1
|
Akhoon BA, Qiao Q, Stewart A, Chen J, Rodriguez Lopez CM, Corbin KR. Pangenomic analysis of the bacterial cellulose-producing genera Komagataeibacter and Novacetimonas. Int J Biol Macromol 2025:139980. [PMID: 39826720 DOI: 10.1016/j.ijbiomac.2025.139980] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2024] [Revised: 12/06/2024] [Accepted: 01/15/2025] [Indexed: 01/22/2025]
Abstract
Bacterial cellulose (BC) holds significant commercial potential due to its unique structural and chemical properties, making it suitable for applications in electronics, medicine, and pharmaceuticals. However, large-scale BC production remains limited by challenges in bacterial performance. In this study, we compared 79 microbial genomes from three genera-Komagataeibacter, Novacetimonas, and Gluconacetobacter-to investigate their pangenomes, genetic diversity, and evolutionary relationships. Through comparative genomic and phylogenetic analyses, we identified distinct genome compositions and evolutionary patterns that differ from previous reports. The role of horizontal gene transfer (HGT) in shaping the genetic diversity and adaptability of these bacteria was also explored. Key determinants in BC production, such as variations in the bacterial cellulose biosynthesis (bcs) operon, carbohydrate uptake genes, and carbohydrate-active enzymes, were examined. Additionally, several biosynthetic gene clusters (BGCs), including Linocin M18 and sactipeptides, which encode for antimicrobial peptides known as bacteriocins, were identified. These findings reveal new aspects of the genetic diversity in cellulose-producing bacteria and present a comprehensive genomic toolkit that will support future efforts to optimize BC production and improve microbial performance for commercial applications.
Collapse
Affiliation(s)
- Bashir A Akhoon
- Department of Horticulture, Martin-Gatton College of Agriculture, Food and Environment, University of Kentucky, Lexington, KY, USA
| | - Qi Qiao
- Department of Horticulture, Martin-Gatton College of Agriculture, Food and Environment, University of Kentucky, Lexington, KY, USA; College of Public Health, University of Kentucky, Lexington, KY, USA
| | - Alexander Stewart
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ, USA
| | - Jin Chen
- Department of Internal Medicine and Department of Computer Science, Institute for Biomedical Informatics, University of Kentucky, Lexington, KY, USA; The University of Alabama at Birmingham, School of Medicine - Nephrology, Birmingham, AL, USA
| | - Carlos M Rodriguez Lopez
- Environmental Epigenetics and Genetics Group, Department of Horticulture, Martin-Gatton College of Agriculture, Food and Environment, University of Kentucky, Lexington, KY, USA
| | - Kendall R Corbin
- Department of Horticulture, Martin-Gatton College of Agriculture, Food and Environment, University of Kentucky, Lexington, KY, USA.
| |
Collapse
|
2
|
Madrigal G, Minhas BF, Catchen J. Klumpy: A tool to evaluate the integrity of long-read genome assemblies and illusive sequence motifs. Mol Ecol Resour 2025; 25:e13982. [PMID: 38800997 PMCID: PMC11646305 DOI: 10.1111/1755-0998.13982] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2024] [Accepted: 05/13/2024] [Indexed: 05/29/2024]
Abstract
The improvement and decreasing costs of third-generation sequencing technologies has widened the scope of biological questions researchers can address with de novo genome assemblies. With the increasing number of reference genomes, validating their integrity with minimal overhead is vital for establishing confident results in their applications. Here, we present Klumpy, a tool for detecting and visualizing both misassembled regions in a genome assembly and genetic elements (e.g. genes) of interest in a set of sequences. By leveraging the initial raw reads in combination with their respective genome assembly, we illustrate Klumpy's utility by investigating antifreeze glycoprotein (afgp) loci across two icefishes, by searching for a reported absent gene in the northern snakehead fish, and by scanning the reference genomes of a mudskipper and bumblebee for misassembled regions. In the two former cases, we were able to provide support for the noncanonical placement of an afgp locus in the icefishes and locate the missing snakehead gene. Furthermore, our genome scans were able identify an unmappable locus in the mudskipper reference genome and identify a putative repetitive element shared among several species of bees.
Collapse
Affiliation(s)
- Giovanni Madrigal
- Department of Evolution, Ecology, and BehaviorUniversity of Illinois at Urbana‐ChampaignUrbanaIllinoisUSA
| | - Bushra Fazal Minhas
- Informatics ProgramUniversity of Illinois at Urbana‐ChampaignUrbanaIllinoisUSA
| | - Julian Catchen
- Department of Evolution, Ecology, and BehaviorUniversity of Illinois at Urbana‐ChampaignUrbanaIllinoisUSA
- Informatics ProgramUniversity of Illinois at Urbana‐ChampaignUrbanaIllinoisUSA
| |
Collapse
|
3
|
Zhou J, Zhang X, Wang Y, Liang H, Yang Y, Huang X, Deng J. Contamination Survey of Insect Genomic and Transcriptomic Data. Animals (Basel) 2024; 14:3432. [PMID: 39682398 DOI: 10.3390/ani14233432] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2024] [Revised: 11/05/2024] [Accepted: 11/25/2024] [Indexed: 12/18/2024] Open
Abstract
The rapid advancement of high-throughput sequencing has led to a great increase in sequencing data, resulting in a significant accumulation of contamination, for example, sequences from non-target species may be present in the target species' sequencing data. Insecta, the most diverse group within Arthropoda, still lacks a comprehensive evaluation of contamination prevalence in public databases and an analysis of potential contamination causes. In this study, COI barcodes were used to investigate contamination from insects and mammals in GenBank's genomic and transcriptomic data across four insect orders. Among the 2796 WGS and 1382 TSA assemblies analyzed, contamination was detected in 32 (1.14%) WGS and 152 (11.0%) TSA assemblies. Key findings from this study include the following: (1) TSA data exhibited more severe contamination than WGS data; (2) contamination levels varied significantly among the four orders, with Hemiptera showing 9.22%, Coleoptera 3.48%, Hymenoptera 7.66%, and Diptera 1.89% contamination rates; (3) possible causes of contamination, such as food, parasitism, sample collection, and cross-contamination, were analyzed. Overall, this study proposes a workflow for checking the existence of contamination in WGS and TSA data and some suggestions to mitigate it.
Collapse
Affiliation(s)
- Jiali Zhou
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, College of Plant Protection, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Xinrui Zhang
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, College of Plant Protection, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Yujie Wang
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, College of Plant Protection, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Haoxian Liang
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, College of Plant Protection, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Yuhao Yang
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, College of Plant Protection, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Xiaolei Huang
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, College of Plant Protection, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Jun Deng
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, College of Plant Protection, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| |
Collapse
|
4
|
Weber CC. Disentangling cobionts and contamination in long-read genomic data using sequence composition. G3 (BETHESDA, MD.) 2024; 14:jkae187. [PMID: 39148415 PMCID: PMC11540323 DOI: 10.1093/g3journal/jkae187] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/30/2024] [Revised: 08/02/2024] [Accepted: 08/02/2024] [Indexed: 08/17/2024]
Abstract
The recent acceleration in genome sequencing targeting previously unexplored parts of the tree of life presents computational challenges. Samples collected from the wild often contain sequences from several organisms, including the target, its cobionts, and contaminants. Effective methods are therefore needed to separate sequences. Though advances in sequencing technology make this task easier, it remains difficult to taxonomically assign sequences from eukaryotic taxa that are not well represented in databases. Therefore, reference-based methods alone are insufficient. Here, I examine how we can take advantage of differences in sequence composition between organisms to identify symbionts, parasites, and contaminants in samples, with minimal reliance on reference data. To this end, I explore data from the Darwin Tree of Life project, including hundreds of high-quality HiFi read sets from insects. Visualizing two-dimensional representations of read tetranucleotide composition learned by a variational autoencoder can reveal distinct components of a sample. Annotating the embeddings with additional information, such as coding density, estimated coverage, or taxonomic labels allows rapid assessment of the contents of a dataset. The approach scales to millions of sequences, making it possible to explore unassembled read sets, even for large genomes. Combined with interactive visualization tools, it allows a large fraction of cobionts reported by reference-based screening to be identified. Crucially, it also facilitates retrieving genomes for which suitable reference data are absent.
Collapse
Affiliation(s)
- Claudia C Weber
- Tree of Life, Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, UK
| |
Collapse
|
5
|
Sadowska-Bartosz I, Bartosz G. Antioxidant Defense in the Toughest Animals on the Earth: Its Contribution to the Extreme Resistance of Tardigrades. Int J Mol Sci 2024; 25:8393. [PMID: 39125965 PMCID: PMC11313143 DOI: 10.3390/ijms25158393] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2024] [Revised: 07/23/2024] [Accepted: 07/29/2024] [Indexed: 08/12/2024] Open
Abstract
Tardigrades are unique among animals in their resistance to dehydration, mainly due to anhydrobiosis and tun formation. They are also very resistant to high-energy radiation, low and high temperatures, low and high pressure, and various chemical agents, Interestingly, they are resistant to ionizing radiation both in the hydrated and dehydrated states to a similar extent. They are able to survive in the cosmic space. Apparently, many mechanisms contribute to the resistance of tardigrades to harmful factors, including the presence of trehalose (though not common to all tardigrades), heat shock proteins, late embryogenesis-abundant proteins, tardigrade-unique proteins, DNA repair proteins, proteins directly protecting DNA (Dsup and TDR1), and efficient antioxidant system. Antioxidant enzymes and small-molecular-weight antioxidants are an important element in the tardigrade resistance. The levels and activities of many antioxidant proteins is elevated by anhydrobiosis and UV radiation; one explanation for their induction during dehydration is provided by the theory of "preparation for oxidative stress", which occurs during rehydration. Genes coding for some antioxidant proteins are expanded in tardigrades; some genes (especially those coding for catalases) were hypothesized to be of bacterial origin, acquired by horizontal gene transfer. An interesting antioxidant protein found in tardigrades is the new Mn-dependent peroxidase.
Collapse
Affiliation(s)
- Izabela Sadowska-Bartosz
- Laboratory of Analytical Biochemistry, Institute of Food Technology and Nutrition, College of Natural Sciences, University of Rzeszów, 4 Zelwerowicza Street, 35-601 Rzeszow, Poland;
| | | |
Collapse
|
6
|
Vuruputoor VS, Starovoitov A, Cai Y, Liu Y, Rahmatpour N, Hedderson TA, Wilding N, Wegrzyn JL, Goffinet B. Crossroads of assembling a moss genome: navigating contaminants and horizontal gene transfer in the moss Physcomitrellopsis africana. G3 (BETHESDA, MD.) 2024; 14:jkae104. [PMID: 38781445 PMCID: PMC11228847 DOI: 10.1093/g3journal/jkae104] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/18/2024] [Revised: 05/03/2024] [Accepted: 05/09/2024] [Indexed: 05/25/2024]
Abstract
The first chromosome-scale reference genome of the rare narrow-endemic African moss Physcomitrellopsis africana (P. africana) is presented here. Assembled from 73 × Oxford Nanopore Technologies (ONT) long reads and 163 × Beijing Genomics Institute (BGI)-seq short reads, the 414 Mb reference comprises 26 chromosomes and 22,925 protein-coding genes [Benchmarking Universal Single-Copy Ortholog (BUSCO) scores: C:94.8% (D:13.9%)]. This genome holds 2 genes that withstood rigorous filtration of microbial contaminants, have no homolog in other land plants, and are thus interpreted as resulting from 2 unique horizontal gene transfers (HGTs) from microbes. Further, P. africana shares 176 of the 273 published HGT candidates identified in Physcomitrium patens (P. patens), but lacks 98 of these, highlighting that perhaps as many as 91 genes were acquired in P. patens in the last 40 million years following its divergence from its common ancestor with P. africana. These observations suggest rather continuous gene gains via HGT followed by potential losses during the diversification of the Funariaceae. Our findings showcase both dynamic flux in plant HGTs over evolutionarily "short" timescales, alongside enduring impacts of successful integrations, like those still functionally maintained in extant P. africana. Furthermore, this study describes the informatic processes employed to distinguish contaminants from candidate HGT events.
Collapse
Affiliation(s)
- Vidya S Vuruputoor
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT 06269, USA
| | - Andrew Starovoitov
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT 06269, USA
| | - Yuqing Cai
- State Key Laboratory of Agricultural Genomics, BGI-Shenzhen, Shenzhen 518083, China
- Key Laboratory of Southern Subtropical Plant Diversity, Fairy Lake 518004, China
| | - Yang Liu
- State Key Laboratory of Agricultural Genomics, BGI-Shenzhen, Shenzhen 518083, China
- Key Laboratory of Southern Subtropical Plant Diversity, Fairy Lake 518004, China
| | - Nasim Rahmatpour
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT 06269, USA
| | - Terry A Hedderson
- Department of Biological Sciences, Bolus Herbarium, University of Cape Town, Private Bag, 7701 Rondebosch, South Africa
| | - Nicholas Wilding
- UMR PVBMT, BP 7151, Université de La Réunion, chemin de l’IRAT, 97410 Saint-Pierre, La Réunion, France
- Missouri Botanical Garden, P.O. Box 299, St. Louis, MO 63166-0299, USA
| | - Jill L Wegrzyn
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT 06269, USA
- Institute for Systems Genomics, University of Connecticut, Storrs, CT 06269, USA
| | - Bernard Goffinet
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT 06269, USA
| |
Collapse
|
7
|
Jia H, Tan S, Cai Y, Guo Y, Shen J, Zhang Y, Ma H, Zhang Q, Chen J, Qiao G, Ruan J, Zhang YE. Low-input PacBio sequencing generates high-quality individual fly genomes and characterizes mutational processes. Nat Commun 2024; 15:5644. [PMID: 38969648 PMCID: PMC11226609 DOI: 10.1038/s41467-024-49992-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2023] [Accepted: 06/20/2024] [Indexed: 07/07/2024] Open
Abstract
Long-read sequencing, exemplified by PacBio, revolutionizes genomics, overcoming challenges like repetitive sequences. However, the high DNA requirement ( > 1 µg) is prohibitive for small organisms. We develop a low-input (100 ng), low-cost, and amplification-free library-generation method for PacBio sequencing (LILAP) using Tn5-based tagmentation and DNA circularization within one tube. We test LILAP with two Drosophila melanogaster individuals, and generate near-complete genomes, surpassing preexisting single-fly genomes. By analyzing variations in these two genomes, we characterize mutational processes: complex transpositions (transposon insertions together with extra duplications and/or deletions) prefer regions characterized by non-B DNA structures, and gene conversion of transposons occurs on both DNA and RNA levels. Concurrently, we generate two complete assemblies for the endosymbiotic bacterium Wolbachia in these flies and similarly detect transposon conversion. Thus, LILAP promises a broad PacBio sequencing adoption for not only mutational studies of flies and their symbionts but also explorations of other small organisms or precious samples.
Collapse
Affiliation(s)
- Hangxing Jia
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China.
| | - Shengjun Tan
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China.
| | - Yingao Cai
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Yanyan Guo
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Jieyu Shen
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Yaqiong Zhang
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
| | - Huijing Ma
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
| | - Qingzhu Zhang
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Jinfeng Chen
- University of Chinese Academy of Sciences, Beijing, China
- State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
| | - Gexia Qiao
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Jue Ruan
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China.
| | - Yong E Zhang
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China.
- University of Chinese Academy of Sciences, Beijing, China.
| |
Collapse
|
8
|
Caetano-Anollés G. Are Viruses Taxonomic Units? A Protein Domain and Loop-Centric Phylogenomic Assessment. Viruses 2024; 16:1061. [PMID: 39066224 PMCID: PMC11281659 DOI: 10.3390/v16071061] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2024] [Revised: 06/26/2024] [Accepted: 06/27/2024] [Indexed: 07/28/2024] Open
Abstract
Virus taxonomy uses a Linnaean-like subsumption hierarchy to classify viruses into taxonomic units at species and higher rank levels. Virus species are considered monophyletic groups of mobile genetic elements (MGEs) often delimited by the phylogenetic analysis of aligned genomic or metagenomic sequences. Taxonomic units are assumed to be independent organizational, functional and evolutionary units that follow a 'natural history' rationale. Here, I use phylogenomic and other arguments to show that viruses are not self-standing genetically-driven systems acting as evolutionary units. Instead, they are crucial components of holobionts, which are units of biological organization that dynamically integrate the genetics, epigenetic, physiological and functional properties of their co-evolving members. Remarkably, phylogenomic analyses show that viruses share protein domains and loops with cells throughout history via massive processes of reticulate evolution, helping spread evolutionary innovations across a wider taxonomic spectrum. Thus, viruses are not merely MGEs or microbes. Instead, their genomes and proteomes conduct cellularly integrated processes akin to those cataloged by the GO Consortium. This prompts the generation of compositional hierarchies that replace the 'is-a-kind-of' by a 'is-a-part-of' logic to better describe the mereology of integrated cellular and viral makeup. My analysis demands a new paradigm that integrates virus taxonomy into a modern evolutionarily centered taxonomy of organisms.
Collapse
Affiliation(s)
- Gustavo Caetano-Anollés
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, C. R. Woese Institute for Genomic Biology, University of Illinois, Urbana, IL 61801, USA
| |
Collapse
|
9
|
Galas S, Le Goff E, Cazevieille C, Tanaka A, Cuq P, Baghdiguian S, Kunieda T, Godefroy N, Richaud M. A comparative ultrastructure study of the tardigrade Ramazzottius varieornatus in the hydrated state, after desiccation and during the process of rehydration. PLoS One 2024; 19:e0302552. [PMID: 38843161 PMCID: PMC11156355 DOI: 10.1371/journal.pone.0302552] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2023] [Accepted: 04/07/2024] [Indexed: 06/09/2024] Open
Abstract
Tardigrades can survive hostile environments such as desiccation by adopting a state of anhydrobiosis. Numerous tardigrade species have been described thus far, and recent genome and transcriptome analyses revealed that several distinct strategies were employed to cope with harsh environments depending on the evolutionary lineages. Detailed analyses at the cellular and subcellular levels are essential to complete these data. In this work, we analyzed a tardigrade species that can withstand rapid dehydration, Ramazzottius varieornatus. Surprisingly, we noted an absence of the anhydrobiotic-specific extracellular structure previously described for the Hypsibius exemplaris species. Both Ramazzottius varieornatus and Hypsibius exemplaris belong to the same evolutionary class of Eutardigrada. Nevertheless, our observations reveal discrepancies in the anhydrobiotic structures correlated with the variation in the anhydrobiotic mechanisms.
Collapse
Affiliation(s)
- Simon Galas
- IBMM, University of Montpellier, CNRS, ENSCM, Montpellier, France
| | - Emilie Le Goff
- ISEM, University of Montpellier, CNRS, IRD, Montpellier, France
| | | | - Akihiro Tanaka
- Department of Biological Sciences, Graduate School of Science, The University of Tokyo, Tokyo, Japan
| | - Pierre Cuq
- IBMM, University of Montpellier, CNRS, ENSCM, Montpellier, France
| | | | - Takekazu Kunieda
- Department of Biological Sciences, Graduate School of Science, The University of Tokyo, Tokyo, Japan
| | - Nelly Godefroy
- ISEM, University of Montpellier, CNRS, IRD, Montpellier, France
| | - Myriam Richaud
- IBMM, University of Montpellier, CNRS, ENSCM, Montpellier, France
| |
Collapse
|
10
|
Surmacz B, Stec D, Prus-Frankowska M, Buczek M, Michalczyk Ł, Łukasik P. Pinpointing the microbiota of tardigrades: What is really there? Environ Microbiol 2024; 26:e16659. [PMID: 38899728 DOI: 10.1111/1462-2920.16659] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2024] [Accepted: 05/09/2024] [Indexed: 06/21/2024]
Abstract
Microbiota are considered significant in the biology of tardigrades, yet their diversity and distribution remain largely unexplored. This is partly due to the methodological challenges associated with studying the microbiota of small organisms that inhabit microbe-rich environments. In our study, we characterized the microbiota of 31 species of cultured tardigrades using 16S rRNA amplicon sequencing. We employed various sample preparation strategies and multiple types of controls and estimated the number of microbes in samples using synthetic DNA spike-ins. We also reanalysed data from previous tardigrade microbiome studies. Our findings suggest that the microbial communities of cultured tardigrades are predominantly composed of bacterial genotypes originating from food, medium, or reagents. Despite numerous experiments, we found it challenging to identify strains that were enriched in certain tardigrades, which would have indicated likely symbiotic associations. Putative tardigrade-associated microbes rarely constituted more than 20% of the datasets, although some matched symbionts identified in other studies. We also uncovered serious contamination issues in previous tardigrade microbiome studies, casting doubt on some of their conclusions. We concluded that tardigrades are not universally dependent on specialized microbes. Our work underscores the need for rigorous safeguards in studies of the microbiota of microscopic organisms and serves as a cautionary tale for studies involving samples with low microbiome abundance.
Collapse
Affiliation(s)
- Bartłomiej Surmacz
- Institute of Botany, Faculty of Biology, Jagiellonian University, Kraków, Poland
- Department of Invertebrate Evolution, Institute of Zoology and Biomedical Research, Faculty of Biology, Jagiellonian University, Kraków, Poland
- Doctoral School of Exact and Natural Sciences, Jagiellonian University, Kraków, Poland
| | - Daniel Stec
- Department of Invertebrate Evolution, Institute of Zoology and Biomedical Research, Faculty of Biology, Jagiellonian University, Kraków, Poland
- Institute of Systematics and Evolution of Animals, Polish Academy of Sciences, Kraków, Poland
| | - Monika Prus-Frankowska
- Institute of Environmental Sciences, Faculty of Biology, Jagiellonian University, Kraków, Poland
| | - Mateusz Buczek
- Institute of Environmental Sciences, Faculty of Biology, Jagiellonian University, Kraków, Poland
| | - Łukasz Michalczyk
- Department of Invertebrate Evolution, Institute of Zoology and Biomedical Research, Faculty of Biology, Jagiellonian University, Kraków, Poland
| | - Piotr Łukasik
- Institute of Environmental Sciences, Faculty of Biology, Jagiellonian University, Kraków, Poland
| |
Collapse
|
11
|
Keeling PJ. Horizontal gene transfer in eukaryotes: aligning theory with data. Nat Rev Genet 2024; 25:416-430. [PMID: 38263430 DOI: 10.1038/s41576-023-00688-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/06/2023] [Indexed: 01/25/2024]
Abstract
Horizontal gene transfer (HGT), or lateral gene transfer, is the non-sexual movement of genetic information between genomes. It has played a pronounced part in bacterial and archaeal evolution, but its role in eukaryotes is less clear. Behaviours unique to eukaryotic cells - phagocytosis and endosymbiosis - have been proposed to increase the frequency of HGT, but nuclear genomes encode fewer HGTs than bacteria and archaea. Here, I review the existing theory in the context of the growing body of data on HGT in eukaryotes, which suggests that any increased chance of acquiring new genes through phagocytosis and endosymbiosis is offset by a reduced need for these genes in eukaryotes, because selection in most eukaryotes operates on variation not readily generated by HGT.
Collapse
Affiliation(s)
- Patrick J Keeling
- Department of Botany, University of British Columbia, Vancouver, BC, Canada.
| |
Collapse
|
12
|
Satomi S, Takahashi S, Inoue T, Taniguchi M, Sugi M, Natsume M, Suzuki S. Identification and Safety Assessment of Enterococcus casseliflavus KB1733 Isolated from Traditional Japanese Pickle Based on Whole-Genome Sequencing Analysis and Preclinical Toxicity Studies. Microorganisms 2024; 12:953. [PMID: 38792783 PMCID: PMC11123836 DOI: 10.3390/microorganisms12050953] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2024] [Revised: 05/02/2024] [Accepted: 05/04/2024] [Indexed: 05/26/2024] Open
Abstract
The present study involves the precise identification and safety evaluation of Enterococcus casseliflavus KB1733, previously identified using 16S rRNA analysis, through whole-genome sequencing, phenotypic analysis, and preclinical toxicity studies. Analyses based on the genome sequencing data confirm the identity of KB1733 as E. casseliflavus and show that the genes related to vancomycin resistance are only present on the chromosome, while no virulence factor genes are present on the chromosome or plasmid. Phenotypic analyses of antibiotic resistance and hemolytic activity also indicated no safety concerns. A bacterial reverse mutation test showed there was no increase in revertant colonies of heat-killed KB1733. An acute toxicity test employing heat-killed KB1733 at a dose of 2000 mg/kg body weight in rats resulted in no deaths and no weight gain or other abnormalities in the general condition of the animals, with renal depression foci and renal cysts only occurring at the same frequency as in the control. Taking the background data into consideration, the effects on the kidneys observed in the current study were not caused by KB1733. Our findings suggest that KB1733 is non-pathogenic to humans/animals, although further studies involving repeated oral toxicity tests and/or clinical tests are required.
Collapse
Affiliation(s)
- Shohei Satomi
- Diet and Well-Being Research Institute, KAGOME Co., Ltd., 17 Nishitomiyama, Nasushiobara 329-2762, Tochigi, Japan; (S.T.); (T.I.); (S.S.)
| | - Shingo Takahashi
- Diet and Well-Being Research Institute, KAGOME Co., Ltd., 17 Nishitomiyama, Nasushiobara 329-2762, Tochigi, Japan; (S.T.); (T.I.); (S.S.)
| | - Takuro Inoue
- Diet and Well-Being Research Institute, KAGOME Co., Ltd., 17 Nishitomiyama, Nasushiobara 329-2762, Tochigi, Japan; (S.T.); (T.I.); (S.S.)
| | - Makoto Taniguchi
- Genome Lead Co., Ltd., 2-3-35 Tokiwa-chou, Takamatsu 760-0054, Kagawa, Japan;
| | - Mai Sugi
- BioSafety Research Center Inc., 582-2 Shioshinden, Iwata 437-1213, Shizuoka, Japan; (M.S.); (M.N.)
| | - Masakatsu Natsume
- BioSafety Research Center Inc., 582-2 Shioshinden, Iwata 437-1213, Shizuoka, Japan; (M.S.); (M.N.)
| | - Shigenori Suzuki
- Diet and Well-Being Research Institute, KAGOME Co., Ltd., 17 Nishitomiyama, Nasushiobara 329-2762, Tochigi, Japan; (S.T.); (T.I.); (S.S.)
| |
Collapse
|
13
|
Wilson CG, Pieszko T, Nowell RW, Barraclough TG. Recombination in bdelloid rotifer genomes: asexuality, transfer and stress. Trends Genet 2024; 40:422-436. [PMID: 38458877 DOI: 10.1016/j.tig.2024.02.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2023] [Revised: 01/31/2024] [Accepted: 02/01/2024] [Indexed: 03/10/2024]
Abstract
Bdelloid rotifers constitute a class of microscopic animals living in freshwater habitats worldwide. Several strange features of bdelloids have drawn attention: their ability to tolerate desiccation and other stresses, a lack of reported males across the clade despite centuries of study, and unusually high numbers of horizontally acquired, non-metazoan genes. Genome sequencing is transforming our understanding of their lifestyle and its consequences, while in turn providing wider insights about recombination and genome organisation in animals. Many questions remain, not least how to reconcile apparent genomic signatures of sex with the continued absence of reported males, why bdelloids have so many horizontally acquired genes, and how their remarkable ability to survive stress interacts with recombination and other genomic processes.
Collapse
Affiliation(s)
- Christopher G Wilson
- Department of Biology, University of Oxford, 11a Mansfield Road, Oxford OX1 3SZ, UK.
| | - Tymoteusz Pieszko
- Department of Biology, University of Oxford, 11a Mansfield Road, Oxford OX1 3SZ, UK
| | - Reuben W Nowell
- Institute of Ecology and Evolution, Ashworth Laboratories, Charlotte Auerbach Road, Edinburgh EH9 3FL, UK; Biological and Environmental Sciences, School of Natural Sciences, University of Stirling, Stirling FK9 4LA, UK
| | | |
Collapse
|
14
|
Li C, Yang Z, Xu X, Meng L, Liu S, Yang D. Conserved and specific gene expression patterns in the embryonic development of tardigrades. Evol Dev 2024; 26:e12476. [PMID: 38654704 DOI: 10.1111/ede.12476] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2023] [Revised: 02/18/2024] [Accepted: 04/03/2024] [Indexed: 04/26/2024]
Abstract
Tardigrades, commonly known as water bears, are enigmatic organisms characterized by their remarkable resilience to extreme environments despite their simple and compact body structure. To date, there is still much to understand about their evolutionary and developmental features contributing to their special body plan and abilities. This research provides preliminary insights on the conserved and specific gene expression patterns during embryonic development of water bears, focusing on the species Hypsibius exemplaris. The developmental dynamic expression analysis of the genes with various evolutionary age grades indicated that the mid-conserved stage of H. exemplaris corresponds to the period of ganglia and midgut development, with the late embryonic stage showing a transition from non-conserved to conserved state. Additionally, a comparison with Drosophila melanogaster highlighted the absence of certain pathway nodes in development-related pathways, such as Maml and Hairless, which are respectively the transcriptional co-activator and co-repressor of NOTCH regulated genes. We also employed Weighted Gene Co-expression Network Analysis (WGCNA) to investigate the expression patterns of tardigrade-specific genes during embryo development. Our findings indicated that the module containing the highest proportion of tardigrade-specific genes (TSGs) exhibits high expression levels before the mid-conserved stage, potentially playing a role in glutathione and lipid metabolism. These functions may be associated to the ecdysone synthesis and storage cell formation, which is unique to tardigrades.
Collapse
Affiliation(s)
- Chaoran Li
- State Key Laboratory of Medical Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing, China
| | - Zhixiang Yang
- School of Life Sciences, Hebei University, Baoding, China
| | - Xiaofang Xu
- State Key Laboratory of Medical Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing, China
| | - Lingling Meng
- School of Life Sciences, Hebei University, Baoding, China
| | - Shihao Liu
- State Key Laboratory of Medical Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing, China
| | - Dong Yang
- State Key Laboratory of Medical Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing, China
| |
Collapse
|
15
|
Pratt CJ, Meili CH, Jones AL, Jackson DK, England EE, Wang Y, Hartson S, Rogers J, Elshahed MS, Youssef NH. Anaerobic fungi in the tortoise alimentary tract illuminate early stages of host-fungal symbiosis and Neocallimastigomycota evolution. Nat Commun 2024; 15:2714. [PMID: 38548766 PMCID: PMC10978972 DOI: 10.1038/s41467-024-47047-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2023] [Accepted: 03/18/2024] [Indexed: 04/01/2024] Open
Abstract
Anaerobic gut fungi (AGF, Neocallimastigomycota) reside in the alimentary tract of herbivores. While their presence in mammals is well documented, evidence for their occurrence in non-mammalian hosts is currently sparse. Culture-independent surveys of AGF in tortoises identified a unique community, with three novel deep-branching genera representing >90% of sequences in most samples. Representatives of all genera were successfully isolated under strict anaerobic conditions. Transcriptomics-enabled phylogenomic and molecular dating analyses indicated an ancient, deep-branching position in the AGF tree for these genera, with an evolutionary divergence time estimate of 104-112 million years ago (Mya). Such estimates push the establishment of animal-Neocallimastigomycota symbiosis from the late to the early Cretaceous. Further, tortoise-associated isolates (T-AGF) exhibited limited capacity for plant polysaccharides metabolism and lacked genes encoding several carbohydrate-active enzyme (CAZyme) families. Finally, we demonstrate that the observed curtailed degradation capacities and reduced CAZyme repertoire is driven by the paucity of horizontal gene transfer (HGT) in T-AGF genomes, compared to their mammalian counterparts. This reduced capacity was reflected in an altered cellulosomal production capacity in T-AGF. Our findings provide insights into the phylogenetic diversity, ecological distribution, evolutionary history, evolution of fungal-host nutritional symbiosis, and dynamics of genes acquisition in Neocallimastigomycota.
Collapse
Affiliation(s)
- Carrie J Pratt
- Department of Microbiology and Molecular Genetics, Oklahoma State University, Stillwater, OK, USA
| | - Casey H Meili
- Department of Microbiology and Molecular Genetics, Oklahoma State University, Stillwater, OK, USA
| | - Adrienne L Jones
- Department of Microbiology and Molecular Genetics, Oklahoma State University, Stillwater, OK, USA
| | - Darian K Jackson
- Department of Microbiology and Molecular Genetics, Oklahoma State University, Stillwater, OK, USA
| | - Emma E England
- Department of Microbiology and Molecular Genetics, Oklahoma State University, Stillwater, OK, USA
| | - Yan Wang
- Department of Biological Sciences, University of Toronto Scarborough, Toronto, ON, Canada
| | - Steve Hartson
- Department of Biochemistry and Molecular Biology, Oklahoma State University, Stillwater, OK, USA
| | - Janet Rogers
- Department of Biochemistry and Molecular Biology, Oklahoma State University, Stillwater, OK, USA
| | - Mostafa S Elshahed
- Department of Microbiology and Molecular Genetics, Oklahoma State University, Stillwater, OK, USA
| | - Noha H Youssef
- Department of Microbiology and Molecular Genetics, Oklahoma State University, Stillwater, OK, USA.
| |
Collapse
|
16
|
Lacerda AL, Frias J, Pedrotti ML. Tardigrades in the marine plastisphere: New hitchhikers surfing plastics. MARINE POLLUTION BULLETIN 2024; 200:116071. [PMID: 38290365 DOI: 10.1016/j.marpolbul.2024.116071] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/07/2023] [Revised: 01/19/2024] [Accepted: 01/20/2024] [Indexed: 02/01/2024]
Abstract
Tardigrades are remarkable microorganisms known for their extraordinary resilience in diverse environments, including extreme conditions such as outer space. They are known for their interactions with natural substrates in terrestrial and aquatic systems, but have remained largely unexplored in relation to marine plastics. This study aims to investigate the colonization of plastics, ranging from fossil fuel-based to bioplastics, in the coastal zones of four countries (Brazil, Ireland, France and Italy). Here, we report the first documented occurrence of tardigrades colonizing plastic substrates. We identified five amplicon sequence variants (ASVs) belonging to the Tardigrada phylum, specifically in a post-consumer polypropylene, in the coastal zone of Galway, Ireland. This discovery raises questions about the characteristics of different plastics influencing on tardigrades' adhesion. Tardigrades hitchhiking on plastics in the oceans could expand their habitat range, possibly displacing native species and altering trophic interactions, with potential consequences for the overall biodiversity.
Collapse
Affiliation(s)
- Ana Luzia Lacerda
- Laboratoire d'Océanographie de Villefranche sur mer (LOV), UPMC Université Paris 06, CNRS UMR 7093, Sorbonne Université, Villefranche sur Mer, France.
| | - João Frias
- Marine and Freshwater Research Centre, Atlantic Technological University, Dublin Road, Galway H91 T8NW, Ireland
| | - Maria Luiza Pedrotti
- Laboratoire d'Océanographie de Villefranche sur mer (LOV), UPMC Université Paris 06, CNRS UMR 7093, Sorbonne Université, Villefranche sur Mer, France
| |
Collapse
|
17
|
Astashyn A, Tvedte ES, Sweeney D, Sapojnikov V, Bouk N, Joukov V, Mozes E, Strope PK, Sylla PM, Wagner L, Bidwell SL, Brown LC, Clark K, Davis EW, Smith-White B, Hlavina W, Pruitt KD, Schneider VA, Murphy TD. Rapid and sensitive detection of genome contamination at scale with FCS-GX. Genome Biol 2024; 25:60. [PMID: 38409096 PMCID: PMC10898089 DOI: 10.1186/s13059-024-03198-7] [Citation(s) in RCA: 21] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2023] [Accepted: 02/15/2024] [Indexed: 02/28/2024] Open
Abstract
Assembled genome sequences are being generated at an exponential rate. Here we present FCS-GX, part of NCBI's Foreign Contamination Screen (FCS) tool suite, optimized to identify and remove contaminant sequences in new genomes. FCS-GX screens most genomes in 0.1-10 min. Testing FCS-GX on artificially fragmented genomes demonstrates high sensitivity and specificity for diverse contaminant species. We used FCS-GX to screen 1.6 million GenBank assemblies and identified 36.8 Gbp of contamination, comprising 0.16% of total bases, with half from 161 assemblies. We updated assemblies in NCBI RefSeq to reduce detected contamination to 0.01% of bases. FCS-GX is available at https://github.com/ncbi/fcs/ or https://doi.org/10.5281/zenodo.10651084 .
Collapse
Affiliation(s)
- Alexander Astashyn
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Eric S Tvedte
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Deacon Sweeney
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Victor Sapojnikov
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Nathan Bouk
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Victor Joukov
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Eyal Mozes
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Pooja K Strope
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Pape M Sylla
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Lukas Wagner
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Shelby L Bidwell
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Larissa C Brown
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Karen Clark
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Emily W Davis
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Brian Smith-White
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Wratko Hlavina
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Kim D Pruitt
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Valerie A Schneider
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Terence D Murphy
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA.
| |
Collapse
|
18
|
Alvarez RV, Landsman D. GTax: improving de novo transcriptome assembly by removing foreign RNA contamination. Genome Biol 2024; 25:12. [PMID: 38191464 PMCID: PMC10773103 DOI: 10.1186/s13059-023-03141-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2022] [Accepted: 12/08/2023] [Indexed: 01/10/2024] Open
Abstract
The cost and complexity of generating a complete reference genome means that many organisms lack an annotated reference. An alternative is to use a de novo reference transcriptome. This technology is cost-effective but is susceptible to off-target RNA contamination. In this manuscript, we present GTax, a taxonomy-structured database of genomic sequences that can be used with BLAST to detect and remove foreign contamination in RNA sequencing samples before assembly. In addition, we use a de novo transcriptome assembly of Solanum lycopersicum (tomato) to demonstrate that removing foreign contamination in sequencing samples reduces the number of assembled chimeric transcripts.
Collapse
Affiliation(s)
- Roberto Vera Alvarez
- Computational Biology Branch, National Center for Biotechnology Information, Intramural Research Program, National Library of Medicine, NIH, Bethesda, MD, USA
| | - David Landsman
- Computational Biology Branch, National Center for Biotechnology Information, Intramural Research Program, National Library of Medicine, NIH, Bethesda, MD, USA.
| |
Collapse
|
19
|
MacLeod AI, Knopp MR, Gould SB. A mysterious cloak: the peptidoglycan layer of algal and plant plastids. PROTOPLASMA 2024; 261:173-178. [PMID: 37603062 PMCID: PMC10784329 DOI: 10.1007/s00709-023-01886-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/03/2023] [Accepted: 07/23/2023] [Indexed: 08/22/2023]
Abstract
The plastids of algae and plants originated on a single occasion from an endosymbiotic cyanobacterium at least a billion years ago. Despite the divergent evolution that characterizes the plastids of different lineages, many traits such as membrane organization and means of fission are universal-they pay tribute to the cyanobacterial origin of the organelle. For one such trait, the peptidoglycan (PG) layer, the situation is more complicated. Our view on its distribution keeps on changing and little is known regarding its molecular relevance, especially for land plants. Here, we investigate the extent of PG presence across the Chloroplastida using a phylogenomic approach. Our data support the view of a PG layer being present in the last common ancestor of land plants and its remarkable conservation across bryophytes that are otherwise characterized by gene loss. In embryophytes, the occurrence of the PG layer biosynthetic toolkit becomes patchier and the availability of novel genome data questions previous predictions regarding a functional coevolution of the PG layer and the plastid division machinery-associated gene FtsZ3. Furthermore, our data confirm the presence of penicillin-binding protein (PBP) orthologs in seed plants, which were previously thought to be absent from this clade. The 5-7 nm thick, and seemingly unchanged, PG layer armoring the plastids of glaucophyte algae might still provide the original function of structural support, but the same can likely not be said about the only recently identified PG layer of bryophyte and tracheophyte plastids. There are several issues to be explored regarding the composition, exact function, and biosynthesis of the PG layer in land plants. These issues arise from the fact that land plants seemingly lack certain genes that are believed to be crucial for PG layer production, even though they probably synthesize a PG layer.
Collapse
Affiliation(s)
- Alexander I MacLeod
- Institute for Molecular Evolution, Heinrich Heine University of Düsseldorf, 40225, Düsseldorf, Germany.
| | - Michael R Knopp
- Institute for Molecular Evolution, Heinrich Heine University of Düsseldorf, 40225, Düsseldorf, Germany
| | - Sven B Gould
- Institute for Molecular Evolution, Heinrich Heine University of Düsseldorf, 40225, Düsseldorf, Germany
| |
Collapse
|
20
|
Matthews AE, Boves TJ, Percy KL, Schelsky WM, Wijeratne AJ. Population Genomics of Pooled Samples: Unveiling Symbiont Infrapopulation Diversity and Host-Symbiont Coevolution. Life (Basel) 2023; 13:2054. [PMID: 37895435 PMCID: PMC10608719 DOI: 10.3390/life13102054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2023] [Revised: 09/30/2023] [Accepted: 10/10/2023] [Indexed: 10/29/2023] Open
Abstract
Microscopic symbionts represent crucial links in biological communities. However, they present technical challenges in high-throughput sequencing (HTS) studies due to their small size and minimal high-quality DNA yields, hindering our understanding of host-symbiont coevolution at microevolutionary and macroevolutionary scales. One approach to overcome those barriers is to pool multiple individuals from the same infrapopulation (i.e., individual host) and sequence them together (Pool-Seq), but individual-level information is then compromised. To simultaneously address both issues (i.e., minimal DNA yields and loss of individual-level information), we implemented a strategic Pool-Seq approach to assess variation in sequencing performance and categorize genetic diversity (single nucleotide polymorphisms (SNPs)) at both the individual-level and infrapopulation-level for microscopic feather mites. To do so, we collected feathers harboring mites (Proctophyllodidae: Amerodectes protonotaria) from four individual Prothonotary Warblers (Parulidae: Protonotaria citrea). From each of the four hosts (i.e., four mite infrapopulations), we conducted whole-genome sequencing on three extraction pools consisting of different numbers of mites (1 mite, 5 mites, and 20 mites). We found that samples containing pools of multiple mites had more sequencing reads map to the feather mite reference genome than did the samples containing only a single mite. Mite infrapopulations were primarily genetically structured by their associated individual hosts (not pool size) and the majority of SNPs were shared by all pools within an infrapopulation. Together, these results suggest that the patterns observed are driven by evolutionary processes occurring at the infrapopulation level and are not technical signals due to pool size. In total, despite the challenges presented by microscopic symbionts in HTS studies, this work highlights the value of both individual-level and infrapopulation-level sequencing toward our understanding of host-symbiont coevolution at multiple evolutionary scales.
Collapse
Affiliation(s)
- Alix E. Matthews
- College of Sciences and Mathematics and Molecular Biosciences Program, Arkansas State University, Jonesboro, AR 72401, USA
- Department of Biological Sciences, Arkansas State University, Jonesboro, AR 72401, USA; (T.J.B.); (A.J.W.)
| | - Than J. Boves
- Department of Biological Sciences, Arkansas State University, Jonesboro, AR 72401, USA; (T.J.B.); (A.J.W.)
| | - Katie L. Percy
- Audubon Delta, National Audubon Society, Baton Rouge, LA 70808, USA;
- United States Department of Agriculture, Natural Resources Conservation Service, Addis, LA 70710, USA
| | - Wendy M. Schelsky
- Department of Evolution, Ecology, and Behavior, School of Integrative Biology, University of Illinois, Urbana-Champaign, Champaign, IL 61801, USA;
- Prairie Research Institute, Illinois Natural History Survey, University of Illinois, Urbana-Champaign, Champaign, IL 61820, USA
| | - Asela J. Wijeratne
- Department of Biological Sciences, Arkansas State University, Jonesboro, AR 72401, USA; (T.J.B.); (A.J.W.)
| |
Collapse
|
21
|
Astashyn A, Tvedte ES, Sweeney D, Sapojnikov V, Bouk N, Joukov V, Mozes E, Strope PK, Sylla PM, Wagner L, Bidwell SL, Clark K, Davis EW, Smith-White B, Hlavina W, Pruitt KD, Schneider VA, Murphy TD. Rapid and sensitive detection of genome contamination at scale with FCS-GX. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.06.02.543519. [PMID: 37292984 PMCID: PMC10246020 DOI: 10.1101/2023.06.02.543519] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Assembled genome sequences are being generated at an exponential rate. Here we present FCS-GX, part of NCBI's Foreign Contamination Screen (FCS) tool suite, optimized to identify and remove contaminant sequences in new genomes. FCS-GX screens most genomes in 0.1-10 minutes. Testing FCS-GX on artificially fragmented genomes demonstrates sensitivity >95% for diverse contaminant species and specificity >99.93%. We used FCS-GX to screen 1.6 million GenBank assemblies and identified 36.8 Gbp of contamination (0.16% of total bases), with half from 161 assemblies. We updated assemblies in NCBI RefSeq to reduce detected contamination to 0.01% of bases. FCS-GX is available at https://github.com/ncbi/fcs/.
Collapse
Affiliation(s)
- Alexander Astashyn
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Eric S Tvedte
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Deacon Sweeney
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Victor Sapojnikov
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Nathan Bouk
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Victor Joukov
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Eyal Mozes
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Pooja K Strope
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Pape M Sylla
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Lukas Wagner
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Shelby L Bidwell
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Karen Clark
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Emily W Davis
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Brian Smith-White
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Wratko Hlavina
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Kim D Pruitt
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Valerie A Schneider
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Terence D Murphy
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| |
Collapse
|
22
|
Gilbert C, Maumus F. Sidestepping Darwin: horizontal gene transfer from plants to insects. CURRENT OPINION IN INSECT SCIENCE 2023; 57:101035. [PMID: 37061183 DOI: 10.1016/j.cois.2023.101035] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/25/2022] [Revised: 04/05/2023] [Accepted: 04/06/2023] [Indexed: 05/06/2023]
Abstract
Horizontal transfer of genetic material (HT) is the passage of DNA between organisms by means other than reproduction. Increasing numbers of HT are reported in insects, with bacteria, fungi, plants, and insects acting as the main sources of these transfers. Here, we provide a detailed account of plant-to-insect HT events. At least 14 insect species belonging to 6 orders are known to have received plant genetic material through HT. One of them, the whitefly Bemisia tabaci (Middle East Asia Minor 1), concentrates most of these transfers, with no less than 28 HT events yielding 55 plant-derived genes in this species. Several plant-to-insect HT events reported so far involve gene families known to play a role in plant-parasite interactions. We highlight methodological approaches that may further help characterize these transfers. We argue that plant-to-insect HT is likely more frequent than currently appreciated and that in-depth studies of these transfers will shed new light on plant-insect interactions.
Collapse
Affiliation(s)
- Clément Gilbert
- Université Paris-Saclay, CNRS, IRD, UMR Evolution, Génomes, Comportement et Ecologie, Gif-sur-Yvette, France.
| | - Florian Maumus
- Université Paris-Saclay, INRAE, URGI, Versailles, France
| |
Collapse
|
23
|
Sperling AL, Glover DM. Parthenogenesis in dipterans: a genetic perspective. Proc Biol Sci 2023; 290:20230261. [PMID: 36946111 PMCID: PMC10031431 DOI: 10.1098/rspb.2023.0261] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2022] [Accepted: 02/28/2023] [Indexed: 03/23/2023] Open
Abstract
Parthenogenesis has been documented in almost every phylum of animals, and yet this phenomenon is largely understudied. It has particular importance in dipterans since some parthenogenetic species are also disease vectors and agricultural pests. Here, we present a catalogue of parthenogenetic dipterans, although it is likely that many more remain to be identified, and we discuss how their developmental biology and interactions with diverse environments may be linked to different types of parthenogenetic reproduction. We discuss how the advances in genetics and genomics have identified chromosomal loci associated with parthenogenesis. In particular, a polygenic cause of facultative parthenogenesis has been uncovered in Drosophila mercatorum, allowing the corresponding genetic variants to be tested for their ability to promote parthenogenesis in another species, Drosophila melanogaster. This study probably identifies just one of many routes that could be followed in the evolution of parthenogenesis. We attempt to account for why the phenomenon has evolved so many times in the dipteran order and why facultative parthenogenesis appears particularly prevalent. We also discuss the significance of coarse genomic changes, including non-disjunction, aneuploidy, and polyploidy and how, together with changes to specific genes, these might relate to both facultative and obligate parthenogenesis in dipterans and other parthenogenetic animals.
Collapse
Affiliation(s)
- A. L. Sperling
- Department of Genetics, University of Cambridge, Cambridge, UK
| | - D. M. Glover
- Department of Genetics, University of Cambridge, Cambridge, UK
- California Institute of Technology, Pasadena, CA, USA
| |
Collapse
|
24
|
Fleming JF. The wealth of shared resources: Improving molecular taxonomy using eDNA and public databases. ZOOL SCR 2023. [DOI: 10.1111/zsc.12591] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/24/2023]
|
25
|
Wang Y, Shahid MQ. Genome sequencing and resequencing identified three horizontal gene transfers and uncovered the genetic mechanism on the intraspecies adaptive evolution of Gastrodia elata Blume. FRONTIERS IN PLANT SCIENCE 2023; 13:1035157. [PMID: 36684780 PMCID: PMC9848658 DOI: 10.3389/fpls.2022.1035157] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/02/2022] [Accepted: 12/13/2022] [Indexed: 06/17/2023]
Abstract
Horizontal gene transfer is a rare and useful genetic mechanism in higher plants. Gastrodia elata Blume (GE) (Orchidaceae), well known as traditional medicinal material in East Asia, adopts a heterotrophic lifestyle, thus being considered to be more prone to horizontal gene transfer (HGT). GE is a "polytypic species" that currently comprised of five recognized forms according to the plant morphology. G. elata Blume forma elata (GEE) and G. elata Bl.f.glauca (GEG) are two common forms that naturally grow in different habitats with difference in altitude and latitude. G. elata Bl.f.viridis (GEV) often occurs sporadically in cultivated populations of GEE and GEG. However, the genetic relationships and genetic mechanism underpinned the divergent ecological adaptations of GEE and GEG have not been revealed. Here, we assembled a chromosome-level draft genome of GEE with 1.04 Gb. Among predicted 17,895 protein coding genes, we identified three HGTs. Meanwhile, we resequenced 10 GEE accessions, nine GEG accessions, and 10 GEV accessions, and identified two independent genetic lineages: GEG_pedigree (GEG individuals and GEV individuals collected from GEG populations) and GEE_pedigree (GEE individuals and GEV individuals collected from GEE populations), which strongly support the taxonomic status of GEE and GEG as subspecies, not as different forms. In highly differentiated genomic regions of GEE_pedigree and GEG_pedigree, three chalcone synthase-encoding genes and one Phox/Bem1p (PB1) domain of encoding Auxin (AUX)/Indoleacetic acid (IAA) were identified in selection sweeping genome regions, which suggested that differentiation between GEE_pedigree and GEG_pedigree was promoted by the selection of genes related to photoresponse and growth and development. Overall, this new genome would be helpful for breeding and utilization of GE and the new findings would deepen the understanding about ecological adaptation and evolution of GE.
Collapse
Affiliation(s)
- Yunsheng Wang
- School of Health and Life Science, Kaili University, Kaili, Guizhou, China
| | - Muhammad Qasim Shahid
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, South China Agricultural University, Guangzhou, China
- Guangdong Provincial Key Laboratory of Plant Molecular Breeding, South China Agricultural University, Guangzhou, China
- College of Agriculture, South China Agricultural University, Guangzhou, Guangdong, China
| |
Collapse
|
26
|
Maleki E, Akbari Rokn Abadi S, Koohi S. HELIOS: High-speed sequence alignment in optics. PLoS Comput Biol 2022; 18:e1010665. [PMID: 36409684 PMCID: PMC9678324 DOI: 10.1371/journal.pcbi.1010665] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/01/2022] [Accepted: 10/18/2022] [Indexed: 11/22/2022] Open
Abstract
In response to the imperfections of current sequence alignment methods, originated from the inherent serialism within their corresponding electrical systems, a few optical approaches for biological data comparison have been proposed recently. However, due to their low performance, raised from their inefficient coding scheme, this paper presents a novel all-optical high-throughput method for aligning DNA, RNA, and protein sequences, named HELIOS. The HELIOS method employs highly sophisticated operations to locate character matches, single or multiple mutations, and single or multiple indels within various biological sequences. On the other hand, the HELIOS optical architecture exploits high-speed processing and operational parallelism in optics, by adopting wavelength and polarization of optical beams. For evaluation, the functionality and accuracy of the HELIOS method are approved through behavioral and optical simulation studies, while its complexity and performance are estimated through analytical computation. The accuracy evaluations indicate that the HELIOS method achieves a precise pairwise alignment of two sequences, highly similar to those of Smith-Waterman, Needleman-Wunsch, BLAST, MUSCLE, ClustalW, ClustalΩ, T-Coffee, Kalign, and MAFFT. According to our performance evaluations, the HELIOS optical architecture outperforms all alternative electrical and optical algorithms in terms of processing time and memory requirement, relying on its highly sophisticated method and optical architecture. Moreover, the employed compact coding scheme highly escalates the number of input characters, and hence, it offers reduced time and space complexities, compared to the electrical and optical alternatives. It makes the HELIOS method and optical architecture highly applicable for biomedical applications.
Collapse
Affiliation(s)
- Ehsan Maleki
- Department of Computer Engineering, Sharif University of Technology, Tehran, Iran
| | | | - Somayyeh Koohi
- Department of Computer Engineering, Sharif University of Technology, Tehran, Iran
- * E-mail:
| |
Collapse
|
27
|
Koutsovoulos GD, Granjeon Noriot S, Bailly-Bechet M, Danchin EGJ, Rancurel C. AvP: A software package for automatic phylogenetic detection of candidate horizontal gene transfers. PLoS Comput Biol 2022; 18:e1010686. [PMID: 36350852 PMCID: PMC9678320 DOI: 10.1371/journal.pcbi.1010686] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2022] [Revised: 11/21/2022] [Accepted: 10/26/2022] [Indexed: 11/10/2022] Open
Abstract
Horizontal gene transfer (HGT) is the transfer of genes between species outside the transmission from parent to offspring. Due to their impact on the genome and biology of various species, HGTs have gained broader attention, but high-throughput methods to robustly identify them are lacking. One rapid method to identify HGT candidates is to calculate the difference in similarity between the most similar gene in closely related species and the most similar gene in distantly related species. Although metrics on similarity associated with taxonomic information can rapidly detect putative HGTs, these methods are hampered by false positives that are difficult to track. Furthermore, they do not inform on the evolutionary trajectory and events such as duplications. Hence, phylogenetic analysis is necessary to confirm HGT candidates and provide a more comprehensive view of their origin and evolutionary history. However, phylogenetic reconstruction requires several time-consuming manual steps to retrieve the homologous sequences, produce a multiple alignment, construct the phylogeny and analyze the topology to assess whether it supports the HGT hypothesis. Here, we present AvP which automatically performs all these steps and detects candidate HGTs within a phylogenetic framework.
Collapse
Affiliation(s)
- Georgios D. Koutsovoulos
- Institut Sophia Agrobiotech, Université Côte d’Azur, INRAE, CNRS, Sophia Antipolis, France
- * E-mail:
| | - Solène Granjeon Noriot
- Institut Sophia Agrobiotech, Université Côte d’Azur, INRAE, CNRS, Sophia Antipolis, France
| | - Marc Bailly-Bechet
- Institut Sophia Agrobiotech, Université Côte d’Azur, INRAE, CNRS, Sophia Antipolis, France
| | - Etienne G. J. Danchin
- Institut Sophia Agrobiotech, Université Côte d’Azur, INRAE, CNRS, Sophia Antipolis, France
| | - Corinne Rancurel
- Institut Sophia Agrobiotech, Université Côte d’Azur, INRAE, CNRS, Sophia Antipolis, France
| |
Collapse
|
28
|
Li L, Peng S, Wang Z, Zhang T, Li H, Xiao Y, Li J, Liu Y, Yin H. Genome mining reveals abiotic stress resistance genes in plant genomes acquired from microbes via HGT. FRONTIERS IN PLANT SCIENCE 2022; 13:1025122. [PMID: 36407614 PMCID: PMC9667741 DOI: 10.3389/fpls.2022.1025122] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/22/2022] [Accepted: 09/07/2022] [Indexed: 06/16/2023]
Abstract
Colonization by beneficial microbes can enhance plant tolerance to abiotic stresses. However, there are still many unknown fields regarding the beneficial plant-microbe interactions. In this study, we have assessed the amount or impact of horizontal gene transfer (HGT)-derived genes in plants that have potentials to confer abiotic stress resistance. We have identified a total of 235 gene entries in fourteen high-quality plant genomes belonging to phyla Chlorophyta and Streptophyta that confer resistance against a wide range of abiotic pressures acquired from microbes through independent HGTs. These genes encode proteins contributed to toxic metal resistance (e.g., ChrA, CopA, CorA), osmotic and drought stress resistance (e.g., Na+/proline symporter, potassium/proton antiporter), acid resistance (e.g., PcxA, ArcA, YhdG), heat and cold stress resistance (e.g., DnaJ, Hsp20, CspA), oxidative stress resistance (e.g., GST, PoxA, glutaredoxin), DNA damage resistance (e.g., Rad25, Rad51, UvrD), and organic pollutant resistance (e.g., CytP450, laccase, CbbY). Phylogenetic analyses have supported the HGT inferences as the plant lineages are all clustering closely with distant microbial lineages. Deep-learning-based protein structure prediction and analyses, in combination with expression assessment based on codon adaption index (CAI) further corroborated the functionality and expressivity of the HGT genes in plant genomes. A case-study applying fold comparison and molecular dynamics (MD) of the HGT-driven CytP450 gave a more detailed illustration on the resemblance and evolutionary linkage between the plant recipient and microbial donor sequences. Together, the microbe-originated HGT genes identified in plant genomes and their participation in abiotic pressures resistance indicate a more profound impact of HGT on the adaptive evolution of plants.
Collapse
Affiliation(s)
- Liangzhi Li
- School of Minerals Processing and Bioengineering, Central South University, Changsha, China
- Key Laboratory of Biometallurgy of Ministry of Education, Central South University, Changsha, China
| | | | - Zhenhua Wang
- Zhangjiajie Tobacco Company of Hunan Province, Zhangjiajie, China
| | - Teng Zhang
- School of Minerals Processing and Bioengineering, Central South University, Changsha, China
- Key Laboratory of Biometallurgy of Ministry of Education, Central South University, Changsha, China
- Hunan Urban and Rural Environmental Construction Co., Ltd, Changsha, China
| | - Hongguang Li
- Hunan Tobacco Science Institute, Changsha, China
| | - Yansong Xiao
- Chenzhou Tobacco Company of Hunan Province, Chenzhou, China
| | - Jingjun Li
- Chenzhou Tobacco Company of Hunan Province, Chenzhou, China
| | - Yongjun Liu
- Hunan Tobacco Science Institute, Changsha, China
| | - Huaqun Yin
- School of Minerals Processing and Bioengineering, Central South University, Changsha, China
- Key Laboratory of Biometallurgy of Ministry of Education, Central South University, Changsha, China
| |
Collapse
|
29
|
Zhang X, Hu Y, Smith DR. HSDatabase-a database of highly similar duplicate genes from plants, animals, and algae. Database (Oxford) 2022; 2022:baac086. [PMID: 36208223 PMCID: PMC9547538 DOI: 10.1093/database/baac086] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2022] [Revised: 08/16/2022] [Accepted: 09/20/2022] [Indexed: 11/30/2022]
Abstract
Gene duplication is an important evolutionary mechanism capable of providing new genetic material, which in some instances can help organisms adapt to various environmental conditions. Recent studies, for example, have indicated that highly similar duplicate genes (HSDs) are aiding adaptation to extreme conditions via gene dosage. However, for most eukaryotic genomes HSDs remain uncharacterized, partly because they can be hard to identify and categorize efficiently and effectively. Here, we collected and curated HSDs in nuclear genomes from various model animals, land plants and algae and indexed them in an online, open-access sequence repository called HSDatabase. Currently, this database contains 117 864 curated HSDs from 40 distinct genomes; it includes statistics on the total number of HSDs per genome as well as individual HSD copy numbers/lengths and provides sequence alignments of the duplicate gene copies. HSDatabase also allows users to download sequences of gene copies, access genome browsers, and link out to other databases, such as Pfam and Kyoto Encyclopedia of Genes and Genomes. What is more, a built-in Basic Local Alignment Search Tool option is available to conveniently explore potential homologous sequences of interest within and across species. HSDatabase has a user-friendly interface and provides easy access to the source data. It can be used on its own for comparative analyses of gene duplicates or in conjunction with HSDFinder, a newly developed bioinformatics tool for identifying, annotating, categorizing and visualizing HSDs. Database URL: http://hsdfinder.com/database/.
Collapse
Affiliation(s)
- Xi Zhang
- Institute for Comparative Genomics, Dalhousie University, Halifax, Nova Scotia B3H 4R2, Canada
- Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, Nova Scotia B3H 4R2, Canada
| | - Yining Hu
- Department of Computer Science, University of Western Ontario, London, Ontario N6A 3K7, Canada
| | - David Roy Smith
- Department of Biology, University of Western Ontario, London, Ontario N6A 3K7, Canada
| |
Collapse
|
30
|
Lozano-Fernandez J. A Practical Guide to Design and Assess a Phylogenomic Study. Genome Biol Evol 2022; 14:evac129. [PMID: 35946263 PMCID: PMC9452790 DOI: 10.1093/gbe/evac129] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/03/2022] [Indexed: 11/13/2022] Open
Abstract
Over the last decade, molecular systematics has undergone a change of paradigm as high-throughput sequencing now makes it possible to reconstruct evolutionary relationships using genome-scale datasets. The advent of "big data" molecular phylogenetics provided a battery of new tools for biologists but simultaneously brought new methodological challenges. The increase in analytical complexity comes at the price of highly specific training in computational biology and molecular phylogenetics, resulting very often in a polarized accumulation of knowledge (technical on one side and biological on the other). Interpreting the robustness of genome-scale phylogenetic studies is not straightforward, particularly as new methodological developments have consistently shown that the general belief of "more genes, more robustness" often does not apply, and because there is a range of systematic errors that plague phylogenomic investigations. This is particularly problematic because phylogenomic studies are highly heterogeneous in their methodology, and best practices are often not clearly defined. The main aim of this article is to present what I consider as the ten most important points to take into consideration when planning a well-thought-out phylogenomic study and while evaluating the quality of published papers. The goal is to provide a practical step-by-step guide that can be easily followed by nonexperts and phylogenomic novices in order to assess the technical robustness of phylogenomic studies or improve the experimental design of a project.
Collapse
Affiliation(s)
- Jesus Lozano-Fernandez
- Department of Genetics, Microbiology and Statistics, Biodiversity Research Institute (IRBio), University of Barcelona, Avd. Diagonal 643, 08028 Barcelona, Spain
- Institute of Evolutionary Biology (CSIC – Universitat Pompeu Fabra), Passeig marítim de la Barcelona 37-49, 08003 Barcelona, Spain
| |
Collapse
|
31
|
Yu M. Computational analysis on two putative mitochondrial protein-coding genes from the Emydura subglobosa genome: A functional annotation approach. PLoS One 2022; 17:e0268031. [PMID: 35981005 PMCID: PMC9387794 DOI: 10.1371/journal.pone.0268031] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2021] [Accepted: 04/21/2022] [Indexed: 11/19/2022] Open
Abstract
Rapid advancements in automated genomic technologies have uncovered many unique findings about the turtle genome and its associated features including olfactory gene expansions and duplications of toll-like receptors. However, despite the advent of large-scale sequencing, assembly, and annotation, about 40-50% of genes in eukaryotic genomes are left without functional annotation, severely limiting our knowledge of the biological information of genes. Additionally, these automated processes are prone to errors since draft genomes consist of several disconnected scaffolds whose order is unknown; erroneous draft assemblies may also be contaminated with foreign sequences and propagate to cause errors in annotation. Many of these automated annotations are thus incomplete and inaccurate, highlighting the need for functional annotation to link gene sequences to biological identity. In this study, we have functionally annotated two genes of the red-bellied short-neck turtle (Emydura subglobosa), a member of the relatively understudied pleurodire lineage of turtles. We improved upon initial ab initio gene predictions through homology-based evidence and generated refined consensus gene models. Through functional, localization, and structural analyses of the predicted proteins, we discovered conserved putative genes encoding mitochondrial proteins that play a role in C21-steroid hormone biosynthetic processes and fatty acid catabolism-both of which are distantly related by the tricarboxylic acid (TCA) cycle and share similar metabolic pathways. Overall, these findings further our knowledge about the genetic features underlying turtle physiology, morphology, and longevity, which have important implications for the treatment of human diseases and evolutionary studies.
Collapse
Affiliation(s)
- Megan Yu
- Department of Molecular, Cell & Developmental Biology, University of California–Los Angeles, Los Angeles, California, United States of America
| |
Collapse
|
32
|
Yoshida Y, Tanaka S. Deciphering the Biological Enigma-Genomic Evolution Underlying Anhydrobiosis in the Phylum Tardigrada and the Chironomid Polypedilum vanderplanki. INSECTS 2022; 13:557. [PMID: 35735894 PMCID: PMC9224920 DOI: 10.3390/insects13060557] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/31/2022] [Revised: 06/13/2022] [Accepted: 06/17/2022] [Indexed: 02/04/2023]
Abstract
Anhydrobiosis, an ametabolic dehydrated state triggered by water loss, is observed in several invertebrate lineages. Anhydrobiotes revive when rehydrated, and seem not to suffer the ultimately lethal cell damage that results from severe loss of water in other organisms. Here, we review the biochemical and genomic evidence that has revealed the protectant molecules, repair systems, and maintenance pathways associated with anhydrobiosis. We then introduce two lineages in which anhydrobiosis has evolved independently: Tardigrada, where anhydrobiosis characterizes many species within the phylum, and the genus Polypedilum, where anhydrobiosis occurs in only two species. Finally, we discuss the complexity of the evolution of anhydrobiosis within invertebrates based on current knowledge, and propose perspectives to enhance the understanding of anhydrobiosis.
Collapse
Affiliation(s)
- Yuki Yoshida
- Graduate School of Arts and Sciences, The University of Tokyo, 3-8-1 Komaba, Meguro-ku, Tokyo 153-8902, Japan
| | - Sae Tanaka
- Exploratory Research Center on Life and Living Systems (ExCELLS), National Institutes of Natural Sciences, 5-1 Higashiyama, Myodaiji, Okazaki 444-8787, Japan
- Institute for Advanced Biosciences, Keio University, 341-1 Mizukami, Tsuruoka 997-0052, Japan
| |
Collapse
|
33
|
Yoshida Y, Satoh T, Ota C, Tanaka S, Horikawa DD, Tomita M, Kato K, Arakawa K. Time-series transcriptomic screening of factors contributing to the cross-tolerance to UV radiation and anhydrobiosis in tardigrades. BMC Genomics 2022; 23:405. [PMID: 35643424 PMCID: PMC9145152 DOI: 10.1186/s12864-022-08642-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2021] [Accepted: 05/18/2022] [Indexed: 12/22/2022] Open
Abstract
BACKGROUND Tardigrades are microscopic animals that are capable of tolerating extreme environments by entering a desiccated state of suspended animation known as anhydrobiosis. While antioxidative stress proteins, antiapoptotic pathways and tardigrade-specific intrinsically disordered proteins have been implicated in the anhydrobiotic machinery, conservation of these mechanisms is not universal within the phylum Tardigrada, suggesting the existence of overlooked components. RESULTS Here, we show that a novel Mn-dependent peroxidase is an important factor in tardigrade anhydrobiosis. Through time-series transcriptome analysis of Ramazzottius varieornatus specimens exposed to ultraviolet light and comparison with anhydrobiosis entry, we first identified several novel gene families without similarity to existing sequences that are induced rapidly after stress exposure. Among these, a single gene family with multiple orthologs that is highly conserved within the phylum Tardigrada and enhances oxidative stress tolerance when expressed in human cells was identified. Crystallographic study of this protein suggested Zn or Mn binding at the active site, and we further confirmed that this protein has Mn-dependent peroxidase activity in vitro. CONCLUSIONS Our results demonstrated novel mechanisms for coping with oxidative stress that may be a fundamental mechanism of anhydrobiosis in tardigrades. Furthermore, localization of these sets of proteins mainly in the Golgi apparatus suggests an indispensable role of the Golgi stress response in desiccation tolerance.
Collapse
Affiliation(s)
- Yuki Yoshida
- Institute for Advanced Biosciences, Keio University, Nihonkoku, 403-1, Daihouji, Tsuruoka, Yamagata, 997-0017, Japan
- Systems Biology Program, Graduate School of Media and Governance, Keio University, 5322 Endo, Fujisawa, Kanagawa, 252-0882, Japan
| | - Tadashi Satoh
- Faculty and Graduate School of Pharmaceutical Sciences, Nagoya City University, 3-1 Tanabe-dori, Mizuho, Nagoya, 467-8603, Japan
| | - Chise Ota
- Faculty and Graduate School of Pharmaceutical Sciences, Nagoya City University, 3-1 Tanabe-dori, Mizuho, Nagoya, 467-8603, Japan
| | - Sae Tanaka
- Exploratory Research Center On Life and Living Systems (ExCELLS), National Institute of Natural Sciences, 5-1 Higashiyama, Myodaiji, Okazaki, Aichi, 444-8787, Japan
| | - Daiki D Horikawa
- Institute for Advanced Biosciences, Keio University, Nihonkoku, 403-1, Daihouji, Tsuruoka, Yamagata, 997-0017, Japan
- Systems Biology Program, Graduate School of Media and Governance, Keio University, 5322 Endo, Fujisawa, Kanagawa, 252-0882, Japan
| | - Masaru Tomita
- Institute for Advanced Biosciences, Keio University, Nihonkoku, 403-1, Daihouji, Tsuruoka, Yamagata, 997-0017, Japan
- Systems Biology Program, Graduate School of Media and Governance, Keio University, 5322 Endo, Fujisawa, Kanagawa, 252-0882, Japan
| | - Koichi Kato
- Faculty and Graduate School of Pharmaceutical Sciences, Nagoya City University, 3-1 Tanabe-dori, Mizuho, Nagoya, 467-8603, Japan
- Exploratory Research Center On Life and Living Systems (ExCELLS), National Institute of Natural Sciences, 5-1 Higashiyama, Myodaiji, Okazaki, Aichi, 444-8787, Japan
| | - Kazuharu Arakawa
- Institute for Advanced Biosciences, Keio University, Nihonkoku, 403-1, Daihouji, Tsuruoka, Yamagata, 997-0017, Japan.
- Systems Biology Program, Graduate School of Media and Governance, Keio University, 5322 Endo, Fujisawa, Kanagawa, 252-0882, Japan.
- Exploratory Research Center On Life and Living Systems (ExCELLS), National Institute of Natural Sciences, 5-1 Higashiyama, Myodaiji, Okazaki, Aichi, 444-8787, Japan.
| |
Collapse
|
34
|
Irisarri I, de Vries J. Punctuated ancestral gene gains in streptophyte evolution. MOLECULAR PLANT 2022; 15:799-801. [PMID: 35342001 DOI: 10.1016/j.molp.2022.03.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/15/2022] [Revised: 03/21/2022] [Accepted: 03/23/2022] [Indexed: 06/14/2023]
Affiliation(s)
- Iker Irisarri
- University of Goettingen, Institute for Microbiology and Genetics, Department of Applied Bioinformatics, Goldschmidtstr. 1, 37077 Göttingen, Germany; Campus Institute Data Science (CIDAS), Göttingen, Germany.
| | - Jan de Vries
- University of Goettingen, Institute for Microbiology and Genetics, Department of Applied Bioinformatics, Goldschmidtstr. 1, 37077 Göttingen, Germany; Campus Institute Data Science (CIDAS), Göttingen, Germany; University of Goettingen, Goettingen Center for Molecular Biosciences (GZMB), Department of Applied Bioinformatics, Goldschmidtstr. 1, 37077 Goettingen, Germany.
| |
Collapse
|
35
|
Intragenomic variation in nuclear ribosomal markers and its implication in species delimitation, identification and barcoding in fungi. FUNGAL BIOL REV 2022. [DOI: 10.1016/j.fbr.2022.04.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
|
36
|
Abstract
Experimentally tractable organisms like C. elegans, Drosophila, zebrafish, and mouse are popular models for addressing diverse questions in biology. In 1997, two of the most valuable invertebrate model organisms to date-C. elegans and Drosophila-were found to be much more closely related to each other than expected. C. elegans and Drosophila belong to the nematodes and arthropods, respectively, and these two phyla and six other phyla make up a clade of molting animals referred to as the Ecdysozoa. The other ecdysozoan phyla could be valuable models for comparative biology, taking advantage of the rich and continual sources of research findings as well as tools from both C. elegans and Drosophila. But when the Ecdysozoa was first recognized, few tools were available for laboratory studies in any of these six other ecdysozoan phyla. In 1999 I began an effort to develop tools for studying one such phylum, the tardigrades. Here, I describe how the tardigrade species Hypsibius exemplaris and tardigrades more generally have emerged over the past two decades as valuable new models for answering diverse questions. To date, these questions have included how animal body plans evolve and how biological materials can survive some remarkably extreme conditions.
Collapse
Affiliation(s)
- Bob Goldstein
- Department of Biology and Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States.
| |
Collapse
|
37
|
Poprawa I, Bartylak T, Kulpla A, Erdmann W, Roszkowska M, Chajec Ł, Kaczmarek Ł, Karachitos A, Kmita H. Verification of Hypsibius exemplaris Gąsiorek et al., 2018 (Eutardigrada; Hypsibiidae) application in anhydrobiosis research. PLoS One 2022; 17:e0261485. [PMID: 35303010 PMCID: PMC8932574 DOI: 10.1371/journal.pone.0261485] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2021] [Accepted: 02/25/2022] [Indexed: 01/03/2023] Open
Abstract
Anhydrobiosis is considered to be an adaptation of important applicative implications because it enables resistance to the lack of water. The phenomenon is still not well understood at molecular level. Thus, a good model invertebrate species for the research is required. The best known anhydrobiotic invertebrates are tardigrades (Tardigrada), considered to be toughest animals in the world. Hypsibius. exemplaris is one of the best studied tardigrade species, with its name "exemplaris" referring to the widespread use of the species as a laboratory model for various types of research. However, available data suggest that anhydrobiotic capability of the species may be overestimated. Therefore, we determined anhydrobiosis survival by Hys. exemplaris specimens using three different anhydrobiosis protocols. We also checked ultrastructure of storage cells within formed dormant structures (tuns) that has not been studied yet for Hys. exemplaris. These cells are known to support energetic requirements of anhydrobiosis. The obtained results indicate that Hys. exemplaris appears not to be a good model species for anhydrobiosis research.
Collapse
Affiliation(s)
- Izabela Poprawa
- Institute of Biology, Biotechnology and Environmental Protection, Faculty of Natural Sciences, University of Silesia in Katowice, Bankowa, Katowice, Poland
| | - Tomasz Bartylak
- Department of Animal Taxonomy and Ecology, Faculty of Biology, Adam Mickiewicz University in Poznań, Uniwersytetu Poznańskiego, Poznań, Poland
- Department of Bioenergetics, Faculty of Biology, Adam Mickiewicz University in Poznań, Uniwersytetu Poznańskiego, Poznań, Poland
| | - Adam Kulpla
- Center for Advanced Technology, Adam Mickiewicz University, Uniwersytetu Poznańskiego, Poznań, Poland
- Faculty of Biology, Adam Mickiewicz University in Poznań, Uniwersytetu Poznańskiego, Poznań, Poland
| | - Weronika Erdmann
- Department of Animal Taxonomy and Ecology, Faculty of Biology, Adam Mickiewicz University in Poznań, Uniwersytetu Poznańskiego, Poznań, Poland
| | - Milena Roszkowska
- Department of Bioenergetics, Faculty of Biology, Adam Mickiewicz University in Poznań, Uniwersytetu Poznańskiego, Poznań, Poland
| | - Łukasz Chajec
- Institute of Biology, Biotechnology and Environmental Protection, Faculty of Natural Sciences, University of Silesia in Katowice, Bankowa, Katowice, Poland
| | - Łukasz Kaczmarek
- Department of Animal Taxonomy and Ecology, Faculty of Biology, Adam Mickiewicz University in Poznań, Uniwersytetu Poznańskiego, Poznań, Poland
| | - Andonis Karachitos
- Department of Bioenergetics, Faculty of Biology, Adam Mickiewicz University in Poznań, Uniwersytetu Poznańskiego, Poznań, Poland
| | - Hanna Kmita
- Department of Bioenergetics, Faculty of Biology, Adam Mickiewicz University in Poznań, Uniwersytetu Poznańskiego, Poznań, Poland
| |
Collapse
|
38
|
Van Dam AR, Covas Orizondo JO, Lam AW, McKenna DD, Van Dam MH. Metagenomic clustering reveals microbial contamination as an essential consideration in ultraconserved element design for phylogenomics with insect museum specimens. Ecol Evol 2022; 12:e8625. [PMID: 35342556 PMCID: PMC8932080 DOI: 10.1002/ece3.8625] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2021] [Revised: 01/03/2022] [Accepted: 01/17/2022] [Indexed: 11/30/2022] Open
Abstract
Phylogenomics via ultraconserved elements (UCEs) has led to improved phylogenetic reconstructions across the tree of life. However, inadvertently incorporating non-targeted DNA into the UCE marker design will lead to misinformation being incorporated into subsequent analyses. To date, the effectiveness of basic metagenomic filtering strategies has not been assessed in arthropods. Designing markers from museum specimens requires careful consideration of methods due to the high levels of microbial contamination typically found in such specimens. We investigate if contaminant sequences are carried forward into a UCE marker set we developed from insect museum specimens using a standard bioinformatics pipeline. We find that the methods currently employed by most researchers do not exclude contamination from the final set of targets. Lastly, we highlight several paths forward for reducing contamination in UCE marker design.
Collapse
Affiliation(s)
- Alex R. Van Dam
- Department of BiologyUniversity of Puerto Rico MayagüezMayagüezPuerto Rico
| | | | - Athena W. Lam
- Department of EntomologyCalifornia Academy of SciencesSan FranciscoCaliforniaUSA
| | - Duane D. McKenna
- Department of Biological SciencesUniversity of MemphisMemphisTennesseeUSA
- Center for Biodiversity ResearchUniversity of MemphisMemphisTennesseeUSA
| | - Matthew H. Van Dam
- Department of EntomologyCalifornia Academy of SciencesSan FranciscoCaliforniaUSA
| |
Collapse
|
39
|
Cornet L, Baurain D. Contamination detection in genomic data: more is not enough. Genome Biol 2022; 23:60. [PMID: 35189924 PMCID: PMC8862208 DOI: 10.1186/s13059-022-02619-9] [Citation(s) in RCA: 32] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2021] [Accepted: 01/18/2022] [Indexed: 12/20/2022] Open
Abstract
The decreasing cost of sequencing and concomitant augmentation of publicly available genomes have created an acute need for automated software to assess genomic contamination. During the last 6 years, 18 programs have been published, each with its own strengths and weaknesses. Deciding which tools to use becomes more and more difficult without an understanding of the underlying algorithms. We review these programs, benchmarking six of them, and present their main operating principles. This article is intended to guide researchers in the selection of appropriate tools for specific applications. Finally, we present future challenges in the developing field of contamination detection.
Collapse
Affiliation(s)
- Luc Cornet
- BCCM/IHEM, Mycology and Aerobiology, Sciensano, Bruxelles, Belgium
| | - Denis Baurain
- InBioS-PhytoSYSTEMS, Eukaryotic Phylogenomics, University of Liège, Liège, Belgium.
| |
Collapse
|
40
|
Abstract
Tardigrades are ubiquitous meiofauna that are especially renowned for their exceptional extremotolerance to various adverse environments, including pressure, temperature, and even ionizing radiation. This is achieved through a reversible halt of metabolism triggered by desiccation, a phenomenon called anhydrobiosis. Recent establishment of genome resources for two tardigrades, Hypsibius exemplaris and Ramazzottius varieornatus, accelerated research to uncover the molecular mechanisms behind anhydrobiosis, leading to the discovery of many tardigrade-unique proteins. This review focuses on the history, methods, discoveries, and current state and challenges regarding tardigrade genomics, with an emphasis on molecular anhydrobiology. Remaining questions and future perspectives regarding prospective approaches to fully elucidate the molecular machinery of this complex phenomenon are discussed.
Collapse
Affiliation(s)
- Kazuharu Arakawa
- Institute for Advanced Biosciences, Keio University, Daishouji, Tsuruoka, Yamagata, Japan; .,Faculty of Environment and Information Studies, Keio University, Fujisawa, Kanagawa, Japan.,Graduate School of Media and Governance, Systems Biology Program, Keio University, Fujisawa, Kanagawa, Japan.,Exploratory Research Center on Life and Living Systems (ExCELLS), National Institute of Natural Sciences, Myodaiji, Okazaki, Aichi, Japan
| |
Collapse
|
41
|
Cerca J, Armstrong EE, Vizueta J, Fernández R, Dimitrov D, Petersen B, Prost S, Rozas J, Petrov D, Gillespie RG. The Tetragnatha kauaiensis Genome Sheds Light on the Origins of Genomic Novelty in Spiders. Genome Biol Evol 2021; 13:evab262. [PMID: 34849853 PMCID: PMC8693713 DOI: 10.1093/gbe/evab262] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/22/2021] [Indexed: 01/07/2023] Open
Abstract
Spiders (Araneae) have a diverse spectrum of morphologies, behaviors, and physiologies. Attempts to understand the genomic-basis of this diversity are often hindered by their large, heterozygous, and AT-rich genomes with high repeat content resulting in highly fragmented, poor-quality assemblies. As a result, the key attributes of spider genomes, including gene family evolution, repeat content, and gene function, remain poorly understood. Here, we used Illumina and Dovetail Chicago technologies to sequence the genome of the long-jawed spider Tetragnatha kauaiensis, producing an assembly distributed along 3,925 scaffolds with an N50 of ∼2 Mb. Using comparative genomics tools, we explore genome evolution across available spider assemblies. Our findings suggest that the previously reported and vast genome size variation in spiders is linked to the different representation and number of transposable elements. Using statistical tools to uncover gene-family level evolution, we find expansions associated with the sensory perception of taste, immunity, and metabolism. In addition, we report strikingly different histories of chemosensory, venom, and silk gene families, with the first two evolving much earlier, affected by the ancestral whole genome duplication in Arachnopulmonata (∼450 Ma) and exhibiting higher numbers. Together, our findings reveal that spider genomes are highly variable and that genomic novelty may have been driven by the burst of an ancient whole genome duplication, followed by gene family and transposable element expansion.
Collapse
Affiliation(s)
- José Cerca
- Berkeley Evolab, Department of Environmental Science, Policy, and Management, UC Berkeley, California, USA
- Frontiers in Evolutionary Zoology, Natural History Museum, University of Oslo, Norway
- Department of Natural History, NTNU University Museum, Norwegian University of Science and Technology, Trondheim, Norway
| | - Ellie E Armstrong
- Berkeley Evolab, Department of Environmental Science, Policy, and Management, UC Berkeley, California, USA
- Department of Biology, Stanford University, California, USA
| | - Joel Vizueta
- Departament de Genètica, Microbiologia i Estadística & Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona, Spain
- Villum Centre for Biodiversity Genomics, Section for Ecology and Evolution, Department of Biology, University of Copenhagen, Denmark
| | - Rosa Fernández
- Institute of Evolutionary Biology (CSIC—Universitat Pompeu Fabra), Barcelona, Spain
| | - Dimitar Dimitrov
- Department of Natural History, University Museum of Bergen, University of Bergen, Norway
| | - Bent Petersen
- Section for Evolutionary Genomics, The GLOBE Institute, Faculty of Health and Medical Sciences, University of Copenhagen, Denmark
- Centre of Excellence for Omics-Driven Computational Biodiscovery, Faculty of Applied Sciences, AIMST University, Kedah, Malaysia
| | - Stefan Prost
- Central Research Laboratories, Natural History Museum Vienna, Vienna, Austria
- University of Veterinary Medicine, Konrad Lorenz Institute of Ethology, Vienna, Austria
- South African National Biodiversity Institute, National Zoological Garden, Pretoria, South Africa
| | - Julio Rozas
- Departament de Genètica, Microbiologia i Estadística & Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona, Spain
| | - Dmitri Petrov
- Department of Biology, Stanford University, California, USA
| | - Rosemary G Gillespie
- Berkeley Evolab, Department of Environmental Science, Policy, and Management, UC Berkeley, California, USA
| |
Collapse
|
42
|
A high-quality fungal genome assembly resolved from a sample accidentally contaminated by multiple taxa. Biotechniques 2021; 72:39-50. [PMID: 34846173 DOI: 10.2144/btn-2021-0097] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Contamination in sequenced genomes is a relatively common problem and several methods to remove non-target sequences have been devised. Typically, the target and contaminating organisms reside in different kingdoms, simplifying their separation. The authors present the case of a genome for the ascomycete fungus Teratosphaeria eucalypti, contaminated by another ascomycete fungus and a bacterium. Approaching the problem as a low-complexity metagenomics project, the authors used two available software programs, BlobToolKit and anvi'o, to filter the contaminated genome. Both the de novo and reference-assisted approaches yielded a high-quality draft genome assembly for the target fungus. Incorporating reference sequences increased assembly completeness and visualization elucidated previously unknown genome features. The authors suggest that visualization should be routine in any sequencing project, regardless of suspected contamination.
Collapse
|
43
|
Murai Y, Yagi-Utsumi M, Fujiwara M, Tanaka S, Tomita M, Kato K, Arakawa K. Multiomics study of a heterotardigrade, Echinisicus testudo, suggests the possibility of convergent evolution of abundant heat-soluble proteins in Tardigrada. BMC Genomics 2021; 22:813. [PMID: 34763673 PMCID: PMC8582207 DOI: 10.1186/s12864-021-08131-x] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2020] [Accepted: 10/28/2021] [Indexed: 11/13/2022] Open
Abstract
Background Many limno-terrestrial tardigrades can enter an ametabolic state, known as anhydrobiosis, upon desiccation, in which the animals can withstand extreme environments. Through genomics studies, molecular components of anhydrobiosis are beginning to be elucidated, such as the expansion of oxidative stress response genes, loss of stress signaling pathways, and gain of tardigrade-specific heat-soluble protein families designated CAHS and SAHS. However, to date, studies have predominantly investigated the class Eutardigrada, and molecular mechanisms in the remaining class, Heterotardigrada, still remains elusive. To address this gap in the research, we report a multiomics study of the heterotardigrade Echiniscus testudo, one of the most desiccation-tolerant species which is not yet culturable in laboratory conditions. Results In order to elucidate the molecular basis of anhydrobiosis in E. testudo, we employed a multi-omics strategy encompassing genome sequencing, differential transcriptomics, and proteomics. Using ultra-low input library sequencing protocol from a single specimen, we sequenced and assembled the 153.7 Mbp genome annotated using RNA-Seq data. None of the previously identified tardigrade-specific abundant heat-soluble genes was conserved, while the loss and expansion of existing pathways were partly shared. Furthermore, we identified two families novel abundant heat-soluble proteins, which we named E. testudo Abundant Heat Soluble (EtAHS), that are predicted to contain large stretches of disordered regions. Likewise the AHS families in eutardigrada, EtAHS shows structural changes from random coil to alphahelix as the water content was decreased in vitro. These characteristics of EtAHS proteins are analogous to those of CAHS in eutardigrades, while there is no conservation at the sequence level. Conclusions Our results suggest that Heterotardigrada have partly shared but distinct anhydrobiosis machinery compared with Eutardigrada, possibly due to convergent evolution within Tardigrada. (276/350). Supplementary Information The online version contains supplementary material available at 10.1186/s12864-021-08131-x.
Collapse
Affiliation(s)
- Yumi Murai
- Institute for Advanced Biosciences, Keio University, Tsuruoka, Yamagata, Japan.,Systems Biology Program, Graduate School of Media and Governance, Keio University, Fujisawa, Kanagawa, Japan
| | - Maho Yagi-Utsumi
- Exploratory Research Center on Life and Living Systems, National Institutes of Natural Sciences, Okazaki, Aichi, Japan.,Institute for Molecular Science, National Institutes of Natural Sciences, Okazaki, Aichi, Japan.,Graduate School of Pharmaceutical Sciences, Nagoya City University, Nagoya, Aichi, Japan
| | - Masayuki Fujiwara
- Institute for Advanced Biosciences, Keio University, Tsuruoka, Yamagata, Japan.,Systems Biology Program, Graduate School of Media and Governance, Keio University, Fujisawa, Kanagawa, Japan
| | - Sae Tanaka
- Exploratory Research Center on Life and Living Systems, National Institutes of Natural Sciences, Okazaki, Aichi, Japan
| | - Masaru Tomita
- Institute for Advanced Biosciences, Keio University, Tsuruoka, Yamagata, Japan.,Systems Biology Program, Graduate School of Media and Governance, Keio University, Fujisawa, Kanagawa, Japan.,Faculty of Environment and Information Studies, Keio University, Fujisawa, Kanagawa, Japan
| | - Koichi Kato
- Exploratory Research Center on Life and Living Systems, National Institutes of Natural Sciences, Okazaki, Aichi, Japan.,Institute for Molecular Science, National Institutes of Natural Sciences, Okazaki, Aichi, Japan.,Graduate School of Pharmaceutical Sciences, Nagoya City University, Nagoya, Aichi, Japan
| | - Kazuharu Arakawa
- Institute for Advanced Biosciences, Keio University, Tsuruoka, Yamagata, Japan. .,Systems Biology Program, Graduate School of Media and Governance, Keio University, Fujisawa, Kanagawa, Japan. .,Exploratory Research Center on Life and Living Systems, National Institutes of Natural Sciences, Okazaki, Aichi, Japan. .,Faculty of Environment and Information Studies, Keio University, Fujisawa, Kanagawa, Japan.
| |
Collapse
|
44
|
Lupo V, Van Vlierberghe M, Vanderschuren H, Kerff F, Baurain D, Cornet L. Contamination in Reference Sequence Databases: Time for Divide-and-Rule Tactics. Front Microbiol 2021; 12:755101. [PMID: 34745061 PMCID: PMC8570097 DOI: 10.3389/fmicb.2021.755101] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2021] [Accepted: 10/04/2021] [Indexed: 11/24/2022] Open
Abstract
Contaminating sequences in public genome databases is a pervasive issue with potentially far-reaching consequences. This problem has attracted much attention in the recent literature and many different tools are now available to detect contaminants. Although these methods are based on diverse algorithms that can sometimes produce widely different estimates of the contamination level, the majority of genomic studies rely on a single method of detection, which represents a risk of systematic error. In this work, we used two orthogonal methods to assess the level of contamination among National Center for Biotechnological Information Reference Sequence Database (RefSeq) bacterial genomes. First, we applied the most popular solution, CheckM, which is based on gene markers. We then complemented this approach by a genome-wide method, termed Physeter, which now implements a k-folds algorithm to avoid inaccurate detection due to potential contamination of the reference database. We demonstrate that CheckM cannot currently be applied to all available genomes and bacterial groups. While it performed well on the majority of RefSeq genomes, it produced dubious results for 12,326 organisms. Among those, Physeter identified 239 contaminated genomes that had been missed by CheckM. In conclusion, we emphasize the importance of using multiple methods of detection while providing an upgrade of our own detection tool, Physeter, which minimizes incorrect contamination estimates in the context of unavoidably contaminated reference databases.
Collapse
Affiliation(s)
- Valérian Lupo
- InBioS-PhytoSYSTEMS, Eukaryotic Phylogenomics, University of Liège, Liège, Belgium.,InBioS, Center for Protein Engineering, University of Liège, Liège, Belgium
| | - Mick Van Vlierberghe
- InBioS-PhytoSYSTEMS, Eukaryotic Phylogenomics, University of Liège, Liège, Belgium
| | - Hervé Vanderschuren
- Plant Genetics, TERRA Teaching and Research Center, Gembloux Agro-Bio Tech, University of Liège, Liège, Belgium
| | - Frédéric Kerff
- InBioS, Center for Protein Engineering, University of Liège, Liège, Belgium
| | - Denis Baurain
- InBioS-PhytoSYSTEMS, Eukaryotic Phylogenomics, University of Liège, Liège, Belgium
| | - Luc Cornet
- InBioS-PhytoSYSTEMS, Eukaryotic Phylogenomics, University of Liège, Liège, Belgium.,Plant Genetics, TERRA Teaching and Research Center, Gembloux Agro-Bio Tech, University of Liège, Liège, Belgium
| |
Collapse
|
45
|
Wang Y, Yuan H, Huang J, Li C. Inline index helped in cleaning up data contamination generated during library preparation and the subsequent steps. Mol Biol Rep 2021; 49:385-392. [PMID: 34716505 DOI: 10.1007/s11033-021-06884-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2021] [Accepted: 09/23/2021] [Indexed: 11/24/2022]
Abstract
BACKGROUND High-throughput sequencing involves library preparation and amplification steps, which may induce contamination across samples or between samples and the environment. METHODS We tested the effect of applying an inline-index strategy, in which DNA indices of 6 bp were added to both ends of the inserts at the ligation step of library prep for resolving the data contamination problem. RESULTS Our results showed that the contamination ranged from 0.29 to 1.25% in one experiment and from 0.83 to 27.01% in the other. We also found that contamination could be environmental or from reagents besides cross-contamination between samples. CONCLUSIONS Inline-index method is a useful experimental design to clean up the data and address the contamination problem which has been plaguing high-throughput sequencing data in many applications.
Collapse
Affiliation(s)
- Ying Wang
- Shanghai Universities Key Laboratory of Marine Animal Taxonomy and Evolution, Shanghai Ocean University, Shanghai, 201306, China.,Shanghai Collaborative Innovation for Aquatic Animal Genetics and Breeding, Shanghai Ocean University, Shanghai, 201306, China
| | - Hao Yuan
- Shanghai Universities Key Laboratory of Marine Animal Taxonomy and Evolution, Shanghai Ocean University, Shanghai, 201306, China.,Shanghai Collaborative Innovation for Aquatic Animal Genetics and Breeding, Shanghai Ocean University, Shanghai, 201306, China
| | - Junman Huang
- Shanghai Universities Key Laboratory of Marine Animal Taxonomy and Evolution, Shanghai Ocean University, Shanghai, 201306, China.,Shanghai Collaborative Innovation for Aquatic Animal Genetics and Breeding, Shanghai Ocean University, Shanghai, 201306, China
| | - Chenhong Li
- Shanghai Universities Key Laboratory of Marine Animal Taxonomy and Evolution, Shanghai Ocean University, Shanghai, 201306, China. .,Shanghai Collaborative Innovation for Aquatic Animal Genetics and Breeding, Shanghai Ocean University, Shanghai, 201306, China.
| |
Collapse
|
46
|
Wan X, Saito JA, Hou S, Geib SM, Yuryev A, Higa LM, Womersley CZ, Alam M. The Aphelenchus avenae genome highlights evolutionary adaptation to desiccation. Commun Biol 2021; 4:1232. [PMID: 34711923 PMCID: PMC8553787 DOI: 10.1038/s42003-021-02778-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2021] [Accepted: 10/09/2021] [Indexed: 02/08/2023] Open
Abstract
Some organisms can withstand complete body water loss (losing up to 99% of body water) and stay in ametabolic state for decades until rehydration, which is known as anhydrobiosis. Few multicellular eukaryotes on their adult stage can withstand life without water. We still have an incomplete understanding of the mechanism for metazoan survival of anhydrobiosis. Here we report the 255-Mb genome of Aphelenchus avenae, which can endure relative zero humidity for years. Gene duplications arose genome-wide and contributed to the expansion and diversification of 763 kinases, which represents the second largest metazoan kinome to date. Transcriptome analyses of ametabolic state of A. avenae indicate the elevation of ATP level for global recycling of macromolecules and enhancement of autophagy in the early stage of anhydrobiosis. We catalogue 74 species-specific intrinsically disordered proteins, which may facilitate A. avenae to survive through desiccation stress. Our findings refine a molecular basis evolving for survival in extreme water loss and open the way for discovering new anti-desiccation strategies.
Collapse
Affiliation(s)
- Xuehua Wan
- Advanced Studies in Genomics, Proteomics and Bioinformatics, University of Hawaii, Honolulu, HI, USA.
- TEDA Institute of Biological Sciences and Biotechnology, Nankai University, Tianjin, P. R. China.
| | - Jennifer A Saito
- Advanced Studies in Genomics, Proteomics and Bioinformatics, University of Hawaii, Honolulu, HI, USA
| | - Shaobin Hou
- Advanced Studies in Genomics, Proteomics and Bioinformatics, University of Hawaii, Honolulu, HI, USA
| | - Scott M Geib
- Tropical Crop and Commodity Protection Research Unit, USDA-ARS Pacific Basin Agricultural Research Center, Hilo, HI, USA
| | - Anton Yuryev
- Elsevier Life Sciences Solutions, Rockville, MD, USA
| | - Lynne M Higa
- School of Life Sciences, University of Hawaii, Honolulu, HI, USA
| | | | - Maqsudul Alam
- Advanced Studies in Genomics, Proteomics and Bioinformatics, University of Hawaii, Honolulu, HI, USA
| |
Collapse
|
47
|
Cooley NP, Wright ES. Accurate annotation of protein coding sequences with IDTAXA. NAR Genom Bioinform 2021; 3:lqab080. [PMID: 34541527 PMCID: PMC8445202 DOI: 10.1093/nargab/lqab080] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2021] [Revised: 07/07/2021] [Accepted: 08/25/2021] [Indexed: 11/12/2022] Open
Abstract
The observed diversity of protein coding sequences continues to increase far more rapidly than knowledge of their functions, making classification algorithms essential for assigning a function to proteins using only their sequence. Most pipelines for annotating proteins rely on searches for homologous sequences in databases of previously annotated proteins using BLAST or HMMER. Here, we develop a new approach for classifying proteins into a taxonomy of functions and demonstrate its utility for genome annotation. Our algorithm, IDTAXA, was more accurate than BLAST or HMMER at assigning sequences to KEGG ortholog groups. Moreover, IDTAXA correctly avoided classifying sequences with novel functions to existing groups, which is a common error mode for classification approaches that rely on E-values as a proxy for confidence. We demonstrate IDTAXA's utility for annotating eukaryotic and prokaryotic genomes by assigning functions to proteins within a multi-level ontology and applied IDTAXA to detect genome contamination in eukaryotic genomes. Finally, we re-annotated 8604 microbial genomes with known antibiotic resistance phenotypes to discover two novel associations between proteins and antibiotic resistance. IDTAXA is available as a web tool (http://DECIPHER.codes/Classification.html) or as part of the open source DECIPHER R package from Bioconductor.
Collapse
Affiliation(s)
- Nicholas P Cooley
- Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA 15206, USA
| | - Erik S Wright
- Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA 15206, USA
- Center for Evolutionary Biology and Medicine, Pittsburgh, PA 15219, USA
| |
Collapse
|
48
|
Verster KI, Tarnopol RL, Akalu SM, Whiteman NK. Horizontal Transfer of Microbial Toxin Genes to Gall Midge Genomes. Genome Biol Evol 2021; 13:6358723. [PMID: 34450656 PMCID: PMC8455502 DOI: 10.1093/gbe/evab202] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/22/2021] [Indexed: 12/26/2022] Open
Abstract
A growing body of evidence has underscored the role of horizontal gene transfer (HGT) in animal evolution. Previously, we discovered the horizontal transfer of the gene encoding the eukaryotic genotoxin cytolethal distending toxin B (cdtB) from the pea aphid Acyrthosiphon pisum secondary endosymbiont (APSE) phages to drosophilid and aphid nuclear genomes. Here, we report cdtB in the nuclear genome of the gall-forming "swede midge" Contarinia nasturtii (Diptera: Cecidomyiidae) via HGT. We searched all available gall midge genome sequences for evidence of APSE-to-insect HGT events and found five toxin genes (aip56, cdtB, lysozyme, rhs, and sltxB) transferred horizontally to cecidomyiid nuclear genomes. Surprisingly, phylogenetic analyses of HGT candidates indicated APSE phages were often not the ancestral donor lineage of the toxin gene to cecidomyiids. We used a phylogenetic signal statistic to test a transfer-by-proximity hypothesis for animal HGT, which suggested that microbe-to-insect HGT was more likely between taxa that share environments than those from different environments. Many of the toxins we found in midge genomes target eukaryotic cells, and catalytic residues important for toxin function are conserved in insect copies. This class of horizontally transferred, eukaryotic cell-targeting genes is potentially important in insect adaptation.
Collapse
Affiliation(s)
- Kirsten I Verster
- Department of Integrative Biology, University of California, Berkeley, California, USA
| | - Rebecca L Tarnopol
- Department of Plant & Microbial Biology, University of California, Berkeley, California, USA
| | - Saron M Akalu
- Department of Integrative Biology, University of California, Berkeley, California, USA
| | - Noah K Whiteman
- Department of Integrative Biology, University of California, Berkeley, California, USA,Department of Molecular and Cell Biology, University of California, Berkeley, California, USA,Corresponding author: E-mail:
| |
Collapse
|
49
|
Rachtman E, Bafna V, Mirarab S. CONSULT: accurate contamination removal using locality-sensitive hashing. NAR Genom Bioinform 2021; 3:lqab071. [PMID: 34377979 PMCID: PMC8340999 DOI: 10.1093/nargab/lqab071] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2021] [Revised: 06/30/2021] [Accepted: 07/19/2021] [Indexed: 12/27/2022] Open
Abstract
A fundamental question appears in many bioinformatics applications: Does a sequencing read belong to a large dataset of genomes from some broad taxonomic group, even when the closest match in the set is evolutionarily divergent from the query? For example, low-coverage genome sequencing (skimming) projects either assemble the organelle genome or compute genomic distances directly from unassembled reads. Using unassembled reads needs contamination detection because samples often include reads from unintended groups of species. Similarly, assembling the organelle genome needs distinguishing organelle and nuclear reads. While k-mer-based methods have shown promise in read-matching, prior studies have shown that existing methods are insufficiently sensitive for contamination detection. Here, we introduce a new read-matching tool called CONSULT that tests whether k-mers from a query fall within a user-specified distance of the reference dataset using locality-sensitive hashing. Taking advantage of large memory machines available nowadays, CONSULT libraries accommodate tens of thousands of microbial species. Our results show that CONSULT has higher true-positive and lower false-positive rates of contamination detection than leading methods such as Kraken-II and improves distance calculation from genome skims. We also demonstrate that CONSULT can distinguish organelle reads from nuclear reads, leading to dramatic improvements in skim-based mitochondrial assemblies.
Collapse
Affiliation(s)
- Eleonora Rachtman
- Bioinformatics and Systems Biology Graduate Program, UC San Diego, CA 92093, USA
| | - Vineet Bafna
- Department of Computer Science and Engineering, UC San Diego, CA 92093, USA
| | - Siavash Mirarab
- Department of Electrical and Computer Engineering, UC San Diego, CA 92093, USA
| |
Collapse
|
50
|
Hara Y, Shibahara R, Kondo K, Abe W, Kunieda T. Parallel evolution of trehalose production machinery in anhydrobiotic animals via recurrent gene loss and horizontal transfer. Open Biol 2021; 11:200413. [PMID: 34255978 PMCID: PMC8277472 DOI: 10.1098/rsob.200413] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Trehalose is a versatile non-reducing sugar. In some animal groups possessing its intrinsic production machinery, it is used as a potent protectant against environmental stresses, as well as blood sugar. However, the trehalose biosynthesis genes remain unidentified in the large majority of metazoan phyla, including vertebrates. To uncover the evolutionary history of trehalose production machinery in metazoans, we scrutinized the available genome resources and identified bifunctional trehalose-6-phosphate synthase-trehalose-6-phosphate phosphatase (TPS–TPP) genes in various taxa. The scan included our newly sequenced genome assembly of a desiccation-tolerant tardigrade Paramacrobiotus sp. TYO, revealing that this species retains TPS–TPP genes activated upon desiccation. Phylogenetic analyses identified a monophyletic group of the many of the metazoan TPS–TPP genes, namely ‘pan-metazoan’ genes, that were acquired in the early ancestors of metazoans. Furthermore, coordination of our results with the previous horizontal gene transfer studies illuminated that the two tardigrade lineages, nematodes and bdelloid rotifers, all of which include desiccation-tolerant species, independently acquired the TPS–TPP homologues via horizontal transfer accompanied with loss of the ‘pan-metazoan’ genes. Our results indicate that the parallel evolution of trehalose synthesis via recurrent loss and horizontal transfer of the biosynthesis genes resulted in the acquisition and/or augmentation of anhydrobiotic lives in animals.
Collapse
Affiliation(s)
- Yuichiro Hara
- Research Center for Genome and Medical Sciences, Tokyo Metropolitan Institute of Medical Science, Tokyo, Japan
| | - Reira Shibahara
- Department of Biological Sciences, Graduate School of Science, The University of Tokyo, Tokyo, Japan
| | - Koyuki Kondo
- Department of Biological Sciences, Graduate School of Science, The University of Tokyo, Tokyo, Japan
| | - Wataru Abe
- Department of Biology, Dokkyo Medical University, Tochigi, Japan
| | - Takekazu Kunieda
- Department of Biological Sciences, Graduate School of Science, The University of Tokyo, Tokyo, Japan
| |
Collapse
|