1
|
Bálint B, Merényi Z, Hegedüs B, Grigoriev IV, Hou Z, Földi C, Nagy LG. ContScout: sensitive detection and removal of contamination from annotated genomes. Nat Commun 2024; 15:936. [PMID: 38296951 PMCID: PMC10831095 DOI: 10.1038/s41467-024-45024-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2023] [Accepted: 01/08/2024] [Indexed: 02/02/2024] Open
Abstract
Contamination of genomes is an increasingly recognized problem affecting several downstream applications, from comparative evolutionary genomics to metagenomics. Here we introduce ContScout, a precise tool for eliminating foreign sequences from annotated genomes. It achieves high specificity and sensitivity on synthetic benchmark data even when the contaminant is a closely related species, outperforms competing tools, and can distinguish horizontal gene transfer from contamination. A screen of 844 eukaryotic genomes for contamination identified bacteria as the most common source, followed by fungi and plants. Furthermore, we show that contaminants in ancestral genome reconstructions lead to erroneous early origins of genes and inflate gene loss rates, leading to a false notion of complex ancestral genomes. Taken together, we offer here a tool for sensitive removal of foreign proteins, identify and remove contaminants from diverse eukaryotic genomes and evaluate their impact on phylogenomic analyses.
Collapse
Affiliation(s)
- Balázs Bálint
- Synthetic and Systems Biology Unit, HUN-REN Biological Research Centre, Szeged, Szeged, 6726, Hungary
| | - Zsolt Merényi
- Synthetic and Systems Biology Unit, HUN-REN Biological Research Centre, Szeged, Szeged, 6726, Hungary
| | - Botond Hegedüs
- Synthetic and Systems Biology Unit, HUN-REN Biological Research Centre, Szeged, Szeged, 6726, Hungary
| | - Igor V Grigoriev
- U.S. Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
- Department of Plant and Microbial Biology, University of California Berkeley, Berkeley, CA, 94720, USA
| | - Zhihao Hou
- Synthetic and Systems Biology Unit, HUN-REN Biological Research Centre, Szeged, Szeged, 6726, Hungary
- Doctoral School of Biology, Faculty of Science and Informatics, University of Szeged, Szeged, 6720, Hungary
| | - Csenge Földi
- Synthetic and Systems Biology Unit, HUN-REN Biological Research Centre, Szeged, Szeged, 6726, Hungary
- Doctoral School of Biology, Faculty of Science and Informatics, University of Szeged, Szeged, 6720, Hungary
| | - László G Nagy
- Synthetic and Systems Biology Unit, HUN-REN Biological Research Centre, Szeged, Szeged, 6726, Hungary.
| |
Collapse
|
2
|
Knyshov A, Gordon ERL, Masonick PK, Castillo S, Forero D, Hoey-Chamberlain R, Hwang WS, Johnson KP, Lemmon AR, Moriarty Lemmon E, Standring S, Zhang J, Weirauch C. Chromosome-Aware Phylogenomics of Assassin Bugs (Hemiptera: Reduvioidea) Elucidates Ancient Gene Conflict. Mol Biol Evol 2023; 40:msad168. [PMID: 37494292 PMCID: PMC10411492 DOI: 10.1093/molbev/msad168] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2023] [Revised: 07/15/2023] [Accepted: 07/19/2023] [Indexed: 07/28/2023] Open
Abstract
Though the phylogenetic signal of loci on sex chromosomes can differ from those on autosomes, chromosomal-level genome assemblies for nonvertebrates are still relatively scarce and conservation of chromosomal gene content across deep phylogenetic scales has therefore remained largely unexplored. We here assemble a uniquely large and diverse set of samples (17 anchored hybrid enrichment, 24 RNA-seq, and 70 whole-genome sequencing samples of variable depth) for the medically important assassin bugs (Reduvioidea). We assess the performance of genes based on multiple features (e.g., nucleotide vs. amino acid, nuclear vs. mitochondrial, and autosomal vs. X chromosomal) and employ different methods (concatenation and coalescence analyses) to reconstruct the unresolved phylogeny of this diverse (∼7,000 spp.) and old (>180 Ma) group. Our results show that genes on the X chromosome are more likely to have discordant phylogenies than those on autosomes. We find that the X chromosome conflict is driven by high gene substitution rates that impact the accuracy of phylogenetic inference. However, gene tree clustering showed strong conflict even after discounting variable third codon positions. Alternative topologies were not particularly enriched for sex chromosome loci, but spread across the genome. We conclude that binning genes to autosomal or sex chromosomes may result in a more accurate picture of the complex evolutionary history of a clade.
Collapse
Affiliation(s)
- Alexander Knyshov
- Department of Entomology, University of California, Riverside, CA, USA
| | - Eric R L Gordon
- Ecology and Evolutionary Biology Department, University of Connecticut, Storrs, CT, USA
| | - Paul K Masonick
- Department of Entomology, University of California, Riverside, CA, USA
| | | | - Dimitri Forero
- Instituto de Ciencias Naturales, Universidad Nacional de Colombia, Bogota, Colombia
| | | | - Wei Song Hwang
- Lee Kong Chian Natural History Museum, National University of Singapore, Queenstown, Singapore
| | - Kevin P Johnson
- Illinois Natural History Survey, Prairie Research Institute, University of Illinois, Champaign, IL, USA
| | - Alan R Lemmon
- Department of Scientific Computing, Florida State University, Tallahassee, FL, USA
| | | | | | - Junxia Zhang
- Key Laboratory of Zoological Systematics and Application of Hebei Province, Institute of Life Science and Green Development, College of Life Sciences, Hebei University, Baoding, Hebei, China
| | | |
Collapse
|
3
|
Lack of host phylogenetic structure in the gut bacterial communities of New Zealand cicadas and their interspecific hybrids. Sci Rep 2022; 12:20559. [PMID: 36446872 PMCID: PMC9709078 DOI: 10.1038/s41598-022-24723-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2022] [Accepted: 11/18/2022] [Indexed: 11/30/2022] Open
Abstract
Host-microbe interactions are intimately linked to eukaryotic evolution, particularly in sap-sucking insects that often rely on obligate microbial symbionts for nutrient provisioning. Cicadas (Cicadidae: Auchenorrhyncha) specialize on xylem fluid and derive many essential amino acids and vitamins from intracellular bacteria or fungi (Hodgkinia, Sulcia, and Ophiocordyceps) that are propagated via transmission from mothers to offspring. Despite the beneficial role of these non-gut symbionts in nutrient provisioning, the role of beneficial microbiota within the gut remains unclear. Here, we investigate the relative abundance and impact of host phylogeny and ecology on gut microbial diversity in cicadas using 16S ribosomal RNA gene amplicon sequencing data from 197 wild-collected cicadas and new mitochondrial genomes across 38 New Zealand cicada species, including natural hybrids between one pair of two species. We find low abundance and a lack of phylogenetic structure and hybrid effects but a significant role of elevation in explaining variation in gut microbiota.
Collapse
|