1
|
Hatmaker EA, Barber AE, Drott MT, Sauters TJC, Alastruey-Izquierdo A, Garcia-Hermoso D, Kurzai O, Rokas A. Pathogenicity is associated with population structure in a fungal pathogen of humans. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.05.602241. [PMID: 39026826 PMCID: PMC11257439 DOI: 10.1101/2024.07.05.602241] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/20/2024]
Abstract
Aspergillus flavus is a clinically and agriculturally important saprotrophic fungus responsible for severe human infections and extensive crop losses. We analyzed genomic data from 250 (95 clinical and 155 environmental) A. flavus isolates from 9 countries, including 70 newly sequenced clinical isolates, to examine population and pan-genome structure and their relationship to pathogenicity. We identified five A. flavus populations, including a new population, D, corresponding to distinct clades in the genome-wide phylogeny. Strikingly, > 75% of clinical isolates were from population D. Accessory genes, including genes within biosynthetic gene clusters, were significantly more common in some populations but rare in others. Population D was enriched for genes associated with zinc ion binding, lipid metabolism, and certain types of hydrolase activity. In contrast to the major human pathogen Aspergillus fumigatus , A. flavus pathogenicity in humans is strongly associated with population structure, making it a great system for investigating how population-specific genes contribute to pathogenicity.
Collapse
|
2
|
Gluck-Thaler E, Vogan A. Systematic identification of cargo-mobilizing genetic elements reveals new dimensions of eukaryotic diversity. Nucleic Acids Res 2024; 52:5496-5513. [PMID: 38686785 PMCID: PMC11162782 DOI: 10.1093/nar/gkae327] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2023] [Revised: 03/12/2024] [Accepted: 04/15/2024] [Indexed: 05/02/2024] Open
Abstract
Cargo-mobilizing mobile elements (CMEs) are genetic entities that faithfully transpose diverse protein coding sequences. Although common in bacteria, we know little about eukaryotic CMEs because no appropriate tools exist for their annotation. For example, Starships are giant fungal CMEs whose functions are largely unknown because they require time-intensive manual curation. To address this knowledge gap, we developed starfish, a computational workflow for high-throughput eukaryotic CME annotation. We applied starfish to 2 899 genomes of 1 649 fungal species and found that starfish recovers known Starships with 95% combined precision and recall while expanding the number of annotated elements ten-fold. Extant Starship diversity is partitioned into 11 families that differ in their enrichment patterns across fungal classes. Starship cargo changes rapidly such that elements from the same family differ substantially in their functional repertoires, which are predicted to contribute to diverse biological processes such as metabolism. Many elements have convergently evolved to insert into 5S rDNA and AT-rich sequence while others integrate into random locations, revealing both specialist and generalist strategies for persistence. Our work establishes a framework for advancing mobile element biology and provides the means to investigate an emerging dimension of eukaryotic genetic diversity, that of genomes within genomes.
Collapse
Affiliation(s)
- Emile Gluck-Thaler
- Laboratory of Evolutionary Genetics, Institute of Biology, University of Neuchâtel, Neuchâtel, Neuchâtel 2000, Switzerland
- Department of Plant Pathology, University of Wisconsin-Madison, Madison, WI 53706, USA
- Wisconsin Institute for Discovery, Madison, WI 53706, USA
| | - Aaron A Vogan
- Systematic Biology, Department of Organismal Biology, Uppsala University, Uppsala, 752 36, Sweden
| |
Collapse
|
3
|
Gangurde SS, Korani W, Bajaj P, Wang H, Fountain JC, Agarwal G, Pandey MK, Abbas HK, Chang PK, Holbrook CC, Kemerait RC, Varshney RK, Dutta B, Clevenger JP, Guo B. Aspergillus flavus pangenome (AflaPan) uncovers novel aflatoxin and secondary metabolite associated gene clusters. BMC PLANT BIOLOGY 2024; 24:354. [PMID: 38693487 PMCID: PMC11061970 DOI: 10.1186/s12870-024-04950-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/15/2024] [Accepted: 03/26/2024] [Indexed: 05/03/2024]
Abstract
BACKGROUND Aspergillus flavus is an important agricultural and food safety threat due to its production of carcinogenic aflatoxins. It has high level of genetic diversity that is adapted to various environments. Recently, we reported two reference genomes of A. flavus isolates, AF13 (MAT1-2 and highly aflatoxigenic isolate) and NRRL3357 (MAT1-1 and moderate aflatoxin producer). Where, an insertion of 310 kb in AF13 included an aflatoxin producing gene bZIP transcription factor, named atfC. Observations of significant genomic variants between these isolates of contrasting phenotypes prompted an investigation into variation among other agricultural isolates of A. flavus with the goal of discovering novel genes potentially associated with aflatoxin production regulation. Present study was designed with three main objectives: (1) collection of large number of A. flavus isolates from diverse sources including maize plants and field soils; (2) whole genome sequencing of collected isolates and development of a pangenome; and (3) pangenome-wide association study (Pan-GWAS) to identify novel secondary metabolite cluster genes. RESULTS Pangenome analysis of 346 A. flavus isolates identified a total of 17,855 unique orthologous gene clusters, with mere 41% (7,315) core genes and 59% (10,540) accessory genes indicating accumulation of high genomic diversity during domestication. 5,994 orthologous gene clusters in accessory genome not annotated in either the A. flavus AF13 or NRRL3357 reference genomes. Pan-genome wide association analysis of the genomic variations identified 391 significant associated pan-genes associated with aflatoxin production. Interestingly, most of the significantly associated pan-genes (94%; 369 associations) belonged to accessory genome indicating that genome expansion has resulted in the incorporation of new genes associated with aflatoxin and other secondary metabolites. CONCLUSION In summary, this study provides complete pangenome framework for the species of Aspergillus flavus along with associated genes for pathogen survival and aflatoxin production. The large accessory genome indicated large genome diversity in the species A. flavus, however AflaPan is a closed pangenome represents optimum diversity of species A. flavus. Most importantly, the newly identified aflatoxin producing gene clusters will be a new source for seeking aflatoxin mitigation strategies and needs new attention in research.
Collapse
Affiliation(s)
- Sunil S Gangurde
- Department of Plant Pathology, University of Georgia, Tifton, GA, 31793, USA
- Crop Protection and Management Research Unit, USDA-ARS, Tifton, GA, 31793, USA
| | - Walid Korani
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, 35806, USA
| | - Prasad Bajaj
- International Crop Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, 502324, Telangana, India
| | - Hui Wang
- Department of Plant Pathology, University of Georgia, Tifton, GA, 31793, USA
| | - Jake C Fountain
- Department of Plant Pathology, University of Georgia, Griffin, GA, 30223, USA
| | - Gaurav Agarwal
- Department of Plant Biology, Michigan State University, East Lansing, MI, 48823, USA
| | - Manish K Pandey
- International Crop Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, 502324, Telangana, India.
| | - Hamed K Abbas
- Biological Control of Pests Research Unit, USDA-ARS, Stoneville, MS, 38776, USA
| | - Perng-Kuang Chang
- Southern Regional Research Center, USDA-ARS, New Orleans, LA, 70124, USA
| | - C Corley Holbrook
- Crop Protection and Management Research Unit, USDA-ARS, Tifton, GA, 31793, USA
| | - Robert C Kemerait
- Department of Plant Pathology, University of Georgia, Tifton, GA, 31793, USA
| | - Rajeev K Varshney
- WA State Biotechnology Centre, Centre for Crop and Food innovation, Food Futures Institute, Murdoch University, Murdoch, WA, 6150, Australia
| | - Bhabesh Dutta
- Department of Plant Pathology, University of Georgia, Tifton, GA, 31793, USA
| | - Josh P Clevenger
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, 35806, USA.
| | - Baozhu Guo
- Crop Protection and Management Research Unit, USDA-ARS, Tifton, GA, 31793, USA.
| |
Collapse
|
4
|
Joubert PM, Krasileva KV. Distinct genomic contexts predict gene presence-absence variation in different pathotypes of Magnaporthe oryzae. Genetics 2024; 226:iyae012. [PMID: 38290434 PMCID: PMC10990425 DOI: 10.1093/genetics/iyae012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Revised: 11/28/2023] [Accepted: 12/19/2023] [Indexed: 02/01/2024] Open
Abstract
Fungi use the accessory gene content of their pangenomes to adapt to their environments. While gene presence-absence variation contributes to shaping accessory gene reservoirs, the genomic contexts that shape these events remain unclear. Since pangenome studies are typically species-wide and do not analyze different populations separately, it is yet to be uncovered whether presence-absence variation patterns and mechanisms are consistent across populations. Fungal plant pathogens are useful models for studying presence-absence variation because they rely on it to adapt to their hosts, and members of a species often infect distinct hosts. We analyzed gene presence-absence variation in the blast fungus, Magnaporthe oryzae (syn. Pyricularia oryzae), and found that presence-absence variation genes involved in host-pathogen and microbe-microbe interactions may drive the adaptation of the fungus to its environment. We then analyzed genomic and epigenomic features of presence-absence variation and observed that proximity to transposable elements, gene GC content, gene length, expression level in the host, and histone H3K27me3 marks were different between presence-absence variation genes and conserved genes. We used these features to construct a model that was able to predict whether a gene is likely to experience presence-absence variation with high precision (86.06%) and recall (92.88%) in M. oryzae. Finally, we found that presence-absence variation genes in the rice and wheat pathotypes of M. oryzae differed in their number and their genomic context. Our results suggest that genomic and epigenomic features of gene presence-absence variation can be used to better understand and predict fungal pangenome evolution. We also show that substantial intra-species variation can exist in these features.
Collapse
Affiliation(s)
- Pierre M Joubert
- Department of Plant and Microbial Biology, University of California-Berkeley, Berkeley, CA 94720, USA
- Center for Computational Biology, University of California-Berkeley, Berkeley, CA 94720, USA
| | - Ksenia V Krasileva
- Department of Plant and Microbial Biology, University of California-Berkeley, Berkeley, CA 94720, USA
- Center for Computational Biology, University of California-Berkeley, Berkeley, CA 94720, USA
| |
Collapse
|
5
|
Wang M, Li X, Liu X, Hou X, He Y, Yu JH, Hu S, Yin H, Xie BB. Annotation of 2,507 Saccharomyces cerevisiae genomes. Microbiol Spectr 2024; 12:e0358223. [PMID: 38488392 PMCID: PMC10986567 DOI: 10.1128/spectrum.03582-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2023] [Accepted: 02/25/2024] [Indexed: 04/06/2024] Open
Abstract
Saccharomyces cerevisiae (baker's yeast, budding yeast) is one of the most important model organisms for biological research and is a crucial microorganism in industry. Currently, a huge number of Saccharomyces cerevisiae genome sequences are available at the public domain. However, these genomes are distributed at different websites and a large number of them are released without annotation information. To provide one complete annotated genome data resource, we collected 2,507 Saccharomyces cerevisiae genome assemblies and re-annotated 2,506 assemblies using a custom annotation pipeline, producing a total of 15,407,164 protein-coding gene models. With a custom pipeline, all these gene sequences were clustered into families. A total of 1,506 single-copy genes were selected as marker genes, which were then used to evaluate the genome completeness and base qualities of all assemblies. Pangenomic analyses were performed based on a selected subset of 847 medium-high-quality genomes. Statistical comparisons revealed a number of gene families showing copy number variations among different organism sources. To the authors' knowledge, this study represents the largest genome annotation project of S. cerevisiae so far, providing rich genomic resources for the future studies of the model organism S. cerevisiae and its relatives.IMPORTANCESaccharomyces cerevisiae (baker's yeast, budding yeast) is one of the most important model organisms for biological research and is a crucial microorganism in industry. Though a huge number of Saccharomyces cerevisiae genome sequences are available at the public domain, these genomes are distributed at different websites and most are released without annotation, hindering the efficient reuse of these genome resources. Here, we collected 2,507 genomes for Saccharomyces cerevisiae, performed genome annotation, and evaluated the genome qualities. All the obtained data have been deposited at public repositories and are freely accessible to the community. This study represents the largest genome annotation project of S. cerevisiae so far, providing one complete annotated genome data set for S. cerevisiae, an important workhorse for fundamental biology, biotechnology, and industry.
Collapse
Affiliation(s)
- Meng Wang
- Microbial Technology Institute and State Key Laboratory of Microbial Technology, Shandong University, Qingdao, China
| | - Xuan Li
- Microbial Technology Institute and State Key Laboratory of Microbial Technology, Shandong University, Qingdao, China
| | - Xian Liu
- Microbial Technology Institute and State Key Laboratory of Microbial Technology, Shandong University, Qingdao, China
| | - Xiaoping Hou
- State Key Laboratory of Biological Fermentation Engineering of Beer, Tsingtao Brewery Co., Ltd, Qingdao, China
| | - Yang He
- State Key Laboratory of Biological Fermentation Engineering of Beer, Tsingtao Brewery Co., Ltd, Qingdao, China
| | - Jun-Hong Yu
- State Key Laboratory of Biological Fermentation Engineering of Beer, Tsingtao Brewery Co., Ltd, Qingdao, China
| | - Shumin Hu
- State Key Laboratory of Biological Fermentation Engineering of Beer, Tsingtao Brewery Co., Ltd, Qingdao, China
| | - Hua Yin
- State Key Laboratory of Biological Fermentation Engineering of Beer, Tsingtao Brewery Co., Ltd, Qingdao, China
| | - Bin-Bin Xie
- Microbial Technology Institute and State Key Laboratory of Microbial Technology, Shandong University, Qingdao, China
| |
Collapse
|
6
|
Welgemoed T, Duong TA, Barnes I, Stukenbrock EH, Berger DK. Population genomic analyses suggest recent dispersal events of the pathogen Cercospora zeina into East and Southern African maize cropping systems. G3 (BETHESDA, MD.) 2023; 13:jkad214. [PMID: 37738420 PMCID: PMC10627275 DOI: 10.1093/g3journal/jkad214] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/03/2023] [Revised: 08/03/2023] [Accepted: 09/06/2023] [Indexed: 09/24/2023]
Abstract
A serious factor hampering global maize production is gray leaf spot disease. Cercospora zeina is one of the causative pathogens, but population genomics analysis of C. zeina is lacking. We conducted whole-genome Illumina sequencing of a representative set of 30 C. zeina isolates from Kenya and Uganda (East Africa) and Zambia, Zimbabwe, and South Africa (Southern Africa). Selection of the diverse set was based on microsatellite data from a larger collection of the pathogen. Pangenome analysis of the C. zeina isolates was done by (1) de novo assembly of the reads with SPAdes, (2) annotation with BRAKER, and (3) protein clustering with OrthoFinder. A published long-read assembly of C. zeina (CMW25467) from Zambia was included and annotated using the same pipeline. This analysis revealed 790 non-shared accessory and 10,677 shared core orthogroups (genes) between the 31 isolates. Accessory gene content was largely shared between isolates from all countries, with a few genes unique to populations from Southern Africa (32) or East Africa (6). There was a significantly higher proportion of effector genes in the accessory secretome (44%) compared to the core secretome (24%). PCA, ADMIXTURE, and phylogenetic analysis using a neighbor-net network indicated a population structure with a geographical subdivision between the East African isolates and the Southern African isolates, although gene flow was also evident. The small pangenome and partial population differentiation indicated recent dispersal of C. zeina into Africa, possibly from 2 regional founder populations, followed by recurrent gene flow owing to widespread maize production across sub-Saharan Africa.
Collapse
Affiliation(s)
- Tanya Welgemoed
- Department of Biochemistry, Genetics and Microbiology, Forestry and Agricultural Biotechnology Institute, University of Pretoria, Private Bag X20, Hatfield 0028, South Africa
| | - Tuan A Duong
- Department of Biochemistry, Genetics and Microbiology, Forestry and Agricultural Biotechnology Institute, University of Pretoria, Private Bag X20, Hatfield 0028, South Africa
| | - Irene Barnes
- Department of Biochemistry, Genetics and Microbiology, Forestry and Agricultural Biotechnology Institute, University of Pretoria, Private Bag X20, Hatfield 0028, South Africa
| | - Eva H Stukenbrock
- Environmental Genomics, Christian-Albrechts University of Kiel, Am Botanischen Garten 1-11, Kiel 24118, Germany
- Max Planck Institute for Evolutionary Biology, August-Thienemann-Str. 2, Plön 24306, Germany
| | - Dave K Berger
- Department of Plant and Soil Sciences, Forestry and Agricultural Biotechnology Institute, University of Pretoria, Private Bag X20, Hatfield 0028, South Africa
| |
Collapse
|
7
|
Saco A, Rey-Campos M, Gallardo-Escárate C, Gerdol M, Novoa B, Figueras A. Gene presence/absence variation in Mytilus galloprovincialis and its implications in gene expression and adaptation. iScience 2023; 26:107827. [PMID: 37744033 PMCID: PMC10514466 DOI: 10.1016/j.isci.2023.107827] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2023] [Revised: 07/12/2023] [Accepted: 09/01/2023] [Indexed: 09/26/2023] Open
Abstract
Presence/absence variation (PAV) is a well-known phenomenon in prokaryotes that was described for the first time in bivalves in 2020 in Mytilus galloprovincialis. The objective of the present study was to further our understanding of the PAV phenomenon in mussel biology. The distribution of PAV was studied in a mussel chromosome-level genome assembly, revealing a widespread distribution but with hotspots of dispensability. Special attention was given to the effect of PAV in gene expression, since dispensable genes were found to be inherently subject to distortions due to their sparse distribution among individuals. Furthermore, the high expression and strong tissue specificity of some dispensable genes, such as myticins, strongly supported their biological relevance. The significant differences in the repertoire of dispensable genes associated with two geographically distinct populations suggest that PAV is involved in local adaptation. Overall, the PAV phenomenon would provide a key selective advantage at the population level.
Collapse
Affiliation(s)
- Amaro Saco
- Institute of Marine Research, Spanish National Research Council, Vigo, Spain
| | - Magalí Rey-Campos
- Institute of Marine Research, Spanish National Research Council, Vigo, Spain
| | | | - Marco Gerdol
- Department of Life Sciences, University of Trieste, Trieste, Italy
| | - Beatriz Novoa
- Institute of Marine Research, Spanish National Research Council, Vigo, Spain
| | - Antonio Figueras
- Institute of Marine Research, Spanish National Research Council, Vigo, Spain
| |
Collapse
|
8
|
O'Donnell S, Yue JX, Saada OA, Agier N, Caradec C, Cokelaer T, De Chiara M, Delmas S, Dutreux F, Fournier T, Friedrich A, Kornobis E, Li J, Miao Z, Tattini L, Schacherer J, Liti G, Fischer G. Telomere-to-telomere assemblies of 142 strains characterize the genome structural landscape in Saccharomyces cerevisiae. Nat Genet 2023; 55:1390-1399. [PMID: 37524789 PMCID: PMC10412453 DOI: 10.1038/s41588-023-01459-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2022] [Accepted: 06/26/2023] [Indexed: 08/02/2023]
Abstract
Pangenomes provide access to an accurate representation of the genetic diversity of species, both in terms of sequence polymorphisms and structural variants (SVs). Here we generated the Saccharomyces cerevisiae Reference Assembly Panel (ScRAP) comprising reference-quality genomes for 142 strains representing the species' phylogenetic and ecological diversity. The ScRAP includes phased haplotype assemblies for several heterozygous diploid and polyploid isolates. We identified circa (ca.) 4,800 nonredundant SVs that provide a broad view of the genomic diversity, including the dynamics of telomere length and transposable elements. We uncovered frequent cases of complex aneuploidies where large chromosomes underwent large deletions and translocations. We found that SVs can impact gene expression near the breakpoints and substantially contribute to gene repertoire evolution. We also discovered that horizontally acquired regions insert at chromosome ends and can generate new telomeres. Overall, the ScRAP demonstrates the benefit of a pangenome in understanding genome evolution at population scale.
Collapse
Affiliation(s)
- Samuel O'Donnell
- Sorbonne Université, CNRS, Institut de Biologie Paris-Seine, Laboratory of Computational and Quantitative Biology, Paris, France
| | - Jia-Xing Yue
- State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Guangdong Key Laboratory of Nasopharyngeal Carcinoma Diagnosis and Therapy, Sun Yat-sen University Cancer Center, Guangzhou, China
- Université Côte d'Azur, CNRS, INSERM, IRCAN, Nice, France
| | - Omar Abou Saada
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France
| | - Nicolas Agier
- Sorbonne Université, CNRS, Institut de Biologie Paris-Seine, Laboratory of Computational and Quantitative Biology, Paris, France
| | - Claudia Caradec
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France
| | - Thomas Cokelaer
- Biomics Technological Platform, Center for Technological Resources and Research (C2RT), Institut Pasteur, Paris, France
- Bioinformatics and Biostatistics Hub, Computational Biology Department, Institut Pasteur, Paris, France
| | | | - Stéphane Delmas
- Sorbonne Université, CNRS, Institut de Biologie Paris-Seine, Laboratory of Computational and Quantitative Biology, Paris, France
| | - Fabien Dutreux
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France
| | - Téo Fournier
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France
| | - Anne Friedrich
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France
| | - Etienne Kornobis
- Biomics Technological Platform, Center for Technological Resources and Research (C2RT), Institut Pasteur, Paris, France
- Bioinformatics and Biostatistics Hub, Computational Biology Department, Institut Pasteur, Paris, France
| | - Jing Li
- State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Guangdong Key Laboratory of Nasopharyngeal Carcinoma Diagnosis and Therapy, Sun Yat-sen University Cancer Center, Guangzhou, China
- Université Côte d'Azur, CNRS, INSERM, IRCAN, Nice, France
| | - Zepu Miao
- State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Guangdong Key Laboratory of Nasopharyngeal Carcinoma Diagnosis and Therapy, Sun Yat-sen University Cancer Center, Guangzhou, China
| | | | | | - Gianni Liti
- Université Côte d'Azur, CNRS, INSERM, IRCAN, Nice, France.
| | - Gilles Fischer
- Sorbonne Université, CNRS, Institut de Biologie Paris-Seine, Laboratory of Computational and Quantitative Biology, Paris, France.
| |
Collapse
|
9
|
Steenwyk JL, Knowles S, Bastos RW, Balamurugan C, Rinker D, Mead ME, Roberts CD, Raja HA, Li Y, Colabardini AC, de Castro PA, dos Reis TF, Canóvas D, Sanchez RL, Lagrou K, Torrado E, Rodrigues F, Oberlies NH, Zhou X, Goldman GH, Rokas A. Evolutionary origin, population diversity, and diagnostics for a cryptic hybrid pathogen. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.03.547508. [PMID: 37461539 PMCID: PMC10350022 DOI: 10.1101/2023.07.03.547508] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 07/25/2023]
Abstract
Cryptic fungal pathogens pose significant identification and disease management challenges due to their morphological resemblance to known pathogenic species while harboring genetic and (often) infectionrelevant trait differences. The cryptic fungal pathogen Aspergillus latus, an allodiploid hybrid originating from Aspergillus spinulosporus and an unknown close relative of Aspergillus quadrilineatus within section Nidulantes, remains poorly understood. The absence of accurate diagnostics for A. latus has led to misidentifications, hindering epidemiological studies and the design of effective treatment plans. We conducted an in-depth investigation of the genomes and phenotypes of 44 globally distributed isolates (41 clinical isolates and three type strains) from Aspergillus section Nidulantes. We found that 21 clinical isolates were A. latus; notably, standard methods of pathogen identification misidentified all A. latus isolates. The remaining isolates were identified as A. spinulosporus (8), A. quadrilineatus (1), or A. nidulans (11). Phylogenomic analyses shed light on the origin of A. latus, indicating one or two hybridization events gave rise to the species during the Miocene, approximately 15.4 to 8.8 million years ago. Characterizing the A. latus pangenome uncovered substantial genetic diversity within gene families and biosynthetic gene clusters. Transcriptomic analysis revealed that both parental genomes are actively expressed in nearly equal proportions and respond to environmental stimuli. Further investigation into infection-relevant chemical and physiological traits, including drug resistance profiles, growth under oxidative stress conditions, and secondary metabolite biosynthesis, highlight distinct phenotypic profiles of the hybrid A. latus compared to its parental and closely related species. Leveraging our comprehensive genomic and phenotypic analyses, we propose five genomic and phenotypic markers as diagnostics for A. latus species identification. These findings provide valuable insights into the evolutionary origin, genomic outcome, and phenotypic implications of hybridization in a cryptic fungal pathogen, thus enhancing our understanding of the underlying processes contributing to fungal pathogenesis. Furthermore, our study underscores the effectiveness of extensive genomic and phenotypic analyses as a promising approach for developing diagnostics applicable to future investigations of cryptic and emerging pathogens.
Collapse
Affiliation(s)
- Jacob L. Steenwyk
- Howards Hughes Medical Institute and the Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, USA
- Vanderbilt University, Department of Biological Sciences, VU Station B #35–1634, Nashville, TN 37235, United States of America
- Evolutionary Studies Initiative, Vanderbilt University, Nashville, TN 37235, USA
| | - Sonja Knowles
- Department of Chemistry & Biochemistry, University of North Carolina at Greensboro, Greensboro, NC, USA
| | - Rafael W. Bastos
- Faculdade de Ciencias Farmacêuticas de Ribeirão Preto, Universidade de São Paulo, São Paulo, Brazil
- Department of Microbiology and Parasitology, Bioscience Center, Federal University of Rio Grande do Norte, Natal-RN, Brazil
| | - Charu Balamurugan
- Vanderbilt University, Department of Biological Sciences, VU Station B #35–1634, Nashville, TN 37235, United States of America
- Evolutionary Studies Initiative, Vanderbilt University, Nashville, TN 37235, USA
| | - David Rinker
- Vanderbilt University, Department of Biological Sciences, VU Station B #35–1634, Nashville, TN 37235, United States of America
- Evolutionary Studies Initiative, Vanderbilt University, Nashville, TN 37235, USA
| | - Matthew E. Mead
- Vanderbilt University, Department of Biological Sciences, VU Station B #35–1634, Nashville, TN 37235, United States of America
- Evolutionary Studies Initiative, Vanderbilt University, Nashville, TN 37235, USA
| | - Christopher D. Roberts
- Department of Chemistry & Biochemistry, University of North Carolina at Greensboro, Greensboro, NC, USA
| | - Huzefa A. Raja
- Department of Chemistry & Biochemistry, University of North Carolina at Greensboro, Greensboro, NC, USA
| | - Yuanning Li
- Institute of Marine Science and Technology, Shandong University, 72 Binhai Road, Qingdao 266237, China
| | - Ana Cristina Colabardini
- Faculdade de Ciencias Farmacêuticas de Ribeirão Preto, Universidade de São Paulo, São Paulo, Brazil
| | - Patrícia Alves de Castro
- Faculdade de Ciencias Farmacêuticas de Ribeirão Preto, Universidade de São Paulo, São Paulo, Brazil
| | - Thaila Fernanda dos Reis
- Faculdade de Ciencias Farmacêuticas de Ribeirão Preto, Universidade de São Paulo, São Paulo, Brazil
| | - David Canóvas
- Departamento de Genética, Facultad de Biología, Universidad de Sevilla, Seville, Spain
| | - Rafael Luperini Sanchez
- Faculdade de Ciencias Farmacêuticas de Ribeirão Preto, Universidade de São Paulo, São Paulo, Brazil
| | - Katrien Lagrou
- Department of Microbiology, Immunology and Transplantation, Katholieke Universiteit Leuven, 3000 Leuven, Belgium
- Department of Laboratory Medicine and National Reference Centre for Mycosis, University Hospitals Leuven, 3000 Leuven, Belgium
| | - Egídio Torrado
- Life and Health Sciences Research Institute (ICVS), School of Medicine, University of Minho, 4715-495 Braga, Portugal; ICVS/3B’s-PT Government Associate Laboratory, 4715-495 Braga, Portugal
| | - Fernando Rodrigues
- Life and Health Sciences Research Institute (ICVS), School of Medicine, University of Minho, 4715-495 Braga, Portugal; ICVS/3B’s-PT Government Associate Laboratory, 4715-495 Braga, Portugal
| | - Nicholas H. Oberlies
- Department of Chemistry & Biochemistry, University of North Carolina at Greensboro, Greensboro, NC, USA
| | - Xiaofan Zhou
- Guangdong Laboratory for Lingnan Modern Agriculture, Guangdong Province Key Laboratory of Microbial Signals and Disease Control, Integrative Microbiology Research Centre, South China Agricultural University, Guangzhou, China
| | - Gustavo H. Goldman
- Faculdade de Ciencias Farmacêuticas de Ribeirão Preto, Universidade de São Paulo, São Paulo, Brazil
| | - Antonis Rokas
- Vanderbilt University, Department of Biological Sciences, VU Station B #35–1634, Nashville, TN 37235, United States of America
- Evolutionary Studies Initiative, Vanderbilt University, Nashville, TN 37235, USA
| |
Collapse
|
10
|
Sierra-Patev S, Min B, Naranjo-Ortiz M, Looney B, Konkel Z, Slot JC, Sakamoto Y, Steenwyk JL, Rokas A, Carro J, Camarero S, Ferreira P, Molpeceres G, Ruiz-Dueñas FJ, Serrano A, Henrissat B, Drula E, Hughes KW, Mata JL, Ishikawa NK, Vargas-Isla R, Ushijima S, Smith CA, Donoghue J, Ahrendt S, Andreopoulos W, He G, LaButti K, Lipzen A, Ng V, Riley R, Sandor L, Barry K, Martínez AT, Xiao Y, Gibbons JG, Terashima K, Grigoriev IV, Hibbett D. A global phylogenomic analysis of the shiitake genus Lentinula. Proc Natl Acad Sci U S A 2023; 120:e2214076120. [PMID: 36848567 PMCID: PMC10013852 DOI: 10.1073/pnas.2214076120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2022] [Accepted: 12/22/2022] [Indexed: 03/01/2023] Open
Abstract
Lentinula is a broadly distributed group of fungi that contains the cultivated shiitake mushroom, L. edodes. We sequenced 24 genomes representing eight described species and several unnamed lineages of Lentinula from 15 countries on four continents. Lentinula comprises four major clades that arose in the Oligocene, three in the Americas and one in Asia-Australasia. To expand sampling of shiitake mushrooms, we assembled 60 genomes of L. edodes from China that were previously published as raw Illumina reads and added them to our dataset. Lentinula edodes sensu lato (s. lat.) contains three lineages that may warrant recognition as species, one including a single isolate from Nepal that is the sister group to the rest of L. edodes s. lat., a second with 20 cultivars and 12 wild isolates from China, Japan, Korea, and the Russian Far East, and a third with 28 wild isolates from China, Thailand, and Vietnam. Two additional lineages in China have arisen by hybridization among the second and third groups. Genes encoding cysteine sulfoxide lyase (lecsl) and γ-glutamyl transpeptidase (leggt), which are implicated in biosynthesis of the organosulfur flavor compound lenthionine, have diversified in Lentinula. Paralogs of both genes that are unique to Lentinula (lecsl 3 and leggt 5b) are coordinately up-regulated in fruiting bodies of L. edodes. The pangenome of L. edodes s. lat. contains 20,308 groups of orthologous genes, but only 6,438 orthogroups (32%) are shared among all strains, whereas 3,444 orthogroups (17%) are found only in wild populations, which should be targeted for conservation.
Collapse
Affiliation(s)
| | - Byoungnam Min
- U.S. Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA94720
| | | | - Brian Looney
- Biology Department, Clark University, Worcester, MA01610
| | - Zachary Konkel
- Department of Plant Pathology, Ohio State University, Columbus, OH43210
| | - Jason C. Slot
- Department of Plant Pathology, Ohio State University, Columbus, OH43210
| | - Yuichi Sakamoto
- Iwate Biotechnology Research Center, Kitakami, Iwate024-0003, Japan
| | - Jacob L. Steenwyk
- Department of Biological Sciences and Evolutionary Studies Initiative, Vanderbilt University, Nashville, TN37235
| | - Antonis Rokas
- Department of Biological Sciences and Evolutionary Studies Initiative, Vanderbilt University, Nashville, TN37235
| | - Juan Carro
- Centro de Investigaciones Biológicas “Margarita Salas,” Consejo Superior de Investigaciones Científicas, MadridE-28040, Spain
| | - Susana Camarero
- Centro de Investigaciones Biológicas “Margarita Salas,” Consejo Superior de Investigaciones Científicas, MadridE-28040, Spain
| | - Patricia Ferreira
- Department of Biochemistry and Molecular and Cellular Biology, University of Zaragoza, 50009Zaragoza, Spain
- Institute of Biocomputation and Physics of Complex Systems, University of Zaragoza,50018Zaragoza, Spain
| | - Gonzalo Molpeceres
- Centro de Investigaciones Biológicas “Margarita Salas,” Consejo Superior de Investigaciones Científicas, MadridE-28040, Spain
| | - Francisco J. Ruiz-Dueñas
- Centro de Investigaciones Biológicas “Margarita Salas,” Consejo Superior de Investigaciones Científicas, MadridE-28040, Spain
| | - Ana Serrano
- Centro de Investigaciones Biológicas “Margarita Salas,” Consejo Superior de Investigaciones Científicas, MadridE-28040, Spain
| | - Bernard Henrissat
- DTU Bioengineering, Technical University of Denmark2800, Kgs. Lyngby, Denmark
- Department of Biological Sciences, King Abdulaziz University, Jeddah21589, Saudi Arabia
| | - Elodie Drula
- Architecture et Fonction des Macromolécules Biologiques, CNRS, Université13288, Marseille, France
- INRAE, UMR 1163, Biodiversité et Biotechnologie Fongiques13009, Marseille, France
| | - Karen W. Hughes
- Department of Ecology and Evolutionary Biology, University of Tennessee, Knoxville, TN37996
| | - Juan L. Mata
- Department of Biology, University of South Alabama, Mobile, AL36688
| | - Noemia Kazue Ishikawa
- Coordenação de Biodiversidade, Instituto Nacional de Pesquisas da Amazônia, Petrópolis, ManausAM 69067-375, Brazil
| | - Ruby Vargas-Isla
- Coordenação de Biodiversidade, Instituto Nacional de Pesquisas da Amazônia, Petrópolis, ManausAM 69067-375, Brazil
| | - Shuji Ushijima
- The Tottori Mycological Institute, Japan Kinoko Research Center Foundation, Tottori689-1125, Japan
| | - Chris A. Smith
- Manaaki Whenua - Landcare Research, Auckland1072, New Zealand
| | - John Donoghue
- Northwest Mycological Consultants, Corvallis, OR97330
| | - Steven Ahrendt
- U.S. Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA94720
| | - William Andreopoulos
- U.S. Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA94720
| | - Guifen He
- U.S. Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA94720
| | - Kurt LaButti
- U.S. Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA94720
| | - Anna Lipzen
- U.S. Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA94720
| | - Vivian Ng
- U.S. Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA94720
| | - Robert Riley
- U.S. Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA94720
| | - Laura Sandor
- U.S. Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA94720
| | - Kerrie Barry
- U.S. Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA94720
| | - Angel T. Martínez
- Centro de Investigaciones Biológicas “Margarita Salas,” Consejo Superior de Investigaciones Científicas, MadridE-28040, Spain
| | - Yang Xiao
- Institute of Applied Mycology, Huazhong Agricultural University, Wuhan, Hubei430070, China
| | - John G. Gibbons
- Department of Food Science, University of Massachusetts, Amherst, MA01003
| | - Kazuhisa Terashima
- The Tottori Mycological Institute, Japan Kinoko Research Center Foundation, Tottori689-1125, Japan
| | - Igor V. Grigoriev
- U.S. Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA94720
- Department of Plant and Microbial Biology, University of California Berkeley, Berkeley, CA94720
| | - David Hibbett
- Biology Department, Clark University, Worcester, MA01610
| |
Collapse
|
11
|
Joubert PM, Krasileva KV. Distinct genomic contexts predict gene presence-absence variation in different pathotypes of a fungal plant pathogen. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.02.17.529015. [PMID: 36824763 PMCID: PMC9949116 DOI: 10.1101/2023.02.17.529015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/20/2023]
Abstract
Background Fungi use the accessory segments of their pan-genomes to adapt to their environments. While gene presence-absence variation (PAV) contributes to shaping these accessory gene reservoirs, whether these events happen in specific genomic contexts remains unclear. Additionally, since pan-genome studies often group together all members of the same species, it is uncertain whether genomic or epigenomic features shaping pan-genome evolution are consistent across populations within the same species. Fungal plant pathogens are useful models for answering these questions because members of the same species often infect distinct hosts, and they frequently rely on gene PAV to adapt to these hosts. Results We analyzed gene PAV in the rice and wheat blast fungus, Magnaporthe oryzae, and found that PAV of disease-causing effectors, antibiotic production, and non-self-recognition genes may drive the adaptation of the fungus to its environment. We then analyzed genomic and epigenomic features and data from available datasets for patterns that might help explain these PAV events. We observed that proximity to transposable elements (TEs), gene GC content, gene length, expression level in the host, and histone H3K27me3 marks were different between PAV genes and conserved genes, among other features. We used these features to construct a random forest classifier that was able to predict whether a gene is likely to experience PAV with high precision (86.06%) and recall (92.88%) in rice-infecting M. oryzae. Finally, we found that PAV in wheat- and rice-infecting pathotypes of M. oryzae differed in their number and their genomic context. Conclusions Our results suggest that genomic and epigenomic features of gene PAV can be used to better understand and even predict fungal pan-genome evolution. We also show that substantial intra-species variation can exist in these features.
Collapse
|
12
|
Chen H, King R, Smith D, Bayon C, Ashfield T, Torriani S, Kanyuka K, Hammond-Kosack K, Bieri S, Rudd J. Combined pangenomics and transcriptomics reveals core and redundant virulence processes in a rapidly evolving fungal plant pathogen. BMC Biol 2023; 21:24. [PMID: 36747219 PMCID: PMC9903594 DOI: 10.1186/s12915-023-01520-6] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Accepted: 01/19/2023] [Indexed: 02/08/2023] Open
Abstract
BACKGROUND Studying genomic variation in rapidly evolving pathogens potentially enables identification of genes supporting their "core biology", being present, functional and expressed by all strains or "flexible biology", varying between strains. Genes supporting flexible biology may be considered to be "accessory", whilst the "core" gene set is likely to be important for common features of a pathogen species biology, including virulence on all host genotypes. The wheat-pathogenic fungus Zymoseptoria tritici represents one of the most rapidly evolving threats to global food security and was the focus of this study. RESULTS We constructed a pangenome of 18 European field isolates, with 12 also subjected to RNAseq transcription profiling during infection. Combining this data, we predicted a "core" gene set comprising 9807 sequences which were (1) present in all isolates, (2) lacking inactivating polymorphisms and (3) expressed by all isolates. A large accessory genome, consisting of 45% of the total genes, was also defined. We classified genetic and genomic polymorphism at both chromosomal and individual gene scales. Proteins required for essential functions including virulence had lower-than average sequence variability amongst core genes. Both core and accessory genomes encoded many small, secreted candidate effector proteins that likely interact with plant immunity. Viral vector-mediated transient in planta overexpression of 88 candidates failed to identify any which induced leaf necrosis characteristic of disease. However, functional complementation of a non-pathogenic deletion mutant lacking five core genes demonstrated that full virulence was restored by re-introduction of the single gene exhibiting least sequence polymorphism and highest expression. CONCLUSIONS These data support the combined use of pangenomics and transcriptomics for defining genes which represent core, and potentially exploitable, weaknesses in rapidly evolving pathogens.
Collapse
Affiliation(s)
- Hongxin Chen
- grid.418374.d0000 0001 2227 9389Department of Protecting Crops and the Environment, Rothamsted Research, Harpenden, Herts UK ,grid.12981.330000 0001 2360 039XPresent address: School of Agriculture, Shenzhen Campus of Sun Yat-sen University, Guangming District, Shenzhen, Guangdong People’s Republic of China
| | - Robert King
- grid.418374.d0000 0001 2227 9389Department of Protecting Crops and the Environment, Rothamsted Research, Harpenden, Herts UK
| | - Dan Smith
- grid.418374.d0000 0001 2227 9389Department of Protecting Crops and the Environment, Rothamsted Research, Harpenden, Herts UK
| | - Carlos Bayon
- grid.418374.d0000 0001 2227 9389Department of Protecting Crops and the Environment, Rothamsted Research, Harpenden, Herts UK
| | - Tom Ashfield
- grid.418374.d0000 0001 2227 9389Department of Protecting Crops and the Environment, Rothamsted Research, Harpenden, Herts UK ,grid.418374.d0000 0001 2227 9389Crop Health and Protection (CHaP), Rothamsted Research, Harpenden, Herts UK
| | - Stefano Torriani
- grid.420222.40000 0001 0669 0426Syngenta Crop Protection AG, Schaffhauserstrasse 101, CH-4332 Stein, Switzerland
| | - Kostya Kanyuka
- grid.418374.d0000 0001 2227 9389Department of Protecting Crops and the Environment, Rothamsted Research, Harpenden, Herts UK ,grid.17595.3f0000 0004 0383 6532Present address: National Institute for Agricultural Botany (NIAB), 93 Lawrence Weaver Road, Cambridge, UK
| | - Kim Hammond-Kosack
- grid.418374.d0000 0001 2227 9389Department of Protecting Crops and the Environment, Rothamsted Research, Harpenden, Herts UK
| | - Stephane Bieri
- grid.420222.40000 0001 0669 0426Syngenta Crop Protection AG, Schaffhauserstrasse 101, CH-4332 Stein, Switzerland
| | - Jason Rudd
- Department of Protecting Crops and the Environment, Rothamsted Research, Harpenden, Herts, UK.
| |
Collapse
|
13
|
Dereeper A, Allouch N, Guerlais V, Garnier M, Ma L, De Jonckheere JF, Joseph SJ, Ali IKM, Talarmin A, Marcelino I. Naegleria genus pangenome reveals new structural and functional insights into the versatility of these free-living amoebae. Front Microbiol 2023; 13:1056418. [PMID: 36817109 PMCID: PMC9928731 DOI: 10.3389/fmicb.2022.1056418] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2022] [Accepted: 12/21/2022] [Indexed: 02/04/2023] Open
Abstract
Introduction Free-living amoebae of the Naegleria genus belong to the major protist clade Heterolobosea and are ubiquitously distributed in soil and freshwater habitats. Of the 47 Naegleria species described, N. fowleri is the only one being pathogenic to humans, causing a rare but fulminant primary amoebic meningoencephalitis. Some Naegleria genome sequences are publicly available, but the genetic basis for Naegleria diversity and ability to thrive in diverse environments (including human brain) remains unclear. Methods Herein, we constructed a high-quality Naegleria genus pangenome to obtain a comprehensive catalog of genes encoded by these amoebae. For this, we first sequenced, assembled, and annotated six new Naegleria genomes. Results and Discussion Genome architecture analyses revealed that Naegleria may use genome plasticity features such as ploidy/aneuploidy to modulate their behavior in different environments. When comparing 14 near-to-complete genome sequences, our results estimated the theoretical Naegleria pangenome as a closed genome, with 13,943 genes, including 3,563 core and 10,380 accessory genes. The functional annotations revealed that a large fraction of Naegleria genes show significant sequence similarity with those already described in other kingdoms, namely Animalia and Plantae. Comparative analyses highlighted a remarkable genomic heterogeneity, even for closely related strains and demonstrate that Naegleria harbors extensive genome variability, reflected in different metabolic repertoires. If Naegleria core genome was enriched in conserved genes essential for metabolic, regulatory and survival processes, the accessory genome revealed the presence of genes involved in stress response, macromolecule modifications, cell signaling and immune response. Commonly reported N. fowleri virulence-associated genes were present in both core and accessory genomes, suggesting that N. fowleri's ability to infect human brain could be related to its unique species-specific genes (mostly of unknown function) and/or to differential gene expression. The construction of Naegleria first pangenome allowed us to move away from a single reference genome (that does not necessarily represent each species as a whole) and to identify essential and dispensable genes in Naegleria evolution, diversity and biology, paving the way for further genomic and post-genomic studies.
Collapse
Affiliation(s)
- Alexis Dereeper
- Institut Pasteur de la Guadeloupe, Unité TReD-Path, Les Abymes, Guadeloupe, France
| | - Nina Allouch
- Institut Pasteur de la Guadeloupe, Unité TReD-Path, Les Abymes, Guadeloupe, France
| | - Vincent Guerlais
- Institut Pasteur de la Guadeloupe, Unité TReD-Path, Les Abymes, Guadeloupe, France
| | - Maëlle Garnier
- Institut Pasteur de la Guadeloupe, Unité TReD-Path, Les Abymes, Guadeloupe, France
| | - Laurence Ma
- Institut Pasteur de Paris, Biomics, Paris, France
| | | | - Sandeep J. Joseph
- Centers for Disease Control and Prevention (CDC), Atlanta, GA, United States
| | - Ibne Karim M. Ali
- Centers for Disease Control and Prevention (CDC), Atlanta, GA, United States
| | - Antoine Talarmin
- Institut Pasteur de la Guadeloupe, Unité TReD-Path, Les Abymes, Guadeloupe, France
| | - Isabel Marcelino
- Institut Pasteur de la Guadeloupe, Unité TReD-Path, Les Abymes, Guadeloupe, France,*Correspondence: Isabel Marcelino,
| |
Collapse
|
14
|
Li T, Yin Y. Critical assessment of pan-genomic analysis of metagenome-assembled genomes. Brief Bioinform 2022; 23:6702672. [PMID: 36124775 PMCID: PMC9677465 DOI: 10.1093/bib/bbac413] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2022] [Revised: 08/23/2022] [Accepted: 08/26/2022] [Indexed: 12/30/2022] Open
Abstract
Pan-genome analyses of metagenome-assembled genomes (MAGs) may suffer from the known issues with MAGs: fragmentation, incompleteness and contamination. Here, we conducted a critical assessment of pan-genomics of MAGs, by comparing pan-genome analysis results of complete bacterial genomes and simulated MAGs. We found that incompleteness led to significant core gene (CG) loss. The CG loss remained when using different pan-genome analysis tools (Roary, BPGA, Anvi'o) and when using a mixture of MAGs and complete genomes. Contamination had little effect on core genome size (except for Roary due to in its gene clustering issue) but had major influence on accessory genomes. Importantly, the CG loss was partially alleviated by lowering the CG threshold and using gene prediction algorithms that consider fragmented genes, but to a less degree when incompleteness was higher than 5%. The CG loss also led to incorrect pan-genome functional predictions and inaccurate phylogenetic trees. Our main findings were supported by a study of real MAG-isolate genome data. We conclude that lowering CG threshold and predicting genes in metagenome mode (as Anvi'o does with Prodigal) are necessary in pan-genome analysis of MAGs. Development of new pan-genome analysis tools specifically for MAGs are needed in future studies.
Collapse
Affiliation(s)
- Tang Li
- Nebraska Food for Health Center, Department of Food Science and Technology, University of Nebraska - Lincoln, Lincoln, NE, 68508, USA
| | - Yanbin Yin
- Corresponding author. Yanbin Yin, Nebraska Food for Health Center, Department of Food Science and Technology, University of Nebraska - Lincoln, Lincoln, NE 68508, USA. Tel.: +1-402-472-4303; E-mail:
| |
Collapse
|
15
|
Genome-Based Multi-Antigenic Epitopes Vaccine Construct Designing against Staphylococcus hominis Using Reverse Vaccinology and Biophysical Approaches. Vaccines (Basel) 2022; 10:vaccines10101729. [PMID: 36298594 PMCID: PMC9611379 DOI: 10.3390/vaccines10101729] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2022] [Revised: 10/11/2022] [Accepted: 10/14/2022] [Indexed: 11/07/2022] Open
Abstract
Staphylococcus hominis is a Gram-positive bacterium from the staphylococcus genus; it is also a member of coagulase-negative staphylococci because of its opportunistic nature and ability to cause life-threatening bloodstream infections in immunocompromised patients. Gram-positive and opportunistic bacteria have become a major concern for the medical community. It has also drawn the attention of scientists due to the evaluation of immune evasion tactics and the development of multidrug-resistant strains. This prompted the need to explore novel therapeutic approaches as an alternative to antibiotics. The current study aimed to develop a broad-spectrum, multi-epitope vaccine to control bacterial infections and reduce the burden on healthcare systems. A computational framework was designed to filter the immunogenic potent vaccine candidate. This framework consists of pan-genomics, subtractive proteomics, and immunoinformatics approaches to prioritize vaccine candidates. A total of 12,285 core proteins were obtained using a pan-genome analysis of all strains. The screening of the core proteins resulted in the selection of only two proteins for the next epitope prediction phase. Eleven B-cell derived T-cell epitopes were selected that met the criteria of different immunoinformatics approaches such as allergenicity, antigenicity, immunogenicity, and toxicity. A vaccine construct was formulated using EAAAK and GPGPG linkers and a cholera toxin B subunit. This formulated vaccine construct was further used for downward analysis. The vaccine was loop refined and improved for structure stability through disulfide engineering. For an efficient expression, the codons were optimized as per the usage pattern of the E coli (K12) expression system. The top three refined docked complexes of the vaccine that docked with the MHC-I, MHC-II, and TLR-4 receptors were selected, which proved the best binding potential of the vaccine with immune receptors; this was followed by molecular dynamic simulations. The results indicate the best intermolecular bonding between immune receptors and vaccine epitopes and that they are exposed to the host’s immune system. Finally, the binding energies were calculated to confirm the binding stability of the docked complexes. This work aimed to provide a manageable list of immunogenic and antigenic epitopes that could be used as potent vaccine candidates for experimental in vivo and in vitro studies.
Collapse
|
16
|
Naranjo-Ortiz MA, Molina M, Fuentes D, Mixão V, Gabaldón T. Karyon: a computational framework for the diagnosis of hybrids, aneuploids, and other nonstandard architectures in genome assemblies. Gigascience 2022; 11:6751106. [PMID: 36205401 PMCID: PMC9540331 DOI: 10.1093/gigascience/giac088] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2021] [Revised: 11/23/2021] [Accepted: 08/24/2022] [Indexed: 12/22/2022] Open
Abstract
BACKGROUND Recent technological developments have made genome sequencing and assembly highly accessible and widely used. However, the presence in sequenced organisms of certain genomic features such as high heterozygosity, polyploidy, aneuploidy, heterokaryosis, or extreme compositional biases can challenge current standard assembly procedures and result in highly fragmented assemblies. Hence, we hypothesized that genome databases must contain a nonnegligible fraction of low-quality assemblies that result from such type of intrinsic genomic factors. FINDINGS Here we present Karyon, a Python-based toolkit that uses raw sequencing data and de novo genome assembly to assess several parameters and generate informative plots to assist in the identification of nonchanonical genomic traits. Karyon includes automated de novo genome assembly and variant calling pipelines. We tested Karyon by diagnosing 35 highly fragmented publicly available assemblies from 19 different Mucorales (Fungi) species. CONCLUSIONS Our results show that 10 (28.57%) of the assemblies presented signs of unusual genomic configurations, suggesting that these are common, at least for some lineages within the Fungi.
Collapse
Affiliation(s)
- Miguel A Naranjo-Ortiz
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona 08003, Spain,Health and Life Sciences, Universitat Pompeu Fabra (UPF), Barcelona 08003, Spain,Biology Department, Clark University, Worcester, MA 01610, USA,Naturhistoriskmuseum, University of Oslo, Oslo 0562, Norway
| | - Manu Molina
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona 08003, Spain,Health and Life Sciences, Universitat Pompeu Fabra (UPF), Barcelona 08003, Spain,Life Sciences Department, Barcelona Supercomputing Centre (BSC-CNS), Barcelona 08034, Spain
| | - Diego Fuentes
- Life Sciences Department, Barcelona Supercomputing Centre (BSC-CNS), Barcelona 08034, Spain,Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Barcelona 08028, Spain
| | - Verónica Mixão
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona 08003, Spain,Health and Life Sciences, Universitat Pompeu Fabra (UPF), Barcelona 08003, Spain,Life Sciences Department, Barcelona Supercomputing Centre (BSC-CNS), Barcelona 08034, Spain,Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Barcelona 08028, Spain
| | - Toni Gabaldón
- Correspondence address. Toni Gabaldón, Plaça Eusebi Güell, 1-3, Barcelona 08034, Spain. E-mail:
| |
Collapse
|
17
|
Parikh SB, Houghton C, Van Oss SB, Wacholder A, Carvunis A. Origins, evolution, and physiological implications of de novo genes in yeast. Yeast 2022; 39:471-481. [PMID: 35959631 PMCID: PMC9544372 DOI: 10.1002/yea.3810] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2022] [Revised: 08/08/2022] [Accepted: 08/09/2022] [Indexed: 12/03/2022] Open
Abstract
De novo gene birth is the process by which new genes emerge in sequences that were previously noncoding. Over the past decade, researchers have taken advantage of the power of yeast as a model and a tool to study the evolutionary mechanisms and physiological implications of de novo gene birth. We summarize the mechanisms that have been proposed to explicate how noncoding sequences can become protein-coding genes, highlighting the discovery of pervasive translation of the yeast transcriptome and its presumed impact on evolutionary innovation. We summarize current best practices for the identification and characterization of de novo genes. Crucially, we explain that the field is still in its nascency, with the physiological roles of most young yeast de novo genes identified thus far still utterly unknown. We hope this review inspires researchers to investigate the true contribution of de novo gene birth to cellular physiology and phenotypic diversity across yeast strains and species.
Collapse
Affiliation(s)
- Saurin B. Parikh
- Department of Computational and Systems Biology, School of Medicine, Pittsburgh Center for Evolutionary Biology and EvolutionUniversity of PittsburghPittsburghPennsylvaniaUSA
| | - Carly Houghton
- Department of Computational and Systems Biology, School of Medicine, Pittsburgh Center for Evolutionary Biology and EvolutionUniversity of PittsburghPittsburghPennsylvaniaUSA
| | - S. Branden Van Oss
- Department of Computational and Systems Biology, School of Medicine, Pittsburgh Center for Evolutionary Biology and EvolutionUniversity of PittsburghPittsburghPennsylvaniaUSA
| | - Aaron Wacholder
- Department of Computational and Systems Biology, School of Medicine, Pittsburgh Center for Evolutionary Biology and EvolutionUniversity of PittsburghPittsburghPennsylvaniaUSA
| | - Anne‐Ruxandra Carvunis
- Department of Computational and Systems Biology, School of Medicine, Pittsburgh Center for Evolutionary Biology and EvolutionUniversity of PittsburghPittsburghPennsylvaniaUSA
| |
Collapse
|
18
|
Ogaji YO, Lee RC, Sawbridge TI, Cocks BG, Daetwyler HD, Kaur S. De Novo Long-Read Whole-Genome Assemblies and the Comparative Pan-Genome Analysis of Ascochyta Blight Pathogens Affecting Field Pea. J Fungi (Basel) 2022; 8:jof8080884. [PMID: 36012871 PMCID: PMC9410150 DOI: 10.3390/jof8080884] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2022] [Revised: 08/12/2022] [Accepted: 08/15/2022] [Indexed: 11/16/2022] Open
Abstract
Ascochyta Blight (AB) is a major disease of many cool-season legumes globally. In field pea, three fungal pathogens have been identified to be responsible for this disease in Australia, namely Peyronellaea pinodes, Peyronellaea pinodella and Phoma koolunga. Limited genomic resources for these pathogens have been generated, which has hampered the implementation of effective management strategies and breeding for resistant cultivars. Using Oxford Nanopore long-read sequencing, we report the first high-quality, fully annotated, near-chromosome-level nuclear and mitochondrial genome assemblies for 18 isolates from the Australian AB complex. Comparative genome analysis was performed to elucidate the differences and similarities between species and isolates using phylogenetic relationships and functional diversity. Our data indicated that P. pinodella and P. koolunga are heterothallic, while P. pinodes is homothallic. More homology and orthologous gene clusters are shared between P. pinodes and P. pinodella compared to P. koolunga. The analysis of the repetitive DNA content showed differences in the transposable repeat composition in the genomes and their expression in the transcriptomes. Significant repeat expansion in P. koolunga’s genome was seen, with strong repeat-induced point mutation (RIP) activity being evident. Phylogenetic analysis revealed that genetic diversity can be exploited for species marker development. This study provided the much-needed genetic resources and characterization of the AB species to further drive research in key areas such as disease epidemiology and host–pathogen interactions.
Collapse
Affiliation(s)
- Yvonne O. Ogaji
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, 5 Ring Road, Melbourne, VIC 3083, Australia
- School of Applied Systems Biology, La Trobe University, Melbourne, VIC 3086, Australia
| | - Robert C. Lee
- Centre for Crop and Disease Management, School of Molecular and Life Sciences, Curtin University, Perth, WA 6102, Australia
| | - Tim I. Sawbridge
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, 5 Ring Road, Melbourne, VIC 3083, Australia
- School of Applied Systems Biology, La Trobe University, Melbourne, VIC 3086, Australia
| | - Benjamin G. Cocks
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, 5 Ring Road, Melbourne, VIC 3083, Australia
- School of Applied Systems Biology, La Trobe University, Melbourne, VIC 3086, Australia
| | - Hans D. Daetwyler
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, 5 Ring Road, Melbourne, VIC 3083, Australia
- School of Applied Systems Biology, La Trobe University, Melbourne, VIC 3086, Australia
| | - Sukhjiwan Kaur
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, 5 Ring Road, Melbourne, VIC 3083, Australia
- Correspondence:
| |
Collapse
|
19
|
Examination of Genome-Wide Ortholog Variation in Clinical and Environmental Isolates of the Fungal Pathogen Aspergillus fumigatus. mBio 2022; 13:e0151922. [PMID: 35766381 PMCID: PMC9426589 DOI: 10.1128/mbio.01519-22] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Aspergillus fumigatus is both an environmental saprobe and an opportunistic human fungal pathogen. Knowledge of genomic variation across A. fumigatus isolates is essential for understanding the evolution of pathogenicity, virulence, and resistance to antifungal drugs. Here, we investigated 206 A. fumigatus isolates (133 clinical and 73 environmental isolates), aiming to identify genes with variable presence across isolates and test whether this variation was related to the clinical or environmental origin of isolates. The PanOrtho genome of A. fumigatus consists of 13,085 ortholog groups, of which 7,773 (59.4%) are shared by all isolates (core groups) and 5,312 (40.6%) vary in their gene presence across isolates (accessory groups plus singletons). Despite differences in the distribution of orthologs across all isolates, no significant differences were observed among clinical versus environmental isolates when phylogeny was accounted for. Orthologs that differ in their distribution across isolates tend to occur at low frequency and/or be restricted to specific isolates; thus, the degree of genomic conservation between orthologs of A. fumigatus is high. These results suggest that differences in the distribution of orthologs within A. fumigatus cannot be associated with the clinical or environmental origin of isolates. IMPORTANCE Aspergillus fumigatus is a cosmopolitan species of fungus responsible for thousands of cases of invasive disease annually. Clinical and environmental isolates of A. fumigatus exhibit extensive phenotypic differences, including differences related to virulence and antifungal drug resistance. A comprehensive survey of the genomic diversity present in A. fumigatus and its relationship to the clinical or environmental origin of isolates can contribute to the prediction of the mechanisms of evolution and infection of the species. Our results suggest that there is no significant variation in ortholog distribution between clinical and environmental isolates when accounting for evolutionary history. The work supports the hypothesis that environmental and clinical isolates of A. fumigatus do not differ in their gene contents.
Collapse
|
20
|
Maphosa MN, Steenkamp ET, Kanzi AM, van Wyk S, De Vos L, Santana QC, Duong TA, Wingfield BD. Intra-Species Genomic Variation in the Pine Pathogen Fusarium circinatum. J Fungi (Basel) 2022; 8:jof8070657. [PMID: 35887414 PMCID: PMC9316270 DOI: 10.3390/jof8070657] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2022] [Revised: 06/02/2022] [Accepted: 06/08/2022] [Indexed: 12/10/2022] Open
Abstract
Fusarium circinatum is an important global pathogen of pine trees. Genome plasticity has been observed in different isolates of the fungus, but no genome comparisons are available. To address this gap, we sequenced and assembled to chromosome level five isolates of F. circinatum. These genomes were analysed together with previously published genomes of F. circinatum isolates, FSP34 and KS17. Multi-sample variant calling identified a total of 461,683 micro variants (SNPs and small indels) and a total of 1828 macro structural variants of which 1717 were copy number variants and 111 were inversions. The variant density was higher on the sub-telomeric regions of chromosomes. Variant annotation revealed that genes involved in transcription, transport, metabolism and transmembrane proteins were overrepresented in gene sets that were affected by high impact variants. A core genome representing genomic elements that were conserved in all the isolates and a non-redundant pangenome representing all genomic elements is presented. Whole genome alignments showed that an average of 93% of the genomic elements were present in all isolates. The results of this study reveal that some genomic elements are not conserved within the isolates and some variants are high impact. The described genome-scale variations will help to inform novel disease management strategies against the pathogen.
Collapse
|
21
|
Sedaghatjoo S, Mishra B, Forster MK, Becker Y, Keilwagen J, Killermann B, Thines M, Karlovsky P, Maier W. Comparative genomics reveals low levels of inter- and intraspecies diversity in the causal agents of dwarf and common bunt of wheat and hint at conspecificity of Tilletia caries and T. laevis. IMA Fungus 2022; 13:11. [PMID: 35672841 PMCID: PMC9172201 DOI: 10.1186/s43008-022-00098-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2021] [Accepted: 05/27/2022] [Indexed: 11/10/2022] Open
Abstract
AbstractTilletia caries and T. laevis, which are the causal agents of common bunt, as well as T. controversa, which causes dwarf bunt of wheat, threaten especially organic wheat farming. The three closely related fungal species differ in their teliospore morphology and partially in their physiology and infection biology. The gene content as well as intraspecies variation in these species and the genetic basis of their separation is unknown. We sequenced the genome of four T. caries, five T. controversa, and two T. laevis and extended this dataset with five publicly available ones. The genomes of the three species displayed microsynteny with up to 94.3% pairwise aligned regions excluding repetitive regions. The majority of functionally characterized genes involved in pathogenicity, life cycle, and infection of corn smut, Ustilago maydis, were found to be absent or poorly conserved in the draft genomes and the biosynthetic pathway for trimethylamine in Tilletia spp. could be different from bacteria. Overall, 75% of the identified protein-coding genes comprising 84% of the total predicted carbohydrate utilizing enzymes, 72.5% putatively secreted proteins, and 47.4% of effector-like proteins were conserved and shared across all 16 isolates. We predicted nine highly identical secondary metabolite biosynthesis gene clusters comprising in total 62 genes in all species and none were species-specific. Less than 0.1% of the protein-coding genes were species-specific and their function remained mostly unknown. Tilletia controversa had the highest intraspecies genetic variation, followed by T. caries and the lowest in T. laevis. Although the genomes of the three species are very similar, employing 241 single copy genes T. controversa was phylogenetically distinct from T. caries and T. laevis, however these two could not be resolved as individual monophyletic groups. This was in line with the genome-wide number of single nucleotide polymorphisms and small insertions and deletions. Despite the conspicuously different teliospore ornamentation of T. caries and T. laevis, a high degree of genomic identity and scarcity of species-specific genes indicate that the two species could be conspecific.
Collapse
|
22
|
Pan-Genomes Provide Insights into the Genetic Basis of Auricularia heimuer Domestication. J Fungi (Basel) 2022; 8:jof8060581. [PMID: 35736064 PMCID: PMC9225563 DOI: 10.3390/jof8060581] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2022] [Revised: 05/27/2022] [Accepted: 05/27/2022] [Indexed: 11/17/2022] Open
Abstract
In order to reveal the genetic variation signals of Auricularia heimuer that have occurred during their domestication and to find potential functional gene families, we constructed a monokaryotic pan-genome of A. heimuer representing four cultivated strains and four wild strains. The pan-genome contained 14,089 gene families, of which 67.56% were core gene families and 31.88% were dispensable gene families. We screened substrate utilization-related genes such as the chitinase gene ahchi1 of the glycoside hydrolase (GH) 18 family and a carbohydrate-binding module (CBM)-related gene from the dispensable families of cultivated populations. The genomic difference in the ahchi1 gene between the wild and cultivated genomes was caused by a 33 kb presence/absence variation (PAV). The detection rate of the ahchi1 gene was 93.75% in the cultivated population, significantly higher than that in the wild population (17.39%), indicating that it has been selected in cultivated strains. Principal component analysis (PCA) of the polymorphic markers in fragments near the ahchi1 gene was enriched in cultivated strains, and this was caused by multiple independent instances of artificial selection. We revealed for the first time the genetic basis of the ahchi1 gene in domestication, thereby providing a foundation for elucidating the potential function of the ahchi1 gene in the breeding of A. heimuer.
Collapse
|
23
|
Characterization and genomic analysis of a Demerecviridae phage SP76 with lytic multiple-serotypes of Salmonella. Arch Microbiol 2022; 204:175. [PMID: 35166928 DOI: 10.1007/s00203-022-02762-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2021] [Revised: 01/10/2022] [Accepted: 01/12/2022] [Indexed: 11/02/2022]
Abstract
With the increase in antimicrobial resistance of Salmonella, phages have been paid more attention to as an alternative to antibiotics. In this study, a phage designated as SP76 was isolated from sewage. It can lyse several serotypes of Salmonella, including S. typhimurium (21/33), S. enteritidis (7/7), S. dublin (4/4), S. pullorum (2/2) and S. choleraesuis (1/2). SP76 showed a latent time of about 10 min, and maintained good lytic activity at a pH range of 3-10 and temperatures between 4 and 37 °C. Moreover, its optimal multiplicity of infection (MOI) was 0.0001. Based on the results of genomic sequence and analysis, SP76 was found to have a genome of 111,639 bp that encoded 166 predicted ORFs and belong to the Demerecviridae family, order Caudovirales. No virulence or lysogen formation gene clusters were identified in the SP76 genome. A pan-genome analysis based on 100 phages within the subfamily Markadamsvirinae indicated that SP76 had 23 core genes and 1199 accessory genes. We grouped the subfamily Markadamsvirinae and found that the main difference was in group III. In vitro bacteriostasis, experiments showed that the phage SP76 reduced planktonic bacteria by 1.52 log10 CFU/mL, and biofilms (24 h old) by 0.372 log10 CFU/mL, respectively. Thus, we isolated a safe and efficient phage that might be a good antibacterial agent.
Collapse
|
24
|
Yildirir G, Sperschneider J, Malar C M, Chen ECH, Iwasaki W, Cornell C, Corradi N. Long reads and Hi-C sequencing illuminate the two-compartment genome of the model arbuscular mycorrhizal symbiont Rhizophagus irregularis. THE NEW PHYTOLOGIST 2022; 233:1097-1107. [PMID: 34747029 DOI: 10.1111/nph.17842] [Citation(s) in RCA: 29] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/25/2021] [Accepted: 10/29/2021] [Indexed: 06/13/2023]
Abstract
Chromosome folding links genome structure with gene function by generating distinct nuclear compartments and topologically associating domains. In mammals, these undergo preferential interactions and regulate gene expression. However, their role in fungal genome biology is unclear. Here, we combine Nanopore (ONT) sequencing with chromatin conformation capture sequencing (Hi-C) to reveal chromosome and epigenetic diversity in a group of obligate plant symbionts: the arbuscular mycorrhizal fungi (AMF). We find that five phylogenetically distinct strains of the model AMF Rhizophagus irregularis carry 33 chromosomes with substantial within-species variability in size, as well as in gene and repeat content. Strain-specific Hi-C contact maps reveal a 'checkerboard' pattern that underline two dominant euchromatin (A) and heterochromatin (B) compartments. Each compartment differs in the level of gene transcription, regulation of candidate effectors and methylation frequencies. The A-compartment is more gene-dense and contains most core genes, while the B-compartment is more repeat-rich and has higher rates of chromosomal rearrangement. While the B-compartment is transcriptionally repressed, it has significantly more secreted proteins and in planta upregulated candidate effectors, suggesting a possible host-induced change in chromosome conformation. Overall, this study provides a fine-scale view into the genome biology and evolution of model plant symbionts, and opens avenues to study the epigenetic mechanisms that modify chromosome folding during host-microbe interactions.
Collapse
Affiliation(s)
- Gökalp Yildirir
- Department of Biology, University of Ottawa, Ottawa, ON, K1N 6N5, Canada
| | - Jana Sperschneider
- Biological Data Science Institute, The Australian National University, Canberra, ACT, 260, Australia
| | - Mathu Malar C
- Department of Biology, University of Ottawa, Ottawa, ON, K1N 6N5, Canada
| | - Eric C H Chen
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, The University of Tokyo, Tokyo, 1113-0033, Japan
| | - Wataru Iwasaki
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, The University of Tokyo, Tokyo, 1113-0033, Japan
| | - Calvin Cornell
- Department of Biology, University of Ottawa, Ottawa, ON, K1N 6N5, Canada
| | - Nicolas Corradi
- Department of Biology, University of Ottawa, Ottawa, ON, K1N 6N5, Canada
| |
Collapse
|
25
|
Nie Y, Zhao H, Wang Z, Zhou Z, Liu X, Huang B. The Gene Rearrangement, Loss, Transfer, and Deep Intronic Variation in Mitochondrial Genomes of Conidiobolus. Front Microbiol 2021; 12:765733. [PMID: 34858376 PMCID: PMC8632527 DOI: 10.3389/fmicb.2021.765733] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2021] [Accepted: 10/01/2021] [Indexed: 12/22/2022] Open
Abstract
The genus Conidiobolus s.s. was newly delimited from Conidiobolus s.l. In order to gain insight into its mitochondrial genetic background, this study sequenced six mitochondrial genomes of the genus Conidiobolus s.s. These mitogenomes were all composed of circular DNA molecules, ranging from 29,253 to 48,417 bp in size and from 26.61 to 27.90% in GC content. The order and direction for 14 core protein-coding genes (PCGs) were identical, except for the atp8 gene lost in Conidiobolus chlamydosporus, Conidiobolus polyspermus, and Conidiobolus polytocus, and rearranged in the other Conidiobolus s.s. species. Besides, the atp8 gene split the cox1 gene in Conidiobolus taihushanensis. Phylogenomic analysis based on the 14 core PCGs confirmed that all Conidiobolus s.s. species formed a monophyly in the Entomophthoromycotina lineage. The number and length of introns were the main factors contributing to mitogenomic size, and deep variations and potential transfer were detected in introns. In addition, gene transfer occurred between the mitochondrial and nuclear genomes. This study promoted the understanding of the evolution and phylogeny of the Conidiobolus s.s. genus.
Collapse
Affiliation(s)
- Yong Nie
- Anhui Provincial Key Laboratory for Microbial Pest Control, Anhui Agricultural University, Hefei, China
- School of Civil Engineering and Architecture, Anhui University of Technology, Ma’anshan, China
| | - Heng Zhao
- School of Ecology and Nature Conservation, Institute of Microbiology, Beijing Forestry University, Beijing, China
- College of Life Sciences, Shandong Normal University, Jinan, China
| | - Zimin Wang
- School of Civil Engineering and Architecture, Anhui University of Technology, Ma’anshan, China
| | - Zhengyu Zhou
- School of Civil Engineering and Architecture, Anhui University of Technology, Ma’anshan, China
| | - Xiaoyong Liu
- College of Life Sciences, Shandong Normal University, Jinan, China
- State Key Laboratory of Mycology, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China
| | - Bo Huang
- Anhui Provincial Key Laboratory for Microbial Pest Control, Anhui Agricultural University, Hefei, China
| |
Collapse
|
26
|
Wang L, Liao B, Gong L, Xiao S, Huang Z. Haploid Genome Analysis Reveals a Tandem Cluster of Four HSP20 Genes Involved in the High-Temperature Adaptation of Coriolopsis trogii. Microbiol Spectr 2021; 9:e0028721. [PMID: 34406871 PMCID: PMC8552761 DOI: 10.1128/spectrum.00287-21] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2021] [Accepted: 07/16/2021] [Indexed: 01/16/2023] Open
Abstract
Coriolopsis trogii is a typical thermotolerant basidiomycete fungus, but its thermotolerance mechanisms are currently unknown. In this study, two monokaryons of C. trogii strain Ct001 were assembled: Ct001_29 had a genome assembly size of 38.85 Mb and encoded 13,113 genes, while Ct001_31 was 40.19 Mb in length and encoded 13,309 genes. Comparative intra- and interstrain genomic analysis revealed the rich genetic diversity of C. trogii, which included more than 315,194 single-nucleotide polymorphisms (SNPs), 30,387 insertion/deletions (indels), and 1,460 structural variations. Gene family analysis showed that the expanded families of C. trogii were functionally enriched in lignocellulose degradation activities. Furthermore, a total of 14 allelic pairs of heat shock protein 20 (HSP20) genes were identified in the C. trogii genome. The expression profile obtained from RNA sequencing (RNA-Seq) showed that four tandem-duplicated allelic pairs, HSP20.5 to HSP20.8, had more than 5-fold higher expression at 35°C than at 25°C. In particular, HSP20.5 and HSP20.8 were the most highly expressed HSP20 genes. Allelic expression bias was found for HSP20.5 and HSP20.8; the expression of Ct29HSP20.8 was at least 1.34-fold higher than that of Ct31HSP20.8, and that of Ct31HSP20.5 was at least 1.5-fold higher than that of Ct29HSP20.5. The unique structural and expression profiles of the HSP20 genes revealed by these haplotype-resolved genomes provide insight into the molecular mechanisms of high-temperature adaptation in C. trogii. IMPORTANCE Heat stress is one of the most frequently encountered environmental stresses for most mushroom-forming fungi. Currently available fungal genomes are mostly haploid because high heterozygosity hinders diploid genome assembly. Here, two haplotype genomes of C. trogii, a thermotolerant basidiomycete, were assembled separately. A conserved tandem cluster of four HSP20 genes showing allele-specific expression was found to be closely related to high-temperature adaptation in C. trogii. The obtained haploid genomes and their comparison offer a more thorough understanding of the genetic background of C. trogii. In addition, the responses of HSP20 genes at 35°C, which may contribute to the growth and survival of C. trogii at high temperatures, could inform the selection and breeding of elite strains in the future.
Collapse
Affiliation(s)
- Lining Wang
- Institute of Bioengineering, Guangdong Academy of Sciences, Guangzhou, People’s Republic of China
| | - Baosheng Liao
- Key Laboratory of Quality Evaluation of Chinese Medicine of the Guangdong Provincial Medical Products Administration, the Second Clinical College, Guangzhou University of Chinese Medicine, Guangzhou, People’s Republic of China
- Institute of Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing, People’s Republic of China
| | - Lu Gong
- Key Laboratory of Quality Evaluation of Chinese Medicine of the Guangdong Provincial Medical Products Administration, the Second Clinical College, Guangzhou University of Chinese Medicine, Guangzhou, People’s Republic of China
| | - Shuiming Xiao
- Institute of Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing, People’s Republic of China
| | - Zhihai Huang
- Key Laboratory of Quality Evaluation of Chinese Medicine of the Guangdong Provincial Medical Products Administration, the Second Clinical College, Guangzhou University of Chinese Medicine, Guangzhou, People’s Republic of China
| |
Collapse
|
27
|
Rueda-Mejia MP, Nägeli L, Lutz S, Hayes RD, Varadarajan AR, Grigoriev IV, Ahrens CH, Freimoser FM. Genome, transcriptome and secretome analyses of the antagonistic, yeast-like fungus Aureobasidium pullulans to identify potential biocontrol genes. MICROBIAL CELL 2021; 8:184-202. [PMID: 34395586 PMCID: PMC8329847 DOI: 10.15698/mic2021.08.757] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/24/2021] [Revised: 05/21/2021] [Accepted: 05/26/2021] [Indexed: 12/27/2022]
Abstract
Aureobasidium pullulans is an extremotolerant, cosmopolitan yeast-like fungus that successfully colonises vastly different ecological niches. The species is widely used in biotechnology and successfully applied as a commercial biocontrol agent against postharvest diseases and fireblight. However, the exact mechanisms that are responsible for its antagonistic activity against diverse plant pathogens are not known at the molecular level. Thus, it is difficult to optimise and improve the biocontrol applications of this species. As a foundation for elucidating biocontrol mechanisms, we have de novo assembled a high-quality reference genome of a strongly antagonistic A. pullulans strain, performed dual RNA-seq experiments, and analysed proteins secreted during the interaction with the plant pathogen Fusarium oxysporum. Based on the genome annotation, potential biocontrol genes were predicted to encode secreted hydrolases or to be part of secondary metabolite clusters (e.g., NRPS-like, NRPS, T1PKS, terpene, and β-lactone clusters). Transcriptome and secretome analyses defined a subset of 79 A. pullulans genes (among the 10,925 annotated genes) that were transcriptionally upregulated or exclusively detected at the protein level during the competition with F. oxysporum. These potential biocontrol genes comprised predicted secreted hydrolases such as glycosylases, esterases, and proteases, as well as genes encoding enzymes, which are predicted to be involved in the synthesis of secondary metabolites. This study highlights the value of a sequential approach starting with genome mining and consecutive transcriptome and secretome analyses in order to identify a limited number of potential target genes for detailed, functional analyses.
Collapse
Affiliation(s)
- Maria Paula Rueda-Mejia
- Agroscope, Research Division Plant Protection, Müller-Thurgau-Strasse 29, 8820 Wädenswil, Switzerland
| | - Lukas Nägeli
- Agroscope, Research Division Plant Protection, Müller-Thurgau-Strasse 29, 8820 Wädenswil, Switzerland
| | - Stefanie Lutz
- Agroscope, Competence Division Method Development and Analytics, Müller-Thurgau-Strasse 29, 8820, Wädenswil, Switzerland
| | - Richard D Hayes
- U.S. Department of Energy Joint Genome Institute (JGI), Lawrence Berkeley National Laboratory, 1 Cyclotron Rd., Berkeley, California 94720, USA
| | - Adithi R Varadarajan
- Agroscope, Competence Division Method Development and Analytics, Müller-Thurgau-Strasse 29, 8820, Wädenswil, Switzerland
| | - Igor V Grigoriev
- U.S. Department of Energy Joint Genome Institute (JGI), Lawrence Berkeley National Laboratory, 1 Cyclotron Rd., Berkeley, California 94720, USA.,Department of Plant and Microbial Biology, University of California Berkeley, Koshland Hall, Berkeley, CA, USA
| | - Christian H Ahrens
- Agroscope, Competence Division Method Development and Analytics, Müller-Thurgau-Strasse 29, 8820, Wädenswil, Switzerland.,SIB, Swiss Institute of Bioinformatics, Wädenswil, Switzerland
| | - Florian M Freimoser
- Agroscope, Research Division Plant Protection, Müller-Thurgau-Strasse 29, 8820 Wädenswil, Switzerland
| |
Collapse
|
28
|
Calcino AD, Kenny NJ, Gerdol M. Single individual structural variant detection uncovers widespread hemizygosity in molluscs. Philos Trans R Soc Lond B Biol Sci 2021; 376:20200153. [PMID: 33813894 PMCID: PMC8059565 DOI: 10.1098/rstb.2020.0153] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/07/2021] [Indexed: 11/12/2022] Open
Abstract
The advent of complete genomic sequencing has opened a window into genomic phenomena obscured by fragmented assemblies. A good example of these is the existence of hemizygous regions of autosomal chromosomes, which can result in marked differences in gene content between individuals within species. While these hemizygous regions, and presence/absence variation of genes that can result, are well known in plants, firm evidence has only recently emerged for their existence in metazoans. Here, we use recently published, complete genomes from wild-caught molluscs to investigate the prevalence of hemizygosity across a well-known and ecologically important clade. We show that hemizygous regions are widespread in mollusc genomes, not clustered in individual chromosomes, and often contain genes linked to transposition, DNA repair and stress response. With targeted investigations of HSP70-12 and C1qDC, we also show how individual gene families are distributed within pan-genomes. This work suggests that extensive pan-genomes are widespread across the conchiferan Mollusca, and represent useful tools for genomic evolution, allowing the maintenance of additional genetic diversity within the population. As genomic sequencing and re-sequencing becomes more routine, the prevalence of hemizygosity, and its impact on selection and adaptation, are key targets for research across the tree of life. This article is part of the Theo Murphy meeting issue 'Molluscan genomics: broad insights and future directions for a neglected phylum'.
Collapse
Affiliation(s)
- Andrew D. Calcino
- Department of Evolutionary Biology, Integrative Zoology, University of Vienna, Althanstrasse 14, Vienna 1090, Austria
| | - Nathan J. Kenny
- Life Sciences, The Natural History Museum, Cromwell Road, London SW7 5BD, UK
| | - Marco Gerdol
- Department of Life Sciences, University of Trieste, Via Licio Giorgieri 5, 34127 Trieste, Italy
| |
Collapse
|
29
|
Microevolution in the pansecondary metabolome of Aspergillus flavus and its potential macroevolutionary implications for filamentous fungi. Proc Natl Acad Sci U S A 2021; 118:2021683118. [PMID: 34016748 DOI: 10.1073/pnas.2021683118] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Fungi produce a wealth of pharmacologically bioactive secondary metabolites (SMs) from biosynthetic gene clusters (BGCs). It is common practice for drug discovery efforts to treat species' secondary metabolomes as being well represented by a single or a small number of representative genomes. However, this approach misses the possibility that intraspecific population dynamics, such as adaptation to environmental conditions or local microbiomes, may harbor novel BGCs that contribute to the overall niche breadth of species. Using 94 isolates of Aspergillus flavus, a cosmopolitan model fungus, sampled from seven states in the United States, we dereplicate 7,821 BGCs into 92 unique BGCs. We find that more than 25% of pangenomic BGCs show population-specific patterns of presence/absence or protein divergence. Population-specific BGCs make up most of the accessory-genome BGCs, suggesting that different ecological forces that maintain accessory genomes may be partially mediated by population-specific differences in secondary metabolism. We use ultra-high-performance high-resolution mass spectrometry to confirm that these genetic differences in BGCs also result in chemotypic differences in SM production in different populations, which could mediate ecological interactions and be acted on by selection. Thus, our results suggest a paradigm shift that previously unrealized population-level reservoirs of SM diversity may be of significant evolutionary, ecological, and pharmacological importance. Last, we find that several population-specific BGCs from A. flavus are present in Aspergillus parasiticus and Aspergillus minisclerotigenes and discuss how the microevolutionary patterns we uncover inform macroevolutionary inferences and help to align fungal secondary metabolism with existing evolutionary theory.
Collapse
|
30
|
Yildiz G, Ozkilinc H. Pan-Mitogenomics Approach Discovers Diversity and Dynamism in the Prominent Brown Rot Fungal Pathogens. Front Microbiol 2021; 12:647989. [PMID: 34054750 PMCID: PMC8149612 DOI: 10.3389/fmicb.2021.647989] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2020] [Accepted: 03/29/2021] [Indexed: 01/26/2023] Open
Abstract
Monilinia fructicola and Monilinia laxa species are the most destructive and economically devastating fungal plant pathogens causing brown rot disease on stone and pome fruits worldwide. Mitochondrial genomes (mitogenomes) play critical roles influencing the mechanisms and directions of the evolution of fungal pathogens. The pan-mitogenomics approach predicts core and accessory regions of the mitochondrial genomes and explains the gain or loss of variation within and between species. The present study is a fungal pan-mitogenome of M. fructicola (N = 8) and M. laxa (N = 8) species. The completely sequenced and annotated mitogenomes showed high variability in size within and between the species. The mitogenomes of M. laxa were larger, ranging from 178,351 to 179,780bp, than the mitogenomes of M. fructicola, ranging from 158,607 to 167,838bp. However, size variation within the species showed that M. fructicola isolates were more variable in the size range than M. laxa isolates. All the mitogenomes included conserved mitochondrial genes, as well as variable regions including different mobile introns encoding homing endonucleases or maturase, non-coding introns, and repetitive elements. The linear model analysis supported the hypothesis that the mitogenome size expansion is due to presence of variable (accessory) regions. Gene synteny was mostly conserved among all samples, with the exception for order of the rps3 in the mitogenome of one isolate. The mitogenomes presented AT richness; however, A/T and G/C skew varied among the mitochondrial genes. The purifying selection was detected in almost all the protein-coding genes (PCGs) between the species. However, cytochrome b was the only gene showing a positive selection signal among the total samples. Combined datasets of amino acid sequences of 14 core mitochondrial PCGs and rps3 obtained from this study together with published mitochondrial genome sequences from some other species from Heliotales were used to infer a maximum likelihood (ML) phylogenetic tree. ML tree indicated that both Monilinia species highly diverged from each other as well as some other fungal species from the same order. Mitogenomes harbor much information about the evolution of fungal plant pathogens, which could be useful to predict pathogenic life strategies.
Collapse
Affiliation(s)
- Gozde Yildiz
- School of Graduate Studies, MSc Program in Biomolecular Sciences, Çanakkale Onsekiz Mart University, Çanakkale, Turkey
| | - Hilal Ozkilinc
- School of Graduate Studies, MSc Program in Biomolecular Sciences, Çanakkale Onsekiz Mart University, Çanakkale, Turkey.,Faculty of Arts and Sciences, Department of Molecular Biology and Genetics, Çanakkale Onsekiz Mart University, Çanakkale, Turkey
| |
Collapse
|
31
|
Peterson D, Li T, Calvo AM, Yin Y. Categorization of Orthologous Gene Clusters in 92 Ascomycota Genomes Reveals Functions Important for Phytopathogenicity. J Fungi (Basel) 2021; 7:337. [PMID: 33925458 PMCID: PMC8146833 DOI: 10.3390/jof7050337] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2021] [Revised: 04/23/2021] [Accepted: 04/24/2021] [Indexed: 01/01/2023] Open
Abstract
Phytopathogenic Ascomycota are responsible for substantial economic losses each year, destroying valuable crops. The present study aims to provide new insights into phytopathogenicity in Ascomycota from a comparative genomic perspective. This has been achieved by categorizing orthologous gene groups (orthogroups) from 68 phytopathogenic and 24 non-phytopathogenic Ascomycota genomes into three classes: Core, (pathogen or non-pathogen) group-specific, and genome-specific accessory orthogroups. We found that (i) ~20% orthogroups are group-specific and accessory in the 92 Ascomycota genomes, (ii) phytopathogenicity is not phylogenetically determined, (iii) group-specific orthogroups have more enriched functional terms than accessory orthogroups and this trend is particularly evident in phytopathogenic fungi, (iv) secreted proteins with signal peptides and horizontal gene transfers (HGTs) are the two functional terms that show the highest occurrence and significance in group-specific orthogroups, (v) a number of other functional terms are also identified to have higher significance and occurrence in group-specific orthogroups. Overall, our comparative genomics analysis determined positive enrichment existing between orthogroup classes and revealed a prediction of what genomic characteristics make an Ascomycete phytopathogenic. We conclude that genes shared by multiple phytopathogenic genomes are more important for phytopathogenicity than those that are unique in each genome.
Collapse
Affiliation(s)
- Daniel Peterson
- Department of Biological Sciences, Northern Illinois University, DeKalb, IL 60115, USA;
| | - Tang Li
- Nebraska Food for Health Center, Department of Food Science and Technology, University of Nebraska–Lincoln, Lincoln, NE 68588, USA;
| | - Ana M. Calvo
- Department of Biological Sciences, Northern Illinois University, DeKalb, IL 60115, USA;
| | - Yanbin Yin
- Nebraska Food for Health Center, Department of Food Science and Technology, University of Nebraska–Lincoln, Lincoln, NE 68588, USA;
| |
Collapse
|
32
|
Glick L, Mayrose I. Panoramic: A package for constructing eukaryotic pan-genomes. Mol Ecol Resour 2021; 21:1393-1403. [PMID: 33533167 DOI: 10.1111/1755-0998.13344] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2020] [Revised: 01/10/2021] [Accepted: 01/27/2021] [Indexed: 11/28/2022]
Abstract
The study of intraspecific genomic variation in eukaryotic species has been the focus of numerous genome resequencing projects in recent years. One emerging approach for the analysis of intraspecific diversity uses the concept of a pan-genome, which theoretically represents the full set of genomic sequences and coding genes from all individuals of a given species. This approach has many advantages over reference-based methods and has been successfully applied to study both prokaryotic and eukaryotic species. However, the process of pan-genome construction still presents considerable scientific and technical challenges, especially for eukaryotic species with large and complex genomes. Although general approaches for the construction of pan-genomes have been devised, currently available software tools implement only certain modules of the entire computational procedure. Therefore, each pan-genome project requires the development of tailored analysis pipelines, thus complicating and prolonging the process and impairing research reproducibility and comparison across studies. Here, we present Panoramic, a software package for the automatic construction of eukaryotic pan-genomes. Panoramic takes raw sequencing reads as input and applies two alternative approaches for pan-genome construction. Panoramic makes pan-genome construction a considerably easier task by providing simple user interface and efficient data processing algorithms. We demonstrate the use of Panoramic by constructing the pan-genome of the model plant species Arabidopsis thaliana from sequencing data of 20 diverse ecotypes.
Collapse
Affiliation(s)
- Lior Glick
- School of Plant Sciences and Food Security, Department of Life Sciences, Tel-Aviv University, Tel Aviv, Israel
| | - Itay Mayrose
- School of Plant Sciences and Food Security, Department of Life Sciences, Tel-Aviv University, Tel Aviv, Israel
| |
Collapse
|
33
|
Zhong C, Chen C, Wang L, Ning K. Integrating pan-genome with metagenome for microbial community profiling. Comput Struct Biotechnol J 2021; 19:1458-1466. [PMID: 33841754 PMCID: PMC8010324 DOI: 10.1016/j.csbj.2021.02.021] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2020] [Revised: 02/24/2021] [Accepted: 02/27/2021] [Indexed: 02/07/2023] Open
Abstract
Advances in sequencing technology have led to the increased availability of genomes and metagenomes, which has greatly facilitated microbial pan-genome and metagenome analysis in the community. In line with this trend, studies on microbial genomes and phenotypes have gradually shifted from individuals to environmental communities. Pan-genomics and metagenomics are powerful strategies for in-depth profiling study of microbial communities. Pan-genomics focuses on genetic diversity, dynamics, and phylogeny at the multi-genome level, while metagenomics profiles the distribution and function of culture-free microbial communities in special environments. Combining pan-genome and metagenome analysis can reveal the microbial complicated connections from an individual complete genome to a mixture of genomes, thereby extending the catalog of traditional individual genomic profile to community microbial profile. Therefore, the combination of pan-genome and metagenome approaches has become a promising method to track the sources of various microbes and decipher the population-level evolution and ecosystem functions. This review summarized the pan-genome and metagenome approaches, the combined strategies of pan-genome and metagenome, and applications of these combined strategies in studies of microbial dynamics, evolution, and function in communities. We discussed emerging strategies for the study of microbial communities that integrate information in both pan-genome and metagenome. We emphasized studies in which the integrating pan-genome with metagenome approach improved the understanding of models of microbial community profiles, both structural and functional. Finally, we illustrated future perspectives of microbial community profile: more advanced analytical techniques, including big-data based artificial intelligence, will lead to an even better understanding of the patterns of microbial communities.
Collapse
Affiliation(s)
- Chaofang Zhong
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-imaging, Center of AI Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, Hubei, China.,Department of Computer Science, City University of Hong Kong, 83 Tat Chee Avenue, Kowloon, Hong Kong, China
| | - Chaoyun Chen
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-imaging, Center of AI Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, Hubei, China
| | - Lusheng Wang
- Department of Computer Science, City University of Hong Kong, 83 Tat Chee Avenue, Kowloon, Hong Kong, China.,City University of Hong Kong Shenzhen Research Institute, Shenzhen, China
| | - Kang Ning
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-imaging, Center of AI Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, Hubei, China
| |
Collapse
|
34
|
Lind AL, Pollard KS. Accurate and sensitive detection of microbial eukaryotes from whole metagenome shotgun sequencing. MICROBIOME 2021; 9:58. [PMID: 33658077 PMCID: PMC7931531 DOI: 10.1186/s40168-021-01015-y] [Citation(s) in RCA: 45] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/24/2021] [Accepted: 02/02/2021] [Indexed: 05/08/2023]
Abstract
BACKGROUND Microbial eukaryotes are found alongside bacteria and archaea in natural microbial systems, including host-associated microbiomes. While microbial eukaryotes are critical to these communities, they are challenging to study with shotgun sequencing techniques and are therefore often excluded. RESULTS Here, we present EukDetect, a bioinformatics method to identify eukaryotes in shotgun metagenomic sequencing data. Our approach uses a database of 521,824 universal marker genes from 241 conserved gene families, which we curated from 3713 fungal, protist, non-vertebrate metazoan, and non-streptophyte archaeplastida genomes and transcriptomes. EukDetect has a broad taxonomic coverage of microbial eukaryotes, performs well on low-abundance and closely related species, and is resilient against bacterial contamination in eukaryotic genomes. Using EukDetect, we describe the spatial distribution of eukaryotes along the human gastrointestinal tract, showing that fungi and protists are present in the lumen and mucosa throughout the large intestine. We discover that there is a succession of eukaryotes that colonize the human gut during the first years of life, mirroring patterns of developmental succession observed in gut bacteria. By comparing DNA and RNA sequencing of paired samples from human stool, we find that many eukaryotes continue active transcription after passage through the gut, though some do not, suggesting they are dormant or nonviable. We analyze metagenomic data from the Baltic Sea and find that eukaryotes differ across locations and salinity gradients. Finally, we observe eukaryotes in Arabidopsis leaf samples, many of which are not identifiable from public protein databases. CONCLUSIONS EukDetect provides an automated and reliable way to characterize eukaryotes in shotgun sequencing datasets from diverse microbiomes. We demonstrate that it enables discoveries that would be missed or clouded by false positives with standard shotgun sequence analysis. EukDetect will greatly advance our understanding of how microbial eukaryotes contribute to microbiomes. Video abstract.
Collapse
Affiliation(s)
- Abigail L Lind
- Gladstone Institute of Data Science and Biotechnology, San Francisco, CA, USA
| | - Katherine S Pollard
- Gladstone Institute of Data Science and Biotechnology, San Francisco, CA, USA.
- Institute for Human Genetics, University of California, San Francisco, CA, USA.
- Department of Epidemiology and Biostatistics, University of California, San Francisco, CA, USA.
- Institute for Computational Health Sciences, University of California, San Francisco, CA, USA.
- Chan Zuckerberg Biohub, San Francisco, CA, USA.
| |
Collapse
|
35
|
Gerdol M, Moreira R, Cruz F, Gómez-Garrido J, Vlasova A, Rosani U, Venier P, Naranjo-Ortiz MA, Murgarella M, Greco S, Balseiro P, Corvelo A, Frias L, Gut M, Gabaldón T, Pallavicini A, Canchaya C, Novoa B, Alioto TS, Posada D, Figueras A. Massive gene presence-absence variation shapes an open pan-genome in the Mediterranean mussel. Genome Biol 2020; 21:275. [PMID: 33168033 PMCID: PMC7653742 DOI: 10.1186/s13059-020-02180-3] [Citation(s) in RCA: 81] [Impact Index Per Article: 20.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2019] [Accepted: 10/15/2020] [Indexed: 01/14/2023] Open
Abstract
BACKGROUND The Mediterranean mussel Mytilus galloprovincialis is an ecologically and economically relevant edible marine bivalve, highly invasive and resilient to biotic and abiotic stressors causing recurrent massive mortalities in other bivalves. Although these traits have been recently linked with the maintenance of a high genetic variation within natural populations, the factors underlying the evolutionary success of this species remain unclear. RESULTS Here, after the assembly of a 1.28-Gb reference genome and the resequencing of 14 individuals from two independent populations, we reveal a complex pan-genomic architecture in M. galloprovincialis, with a core set of 45,000 genes plus a strikingly high number of dispensable genes (20,000) subject to presence-absence variation, which may be entirely missing in several individuals. We show that dispensable genes are associated with hemizygous genomic regions affected by structural variants, which overall account for nearly 580 Mb of DNA sequence not included in the reference genome assembly. As such, this is the first study to report the widespread occurrence of gene presence-absence variation at a whole-genome scale in the animal kingdom. CONCLUSIONS Dispensable genes usually belong to young and recently expanded gene families enriched in survival functions, which might be the key to explain the resilience and invasiveness of this species. This unique pan-genome architecture is characterized by dispensable genes in accessory genomic regions that exceed by orders of magnitude those observed in other metazoans, including humans, and closely mirror the open pan-genomes found in prokaryotes and in a few non-metazoan eukaryotes.
Collapse
Affiliation(s)
- Marco Gerdol
- Department of Life Sciences, Università degli Studi di Trieste, Via Licio Giorgieri 5, 34127 Trieste, Italy
| | - Rebeca Moreira
- Instituto de Investigaciones Marinas (IIM - CSIC), Eduardo Cabello, 6, 36208 Vigo, Spain
| | - Fernando Cruz
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, 08028 Barcelona, Spain
| | - Jessica Gómez-Garrido
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, 08028 Barcelona, Spain
| | - Anna Vlasova
- CRG - Centre for Genomic Regulation, Doctor Aiguader, 88, 08003 Barcelona, Spain
| | - Umberto Rosani
- Department of Biology, Università degli Studi di Padova, Via Ugo Bassi 58/B, 35131 Padova, Italy
| | - Paola Venier
- Department of Biology, Università degli Studi di Padova, Via Ugo Bassi 58/B, 35131 Padova, Italy
| | - Miguel A. Naranjo-Ortiz
- CRG - Centre for Genomic Regulation, Doctor Aiguader, 88, 08003 Barcelona, Spain
- Universitat Pompeu Fabra (UPF), 08003 Barcelona, Spain
| | - Maria Murgarella
- Department of Biochemistry, Genetics and Immunology, University of Vigo, 36310 Vigo, Spain
| | - Samuele Greco
- Department of Life Sciences, Università degli Studi di Trieste, Via Licio Giorgieri 5, 34127 Trieste, Italy
| | - Pablo Balseiro
- Instituto de Investigaciones Marinas (IIM - CSIC), Eduardo Cabello, 6, 36208 Vigo, Spain
- Norce Norwegian Research Centre AS, Bergen, Norway
| | - André Corvelo
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, 08028 Barcelona, Spain
- New York Genome Center, New York, NY 10013 USA
| | - Leonor Frias
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, 08028 Barcelona, Spain
| | - Marta Gut
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, 08028 Barcelona, Spain
| | - Toni Gabaldón
- CRG - Centre for Genomic Regulation, Doctor Aiguader, 88, 08003 Barcelona, Spain
- Universitat Pompeu Fabra (UPF), 08003 Barcelona, Spain
- ICREA, Pg. Lluís Companys 23, 08010 Barcelona, Spain
- Current address: Barelona Supercomputing Centre (BSC-CNS) and Institute for Research in Biomedicine (IRB), 08034 Barcelona, Spain
| | - Alberto Pallavicini
- Department of Life Sciences, Università degli Studi di Trieste, Via Licio Giorgieri 5, 34127 Trieste, Italy
- Anton Dohrn Zoological Station, 80121 Villa Comunale, Naples, Italy
| | - Carlos Canchaya
- Department of Biochemistry, Genetics and Immunology, University of Vigo, 36310 Vigo, Spain
- Biomedical Research Center (CINBIO), University of Vigo, 36310 Vigo, Spain
- Galicia Sur Health Research Institute, 36310 Vigo, Spain
| | - Beatriz Novoa
- Instituto de Investigaciones Marinas (IIM - CSIC), Eduardo Cabello, 6, 36208 Vigo, Spain
| | - Tyler S. Alioto
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, 08028 Barcelona, Spain
- Universitat Pompeu Fabra (UPF), 08003 Barcelona, Spain
| | - David Posada
- Department of Biochemistry, Genetics and Immunology, University of Vigo, 36310 Vigo, Spain
- Biomedical Research Center (CINBIO), University of Vigo, 36310 Vigo, Spain
- Galicia Sur Health Research Institute, 36310 Vigo, Spain
| | - Antonio Figueras
- Instituto de Investigaciones Marinas (IIM - CSIC), Eduardo Cabello, 6, 36208 Vigo, Spain
| |
Collapse
|
36
|
Naranjo‐Ortiz MA, Gabaldón T. Fungal evolution: cellular, genomic and metabolic complexity. Biol Rev Camb Philos Soc 2020; 95:1198-1232. [PMID: 32301582 PMCID: PMC7539958 DOI: 10.1111/brv.12605] [Citation(s) in RCA: 53] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2019] [Revised: 03/31/2020] [Accepted: 04/02/2020] [Indexed: 12/13/2022]
Abstract
The question of how phenotypic and genomic complexity are inter-related and how they are shaped through evolution is a central question in biology that historically has been approached from the perspective of animals and plants. In recent years, however, fungi have emerged as a promising alternative system to address such questions. Key to their ecological success, fungi present a broad and diverse range of phenotypic traits. Fungal cells can adopt many different shapes, often within a single species, providing them with great adaptive potential. Fungal cellular organizations span from unicellular forms to complex, macroscopic multicellularity, with multiple transitions to higher or lower levels of cellular complexity occurring throughout the evolutionary history of fungi. Similarly, fungal genomes are very diverse in their architecture. Deep changes in genome organization can occur very quickly, and these phenomena are known to mediate rapid adaptations to environmental changes. Finally, the biochemical complexity of fungi is huge, particularly with regard to their secondary metabolites, chemical products that mediate many aspects of fungal biology, including ecological interactions. Herein, we explore how the interplay of these cellular, genomic and metabolic traits mediates the emergence of complex phenotypes, and how this complexity is shaped throughout the evolutionary history of Fungi.
Collapse
Affiliation(s)
- Miguel A. Naranjo‐Ortiz
- Bioinformatics and Genomics Programme, Centre for Genomic Regulation (CRG)The Barcelona Institute of Science and TechnologyDr. Aiguader 88, Barcelona08003Spain
| | - Toni Gabaldón
- Bioinformatics and Genomics Programme, Centre for Genomic Regulation (CRG)The Barcelona Institute of Science and TechnologyDr. Aiguader 88, Barcelona08003Spain
- Department of Experimental Sciences, Universitat Pompeu Fabra (UPF)Dr. Aiguader 88, 08003BarcelonaSpain
- ICREAPg. Lluís Companys 23, 08010BarcelonaSpain
| |
Collapse
|
37
|
Fang H, Xu JB, Nie Y, Wu XL. Pan-genomic analysis reveals that the evolution of Dietzia species depends on their living habitats. Environ Microbiol 2020; 23:861-877. [PMID: 32715552 DOI: 10.1111/1462-2920.15176] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2020] [Revised: 07/20/2020] [Accepted: 07/23/2020] [Indexed: 12/14/2022]
Abstract
The bacterial genus Dietzia is widely distributed in various environments. The genomes of 26 diverse strains of Dietzia, including almost all the type strains, were analysed in this study. This analysis revealed a lipid metabolism gene richness, which could explain the ability of Dietzia to live in oil related environments. The pan-genome consists of 83,976 genes assigned into 10,327 gene families, 792 of which are shared by all the genomes of Dietzia. Mathematical extrapolation of the data suggests that the Dietzia pan-genome is open. Both gene duplication and gene loss contributed to the open pan-genome, while horizontal gene transfer was limited. Dietzia strains primarily gained their diverse metabolic capacity through more ancient gene duplications. Phylogenetic analysis of Dietzia isolated from aquatic and terrestrial environments showed two distinct clades from the same ancestor. The genome sizes of Dietzia strains from aquatic environments were significantly larger than those from terrestrial environments, which was mainly due to the occurrence of more gene loss events during the evolutionary progress of the strains from terrestrial environments. The evolutionary history of Dietzia was tightly coupled to environmental conditions, and iron concentrations should be one of the key factors shaping the genomes of the Dietzia lineages.
Collapse
Affiliation(s)
- Hui Fang
- College of Engineering, Peking University, Beijing, 100871, China
| | - Jin-Bo Xu
- College of Engineering, Peking University, Beijing, 100871, China
| | - Yong Nie
- College of Engineering, Peking University, Beijing, 100871, China
| | - Xiao-Lei Wu
- College of Engineering, Peking University, Beijing, 100871, China.,Institute of Ocean Research, Peking University, Beijing, 100871, China
| |
Collapse
|
38
|
Badet T, Croll D. The rise and fall of genes: origins and functions of plant pathogen pangenomes. CURRENT OPINION IN PLANT BIOLOGY 2020; 56:65-73. [PMID: 32480355 DOI: 10.1016/j.pbi.2020.04.009] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/06/2020] [Revised: 04/14/2020] [Accepted: 04/18/2020] [Indexed: 06/11/2023]
Abstract
Plant pathogens can rapidly overcome resistance of their hosts by mutating key pathogenicity genes encoding for effectors. Pathogen adaptation is fuelled by extensive genetic variability in populations and different strains may not share the same set of genes. Recently, such an intra-specific variation in gene content became formalized as pangenomes distinguishing core genes (i.e. shared) and accessory genes (i.e. lineage or strain-specific). Across pathogens species, key effectors tend to be part of the rapidly evolving accessory genome. Here, we show how the construction and analysis of pathogen pangenomes provide deep insights into the dynamic host adaptation process. We also discuss how pangenomes should ideally be built and how geography, niche and lifestyle likely determine pangenome sizes.
Collapse
Affiliation(s)
- Thomas Badet
- Laboratory of Evolutionary Genetics, Institute of Biology, University of Neuchâtel, Switzerland
| | - Daniel Croll
- Laboratory of Evolutionary Genetics, Institute of Biology, University of Neuchâtel, Switzerland.
| |
Collapse
|
39
|
Wang M, Fu H, Ruan R. A Small Horizontally Transferred Gene Cluster Contributes to the Sporulation of Alternaria alternata. Genome Biol Evol 2020; 11:3436-3444. [PMID: 31764979 PMCID: PMC6916707 DOI: 10.1093/gbe/evz257] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/23/2019] [Indexed: 12/17/2022] Open
Abstract
Horizontal gene transfer (HGT) has been identified as an important source of genomic innovation in fungi. However, how HGT drove the evolution of Alternaria alternata, a necrotrophic fungus which can be ubiquitously isolated from soil and various plants and decaying plant materials is largely known. In this study, we identified 12 protein-encoding genes that are likely acquired from lineages outside Pezizomycotina. Phylogenetic trees and approximately unbiased comparative topology tests strongly supported the evolutionary origin of these genes. According to their predicted functions, these HGT candidates are involved in nitrogen and carbohydrate metabolism. Especially, five genes of them were likely transferred as a physically linked cluster from Tremellales (Basidiomycota). Functionally knocking out the five-gene cluster in an A. alternata isolate causing citrus brown spot resulted in an 80% decrease in asexual spore production in the deletion mutant. We further knocked out each of these five genes in this cluster and the resultant single-gene deletion mutants exhibited a various degree of reduction in spore production. Except for conidiation, functions of these genes associated with vegetative growth, stress tolerance, and virulence are very limited. Our results provide new evidence that HGT has played important roles over the course of the evolution of filamentous fungi.
Collapse
Affiliation(s)
- Mingshuang Wang
- College of Life and Environmental Sciences, Hangzhou Normal University, Hangzhou, China
| | - Huilan Fu
- Institute of Biotechnology, Zhejiang University, Hangzhou, China
| | - Ruoxin Ruan
- Institute of Biotechnology, Zhejiang University, Hangzhou, China.,Hangzhou Academy of Agricultural Sciences, Hangzhou, China
| |
Collapse
|
40
|
Badet T, Oggenfuss U, Abraham L, McDonald BA, Croll D. A 19-isolate reference-quality global pangenome for the fungal wheat pathogen Zymoseptoria tritici. BMC Biol 2020; 18:12. [PMID: 32046716 PMCID: PMC7014611 DOI: 10.1186/s12915-020-0744-3] [Citation(s) in RCA: 62] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2019] [Accepted: 01/27/2020] [Indexed: 12/16/2022] Open
Abstract
BACKGROUND The gene content of a species largely governs its ecological interactions and adaptive potential. A species is therefore defined by both core genes shared between all individuals and accessory genes segregating presence-absence variation. There is growing evidence that eukaryotes, similar to bacteria, show intra-specific variability in gene content. However, it remains largely unknown how functionally relevant such a pangenome structure is for eukaryotes and what mechanisms underlie the emergence of highly polymorphic genome structures. RESULTS Here, we establish a reference-quality pangenome of a fungal pathogen of wheat based on 19 complete genomes from isolates sampled across six continents. Zymoseptoria tritici causes substantial worldwide losses to wheat production due to rapidly evolved tolerance to fungicides and evasion of host resistance. We performed transcriptome-assisted annotations of each genome to construct a global pangenome. Major chromosomal rearrangements are segregating within the species and underlie extensive gene presence-absence variation. Conserved orthogroups account for only ~ 60% of the species pangenome. Investigating gene functions, we find that the accessory genome is enriched for pathogenesis-related functions and encodes genes involved in metabolite production, host tissue degradation and manipulation of the immune system. De novo transposon annotation of the 19 complete genomes shows that the highly diverse chromosomal structure is tightly associated with transposable element content. Furthermore, transposable element expansions likely underlie recent genome expansions within the species. CONCLUSIONS Taken together, our work establishes a highly complex eukaryotic pangenome providing an unprecedented toolbox to study how pangenome structure impacts crop-pathogen interactions.
Collapse
Affiliation(s)
- Thomas Badet
- Laboratory of Evolutionary Genetics, Institute of Biology, University of Neuchâtel, Neuchâtel, Switzerland
| | - Ursula Oggenfuss
- Laboratory of Evolutionary Genetics, Institute of Biology, University of Neuchâtel, Neuchâtel, Switzerland
| | - Leen Abraham
- Laboratory of Evolutionary Genetics, Institute of Biology, University of Neuchâtel, Neuchâtel, Switzerland
| | - Bruce A McDonald
- Plant Pathology, Institute of Integrative Biology, ETH Zürich, Zürich, Switzerland
| | - Daniel Croll
- Laboratory of Evolutionary Genetics, Institute of Biology, University of Neuchâtel, Neuchâtel, Switzerland.
| |
Collapse
|
41
|
Badet T, Oggenfuss U, Abraham L, McDonald BA, Croll D. A 19-isolate reference-quality global pangenome for the fungal wheat pathogen Zymoseptoria tritici. BMC Biol 2020; 18:12. [PMID: 32046716 DOI: 10.1101/803098] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2019] [Accepted: 01/27/2020] [Indexed: 05/26/2023] Open
Abstract
BACKGROUND The gene content of a species largely governs its ecological interactions and adaptive potential. A species is therefore defined by both core genes shared between all individuals and accessory genes segregating presence-absence variation. There is growing evidence that eukaryotes, similar to bacteria, show intra-specific variability in gene content. However, it remains largely unknown how functionally relevant such a pangenome structure is for eukaryotes and what mechanisms underlie the emergence of highly polymorphic genome structures. RESULTS Here, we establish a reference-quality pangenome of a fungal pathogen of wheat based on 19 complete genomes from isolates sampled across six continents. Zymoseptoria tritici causes substantial worldwide losses to wheat production due to rapidly evolved tolerance to fungicides and evasion of host resistance. We performed transcriptome-assisted annotations of each genome to construct a global pangenome. Major chromosomal rearrangements are segregating within the species and underlie extensive gene presence-absence variation. Conserved orthogroups account for only ~ 60% of the species pangenome. Investigating gene functions, we find that the accessory genome is enriched for pathogenesis-related functions and encodes genes involved in metabolite production, host tissue degradation and manipulation of the immune system. De novo transposon annotation of the 19 complete genomes shows that the highly diverse chromosomal structure is tightly associated with transposable element content. Furthermore, transposable element expansions likely underlie recent genome expansions within the species. CONCLUSIONS Taken together, our work establishes a highly complex eukaryotic pangenome providing an unprecedented toolbox to study how pangenome structure impacts crop-pathogen interactions.
Collapse
Affiliation(s)
- Thomas Badet
- Laboratory of Evolutionary Genetics, Institute of Biology, University of Neuchâtel, Neuchâtel, Switzerland
| | - Ursula Oggenfuss
- Laboratory of Evolutionary Genetics, Institute of Biology, University of Neuchâtel, Neuchâtel, Switzerland
| | - Leen Abraham
- Laboratory of Evolutionary Genetics, Institute of Biology, University of Neuchâtel, Neuchâtel, Switzerland
| | - Bruce A McDonald
- Plant Pathology, Institute of Integrative Biology, ETH Zürich, Zürich, Switzerland
| | - Daniel Croll
- Laboratory of Evolutionary Genetics, Institute of Biology, University of Neuchâtel, Neuchâtel, Switzerland.
| |
Collapse
|
42
|
Golicz AA, Bayer PE, Bhalla PL, Batley J, Edwards D. Pangenomics Comes of Age: From Bacteria to Plant and Animal Applications. Trends Genet 2019; 36:132-145. [PMID: 31882191 DOI: 10.1016/j.tig.2019.11.006] [Citation(s) in RCA: 98] [Impact Index Per Article: 19.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2019] [Revised: 11/09/2019] [Accepted: 11/12/2019] [Indexed: 02/01/2023]
Abstract
The pangenome refers to a collection of genomic sequence found in the entire species or population rather than in a single individual; the sequence can be core, present in all individuals, or accessory (variable or dispensable), found in a subset of individuals only. While pangenomic studies were first undertaken in bacterial species, developments in genome sequencing and assembly approaches have allowed construction of pangenomes for eukaryotic organisms, fungi, plants, and animals, including two large-scale human pangenome projects. Analysis of the these pangenomes revealed key differences, most likely stemming from divergent evolutionary histories, but also surprising similarities.
Collapse
Affiliation(s)
- Agnieszka A Golicz
- Plant Molecular Biology and Biotechnology Laboratory, Faculty of Veterinary and Agricultural Sciences, University of Melbourne, Melbourne, VIC, Australia.
| | - Philipp E Bayer
- School of Biological Sciences and Institute of Agriculture, The University of Western Australia, Crawley, WA, Australia
| | - Prem L Bhalla
- Plant Molecular Biology and Biotechnology Laboratory, Faculty of Veterinary and Agricultural Sciences, University of Melbourne, Melbourne, VIC, Australia
| | - Jacqueline Batley
- School of Biological Sciences and Institute of Agriculture, The University of Western Australia, Crawley, WA, Australia
| | - David Edwards
- School of Biological Sciences and Institute of Agriculture, The University of Western Australia, Crawley, WA, Australia.
| |
Collapse
|
43
|
Kovalev N, Pogany J, Nagy PD. Interviral Recombination between Plant, Insect, and Fungal RNA Viruses: Role of the Intracellular Ca 2+/Mn 2+ Pump. J Virol 2019; 94:e01015-19. [PMID: 31597780 PMCID: PMC6912095 DOI: 10.1128/jvi.01015-19] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2019] [Accepted: 10/07/2019] [Indexed: 01/18/2023] Open
Abstract
Recombination is one of the driving forces of viral evolution. RNA recombination events among similar RNA viruses are frequent, although RNA recombination could also take place among unrelated viruses. In this paper, we have established efficient interviral recombination systems based on yeast and plants. We show that diverse RNA viruses, including the plant viruses tomato bushy stunt virus, carnation Italian ringspot virus, and turnip crinkle virus-associated RNA; the insect plus-strand RNA [(+)RNA] viruses Flock House virus and Nodamura virus; and the double-stranded L-A virus of yeast, are involved in interviral recombination events. Most interviral recombinants are minus-strand recombinant RNAs, and the junction sites are not randomly distributed, but there are certain hot spot regions. Formation of interviral recombinants in yeast and plants is accelerated by depletion of the cellular SERCA-like Pmr1 ATPase-driven Ca2+/Mn2+ pump, regulating intracellular Ca2+ and Mn2+ influx into the Golgi apparatus from the cytosol. The interviral recombinants are generated by a template-switching mechanism during RNA replication by the viral replicase. Replication studies revealed that a group of interviral recombinants is replication competent in cell-free extracts, in yeast, and in the plant Nicotiana benthamiana We propose that there are major differences among the viral replicases to generate and maintain interviral recombinants. Altogether, the obtained data promote the model that host factors greatly contribute to the formation of recombinants among related and unrelated viruses. This is the first time that a host factor's role in affecting interviral recombination is established.IMPORTANCE Viruses with RNA genomes are abundant, and their genomic sequences show astonishing variation. Genetic recombination in RNA viruses is a major force behind their rapid evolution, enhanced pathogenesis, and adaptation to their hosts. We utilized a previously identified intracellular Ca2+/Mn2+ pump-deficient yeast to search for interviral recombinants. Noninfectious viral replication systems were used to avoid generating unwanted infectious interviral recombinants. Altogether, interviral RNA recombinants were observed between plant and insect viruses, and between a fungal double-stranded RNA (dsRNA) virus and an insect virus, in the yeast host. In addition, interviral recombinants between two plant virus replicon RNAs were identified in N. benthamiana plants, in which the intracellular Ca2+/Mn2+ pump was depleted. These findings underline the crucial role of the host in promoting RNA recombination among unrelated viruses.
Collapse
Affiliation(s)
- Nikolay Kovalev
- Department of Plant Pathology, University of Kentucky, Lexington, Kentucky, USA
| | - Judit Pogany
- Department of Plant Pathology, University of Kentucky, Lexington, Kentucky, USA
| | - Peter D Nagy
- Department of Plant Pathology, University of Kentucky, Lexington, Kentucky, USA
| |
Collapse
|
44
|
Whole Genome Sequence of the Commercially Relevant Mushroom Strain Agaricus bisporus var. bisporus ARP23. G3-GENES GENOMES GENETICS 2019; 9:3057-3066. [PMID: 31371382 PMCID: PMC6778795 DOI: 10.1534/g3.119.400563] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
Agaricus bisporus is an extensively cultivated edible mushroom. Demand for cultivation is continuously growing and difficulties associated with breeding programs now means strains are effectively considered monoculture. While commercial growing practices are highly efficient and tightly controlled, the over-use of a single strain has led to a variety of disease outbreaks from a range of pathogens including bacteria, fungi and viruses. To address this, the Agaricus Resource Program (ARP) was set up to collect wild isolates from diverse geographical locations through a bounty-driven scheme to create a repository of wild Agaricus germplasm. One of the strains collected, Agaricus bisporus var. bisporus ARP23, has been crossed extensively with white commercial varieties leading to the generation of a novel hybrid with a dark brown pileus commonly referred to as 'Heirloom'. Heirloom has been successfully implemented into commercial mushroom cultivation. In this study the whole genome of Agaricus bisporus var. bisporus ARP23 was sequenced and assembled with Illumina and PacBio sequencing technology. The final genome was found to be 33.49 Mb in length and have significant levels of synteny to other sequenced Agaricus bisporus strains. Overall, 13,030 putative protein coding genes were located and annotated. Relative to the other A. bisporus genomes that are currently available, Agaricus bisporus var. bisporus ARP23 is the largest A. bisporus strain in terms of gene number and genetic content sequenced to date. Comparative genomic analysis shows that the A. bisporus mating loci in unifactorial and unsurprisingly highly conserved between strains. The lignocellulolytic gene content of all A. bisporus strains compared is also very similar. Our results show that the pangenome structure of A. bisporus is quite diverse with between 60-70% of the total protein coding genes per strain considered as being orthologous and syntenically conserved. These analyses and the genome sequence described herein are the starting point for more detailed molecular analyses into the growth and phenotypical responses of Agaricus bisporus var. bisporus ARP23 when challenged with economically important mycoviruses.
Collapse
|
45
|
McCarthy CGP, Fitzpatrick DA. Pangloss: A Tool for Pan-Genome Analysis of Microbial Eukaryotes. Genes (Basel) 2019; 10:E521. [PMID: 31295964 PMCID: PMC6678930 DOI: 10.3390/genes10070521] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2019] [Revised: 07/05/2019] [Accepted: 07/05/2019] [Indexed: 12/27/2022] Open
Abstract
Although the pan-genome concept originated in prokaryote genomics, an increasing number of eukaryote species pan-genomes have also been analysed. However, there is a relative lack of software intended for eukaryote pan-genome analysis compared to that available for prokaryotes. In a previous study, we analysed the pan-genomes of four model fungi with a computational pipeline that constructed pan-genomes using the synteny-dependent Pan-genome Ortholog Clustering Tool (PanOCT) approach. Here, we present a modified and improved version of that pipeline which we have called Pangloss. Pangloss can perform gene prediction for a set of genomes from a given species that the user provides, constructs and optionally refines a species pan-genome from that set using PanOCT, and can perform various functional characterisation and visualisation analyses of species pan-genome data. To demonstrate Pangloss's capabilities, we constructed and analysed a species pan-genome for the oleaginous yeast Yarrowialipolytica and also reconstructed a previously-published species pan-genome for the opportunistic respiratory pathogen Aspergillus fumigatus. Pangloss is implemented in Python, Perl and R and is freely available under an open source GPLv3 licence via GitHub.
Collapse
Affiliation(s)
- Charley G P McCarthy
- Genome Evolution Laboratory, Department of Biology, Maynooth University, W23 F2K8 Maynooth, Co. Kildare, Ireland.
- Human Health Research Institute, Maynooth University, W23 F2K8 Maynooth, Co. Kildare, Ireland.
| | - David A Fitzpatrick
- Genome Evolution Laboratory, Department of Biology, Maynooth University, W23 F2K8 Maynooth, Co. Kildare, Ireland
- Human Health Research Institute, Maynooth University, W23 F2K8 Maynooth, Co. Kildare, Ireland
| |
Collapse
|