1
|
Coomber A, Saville A, Ristaino JB. Evolution of Phytophthora infestans on its potato host since the Irish potato famine. Nat Commun 2024; 15:6488. [PMID: 39103347 DOI: 10.1038/s41467-024-50749-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2023] [Accepted: 07/18/2024] [Indexed: 08/07/2024] Open
Abstract
Phytophthora infestans is a major oomycete plant pathogen, responsible for potato late blight, which led to the Irish Potato Famine from 1845-1852. Since then, potatoes resistant to this disease have been bred and deployed worldwide. Their resistance (R) genes recognize pathogen effectors responsible for virulence and then induce a plant response stopping disease progression. However, most deployed R genes are quickly overcome by the pathogen. We use targeted sequencing of effector and R genes on herbarium specimens to examine the joint evolution in both P. infestans and potato from 1845-1954. Currently relevant effectors are historically present in P. infestans, but with alternative alleles compared to modern reference genomes. The historic FAM-1 lineage has the virulent Avr1 allele and the ability to break the R1 resistance gene before breeders deployed it in potato. The FAM-1 lineage is diploid, but later, triploid US-1 lineages appear. We show that pathogen virulence genes and host resistance genes have undergone significant changes since the Famine, from both natural and artificial selection.
Collapse
Affiliation(s)
- Allison Coomber
- Department of Entomology and Plant Pathology, NC State University, Raleigh, NC, USA
- Functional Genomics Program, NC State University, Raleigh, NC, USA
| | - Amanda Saville
- Department of Entomology and Plant Pathology, NC State University, Raleigh, NC, USA
| | - Jean Beagle Ristaino
- Department of Entomology and Plant Pathology, NC State University, Raleigh, NC, USA.
- Emerging Plant Disease and Global Food Security Cluster, NC State University, Raleigh, NC, USA.
| |
Collapse
|
2
|
Li W, Koshkarov A, Tahiri N. Comparison of phylogenetic trees defined on different but mutually overlapping sets of taxa: A review. Ecol Evol 2024; 14:e70054. [PMID: 39119174 PMCID: PMC11307105 DOI: 10.1002/ece3.70054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2023] [Revised: 07/03/2024] [Accepted: 07/10/2024] [Indexed: 08/10/2024] Open
Abstract
Phylogenetic trees represent the evolutionary relationships and ancestry of various species or groups of organisms. Comparing these trees by measuring the distance between them is essential for applications such as tree clustering and the Tree of Life project. Many distance metrics for phylogenetic trees focus on trees defined on the same set of taxa. However, some problems require calculating distances between trees with different but overlapping sets of taxa. This study reviews state-of-the-art distance measures for such trees, covering six major approaches, including the constraint-based Robinson-Foulds (RF) distance RF(-), the completion-based RF(+), the generalized RF (GRF), the dissimilarity measure, the vectorial tree distance, and the geodesic distance in the extended Billera-Holmes-Vogtmann tree space. Among these, three RF-based methods, RF(-), RF(+), and GRF, were examined in detail on generated clusters of phylogenetic trees defined on different but mutually overlapping sets of taxa. Additionally, we reviewed nine related techniques, including leaf imputation methods, the tree edit distance, and visual comparison. A comparison of the related distance measures, highlighting their principal advantages and shortcomings, is provided. This review offers valuable insights into their applicability and performance, guiding the appropriate use of these metrics based on tree type (rooted or unrooted) and information type (topological or branch lengths).
Collapse
Affiliation(s)
- Wanlin Li
- Department of Computer ScienceUniversity of SherbrookeSherbrookeQuebecCanada
| | - Aleksandr Koshkarov
- Department of Computer ScienceUniversity of SherbrookeSherbrookeQuebecCanada
| | - Nadia Tahiri
- Department of Computer ScienceUniversity of SherbrookeSherbrookeQuebecCanada
| |
Collapse
|
3
|
Romeo F, Spetter MJ, Pereyra SB, Morán PE, González Altamiranda EA, Louge Uriarte EL, Odeón AC, Pérez SE, Verna AE. Whole Genome Sequence-Based Analysis of Bovine Gammaherpesvirus 4 Isolated from Bovine Abortions. Viruses 2024; 16:739. [PMID: 38793621 PMCID: PMC11125609 DOI: 10.3390/v16050739] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2024] [Revised: 04/24/2024] [Accepted: 04/28/2024] [Indexed: 05/26/2024] Open
Abstract
Bovine gammaherpesvirus 4 (BoGHV4) is a member of the Gammaherspivirinae subfamily, Rhadinovirus genus. Its natural host is the bovine, and it is prevalent among the global cattle population. Although the complete genome of BoGHV4 has been successfully sequenced, the functions of most of its genes remain unknown. Currently, only six strains of BoGHV4, all belonging to Genotype 1, have been sequenced. This is the first report of the nearly complete genome of Argentinean BoGHV4 strains isolated from clinical cases of abortion, representing the first BoGHV4 Genotype 2 and 3 genomes described in the literature. Both Argentinean isolates presented the highest nt p-distance values, indicating a greater level of divergence. Overall, the considerable diversity observed in the complete genomes and open reading frames underscores the distinctiveness of both Argentinean isolates compared to the existing BoGHV4 genomes. These findings support previous studies that categorized the Argentinean BoGHV4 strains 07-435 and 10-154 as Genotypes 3 and 2, respectively. The inclusion of these sequences represents a significant expansion to the currently limited pool of BoGHV4 genomes while providing an important basis to increase the knowledge of local isolates.
Collapse
Affiliation(s)
- Florencia Romeo
- Instituto Nacional de Tecnología Agropecuaria, Instituto de Innovación para la Producción Agropecuaria y El Desarrollo Sostenible (IPADS, INTA-CONICET) Ruta 226, km 73.5, Balcarce CC7620, Buenos Aires, Argentina (E.L.L.U.)
| | - Maximiliano Joaquín Spetter
- Facultad de Ciencias Veterinarias, Departamento de Fisiopatología, Centro de Investigación Veterinaria de Tandil (CIVETAN), Universidad Nacional del Centro de la Provincia de Buenos Aires, Paraje Arroyo Seco s/n, Tandil CC7000, Buenos Aires, Argentina
| | - Susana Beatriz Pereyra
- Instituto Nacional de Tecnología Agropecuaria, Instituto de Innovación para la Producción Agropecuaria y El Desarrollo Sostenible (IPADS, INTA-CONICET) Ruta 226, km 73.5, Balcarce CC7620, Buenos Aires, Argentina (E.L.L.U.)
| | - Pedro Edgardo Morán
- Laboratorio de Virología, Facultad de Ciencias Veterinarias, Centro de Investigación Veterinaria de Tandil (CIVETAN), Universidad Nacional del Centro de la Provincia de Buenos Aires, Paraje Arroyo Seco s/n, Tandil CC7000, Buenos Aires, Argentina
| | - Erika Analía González Altamiranda
- Instituto Nacional de Tecnología Agropecuaria, Instituto de Innovación para la Producción Agropecuaria y El Desarrollo Sostenible (IPADS, INTA-CONICET) Ruta 226, km 73.5, Balcarce CC7620, Buenos Aires, Argentina (E.L.L.U.)
| | - Enrique Leopoldo Louge Uriarte
- Instituto Nacional de Tecnología Agropecuaria, Instituto de Innovación para la Producción Agropecuaria y El Desarrollo Sostenible (IPADS, INTA-CONICET) Ruta 226, km 73.5, Balcarce CC7620, Buenos Aires, Argentina (E.L.L.U.)
| | - Anselmo Carlos Odeón
- Facultad de Ciencias Agrarias, Universidad Nacional de Mar del Plata, Ruta 226, km 73.5, Balcarce CC7620, Buenos Aires, Argentina
| | - Sandra Elizabeth Pérez
- Laboratorio de Virología, Facultad de Ciencias Veterinarias, Centro de Investigación Veterinaria de Tandil (CIVETAN), Universidad Nacional del Centro de la Provincia de Buenos Aires, Paraje Arroyo Seco s/n, Tandil CC7000, Buenos Aires, Argentina
| | - Andrea Elizabeth Verna
- Instituto Nacional de Tecnología Agropecuaria, Instituto de Innovación para la Producción Agropecuaria y El Desarrollo Sostenible (IPADS, INTA-CONICET) Ruta 226, km 73.5, Balcarce CC7620, Buenos Aires, Argentina (E.L.L.U.)
| |
Collapse
|
4
|
Culbertson EM, Levin TC. Eukaryotic CD-NTase, STING, and viperin proteins evolved via domain shuffling, horizontal transfer, and ancient inheritance from prokaryotes. PLoS Biol 2023; 21:e3002436. [PMID: 38064485 PMCID: PMC10732462 DOI: 10.1371/journal.pbio.3002436] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2023] [Revised: 12/20/2023] [Accepted: 11/20/2023] [Indexed: 12/21/2023] Open
Abstract
Animals use a variety of cell-autonomous innate immune proteins to detect viral infections and prevent replication. Recent studies have discovered that a subset of mammalian antiviral proteins have homology to antiphage defense proteins in bacteria, implying that there are aspects of innate immunity that are shared across the Tree of Life. While the majority of these studies have focused on characterizing the diversity and biochemical functions of the bacterial proteins, the evolutionary relationships between animal and bacterial proteins are less clear. This ambiguity is partly due to the long evolutionary distances separating animal and bacterial proteins, which obscures their relationships. Here, we tackle this problem for 3 innate immune families (CD-NTases [including cGAS], STINGs, and viperins) by deeply sampling protein diversity across eukaryotes. We find that viperins and OAS family CD-NTases are ancient immune proteins, likely inherited since the earliest eukaryotes first arose. In contrast, we find other immune proteins that were acquired via at least 4 independent events of horizontal gene transfer (HGT) from bacteria. Two of these events allowed algae to acquire new bacterial viperins, while 2 more HGT events gave rise to distinct superfamilies of eukaryotic CD-NTases: the cGLR superfamily (containing cGAS) that has since diversified via a series of animal-specific duplications and a previously undefined eSMODS superfamily, which more closely resembles bacterial CD-NTases. Finally, we found that cGAS and STING proteins have substantially different histories, with STING protein domains undergoing convergent domain shuffling in bacteria and eukaryotes. Overall, our findings paint a picture of eukaryotic innate immunity as highly dynamic, where eukaryotes build upon their ancient antiviral repertoires through the reuse of protein domains and by repeatedly sampling a rich reservoir of bacterial antiphage genes.
Collapse
Affiliation(s)
- Edward M. Culbertson
- University of Pittsburgh, Department of Biological Sciences, Pittsburgh, Pennsylvania, United States of America
| | - Tera C. Levin
- University of Pittsburgh, Department of Biological Sciences, Pittsburgh, Pennsylvania, United States of America
| |
Collapse
|
5
|
Fruzangohar M, Moolhuijzen P, Bakaj N, Taylor J. CoreDetector: a flexible and efficient program for core-genome alignment of evolutionary diverse genomes. Bioinformatics 2023; 39:btad628. [PMID: 37878789 PMCID: PMC10663985 DOI: 10.1093/bioinformatics/btad628] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2023] [Revised: 09/20/2023] [Accepted: 10/23/2023] [Indexed: 10/27/2023] Open
Abstract
MOTIVATION Whole genome alignment of eukaryote species remains an important method for the determination of sequence and structural variations and can also be used to ascertain the representative non-redundant core-genome sequence of a population. Many whole genome alignment tools were first developed for the more mature analysis of prokaryote species with few current tools containing the functionality to process larger genomes of eukaryotes as well as genomes of more divergent species. In addition, the functionality of these tools becomes computationally prohibitive due to the significant compute resources needed to handle larger genomes. RESULTS In this research, we present CoreDetector, an easy-to-use general-purpose program that can align the core-genome sequences for a range of genome sizes and divergence levels. To illustrate the flexibility of CoreDetector, we conducted alignments of a large set of closely related fungal pathogen and hexaploid wheat cultivar genomes as well as more divergent fly and rodent species genomes. In all cases, compared to existing multiple genome alignment tools, CoreDetector exhibited improved flexibility, efficiency, and competitive accuracy in tested cases. AVAILABILITY AND IMPLEMENTATION CoreDetector was developed in the cross platform, and easily deployable, Java language. A packaged pipeline is readily executable in a bash terminal without any external need for Perl or Python environments. Installation, example data, and usage instructions for CoreDetector are freely available from https://github.com/mfruzan/CoreDetector.
Collapse
Affiliation(s)
- Mario Fruzangohar
- The Biometry Hub, School of Agriculture, Food and Wine, University of Adelaide, Urrbrae, South Australia 5064, Australia
| | - Paula Moolhuijzen
- Centre for Crop Disease Management, School of Molecular and Life Sciences, Curtin University, Bentley, Western Australia 6102, Australia
| | - Nicolette Bakaj
- The Biometry Hub, School of Agriculture, Food and Wine, University of Adelaide, Urrbrae, South Australia 5064, Australia
| | - Julian Taylor
- The Biometry Hub, School of Agriculture, Food and Wine, University of Adelaide, Urrbrae, South Australia 5064, Australia
| |
Collapse
|
6
|
Shaukat MA, Nguyen TT, Hsu EB, Yang S, Bhatti A. Comparative study of encoded and alignment-based methods for virus taxonomy classification. Sci Rep 2023; 13:18662. [PMID: 37907535 PMCID: PMC10618506 DOI: 10.1038/s41598-023-45461-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2023] [Accepted: 10/19/2023] [Indexed: 11/02/2023] Open
Abstract
The emergence of viruses and their variants has made virus taxonomy more important than ever before in controlling the spread of diseases. The creation of efficient treatments and cures that target particular virus properties can be aided by understanding virus taxonomy. Alignment-based methods are commonly used for this task, but are computationally expensive and time-consuming, especially when dealing with large datasets or when detecting new virus variants is time sensitive. An alternative approach, the encoded method, has been developed that does not require prior sequence alignment and provides faster results. However, each encoded method has its own claimed accuracy. Therefore, careful evaluation and comparison of the performance of different encoded methods are essential to identify the most accurate and reliable approach for virus taxonomy classification. This study aims to address this issue by providing a comprehensive and comparative analysis of the potential of encoded methods for virus classification and phylogenetics. We compared the vectors generated for each encoded method using distance metrics to determine their similarity to alignment-based methods. The results and their validation show that K-merNV followed by CgrDft encoded methods, perform similarly to state-of-the-art multi-sequence alignment methods. This is the first study to incorporate and compare encoded methods that will facilitate future research in making more informed decisions regarding selection of a suitable method for virus taxonomy.
Collapse
Affiliation(s)
- Muhammad Arslan Shaukat
- Institute for Intelligent Systems Research and Innovation (IISRI), Deakin University, Victoria, Australia.
| | - Thanh Thi Nguyen
- Faculty of Information Technology, Monash University, Victoria, Australia
| | - Edbert B Hsu
- Department of Emergency Medicine, Johns Hopkins University, Maryland, USA
| | - Samuel Yang
- Department of Emergency Medicine, Stanford University, California, USA
| | - Asim Bhatti
- Institute for Intelligent Systems Research and Innovation (IISRI), Deakin University, Victoria, Australia
| |
Collapse
|
7
|
Becker AS, Silva Júnior JVJ, Weiblen R, Flores EF. An appraisal of gene targets for phylogenetic classification of canine distemper virus: Is the hemagglutinin the best candidate? Virus Res 2023; 325:199043. [PMID: 36634899 DOI: 10.1016/j.virusres.2023.199043] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2022] [Revised: 01/04/2023] [Accepted: 01/07/2023] [Indexed: 01/11/2023]
Abstract
Sequence analysis of the canine distemper virus (CDV) hemagglutinin (H) gene may provide important insights on virus-host interactions and has also been frequently used for CDV phylogenetic classification. Herein, we performed an in silico analysis of CDV complete genomes (CGs) available in GenBank in order to investigate the suitability of H for CDV classification into lineages/genotypes. In addition, we analyzed the other viral genes for their potential use in CDV classification. Initially, we collected 116 CDV CGs from GenBank and compared their phylogenetic classification with that of their respective H nucleotide (nt) and amino acid (aa) sequences. Subsequently, we calculated the geodesic distance between the CG and H phylogenetic trees. These analyses were later performed with other CDV genes. All CDV CGs were also evaluated for possible recombination events. Nucleotide and aa analyses of H misclassified some Vaccine/America 1/Asia 3 lineage sequences compared to CG analysis, finding supported by both Maximum Likelihood (ML) and Bayesian Markov Chain Monte Carlo (B-MCMC) methods. Moreover, aa-based H analysis showed additional disagreements with the classification obtained by CG. The geodesic distance between the H and CG trees was 0.0680. Strong recombination signals were identified in the H gene, including Vaccine/America 1/Asia 3 lineage sequences. In contrast, C and P were the only genes that fully reproduced the CG classification (by ML and/or B-MCMC) and that did not show strong recombination signals. Furthermore, the P phylogenetic tree showed the lowest geodesic distance from the CG tree (0.0369). These findings suggest C and P as potential targets for CDV phylogenetic classification, especially when full genome sequencing is not possible. Finally, since our results were obtained considering the CDV CGs available to date, future analyses performed as more CDV sequences become available will be useful to assess probable issues of H-based phylogeny and to consolidate the suitability of the C and P genes for CDV classification.
Collapse
Affiliation(s)
- Alice Silveira Becker
- Setor de Virologia, Departamento de Medicina Veterinária Preventiva, Universidade Federal de Santa Maria, Rio Grande do Sul, Brazil; Programa de Pós-graduação em Medicina Veterinária, Universidade Federal de Santa Maria, Rio Grande do Sul, Brazil
| | - José Valter Joaquim Silva Júnior
- Setor de Virologia, Departamento de Medicina Veterinária Preventiva, Universidade Federal de Santa Maria, Rio Grande do Sul, Brazil; Setor de Virologia, Instituto Keizo Asami, Universidade Federal de Pernambuco, Pernambuco, Brazil; Departamento de Análises Clínicas, Centro de Ciências da Saúde, Universidade Federal de Santa Maria, Rio Grande do Sul, Brazil.
| | - Rudi Weiblen
- Setor de Virologia, Departamento de Medicina Veterinária Preventiva, Universidade Federal de Santa Maria, Rio Grande do Sul, Brazil
| | - Eduardo Furtado Flores
- Setor de Virologia, Departamento de Medicina Veterinária Preventiva, Universidade Federal de Santa Maria, Rio Grande do Sul, Brazil.
| |
Collapse
|
8
|
Jackson R, Patapiou PA, Golding G, Helanterä H, Economou CK, Chapuisat M, Henry LM. Evidence of phylosymbiosis in Formica ants. Front Microbiol 2023; 14:1044286. [PMID: 37213490 PMCID: PMC10196114 DOI: 10.3389/fmicb.2023.1044286] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2022] [Accepted: 03/31/2023] [Indexed: 05/23/2023] Open
Abstract
Introduction Insects share intimate relationships with microbes that play important roles in their biology. Yet our understanding of how host-bound microbial communities assemble and perpetuate over evolutionary time is limited. Ants host a wide range of microbes with diverse functions and are an emerging model for studying the evolution of insect microbiomes. Here, we ask whether phylogenetically related ant species have formed distinct and stable microbiomes. Methods To answer this question, we investigated the microbial communities associated with queens of 14 Formica species from five clades, using deep coverage 16S rRNA amplicon sequencing. Results We reveal that Formica species and clades harbor highly defined microbial communities that are dominated by four bacteria genera: Wolbachia, Lactobacillus, Liliensternia, and Spiroplasma. Our analysis reveals that the composition of Formica microbiomes mirrors the phylogeny of the host, i.e., phylosymbiosis, in that related hosts harbor more similar microbial communities. In addition, we find there are significant correlations between microbe co-occurrences. Discussion Our results demonstrate Formica ants carry microbial communities that recapitulate the phylogeny of their hosts. Our data suggests that the co-occurrence of different bacteria genera may at least in part be due to synergistic and antagonistic interactions between microbes. Additional factors potentially contributing to the phylosymbiotic signal are discussed, including host phylogenetic relatedness, host-microbe genetic compatibility, modes of transmission, and similarities in host ecologies (e.g., diets). Overall, our results support the growing body of evidence that microbial community composition closely depends on the phylogeny of their hosts, despite bacteria having diverse modes of transmission and localization within the host.
Collapse
Affiliation(s)
- Raphaella Jackson
- School of Biological and Behavioural Sciences, Queen Mary University of London, London, United Kingdom
| | - Patapios A. Patapiou
- School of Biological and Behavioural Sciences, Queen Mary University of London, London, United Kingdom
- Department of Pathobiology and Population Sciences, Royal Veterinary College, Hatfield, United Kingdom
| | - Gemma Golding
- School of Biological and Behavioural Sciences, Queen Mary University of London, London, United Kingdom
| | - Heikki Helanterä
- Ecology and Genetics Research Unit, University of Oulu, Oulu, Finland
- Tvärminne Zoological Station, University of Helsinki, Hanko, Finland
| | - Chloe K. Economou
- School of Biological and Behavioural Sciences, Queen Mary University of London, London, United Kingdom
| | - Michel Chapuisat
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland
| | - Lee M. Henry
- School of Biological and Behavioural Sciences, Queen Mary University of London, London, United Kingdom
- *Correspondence: Lee M. Henry,
| |
Collapse
|
9
|
de Oliveira PSB, Silva Júnior JVJ, Weiblen R, Flores EF. A new (old) bovine viral diarrhea virus 2 subtype: BVDV-2e. Arch Virol 2022; 167:2545-2553. [PMID: 36104508 DOI: 10.1007/s00705-022-05565-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2022] [Accepted: 07/02/2022] [Indexed: 12/14/2022]
Abstract
Bovine pestiviruses are members of the species Pestivirus A (bovine viral diarrhea virus 1, BVDV-1), Pestivirus B (BVDV-2) or Pestivirus H (HoBiPeV). To date, BVDV-2 isolates/strains have been classified into three subtypes (a-c) by phylogenetic analysis, and an additional subtype (d) has been proposed based on 5' untranslated region (UTR) secondary structures. In a previous study, we identified some BVDV-2 sequences in the GenBank database that could not be classified as subtype a, b or c by phylogenetic analysis of their genomes, UTRs or individual genes. Here, we performed a detailed study of these sequences and assessed whether they might represent a distinct BVDV-2 subtype. Initially, we collected 85 BVDV-2 complete/near-complete genomes (CNCGs) from GenBank and performed a "proof of equivalence" between phylogenetic analyses based on CNCGs and open reading frames (ORFs), which showed that ORFs may be reliably used as a reference target for BVDV-2 phylogeny, allowing us to increase our dataset to 139 sequences. Among these, we found seven sequences that could not be classified as BVDV-2a-c. The same was observed in the phylogenetic analysis of CNCGs and viral genes. In addition, the seven non-BVDV-2a-c sequences formed a distinct cluster in all phylogenetic trees, which we propose to term BVDV-2e. BVDV-2e also showed 44 amino acid changes compared to BVDV-2a-c, 20 of which are in well-defined positions. Importantly, an additional phylogenetic analysis including BVDV-2d and a pairwise comparison of BVDV-2e and BVDV-2d sequences also supported the difference between these subtypes. Finally, we propose the recognition of BVDV-2e as a distinct BVDV-2 subtype and encourage its inclusion in future phylogenetic analyses to understand its distribution and evolution.
Collapse
Affiliation(s)
- Pablo Sebastian Britto de Oliveira
- Setor de Virologia, Departamento de Medicina Veterinária Preventiva, Universidade Federal de Santa Maria, Av. Roraima, 1000, Prédio 63A, Camobi, Santa Maria, Rio Grande do Sul, 97105-900, Brazil.,Programa de Pós-graduação em Medicina Veterinária, Universidade Federal de Santa Maria, Santa Maria, Rio Grande do Sul, Brazil
| | - José Valter Joaquim Silva Júnior
- Setor de Virologia, Departamento de Medicina Veterinária Preventiva, Universidade Federal de Santa Maria, Av. Roraima, 1000, Prédio 63A, Camobi, Santa Maria, Rio Grande do Sul, 97105-900, Brazil. .,Setor de Virologia, Laboratório de Imunopatologia Keizo Asami, Universidade Federal de Pernambuco, Recife, Pernambuco, Brazil.
| | - Rudi Weiblen
- Setor de Virologia, Departamento de Medicina Veterinária Preventiva, Universidade Federal de Santa Maria, Av. Roraima, 1000, Prédio 63A, Camobi, Santa Maria, Rio Grande do Sul, 97105-900, Brazil
| | - Eduardo Furtado Flores
- Setor de Virologia, Departamento de Medicina Veterinária Preventiva, Universidade Federal de Santa Maria, Av. Roraima, 1000, Prédio 63A, Camobi, Santa Maria, Rio Grande do Sul, 97105-900, Brazil.
| |
Collapse
|
10
|
Goremykin V. Assessment of Absolute Substitution Model Fit Accommodating Time-Reversible and Non-Time-Reversible Evolutionary Processes. Syst Biol 2022:6632685. [PMID: 35792853 DOI: 10.1093/sysbio/syac046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2021] [Revised: 06/16/2022] [Accepted: 06/24/2022] [Indexed: 11/13/2022] Open
Abstract
The loss of information accompanying assessment of absolute fit of substitution models to phylogenetic data negatively affects the discriminatory power of previous methods and can make them insensitive to lineage-specific changes in the substitution process. As an alternative, I propose evaluating absolute fit of substitution models based on a novel statistic which describes the observed data without information loss and which is unlikely to become zero-inflated with increasing numbers of taxa. This method can accommodate gaps and is sensitive to lineage-specific shifts in the substitution process. In simulation experiments, it exhibits greater discriminatory power than previous methods. The method can be implemented in both Bayesian and Maximum Likelihood phylogenetic analyses, and used to screen any set of models. Recently, it has been suggested that model selection may be an unnecessary step in phylogenetic inference. However, results presented here emphasize the importance of model fit assessment for reliable phylogenetic inference.
Collapse
Affiliation(s)
- Vadim Goremykin
- Research and Innovation Centre, Fondazione Edmund Mach, 38010 San Michele all'Adige (TN), Italy
| |
Collapse
|
11
|
Chemical characterizations of neurotransmission receptors of human and plant to unfold the evolutionary relationships among them. Comput Biol Chem 2022; 98:107685. [DOI: 10.1016/j.compbiolchem.2022.107685] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2021] [Revised: 04/12/2022] [Accepted: 04/16/2022] [Indexed: 11/18/2022]
|
12
|
Nikolaidis M, Markoulatos P, Van de Peer Y, Oliver SG, Amoutzias GD. The Neighborhood of the Spike Gene Is a Hotspot for Modular Intertypic Homologous and Nonhomologous Recombination in Coronavirus Genomes. Mol Biol Evol 2022; 39:msab292. [PMID: 34638137 PMCID: PMC8549283 DOI: 10.1093/molbev/msab292] [Citation(s) in RCA: 23] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Coronaviruses (CoVs) have very large RNA viral genomes with a distinct genomic architecture of core and accessory open reading frames (ORFs). It is of utmost importance to understand their patterns and limits of homologous and nonhomologous recombination, because such events may affect the emergence of novel CoV strains, alter their host range, infection rate, tissue tropism pathogenicity, and their ability to escape vaccination programs. Intratypic recombination among closely related CoVs of the same subgenus has often been reported; however, the patterns and limits of genomic exchange between more distantly related CoV lineages (intertypic recombination) need further investigation. Here, we report computational/evolutionary analyses that clearly demonstrate a substantial ability for CoVs of different subgenera to recombine. Furthermore, we show that CoVs can obtain-through nonhomologous recombination-accessory ORFs from core ORFs, exchange accessory ORFs with different CoV genera, with other viruses (i.e., toroviruses, influenza C/D, reoviruses, rotaviruses, astroviruses) and even with hosts. Intriguingly, most of these radical events result from double crossovers surrounding the Spike ORF, thus highlighting both the instability and mobile nature of this genomic region. Although many such events have often occurred during the evolution of various CoVs, the genomic architecture of the relatively young SARS-CoV/SARS-CoV-2 lineage so far appears to be stable.
Collapse
Affiliation(s)
- Marios Nikolaidis
- Bioinformatics Laboratory, Department of Biochemistry and Biotechnology, University of Thessaly, Larissa, Greece
| | - Panayotis Markoulatos
- Microbial Biotechnology-Molecular Bacteriology-Virology Laboratory, Department of Biochemistry and Biotechnology, University of Thessaly, Larissa, Greece
| | - Yves Van de Peer
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
- Center for Plant Systems Biology, VIB, Ghent, Belgium
- Department of Biochemistry, Genetics and Microbiology, University of Pretoria, Pretoria, South Africa
- College of Horticulture, Nanjing Agricultural University, Nanjing, China
| | - Stephen G Oliver
- Department of Biochemistry, University of Cambridge, Cambridge, United Kingdom
| | - Grigorios D Amoutzias
- Bioinformatics Laboratory, Department of Biochemistry and Biotechnology, University of Thessaly, Larissa, Greece
| |
Collapse
|
13
|
Papaioannou IA, Dutreux F, Peltier FA, Maekawa H, Delhomme N, Bardhan A, Friedrich A, Schacherer J, Knop M. Sex without crossing over in the yeast Saccharomycodes ludwigii. Genome Biol 2021; 22:303. [PMID: 34732243 PMCID: PMC8567612 DOI: 10.1186/s13059-021-02521-w] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2021] [Accepted: 10/20/2021] [Indexed: 02/06/2023] Open
Abstract
BACKGROUND Intermixing of genomes through meiotic reassortment and recombination of homologous chromosomes is a unifying theme of sexual reproduction in eukaryotic organisms and is considered crucial for their adaptive evolution. Previous studies of the budding yeast species Saccharomycodes ludwigii suggested that meiotic crossing over might be absent from its sexual life cycle, which is predominated by fertilization within the meiotic tetrad. RESULTS We demonstrate that recombination is extremely suppressed during meiosis in Sd. ludwigii. DNA double-strand break formation by the conserved transesterase Spo11, processing and repair involving interhomolog interactions are required for normal meiosis but do not lead to crossing over. Although the species has retained an intact meiotic gene repertoire, genetic and population analyses suggest the exceptionally rare occurrence of meiotic crossovers in its genome. A strong AT bias of spontaneous mutations and the absence of recombination are likely responsible for its unusually low genomic GC level. CONCLUSIONS Sd. ludwigii has followed a unique evolutionary trajectory that possibly derives fitness benefits from the combination of frequent mating between products of the same meiotic event with the extreme suppression of meiotic recombination. This life style ensures preservation of heterozygosity throughout its genome and may enable the species to adapt to its environment and survive with only minimal levels of rare meiotic recombination. We propose Sd. ludwigii as an excellent natural forum for the study of genome evolution and recombination rates.
Collapse
Affiliation(s)
| | - Fabien Dutreux
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France
| | - France A. Peltier
- Center for Molecular Biology of Heidelberg University (ZMBH), Heidelberg, Germany
| | - Hiromi Maekawa
- Center for Molecular Biology of Heidelberg University (ZMBH), Heidelberg, Germany
- Current affiliation: Faculty of Agriculture, Kyushu University, Fukuoka, Japan
| | - Nicolas Delhomme
- Umeå Plant Science Centre, Department of Forest Genetics and Plant Physiology, Swedish University of Agricultural Sciences, Umeå, Sweden
| | - Amit Bardhan
- Center for Molecular Biology of Heidelberg University (ZMBH), Heidelberg, Germany
| | - Anne Friedrich
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France
| | - Joseph Schacherer
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France
- Institut Universitaire de France (IUF), Paris, France
| | - Michael Knop
- Center for Molecular Biology of Heidelberg University (ZMBH), Heidelberg, Germany
- German Cancer Research Center (DKFZ), DKFZ-ZMBH Alliance, Heidelberg, Germany
| |
Collapse
|
14
|
Undabarrena A, Valencia R, Cumsille A, Zamora-Leiva L, Castro-Nallar E, Barona-Gomez F, Cámara B. Rhodococcus comparative genomics reveals a phylogenomic-dependent non-ribosomal peptide synthetase distribution: insights into biosynthetic gene cluster connection to an orphan metabolite. Microb Genom 2021; 7:000621. [PMID: 34241590 PMCID: PMC8477407 DOI: 10.1099/mgen.0.000621] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2021] [Accepted: 06/04/2021] [Indexed: 01/14/2023] Open
Abstract
Natural products (NPs) are synthesized by biosynthetic gene clusters (BGCs), whose genes are involved in producing one or a family of chemically related metabolites. Advances in comparative genomics have been favourable for exploiting huge amounts of data and discovering previously unknown BGCs. Nonetheless, studying distribution patterns of novel BGCs and elucidating the biosynthesis of orphan metabolites remains a challenge. To fill this knowledge gap, our study developed a pipeline for high-quality comparative genomics for the actinomycete genus Rhodococcus , which is metabolically versatile, yet understudied in terms of NPs, leading to a total of 110 genomes, 1891 BGCs and 717 non-ribosomal peptide synthetases (NRPSs). Phylogenomic inferences showed four major clades retrieved from strains of several ecological habitats. BiG-SCAPE sequence similarity BGC networking revealed 44 unidentified gene cluster families (GCFs) for NRPS, which presented a phylogenomic-dependent evolution pattern, supporting the hypothesis of vertical gene transfer. As a proof of concept, we analysed in-depth one of our marine strains, Rhodococcus sp. H-CA8f, which revealed a unique BGC distribution within its phylogenomic clade, involved in producing a chloramphenicol-related compound. While this BGC is part of the most abundant and widely distributed NRPS GCF, corason analysis unveiled major differences regarding its genetic context, co-occurrence patterns and modularity. This BGC is composed of three sections, two well-conserved right/left arms flanking a very variable middle section, composed of nrps genes. The presence of two non-canonical domains in H-CA8f’s BGC may contribute to adding chemical diversity to this family of NPs. Liquid chromatography-high resolution MS and dereplication efforts retrieved a set of related orphan metabolites, the corynecins, which to our knowledge are reported here for the first time in Rhodococcus . Overall, our data provide insights to connect BGC uniqueness with orphan metabolites, by revealing key comparative genomic features supported by models of BGC distribution along phylogeny.
Collapse
Affiliation(s)
- Agustina Undabarrena
- Laboratorio de Microbiología Molecular y Biotecnología Ambiental, Departamento de Química y Centro de Biotecnología Daniel Alkalay Lowitt, Universidad Técnica Federico Santa María, Valparaíso 2340000, Chile
| | - Ricardo Valencia
- Laboratorio de Microbiología Molecular y Biotecnología Ambiental, Departamento de Química y Centro de Biotecnología Daniel Alkalay Lowitt, Universidad Técnica Federico Santa María, Valparaíso 2340000, Chile
- Present address: Institute of Quantitative Biology, Biochemistry and Biotechnology, School of Biological Sciences, University of Edinburgh, King’s Buildings, Edinburgh, UK
| | - Andrés Cumsille
- Laboratorio de Microbiología Molecular y Biotecnología Ambiental, Departamento de Química y Centro de Biotecnología Daniel Alkalay Lowitt, Universidad Técnica Federico Santa María, Valparaíso 2340000, Chile
| | - Leonardo Zamora-Leiva
- Laboratorio de Microbiología Molecular y Biotecnología Ambiental, Departamento de Química y Centro de Biotecnología Daniel Alkalay Lowitt, Universidad Técnica Federico Santa María, Valparaíso 2340000, Chile
| | - Eduardo Castro-Nallar
- Center for Bioinformatics and Integrative Biology, Facultad de Ciencias de la Vida, Universidad Andres Bello, Santiago, Chile
| | - Francisco Barona-Gomez
- Evolution of Metabolic Diversity Laboratory, Unidad de Genómica Avanzada (Langebio), Cinvestav, Irapuato, Guanajuato, Mexico
| | - Beatriz Cámara
- Laboratorio de Microbiología Molecular y Biotecnología Ambiental, Departamento de Química y Centro de Biotecnología Daniel Alkalay Lowitt, Universidad Técnica Federico Santa María, Valparaíso 2340000, Chile
| |
Collapse
|
15
|
Gómez-Pérez D, Kemen E. Predicting Lifestyle from Positive Selection Data and Genome Properties in Oomycetes. Pathogens 2021; 10:807. [PMID: 34202069 PMCID: PMC8308905 DOI: 10.3390/pathogens10070807] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2021] [Revised: 06/19/2021] [Accepted: 06/21/2021] [Indexed: 11/30/2022] Open
Abstract
As evidenced in parasitism, host and niche shifts are a source of genomic and phenotypic diversification. Exemplary is a reduction in the core metabolism as parasites adapt to a particular host, while the accessory genome often maintains a high degree of diversification. However, selective pressures acting on the genome of organisms that have undergone recent lifestyle or host changes have not been fully investigated. Here, we developed a comparative genomics approach to study underlying adaptive trends in oomycetes, a eukaryotic phylum with a wide and diverse range of economically important plant and animal parasitic lifestyles. Our analysis reveals converging evolution on biological processes for oomycetes that have similar lifestyles. Moreover, we find that certain functions, in particular carbohydrate metabolism, transport, and signaling, are important for host and environmental adaptation in oomycetes. Given the high correlation between lifestyle and genome properties in our oomycete dataset, together with the known convergent evolution of fungal and oomycete genomes, we developed a model that predicts plant pathogenic lifestyles with high accuracy based on functional annotations. These insights into how selective pressures correlate with lifestyle may be crucial to better understand host/lifestyle shifts and their impact on the genome.
Collapse
Affiliation(s)
| | - Eric Kemen
- Center for Plant Molecular Biology (ZMBP), University of Tübingen, 72074 Tübingen, Germany;
| |
Collapse
|
16
|
Deformity Index: A Semi-Reference Clade-Based Quality Metric of Phylogenetic Trees. J Mol Evol 2021; 89:302-312. [PMID: 33811501 DOI: 10.1007/s00239-021-10006-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2020] [Accepted: 03/20/2021] [Indexed: 10/21/2022]
Abstract
Measuring the dissimilarity of a phylogenetic tree with respect to a reference tree or the hypotheses is a fundamental task in the phylogenetic study. A large number of methods have been proposed to compute the distance between the reference tree and the target tree. Due to the presence of unresolved relationships among the species, it is challenging to obtain a precise and an accurate reference tree for a selected dataset. As a result, the existing tree comparison methods may behave unexpectedly in various scenarios. In this paper, we introduce a novel scoring function, called the deformity index, to quantify the dissimilarity of a tree based on the list of clades of a reference tree. The strength of our proposed method is that it depends on the list of clades that can be acquired either from the reference tree or from the hypotheses. We investigate the distributions of different modules of the deformity index and perform different goodness-of-fit tests to understand the cumulative distribution. Then, we examine, in detail, the robustness as well as the scalability of our measure by performing different statistical tests under various models. Finally, we experiment on different biological datasets and show that our proposed scoring function overcomes the limitations of the conventional methods.
Collapse
|