1
|
Samson S, Lord É, Makarenkov V. Assessing the emergence time of SARS-CoV-2 zoonotic spillover. PLoS One 2024; 19:e0301195. [PMID: 38574109 PMCID: PMC10994396 DOI: 10.1371/journal.pone.0301195] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2023] [Accepted: 03/12/2024] [Indexed: 04/06/2024] Open
Abstract
Understanding the evolution of Severe Acute Respiratory Syndrome Coronavirus (SARS-CoV-2) and its relationship to other coronaviruses in the wild is crucial for preventing future virus outbreaks. While the origin of the SARS-CoV-2 pandemic remains uncertain, mounting evidence suggests the direct involvement of the bat and pangolin coronaviruses in the evolution of the SARS-CoV-2 genome. To unravel the early days of a probable zoonotic spillover event, we analyzed genomic data from various coronavirus strains from both human and wild hosts. Bayesian phylogenetic analysis was performed using multiple datasets, using strict and relaxed clock evolutionary models to estimate the occurrence times of key speciation, gene transfer, and recombination events affecting the evolution of SARS-CoV-2 and its closest relatives. We found strong evidence supporting the presence of temporal structure in datasets containing SARS-CoV-2 variants, enabling us to estimate the time of SARS-CoV-2 zoonotic spillover between August and early October 2019. In contrast, datasets without SARS-CoV-2 variants provided mixed results in terms of temporal structure. However, they allowed us to establish that the presence of a statistically robust clade in the phylogenies of gene S and its receptor-binding (RBD) domain, including two bat (BANAL) and two Guangdong pangolin coronaviruses (CoVs), is due to the horizontal gene transfer of this gene from the bat CoV to the pangolin CoV that occurred in the middle of 2018. Importantly, this clade is closely located to SARS-CoV-2 in both phylogenies. This phylogenetic proximity had been explained by an RBD gene transfer from the Guangdong pangolin CoV to a very recent ancestor of SARS-CoV-2 in some earlier works in the field before the BANAL coronaviruses were discovered. Overall, our study provides valuable insights into the timeline and evolutionary dynamics of the SARS-CoV-2 pandemic.
Collapse
Affiliation(s)
- Stéphane Samson
- Department of Computer Sciences, Université du Québec à Montréal, Montréal, Canada
- Saint-Jean-sur-Richelieu Research and Development Centre, Agriculture and Agri-Food Canada, Saint-Jean-sur-Richelieu, Québec, Canada
| | - Étienne Lord
- Saint-Jean-sur-Richelieu Research and Development Centre, Agriculture and Agri-Food Canada, Saint-Jean-sur-Richelieu, Québec, Canada
| | - Vladimir Makarenkov
- Department of Computer Sciences, Université du Québec à Montréal, Montréal, Canada
- Mila—Quebec AI Institute, Montreal, QC, Canada
| |
Collapse
|
2
|
Tahiri N, Fichet B, Makarenkov V. Building alternative consensus trees and supertrees using k-means and Robinson and Foulds distance. Bioinformatics 2022; 38:3367-3376. [PMID: 35579343 DOI: 10.1093/bioinformatics/btac326] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2022] [Revised: 04/28/2022] [Accepted: 05/10/2022] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION Each gene has its own evolutionary history which can substantially differ from evolutionary histories of other genes. For example, some individual genes or operons can be affected by specific horizontal gene transfer or recombination events. Thus, the evolutionary history of each gene should be represented by its own phylogenetic tree which may display different evolutionary patterns from the species tree that accounts for the main patterns of vertical descent. However, the output of traditional consensus tree or supertree inference methods is a unique consensus tree or supertree. RESULTS We present a new efficient method for inferring multiple alternative consensus trees and supertrees to best represent the most important evolutionary patterns of a given set of gene phylogenies. We show how an adapted version of the popular k-means clustering algorithm, based on some remarkable properties of the Robinson and Foulds distance, can be used to partition a given set of trees into one (for homogeneous data) or multiple (for heterogeneous data) cluster(s) of trees. Moreover, we adapt the popular Caliński-Harabasz, Silhouette, Ball and Hall, and Gap cluster validity indices to tree clustering with k-means. Special attention is given to the relevant but very challenging problem of inferring alternative supertrees. The use of the Euclidean property of the objective function of the method makes it faster than the existing tree clustering techniques, and thus better suited for analyzing large evolutionary datasets. AVAILABILITY AND IMPLEMENTATION Our KMeansSuperTreeClustering program along with its C ++ source code is available at: https://github.com/TahiriNadia/KMeansSuperTreeClustering. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Nadia Tahiri
- Département d'informatique, Université du Québec à Montréal, Montreal, QC, Canada.,Département d'informatique, Université de Sherbrooke, Sherbrooke, QC, Canada
| | - Bernard Fichet
- Aix-Marseille Université, Faculté de Médecine, 27 Bd. Jean Moulin, F-13385 Marseille, France
| | - Vladimir Makarenkov
- Département d'informatique, Université du Québec à Montréal, Montreal, QC, Canada
| |
Collapse
|
3
|
Makarenkov V, Mazoure B, Rabusseau G, Legendre P. Horizontal gene transfer and recombination analysis of SARS-CoV-2 genes helps discover its close relatives and shed light on its origin. BMC Ecol Evol 2021; 21:5. [PMID: 33514319 PMCID: PMC7817968 DOI: 10.1186/s12862-020-01732-2] [Citation(s) in RCA: 31] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2020] [Accepted: 12/08/2020] [Indexed: 12/30/2022] Open
Abstract
BACKGROUND The SARS-CoV-2 pandemic is one of the greatest global medical and social challenges that have emerged in recent history. Human coronavirus strains discovered during previous SARS outbreaks have been hypothesized to pass from bats to humans using intermediate hosts, e.g. civets for SARS-CoV and camels for MERS-CoV. The discovery of an intermediate host of SARS-CoV-2 and the identification of specific mechanism of its emergence in humans are topics of primary evolutionary importance. In this study we investigate the evolutionary patterns of 11 main genes of SARS-CoV-2. Previous studies suggested that the genome of SARS-CoV-2 is highly similar to the horseshoe bat coronavirus RaTG13 for most of the genes and to some Malayan pangolin coronavirus (CoV) strains for the receptor binding (RB) domain of the spike protein. RESULTS We provide a detailed list of statistically significant horizontal gene transfer and recombination events (both intergenic and intragenic) inferred for each of 11 main genes of the SARS-CoV-2 genome. Our analysis reveals that two continuous regions of genes S and N of SARS-CoV-2 may result from intragenic recombination between RaTG13 and Guangdong (GD) Pangolin CoVs. Statistically significant gene transfer-recombination events between RaTG13 and GD Pangolin CoV have been identified in region [1215-1425] of gene S and region [534-727] of gene N. Moreover, some statistically significant recombination events between the ancestors of SARS-CoV-2, RaTG13, GD Pangolin CoV and bat CoV ZC45-ZXC21 coronaviruses have been identified in genes ORF1ab, S, ORF3a, ORF7a, ORF8 and N. Furthermore, topology-based clustering of gene trees inferred for 25 CoV organisms revealed a three-way evolution of coronavirus genes, with gene phylogenies of ORF1ab, S and N forming the first cluster, gene phylogenies of ORF3a, E, M, ORF6, ORF7a, ORF7b and ORF8 forming the second cluster, and phylogeny of gene ORF10 forming the third cluster. CONCLUSIONS The results of our horizontal gene transfer and recombination analysis suggest that SARS-CoV-2 could not only be a chimera virus resulting from recombination of the bat RaTG13 and Guangdong pangolin coronaviruses but also a close relative of the bat CoV ZC45 and ZXC21 strains. They also indicate that a GD pangolin may be an intermediate host of this dangerous virus.
Collapse
Affiliation(s)
- Vladimir Makarenkov
- Département d'informatique, Université du Québec à Montréal, Montreal, QC, Canada.
| | - Bogdan Mazoure
- Montreal Institute for Learning Algorithms (Mila), Montreal, QC, Canada
| | - Guillaume Rabusseau
- Montreal Institute for Learning Algorithms (Mila), Montreal, QC, Canada
- Département d'informatique et de Recherche Opérationnelle, Université de Montréal and Canada CIFAR AI Chair, Montreal, QC, Canada
| | - Pierre Legendre
- Département de Sciences Biologiques, Université de Montréal, C. P. 6128, Succursale Centre-Ville, Montreal, QC, H3C 3J7, Canada
| |
Collapse
|
4
|
Tahiri N, Willems M, Makarenkov V. A new fast method for inferring multiple consensus trees using k-medoids. BMC Evol Biol 2018; 18:48. [PMID: 29621975 PMCID: PMC5887197 DOI: 10.1186/s12862-018-1163-8] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2017] [Accepted: 03/22/2018] [Indexed: 11/10/2022] Open
Abstract
Background Gene trees carry important information about specific evolutionary patterns which characterize the evolution of the corresponding gene families. However, a reliable species consensus tree cannot be inferred from a multiple sequence alignment of a single gene family or from the concatenation of alignments corresponding to gene families having different evolutionary histories. These evolutionary histories can be quite different due to horizontal transfer events or to ancient gene duplications which cause the emergence of paralogs within a genome. Many methods have been proposed to infer a single consensus tree from a collection of gene trees. Still, the application of these tree merging methods can lead to the loss of specific evolutionary patterns which characterize some gene families or some groups of gene families. Thus, the problem of inferring multiple consensus trees from a given set of gene trees becomes relevant. Results We describe a new fast method for inferring multiple consensus trees from a given set of phylogenetic trees (i.e. additive trees or X-trees) defined on the same set of species (i.e. objects or taxa). The traditional consensus approach yields a single consensus tree. We use the popular k-medoids partitioning algorithm to divide a given set of trees into several clusters of trees. We propose novel versions of the well-known Silhouette and Caliński-Harabasz cluster validity indices that are adapted for tree clustering with k-medoids. The efficiency of the new method was assessed using both synthetic and real data, such as a well-known phylogenetic dataset consisting of 47 gene trees inferred for 14 archaeal organisms. Conclusions The method described here allows inference of multiple consensus trees from a given set of gene trees. It can be used to identify groups of gene trees having similar intragroup and different intergroup evolutionary histories. The main advantage of our method is that it is much faster than the existing tree clustering approaches, while providing similar or better clustering results in most cases. This makes it particularly well suited for the analysis of large genomic and phylogenetic datasets.
Collapse
Affiliation(s)
- Nadia Tahiri
- Département d'Informatique, Université du Québec à Montréal, Case postale 8888, Succursale Centre-ville, Montréal, H3C 3P8, Canada
| | - Matthieu Willems
- Département d'Informatique, Université du Québec à Montréal, Case postale 8888, Succursale Centre-ville, Montréal, H3C 3P8, Canada
| | - Vladimir Makarenkov
- Département d'Informatique, Université du Québec à Montréal, Case postale 8888, Succursale Centre-ville, Montréal, H3C 3P8, Canada.
| |
Collapse
|
5
|
Abstract
Distance-based approaches to phylogeny use estimations of the evolutionary distance between sequences to reconstruct an evolution tree. If the evolution can be represented by an X-tree, the different sequences can be ordered so that the distance matrix [Formula: see text], representing the distance from a leaf n to the path ( i, j), is perfectly ordered meaning that [Formula: see text] and [Formula: see text] For i ≤ j ≤ k. After ordering of the sequences, the distance matrix [Formula: see text] permits to visualize phylogenetic relationships between taxa and to localize deviations from perfect order. The effect of perturbations resulting from lateral gene transfer or crossover can be modeled probabilistically. The order is shown to be quite robust against many perturbations. We have developed algorithms to minimize the level of contradiction in the order of the sequences. These algorithms are tested on the SSU rRNA data for Archaea. The degree of contradiction after optimization is for most taxa quite low. Regions in the taxa space with deviations from perfect order were identified.
Collapse
|
6
|
Naushad S, Barkema HW, Luby C, Condas LAZ, Nobrega DB, Carson DA, De Buck J. Comprehensive Phylogenetic Analysis of Bovine Non- aureus Staphylococci Species Based on Whole-Genome Sequencing. Front Microbiol 2016; 7:1990. [PMID: 28066335 PMCID: PMC5168469 DOI: 10.3389/fmicb.2016.01990] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2016] [Accepted: 11/28/2016] [Indexed: 11/19/2022] Open
Abstract
Non-aureus staphylococci (NAS), a heterogeneous group of a large number of species and subspecies, are the most frequently isolated pathogens from intramammary infections in dairy cattle. Phylogenetic relationships among bovine NAS species are controversial and have mostly been determined based on single-gene trees. Herein, we analyzed phylogeny of bovine NAS species using whole-genome sequencing (WGS) of 441 distinct isolates. In addition, evolutionary relationships among bovine NAS were estimated from multilocus data of 16S rRNA, hsp60, rpoB, sodA, and tuf genes and sequences from these and numerous other single genes/proteins. All phylogenies were created with FastTree, Maximum-Likelihood, Maximum-Parsimony, and Neighbor-Joining methods. Regardless of methodology, WGS-trees clearly separated bovine NAS species into five monophyletic coherent clades. Furthermore, there were consistent interspecies relationships within clades in all WGS phylogenetic reconstructions. Except for the Maximum-Parsimony tree, multilocus data analysis similarly produced five clades. There were large variations in determining clades and interspecies relationships in single gene/protein trees, under different methods of tree constructions, highlighting limitations of using single genes for determining bovine NAS phylogeny. However, based on WGS data, we established a robust phylogeny of bovine NAS species, unaffected by method or model of evolutionary reconstructions. Therefore, it is now possible to determine associations between phylogeny and many biological traits, such as virulence, antimicrobial resistance, environmental niche, geographical distribution, and host specificity.
Collapse
Affiliation(s)
- Sohail Naushad
- Department of Production Animal Health, Faculty of Veterinary Medicine, University of CalgaryCalgary, AB, Canada; Canadian Bovine Mastitis and Milk Quality Research NetworkSt-Hyacinthe, QC, Canada
| | - Herman W Barkema
- Department of Production Animal Health, Faculty of Veterinary Medicine, University of CalgaryCalgary, AB, Canada; Canadian Bovine Mastitis and Milk Quality Research NetworkSt-Hyacinthe, QC, Canada
| | - Christopher Luby
- Canadian Bovine Mastitis and Milk Quality Research NetworkSt-Hyacinthe, QC, Canada; Department of Large Animal Clinical Sciences, Western College of Veterinary Medicine, University of SaskatchewanSaskatoon, SK, Canada
| | - Larissa A Z Condas
- Department of Production Animal Health, Faculty of Veterinary Medicine, University of CalgaryCalgary, AB, Canada; Canadian Bovine Mastitis and Milk Quality Research NetworkSt-Hyacinthe, QC, Canada
| | - Diego B Nobrega
- Department of Production Animal Health, Faculty of Veterinary Medicine, University of CalgaryCalgary, AB, Canada; Canadian Bovine Mastitis and Milk Quality Research NetworkSt-Hyacinthe, QC, Canada
| | - Domonique A Carson
- Department of Production Animal Health, Faculty of Veterinary Medicine, University of CalgaryCalgary, AB, Canada; Canadian Bovine Mastitis and Milk Quality Research NetworkSt-Hyacinthe, QC, Canada
| | - Jeroen De Buck
- Department of Production Animal Health, Faculty of Veterinary Medicine, University of CalgaryCalgary, AB, Canada; Canadian Bovine Mastitis and Milk Quality Research NetworkSt-Hyacinthe, QC, Canada
| |
Collapse
|
7
|
Černý J, Černá Bolfíková B, de A Zanotto PM, Grubhoffer L, Růžek D. A deep phylogeny of viral and cellular right-hand polymerases. INFECTION GENETICS AND EVOLUTION 2015; 36:275-286. [PMID: 26431690 DOI: 10.1016/j.meegid.2015.09.026] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/12/2015] [Revised: 09/22/2015] [Accepted: 09/28/2015] [Indexed: 12/27/2022]
Abstract
Right-hand polymerases are important players in genome replication and repair in cellular organisms as well as in viruses. All right-hand polymerases are grouped into seven related protein families: viral RNA-dependent RNA polymerases, reverse transcriptases, single-subunit RNA polymerases, and DNA polymerase families A, B, D, and Y. Although the evolutionary relationships of right-hand polymerases within each family have been proposed, evolutionary relationships between families remain elusive because their sequence similarity is too low to allow classical phylogenetic analyses. The structure of viral RNA-dependent RNA polymerases recently was shown to be useful in inferring their evolution. Here, we address evolutionary relationships between right-hand polymerase families by combining sequence and structure information. We used a set of 22 viral and cellular polymerases representing all right-hand polymerase families with known protein structure. In contrast to previous studies, which focused only on the evolution of particular families, the current approach allowed us to present the first robust phylogenetic analysis unifying evolution of all right-hand polymerase families. All polymerase families branched into discrete lineages, following a fairly robust adjacency pattern. Only single-subunit RNA polymerases formed an inner group within DNA polymerase family A. RNA-dependent RNA polymerases of RNA viruses and reverse transcriptases of retroviruses formed two sister groups and were distinguishable from all other polymerases. DNA polymerases of DNA bacteriophages did not form a monophyletic group and are phylogenetically mixed with cellular DNA polymerase families A and B. Based on the highest genetic variability and structural simplicity, we assume that RNA-dependent RNA polymerases are the most ancient group of right-hand polymerases, in agreement with the RNA World hypothesis, because RNA-dependent RNA polymerases are enzymes that could serve in replication of RNA genomes. Moreover, our results show that protein structure can be used in phylogenetic analyses of distantly related proteins that share only limited sequence similarity.
Collapse
Affiliation(s)
- Jiří Černý
- Institute of Parasitology, Biology Centre of the Czech Academy of Sciences, Branišovská 31, 370 05 České Budějovice, Czech Republic; Faculty of Science, University of South Bohemia in České Budějovice, Branišovská 31, 370 05 České Budějovice, Czech Republic.
| | - Barbora Černá Bolfíková
- Faculty of Tropical AgriSciences, Czech University of Life Sciences Prague, Kamýcká 126, Suchdol, 165 21 Prague 6, Czech Republic
| | - Paolo M de A Zanotto
- Department of Microbiology, Biomedical Sciences Institute, ICB II University of Sao Paulo, 05508-000 Sao Paulo, Brazil
| | - Libor Grubhoffer
- Institute of Parasitology, Biology Centre of the Czech Academy of Sciences, Branišovská 31, 370 05 České Budějovice, Czech Republic; Faculty of Science, University of South Bohemia in České Budějovice, Branišovská 31, 370 05 České Budějovice, Czech Republic
| | - Daniel Růžek
- Institute of Parasitology, Biology Centre of the Czech Academy of Sciences, Branišovská 31, 370 05 České Budějovice, Czech Republic; Veterinary Research Institute, Hudcova 296/70, 621 00 Brno, Czech Republic
| |
Collapse
|
8
|
Willems M, Tahiri N, Makarenkov V. A new efficient algorithm for inferring explicit hybridization networks following the Neighbor-Joining principle. J Bioinform Comput Biol 2014; 12:1450024. [PMID: 25219384 DOI: 10.1142/s0219720014500243] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Several algorithms and software have been developed for inferring phylogenetic trees. However, there exist some biological phenomena such as hybridization, recombination, or horizontal gene transfer which cannot be represented by a tree topology. We need to use phylogenetic networks to adequately represent these important evolutionary mechanisms. In this article, we present a new efficient heuristic algorithm for inferring hybridization networks from evolutionary distance matrices between species. The famous Neighbor-Joining concept and the least-squares criterion are used for building networks. At each step of the algorithm, before joining two given nodes, we check if a hybridization event could be related to one of them or to both of them. The proposed algorithm finds the exact tree solution when the considered distance matrix is a tree metric (i.e. it is representable by a unique phylogenetic tree). It also provides very good hybrids recovery rates for large trees (with 32 and 64 leaves in our simulations) for both distance and sequence types of data. The results yielded by the new algorithm for real and simulated datasets are illustrated and discussed in detail.
Collapse
Affiliation(s)
- Matthieu Willems
- Département d'informatique, Université du Québec à Montréal, Case postale 8888, Succursale Centre-ville, Montréal (Québec) H3C 3P8, Canada
| | | | | |
Collapse
|
9
|
Layeghifard M, Peres-Neto PR, Makarenkov V. Inferring explicit weighted consensus networks to represent alternative evolutionary histories. BMC Evol Biol 2013; 13:274. [PMID: 24359207 PMCID: PMC3898054 DOI: 10.1186/1471-2148-13-274] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2013] [Accepted: 12/16/2013] [Indexed: 11/20/2022] Open
Abstract
BACKGROUND The advent of molecular biology techniques and constant increase in availability of genetic material have triggered the development of many phylogenetic tree inference methods. However, several reticulate evolution processes, such as horizontal gene transfer and hybridization, have been shown to blur the species evolutionary history by causing discordance among phylogenies inferred from different genes. METHODS To tackle this problem, we hereby describe a new method for inferring and representing alternative (reticulate) evolutionary histories of species as an explicit weighted consensus network which can be constructed from a collection of gene trees with or without prior knowledge of the species phylogeny. RESULTS We provide a way of building a weighted phylogenetic network for each of the following reticulation mechanisms: diploid hybridization, intragenic recombination and complete or partial horizontal gene transfer. We successfully tested our method on some synthetic and real datasets to infer the above-mentioned evolutionary events which may have influenced the evolution of many species. CONCLUSIONS Our weighted consensus network inference method allows one to infer, visualize and validate statistically major conflicting signals induced by the mechanisms of reticulate evolution. The results provided by the new method can be used to represent the inferred conflicting signals by means of explicit and easy-to-interpret phylogenetic networks.
Collapse
Affiliation(s)
- Mehdi Layeghifard
- Département des Sciences biologiques, Université du Québec à Montréal (UQÀM), CP 8888, Succ. Centre Ville, Montréal, QC H3C 3P8, Canada
- Département d’Informatique, Université du Québec à Montréal (UQÀM), CP 8888, Succ. Centre Ville, Montréal, QC H3C 3P8, Canada
| | - Pedro R Peres-Neto
- Département des Sciences biologiques, Université du Québec à Montréal (UQÀM), CP 8888, Succ. Centre Ville, Montréal, QC H3C 3P8, Canada
| | - Vladimir Makarenkov
- Département d’Informatique, Université du Québec à Montréal (UQÀM), CP 8888, Succ. Centre Ville, Montréal, QC H3C 3P8, Canada
| |
Collapse
|
10
|
Boc A, Diallo AB, Makarenkov V. T-REX: a web server for inferring, validating and visualizing phylogenetic trees and networks. Nucleic Acids Res 2012; 40:W573-9. [PMID: 22675075 PMCID: PMC3394261 DOI: 10.1093/nar/gks485] [Citation(s) in RCA: 278] [Impact Index Per Article: 23.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
T-REX (Tree and reticulogram REConstruction) is a web server dedicated to the reconstruction of phylogenetic trees, reticulation networks and to the inference of horizontal gene transfer (HGT) events. T-REX includes several popular bioinformatics applications such as MUSCLE, MAFFT, Neighbor Joining, NINJA, BioNJ, PhyML, RAxML, random phylogenetic tree generator and some well-known sequence-to-distance transformation models. It also comprises fast and effective methods for inferring phylogenetic trees from complete and incomplete distance matrices as well as for reconstructing reticulograms and HGT networks, including the detection and validation of complete and partial gene transfers, inference of consensus HGT scenarios and interactive HGT identification, developed by the authors. The included methods allows for validating and visualizing phylogenetic trees and networks which can be built from distance or sequence data. The web server is available at: www.trex.uqam.ca.
Collapse
Affiliation(s)
- Alix Boc
- Département de sciences biologiques, Université de Montréal, C.P. 6128, Succ. Centre-ville, Montréal, QC, H3C 3J7, Canada
| | | | | |
Collapse
|
11
|
Demongeot J, Ben Amor H, Elena A, Gillois P, Noual M, Sené S. Robustness in regulatory interaction networks. A generic approach with applications at different levels: physiologic, metabolic and genetic. Int J Mol Sci 2009; 10:4437-4473. [PMID: 20057955 PMCID: PMC2790118 DOI: 10.3390/ijms10104437] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2009] [Revised: 10/02/2009] [Accepted: 10/14/2009] [Indexed: 12/26/2022] Open
Abstract
Regulatory interaction networks are often studied on their dynamical side (existence of attractors, study of their stability). We focus here also on their robustness, that is their ability to offer the same spatiotemporal patterns and to resist to external perturbations such as losses of nodes or edges in the networks interactions architecture, changes in their environmental boundary conditions as well as changes in the update schedule (or updating mode) of the states of their elements (e.g., if these elements are genes, their synchronous coexpression mode versus their sequential expression). We define the generic notions of boundary, core, and critical vertex or edge of the underlying interaction graph of the regulatory network, whose disappearance causes dramatic changes in the number and nature of attractors (e.g., passage from a bistable behaviour to a unique periodic regime) or in the range of their basins of stability. The dynamic transition of states will be presented in the framework of threshold Boolean automata rules. A panorama of applications at different levels will be given: brain and plant morphogenesis, bulbar cardio-respiratory regulation, glycolytic/oxidative metabolic coupling, and eventually cell cycle and feather morphogenesis genetic control.
Collapse
Affiliation(s)
- Jacques Demongeot
- Université J. Fourier de Grenoble, TIMC-IMAG, CNRS UMR 5525, Faculté de Médecine, 38700 La Tronche, France; E-Mails:
(H.B.);
(A.E.);
(P.G.)
| | - Hedi Ben Amor
- Université J. Fourier de Grenoble, TIMC-IMAG, CNRS UMR 5525, Faculté de Médecine, 38700 La Tronche, France; E-Mails:
(H.B.);
(A.E.);
(P.G.)
| | - Adrien Elena
- Université J. Fourier de Grenoble, TIMC-IMAG, CNRS UMR 5525, Faculté de Médecine, 38700 La Tronche, France; E-Mails:
(H.B.);
(A.E.);
(P.G.)
| | - Pierre Gillois
- Université J. Fourier de Grenoble, TIMC-IMAG, CNRS UMR 5525, Faculté de Médecine, 38700 La Tronche, France; E-Mails:
(H.B.);
(A.E.);
(P.G.)
| | - Mathilde Noual
- Université de Lyon, École Normale Supérieure Lyon, LIP, CNRS UMR 5668, 69007 Lyon, France
- IXXI, Institut rhône-alpin des systèmes complexes, 69007 Lyon, France; E-Mails:
(M.N.);
(S.S.)
| | - Sylvain Sené
- Université d’Evry Val d’Essonne, IBISC, CNRS FRE 3190, 91000 Evry, France
- IXXI, Institut rhône-alpin des systèmes complexes, 69007 Lyon, France; E-Mails:
(M.N.);
(S.S.)
| |
Collapse
|
12
|
Abstract
Minimum contradiction matrices are a useful complement to distance-based phylogenies. A minimum contradiction matrix represents phylogenetic information under the form of an ordered distance matrix Y(i) (,) (j) (n). A matrix element corresponds to the distance from a reference vertex n to the path (i, j). For an X-tree or a split network, the minimum contradiction matrix is a Robinson matrix. It therefore fulfills all the inequalities defining perfect order: Y(i) (,) (j) (n) >or= Y(i) (,) (k) (n) (,)Y(k j) (n) >or= Y(k) (,) (I) (n), i
Collapse
|
13
|
Makarenkov V, Kevorkov D, Legendre P. Phylogenetic Network Construction Approaches. APPLIED MYCOLOGY AND BIOTECHNOLOGY 2006. [DOI: 10.1016/s1874-5334(06)80006-7] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
|
14
|
Makarenkov V, Legendre P, Desdevises Y. Modelling phylogenetic relationships using reticulated networks. ZOOL SCR 2004. [DOI: 10.1111/j.1463-6409.2004.00141.x] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
15
|
Abstract
In many phylogenetic problems, assuming that species have evolved from a common ancestor by a simple branching process is unrealistic. Reticulate phylogenetic models, however, have been largely neglected because the concept of reticulate evolution have not been supported by using appropriate analytical tools and software. The reticulate model can adequately describe such complicated mechanisms as hybridization between species or lateral gene transfer in bacteria. In this paper, we describe a new algorithm for inferring reticulate phylogenies from evolutionary distances among species. The algorithm is capable of detecting contradictory signals encompassed in a phylogenetic tree and identifying possible reticulate events that may have occurred during evolution. The algorithm produces a reticulate phylogeny by gradually improving upon the initial solution provided by a phylogenetic tree model. The new algorithm is compared to the popular SplitsGraph method in a reanalysis of the evolution of photosynthetic organisms. A computer program to construct and visualize reticulate phylogenies, called T-Rex (Tree and Reticulogram Reconstruction), is available to researchers at the following URL: www.fas.umontreal.ca/biol/casgrain/en/labo/t-rex.
Collapse
Affiliation(s)
- Vladimir Makarenkov
- Département d'informatique, Université du Québec à Montréal, C.P. 8888, Succ. Centre-Ville, Montréal (Québec), Canada, H3C 3P8.
| | | |
Collapse
|