1
|
Mitochondrial DNA variation of the caracal (Caracal caracal) in Iran and range-wide phylogeographic comparisons. Mamm Biol 2022. [DOI: 10.1007/s42991-022-00328-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
2
|
Derouiche L, Irzagh A, Rahmouni R, Tahri R, Hadjeloum M, Bouhadad R, Fernandes C. Deep mitochondrial DNA phylogeographic divergence in the threatened aoudad Ammotragus lervia (Bovidae, Caprini). Gene 2020; 739:144510. [PMID: 32109559 DOI: 10.1016/j.gene.2020.144510] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2019] [Revised: 02/20/2020] [Accepted: 02/24/2020] [Indexed: 12/13/2022]
Abstract
The aoudad or Barbary sheep (Ammotragus lervia) is a threatened ungulate emblematic of North Africa, whose population structure and subspecific taxonomy have not been examined genetically. This knowledge is essential and urgently needed to inform ongoing conservation and management efforts. We analysed the mitochondrial cytochrome b gene and four nuclear genes (casein kappa, spectrin beta nonerythrocytic 1, thyroglobulin, thyrotropin subunit beta) for the first phylogeographic survey of the aoudad, and uncovered a deep Mediterranean-Saharan mitochondrial split separating two highly distinct evolutionary lineages. Their level of divergence is greater than or comparable to those observed between several pairs of congeneric species of different caprine genera. The split was estimated to have occurred in the Early Pleistocene, about 1.3 million years ago. None of the four nuclear genes surveyed, chosen because they have been used in phylogeographic and species-level phylogenetic studies of bovids, allowed us to detect, likely due to their slow evolutionary rate, the substantial and geographically coherent subdivision revealed by mitochondrial DNA. This study is evidence and testament to the ability of mitochondrial DNA, probably unrivalled by any other single-locus marker, as an exploratory tool for investigating population genealogy and history and identifying potential evolutionarily significant units for conservation in animals.
Collapse
Affiliation(s)
- Louiza Derouiche
- LBEIG, Population Genetics & Conservation Unit, Faculté des Sciences Biologiques, Université des Sciences et de la Technologie Houari Boumediene, BP 32 El-Alia, Bab Ezzouar, 16111, Algiers, Algeria.
| | - Ahmed Irzagh
- Département de Biotechnologie, Faculté des Sciences de la Nature et de la Vie, Université Saad Dahlab de Blida, route de Soumâa, BP 270, 09000 Blida, Algeria
| | - Rafiq Rahmouni
- Réserve de Chasse de Tlemcen, Direction Générale des Forêts (DGF), Boulevard de Lala Sitti, Tlemcen, Algeria
| | | | - Mohamed Hadjeloum
- Bureau de la Gestion et Protection de la Faune, Direction Générale des Forêts (DGF), Algiers, Algeria
| | - Rachid Bouhadad
- LBEIG, Population Genetics & Conservation Unit, Faculté des Sciences Biologiques, Université des Sciences et de la Technologie Houari Boumediene, BP 32 El-Alia, Bab Ezzouar, 16111, Algiers, Algeria
| | - Carlos Fernandes
- CE3C - Centre for Ecology, Evolution and Environmental Changes, Faculdade de Ciências, Universidade de Lisboa, 1749-016 Lisboa, Portugal.
| |
Collapse
|
3
|
Brown DG, Owen M. Mean and Variance of Phylogenetic Trees. Syst Biol 2019; 69:139-154. [DOI: 10.1093/sysbio/syz041] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2017] [Revised: 05/13/2019] [Accepted: 05/24/2019] [Indexed: 11/13/2022] Open
Abstract
Abstract
We describe the use of the Fréchet mean and variance in the Billera–Holmes–Vogtmann (BHV) treespace to summarize and explore the diversity of a set of phylogenetic trees. We show that the Fréchet mean is comparable to other summary methods, and, despite its stickiness property, is more likely to be binary than the majority-rule consensus tree. We show that the Fréchet variance is faster and more precise than commonly used variance measures. The Fréchet mean and variance are more theoretically justified, and more robust, than previous estimates of this type and can be estimated reasonably efficiently, providing a foundation for building more advanced statistical methods and leading to applications such as mean hypothesis testing and outlier detection.
Collapse
Affiliation(s)
- Daniel G Brown
- David R. Cheriton School of Computer Science, University of Waterloo, 200 University Ave. W, Waterloo ON N2L 3G1, Canada
| | - Megan Owen
- Department of Mathematics, Lehman College, City University of New York, 250 Bedford Park Blvd West, Bronx, New York, NY 10468, USA
| |
Collapse
|
4
|
Sousa-Santos C, Jesus TF, Fernandes C, Robalo JI, Coelho MM. Fish diversification at the pace of geomorphological changes: evolutionary history of western Iberian Leuciscinae (Teleostei: Leuciscidae) inferred from multilocus sequence data. Mol Phylogenet Evol 2018; 133:263-285. [PMID: 30583043 DOI: 10.1016/j.ympev.2018.12.020] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2018] [Revised: 12/12/2018] [Accepted: 12/15/2018] [Indexed: 01/05/2023]
Abstract
The evolutionary history of western Iberian Leuciscinae, obligatory freshwater fish, is directly linked to the evolution of the hydrographic network of the Iberian Peninsula after its isolation from the rest of Europe, which involved dramatic rearrangements such as the transition from endorheic lakes to open basins draining to the Atlantic. Previous phylogenetic research on western Iberian leuciscines, using mainly mitochondrial DNA and more recently one or two nuclear genes, has found contradictory results and there remain many unresolved issues regarding species relationships, taxonomy, and evolutionary history. Moreover, there is a lack of integration between phylogenetic and divergence time estimates and information on the timing of geomorphological changes and paleobasin rearrangements in the Iberian Peninsula. This study presents the first comprehensive fossil-calibrated multilocus coalescent species tree of western Iberian Leuciscinae (including 14 species of Achondrostoma, Iberochondrostoma, Pseudochondrostoma and Squalius endemic to the Iberian Peninsula, seven of which endemic to Portugal) based on seven nuclear genes, and from which we infer their biogeographic history by comparing divergence time estimates to known dated geological events. The phylogenetic pattern suggests slow-paced evolution of leuciscines during the Early-Middle Miocene endorheic phase of the main Iberian river basins, with the shift to exorheism in the late Neogene-Quaternary allowing westward dispersals that resulted in many cladogenetic events and a high rate of endemism in western Iberia. The results of this study also: (i) confirm the paraphyly of S. pyrenaicus with respect to S. carolitertii, and thus the possible presence of a new taxon in the Portuguese Tagus currently assigned to S. pyrenaicus; (ii) support the taxonomic separation of the Guadiana and Sado populations of S. pyrenaicus; (iii) show the need for further population sampling and taxonomic research to clarify the phylogenetic status of A. arcasii from the Minho basin and of the I. lusitanicum populations in the Sado and Tagus basins; and (iv) indicate that A. occidentale, I. olisiponensis and P. duriensis are the most ancient lineages within their respective genera.
Collapse
Affiliation(s)
- C Sousa-Santos
- MARE - Marine and Environmental Sciences Centre, ISPA-Instituto Universitário, Rua Jardim do Tabaco 34, 1149-041 Lisbon, Portugal.
| | - T F Jesus
- cE3c - Center for Ecology, Evolution and Environmental Changes, Departamento de Biologia Animal, Faculdade de Ciências da Universidade de Lisboa, Campo Grande, 1749-016 Lisbon, Portugal; Instituto de Medicina Molecular João Lobo Antunes, Faculdade de Medicina da Universidade de Lisboa, Av. Professor Egaz Moniz, 1649-028 Lisbon, Portugal(2).
| | - C Fernandes
- cE3c - Center for Ecology, Evolution and Environmental Changes, Departamento de Biologia Animal, Faculdade de Ciências da Universidade de Lisboa, Campo Grande, 1749-016 Lisbon, Portugal.
| | - J I Robalo
- MARE - Marine and Environmental Sciences Centre, ISPA-Instituto Universitário, Rua Jardim do Tabaco 34, 1149-041 Lisbon, Portugal.
| | - M M Coelho
- cE3c - Center for Ecology, Evolution and Environmental Changes, Departamento de Biologia Animal, Faculdade de Ciências da Universidade de Lisboa, Campo Grande, 1749-016 Lisbon, Portugal.
| |
Collapse
|
5
|
Genetic evidence supporting the taxonomic separation of the Arabian and Northwest African subspecies of the desert hedgehog ( Paraechinus aethiopicus ). Gene 2017; 620:54-65. [DOI: 10.1016/j.gene.2017.04.009] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2016] [Revised: 03/30/2017] [Accepted: 04/07/2017] [Indexed: 11/21/2022]
|
6
|
Gavryushkin A, Drummond AJ. The space of ultrametric phylogenetic trees. J Theor Biol 2016; 403:197-208. [PMID: 27188249 DOI: 10.1016/j.jtbi.2016.05.001] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2015] [Revised: 03/17/2016] [Accepted: 05/01/2016] [Indexed: 10/21/2022]
Abstract
The reliability of a phylogenetic inference method from genomic sequence data is ensured by its statistical consistency. Bayesian inference methods produce a sample of phylogenetic trees from the posterior distribution given sequence data. Hence the question of statistical consistency of such methods is equivalent to the consistency of the summary of the sample. More generally, statistical consistency is ensured by the tree space used to analyse the sample. In this paper, we consider two standard parameterisations of phylogenetic time-trees used in evolutionary models: inter-coalescent interval lengths and absolute times of divergence events. For each of these parameterisations we introduce a natural metric space on ultrametric phylogenetic trees. We compare the introduced spaces with existing models of tree space and formulate several formal requirements that a metric space on phylogenetic trees must possess in order to be a satisfactory space for statistical analysis, and justify them. We show that only a few known constructions of the space of phylogenetic trees satisfy these requirements. However, our results suggest that these basic requirements are not enough to distinguish between the two metric spaces we introduce and that the choice between metric spaces requires additional properties to be considered. Particularly, that the summary tree minimising the square distance to the trees from the sample might be different for different parameterisations. This suggests that further fundamental insight is needed into the problem of statistical consistency of phylogenetic inference methods.
Collapse
Affiliation(s)
- Alex Gavryushkin
- Centre for Computational Evolution, The University of Auckland, New Zealand.
| | - Alexei J Drummond
- Centre for Computational Evolution, The University of Auckland, New Zealand
| |
Collapse
|
7
|
De Oliveira Martins L, Mallo D, Posada D. A Bayesian Supertree Model for Genome-Wide Species Tree Reconstruction. Syst Biol 2016; 65:397-416. [PMID: 25281847 PMCID: PMC4851173 DOI: 10.1093/sysbio/syu082] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2014] [Accepted: 09/30/2014] [Indexed: 11/16/2022] Open
Abstract
Current phylogenomic data sets highlight the need for species tree methods able to deal with several sources of gene tree/species tree incongruence. At the same time, we need to make most use of all available data. Most species tree methods deal with single processes of phylogenetic discordance, namely, gene duplication and loss, incomplete lineage sorting (ILS) or horizontal gene transfer. In this manuscript, we address the problem of species tree inference from multilocus, genome-wide data sets regardless of the presence of gene duplication and loss and ILS therefore without the need to identify orthologs or to use a single individual per species. We do this by extending the idea of Maximum Likelihood (ML) supertrees to a hierarchical Bayesian model where several sources of gene tree/species tree disagreement can be accounted for in a modular manner. We implemented this model in a computer program called guenomu whose inputs are posterior distributions of unrooted gene tree topologies for multiple gene families, and whose output is the posterior distribution of rooted species tree topologies. We conducted extensive simulations to evaluate the performance of our approach in comparison with other species tree approaches able to deal with more than one leaf from the same species. Our method ranked best under simulated data sets, in spite of ignoring branch lengths, and performed well on empirical data, as well as being fast enough to analyze relatively large data sets. Our Bayesian supertree method was also very successful in obtaining better estimates of gene trees, by reducing the uncertainty in their distributions. In addition, our results show that under complex simulation scenarios, gene tree parsimony is also a competitive approach once we consider its speed, in contrast to more sophisticated models.
Collapse
Affiliation(s)
| | - Diego Mallo
- Department of Biochemistry, Genetics and Immunology, University of Vigo, Vigo, 36310, Spain
| | - David Posada
- Department of Biochemistry, Genetics and Immunology, University of Vigo, Vigo, 36310, Spain
| |
Collapse
|
8
|
Li J, You X, Bian C, Yu H, Coon SL, Shi Q. Molecular Evolution of Aralkylamine N-Acetyltransferase in Fish: A Genomic Survey. Int J Mol Sci 2015; 17:E51. [PMID: 26729109 PMCID: PMC4730296 DOI: 10.3390/ijms17010051] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2015] [Revised: 12/21/2015] [Accepted: 12/23/2015] [Indexed: 11/17/2022] Open
Abstract
All living organisms synchronize biological functions with environmental changes; melatonin plays a vital role in regulating daily and seasonal variations. Due to rhythmic activity of the timezyme aralkylamine N-acetyltransferase (AANAT), the blood level of melatonin increases at night and decreases during daytime. Whereas other vertebrates have a single form of AANAT, bony fishes possess various isoforms of aanat genes, though the reasons are still unclear. Here, we have taken advantage of multiple unpublished teleost aanat sequences to explore and expand our understanding of the molecular evolution of aanat in fish. Our results confirm that two rounds of whole-genome duplication (WGD) led to the existence of three fish isoforms of aanat, i.e., aanat1a, aanat1b, and aanat2; in addition, gene loss led to the absence of some forms from certain special fish species. Furthermore, we suggest the different roles of two aanat1s in amphibious mudskippers, and speculate that the loss of aanat1a, may be related to terrestrial vision change. Several important sites of AANAT proteins and regulatory elements of aanat genes were analyzed for structural comparison and functional forecasting, respectively, which provides insights into the molecular evolution of the differences between AANAT1 and AANAT2.
Collapse
Affiliation(s)
- Jia Li
- BGI Education Center, University of Chinese Academy of Sciences, Shenzhen 518083, China.
- Shenzhen Key Lab of Marine Genomics, Guangdong Provincial Key Lab of Molecular Breeding in Marine Economic Animals, BGI, Shenzhen 518083, China.
| | - Xinxin You
- Shenzhen Key Lab of Marine Genomics, Guangdong Provincial Key Lab of Molecular Breeding in Marine Economic Animals, BGI, Shenzhen 518083, China.
| | - Chao Bian
- Shenzhen Key Lab of Marine Genomics, Guangdong Provincial Key Lab of Molecular Breeding in Marine Economic Animals, BGI, Shenzhen 518083, China.
- BGI-Zhenjiang Institute of Hydrobiology, Zhenjiang 212000, China.
| | - Hui Yu
- BGI Education Center, University of Chinese Academy of Sciences, Shenzhen 518083, China.
- Shenzhen Key Lab of Marine Genomics, Guangdong Provincial Key Lab of Molecular Breeding in Marine Economic Animals, BGI, Shenzhen 518083, China.
| | - Steven L Coon
- Molecular Genomics Laboratory, National Institutes of Health, Bethesda, MD 20892, USA.
| | - Qiong Shi
- BGI Education Center, University of Chinese Academy of Sciences, Shenzhen 518083, China.
- Shenzhen Key Lab of Marine Genomics, Guangdong Provincial Key Lab of Molecular Breeding in Marine Economic Animals, BGI, Shenzhen 518083, China.
- BGI-Zhenjiang Institute of Hydrobiology, Zhenjiang 212000, China.
| |
Collapse
|
9
|
Abstract
Motivation: The construction of statistics for summarizing posterior samples returned by a Bayesian phylogenetic study has so far been hindered by the poor geometric insights available into the space of phylogenetic trees, and ad hoc methods such as the derivation of a consensus tree makeup for the ill-definition of the usual concepts of posterior mean, while bootstrap methods mitigate the absence of a sound concept of variance. Yielding satisfactory results with sufficiently concentrated posterior distributions, such methods fall short of providing a faithful summary of posterior distributions if the data do not offer compelling evidence for a single topology. Results: Building upon previous work of Billera et al., summary statistics such as sample mean, median and variance are defined as the geometric median, Fréchet mean and variance, respectively. Their computation is enabled by recently published works, and embeds an algorithm for computing shortest paths in the space of trees. Studying the phylogeny of a set of plants, where several tree topologies occur in the posterior sample, the posterior mean balances correctly the contributions from the different topologies, where a consensus tree would be biased. Comparisons of the posterior mean, median and consensus trees with the ground truth using simulated data also reveals the benefits of a sound averaging method when reconstructing phylogenetic trees. Availability and implementation: We provide two independent implementations of the algorithm for computing Fréchet means, geometric medians and variances in the space of phylogenetic trees. TFBayes: https://github.com/pbenner/tfbayes, TrAP: https://github.com/bacak/TrAP. Contact:philipp.benner@mis.mpg.de
Collapse
Affiliation(s)
- Philipp Benner
- Max-Planck Institute for Mathematics in the Sciences, 04103 Leipzig, Germany and Isthmus SARL, 75002 Paris, France
| | - Miroslav Bačák
- Max-Planck Institute for Mathematics in the Sciences, 04103 Leipzig, Germany and Isthmus SARL, 75002 Paris, France
| | - Pierre-Yves Bourguignon
- Max-Planck Institute for Mathematics in the Sciences, 04103 Leipzig, Germany and Isthmus SARL, 75002 Paris, France Max-Planck Institute for Mathematics in the Sciences, 04103 Leipzig, Germany and Isthmus SARL, 75002 Paris, France
| |
Collapse
|
10
|
Heled J, Bouckaert RR. Looking for trees in the forest: summary tree from posterior samples. BMC Evol Biol 2013; 13:221. [PMID: 24093883 PMCID: PMC3853548 DOI: 10.1186/1471-2148-13-221] [Citation(s) in RCA: 92] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2013] [Accepted: 09/18/2013] [Indexed: 11/28/2022] Open
Abstract
Background Bayesian phylogenetic analysis generates a set of trees which are often condensed into a single tree representing the whole set. Many methods exist for selecting a representative topology for a set of unrooted trees, few exist for assigning branch lengths to a fixed topology, and even fewer for simultaneously setting the topology and branch lengths. However, there is very little research into locating a good representative for a set of rooted time trees like the ones obtained from a BEAST analysis. Results We empirically compare new and known methods for generating a summary tree. Some new methods are motivated by mathematical constructions such as tree metrics, while the rest employ tree concepts which work well in practice. These use more of the posterior than existing methods, which discard information not directly mapped to the chosen topology. Using results from a large number of simulations we assess the quality of a summary tree, measuring (a) how well it explains the sequence data under the model and (b) how close it is to the “truth”, i.e to the tree used to generate the sequences. Conclusions Our simulations indicate that no single method is “best”. Methods producing good divergence time estimates have poor branch lengths and lower model fit, and vice versa. Using the results presented here, a user can choose the appropriate method based on the purpose of the summary tree.
Collapse
Affiliation(s)
- Joseph Heled
- Department of Computer Science, University of Auckland, Auckland New Zealand.
| | | |
Collapse
|
11
|
Arnaoudova E, Haws DC, Huggins P, Jaromczyk JW, Moore N, Schardl CL, Yoshida R. Statistical phylogenetic tree analysis using differences of means. Front Neurosci 2010; 4. [PMID: 20802801 PMCID: PMC2928700 DOI: 10.3389/fnins.2010.00047] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2010] [Accepted: 06/09/2010] [Indexed: 11/13/2022] Open
Abstract
We propose a statistical method to test whether two phylogenetic trees with given alignments are significantly incongruent. Our method compares the two distributions of phylogenetic trees given by two input alignments, instead of comparing point estimations of trees. This statistical approach can be applied to gene tree analysis for example, detecting unusual events in genome evolution such as horizontal gene transfer and reshuffling. Our method uses difference of means to compare two distributions of trees, after mapping trees into a vector space. Bootstrapping alignment columns can then be applied to obtain p-values. To compute distances between means, we employ a "kernel method" which speeds up distance calculations when trees are mapped in a high-dimensional feature space, e.g., splits or quartets feature space. In this pilot study, first we test our statistical method on data sets simulated under a coalescence model, to test whether two alignments are generated by congruent gene trees. We follow our simulation results with applications to data sets of gophers and lice, grasses and their endophytes, and different fungal genes from the same genome. A companion toolkit, Phylotree, is provided to facilitate computational experiments.
Collapse
|