1
|
Schraiber JG, Edge MD, Pennell M. Unifying approaches from statistical genetics and phylogenetics for mapping phenotypes in structured populations. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.02.10.579721. [PMID: 38496530 PMCID: PMC10942266 DOI: 10.1101/2024.02.10.579721] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/19/2024]
Abstract
In both statistical genetics and phylogenetics, a major goal is to identify correlations between genetic loci or other aspects of the phenotype or environment and a focal trait. In these two fields, there are sophisticated but disparate statistical traditions aimed at these tasks. The disconnect between their respective approaches is becoming untenable as questions in medicine, conservation biology, and evolutionary biology increasingly rely on integrating data from within and among species, and once-clear conceptual divisions are becoming increasingly blurred. To help bridge this divide, we derive a general model describing the covariance between the genetic contributions to the quantitative phenotypes of different individuals. Taking this approach shows that standard models in both statistical genetics (e.g., Genome-Wide Association Studies; GWAS) and phylogenetic comparative biology (e.g., phylogenetic regression) can be interpreted as special cases of this more general quantitative-genetic model. The fact that these models share the same core architecture means that we can build a unified understanding of the strengths and limitations of different methods for controlling for genetic structure when testing for associations. We develop intuition for why and when spurious correlations may occur using analytical theory and conduct population-genetic and phylogenetic simulations of quantitative traits. The structural similarity of problems in statistical genetics and phylogenetics enables us to take methodological advances from one field and apply them in the other. We demonstrate this by showing how a standard GWAS technique-including both the genetic relatedness matrix (GRM) as well as its leading eigenvectors, corresponding to the principal components of the genotype matrix, in a regression model-can mitigate spurious correlations in phylogenetic analyses. As a case study of this, we re-examine an analysis testing for co-evolution of expression levels between genes across a fungal phylogeny, and show that including covariance matrix eigenvectors as covariates decreases the false positive rate while simultaneously increasing the true positive rate. More generally, this work provides a foundation for more integrative approaches for understanding the genetic architecture of phenotypes and how evolutionary processes shape it.
Collapse
|
2
|
Stankowski S, Zagrodzka ZB, Garlovsky MD, Pal A, Shipilina D, Castillo DG, Lifchitz H, Le Moan A, Leder E, Reeve J, Johannesson K, Westram AM, Butlin RK. The genetic basis of a recent transition to live-bearing in marine snails. Science 2024; 383:114-119. [PMID: 38175895 DOI: 10.1126/science.adi2982] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Accepted: 10/25/2023] [Indexed: 01/06/2024]
Abstract
Key innovations are fundamental to biological diversification, but their genetic basis is poorly understood. A recent transition from egg-laying to live-bearing in marine snails (Littorina spp.) provides the opportunity to study the genetic architecture of an innovation that has evolved repeatedly across animals. Individuals do not cluster by reproductive mode in a genome-wide phylogeny, but local genealogical analysis revealed numerous small genomic regions where all live-bearers carry the same core haplotype. Candidate regions show evidence for live-bearer-specific positive selection and are enriched for genes that are differentially expressed between egg-laying and live-bearing reproductive systems. Ages of selective sweeps suggest that live-bearer-specific alleles accumulated over more than 200,000 generations. Our results suggest that new functions evolve through the recruitment of many alleles rather than in a single evolutionary step.
Collapse
Affiliation(s)
- Sean Stankowski
- Ecology and Evolutionary Biology, School of Biosciences, University of Sheffield, Sheffield S10 2TN, UK
- Institute of Science and Technology Austria (ISTA), 3400 Klosterneuburg, Austria
- Department of Ecology and Evolution, University of Sussex, Brighton BN1 9RH, UK
| | - Zuzanna B Zagrodzka
- Ecology and Evolutionary Biology, School of Biosciences, University of Sheffield, Sheffield S10 2TN, UK
| | - Martin D Garlovsky
- Department of Applied Zoology, Faculty of Biology, Technische Universität Dresden, 01069 Dresden, Germany
| | - Arka Pal
- Institute of Science and Technology Austria (ISTA), 3400 Klosterneuburg, Austria
| | - Daria Shipilina
- Institute of Science and Technology Austria (ISTA), 3400 Klosterneuburg, Austria
- Department of Ecology and Genetics, Program of Evolutionary Biology, Uppsala University, SE-752 36 Uppsala, Sweden
| | | | - Hila Lifchitz
- Institute of Science and Technology Austria (ISTA), 3400 Klosterneuburg, Austria
| | - Alan Le Moan
- CNRS and Sorbonne Université, Station Biologique de Roscoff, 29680 Roscoff, France
- Department of Marine Sciences, Tjärnö Marine Laboratory, University of Gothenburg, 452 96 Strömstad, Sweden
| | - Erica Leder
- Department of Marine Sciences, Tjärnö Marine Laboratory, University of Gothenburg, 452 96 Strömstad, Sweden
- Natural History Museum, University of Oslo, 0562 Oslo, Norway
| | - James Reeve
- Department of Marine Sciences, Tjärnö Marine Laboratory, University of Gothenburg, 452 96 Strömstad, Sweden
| | - Kerstin Johannesson
- Department of Marine Sciences, Tjärnö Marine Laboratory, University of Gothenburg, 452 96 Strömstad, Sweden
| | - Anja M Westram
- Institute of Science and Technology Austria (ISTA), 3400 Klosterneuburg, Austria
- Faculty of Biosciences and Aquaculture, Nord University, N-8049 Bodø, Norway
| | - Roger K Butlin
- Ecology and Evolutionary Biology, School of Biosciences, University of Sheffield, Sheffield S10 2TN, UK
- Department of Marine Sciences, Tjärnö Marine Laboratory, University of Gothenburg, 452 96 Strömstad, Sweden
| |
Collapse
|
3
|
Roberts WR, Ruck EC, Downey KM, Pinseel E, Alverson AJ. Resolving Marine-Freshwater Transitions by Diatoms Through a Fog of Gene Tree Discordance. Syst Biol 2023; 72:984-997. [PMID: 37335140 DOI: 10.1093/sysbio/syad038] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2022] [Revised: 06/02/2023] [Accepted: 06/16/2023] [Indexed: 06/21/2023] Open
Abstract
Despite the obstacles facing marine colonists, most lineages of aquatic organisms have colonized and diversified in freshwaters repeatedly. These transitions can trigger rapid morphological or physiological change and, on longer timescales, lead to increased rates of speciation and extinction. Diatoms are a lineage of ancestrally marine microalgae that have diversified throughout freshwater habitats worldwide. We generated a phylogenomic data set of genomes and transcriptomes for 59 diatom taxa to resolve freshwater transitions in one lineage, the Thalassiosirales. Although most parts of the species tree were consistently resolved with strong support, we had difficulties resolving a Paleocene radiation, which affected the placement of one freshwater lineage. This and other parts of the tree were characterized by high levels of gene tree discordance caused by incomplete lineage sorting and low phylogenetic signal. Despite differences in species trees inferred from concatenation versus summary methods and codons versus amino acids, traditional methods of ancestral state reconstruction supported six transitions into freshwaters, two of which led to subsequent species diversification. Evidence from gene trees, protein alignments, and diatom life history together suggest that habitat transitions were largely the product of homoplasy rather than hemiplasy, a condition where transitions occur on branches in gene trees not shared with the species tree. Nevertheless, we identified a set of putatively hemiplasious genes, many of which have been associated with shifts to low salinity, indicating that hemiplasy played a small but potentially important role in freshwater adaptation. Accounting for differences in evolutionary outcomes, in which some taxa became locked into freshwaters while others were able to return to the ocean or become salinity generalists, might help further distinguish different sources of adaptive mutation in freshwater diatoms.
Collapse
Affiliation(s)
- Wade R Roberts
- Department of Biological Sciences, University of Arkansas, 1 University of Arkansas, Fayetteville, AR, 72701, USA
| | - Elizabeth C Ruck
- Department of Biological Sciences, University of Arkansas, 1 University of Arkansas, Fayetteville, AR, 72701, USA
| | - Kala M Downey
- Department of Biological Sciences, University of Arkansas, 1 University of Arkansas, Fayetteville, AR, 72701, USA
| | - Eveline Pinseel
- Department of Biological Sciences, University of Arkansas, 1 University of Arkansas, Fayetteville, AR, 72701, USA
| | - Andrew J Alverson
- Department of Biological Sciences, University of Arkansas, 1 University of Arkansas, Fayetteville, AR, 72701, USA
| |
Collapse
|
4
|
Pereira AG, Kohlsdorf T. Repeated evolution of similar phenotypes: Integrating comparative methods with developmental pathways. Genet Mol Biol 2023; 46:e20220384. [PMID: 37486083 PMCID: PMC10364090 DOI: 10.1590/1678-4685-gmb-2022-0384] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2022] [Accepted: 05/24/2023] [Indexed: 07/25/2023] Open
Abstract
Repeated phenotypes, often referred to as 'homoplasies' in cladistic analyses, may evolve through changes in developmental processes. Genetic bases of recurrent evolution gained attention and have been studied in the past years using approaches that combine modern analytical phylogenetic tools with the stunning assemblage of new information on developmental mechanisms. In this review, we evaluated the topic under an integrated perspective, revisiting the classical definitions of convergence and parallelism and detailing comparative methods used to evaluate evolution of repeated phenotypes, which include phylogenetic inference, estimates of evolutionary rates and reconstruction of ancestral states. We provide examples to illustrate how a given methodological approach can be used to identify evolutionary patterns and evaluate developmental mechanisms associated with the intermittent expression of a given trait along the phylogeny. Finally, we address why repeated trait loss challenges strict definitions of convergence and parallelism, discussing how changes in developmental pathways might explain the high frequency of repeated trait loss in specific lineages.
Collapse
Affiliation(s)
- Anieli Guirro Pereira
- Universidade de São Paulo, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto (FFCLRP), Departamento de Biologia, Ribeirão Preto, SP, Brazil
| | - Tiana Kohlsdorf
- Universidade de São Paulo, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto (FFCLRP), Departamento de Biologia, Ribeirão Preto, SP, Brazil
| |
Collapse
|
5
|
Hibbins MS, Breithaupt LC, Hahn MW. Phylogenomic comparative methods: Accurate evolutionary inferences in the presence of gene tree discordance. Proc Natl Acad Sci U S A 2023; 120:e2220389120. [PMID: 37216509 PMCID: PMC10235958 DOI: 10.1073/pnas.2220389120] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2022] [Accepted: 04/24/2023] [Indexed: 05/24/2023] Open
Abstract
Phylogenetic comparative methods have long been a mainstay of evolutionary biology, allowing for the study of trait evolution across species while accounting for their common ancestry. These analyses typically assume a single, bifurcating phylogenetic tree describing the shared history among species. However, modern phylogenomic analyses have shown that genomes are often composed of mosaic histories that can disagree both with the species tree and with each other-so-called discordant gene trees. These gene trees describe shared histories that are not captured by the species tree, and therefore that are unaccounted for in classic comparative approaches. The application of standard comparative methods to species histories containing discordance leads to incorrect inferences about the timing, direction, and rate of evolution. Here, we develop two approaches for incorporating gene tree histories into comparative methods: one that constructs an updated phylogenetic variance-covariance matrix from gene trees, and another that applies Felsenstein's pruning algorithm over a set of gene trees to calculate trait histories and likelihoods. Using simulation, we demonstrate that our approaches generate much more accurate estimates of tree-wide rates of trait evolution than standard methods. We apply our methods to two clades of the wild tomato genus Solanum with varying rates of discordance, demonstrating the contribution of gene tree discordance to variation in a set of floral traits. Our approaches have the potential to be applied to a broad range of classic inference problems in phylogenetics, including ancestral state reconstruction and the inference of lineage-specific rate shifts.
Collapse
Affiliation(s)
- Mark S. Hibbins
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, ONM5S 3B2, Canada
- Department of Biology, Indiana University, Bloomington, IN47405
| | - Lara C. Breithaupt
- Department of Biology, Indiana University, Bloomington, IN47405
- Department of Computer Science, Duke University, Durham, NC27710
| | - Matthew W. Hahn
- Department of Biology, Indiana University, Bloomington, IN47405
- Department of Computer Science, Indiana University, Bloomington, IN47405
| |
Collapse
|
6
|
Doyle JJ. Cell types as species: Exploring a metaphor. FRONTIERS IN PLANT SCIENCE 2022; 13:868565. [PMID: 36072310 PMCID: PMC9444152 DOI: 10.3389/fpls.2022.868565] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/02/2022] [Accepted: 07/29/2022] [Indexed: 06/05/2023]
Abstract
The concept of "cell type," though fundamental to cell biology, is controversial. Cells have historically been classified into types based on morphology, physiology, or location. More recently, single cell transcriptomic studies have revealed fine-scale differences among cells with similar gross phenotypes. Transcriptomic snapshots of cells at various stages of differentiation, and of cells under different physiological conditions, have shown that in many cases variation is more continuous than discrete, raising questions about the relationship between cell type and cell state. Some researchers have rejected the notion of fixed types altogether. Throughout the history of discussions on cell type, cell biologists have compared the problem of defining cell type with the interminable and often contentious debate over the definition of arguably the most important concept in systematics and evolutionary biology, "species." In the last decades, systematics, like cell biology, has been transformed by the increasing availability of molecular data, and the fine-grained resolution of genetic relationships have generated new ideas about how that variation should be classified. There are numerous parallels between the two fields that make exploration of the "cell types as species" metaphor timely. These parallels begin with philosophy, with discussion of both cell types and species as being either individuals, groups, or something in between (e.g., homeostatic property clusters). In each field there are various different types of lineages that form trees or networks that can (and in some cases do) provide criteria for grouping. Developing and refining models for evolutionary divergence of species and for cell type differentiation are parallel goals of the two fields. The goal of this essay is to highlight such parallels with the hope of inspiring biologists in both fields to look for new solutions to similar problems outside of their own field.
Collapse
Affiliation(s)
- Jeff J. Doyle
- Section of Plant Biology and Section of Plant Breeding and Genetics, School of Integrative Plant Science, Cornell University, Ithaca, NY, United States
| |
Collapse
|
7
|
Mika K, Whittington CM, McAllan BM, Lynch VJ. Gene expression phylogenies and ancestral transcriptome reconstruction resolves major transitions in the origins of pregnancy. eLife 2022; 11:e74297. [PMID: 35770963 PMCID: PMC9275820 DOI: 10.7554/elife.74297] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Accepted: 06/29/2022] [Indexed: 11/13/2022] Open
Abstract
Structural and physiological changes in the female reproductive system underlie the origins of pregnancy in multiple vertebrate lineages. In mammals, the glandular portion of the lower reproductive tract has transformed into a structure specialized for supporting fetal development. These specializations range from relatively simple maternal nutrient provisioning in egg-laying monotremes to an elaborate suite of traits that support intimate maternal-fetal interactions in Eutherians. Among these traits are the maternal decidua and fetal component of the placenta, but there is considerable uncertainty about how these structures evolved. Previously, we showed that changes in uterine gene expression contributes to several evolutionary innovations during the origins of pregnancy (Mika et al., 2021b). Here, we reconstruct the evolution of entire transcriptomes ('ancestral transcriptome reconstruction') and show that maternal gene expression profiles are correlated with degree of placental invasion. These results indicate that an epitheliochorial-like placenta evolved early in the mammalian stem-lineage and that the ancestor of Eutherians had a hemochorial placenta, and suggest maternal control of placental invasiveness. These data resolve major transitions in the evolution of pregnancy and indicate that ancestral transcriptome reconstruction can be used to study the function of ancestral cell, tissue, and organ systems.
Collapse
Affiliation(s)
- Katelyn Mika
- Department of Human Genetics, University of ChicagoChicagoUnited States
- Department of Organismal Biology and Anatomy, University of ChicagoChicagoUnited States
| | | | | | - Vincent J Lynch
- Department of Biological Sciences, University at Buffalo, State University of New YorkBuffalo,NewyorkUnited States
| |
Collapse
|
8
|
Cooper BJ, Moore MJ, Douglas NA, Wagner WL, Johnson MG, Overson RP, Kinosian SP, McDonnell AJ, Levin RA, Raguso RA, Flores Olvera H, Ochoterena H, Fant JB, Skogen KA, Wickett NJ. Target enrichment and extensive population sampling help untangle the recent, rapid radiation of Oenothera sect. Calylophus. Syst Biol 2022:6588089. [PMID: 35583314 DOI: 10.1093/sysbio/syac032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2021] [Revised: 04/14/2022] [Accepted: 04/22/2022] [Indexed: 11/14/2022] Open
Abstract
Oenothera sect. Calylophus is a North American group of 13 recognized taxa in the evening primrose family (Onagraceae) with an evolutionary history that may include independent origins of bee pollination, edaphic endemism, and permanent translocation heterozygosity. Like other groups that radiated relatively recently and rapidly, taxon boundaries within Oenothera sect. Calylophus have remained challenging to circumscribe. In this study, we used target enrichment, flanking non-coding regions, gene tree/species tree methods, tests for gene flow modified for target-enrichment data, and morphometric analysis to reconstruct phylogenetic hypotheses, evaluate current taxon circumscriptions, and examine character evolution in Oenothera sect. Calylophus. Because sect. Calylophus comprises a clade with a relatively restricted geographic range, we were able to extensively sample across the range of geographic, edaphic and morphological diversity in the group. We found that the combination of exons and flanking non-coding regions led to improved support for species relationships. We reconstructed potential hybrid origins of some accessions and note that if processes such as hybridization are not taken into account, the number of inferred evolutionary transitions may be artificially inflated. We recovered strong evidence for multiple evolutionary origins of bee pollination from ancestral hawkmoth pollination, edaphic specialization on gypsum, and permanent translocation heterozygosity. This study applies newly emerging techniques alongside dense infraspecific sampling and morphological analyses to effectively reconstruct the recalcitrant history of a rapid radiation.
Collapse
Affiliation(s)
- Benjamin J Cooper
- The Negaunee Institute for Plant Conservation Science and Action, Chicago Botanic Garden, 1000 Lake Cook Rd., Glencoe, IL 60022, USA.,Northwestern University,Program in Plant Biology and Conservation,O.T. Hogan Hall, Room, 6-140B, 2205 Tech Drive, Evanston, IL 60208, USA
| | - Michael J Moore
- Oberlin College, Department of Biology, 119 Woodland St., Oberlin, OH 44074, USA
| | - Norman A Douglas
- Department of Biology, University of Florida, Gainesville, FL 32611, USA
| | - Warren L Wagner
- Department of Botany, MRC-166, Smithsonian Institution, PO Box 37012, Washington, DC 20013-7012, USA
| | - Matthew G Johnson
- The Negaunee Institute for Plant Conservation Science and Action, Chicago Botanic Garden, 1000 Lake Cook Rd., Glencoe, IL 60022, USA.,Department of Biological Sciences, Texas Tech University, Box 43131 Lubbock, TX 79409, USA
| | - Rick P Overson
- The Negaunee Institute for Plant Conservation Science and Action, Chicago Botanic Garden, 1000 Lake Cook Rd., Glencoe, IL 60022, USA.,School of Sustainability, Arizona State University, PO Box 875502, Tempe, AZ 85287-5502, USA
| | - Sylvia P Kinosian
- The Negaunee Institute for Plant Conservation Science and Action, Chicago Botanic Garden, 1000 Lake Cook Rd., Glencoe, IL 60022, USA
| | - Angela J McDonnell
- The Negaunee Institute for Plant Conservation Science and Action, Chicago Botanic Garden, 1000 Lake Cook Rd., Glencoe, IL 60022, USA
| | - Rachel A Levin
- Department of Biology, Amherst College, 25 East Drive, Amherst, MA, 01002, USA
| | - Robert A Raguso
- Department of Neurobiology and Behavior, Cornell University, 215 Tower Road, Ithaca, NY 14853, USA
| | - Hilda Flores Olvera
- Departamento de Botánica, Instituto de Biología, Universidad Nacional Autónoma de México, Mexico City, Mexico
| | - Helga Ochoterena
- Departamento de Botánica, Instituto de Biología, Universidad Nacional Autónoma de México, Mexico City, Mexico
| | - Jeremie B Fant
- The Negaunee Institute for Plant Conservation Science and Action, Chicago Botanic Garden, 1000 Lake Cook Rd., Glencoe, IL 60022, USA.,Northwestern University,Program in Plant Biology and Conservation,O.T. Hogan Hall, Room, 6-140B, 2205 Tech Drive, Evanston, IL 60208, USA
| | - Krissa A Skogen
- The Negaunee Institute for Plant Conservation Science and Action, Chicago Botanic Garden, 1000 Lake Cook Rd., Glencoe, IL 60022, USA.,Northwestern University,Program in Plant Biology and Conservation,O.T. Hogan Hall, Room, 6-140B, 2205 Tech Drive, Evanston, IL 60208, USA
| | - Norman J Wickett
- The Negaunee Institute for Plant Conservation Science and Action, Chicago Botanic Garden, 1000 Lake Cook Rd., Glencoe, IL 60022, USA.,Northwestern University,Program in Plant Biology and Conservation,O.T. Hogan Hall, Room, 6-140B, 2205 Tech Drive, Evanston, IL 60208, USA
| |
Collapse
|
9
|
Lagomarsino LP, Frankel L, Uribe-Convers S, Antonelli A, Muchhala N. Increased resolution in the face of conflict: phylogenomics of the Neotropical bellflowers (Campanulaceae: Lobelioideae), a rapid plant radiation. ANNALS OF BOTANY 2022; 129:723-736. [PMID: 35363863 PMCID: PMC9113290 DOI: 10.1093/aob/mcac046] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/09/2022] [Accepted: 03/24/2022] [Indexed: 06/12/2023]
Abstract
BACKGROUND AND AIMS The centropogonid clade (Lobelioideae: Campanulaceae) is an Andean-centred rapid radiation characterized by repeated convergent evolution of morphological traits, including fruit type and pollination syndromes. While previous studies have resolved relationships of lineages with fleshy fruits into subclades, relationships among capsular species remain unresolved. This lack of resolution has impeded reclassification of non-monophyletic genera, whose current taxonomy relies heavily on traits that have undergone convergent evolution. METHODS Targeted sequence capture using a probe-set recently developed for the centropogonid clade was used to obtain phylogenomic data from DNA extracted from both silica-dried and herbarium leaf tissue. These data were used to infer relationships among species using concatenated and partitioned species tree methods, and to quantify gene tree discordance. KEY RESULTS While silica-dried leaf tissue resulted in longer assembled sequence data, the inclusion of herbarium samples improved taxonomic representation. Relationships among baccate lineages are similar to those inferred in previous studies, although they differ for lineages within and among capsular clades. We improve the phylogenetic resolution of Siphocampylus, which forms ten groups of closely related species which we informally name. Two subclades of Siphocampylus and two individual species are rogue taxa whose placement differs widely across analyses. Gene tree discordance (including cytonuclear discordance) is rampant. CONCLUSIONS This first phylogenomic study of the centropogonid clade considerably improves our understanding of relationships in this rapid radiation. Differences across analyses and the possibility of additional lineage discoveries still hamper a solid and stable reclassification. Rapid morphological innovation corresponds with a high degree of phylogenomic complexity, including cytonuclear discordance, nuclear gene tree conflict and well-supported differences between analyses based on different nuclear loci. Together, these results point to a potential role of hemiplasy underlying repeated convergent evolution. This hallmark of rapid radiations is probably present in many other species-rich Andean plant radiations.
Collapse
Affiliation(s)
- Laura P Lagomarsino
- Shirley C. Tucker Herbarium, Department of Biological Sciences, Louisiana State University, Baton Rouge, LA, USA
- Department of Biology, University of Missouri-St. Louis, St. Louis, MO, USA
| | - Lauren Frankel
- Department of Botany, University of Wisconsin-Madison, Madison, WI, USA
| | - Simon Uribe-Convers
- Department of Biology, University of Missouri-St. Louis, St. Louis, MO, USA
- Invitae Corporation, San Francisco, CA, USA
| | - Alexandre Antonelli
- Royal Botanic Gardens, Kew, TW9 3AE, UK
- Gothenburg Global Biodiversity Centre, Department of Biological and Environmental Sciences, University of Gothenburg, Gothenburg, 405 30, Sweden
- Department of Plant Science, University of Oxford, Oxford, UK
| | - Nathan Muchhala
- Department of Biology, University of Missouri-St. Louis, St. Louis, MO, USA
| |
Collapse
|
10
|
Li F, Rane RV, Luria V, Xiong Z, Chen J, Li Z, Catullo RA, Griffin PC, Schiffer M, Pearce S, Lee SF, McElroy K, Stocker A, Shirriffs J, Cockerell F, Coppin C, Sgrò CM, Karger A, Cain JW, Weber JA, Santpere G, Kirschner MW, Hoffmann AA, Oakeshott JG, Zhang G. Phylogenomic analyses of the genus Drosophila reveals genomic signals of climate adaptation. Mol Ecol Resour 2022; 22:1559-1581. [PMID: 34839580 PMCID: PMC9299920 DOI: 10.1111/1755-0998.13561] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2020] [Accepted: 11/10/2021] [Indexed: 01/13/2023]
Abstract
Many Drosophila species differ widely in their distributions and climate niches, making them excellent subjects for evolutionary genomic studies. Here, we have developed a database of high-quality assemblies for 46 Drosophila species and one closely related Zaprionus. Fifteen of the genomes were newly sequenced, and 20 were improved with additional sequencing. New or improved annotations were generated for all 47 species, assisted by new transcriptomes for 19. Phylogenomic analyses of these data resolved several previously ambiguous relationships, especially in the melanogaster species group. However, it also revealed significant phylogenetic incongruence among genes, mainly in the form of incomplete lineage sorting in the subgenus Sophophora but also including asymmetric introgression in the subgenus Drosophila. Using the phylogeny as a framework and taking into account these incongruences, we then screened the data for genome-wide signals of adaptation to different climatic niches. First, phylostratigraphy revealed relatively high rates of recent novel gene gain in three temperate pseudoobscura and five desert-adapted cactophilic mulleri subgroup species. Second, we found differing ratios of nonsynonymous to synonymous substitutions in several hundred orthologues between climate generalists and specialists, with trends for significantly higher ratios for those in tropical and lower ratios for those in temperate-continental specialists respectively than those in the climate generalists. Finally, resequencing natural populations of 13 species revealed tropics-restricted species generally had smaller population sizes, lower genome diversity and more deleterious mutations than the more widespread species. We conclude that adaptation to different climates in the genus Drosophila has been associated with large-scale and multifaceted genomic changes.
Collapse
Affiliation(s)
- Fang Li
- BGI‐ShenzhenShenzhenChina
- Section for Ecology and EvolutionDepartment of BiologyUniversity of CopenhagenCopenhagenDenmark
| | - Rahul V. Rane
- Commonwealth Scientific and Industrial Research OrganisationActonACTAustralia
- Bio21 InstituteSchool of BioSciencesUniversity of MelbourneParkvilleVic.Australia
| | - Victor Luria
- Department of Systems BiologyHarvard Medical SchoolBostonMassachusettsUSA
| | - Zijun Xiong
- BGI‐ShenzhenShenzhenChina
- State Key Laboratory of Genetic Resources and EvolutionKunming Institute of ZoologyChinese Academy of Sciences (CAS)KunmingYunnanChina
- College of Life SciencesUniversity of Chinese Academy of SciencesBeijingChina
| | | | | | - Renee A. Catullo
- Commonwealth Scientific and Industrial Research OrganisationActonACTAustralia
- Division of Ecology and EvolutionCentre for Biodiversity AnalysisThe Australian National UniversityActonACTAustralia
| | - Philippa C. Griffin
- Bio21 InstituteSchool of BioSciencesUniversity of MelbourneParkvilleVic.Australia
| | - Michele Schiffer
- Bio21 InstituteSchool of BioSciencesUniversity of MelbourneParkvilleVic.Australia
- Daintree Rainforest ObservatoryJames Cook UniversityCape TribulationQldAustralia
| | - Stephen Pearce
- Commonwealth Scientific and Industrial Research OrganisationActonACTAustralia
| | - Siu Fai Lee
- Commonwealth Scientific and Industrial Research OrganisationActonACTAustralia
- Applied BioSciencesMacquarie UniversityNorth RydeNSWAustralia
| | - Kerensa McElroy
- Commonwealth Scientific and Industrial Research OrganisationActonACTAustralia
| | - Ann Stocker
- Bio21 InstituteSchool of BioSciencesUniversity of MelbourneParkvilleVic.Australia
| | - Jennifer Shirriffs
- Bio21 InstituteSchool of BioSciencesUniversity of MelbourneParkvilleVic.Australia
| | - Fiona Cockerell
- School of Biological SciencesMonash UniversityClaytonVic.Australia
| | - Chris Coppin
- Commonwealth Scientific and Industrial Research OrganisationActonACTAustralia
| | - Carla M. Sgrò
- School of Biological SciencesMonash UniversityClaytonVic.Australia
| | - Amir Karger
- IT ‐ Research ComputingHarvard Medical SchoolBostonMassachusettsUSA
| | - John W. Cain
- Department of MathematicsHarvard UniversityCambridgeMassachusettsUSA
| | - Jessica A. Weber
- Department of GeneticsHarvard Medical SchoolBostonMassachusettsUSA
| | - Gabriel Santpere
- Neurogenomics Group, Research Programme on Biomedical Informatics (GRIB)Department of Experimental and Health Sciences (DCEXS)Hospital del Mar Medical Research Institute (IMIM)Universitat Pompeu FabraBarcelonaCataloniaSpain
| | - Marc W. Kirschner
- Department of Systems BiologyHarvard Medical SchoolBostonMassachusettsUSA
| | - Ary A. Hoffmann
- Bio21 InstituteSchool of BioSciencesUniversity of MelbourneParkvilleVic.Australia
| | - John G. Oakeshott
- Commonwealth Scientific and Industrial Research OrganisationActonACTAustralia
- Applied BioSciencesMacquarie UniversityNorth RydeNSWAustralia
| | - Guojie Zhang
- BGI‐ShenzhenShenzhenChina
- Section for Ecology and EvolutionDepartment of BiologyUniversity of CopenhagenCopenhagenDenmark
- State Key Laboratory of Genetic Resources and EvolutionKunming Institute of ZoologyChinese Academy of Sciences (CAS)KunmingYunnanChina
- Center for Excellence in Animal Evolution and GeneticsChinese Academy of SciencesKunmingChina
| |
Collapse
|
11
|
James ME, Arenas-Castro H, Groh JS, Allen SL, Engelstädter J, Ortiz-Barrientos D. Highly Replicated Evolution of Parapatric Ecotypes. Mol Biol Evol 2021; 38:4805-4821. [PMID: 34254128 PMCID: PMC8557401 DOI: 10.1093/molbev/msab207] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Parallel evolution of ecotypes occurs when selection independently drives the evolution of similar traits across similar environments. The multiple origins of ecotypes are often inferred based on a phylogeny that clusters populations according to geographic location and not by the environment they occupy. However, the use of phylogenies to infer parallel evolution in closely related populations is problematic because gene flow and incomplete lineage sorting can uncouple the genetic structure at neutral markers from the colonization history of populations. Here, we demonstrate multiple origins within ecotypes of an Australian wildflower, Senecio lautus. We observed strong genetic structure as well as phylogenetic clustering by geography and show that this is unlikely due to gene flow between parapatric ecotypes, which was surprisingly low. We further confirm this analytically by demonstrating that phylogenetic distortion due to gene flow often requires higher levels of migration than those observed in S. lautus. Our results imply that selection can repeatedly create similar phenotypes despite the perceived homogenizing effects of gene flow.
Collapse
Affiliation(s)
- Maddie E James
- School of Biological Sciences, The University of Queensland,St. Lucia, QLD, Australia
| | - Henry Arenas-Castro
- School of Biological Sciences, The University of Queensland,St. Lucia, QLD, Australia
| | - Jeffrey S Groh
- School of Biological Sciences, The University of Queensland,St. Lucia, QLD, Australia
| | - Scott L Allen
- School of Biological Sciences, The University of Queensland,St. Lucia, QLD, Australia
| | - Jan Engelstädter
- School of Biological Sciences, The University of Queensland,St. Lucia, QLD, Australia
| | | |
Collapse
|
12
|
Phylogenomics, floral evolution, and biogeography of Lithospermum L. (Boraginaceae). Mol Phylogenet Evol 2021; 166:107317. [PMID: 34547439 DOI: 10.1016/j.ympev.2021.107317] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2020] [Revised: 08/29/2021] [Accepted: 09/15/2021] [Indexed: 11/23/2022]
Abstract
Lithospermum (Boraginaceae), a geographically cosmopolitan medium-sized genus, includes diverse floral morphology, with variation in corolla size and shape and in breeding system. Over the past decade, multiple studies have examined the evolutionary history of Lithospermum, with most utilizing DNA regions from the plastid genome and/or the nuclear ribosomal internal transcribed spacer. These studies have, in general, not resulted in well-resolved and well-supported phylogenies. In the present study, 298 nuclear DNA regions, amplified via target sequence capture, were utilized for phylogenetic reconstruction for Lithospermum and relatives in Boraginaceae, and patterns of floral evolution, species diversification, and biogeography were examined. Based on multiple phylogenetic methods, Lithospermum is resolved as monophyletic, and the New World species of the genus are also monophyletic. While minimal phylogenetic incongruence is resolved within the nuclear genome, incongruence between the nuclear and plastid genomes is recovered. This is likely due to incomplete lineage sorting during early diversification of the genus in the Americas approximately 7.8 million years ago. At least four shifts to longer corollas are identified throughout Lithospermum, and this may be due to selection for hummingbird-pollinated flowers, particularly for species in Mexico and the southwestern United States. In the New World, one clade of species of the genus diversified primarily across the United States and Canada, and another radiated throughout the mountains of Mexico.
Collapse
|
13
|
Wang Y, Cao Z, Ogilvie HA, Nakhleh L. Phylogenomic assessment of the role of hybridization and introgression in trait evolution. PLoS Genet 2021; 17:e1009701. [PMID: 34407067 PMCID: PMC8405015 DOI: 10.1371/journal.pgen.1009701] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2021] [Revised: 08/30/2021] [Accepted: 07/07/2021] [Indexed: 11/30/2022] Open
Abstract
Trait evolution among a set of species-a central theme in evolutionary biology-has long been understood and analyzed with respect to a species tree. However, the field of phylogenomics, which has been propelled by advances in sequencing technologies, has ushered in the era of species/gene tree incongruence and, consequently, a more nuanced understanding of trait evolution. For a trait whose states are incongruent with the branching patterns in the species tree, the same state could have arisen independently in different species (homoplasy) or followed the branching patterns of gene trees, incongruent with the species tree (hemiplasy). Another evolutionary process whose extent and significance are better revealed by phylogenomic studies is gene flow between different species. In this work, we present a phylogenomic method for assessing the role of hybridization and introgression in the evolution of polymorphic or monomorphic binary traits. We apply the method to simulated evolutionary scenarios to demonstrate the interplay between the parameters of the evolutionary history and the role of introgression in a binary trait's evolution (which we call xenoplasy). Very importantly, we demonstrate, including on a biological data set, that inferring a species tree and using it for trait evolution analysis in the presence of gene flow could lead to misleading hypotheses about trait evolution.
Collapse
Affiliation(s)
- Yaxuan Wang
- Department of Computer Science, Rice University, Houston, Texas, United States of America
| | - Zhen Cao
- Department of Computer Science, Rice University, Houston, Texas, United States of America
| | - Huw A. Ogilvie
- Department of Computer Science, Rice University, Houston, Texas, United States of America
| | - Luay Nakhleh
- Department of Computer Science, Rice University, Houston, Texas, United States of America
- Department of BioSciences, Rice University, Houston, Texas, United States of America
| |
Collapse
|
14
|
Suvorov A, Scornavacca C, Fujimoto MS, Bodily P, Clement M, Crandall KA, Whiting MF, Schrider DR, Bybee SM. Deep ancestral introgression shapes evolutionary history of dragonflies and damselflies. Syst Biol 2021; 71:526-546. [PMID: 34324671 PMCID: PMC9017697 DOI: 10.1093/sysbio/syab063] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2021] [Revised: 07/20/2021] [Accepted: 07/26/2021] [Indexed: 11/13/2022] Open
Abstract
Introgression is an important biological process affecting at least 10% of the extant species in the animal kingdom. Introgression significantly impacts inference of phylogenetic species relationships where a strictly binary tree model cannot adequately explain reticulate net-like species relationships. Here we use phylogenomic approaches to understand patterns of introgression along the evolutionary history of a unique, non-model insect system: dragonflies and damselflies (Odonata). We demonstrate that introgression is a pervasive evolutionary force across various taxonomic levels within Odonata. In particular, we show that the morphologically "intermediate" species of Anisozygoptera (one of the three primary suborders within Odonata besides Zygoptera and Anisoptera), which retain phenotypic characteristics of the other two suborders, experienced high levels of introgression likely coming from zygopteran genomes. Additionally, we find evidence for multiple cases of deep inter-superfamilial ancestral introgression.
Collapse
Affiliation(s)
- Anton Suvorov
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
| | - Celine Scornavacca
- Institut des Sciences de l'Evolution Université de Montpellier, CNRS, IRD, EPHE CC 064, Place Eugène Bataillon, 34095 Montpellier Cedex 05, France
| | - M Stanley Fujimoto
- Department of Computer Science, Brigham Young University, Provo, UT, United States
| | - Paul Bodily
- Department of Computer Science, Idaho State University, Pocatello, ID, United States
| | - Mark Clement
- Department of Computer Science, Brigham Young University, Provo, UT, United States
| | - Keith A Crandall
- Computational Biology Institute, Department of Biostatistics and Bioinformatics, Milken Institute School of Public Health, George Washington University, Washington, DC, United States
| | - Michael F Whiting
- Department of Biology, Brigham Young University, Provo, UT, United States.,M.L. Bean Museum, Brigham Young University, Provo, UT, United States
| | - Daniel R Schrider
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
| | - Seth M Bybee
- Department of Biology, Brigham Young University, Provo, UT, United States.,M.L. Bean Museum, Brigham Young University, Provo, UT, United States
| |
Collapse
|
15
|
Adams RH, Blackmon H, DeGiorgio M. Of Traits and Trees: Probabilistic Distances under Continuous Trait Models for Dissecting the Interplay among Phylogeny, Model, and Data. Syst Biol 2021; 70:660-680. [PMID: 33587145 PMCID: PMC8208806 DOI: 10.1093/sysbio/syab009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2019] [Accepted: 02/01/2021] [Indexed: 12/03/2022] Open
Abstract
Stochastic models of character trait evolution have become a cornerstone of evolutionary biology in an array of contexts. While probabilistic models have been used extensively for statistical inference, they have largely been ignored for the purpose of measuring distances between phylogeny-aware models. Recent contributions to the problem of phylogenetic distance computation have highlighted the importance of explicitly considering evolutionary model parameters and their impacts on molecular sequence data when quantifying dissimilarity between trees. By comparing two phylogenies in terms of their induced probability distributions that are functions of many model parameters, these distances can be more informative than traditional approaches that rely strictly on differences in topology or branch lengths alone. Currently, however, these approaches are designed for comparing models of nucleotide substitution and gene tree distributions, and thus, are unable to address other classes of traits and associated models that may be of interest to evolutionary biologists. Here, we expand the principles of probabilistic phylogenetic distances to compute tree distances under models of continuous trait evolution along a phylogeny. By explicitly considering both the degree of relatedness among species and the evolutionary processes that collectively give rise to character traits, these distances provide a foundation for comparing models and their predictions, and for quantifying the impacts of assuming one phylogenetic background over another while studying the evolution of a particular trait. We demonstrate the properties of these approaches using theory, simulations, and several empirical data sets that highlight potential uses of probabilistic distances in many scenarios. We also introduce an open-source R package named PRDATR for easy application by the scientific community for computing phylogenetic distances under models of character trait evolution.[Brownian motion; comparative methods; phylogeny; quantitative traits.].
Collapse
Affiliation(s)
- Richard H Adams
- Department of Computer and Electrical Engineering and Computer Science, Florida Atlantic University, Boca Raton, FL 33431, USA
| | - Heath Blackmon
- Department of Biology, Texas A&M University, College Station, TX 77843, USA
| | - Michael DeGiorgio
- Department of Computer and Electrical Engineering and Computer Science, Florida Atlantic University, Boca Raton, FL 33431, USA
| |
Collapse
|
16
|
Abstract
Evolutionary biologists have long been fascinated with the episodes of rapid phenotypic innovation that underlie the emergence of major lineages. Although our understanding of the environmental and ecological contexts of such episodes has steadily increased, it has remained unclear how population processes contribute to emergent macroevolutionary patterns. One insight gleaned from phylogenomics is that gene-tree conflict, frequently caused by population-level processes, is often rampant during the origin of major lineages. With the understanding that phylogenomic conflict is often driven by complex population processes, we hypothesized that there may be a direct correspondence between instances of high conflict and elevated rates of phenotypic innovation if both patterns result from the same processes. We evaluated this hypothesis in six clades spanning vertebrates and plants. We found that the most conflict-rich regions of these six clades also tended to experience the highest rates of phenotypic innovation, suggesting that population processes shaping both phenotypic and genomic evolution may leave signatures at deep timescales. Closer examination of the biological significance of phylogenomic conflict may yield improved connections between micro- and macroevolution and increase our understanding of the processes that shape the origin of major lineages across the Tree of Life.
Collapse
|
17
|
Mullen SP, VanKuren NW, Zhang W, Nallu S, Kristiansen EB, Wuyun Q, Liu K, Hill RI, Briscoe AD, Kronforst MR. Disentangling Population History and Character Evolution among Hybridizing Lineages. Mol Biol Evol 2021; 37:1295-1305. [PMID: 31930401 DOI: 10.1093/molbev/msaa004] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
Understanding the origin and maintenance of adaptive phenotypic novelty is a central goal of evolutionary biology. However, both hybridization and incomplete lineage sorting can lead to genealogical discordance between the regions of the genome underlying adaptive traits and the remainder of the genome, decoupling inferences about character evolution from population history. Here, to disentangle these effects, we investigated the evolutionary origins and maintenance of Batesian mimicry between North American admiral butterflies (Limenitis arthemis) and their chemically defended model (Battus philenor) using a combination of de novo genome sequencing, whole-genome resequencing, and statistical introgression mapping. Our results suggest that balancing selection, arising from geographic variation in the presence or absence of the unpalatable model, has maintained two deeply divergent color patterning haplotypes that have been repeatedly sieved among distinct mimetic and nonmimetic lineages of Limenitis via introgressive hybridization.
Collapse
Affiliation(s)
- Sean P Mullen
- Department of Biology, Boston University, Boston, MA
| | | | - Wei Zhang
- School of Life Sciences, Peking University, Beijing, P.R. China
| | - Sumitha Nallu
- Department of Ecology and Evolution, University of Chicago, Chicago, IL
| | | | - Qiqige Wuyun
- Department of Computer Science and Engineering, Michigan State University, East Lansing, MI
| | - Kevin Liu
- Department of Computer Science and Engineering, Michigan State University, East Lansing, MI
| | - Ryan I Hill
- Department of Biological Sciences, University of the Pacific, Stockton, CA
| | - Adriana D Briscoe
- Department of Ecology and Evolutionary Biology, University of California-Irvine, Irvine, CA
| | | |
Collapse
|
18
|
Porto DS, Almeida EAB, Pennell MW. Investigating Morphological Complexes Using Informational Dissonance and Bayes Factors: A Case Study in Corbiculate Bees. Syst Biol 2021; 70:295-306. [PMID: 32722788 PMCID: PMC7882150 DOI: 10.1093/sysbio/syaa059] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2019] [Revised: 07/16/2020] [Accepted: 07/17/2020] [Indexed: 11/22/2022] Open
Abstract
It is widely recognized that different regions of a genome often have different evolutionary histories and that ignoring this variation when estimating phylogenies can be misleading. However, the extent to which this is also true for morphological data is still largely unknown. Discordance among morphological traits might plausibly arise due to either variable convergent selection pressures or else phenomena such as hemiplasy. Here, we investigate patterns of discordance among 282 morphological characters, which we scored for 50 bee species particularly targeting corbiculate bees, a group that includes the well-known eusocial honeybees and bumblebees. As a starting point for selecting the most meaningful partitions in the data, we grouped characters as morphological modules, highly integrated trait complexes that as a result of developmental constraints or coordinated selection we expect to share an evolutionary history and trajectory. In order to assess conflict and coherence across and within these morphological modules, we used recently developed approaches for computing Bayesian phylogenetic information allied with model comparisons using Bayes factors. We found that despite considerable conflict among morphological complexes, accounting for among-character and among-partition rate variation with individual gamma distributions, rate multipliers, and linked branch lengths can lead to coherent phylogenetic inference using morphological data. We suggest that evaluating information content and dissonance among partitions is a useful step in estimating phylogenies from morphological data, just as it is with molecular data. Furthermore, we argue that adopting emerging approaches for investigating dissonance in genomic datasets may provide new insights into the integration and evolution of anatomical complexes. [Apidae; entropy; morphological modules; phenotypic integration; phylogenetic information.].
Collapse
Affiliation(s)
- Diego S Porto
- Laboratório de Biologia Comparada e Abelhas (LBCA), Departamento de Biologia, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto (FFCLRP), Universidade de São Paulo, 14040-901 Ribeirão Preto, SP, Brazil
- Department of Zoology and Biodiversity Research Centre, University of British Columbia, Vancouver BC V6T 1Z4, Canada
- Department of Biological Sciences, Virginia Polytechnic Institute and State University, 926 West Campus Drive, Blacksburg, VA 24061 USA
| | - Eduardo A B Almeida
- Laboratório de Biologia Comparada e Abelhas (LBCA), Departamento de Biologia, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto (FFCLRP), Universidade de São Paulo, 14040-901 Ribeirão Preto, SP, Brazil
| | - Matthew W Pennell
- Department of Zoology and Biodiversity Research Centre, University of British Columbia, Vancouver BC V6T 1Z4, Canada
| |
Collapse
|
19
|
Hibbins MS, Gibson MJS, Hahn MW. Determining the probability of hemiplasy in the presence of incomplete lineage sorting and introgression. eLife 2020; 9:e63753. [PMID: 33345772 PMCID: PMC7800383 DOI: 10.7554/elife.63753] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2020] [Accepted: 12/18/2020] [Indexed: 12/11/2022] Open
Abstract
The incongruence of character states with phylogenetic relationships is often interpreted as evidence of convergent evolution. However, trait evolution along discordant gene trees can also generate these incongruences - a phenomenon known as hemiplasy. Classic comparative methods do not account for discordance, resulting in incorrect inferences about the number, timing, and direction of trait transitions. Biological sources of discordance include incomplete lineage sorting (ILS) and introgression, but only ILS has received theoretical consideration in the context of hemiplasy. Here, we present a model that shows introgression makes hemiplasy more likely, such that methods that account for ILS alone will be conservative. We also present a method and software (HeIST) for making statistical inferences about the probability of hemiplasy and homoplasy in large datasets that contain both ILS and introgression. We apply our methods to two empirical datasets, finding that hemiplasy is likely to contribute to the observed trait incongruences in both.
Collapse
Affiliation(s)
- Mark S Hibbins
- Department of Biology, Indiana UniversityBloomingtonUnited States
| | | | - Matthew W Hahn
- Department of Biology, Indiana UniversityBloomingtonUnited States
- Department of Computer Science, Indiana UniversityBloomingtonUnited States
| |
Collapse
|
20
|
Stott CM, Bobay LM. Impact of homologous recombination on core genome phylogenies. BMC Genomics 2020; 21:829. [PMID: 33238876 PMCID: PMC7691112 DOI: 10.1186/s12864-020-07262-x] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2020] [Accepted: 11/19/2020] [Indexed: 01/14/2023] Open
Abstract
BACKGROUND Core genome phylogenies are widely used to build the evolutionary history of individual prokaryote species. By using hundreds or thousands of shared genes, these approaches are the gold standard to reconstruct the relationships of large sets of strains. However, there is growing evidence that bacterial strains exchange DNA through homologous recombination at rates that vary widely across prokaryote species, indicating that core genome phylogenies might not be able to reconstruct true phylogenies when recombination rate is high. Few attempts have been made to evaluate the robustness of core genome phylogenies to recombination, but some analyses suggest that reconstructed trees are not always accurate. RESULTS In this study, we tested the robustness of core genome phylogenies to various levels of recombination rates. By analyzing simulated and empirical data, we observed that core genome phylogenies are relatively robust to recombination rates; nevertheless, our results suggest that many reconstructed trees are not completely accurate even when bootstrap supports are high. We found that some core genome phylogenies are highly robust to recombination whereas others are strongly impacted by it, and we identified that the robustness of core genome phylogenies to recombination is highly linked to the levels of selective pressures acting on a species. Stronger selective pressures lead to less accurate tree reconstructions, presumably because selective pressures more strongly bias the routes of DNA transfers, thereby causing phylogenetic artifacts. CONCLUSIONS Overall, these results have important implications for the application of core genome phylogenies in prokaryotes.
Collapse
Affiliation(s)
- Caroline M Stott
- Department of Biology, University of North Carolina Greensboro, 321 McIver Street, PO Box 26170, Greensboro, NC, 27402, USA
| | - Louis-Marie Bobay
- Department of Biology, University of North Carolina Greensboro, 321 McIver Street, PO Box 26170, Greensboro, NC, 27402, USA.
| |
Collapse
|
21
|
Li Q, Scornavacca C, Galtier N, Chan YB. The Multilocus Multispecies Coalescent: A Flexible New Model of Gene Family Evolution. Syst Biol 2020; 70:822-837. [PMID: 33169795 DOI: 10.1093/sysbio/syaa084] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2020] [Revised: 05/07/2020] [Accepted: 10/19/2020] [Indexed: 02/06/2023] Open
Abstract
Incomplete lineage sorting (ILS), the interaction between coalescence and speciation, can generate incongruence between gene trees and species trees, as can gene duplication (D), transfer (T), and loss (L). These processes are usually modeled independently, but in reality, ILS can affect gene copy number polymorphism, that is, interfere with DTL. This has been previously recognized, but not treated in a satisfactory way, mainly because DTL events are naturally modeled forward-in-time, while ILS is naturally modeled backward-in-time with the coalescent. Here, we consider the joint action of ILS and DTL on the gene tree/species tree problem in all its complexity. In particular, we show that the interaction between ILS and duplications/transfers (without losses) can result in patterns usually interpreted as resulting from gene loss, and that the realized rate of D, T, and L becomes nonhomogeneous in time when ILS is taken into account. We introduce algorithmic solutions to these problems. Our new model, the multilocus multispecies coalescent, which also accounts for any level of linkage between loci, generalizes the multispecies coalescent (MSC) model and offers a versatile, powerful framework for proper simulation, and inference of gene family evolution. [Gene duplication; gene loss; horizontal gene transfer; incomplete lineage sorting; multispecies coalescent; hemiplasy; recombination.].
Collapse
Affiliation(s)
- Qiuyi Li
- School of Mathematics and Statistics / Melbourne Integrative Genomics, The University of Melbourne, Melbourne 3010, Australia
| | - Celine Scornavacca
- Institut des Sciences de l'Evolution, Université Montpellier, CNRS, IRD, EPHE, Montpellier, 34095, France
| | - Nicolas Galtier
- Institut des Sciences de l'Evolution, Université Montpellier, CNRS, IRD, EPHE, Montpellier, 34095, France
| | - Yao-Ban Chan
- School of Mathematics and Statistics / Melbourne Integrative Genomics, The University of Melbourne, Melbourne 3010, Australia
| |
Collapse
|
22
|
Holland BR, Ketelaar-Jones S, O'Mara AR, Woodhams MD, Jordan GJ. Accuracy of ancestral state reconstruction for non-neutral traits. Sci Rep 2020; 10:7644. [PMID: 32376845 PMCID: PMC7203120 DOI: 10.1038/s41598-020-64647-4] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2019] [Accepted: 04/09/2020] [Indexed: 12/05/2022] Open
Abstract
The assumptions underpinning ancestral state reconstruction are violated in many evolutionary systems, especially for traits under directional selection. However, the accuracy of ancestral state reconstruction for non-neutral traits is poorly understood. To investigate the accuracy of ancestral state reconstruction methods, trees and binary characters were simulated under the BiSSE (Binary State Speciation and Extinction) model using a wide range of character-state-dependent rates of speciation, extinction and character-state transition. We used maximum parsimony (MP), BiSSE and two-state Markov (Mk2) models to reconstruct ancestral states. Under each method, error rates increased with node depth, true number of state transitions, and rates of state transition and extinction; exceeding 30% for the deepest 10% of nodes and highest rates of extinction and character-state transition. Where rates of character-state transition were asymmetrical, error rates were greater when the rate away from the ancestral state was largest. Preferential extinction of species with the ancestral character state also led to higher error rates. BiSSE outperformed Mk2 in all scenarios where either speciation or extinction was state dependent and outperformed MP under most conditions. MP outperformed Mk2 in most scenarios except when the rates of character-state transition and/or extinction were highly asymmetrical and the ancestral state was unfavoured.
Collapse
Affiliation(s)
- Barbara R Holland
- School of Natural Sciences, University of Tasmania, Private Bag 55, Hobart, Tas, 7001, Australia.
| | - Saan Ketelaar-Jones
- School of Natural Sciences, University of Tasmania, Private Bag 55, Hobart, Tas, 7001, Australia
| | - Aidan R O'Mara
- School of Health Sciences, University of Tasmania, Private Bag 121, Hobart, Tas, 7001, Australia
| | - Michael D Woodhams
- School of Natural Sciences, University of Tasmania, Private Bag 55, Hobart, Tas, 7001, Australia
| | - Gregory J Jordan
- School of Natural Sciences, University of Tasmania, Private Bag 55, Hobart, Tas, 7001, Australia
| |
Collapse
|
23
|
Smith SD, Pennell MW, Dunn CW, Edwards SV. Phylogenetics is the New Genetics (for Most of Biodiversity). Trends Ecol Evol 2020; 35:415-425. [DOI: 10.1016/j.tree.2020.01.005] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2019] [Revised: 01/15/2020] [Accepted: 01/20/2020] [Indexed: 12/15/2022]
|
24
|
Wang M, Zhang L, Zhang Z, Li M, Wang D, Zhang X, Xi Z, Keefover-Ring K, Smart LB, DiFazio SP, Olson MS, Yin T, Liu J, Ma T. Phylogenomics of the genus Populus reveals extensive interspecific gene flow and balancing selection. THE NEW PHYTOLOGIST 2020; 225:1370-1382. [PMID: 31550399 DOI: 10.1111/nph.16215] [Citation(s) in RCA: 57] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/09/2019] [Accepted: 09/16/2019] [Indexed: 05/10/2023]
Abstract
Phylogenetic analysis is complicated by interspecific gene flow and the presence of shared ancestral polymorphisms, particularly those maintained by balancing selection. In this study, we aimed to examine the prevalence of these factors during the diversification of Populus, a model tree genus in the Northern Hemisphere. We constructed phylogenetic trees of 29 Populus taxa using 80 individuals based on re-sequenced genomes. Our species tree analyses recovered four main clades in the genus based on consensus nuclear phylogenies, but in conflict with the plastome phylogeny. A few interspecific relationships remained unresolved within the multiple-species clade because of inconsistent gene trees. Our results indicated that gene flow has been widespread within each clade and also occurred among the four clades during their early divergence. We identified 45 candidate genes with ancient polymorphisms maintained by balancing selection. These genes were mainly associated with mating compatibility, growth and stress resistance. Both gene flow and selection-mediated ancient polymorphisms are prevalent in the genus Populus. These are potentially important contributors to adaptive variation. Our results provide a framework for the diversification of model tree genus that will facilitate future comparative studies.
Collapse
Affiliation(s)
- Mingcheng Wang
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, 610065, China
| | - Lei Zhang
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, 610065, China
| | - Zhiyang Zhang
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, 610065, China
| | - Mengmeng Li
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, 610065, China
| | - Deyan Wang
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, 610065, China
| | - Xu Zhang
- State Key Laboratory of Grassland Agro-Ecosystem, Institute of Innovation Ecology & College of Life Sciences, Lanzhou University, Lanzhou, 730000, China
| | - Zhenxiang Xi
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, 610065, China
| | - Ken Keefover-Ring
- Departments of Botany and Geography, University of Wisconsin-Madison, 430 Lincoln Dr., Madison, WI, 53706, USA
| | - Lawrence B Smart
- Horticulture Section, School of Integrative Plant Science, New York State Agricultural Experiment Station, Cornell University, Geneva, NY, 14456, USA
| | - Stephen P DiFazio
- Department of Biology, West Virginia University, Morgantown, WV, 25606, USA
| | - Matthew S Olson
- Department of Biological Sciences, Texas Tech University, Box 43131, Lubbock, TX, 79409-3131, USA
| | - Tongming Yin
- Co-Innovation Center for Sustainable Forestry in Southern China, College of Forestry, Nanjing Forestry University, Nanjing, 210037, China
| | - Jianquan Liu
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, 610065, China
- State Key Laboratory of Grassland Agro-Ecosystem, Institute of Innovation Ecology & College of Life Sciences, Lanzhou University, Lanzhou, 730000, China
| | - Tao Ma
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, 610065, China
| |
Collapse
|
25
|
Vizueta J, Macías‐Hernández N, Arnedo MA, Rozas J, Sánchez‐Gracia A. Chance and predictability in evolution: The genomic basis of convergent dietary specializations in an adaptive radiation. Mol Ecol 2019; 28:4028-4045. [DOI: 10.1111/mec.15199] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2018] [Revised: 07/18/2019] [Accepted: 07/19/2019] [Indexed: 12/11/2022]
Affiliation(s)
- Joel Vizueta
- Departament de Genètica, Microbiologia i Estadística Facultat de Biologia and Institut de Recerca de la Biodiversitat (IRBio) Universitat de Barcelona Barcelona Spain
| | - Nuria Macías‐Hernández
- Laboratory for Integrative Biodiversity Research Finnish Museum of Natural History University of Helsinki Helsinki Finland
- Island Ecology and Evolution Research Group Instituto de Productos Naturales y Agrobiología (IPNA‐CSIC) Tenerife Spain
| | - Miquel A. Arnedo
- Departament de Biologia Evolutiva, Ecologia i Ciències Ambientals Facultat de Biologia Institut de Recerca de la Biodiversitat (IRBio) Universitat de Barcelona Barcelona Spain
| | - Julio Rozas
- Departament de Genètica, Microbiologia i Estadística Facultat de Biologia and Institut de Recerca de la Biodiversitat (IRBio) Universitat de Barcelona Barcelona Spain
| | - Alejandro Sánchez‐Gracia
- Departament de Genètica, Microbiologia i Estadística Facultat de Biologia and Institut de Recerca de la Biodiversitat (IRBio) Universitat de Barcelona Barcelona Spain
| |
Collapse
|
26
|
Lamichhaney S, Card DC, Grayson P, Tonini JFR, Bravo GA, Näpflin K, Termignoni-Garcia F, Torres C, Burbrink F, Clarke JA, Sackton TB, Edwards SV. Integrating natural history collections and comparative genomics to study the genetic architecture of convergent evolution. Philos Trans R Soc Lond B Biol Sci 2019; 374:20180248. [PMID: 31154982 PMCID: PMC6560268 DOI: 10.1098/rstb.2018.0248] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/25/2019] [Indexed: 12/20/2022] Open
Abstract
Evolutionary convergence has been long considered primary evidence of adaptation driven by natural selection and provides opportunities to explore evolutionary repeatability and predictability. In recent years, there has been increased interest in exploring the genetic mechanisms underlying convergent evolution, in part, owing to the advent of genomic techniques. However, the current 'genomics gold rush' in studies of convergence has overshadowed the reality that most trait classifications are quite broadly defined, resulting in incomplete or potentially biased interpretations of results. Genomic studies of convergence would be greatly improved by integrating deep 'vertical', natural history knowledge with 'horizontal' knowledge focusing on the breadth of taxonomic diversity. Natural history collections have and continue to be best positioned for increasing our comprehensive understanding of phenotypic diversity, with modern practices of digitization and databasing of morphological traits providing exciting improvements in our ability to evaluate the degree of morphological convergence. Combining more detailed phenotypic data with the well-established field of genomics will enable scientists to make progress on an important goal in biology: to understand the degree to which genetic or molecular convergence is associated with phenotypic convergence. Although the fields of comparative biology or comparative genomics alone can separately reveal important insights into convergent evolution, here we suggest that the synergistic and complementary roles of natural history collection-derived phenomic data and comparative genomics methods can be particularly powerful in together elucidating the genomic basis of convergent evolution among higher taxa. This article is part of the theme issue 'Convergent evolution in the genomics era: new insights and directions'.
Collapse
Affiliation(s)
- Sangeet Lamichhaney
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA
- Museum of Comparative Zoology, Harvard University, Cambridge, MA 02138, USA
| | - Daren C. Card
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA
- Museum of Comparative Zoology, Harvard University, Cambridge, MA 02138, USA
- Department of Biology, University of Texas Arlington, Arlington, TX 76019, USA
| | - Phil Grayson
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA
- Museum of Comparative Zoology, Harvard University, Cambridge, MA 02138, USA
| | - João F. R. Tonini
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA
- Museum of Comparative Zoology, Harvard University, Cambridge, MA 02138, USA
| | - Gustavo A. Bravo
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA
- Museum of Comparative Zoology, Harvard University, Cambridge, MA 02138, USA
| | - Kathrin Näpflin
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA
- Museum of Comparative Zoology, Harvard University, Cambridge, MA 02138, USA
| | - Flavia Termignoni-Garcia
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA
- Museum of Comparative Zoology, Harvard University, Cambridge, MA 02138, USA
| | - Christopher Torres
- Department of Biology, The University of Texas at Austin, Austin, MA 78712, USA
- Department of Geological Sciences, The University of Texas at Austin, Austin, MA 78712, USA
| | - Frank Burbrink
- Department of Herpetology, The American Museum of Natural History, New York, NY 10024, USA
| | - Julia A. Clarke
- Department of Biology, The University of Texas at Austin, Austin, MA 78712, USA
- Department of Geological Sciences, The University of Texas at Austin, Austin, MA 78712, USA
| | | | - Scott V. Edwards
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA
- Museum of Comparative Zoology, Harvard University, Cambridge, MA 02138, USA
| |
Collapse
|
27
|
Lee KM, Coop G. Population genomics perspectives on convergent adaptation. Philos Trans R Soc Lond B Biol Sci 2019; 374:20180236. [PMID: 31154979 PMCID: PMC6560269 DOI: 10.1098/rstb.2018.0236] [Citation(s) in RCA: 40] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/21/2018] [Indexed: 01/12/2023] Open
Abstract
Convergent adaptation is the independent evolution of similar traits conferring a fitness advantage in two or more lineages. Cases of convergent adaptation inform our ideas about the ecological and molecular basis of adaptation. In judging the degree to which putative cases of convergent adaptation provide an independent replication of the process of adaptation, it is necessary to establish the degree to which the evolutionary change is unexpected under null models and to show that selection has repeatedly, independently driven these changes. Here, we discuss the issues that arise from these questions particularly for closely related populations, where gene flow and standing variation add additional layers of complexity. We outline a conceptual framework to guide intuition as to the extent to which evolutionary change represents the independent gain of information owing to selection and show that this is a measure of how surprised we should be by convergence. Additionally, we summarize the ways population and quantitative genetics and genomics may help us address questions related to convergent adaptation, as well as open new questions and avenues of research. This article is part of the theme issue 'Convergent evolution in the genomics era: new insights and directions'.
Collapse
Affiliation(s)
- Kristin M. Lee
- Center for Population Biology, University of California, Davis, CA 95616, USA
- Department of Evolution and Ecology, University of California, Davis, CA 95616, USA
| | - Graham Coop
- Center for Population Biology, University of California, Davis, CA 95616, USA
- Department of Evolution and Ecology, University of California, Davis, CA 95616, USA
| |
Collapse
|
28
|
Fungal species boundaries in the genomics era. Fungal Genet Biol 2019; 131:103249. [PMID: 31279976 DOI: 10.1016/j.fgb.2019.103249] [Citation(s) in RCA: 55] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2019] [Revised: 05/21/2019] [Accepted: 06/28/2019] [Indexed: 12/30/2022]
Abstract
Genomic data has opened new possibilities to understand how organisms change over time, and could enable the discovery of previously undescribed species. Although taxonomy used to be based on phenotypes, molecular data has frequently revealed that morphological traits are insufficient to describe biodiversity. Genomics holds the promise of revealing even more genetic discontinuities, but the parameters on how to describe species from genomic data remain unclear. Fungi have been a successful case in which the use of molecular markers has uncovered the existence of genetic boundaries where no crosses are possible. In this minireview, we highlight recent advances, propose a set of standards to use genomic sequences to uncover species boundaries, point out potential pitfalls, and present possible future research directions.
Collapse
|
29
|
Mendes FK, Livera AP, Hahn MW. The perils of intralocus recombination for inferences of molecular convergence. Philos Trans R Soc Lond B Biol Sci 2019; 374:20180244. [PMID: 31154973 DOI: 10.1098/rstb.2018.0244] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open
Abstract
Accurate inferences of convergence require that the appropriate tree topology be used. If there is a mismatch between the tree a trait has evolved along and the tree used for analysis, then false inferences of convergence ('hemiplasy') can occur. To avoid problems of hemiplasy when there are high levels of gene tree discordance with the species tree, researchers have begun to construct tree topologies from individual loci. However, due to intralocus recombination, even locus-specific trees may contain multiple topologies within them. This implies that the use of individual tree topologies discordant with the species tree can still lead to incorrect inferences about molecular convergence. Here, we examine the frequency with which single exons and single protein-coding genes contain multiple underlying tree topologies, in primates and Drosophila, and quantify the effects of hemiplasy when using trees inferred from individual loci. In both clades, we find that there are most often multiple diagnosable topologies within single exons and whole genes, with 91% of Drosophila protein-coding genes containing multiple topologies. Because of this underlying topological heterogeneity, even using trees inferred from individual protein-coding genes results in 25% and 38% of substitutions falsely labelled as convergent in primates and Drosophila, respectively. While constructing local trees can reduce the problem of hemiplasy, our results suggest that it will be difficult to completely avoid false inferences of convergence. We conclude by suggesting several ways forward in the analysis of convergent evolution, for both molecular and morphological characters. This article is part of the theme issue 'Convergent evolution in the genomics era: new insights and directions'.
Collapse
Affiliation(s)
- Fábio K Mendes
- 1 Department of Computer Science, The University of Auckland , Auckland 1010 , New Zealand.,2 Department of Biology, Indiana University , Bloomington, IN 47405 , USA
| | - Andrew P Livera
- 2 Department of Biology, Indiana University , Bloomington, IN 47405 , USA
| | - Matthew W Hahn
- 2 Department of Biology, Indiana University , Bloomington, IN 47405 , USA.,3 Department of Computer Science, Indiana University , Bloomington, IN 47405 , USA
| |
Collapse
|
30
|
Bravo GA, Antonelli A, Bacon CD, Bartoszek K, Blom MPK, Huynh S, Jones G, Knowles LL, Lamichhaney S, Marcussen T, Morlon H, Nakhleh LK, Oxelman B, Pfeil B, Schliep A, Wahlberg N, Werneck FP, Wiedenhoeft J, Willows-Munro S, Edwards SV. Embracing heterogeneity: coalescing the Tree of Life and the future of phylogenomics. PeerJ 2019; 7:e6399. [PMID: 30783571 PMCID: PMC6378093 DOI: 10.7717/peerj.6399] [Citation(s) in RCA: 67] [Impact Index Per Article: 13.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2018] [Accepted: 01/07/2019] [Indexed: 12/23/2022] Open
Abstract
Building the Tree of Life (ToL) is a major challenge of modern biology, requiring advances in cyberinfrastructure, data collection, theory, and more. Here, we argue that phylogenomics stands to benefit by embracing the many heterogeneous genomic signals emerging from the first decade of large-scale phylogenetic analysis spawned by high-throughput sequencing (HTS). Such signals include those most commonly encountered in phylogenomic datasets, such as incomplete lineage sorting, but also those reticulate processes emerging with greater frequency, such as recombination and introgression. Here we focus specifically on how phylogenetic methods can accommodate the heterogeneity incurred by such population genetic processes; we do not discuss phylogenetic methods that ignore such processes, such as concatenation or supermatrix approaches or supertrees. We suggest that methods of data acquisition and the types of markers used in phylogenomics will remain restricted until a posteriori methods of marker choice are made possible with routine whole-genome sequencing of taxa of interest. We discuss limitations and potential extensions of a model supporting innovation in phylogenomics today, the multispecies coalescent model (MSC). Macroevolutionary models that use phylogenies, such as character mapping, often ignore the heterogeneity on which building phylogenies increasingly rely and suggest that assimilating such heterogeneity is an important goal moving forward. Finally, we argue that an integrative cyberinfrastructure linking all steps of the process of building the ToL, from specimen acquisition in the field to publication and tracking of phylogenomic data, as well as a culture that values contributors at each step, are essential for progress.
Collapse
Affiliation(s)
- Gustavo A. Bravo
- Department of Organismic and Evolutionary Biology, Museum of Comparative Zoology, Harvard University, Cambridge, MA, USA
| | - Alexandre Antonelli
- Department of Organismic and Evolutionary Biology, Museum of Comparative Zoology, Harvard University, Cambridge, MA, USA
- Gothenburg Global Biodiversity Centre, Göteborg, Sweden
- Department of Biological and Environmental Sciences, University of Gothenburg, Göteborg, Sweden
- Gothenburg Botanical Garden, Göteborg, Sweden
| | - Christine D. Bacon
- Gothenburg Global Biodiversity Centre, Göteborg, Sweden
- Department of Biological and Environmental Sciences, University of Gothenburg, Göteborg, Sweden
| | - Krzysztof Bartoszek
- Department of Computer and Information Science, Linköping University, Linköping, Sweden
| | - Mozes P. K. Blom
- Department of Bioinformatics and Genetics, Swedish Museum of Natural History, Stockholm, Sweden
| | - Stella Huynh
- Institut de Biologie, Université de Neuchâtel, Neuchâtel, Switzerland
| | - Graham Jones
- Department of Biological and Environmental Sciences, University of Gothenburg, Göteborg, Sweden
| | - L. Lacey Knowles
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI, USA
| | - Sangeet Lamichhaney
- Department of Organismic and Evolutionary Biology, Museum of Comparative Zoology, Harvard University, Cambridge, MA, USA
| | - Thomas Marcussen
- Centre for Ecological and Evolutionary Synthesis, University of Oslo, Oslo, Norway
| | - Hélène Morlon
- Institut de Biologie, Ecole Normale Supérieure de Paris, Paris, France
| | - Luay K. Nakhleh
- Department of Computer Science, Rice University, Houston, TX, USA
| | - Bengt Oxelman
- Gothenburg Global Biodiversity Centre, Göteborg, Sweden
- Department of Biological and Environmental Sciences, University of Gothenburg, Göteborg, Sweden
| | - Bernard Pfeil
- Department of Biological and Environmental Sciences, University of Gothenburg, Göteborg, Sweden
| | - Alexander Schliep
- Department of Computer Science and Engineering, Chalmers University of Technology and University of Gothenburg, Göteborg, Sweden
| | | | - Fernanda P. Werneck
- Coordenação de Biodiversidade, Programa de Coleções Científicas Biológicas, Instituto Nacional de Pesquisa da Amazônia, Manaus, AM, Brazil
| | - John Wiedenhoeft
- Department of Computer Science and Engineering, Chalmers University of Technology and University of Gothenburg, Göteborg, Sweden
- Department of Computer Science, Rutgers University, Piscataway, NJ, USA
| | - Sandi Willows-Munro
- School of Life Sciences, University of Kwazulu-Natal, Pietermaritzburg, South Africa
| | - Scott V. Edwards
- Department of Organismic and Evolutionary Biology, Museum of Comparative Zoology, Harvard University, Cambridge, MA, USA
- Gothenburg Centre for Advanced Studies in Science and Technology, Chalmers University of Technology and University of Gothenburg, Göteborg, Sweden
| |
Collapse
|