1
|
Liu S, Chen Q, Hou C, Dong C, Qiu X, Tang K. Recovery of 1559 metagenome-assembled genomes from the East China Sea's low-oxygen region. Sci Data 2024; 11:994. [PMID: 39266528 PMCID: PMC11393323 DOI: 10.1038/s41597-024-03850-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2024] [Accepted: 09/02/2024] [Indexed: 09/14/2024] Open
Abstract
The Changjiang Estuary and adjacent East China Sea are well-known hypoxic aquatic environments. Eutrophication-driven hypoxia frequently occurs in coastal areas, posing a major threat to the ecological environment, including altering community structure and metabolic processes of marine organisms, and enhancing diversion of energy shunt into microbial communities. However, the responses of microbial communities and their metabolic pathways to coastal hypoxia remain poorly understood. Here, we studied the microbial communities collected from spatiotemporal samplings using metagenomic sequencing in the Changjiang Estuary and adjacent East China Sea. This generated 1.31 Tbp of metagenomics data, distributed across 103 samples corresponding to 8 vertical profiles. We further reported 1,559 metagenome-assembled genomes (MAGs), of which 508 were high-quality MAGs (Completeness > 90% and Contamination < 10%). Phylogenomic analysis classified them into 181 archaeal and 1,378 bacterial MAGs. These results provided a valuable metagenomic dataset available for further investigation of the effects of hypoxia on marine microorganisms.
Collapse
Affiliation(s)
- Shujing Liu
- State Key Laboratory of Marine Environmental Science, Fujian Key Laboratory of Marine Carbon Sequestration, College of Ocean and Earth Sciences, Xiamen University, Fujian, China
| | - Quanrui Chen
- State Key Laboratory of Marine Environmental Science, Fujian Key Laboratory of Marine Carbon Sequestration, College of Ocean and Earth Sciences, Xiamen University, Fujian, China
| | - Congcong Hou
- State Key Laboratory of Marine Environmental Science, Fujian Key Laboratory of Marine Carbon Sequestration, College of Ocean and Earth Sciences, Xiamen University, Fujian, China
| | - Changjie Dong
- State Key Laboratory of Marine Environmental Science, Fujian Key Laboratory of Marine Carbon Sequestration, College of Ocean and Earth Sciences, Xiamen University, Fujian, China
| | - Xuanyun Qiu
- State Key Laboratory of Marine Environmental Science, Fujian Key Laboratory of Marine Carbon Sequestration, College of Ocean and Earth Sciences, Xiamen University, Fujian, China
| | - Kai Tang
- State Key Laboratory of Marine Environmental Science, Fujian Key Laboratory of Marine Carbon Sequestration, College of Ocean and Earth Sciences, Xiamen University, Fujian, China.
| |
Collapse
|
2
|
Leal JL, Milesi P, Hodková E, Zhou Q, James J, Eklund DM, Pyhäjärvi T, Salojärvi J, Lascoux M. Complex Polyploids: Origins, Genomic Composition, and Role of Introgressed Alleles. Syst Biol 2024; 73:392-418. [PMID: 38613229 PMCID: PMC11282369 DOI: 10.1093/sysbio/syae012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2023] [Revised: 12/18/2023] [Accepted: 03/28/2024] [Indexed: 04/14/2024] Open
Abstract
Introgression allows polyploid species to acquire new genomic content from diploid progenitors or from other unrelated diploid or polyploid lineages, contributing to genetic diversity and facilitating adaptive allele discovery. In some cases, high levels of introgression elicit the replacement of large numbers of alleles inherited from the polyploid's ancestral species, profoundly reshaping the polyploid's genomic composition. In such complex polyploids, it is often difficult to determine which taxa were the progenitor species and which taxa provided additional introgressive blocks through subsequent hybridization. Here, we use population-level genomic data to reconstruct the phylogenetic history of Betula pubescens (downy birch), a tetraploid species often assumed to be of allopolyploid origin and which is known to hybridize with at least four other birch species. This was achieved by modeling polyploidization and introgression events under the multispecies coalescent and then using an approximate Bayesian computation rejection algorithm to evaluate and compare competing polyploidization models. We provide evidence that B. pubescens is the outcome of an autoploid genome doubling event in the common ancestor of B. pendula and its extant sister species, B. platyphylla, that took place approximately 178,000-188,000 generations ago. Extensive hybridization with B. pendula, B. nana, and B. humilis followed in the aftermath of autopolyploidization, with the relative contribution of each of these species to the B. pubescens genome varying markedly across the species' range. Functional analysis of B. pubescens loci containing alleles introgressed from B. nana identified multiple genes involved in climate adaptation, while loci containing alleles derived from B. humilis revealed several genes involved in the regulation of meiotic stability and pollen viability in plant species.
Collapse
Affiliation(s)
- J Luis Leal
- Plant Ecology and Evolution, Department of Ecology and Genetics, Uppsala University, Norbyvägen 18D, 75236 Uppsala, Sweden
| | - Pascal Milesi
- Plant Ecology and Evolution, Department of Ecology and Genetics, Uppsala University, Norbyvägen 18D, 75236 Uppsala, Sweden
- Science for Life Laboratory (SciLifeLab), Uppsala University, 75237 Uppsala, Sweden
| | - Eva Hodková
- Plant Ecology and Evolution, Department of Ecology and Genetics, Uppsala University, Norbyvägen 18D, 75236 Uppsala, Sweden
- Faculty of Environmental Sciences, Czech University of Life Sciences Prague, Kamýcká 129, 16521 Prague, Czech Republic
| | - Qiujie Zhou
- Plant Ecology and Evolution, Department of Ecology and Genetics, Uppsala University, Norbyvägen 18D, 75236 Uppsala, Sweden
| | - Jennifer James
- Plant Ecology and Evolution, Department of Ecology and Genetics, Uppsala University, Norbyvägen 18D, 75236 Uppsala, Sweden
| | - D Magnus Eklund
- Physiology and Environmental Toxicology, Department of Organismal Biology, Uppsala University, Norbyvägen 18A, 75236 Uppsala, Sweden
| | - Tanja Pyhäjärvi
- Organismal and Evolutionary Biology Research Program, Faculty of Biological and Environmental Sciences, and Viikki Plant Science Centre, University of Helsinki, P.O. Box 65 (Viikinkaari 1), 00014 Helsinki, Finland
- Department of Forest Sciences, University of Helsinki, 00014 Helsinki, Finland
| | - Jarkko Salojärvi
- School of Biological Sciences, Nanyang Technological University, 60 Nanyang Drive, Singapore 637551, Singapore
- Organismal and Evolutionary Biology Research Program, Faculty of Biological and Environmental Sciences, and Viikki Plant Science Centre, University of Helsinki, P.O. Box 65 (Viikinkaari 1), 00014 Helsinki, Finland
| | - Martin Lascoux
- Plant Ecology and Evolution, Department of Ecology and Genetics, Uppsala University, Norbyvägen 18D, 75236 Uppsala, Sweden
- Science for Life Laboratory (SciLifeLab), Uppsala University, 75237 Uppsala, Sweden
| |
Collapse
|
3
|
Rick JA, Brock CD, Lewanski AL, Golcher-Benavides J, Wagner CE. Reference Genome Choice and Filtering Thresholds Jointly Influence Phylogenomic Analyses. Syst Biol 2024; 73:76-101. [PMID: 37881861 DOI: 10.1093/sysbio/syad065] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2022] [Revised: 09/20/2023] [Accepted: 10/20/2023] [Indexed: 10/27/2023] Open
Abstract
Molecular phylogenies are a cornerstone of modern comparative biology and are commonly employed to investigate a range of biological phenomena, such as diversification rates, patterns in trait evolution, biogeography, and community assembly. Recent work has demonstrated that significant biases may be introduced into downstream phylogenetic analyses from processing genomic data; however, it remains unclear whether there are interactions among bioinformatic parameters or biases introduced through the choice of reference genome for sequence alignment and variant calling. We address these knowledge gaps by employing a combination of simulated and empirical data sets to investigate the extent to which the choice of reference genome in upstream bioinformatic processing of genomic data influences phylogenetic inference, as well as the way that reference genome choice interacts with bioinformatic filtering choices and phylogenetic inference method. We demonstrate that more stringent minor allele filters bias inferred trees away from the true species tree topology, and that these biased trees tend to be more imbalanced and have a higher center of gravity than the true trees. We find the greatest topological accuracy when filtering sites for minor allele count (MAC) >3-4 in our 51-taxa data sets, while tree center of gravity was closest to the true value when filtering for sites with MAC >1-2. In contrast, filtering for missing data increased accuracy in the inferred topologies; however, this effect was small in comparison to the effect of minor allele filters and may be undesirable due to a subsequent mutation spectrum distortion. The bias introduced by these filters differs based on the reference genome used in short read alignment, providing further support that choosing a reference genome for alignment is an important bioinformatic decision with implications for downstream analyses. These results demonstrate that attributes of the study system and dataset (and their interaction) add important nuance for how best to assemble and filter short-read genomic data for phylogenetic inference.
Collapse
Affiliation(s)
- Jessica A Rick
- School of Natural Resources & the Environment, University of Arizona, Tucson, AZ 85719, USA
| | - Chad D Brock
- Department of Biological Sciences, Tarleton State University, Stephenville, TX 76401, USA
| | - Alexander L Lewanski
- Department of Integrative Biology and W.K. Kellogg Biological Station, Michigan State University, East Lansing, MI 48824, USA
| | - Jimena Golcher-Benavides
- Department of Natural Resource Ecology and Management, Iowa State University, Ames, IA 50011, USA
| | - Catherine E Wagner
- Program in Ecology and Evolution, University of Wyoming, Laramie, WY 82071, USA
- Department of Botany, University of Wyoming, Laramie, WY 82071, USA
| |
Collapse
|
4
|
Cai ZY, Niu ZY, Zhang YY, Tong YH, Vu TC, Goh WL, Sungkaew S, Teerawatananon A, Xia NH. Phylogenomic analyses reveal reticulate evolution between Neomicrocalamus and Temochloa (Poaceae: Bambusoideae). FRONTIERS IN PLANT SCIENCE 2023; 14:1274337. [PMID: 38111884 PMCID: PMC10726129 DOI: 10.3389/fpls.2023.1274337] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/08/2023] [Accepted: 11/14/2023] [Indexed: 12/20/2023]
Abstract
Neomicrocalamus and Temochloa are closely related to bamboo genera. However, when considered with newly discovered and morphologically similar material from China and Vietnam, the phylogenetic relationship among these three groups was ambiguous in the analyses based on DNA regions. Here, as a means of investigating the relationships among the three bamboo groups and exploring potential sources of genomic conflicts, we present a phylogenomic examination based on the whole plastome, single-nucleotide polymorphism (SNP), and single-copy nuclear (SCN) gene datasets. Three different phylogenetic hypotheses were found. The inconsistency is attributed to the combination of incomplete lineage sorting and introgression. The origin of newly discovered bamboos is from introgressive hybridization between Temochloa liliana (which contributed 80.7% of the genome) and Neomicrocalamus prainii (19.3%), indicating that the newly discovered bamboos are closer to T. liliana in genetics. The more similar morphology and closer distribution elevation also imply a closer relationship between Temochloa and newly discovered bamboos.
Collapse
Affiliation(s)
- Zhuo-Yu Cai
- Key Laboratory of Plant Resources, Conservation and Sustainable Utilization/Guangdong Provincial Key Laboratory of Digital Botanical Garden, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China
- South China National Botanical Garden, Guangzhou, China
| | - Zheng-Yang Niu
- Key Laboratory of Plant Resources, Conservation and Sustainable Utilization/Guangdong Provincial Key Laboratory of Digital Botanical Garden, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China
- South China National Botanical Garden, Guangzhou, China
| | - You-Yuan Zhang
- Key Laboratory of Plant Resources, Conservation and Sustainable Utilization/Guangdong Provincial Key Laboratory of Digital Botanical Garden, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, China
- Quality Management Office, Guiyang Vocational and Technical College, Guiyang, China
| | - Yi-Hua Tong
- Key Laboratory of Plant Resources, Conservation and Sustainable Utilization/Guangdong Provincial Key Laboratory of Digital Botanical Garden, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, China
- South China National Botanical Garden, Guangzhou, China
- State Key Laboratory of Plant Diversity and Specialty Crops, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, China
| | - Tien Chinh Vu
- Vietnam National Museum of Nature, Vietnam Academy of Science and Technology, Hanoi, Vietnam
- Graduate University of Science and Technology, Vietnam Academy of Science and Technology, Hanoi, Vietnam
| | - Wei Lim Goh
- Faculty of Science, Universiti Tunku Abdul Rahman, Jalan Universiti, Kampar, Perak, Malaysia
| | - Sarawood Sungkaew
- Department of Forest Biology, Faculty of Forestry, Kasetsart University, Bangkok, Thailand
| | | | - Nian-He Xia
- Key Laboratory of Plant Resources, Conservation and Sustainable Utilization/Guangdong Provincial Key Laboratory of Digital Botanical Garden, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, China
- South China National Botanical Garden, Guangzhou, China
| |
Collapse
|
5
|
Myers BM, Burns KJ, Clark CJ, Brelsford A. Sampling affects population genetic inference: A case study of the Allen's (Selasphorus sasin) and rufous hummingbird (Selasphorus rufus). J Hered 2023; 114:625-636. [PMID: 37455658 DOI: 10.1093/jhered/esad044] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2022] [Accepted: 07/12/2023] [Indexed: 07/18/2023] Open
Abstract
Gene flow can affect evolutionary inference when species are undersampled. Here, we evaluate the effects of gene flow and geographic sampling on demographic inference of 2 hummingbirds that hybridize, Allen's hummingbird (Selasphorus sasin) and rufous hummingbird (Selasphorus rufus). Using whole-genome data and extensive geographic sampling, we find widespread connectivity, with introgression far beyond the Allen's × rufous hybrid zone, although the Z chromosome resists introgression beyond the hybrid zone. We test alternative hypotheses of speciation history of Allen's, rufous, and Calliope (S. calliope) hummingbird and find that rufous hummingbird is the sister taxon to Allen's hummingbird, and Calliope hummingbird is the outgroup. A model treating the 2 subspecies of Allen's hummingbird as a single panmictic population fit observed genetic data better than models treating the subspecies as distinct populations, in contrast to morphological and behavioral differences and analyses of spatial population structure. With additional sampling, our study builds upon recent studies that came to conflicting conclusions regarding the evolutionary histories of these 2 species. Our results stress the importance of thorough geographic sampling when assessing demographic history in the presence of gene flow.
Collapse
Affiliation(s)
- Brian M Myers
- Department of Biological Sciences, San Diego State University, San Diego, CA, United States
| | - Kevin J Burns
- Department of Biological Sciences, San Diego State University, San Diego, CA, United States
| | - Christopher J Clark
- Department of Evolution, Ecology, and Organismal Biology, Speith Hall, University of California, Riverside, CA, United States
| | - Alan Brelsford
- Department of Evolution, Ecology, and Organismal Biology, Speith Hall, University of California, Riverside, CA, United States
| |
Collapse
|
6
|
DeRaad DA, McCullough JM, DeCicco LH, Hime PM, Joseph L, Andersen MJ, Moyle RG. Mitonuclear discordance results from incomplete lineage sorting, with no detectable evidence for gene flow, in a rapid radiation of Todiramphus kingfishers. Mol Ecol 2023; 32:4844-4862. [PMID: 37515525 DOI: 10.1111/mec.17080] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2022] [Revised: 03/15/2023] [Accepted: 06/19/2023] [Indexed: 07/31/2023]
Abstract
Many organisms possess multiple discrete genomes (i.e. nuclear and organellar), which are inherited separately and may have unique and even conflicting evolutionary histories. Phylogenetic reconstructions from these discrete genomes can yield different patterns of relatedness, a phenomenon known as cytonuclear discordance. In many animals, mitonuclear discordance (i.e. discordant evolutionary histories between the nuclear and mitochondrial genomes) has been widely documented, but its causes are often considered idiosyncratic and inscrutable. We show that a case of mitonuclear discordance in Todiramphus kingfishers can be explained by extensive genome-wide incomplete lineage sorting (ILS), likely a result of the explosive diversification history of this genus. For these kingfishers, quartet frequencies reveal that the nuclear genome is dominated by discordant topologies, with none of the internal branches in our consensus nuclear tree recovered in >50% of genome-wide gene trees. Meanwhile, a lack of inter-species shared ancestry, non-significant pairwise tests for gene flow, and little evidence for meaningful migration edges between species, leads to the conclusion that gene flow cannot explain the mitonuclear discordance we observe. This lack of evidence for gene flow combined with evidence for extensive genome-wide gene tree discordance, a hallmark of ILS, leads us to conclude that the mitonuclear discordance we observe likely results from ILS, specifically deep coalescence of the mitochondrial genome. Based on this case study, we hypothesize that similar demographic histories in other 'great speciator' taxa across the Indo-Pacific likely predispose these groups to high levels of ILS and high likelihoods of mitonuclear discordance.
Collapse
Affiliation(s)
- Devon A DeRaad
- Biodiversity Institute and Natural History Museum, University of Kansas, Lawrence, Kansas, USA
| | - Jenna M McCullough
- Department of Biology and Museum of Southwestern Biology, University of New Mexico, Albuquerque, New Mexico, USA
| | - Lucas H DeCicco
- Biodiversity Institute and Natural History Museum, University of Kansas, Lawrence, Kansas, USA
| | - Paul M Hime
- Biodiversity Institute and Natural History Museum, University of Kansas, Lawrence, Kansas, USA
| | - Leo Joseph
- Australian National Wildlife Collection, CSIRO National Research Collections Australia, Canberra, Australian Capital Territory, Australia
| | - Michael J Andersen
- Department of Biology and Museum of Southwestern Biology, University of New Mexico, Albuquerque, New Mexico, USA
| | - Robert G Moyle
- Biodiversity Institute and Natural History Museum, University of Kansas, Lawrence, Kansas, USA
| |
Collapse
|
7
|
Leal JL, Milesi P, Salojärvi J, Lascoux M. Phylogenetic Analysis of Allotetraploid Species Using Polarized Genomic Sequences. Syst Biol 2023; 72:372-390. [PMID: 36932679 PMCID: PMC10275558 DOI: 10.1093/sysbio/syad009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2022] [Revised: 10/14/2022] [Accepted: 03/10/2023] [Indexed: 03/19/2023] Open
Abstract
Phylogenetic analysis of polyploid hybrid species has long posed a formidable challenge as it requires the ability to distinguish between alleles of different ancestral origins in order to disentangle their individual evolutionary history. This problem has been previously addressed by conceiving phylogenies as reticulate networks, using a two-step phasing strategy that first identifies and segregates homoeologous loci and then, during a second phasing step, assigns each gene copy to one of the subgenomes of an allopolyploid species. Here, we propose an alternative approach, one that preserves the core idea behind phasing-to produce separate nucleotide sequences that capture the reticulate evolutionary history of a polyploid-while vastly simplifying its implementation by reducing a complex multistage procedure to a single phasing step. While most current methods used for phylogenetic reconstruction of polyploid species require sequencing reads to be pre-phased using experimental or computational methods-usually an expensive, complex, and/or time-consuming endeavor-phasing executed using our algorithm is performed directly on the multiple-sequence alignment (MSA), a key change that allows for the simultaneous segregation and sorting of gene copies. We introduce the concept of genomic polarization that, when applied to an allopolyploid species, produces nucleotide sequences that capture the fraction of a polyploid genome that deviates from that of a reference sequence, usually one of the other species present in the MSA. We show that if the reference sequence is one of the parental species, the polarized polyploid sequence has a close resemblance (high pairwise sequence identity) to the second parental species. This knowledge is harnessed to build a new heuristic algorithm where, by replacing the allopolyploid genomic sequence in the MSA by its polarized version, it is possible to identify the phylogenetic position of the polyploid's ancestral parents in an iterative process. The proposed methodology can be used with long-read and short-read high-throughput sequencing data and requires only one representative individual for each species to be included in the phylogenetic analysis. In its current form, it can be used in the analysis of phylogenies containing tetraploid and diploid species. We test the newly developed method extensively using simulated data in order to evaluate its accuracy. We show empirically that the use of polarized genomic sequences allows for the correct identification of both parental species of an allotetraploid with up to 97% certainty in phylogenies with moderate levels of incomplete lineage sorting (ILS) and 87% in phylogenies containing high levels of ILS. We then apply the polarization protocol to reconstruct the reticulate histories of Arabidopsis kamchatica and Arabidopsis suecica, two allopolyploids whose ancestry has been well documented. [Allopolyploidy; Arabidopsis; genomic polarization; homoeologs; incomplete lineage sorting; phasing; polyploid phylogenetics; reticulate evolution.].
Collapse
Affiliation(s)
- J Luis Leal
- Plant Ecology and Evolution, Department of Ecology and Genetics, Uppsala University, Norbyvägen 18D, 75236 Uppsala, Sweden
| | - Pascal Milesi
- Plant Ecology and Evolution, Department of Ecology and Genetics, Uppsala University, Norbyvägen 18D, 75236 Uppsala, Sweden
- Science for Life Laboratory (SciLifeLab), Uppsala University, 75237 Uppsala, Sweden
| | - Jarkko Salojärvi
- Organismal and Evolutionary Biology Research Program, Faculty of Biological and Environmental Sciences, and Viikki Plant Science Centre, University of Helsinki, P.O. Box 65 (Viikinkaari 1), 00014 Helsinki, Finland
- School of Biological Sciences, Nanyang Technological University, 60 Nanyang Drive, Singapore 637551, Singapore
| | - Martin Lascoux
- Plant Ecology and Evolution, Department of Ecology and Genetics, Uppsala University, Norbyvägen 18D, 75236 Uppsala, Sweden
- Science for Life Laboratory (SciLifeLab), Uppsala University, 75237 Uppsala, Sweden
| |
Collapse
|
8
|
Faltings L, Young MW, Ross CF, Granatosky MC. Got rhythm? Rhythmicity differences reflect different optimality criteria in feeding and locomotor systems. Evolution 2022; 76:2181-2190. [PMID: 35862552 DOI: 10.1111/evo.14569] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2021] [Revised: 04/19/2022] [Accepted: 05/24/2022] [Indexed: 01/22/2023]
Abstract
Evolutionary analyses of joint kinematics and muscle mechanics suggest that, during cyclic behaviors, tetrapod feeding systems are optimized for precise application of forces over small displacements during chewing, whereas locomotor systems are more optimized for large and rapid joint excursions during walking and running. If this hypothesis is correct, then it stands to reason that other biomechanical variables in the feeding and locomotor systems should also reflect these divergent functions. We compared rhythmicity of cyclic jaw and limb movements in feeding and locomotor systems in 261 tetrapod species in a phylogenetic context. Accounting for potential confounding variables, our analyses reveal higher rhythmicity of cyclic movements of the limbs than of the jaw. Higher rhythmicity in the locomotor system corroborates a hypothesis of stronger optimization for energetic efficiency: deviation from the limbs' natural frequency results in greater variability of center of mass movements and limb inertial changes, and therefore more work by limb muscles. Relatively lower rhythmicity in the feeding system may be a consequence of the necessity to prevent tooth breakage and wear, the greater complexity of coordination with tongue movements, and/or a greater emphasis on energy storage in elastic elements rather than the kinetics of limb movement.
Collapse
Affiliation(s)
- Lukas Faltings
- College of Osteopathic Medicine, New York Institute of Technology, Old Westbury, New York, 11568, USA
| | - Melody W Young
- College of Osteopathic Medicine, New York Institute of Technology, Old Westbury, New York, 11568, USA
- Department of Anatomy, Center for Biomedical Innovation, College of Osteopathic Medicine, New York Institute of Technology, Old Westbury, New York, 11568, USA
| | - Callum F Ross
- Department of Organismal Biology and Anatomy, University of Chicago, Chicago, Illinois, 60637, USA
| | - Michael C Granatosky
- College of Osteopathic Medicine, New York Institute of Technology, Old Westbury, New York, 11568, USA
- Department of Anatomy, Center for Biomedical Innovation, College of Osteopathic Medicine, New York Institute of Technology, Old Westbury, New York, 11568, USA
| |
Collapse
|
9
|
Zhang Y, Wang M, Cheng R, Luo Y, Li Y, Liu Z, Chen Q, Shen Y. Mitochondrial characteristics of Pseudohynobius flavomaculatus a protected salamander in China, and biogeographical implications for the family Hynobiidae (Amphibia, Caudata). ZOOSYST EVOL 2022. [DOI: 10.3897/zse.98.66578] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Pseudohynobius flavomaculatus a provincially-protected salamander species, inhabits mountainous areas of Chongqing and surrounding provinces in China. In the present study, the complete mitochondrial genome of P. flavomaculatus was sequenced and analyzed. The mitogenome is 16,401 bp in length and consisted of 13 protein-coding genes, 2 ribosomal RNA genes, 22 transfer RNA genes, and a control region. We performed a novel phylogenetic analysis, which demonstrated a sister relationship between P. flavomaculatus and P. jinfo. The 95% confidence interval around our new divergence date estimate suggest that Hynobiidae originated at 101.62–119.84 (mean=110.87) Ma. Species within Hynobiidae diverged successively in the Cenozoic era, and hynobiid speciation coincides primarily with geologic events. Our biogeographical inference demonstrates that nearly all early hynobiids divergences correspond to geological estimates of orogeny, which may have contributed to the notably high dN/dS ratio in this clade. We conclude that orogeny is likely a primary, dynamic factor, which may have repeatedly initiated the process of speciation in the family Hynobiidae.
Collapse
|
10
|
DeRaad DA, McCormack JE, Chen N, Peterson AT, Moyle RG. Combining Species Delimitation, Species Trees, and Tests for Gene Flow Clarifies Complex Speciation in Scrub-Jays. Syst Biol 2022; 71:1453-1470. [PMID: 35552760 DOI: 10.1093/sysbio/syac034] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2021] [Revised: 05/02/2022] [Accepted: 05/06/2022] [Indexed: 11/13/2022] Open
Abstract
Complex speciation, involving rapid divergence and multiple bouts of post-divergence gene flow, can obfuscate phylogenetic relationships and species limits. In North America, cases of complex speciation are common, due at least in part to the cyclical Pleistocene glacial history of the continent. Scrub-jays in the genus Aphelocoma provide a useful case study in complex speciation because their range throughout North America is structured by phylogeographic barriers with multiple cases of secondary contact between divergent lineages. Here, we show that a comprehensive approach to genomic reconstruction of evolutionary history, i.e., synthesizing results from species delimitation, species tree reconstruction, demographic model testing, and tests for gene flow, is capable of clarifying evolutionary history despite complex speciation. We find concordant evidence across all statistical approaches for the distinctiveness of an endemic southern Mexico lineage (A. w. sumichrasti), culminating in support for the species status of this lineage under any commonly applied species concept. We also find novel genomic evidence for the species status of a Texas endemic lineage A. w. texana, for which equivocal species delimitation results were clarified by demographic modeling and spatially explicit models of gene flow. Finally, we find that complex signatures of both ancient and modern gene flow between the non-sister California Scrub-Jay (A. californica) and Woodhouse's Scrub-Jay (A. woodhouseii), result in discordant gene trees throughout the species' genomes despite clear support for their overall isolation and species status. In sum, we find that a multi-faceted approach to genomic analysis can increase our understanding of complex speciation histories, even in well-studied groups. Given the emerging recognition that complex speciation is relatively commonplace, the comprehensive framework that we demonstrate for interrogation of species limits and evolutionary history using genomic data can provide a necessary roadmap for disentangling the impacts of gene flow and incomplete lineage sorting to better understand the systematics of other groups with similarly complex evolutionary histories.
Collapse
Affiliation(s)
- Devon A DeRaad
- Biodiversity Institute and Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence KS, 66045, USA
| | - John E McCormack
- Moore Laboratory of Zoology,Occidental College, Los Angeles, CA, 90041, USA
| | - Nancy Chen
- Department of Biology, University of Rochester, Rochester, NY, 14627, USA
| | - A Townsend Peterson
- Biodiversity Institute and Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence KS, 66045, USA
| | - Robert G Moyle
- Biodiversity Institute and Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence KS, 66045, USA
| |
Collapse
|
11
|
Abstract
Motivation Phylogenomics faces a dilemma: on the one hand, most accurate species and gene tree estimation methods are those that co-estimate them; on the other hand, these co-estimation methods do not scale to moderately large numbers of species. The summary-based methods, which first infer gene trees independently and then combine them, are much more scalable but are prone to gene tree estimation error, which is inevitable when inferring trees from limited-length data. Gene tree estimation error is not just random noise and can create biases such as long-branch attraction. Results We introduce a scalable likelihood-based approach to co-estimation under the multi-species coalescent model. The method, called quartet co-estimation (QuCo), takes as input independently inferred distributions over gene trees and computes the most likely species tree topology and internal branch length for each quartet, marginalizing over gene tree topologies and ignoring branch lengths by making several simplifying assumptions. It then updates the gene tree posterior probabilities based on the species tree. The focus on gene tree topologies and the heuristic division to quartets enables fast likelihood calculations. We benchmark our method with extensive simulations for quartet trees in zones known to produce biased species trees and further with larger trees. We also run QuCo on a biological dataset of bees. Our results show better accuracy than the summary-based approach ASTRAL run on estimated gene trees. Availability and implementation QuCo is available on https://github.com/maryamrabiee/quco. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Maryam Rabiee
- Department of Computer Science and Engineering, University of California, San Diego, La Jolla, CA 92093, USA
| | | |
Collapse
|
12
|
Brock KM, McTavish EJ, Edwards DL. Color Polymorphism is a Driver of Diversification in the Lizard Family Lacertidae. Syst Biol 2021; 71:24-39. [PMID: 34146110 PMCID: PMC8677543 DOI: 10.1093/sysbio/syab046] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2020] [Revised: 06/05/2021] [Accepted: 06/11/2021] [Indexed: 11/13/2022] Open
Abstract
Color polymorphism-two or more heritable color phenotypes maintained within a single breeding population-is an extreme type of intraspecific diversity widespread across the tree of life. Color polymorphism is hypothesized to be an engine for speciation, where morph loss or divergence between distinct color morphs within a species results in the rapid evolution of new lineages, and thus, color polymorphic lineages are expected to display elevated diversification rates. Multiple species in the lizard family Lacertidae are color polymorphic, making them an ideal group to investigate the evolutionary history of this trait and its influence on macroevolution. Here, we produce a comprehensive species-level phylogeny of the lizard family Lacertidae to reconstruct the evolutionary history of color polymorphism and test if color polymorphism has been a driver of diversification. Accounting for phylogenetic uncertainty with multiple phylogenies and simulation studies, we estimate an ancient origin of color polymorphism (111 Ma) within the Lacertini tribe (subfamily Lacertinae). Color polymorphism most likely evolved few times in the Lacertidae and has been lost at a much faster rate than gained. Evolutionary transitions to color polymorphism are associated with shifts in increased net diversification rate in this family of lizards. Taken together, our empirical results support long-standing theoretical expectations that color polymorphism is a driver of diversification.[Color polymorphism; Lacertidae; state-dependent speciation extinction models; trait-dependent diversification.].
Collapse
Affiliation(s)
- Kinsey M Brock
- Department of Life & Environmental Sciences, School of Natural Sciences, University of California, Merced 5400 N. Lake Rd., Merced, CA 95340 USA
- Quantitative & Systems Biology Graduate Group, School of Natural Sciences, University of California, Merced 5400 N. Lake Rd., Merced, CA 95340 USA
| | - Emily Jane McTavish
- Department of Life & Environmental Sciences, School of Natural Sciences, University of California, Merced 5400 N. Lake Rd., Merced, CA 95340 USA
| | - Danielle L Edwards
- Department of Life & Environmental Sciences, School of Natural Sciences, University of California, Merced 5400 N. Lake Rd., Merced, CA 95340 USA
| |
Collapse
|
13
|
Chafin TK, Douglas MR, Bangs MR, Martin BT, Mussmann SM, Douglas ME. Taxonomic Uncertainty and the Anomaly Zone: Phylogenomics Disentangle a Rapid Radiation to Resolve Contentious Species (Gila robusta Complex) in the Colorado River. Genome Biol Evol 2021; 13:evab200. [PMID: 34432005 PMCID: PMC8449829 DOI: 10.1093/gbe/evab200] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/19/2021] [Indexed: 12/18/2022] Open
Abstract
Species are indisputable units for biodiversity conservation, yet their delimitation is fraught with both conceptual and methodological difficulties. A classic example is the taxonomic controversy surrounding the Gila robusta complex in the lower Colorado River of southwestern North America. Nominal species designations were originally defined according to weakly diagnostic morphological differences, but these conflicted with subsequent genetic analyses. Given this ambiguity, the complex was re-defined as a single polytypic unit, with the proposed "threatened" status under the U.S. Endangered Species Act of two elements being withdrawn. Here we re-evaluated the status of the complex by utilizing dense spatial and genomic sampling (n = 387 and >22 k loci), coupled with SNP-based coalescent and polymorphism-aware phylogenetic models. In doing so, we found that all three species were indeed supported as evolutionarily independent lineages, despite widespread phylogenetic discordance. To juxtapose this discrepancy with previous studies, we first categorized those evolutionary mechanisms driving discordance, then tested (and subsequently rejected) prior hypotheses which argued phylogenetic discord in the complex was driven by the hybrid origin of Gila nigra. The inconsistent patterns of diversity we found within G. robusta were instead associated with rapid Plio-Pleistocene drainage evolution, with subsequent divergence within the "anomaly zone" of tree space producing ambiguities that served to confound prior studies. Our results not only support the resurrection of the three species as distinct entities but also offer an empirical example of how phylogenetic discordance can be categorized within other recalcitrant taxa, particularly when variation is primarily partitioned at the species level.
Collapse
Affiliation(s)
- Tyler K Chafin
- Department of Biological Sciences, University of Arkansas, Fayetteville, Arkansas, USA
- Department of Ecology and Evolutionary Biology, University of Colorado, Boulder, Colorado, USA
| | - Marlis R Douglas
- Department of Biological Sciences, University of Arkansas, Fayetteville, Arkansas, USA
| | - Max R Bangs
- Department of Biological Sciences, University of Arkansas, Fayetteville, Arkansas, USA
- Department of Biological Science, Florida State University, Tallahassee, Florida, USA
| | - Bradley T Martin
- Department of Biological Sciences, University of Arkansas, Fayetteville, Arkansas, USA
- Global Campus, University of Arkansas, Fayetteville, Arkansas, USA
| | - Steven M Mussmann
- Department of Biological Sciences, University of Arkansas, Fayetteville, Arkansas, USA
- Southwestern Native Aquatic Resources and Recovery Center, U.S. Fish & Wildlife Service, Dexter, New Mexico, USA
| | - Michael E Douglas
- Department of Biological Sciences, University of Arkansas, Fayetteville, Arkansas, USA
| |
Collapse
|
14
|
Vázquez-Miranda H, Barker FK. Autosomal, sex-linked and mitochondrial loci resolve evolutionary relationships among wrens in the genus Campylorhynchus. Mol Phylogenet Evol 2021; 163:107242. [PMID: 34224849 DOI: 10.1016/j.ympev.2021.107242] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2020] [Revised: 06/14/2021] [Accepted: 06/29/2021] [Indexed: 01/18/2023]
Abstract
Although there is general consensus that sampling of multiple genetic loci is critical in accurate reconstruction of species trees, the exact numbers and the best types of molecular markers remain an open question. In particular, the phylogenetic utility of sex-linked loci is underexplored. Here, we sample all species and 70% of the named diversity of the New World wren genus Campylorhynchus using sequences from 23 loci, to evaluate the effects of linkage on efficiency in recovering a well-supported tree for the group. At a tree-wide level, we found that most loci supported fewer than half the possible clades and that sex-linked loci produced similar resolution to slower-coalescing autosomal markers, controlling for locus length. By contrast, we did find evidence that linkage affected the efficiency of recovery of individual relationships; as few as two sex-linked loci were necessary to resolve a selection of clades with long to medium subtending branches, whereas 4-6 autosomal loci were necessary to achieve comparable results. These results support an expanded role for sampling of the avian Z chromosome in phylogenetic studies, including target enrichment approaches. Our concatenated and species tree analyses represent significant improvements in our understanding of diversification in Campylorhynchus, and suggest a relatively complex scenario for its radiation across the Miocene/Pliocene boundary, with multiple invasions of South America.
Collapse
Affiliation(s)
- Hernán Vázquez-Miranda
- Departamento de Zoología, Instituto de Biología, Universidad Nacional Autónoma de México, Ciudad de México C.P. 04510, Mexico
| | - F Keith Barker
- Department of Ecology, Evolution and Behavior, Bell Museum of Natural History, University of Minnesota, 40 Gortner Laboratory, 1479 Gortner Avenue, Saint Paul, MN 55108, USA
| |
Collapse
|
15
|
Thomas SK, Liu X, Du Z, Dong Y, Cummings A, Pokorny L, Xiang Q(J, Leebens‐Mack JH. Comprehending Cornales: phylogenetic reconstruction of the order using the Angiosperms353 probe set. AMERICAN JOURNAL OF BOTANY 2021; 108:1112-1121. [PMID: 34263456 PMCID: PMC8361741 DOI: 10.1002/ajb2.1696] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/19/2020] [Accepted: 05/12/2021] [Indexed: 05/08/2023]
Abstract
PREMISE Cornales is an order of flowering plants containing ecologically and horticulturally important families, including Cornaceae (dogwoods) and Hydrangeaceae (hydrangeas), among others. While many relationships in Cornales are strongly supported by previous studies, some uncertainty remains with regards to the placement of Hydrostachyaceae and to relationships among families in Cornales and within Cornaceae. Here we analyzed hundreds of nuclear loci to test published phylogenetic hypotheses and estimated a robust species tree for Cornales. METHODS Using the Angiosperms353 probe set and existing data sets, we generated phylogenomic data for 158 samples, representing all families in the Cornales, with intensive sampling in the Cornaceae. RESULTS We curated an average of 312 genes per sample, constructed maximum likelihood gene trees, and inferred a species tree using the summary approach implemented in ASTRAL-III, a method statistically consistent with the multispecies coalescent model. CONCLUSIONS The species tree we constructed generally shows high support values and a high degree of concordance among individual nuclear gene trees. Relationships among families are largely congruent with previous molecular studies, except for the placement of the nyssoids and the Grubbiaceae-Curtisiaceae clades. Furthermore, we were able to place Hydrostachyaceae within Cornales, and within Cornaceae, the monophyly of known morphogroups was well supported. However, patterns of gene tree discordance suggest potential ancient reticulation, gene flow, and/or ILS in the Hydrostachyaceae lineage and the early diversification of Cornus. Our findings reveal new insights into the diversification process across Cornales and demonstrate the utility of the Angiosperms353 probe set.
Collapse
Affiliation(s)
- Shawn K. Thomas
- Department of Plant BiologyUniversity of GeorgiaAthensGA30602USA
- Division of Biological SciencesUniversity of MissouriColumbiaMO65203USA
| | - Xiang Liu
- Department of Plant and Microbial BiologyNorth Carolina State UniversityRaleighNC27695USA
- SyngentaResearch Triangle ParkNC27709USA
| | - Zhi‐Yuan Du
- Wuhan Botanical GardenThe Chinese Academy of SciencesWuhanHubei430074China
| | - Yibo Dong
- Department of Plant and Microbial BiologyNorth Carolina State UniversityRaleighNC27695USA
- Global Health Infectious Disease ResearchCollege of Public HealthUniversity of South FloridaTampaFL33612USA
| | - Amanda Cummings
- Department of Plant BiologyUniversity of GeorgiaAthensGA30602USA
| | - Lisa Pokorny
- Royal Botanic Gardens, KewRichmondLondonTW9 3AEUK
- Computational/Systems Biology and Genomics ProgramCentre for Plant Biotechnology and GenomicsUPM‐INIA‐CSICPozuelo de Alarcón (Madrid)28223Spain
| | - Qui‐Yun (Jenny) Xiang
- Department of Plant and Microbial BiologyNorth Carolina State UniversityRaleighNC27695USA
| | | |
Collapse
|
16
|
Jayawardena RS, Hyde KD, Chen YJ, Papp V, Palla B, Papp D, Bhunjun CS, Hurdeal VG, Senwanna C, Manawasinghe IS, Harischandra DL, Gautam AK, Avasthi S, Chuankid B, Goonasekara ID, Hongsanan S, Zeng X, Liyanage KK, Liu N, Karunarathna A, Hapuarachchi KK, Luangharn T, Raspé O, Brahmanage R, Doilom M, Lee HB, Mei L, Jeewon R, Huanraluek N, Chaiwan N, Stadler M, Wang Y. One stop shop IV: taxonomic update with molecular phylogeny for important phytopathogenic genera: 76–100 (2020). FUNGAL DIVERS 2020. [DOI: 10.1007/s13225-020-00460-8] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
AbstractThis is a continuation of a series focused on providing a stable platform for the taxonomy of phytopathogenic fungi and fungus-like organisms. This paper focuses on one family: Erysiphaceae and 24 phytopathogenic genera: Armillaria, Barriopsis, Cercospora, Cladosporium, Clinoconidium, Colletotrichum, Cylindrocladiella, Dothidotthia,, Fomitopsis, Ganoderma, Golovinomyces, Heterobasidium, Meliola, Mucor, Neoerysiphe, Nothophoma, Phellinus, Phytophthora, Pseudoseptoria, Pythium, Rhizopus, Stemphylium, Thyrostroma and Wojnowiciella. Each genus is provided with a taxonomic background, distribution, hosts, disease symptoms, and updated backbone trees. Species confirmed with pathogenicity studies are denoted when data are available. Six of the genera are updated from previous entries as many new species have been described.
Collapse
|
17
|
Perea S, Sousa‐Santos C, Robalo J, Doadrio I. Multilocus phylogeny and systematics of Iberian endemicSqualius(Actinopterygii, Leuciscidae). ZOOL SCR 2020. [DOI: 10.1111/zsc.12420] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Silvia Perea
- Department of Biodiversity and Evolutionary Biology Museo Nacional de Ciencias Naturales - CSIC Madrid Spain
| | - Carla Sousa‐Santos
- MARE – Marine and Environmental Sciences Centre ISPA‐Instituto Universitário Lisbon Portugal
| | - Joana Robalo
- MARE – Marine and Environmental Sciences Centre ISPA‐Instituto Universitário Lisbon Portugal
| | - Ignacio Doadrio
- Department of Biodiversity and Evolutionary Biology Museo Nacional de Ciencias Naturales - CSIC Madrid Spain
| |
Collapse
|
18
|
Cardoso LW, Silva-Brandão KL, Duarte M. Adhemarius eurysthenes (Felder, 1874) (Lepidoptera, Sphingidae) in the Atlantic Rain Forest: A phylogeographic perspective. ZOOL ANZ 2018. [DOI: 10.1016/j.jcz.2018.10.005] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
19
|
Rabiee M, Sayyari E, Mirarab S. Multi-allele species reconstruction using ASTRAL. Mol Phylogenet Evol 2018; 130:286-296. [PMID: 30393186 DOI: 10.1016/j.ympev.2018.10.033] [Citation(s) in RCA: 86] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2017] [Revised: 10/23/2018] [Accepted: 10/24/2018] [Indexed: 11/29/2022]
Abstract
Genome-wide phylogeny reconstruction is becoming increasingly common, and one driving factor behind these phylogenomic studies is the promise that the potential discordance between gene trees and the species tree can be modeled. Incomplete lineage sorting is one cause of discordance that bridges population genetic and phylogenetic processes. ASTRAL is a species tree reconstruction method that seeks to find the tree with minimum quartet distance to an input set of inferred gene trees. However, the published ASTRAL algorithm only works with one sample per species. To account for polymorphisms in present-day species, one can sample multiple individuals per species to create multi-allele datasets. Here, we introduce how ASTRAL can handle multi-allele datasets. We show that the quartet-based optimization problem extends naturally, and we introduce heuristic methods for building the search space specifically for the case of multi-individual datasets. We study the accuracy and scalability of the multi-individual version of ASTRAL-III using extensive simulation studies and compare it to NJst, the only other scalable method that can handle these datasets. We do not find strong evidence that using multiple individuals dramatically improves accuracy. When we study the trade-off between sampling more genes versus more individuals, we find that sampling more genes is more effective than sampling more individuals, even under conditions that we study where trees are shallow (median length: ≈1Ne) and ILS is extremely high.
Collapse
Affiliation(s)
- Maryam Rabiee
- Department of Computer Science and Engineering, University of California, San Diego, 9500 Gilman Dr, La Jolla, CA 92093, United States
| | - Erfan Sayyari
- Department of Electrical and Computer Engineering, University of California, San Diego, 9500 Gilman Dr, La Jolla, CA 92093, United States
| | - Siavash Mirarab
- Department of Electrical and Computer Engineering, University of California, San Diego, 9500 Gilman Dr, La Jolla, CA 92093, United States.
| |
Collapse
|
20
|
Herrando-Moraira S. Exploring data processing strategies in NGS target enrichment to disentangle radiations in the tribe Cardueae (Compositae). Mol Phylogenet Evol 2018; 128:69-87. [PMID: 30036700 DOI: 10.1016/j.ympev.2018.07.012] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2018] [Revised: 07/13/2018] [Accepted: 07/14/2018] [Indexed: 12/17/2022]
Abstract
Target enrichment is a cost-effective sequencing technique that holds promise for elucidating evolutionary relationships in fast-evolving lineages. However, potential biases and impact of bioinformatic sequence treatments in phylogenetic inference have not been thoroughly explored yet. Here, we investigate this issue with an ultimate goal to shed light into a highly diversified group of Compositae (Asteraceae) constituted by four main genera: Arctium, Cousinia, Saussurea, and Jurinea. Specifically, we compared sequence data extraction methods implemented in two easy-to-use workflows, PHYLUCE and HybPiper, and assessed the impact of two filtering practices intended to reduce phylogenetic noise. In addition, we compared two phylogenetic inference methods: (1) the concatenation approach, in which all loci were concatenated in a supermatrix; and (2) the coalescence approach, in which gene trees were produced independently and then used to construct a species tree under coalescence assumptions. Here we confirm the usefulness of the set of 1061 COS targets (a nuclear conserved orthology loci set developed for the Compositae) across a variety of taxonomic levels. Intergeneric relationships were completely resolved: there are two sister groups, Arctium-Cousinia and Saussurea-Jurinea, which are in agreement with a morphological hypothesis. Intrageneric relationships among species of Arctium, Cousinia, and Saussurea are also well defined. Conversely, conflicting species relationships remain for Jurinea. Methodological choices significantly affected phylogenies in terms of topology, branch length, and support. Across all analyses, the phylogeny obtained using HybPiper and the strictest scheme of removing fast-evolving sites was estimated as the optimal. Regarding methodological choices, we conclude that: (1) trees obtained under the coalescence approach are topologically more congruent between them than those inferred using the concatenation approach; (2) refining treatments only improved support values under the concatenation approach; and (3) branch support values are maximized when fast-evolving sites are removed in the concatenation approach, and when a higher number of loci is analyzed in the coalescence approach.
Collapse
Affiliation(s)
- Sonia Herrando-Moraira
- Botanic Institute of Barcelona (IBB, CSIC-ICUB), Pg. del Migdia, s.n., 08038 Barcelona, Spain.
| | | |
Collapse
|
21
|
Adams RH, Schield DR, Card DC, Castoe TA. Assessing the Impacts of Positive Selection on Coalescent-Based Species Tree Estimation and Species Delimitation. Syst Biol 2018; 67:1076-1090. [DOI: 10.1093/sysbio/syy034] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2017] [Accepted: 05/05/2018] [Indexed: 11/13/2022] Open
Affiliation(s)
- Richard H Adams
- Department of Biology, University of Texas at Arlington, 501 S. Nedderman Dr., Arlington, TX 76019, USA
| | - Drew R Schield
- Department of Biology, University of Texas at Arlington, 501 S. Nedderman Dr., Arlington, TX 76019, USA
| | - Daren C Card
- Department of Biology, University of Texas at Arlington, 501 S. Nedderman Dr., Arlington, TX 76019, USA
| | - Todd A Castoe
- Department of Biology, University of Texas at Arlington, 501 S. Nedderman Dr., Arlington, TX 76019, USA
| |
Collapse
|
22
|
Knowles LL, Huang H, Sukumaran J, Smith SA. A matter of phylogenetic scale: Distinguishing incomplete lineage sorting from lateral gene transfer as the cause of gene tree discord in recent versus deep diversification histories. AMERICAN JOURNAL OF BOTANY 2018; 105:376-384. [PMID: 29710372 DOI: 10.1002/ajb2.1064] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/31/2017] [Accepted: 01/30/2018] [Indexed: 06/08/2023]
Abstract
PREMISE OF THE STUDY Discordant gene trees are commonly encountered when sequences from thousands of loci are applied to estimate phylogenetic relationships. Several processes contribute to this discord. Yet, we have no methods that jointly model different sources of conflict when estimating phylogenies. An alternative to analyzing entire genomes or all the sequenced loci is to identify a subset of loci for phylogenetic analysis. If we can identify data partitions that are most likely to reflect descent from a common ancestor (i.e., discordant loci that indeed reflect incomplete lineage sorting [ILS], as opposed to some other process, such as lateral gene transfer [LGT]), we can analyze this subset using powerful coalescent-based species-tree approaches. METHODS Test data sets were simulated where discord among loci could arise from ILS and LGT. Data sets where analyzed using the newly developed program CLASSIPHY (Huang et al., ) to assess whether our ability to distinguish the cause of discord among loci varied when ILS and LGT occurred in the recent versus deep past and whether the accuracy of these inferences were affected by the mutational process. KEY RESULTS We show that accuracy of probabilistic classification of individual loci by the cause of discord differed when ILS and LGT events occurred more recently compared with the distant past and that the signal-to-noise ratio arising from the mutational process contributes to difficulties in inferring LGT data partitions. CONCLUSIONS We discuss our findings in terms of the promise and limitations of identifying subsets of loci for species-tree inference that will not violate the underlying coalescent model (i.e., data partitions in which ILS, and not LGT, contributes to discord). We also discuss the empirical implications of our work given the many recalcitrant nodes in the tree of life (e.g., origins of angiosperms, amniotes, or Neoaves), and recent arguments for concatenating loci.
Collapse
Affiliation(s)
- L Lacey Knowles
- Department of Ecology and Evolutionary Biology, Museum of Zoology, University of Michigan, 1109 Geddes Avenue, Ann Arbor, MI, 48109-1079, USA
| | - Huateng Huang
- Department of Ecology and Evolutionary Biology, Museum of Zoology, University of Michigan, 1109 Geddes Avenue, Ann Arbor, MI, 48109-1079, USA
| | - Jeet Sukumaran
- Department of Ecology and Evolutionary Biology, Museum of Zoology, University of Michigan, 1109 Geddes Avenue, Ann Arbor, MI, 48109-1079, USA
| | - Stephen A Smith
- Department of Ecology and Evolutionary Biology, Museum of Zoology, University of Michigan, 1109 Geddes Avenue, Ann Arbor, MI, 48109-1079, USA
| |
Collapse
|
23
|
Blom MPK, Bragg JG, Potter S, Moritz C. Accounting for Uncertainty in Gene Tree Estimation: Summary-Coalescent Species Tree Inference in a Challenging Radiation of Australian Lizards. Syst Biol 2018; 66:352-366. [PMID: 28039387 DOI: 10.1093/sysbio/syw089] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2015] [Accepted: 09/27/2016] [Indexed: 11/12/2022] Open
Abstract
Accurate gene tree inference is an important aspect of species tree estimation in a summary-coalescent framework. Yet, in empirical studies, inferred gene trees differ in accuracy due to stochastic variation in phylogenetic signal between targeted loci. Empiricists should, therefore, examine the consistency of species tree inference, while accounting for the observed heterogeneity in gene tree resolution of phylogenomic data sets. Here, we assess the impact of gene tree estimation error on summary-coalescent species tree inference by screening ${\sim}2000$ exonic loci based on gene tree resolution prior to phylogenetic inference. We focus on a phylogenetically challenging radiation of Australian lizards (genus Cryptoblepharus, Scincidae) and explore effects on topology and support. We identify a well-supported topology based on all loci and find that a relatively small number of high-resolution gene trees can be sufficient to converge on the same topology. Adding gene trees with decreasing resolution produced a generally consistent topology, and increased confidence for specific bipartitions that were poorly supported when using a small number of informative loci. This corroborates coalescent-based simulation studies that have highlighted the need for a large number of loci to confidently resolve challenging relationships and refutes the notion that low-resolution gene trees introduce phylogenetic noise. Further, our study also highlights the value of quantifying changes in nodal support across locus subsets of increasing size (but decreasing gene tree resolution). Such detailed analyses can reveal anomalous fluctuations in support at some nodes, suggesting the possibility of model violation. By characterizing the heterogeneity in phylogenetic signal among loci, we can account for uncertainty in gene tree estimation and assess its effect on the consistency of the species tree estimate. We suggest that the evaluation of gene tree resolution should be incorporated in the analysis of empirical phylogenomic data sets. This will ultimately increase our confidence in species tree estimation using summary-coalescent methods and enable us to exploit genomic data for phylogenetic inference. [Coalescence; concatenation; Cryptoblepharus; exon capture; gene tree; phylogenomics; species tree.].
Collapse
Affiliation(s)
- Mozes P K Blom
- Research School of Biology, Australian National University, Canberra ACT 0200, Australia
| | - Jason G Bragg
- Research School of Biology, Australian National University, Canberra ACT 0200, Australia
| | - Sally Potter
- Research School of Biology, Australian National University, Canberra ACT 0200, Australia
| | - Craig Moritz
- Research School of Biology, Australian National University, Canberra ACT 0200, Australia
| |
Collapse
|
24
|
Molloy EK, Warnow T. To Include or Not to Include: The Impact of Gene Filtering on Species Tree Estimation Methods. Syst Biol 2017; 67:285-303. [DOI: 10.1093/sysbio/syx077] [Citation(s) in RCA: 138] [Impact Index Per Article: 19.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2016] [Accepted: 09/13/2017] [Indexed: 01/27/2023] Open
Affiliation(s)
- Erin K Molloy
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Tandy Warnow
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| |
Collapse
|
25
|
Distribution of coalescent histories under the coalescent model with gene flow. Mol Phylogenet Evol 2016; 105:177-192. [DOI: 10.1016/j.ympev.2016.08.024] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2016] [Revised: 08/16/2016] [Accepted: 08/31/2016] [Indexed: 12/19/2022]
|
26
|
Linkem CW, Minin VN, Leaché AD. Detecting the Anomaly Zone in Species Trees and Evidence for a Misleading Signal in Higher-Level Skink Phylogeny (Squamata: Scincidae). Syst Biol 2016; 65:465-77. [PMID: 26738927 PMCID: PMC6383586 DOI: 10.1093/sysbio/syw001] [Citation(s) in RCA: 67] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2014] [Accepted: 12/29/2015] [Indexed: 01/28/2023] Open
Abstract
The anomaly zone, defined by the presence of gene tree topologies that are more probable than the true species tree, presents a major challenge to the accurate resolution of many parts of the Tree of Life. This discrepancy can result from consecutive rapid speciation events in the species tree. Similar to the problem of long-branch attraction, including more data via loci concatenation will only reinforce the support for the incorrect species tree. Empirical phylogenetic studies often employ coalescent-based species tree methods to avoid the anomaly zone, but to this point these studies have not had a method for providing any direct evidence that the species tree is actually in the anomaly zone. In this study, we use 16 species of lizards in the family Scincidae to investigate whether nodes that are difficult to resolve place the species tree within the anomaly zone. We analyze new phylogenomic data (429 loci), using both concatenation and coalescent-based species tree estimation, to locate conflicting topological signal. We then use the unifying principle of the anomaly zone, together with estimates of ancestral population sizes and species persistence times, to determine whether the observed phylogenetic conflict is a result of the anomaly zone. We identify at least three regions of the Scincidae phylogeny that provide demographic signatures consistent with the anomaly zone, and this new information helps reconcile the phylogenetic conflict in previously published studies on these lizards. The anomaly zone presents a real problem in phylogenetics, and our new framework for identifying anomalous relationships will help empiricists leverage their resources appropriately for investigating and overcoming this challenge.
Collapse
Affiliation(s)
| | - Vladimir N Minin
- Department of Biology, University of Washington, Seattle WA; Department of Statistics, University of Washington, Seattle WA
| | - Adam D Leaché
- Department of Biology, University of Washington, Seattle WA; Burke Museum of Natural History and Culture, University of Washington, Seattle, WA, 98195, USA
| |
Collapse
|
27
|
Edwards T, Tollis M, Hsieh P, Gutenkunst RN, Liu Z, Kusumi K, Culver M, Murphy RW. Assessing models of speciation under different biogeographic scenarios; an empirical study using multi-locus and RNA-seq analyses. Ecol Evol 2016; 6:379-96. [PMID: 26843925 PMCID: PMC4729248 DOI: 10.1002/ece3.1865] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2015] [Revised: 11/01/2015] [Accepted: 11/02/2015] [Indexed: 12/30/2022] Open
Abstract
Evolutionary biology often seeks to decipher the drivers of speciation, and much debate persists over the relative importance of isolation and gene flow in the formation of new species. Genetic studies of closely related species can assess if gene flow was present during speciation, because signatures of past introgression often persist in the genome. We test hypotheses on which mechanisms of speciation drove diversity among three distinct lineages of desert tortoise in the genus Gopherus. These lineages offer a powerful system to study speciation, because different biogeographic patterns (physical vs. ecological segregation) are observed at opposing ends of their distributions. We use 82 samples collected from 38 sites, representing the entire species' distribution and generate sequence data for mtDNA and four nuclear loci. A multilocus phylogenetic analysis in *BEAST estimates the species tree. RNA‐seq data yield 20,126 synonymous variants from 7665 contigs from two individuals of each of the three lineages. Analyses of these data using the demographic inference package ∂a∂i serve to test the null hypothesis of no gene flow during divergence. The best‐fit demographic model for the three taxa is concordant with the *BEAST species tree, and the ∂a∂i analysis does not indicate gene flow among any of the three lineages during their divergence. These analyses suggest that divergence among the lineages occurred in the absence of gene flow and in this scenario the genetic signature of ecological isolation (parapatric model) cannot be differentiated from geographic isolation (allopatric model).
Collapse
Affiliation(s)
- Taylor Edwards
- School of Natural Resources and the Environment The University of Arizona Tucson Arizona 85721; University of Arizona Genetics Core University of Arizona Tucson Arizona 85721
| | - Marc Tollis
- School of Life Sciences Arizona State University Tempe Arizona 85287
| | - PingHsun Hsieh
- Department of Ecology and Evolutionary Biology The University of Arizona Tucson Arizona 85721
| | - Ryan N Gutenkunst
- Department of Ecology and Evolutionary Biology The University of Arizona Tucson Arizona 85721; Department of Molecular and Cellular Biology The University of Arizona Tucson Arizona 85721
| | - Zhen Liu
- State Key Laboratory of Genetic Resources and Evolution Kunming Institute of Zoology Chinese Academy of Sciences Kunming 650223 China
| | - Kenro Kusumi
- School of Life Sciences Arizona State University Tempe Arizona 85287
| | - Melanie Culver
- School of Natural Resources and the Environment The University of Arizona Tucson Arizona 85721; Arizona Cooperative Fish & Wildlife Research Unit USGS University of Arizona Tucson Arizona 85721
| | - Robert W Murphy
- State Key Laboratory of Genetic Resources and Evolution Kunming Institute of Zoology Chinese Academy of Sciences Kunming 650223 China; Centre for Biodiversity and Conservation Biology Royal Ontario Museum Toronto ON Canada
| |
Collapse
|
28
|
Implementing and testing the multispecies coalescent model: A valuable paradigm for phylogenomics. Mol Phylogenet Evol 2016; 94:447-62. [DOI: 10.1016/j.ympev.2015.10.027] [Citation(s) in RCA: 265] [Impact Index Per Article: 33.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
|
29
|
Impact of gene family evolutionary histories on phylogenetic species tree inference by gene tree parsimony. Mol Phylogenet Evol 2015; 96:9-16. [PMID: 26702957 DOI: 10.1016/j.ympev.2015.12.002] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2015] [Revised: 10/11/2015] [Accepted: 12/03/2015] [Indexed: 11/21/2022]
Abstract
Complicated history of gene duplication and loss brings challenge to molecular phylogenetic inference, especially in deep phylogenies. However, phylogenomic approaches, such as gene tree parsimony (GTP), show advantage over some other approaches in its ability to use gene families with duplications. GTP searches the 'optimal' species tree by minimizing the total cost of biological events such as duplications, but accuracy of GTP and phylogenetic signal in the context of different gene families with distinct histories of duplication and loss are unclear. To evaluate how different evolutionary properties of different gene families can impact on species tree inference, 3900 gene families from seven angiosperms encompassing a wide range of gene content, lineage-specific expansions and contractions were analyzed. It was found that the gene content and total duplication number in a gene family strongly influence species tree inference accuracy, with the highest accuracy achieved at either very low or very high gene content (or duplication number) and lowest accuracy centered in intermediate gene content (or duplication number), as the relationship can fit a binomial regression. Besides, for gene families of similar level of average gene content, those with relatively higher lineage-specific expansion or duplication rates tend to show lower accuracy. Additional correlation tests support that high accuracy for those gene families with large gene content may rely on abundant ancestral copies to provide many subtrees to resolve conflicts, whereas high accuracy for single or low copy gene families are just subject to sequence substitution per se. Very low accuracy reached by gene families of intermediate gene content or duplication number can be due to insufficient subtrees to resolve the conflicts from loss of alternative copies. As these evolutionary properties can significantly influence species tree accuracy, I discussed the potential weighting of the duplication cost by evolutionary properties of gene families in future GTP analyses.
Collapse
|
30
|
Huang JP, Knowles LL. The Species versus Subspecies Conundrum: Quantitative Delimitation from Integrating Multiple Data Types within a Single Bayesian Approach in Hercules Beetles. Syst Biol 2015; 65:685-99. [PMID: 26681696 DOI: 10.1093/sysbio/syv119] [Citation(s) in RCA: 51] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2015] [Accepted: 12/10/2015] [Indexed: 11/13/2022] Open
Abstract
With the recent attention and focus on quantitative methods for species delimitation, an overlooked but equally important issue regards what has actually been delimited. This study investigates the apparent arbitrariness of some taxonomic distinctions, and in particular how species and subspecies are assigned. Specifically, we use a recently developed Bayesian model-based approach to show that in the Hercules beetles (genus Dynastes) there is no statistical difference in the probability that putative taxa represent different species, irrespective of whether they were given species or subspecies designations. By considering multiple data types, as opposed to relying exclusively on genetic data alone, we also show that both previously recognized species and subspecies represent a variety of points along the speciation spectrum (i.e., previously recognized species are not systematically further along the continuum than subspecies). For example, based on evolutionary models of divergence, some taxa are statistically distinguishable on more than one axis of differentiation (e.g., along both phenotypic and genetic dimensions), whereas other taxa can only be delimited statistically from a single data type. Because both phenotypic and genetic data are analyzed in a common Bayesian framework, our study provides a framework for investigating whether disagreements in species boundaries among data types reflect (i) actual discordance with the actual history of lineage splitting, or instead (ii) differences among data types in the amount of time required for differentiation to become apparent among the delimited taxa. We discuss what the answers to these questions imply about what characters are used to delimit species, as well as the diverse processes involved in the origin and maintenance of species boundaries. With this in mind, we then reflect more generally on how quantitative methods for species delimitation are used to assign taxonomic status.
Collapse
Affiliation(s)
- Jen-Pan Huang
- Department of Ecology and Evolutionary Biology, 1109 Geddes Avenue, Museum of Zoology, University of Michigan, Ann Arbor, MI 48109-1079, USA
| | - L Lacey Knowles
- Department of Ecology and Evolutionary Biology, 1109 Geddes Avenue, Museum of Zoology, University of Michigan, Ann Arbor, MI 48109-1079, USA
| |
Collapse
|
31
|
De Maio N, Schrempf D, Kosiol C. PoMo: An Allele Frequency-Based Approach for Species Tree Estimation. Syst Biol 2015; 64:1018-31. [PMID: 26209413 PMCID: PMC4604832 DOI: 10.1093/sysbio/syv048] [Citation(s) in RCA: 56] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2014] [Accepted: 06/11/2015] [Indexed: 11/24/2022] Open
Abstract
Incomplete lineage sorting can cause incongruencies of the overall species-level phylogenetic tree with the phylogenetic trees for individual genes or genomic segments. If these incongruencies are not accounted for, it is possible to incur several biases in species tree estimation. Here, we present a simple maximum likelihood approach that accounts for ancestral variation and incomplete lineage sorting. We use a POlymorphisms-aware phylogenetic MOdel (PoMo) that we have recently shown to efficiently estimate mutation rates and fixation biases from within and between-species variation data. We extend this model to perform efficient estimation of species trees. We test the performance of PoMo in several different scenarios of incomplete lineage sorting using simulations and compare it with existing methods both in accuracy and computational speed. In contrast to other approaches, our model does not use coalescent theory but is allele frequency based. We show that PoMo is well suited for genome-wide species tree estimation and that on such data it is more accurate than previous approaches.
Collapse
Affiliation(s)
- Nicola De Maio
- Institut für Populationsgenetik, Vetmeduni Vienna, Wien 1210, Austria; Vienna Graduate School of Population Genetics, Wien, Austria; and Nuffield Department of Clinical Medicine, University of Oxford, Oxford OX3 7BN, UK
| | - Dominik Schrempf
- Institut für Populationsgenetik, Vetmeduni Vienna, Wien 1210, Austria; Vienna Graduate School of Population Genetics, Wien, Austria; and
| | - Carolin Kosiol
- Institut für Populationsgenetik, Vetmeduni Vienna, Wien 1210, Austria;
| |
Collapse
|
32
|
Crowl AA, Visger CJ, Mansion G, Hand R, Wu HH, Kamari G, Phitos D, Cellinese N. Evolution and biogeography of the endemic Roucela complex (Campanulaceae: Campanula) in the Eastern Mediterranean. Ecol Evol 2015; 5:5329-5343. [PMID: 30151135 PMCID: PMC6102515 DOI: 10.1002/ece3.1791] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2015] [Revised: 09/23/2015] [Accepted: 09/26/2015] [Indexed: 11/10/2022] Open
Abstract
At the intersection of geological activity, climatic fluctuations, and human pressure, the Mediterranean Basin – a hotspot of biodiversity – provides an ideal setting for studying endemism, evolution, and biogeography. Here, we focus on the Roucela complex (Campanula subgenus Roucela), a group of 13 bellflower species found primarily in the eastern Mediterranean Basin. Plastid and low‐copy nuclear markers were employed to reconstruct evolutionary relationships and estimate divergence times within the Roucela complex using both concatenation and species tree analyses. Niche modeling, ancestral range estimation, and diversification analyses were conducted to provide further insights into patterns of endemism and diversification through time. Diversification of the Roucela clade appears to have been primarily the result of vicariance driven by the breakup of an ancient landmass. We found geologic events such as the formation of the mid‐Aegean trench and the Messinian Salinity Crisis to be historically important in the evolutionary history of this group. Contrary to numerous past studies, the onset of the Mediterranean climate has not promoted diversification in the Roucela complex and, in fact, may be negatively affecting these species. This study highlights the diversity and complexity of historical processes driving plant evolution in the Mediterranean Basin.
Collapse
Affiliation(s)
- Andrew A Crowl
- Florida Museum of Natural History University of Florida Gainesville Florida.,Department of Biology University of Florida Gainesville Florida
| | - Clayton J Visger
- Florida Museum of Natural History University of Florida Gainesville Florida.,Department of Biology University of Florida Gainesville Florida
| | - Guilhem Mansion
- Botanischer Garten und Botanisches Museum Berlin-Dahlem Freie Universität Berlin Berlin Germany
| | - Ralf Hand
- Botanischer Garten und Botanisches Museum Berlin-Dahlem Freie Universität Berlin Berlin Germany
| | - Hsin-Hui Wu
- Florida Museum of Natural History University of Florida Gainesville Florida
| | - Georgia Kamari
- Department of Biology University of Patras Patras Greece
| | | | - Nico Cellinese
- Florida Museum of Natural History University of Florida Gainesville Florida
| |
Collapse
|
33
|
Ruane S, Raxworthy CJ, Lemmon AR, Lemmon EM, Burbrink FT. Comparing species tree estimation with large anchored phylogenomic and small Sanger-sequenced molecular datasets: an empirical study on Malagasy pseudoxyrhophiine snakes. BMC Evol Biol 2015; 15:221. [PMID: 26459325 PMCID: PMC4603904 DOI: 10.1186/s12862-015-0503-1] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2015] [Accepted: 10/01/2015] [Indexed: 11/15/2022] Open
Abstract
BACKGROUND Using molecular data generated by high throughput next generation sequencing (NGS) platforms to infer phylogeny is becoming common as costs go down and the ability to capture loci from across the genome goes up. While there is a general consensus that greater numbers of independent loci should result in more robust phylogenetic estimates, few studies have compared phylogenies resulting from smaller datasets for commonly used genetic markers with the large datasets captured using NGS. Here, we determine how a 5-locus Sanger dataset compares with a 377-locus anchored genomics dataset for understanding the evolutionary history of the pseudoxyrhophiine snake radiation centered in Madagascar. The Pseudoxyrhophiinae comprise ~86 % of Madagascar's serpent diversity, yet they are poorly known with respect to ecology, behavior, and systematics. Using the 377-locus NGS dataset and the summary statistics species-tree methods STAR and MP-EST, we estimated a well-supported species tree that provides new insights concerning intergeneric relationships for the pseudoxyrhophiines. We also compared how these and other methods performed with respect to estimating tree topology using datasets with varying numbers of loci. METHODS Using Sanger sequencing and an anchored phylogenomics approach, we sequenced datasets comprised of 5 and 377 loci, respectively, for 23 pseudoxyrhophiine taxa. For each dataset, we estimated phylogenies using both gene-tree (concatenation) and species-tree (STAR, MP-EST) approaches. We determined the similarity of resulting tree topologies from the different datasets using Robinson-Foulds distances. In addition, we examined how subsets of these data performed compared to the complete Sanger and anchored datasets for phylogenetic accuracy using the same tree inference methodologies, as well as the program *BEAST to determine if a full coalescent model for species tree estimation could generate robust results with fewer loci compared to the summary statistics species tree approaches. We also examined the individual gene trees in comparison to the 377-locus species tree using the program MetaTree. RESULTS Using the full anchored dataset under a variety of methods gave us the same, well-supported phylogeny for pseudoxyrhophiines. The African pseudoxyrhophiine Duberria is the sister taxon to the Malagasy pseudoxyrhophiines genera, providing evidence for a monophyletic radiation in Madagascar. In addition, within Madagascar, the two major clades inferred correspond largely to the aglyphous and opisthoglyphous genera, suggesting that feeding specializations associated with tooth venom delivery may have played a major role in the early diversification of this radiation. The comparison of tree topologies from the concatenated and species-tree methods using different datasets indicated the 5-locus dataset cannot beused to infer a correct phylogeny for the pseudoxyrhophiines under any method tested here and that summary statistics methods require 50 or more loci to consistently recover the species-tree inferred using the complete anchored dataset. However, as few as 15 loci may infer the correct topology when using the full coalescent species tree method *BEAST. MetaTree analyses of each gene tree from the Sanger and anchored datasets found that none of the individual gene trees matched the 377-locus species tree, and that no gene trees were identical with respect to topology. CONCLUSIONS Our results suggest that ≥50 loci may be necessary to confidently infer phylogenies when using summaryspecies-tree methods, but that the coalescent-based method *BEAST consistently recovers the same topology using only 15 loci. These results reinforce that datasets with small numbers of markers may result in misleading topologies, and further, that the method of inference used to generate a phylogeny also has a major influence on the number of loci necessary to infer robust species trees.
Collapse
Affiliation(s)
- Sara Ruane
- Department of Herpetology, American Museum of Natural History, Central Park West at 79th Street, New York, NY, 10024, USA.
| | - Christopher J Raxworthy
- Department of Herpetology, American Museum of Natural History, Central Park West at 79th Street, New York, NY, 10024, USA.
| | - Alan R Lemmon
- Department of Biology, Florida State University, 319 Stadium Drive, P.O. Box 3064295, Tallahassee, FL, 32306-4295, USA.
| | - Emily Moriarty Lemmon
- Department of Biology, Florida State University, 319 Stadium Drive, P.O. Box 3064295, Tallahassee, FL, 32306-4295, USA.
| | - Frank T Burbrink
- Department of Herpetology, American Museum of Natural History, Central Park West at 79th Street, New York, NY, 10024, USA.
- Biology Department, College of Staten Island/CUNY, 2800 Victory Boulevard, Staten Island, NY, 10314, USA.
| |
Collapse
|
34
|
Zhong B, Sun L, Penny D. The Origin of Land Plants: A Phylogenomic Perspective. Evol Bioinform Online 2015; 11:137-41. [PMID: 26244002 PMCID: PMC4498653 DOI: 10.4137/ebo.s29089] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2015] [Revised: 06/04/2015] [Accepted: 06/08/2015] [Indexed: 11/16/2022] Open
Abstract
Land plants are a natural group, and Charophyte algae are the closest lineages of land plants and have six morphologically diverged groups. The conjugating green algae (Zygnematales) are now suggested to be the extant sister group to land plants, providing the novel understanding for character evolution and early multicellular innovations in land plants. We review recent molecular phylogenetic work on the origin of land plants and discuss some future directions in phylogenomic analyses.
Collapse
Affiliation(s)
- Bojian Zhong
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, Nanjing, China
| | - Linhua Sun
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, Nanjing, China
| | - David Penny
- Institute of Fundamental Sciences, Massey University, Palmerston North, New Zealand
| |
Collapse
|
35
|
Pyron RA. Post-molecular systematics and the future of phylogenetics. Trends Ecol Evol 2015; 30:384-9. [DOI: 10.1016/j.tree.2015.04.016] [Citation(s) in RCA: 55] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2015] [Revised: 04/29/2015] [Accepted: 04/30/2015] [Indexed: 12/21/2022]
|
36
|
Giarla TC, Esselstyn JA. The Challenges of Resolving a Rapid, Recent Radiation: Empirical and Simulated Phylogenomics of Philippine Shrews. Syst Biol 2015; 64:727-40. [DOI: 10.1093/sysbio/syv029] [Citation(s) in RCA: 113] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2015] [Accepted: 05/07/2015] [Indexed: 01/30/2023] Open
|
37
|
Liu L, Xi Z, Wu S, Davis CC, Edwards SV. Estimating phylogenetic trees from genome-scale data. Ann N Y Acad Sci 2015; 1360:36-53. [DOI: 10.1111/nyas.12747] [Citation(s) in RCA: 129] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Liang Liu
- Department of Statistics; University of Georgia; Athens Georgia
- Institute of Bioinformatics; University of Georgia; Athens Georgia
| | - Zhenxiang Xi
- Department of Organismic and Evolutionary Biology; Harvard University; Cambridge Massachusetts
| | - Shaoyuan Wu
- Department of Biochemistry and Molecular Biology & Tianjin Key Laboratory of Medical Epigenetics, School of Basic Medical Sciences; Tianjin Medical University; Tianjin China
| | - Charles C. Davis
- Department of Organismic and Evolutionary Biology; Harvard University; Cambridge Massachusetts
| | - Scott V. Edwards
- Department of Organismic and Evolutionary Biology; Harvard University; Cambridge Massachusetts
| |
Collapse
|
38
|
Lanier HC, Knowles LL. Applying species-tree analyses to deep phylogenetic histories: Challenges and potential suggested from a survey of empirical phylogenetic studies. Mol Phylogenet Evol 2015; 83:191-9. [DOI: 10.1016/j.ympev.2014.10.022] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2014] [Revised: 08/30/2014] [Accepted: 10/29/2014] [Indexed: 10/24/2022]
|
39
|
Rittmeyer EN, Austin CC. Combined next-generation sequencing and morphology reveal fine-scale speciation in Crocodile Skinks (Squamata: Scincidae: Tribolonotus). Mol Ecol 2015; 24:466-83. [PMID: 25470077 DOI: 10.1111/mec.13030] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2013] [Revised: 11/17/2014] [Accepted: 11/28/2014] [Indexed: 02/03/2023]
Abstract
Next-generation sequencing has vast potential to revolutionize the fields of phylogenetics and population genetics through its ability to collect genomic scale data sets of thousands of orthologous loci. Despite this potential, other types of data (e.g. morphology, ecology) remain important, particularly for studies endeavouring to delimit species. Here, we integrate next-generation sequencing with morphology to examine divergence between populations of Tribolonotus pseudoponceleti on the islands of Buka and Bougainville in the Solomon Archipelago. We used the Ion Torrent PGM to collect over 648 Mbp of sequence data for 12 samples, representing 1526 loci recovered from all samples, and 3342 were recovered from at least six samples. Genetic structure analyses strongly support the distinctiveness of these two populations, and Bayes factor delimitations decisively select speciation between Buka and Bougainville. Principal components and discriminant function analyses reveal concordant morphological divergence. Finally, demographic analyses via diffusion approximation and approximate Bayesian computation prefer a complex model of mid-Pleistocene divergence with migration, and a later decrease or cessation of migration and population size shift, suggesting a scenario in which migration was enabled by Pleistocene merging of these two islands, and limited when isolated by higher sea levels. Further analysis of four Sanger sequenced loci in IMa2 had limited power to distinguish among models including and excluding migration, but resulted in similar population size and divergence time estimates, although with much broader confidence intervals. This study represents a framework for how next-generation sequencing and morphological data can be combined and leveraged towards validating putative species and testing demographic scenarios for speciation.
Collapse
Affiliation(s)
- Eric N Rittmeyer
- Department of Biological Sciences, Museum of Natural Science, Louisiana State University, 119 Foster Hall, Baton Rouge, LA, 70803, USA
| | | |
Collapse
|
40
|
New species of Ophiocordyceps unilateralis, an ubiquitous pathogen of ants from Thailand. Fungal Biol 2015; 119:44-52. [DOI: 10.1016/j.funbio.2014.10.008] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2014] [Revised: 10/02/2014] [Accepted: 10/29/2014] [Indexed: 11/30/2022]
|
41
|
Jockusch EL, Martínez-Solano I, Timpe EK. The Effects of Inference Method, Population Sampling, and Gene Sampling on Species Tree Inferences: An Empirical Study in Slender Salamanders (Plethodontidae: Batrachoseps). Syst Biol 2014; 64:66-83. [DOI: 10.1093/sysbio/syu078] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Affiliation(s)
- Elizabeth L. Jockusch
- Department of Ecology and Evolutionary Biology, University of Connecticut, 75 N. Eagleville Road, U-3043, Storrs, CT 06269-3043, USA; and 2CIBIO-InBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, Campus Agrário de Vairão, Universidade do Porto, 4485-661 Vairão, Portugal
| | - Iñigo Martínez-Solano
- Department of Ecology and Evolutionary Biology, University of Connecticut, 75 N. Eagleville Road, U-3043, Storrs, CT 06269-3043, USA; and 2CIBIO-InBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, Campus Agrário de Vairão, Universidade do Porto, 4485-661 Vairão, Portugal
- Department of Ecology and Evolutionary Biology, University of Connecticut, 75 N. Eagleville Road, U-3043, Storrs, CT 06269-3043, USA; and 2CIBIO-InBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, Campus Agrário de Vairão, Universidade do Porto, 4485-661 Vairão, Portugal
| | - Elizabeth K. Timpe
- Department of Ecology and Evolutionary Biology, University of Connecticut, 75 N. Eagleville Road, U-3043, Storrs, CT 06269-3043, USA; and 2CIBIO-InBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, Campus Agrário de Vairão, Universidade do Porto, 4485-661 Vairão, Portugal
| |
Collapse
|
42
|
Brunes TO, Alexandrino J, Baêta D, Zina J, Haddad CF, Sequeira F. Species limits, phylogeographic and hybridization patterns in Neotropical leaf frogs (Phyllomedusinae). ZOOL SCR 2014. [DOI: 10.1111/zsc.12079] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Tuliana O. Brunes
- Departamento de Biologia; Faculdade de Ciências da Universidade do Porto; 4169-007 Porto Portugal
- CIBIO/InBIO; Centro de Investigação em Biodiversidade e Recursos Genéticos; Laboratório Associado; Universidade do Porto; Campus Agrário de Vairão 4485-661 Vairão Portugal
| | - João Alexandrino
- Departamento de Ciências Biológicas; Universidade Federal de São Paulo; 09972-270 Diadema Brasil
| | - Délio Baêta
- Departamento de Zoologia; Instituto de Biociências; Universidade Estadual Paulista; 13506-900 Rio Claro São Paulo Brasil
- Museu Nacional; Departamento de Vertebrados; Setor de Herpetologia; Universidade Federal do Rio de Janeiro; 20940-040 Rio de Janeiro Brasil
| | - Juliana Zina
- Departamento de Ciências Biológicas; Universidade Estadual do Sudoeste da Bahia; 45206-190 Jequié Bahia Brasil
| | - Célio F.B. Haddad
- Departamento de Zoologia; Instituto de Biociências; Universidade Estadual Paulista; 13506-900 Rio Claro São Paulo Brasil
| | - Fernando Sequeira
- CIBIO/InBIO; Centro de Investigação em Biodiversidade e Recursos Genéticos; Laboratório Associado; Universidade do Porto; Campus Agrário de Vairão 4485-661 Vairão Portugal
| |
Collapse
|
43
|
Arias CF, Salazar C, Rosales C, Kronforst MR, Linares M, Bermingham E, McMillan WO. Phylogeography of Heliconius cydno and its closest relatives: disentangling their origin and diversification. Mol Ecol 2014; 23:4137-52. [PMID: 24962067 DOI: 10.1111/mec.12844] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2014] [Revised: 06/14/2014] [Accepted: 06/16/2014] [Indexed: 11/28/2022]
Abstract
The origins of the extraordinary diversity within the Neotropics have long fascinated biologists and naturalists. Yet, the underlying factors that have given rise to this diversity remain controversial. To test the relative importance of Quaternary climatic change and Neogene tectonic and paleogeographic reorganizations in the generation of biodiversity, we examine intraspecific variation across the Heliconius cydno radiation and compare this variation to that within the closely related Heliconius melpomene and Heliconius timareta radiations. Our data, which consist of both mtDNA and genome-scan data from nearly 2250 amplified fragment length polymorphism (AFLP) loci, reveal a complex history of differentiation and admixture at different geographic scales. Both mtDNA and AFLP phylogenies suggest that H. timareta and H. cydno are probably geographic extremes of the same radiation that probably diverged from H. melpomene prior to the Pliocene-Pleistocene boundary, consistent with hypotheses of diversification that rely on geological events in the Pliocene. The mtDNA suggests that this radiation originated in Central America or the northwestern region of South America, with a subsequent colonization of the eastern and western slopes of the Andes. Our genome-scan data indicate significant admixture among sympatric H. cydno/H. timareta and H. melpomene populations across the extensive geographic ranges of the two radiations. Within H. cydno, both mtDNA and AFLP data indicate significant population structure at local scales, with strong genetic differences even among adjacent H. cydno colour pattern races. These genetic patterns highlight the importance of past geoclimatic events, intraspecific gene flow, and local population differentiation in the origin and establishment of new adaptive forms.
Collapse
Affiliation(s)
- Carlos F Arias
- Department of Biology, McGill University, 1205 Ave. Dr. Penfield, Montreal, QC, Canada, H3A 1B1; Smithsonian Tropical Research Institute, Apartado 0843-03092, Panamá, Panamá
| | | | | | | | | | | | | |
Collapse
|
44
|
Huang H, Tran LAP, Knowles LL. Do estimated and actual species phylogenies match? Evaluation of East African cichlid radiations. Mol Phylogenet Evol 2014; 78:56-65. [PMID: 24837624 DOI: 10.1016/j.ympev.2014.05.010] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2014] [Revised: 05/02/2014] [Accepted: 05/06/2014] [Indexed: 10/25/2022]
Abstract
A large number of published phylogenetic estimates are based on a single locus or the concatenation of multiple loci, even though genealogies of single or concatenated loci may not accurately reflect the true history of species diversification (i.e., the species tree). The increased availability of genomic data, coupled with new computational methods, improves resolution of species relationships beyond what was possible in the past. Such developments will no doubt benefit future phylogenetic studies. It remains unclear how robust phylogenies that predate these developments (i.e., the bulk of phylogenetic studies) are to departures from the assumption of strict gene tree-species tree concordance. Here, we present a parametric bootstrap (PBST) approach that assesses the reliability of past phylogenetic estimates in which gene tree-species tree discord was ignored. We focus on a universal cause of discord-the random loss of gene lineages from genetic drift-and apply the method in a meta-analysis of East African cichlids, a group encompassing historical scenarios that are particularly challenging for phylogenetic estimation. Although we identify some evolutionary relationships that are robust to gene tree discord, many past phylogenetic estimates of cichlids are not. We discuss the utility of the PBST method for evaluating the robustness of gene tree-based phylogenetic estimations in general as well as for testing the clade-specific performance of species tree estimation methods and designing sampling strategies that increase the accuracy of estimated species relationships.
Collapse
Affiliation(s)
- Huateng Huang
- Department of Ecology and Evolutionary Biology, Museum of Zoology, University of Michigan, Ann Arbor, MI 48109-1079, USA.
| | - Lucy A P Tran
- Department of Ecology and Evolutionary Biology, Museum of Zoology, University of Michigan, Ann Arbor, MI 48109-1079, USA.
| | - L Lacey Knowles
- Department of Ecology and Evolutionary Biology, Museum of Zoology, University of Michigan, Ann Arbor, MI 48109-1079, USA.
| |
Collapse
|
45
|
Kimball RT, Braun EL. Does more sequence data improve estimates of galliform phylogeny? Analyses of a rapid radiation using a complete data matrix. PeerJ 2014; 2:e361. [PMID: 24795852 PMCID: PMC4006227 DOI: 10.7717/peerj.361] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2013] [Accepted: 04/03/2014] [Indexed: 01/19/2023] Open
Abstract
The resolution of rapid evolutionary radiations or "bushes" in the tree of life has been one of the most difficult and interesting problems in phylogenetics. The avian order Galliformes appears to have undergone several rapid radiations that have limited the resolution of prior studies and obscured the position of taxa important both agriculturally and as model systems (chicken, turkey, Japanese quail). Here we present analyses of a multi-locus data matrix comprising over 15,000 sites, primarily from nuclear introns but also including three mitochondrial regions, from 46 galliform taxa with all gene regions sampled for all taxa. The increased sampling of unlinked nuclear genes provided strong bootstrap support for all but a small number of relationships. Coalescent-based methods to combine individual gene trees and analyses of datasets that are independent of published data indicated that this well-supported topology is likely to reflect the galliform species tree. The inclusion or exclusion of mitochondrial data had a limited impact upon analyses upon analyses using either concatenated data or multispecies coalescent methods. Some of the key phylogenetic findings include support for a second major clade within the core phasianids that includes the chicken and Japanese quail and clarification of the phylogenetic relationships of turkey. Jackknifed datasets suggested that there is an advantage to sampling many independent regions across the genome rather than obtaining long sequences for a small number of loci, possibly reflecting the differences among gene trees that differ due to incomplete lineage sorting. Despite the novel insights we obtained using this increased sampling of gene regions, some nodes remain unresolved, likely due to periods of rapid diversification. Resolving these remaining groups will likely require sequencing a very large number of gene regions, but our analyses now appear to support a robust backbone for this order.
Collapse
Affiliation(s)
- Rebecca T Kimball
- Department of Biology, University of Florida , Gainesville, FL , USA
| | - Edward L Braun
- Department of Biology, University of Florida , Gainesville, FL , USA
| |
Collapse
|
46
|
DeGiorgio M, Syring J, Eckert AJ, Liston A, Cronn R, Neale DB, Rosenberg NA. An empirical evaluation of two-stage species tree inference strategies using a multilocus dataset from North American pines. BMC Evol Biol 2014; 14:67. [PMID: 24678701 PMCID: PMC4021425 DOI: 10.1186/1471-2148-14-67] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2013] [Accepted: 02/10/2014] [Indexed: 12/26/2022] Open
Abstract
Background As it becomes increasingly possible to obtain DNA sequences of orthologous genes from diverse sets of taxa, species trees are frequently being inferred from multilocus data. However, the behavior of many methods for performing this inference has remained largely unexplored. Some methods have been proven to be consistent given certain evolutionary models, whereas others rely on criteria that, although appropriate for many parameter values, have peculiar zones of the parameter space in which they fail to converge on the correct estimate as data sets increase in size. Results Here, using North American pines, we empirically evaluate the behavior of 24 strategies for species tree inference using three alternative outgroups (72 strategies total). The data consist of 120 individuals sampled in eight ingroup species from subsection Strobus and three outgroup species from subsection Gerardianae, spanning ∼47 kilobases of sequence at 121 loci. Each “strategy” for inferring species trees consists of three features: a species tree construction method, a gene tree inference method, and a choice of outgroup. We use multivariate analysis techniques such as principal components analysis and hierarchical clustering to identify tree characteristics that are robustly observed across strategies, as well as to identify groups of strategies that produce trees with similar features. We find that strategies that construct species trees using only topological information cluster together and that strategies that use additional non-topological information (e.g., branch lengths) also cluster together. Strategies that utilize more than one individual within a species to infer gene trees tend to produce estimates of species trees that contain clades present in trees estimated by other strategies. Strategies that use the minimize-deep-coalescences criterion to construct species trees tend to produce species tree estimates that contain clades that are not present in trees estimated by the Concatenation, RTC, SMRT, STAR, and STEAC methods, and that in general are more balanced than those inferred by these other strategies. Conclusions When constructing a species tree from a multilocus set of sequences, our observations provide a basis for interpreting differences in species tree estimates obtained via different approaches that have a two-stage structure in common, one step for gene tree estimation and a second step for species tree estimation. The methods explored here employ a number of distinct features of the data, and our analysis suggests that recovery of the same results from multiple methods that tend to differ in their patterns of inference can be a valuable tool for obtaining reliable estimates.
Collapse
Affiliation(s)
- Michael DeGiorgio
- Department of Biology, Pennsylvania State University, University Park, PA 16802, USA.
| | | | | | | | | | | | | |
Collapse
|
47
|
García-Pereira MJ, Carvajal-Rodríguez A, Whelan S, Caballero A, Quesada H. Impact of deep coalescence and recombination on the estimation of phylogenetic relationships among species using AFLP markers. Mol Phylogenet Evol 2014; 76:102-9. [PMID: 24631855 DOI: 10.1016/j.ympev.2014.03.001] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2013] [Revised: 02/26/2014] [Accepted: 03/04/2014] [Indexed: 10/25/2022]
Abstract
Deep coalescence and the nongenealogical pattern of descent caused by recombination have emerged as a common problem for phylogenetic inference at the species level. Here we use computer simulations to assess whether AFLP-based phylogenies are robust to the uncertainties introduced by these factors. Our results indicate that phylogenetic signal can prevail even in the face of extensive deep coalescence allowing recovering the correct species tree topology. The impact of recombination on tree accuracy was related to total tree depth and species effective population size. The correct tree topology could be recovered upon many simulation settings due to a trade-off between the conflicting signals resulting from intra-locus recombination and the benefits of the joint consideration of unlinked loci that better matched overall the true species tree. Errors in tree topology were not only determined by deep coalescence, but also by the timing of divergence and the tree-building errors arising from an insufficient number of characters. DNA sequences generally outperformed AFLPs upon any simulated scenario, but this difference in performance was nearly negligible when a sufficient number of AFLP characters were sampled. Our simulations suggest that the impact of deep coalescence and intra-locus recombination on the reliability of AFLP trees could be minimal for effective population sizes equal to or lower than 10,000 (typical of many vertebrates and tree plants) given tree depths above 0.02 substitutions per site.
Collapse
Affiliation(s)
- María Jesús García-Pereira
- Departamento de Bioquímica, Genética e Inmunología, Facultad de Biología, Universidad de Vigo, 36310 Vigo, Spain.
| | - Antonio Carvajal-Rodríguez
- Departamento de Bioquímica, Genética e Inmunología, Facultad de Biología, Universidad de Vigo, 36310 Vigo, Spain.
| | - Simon Whelan
- Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Uppsala 75236-SE, Sweden.
| | - Armando Caballero
- Departamento de Bioquímica, Genética e Inmunología, Facultad de Biología, Universidad de Vigo, 36310 Vigo, Spain.
| | - Humberto Quesada
- Departamento de Bioquímica, Genética e Inmunología, Facultad de Biología, Universidad de Vigo, 36310 Vigo, Spain.
| |
Collapse
|
48
|
Lanier HC, Huang H, Knowles LL. How low can you go? The effects of mutation rate on the accuracy of species-tree estimation. Mol Phylogenet Evol 2014; 70:112-9. [DOI: 10.1016/j.ympev.2013.09.006] [Citation(s) in RCA: 52] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2013] [Revised: 08/12/2013] [Accepted: 09/06/2013] [Indexed: 11/16/2022]
|
49
|
Harris RB, Carling MD, Lovette IJ. The influence of sampling design on species tree inference: a new relationship for the New World chickadees (Aves: Poecile). Evolution 2013; 68:501-13. [PMID: 24111665 DOI: 10.1111/evo.12280] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2013] [Accepted: 09/19/2013] [Indexed: 11/28/2022]
Abstract
In this study, we explore the long-standing issue of how many loci are needed to infer accurate phylogenetic relationships, and whether loci with particular attributes (e.g., parsimony informativeness, variability, gene tree resolution) outperform others. To do so, we use an empirical data set consisting of the seven species of chickadees (Aves: Paridae), an analytically tractable, recently diverged group, and well-studied ecologically but lacking a nuclear phylogeny. We estimate relationships using 40 nuclear loci and mitochondrial DNA using four coalescent-based species tree inference methods (BEST, *BEAST, STEM, STELLS). Collectively, our analyses contrast with previous studies and support a sister relationship between the Black-capped and Carolina Chickadee, two superficially similar species that hybridize along a long zone of contact. Gene flow is a potential source of conflict between nuclear and mitochondrial gene trees, yet we find a significant, albeit low, signal of gene flow. Our results suggest that relatively few loci with high information content may be sufficient for estimating an accurate species tree, but that substantially more loci are necessary for accurate parameter estimation. We provide an empirical reference point for researchers designing sampling protocols with the purpose of inferring phylogenies and population parameters of closely related taxa.
Collapse
Affiliation(s)
- Rebecca B Harris
- Fuller Evolutionary Biology Program, Cornell Lab of Ornithology, Cornell University, Ithaca, New York, 14850; Department of Ecology and Evolutionary Biology, Cornell University, Ithaca, New York, 14850; Department of Biology and Burke Museum, University of Washington, Seattle, Washington.
| | | | | |
Collapse
|
50
|
Willis SC, Farias IP, Ortí G. TESTING MITOCHONDRIAL CAPTURE AND DEEP COALESCENCE IN AMAZONIAN CICHLID FISHES (CICHLIDAE:CICHLA). Evolution 2013; 68:256-68. [DOI: 10.1111/evo.12230] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2013] [Accepted: 08/05/2013] [Indexed: 11/30/2022]
Affiliation(s)
- Stuart C. Willis
- School of Biological Sciences, 348 Manter Hall; University of Nebraska-Lincoln; Lincoln Nebraska 68588
| | - Izeni P. Farias
- Laboratório de Evolução e Genética Animal, ICB; Universidade Federal do Amazonas, Estrada do Contorno; 3000 Manaus AM Brazil
| | - Guillermo Ortí
- Department of Biology; The George Washington University; 2023 G St. NW Suite 340 Washington District of Columbia 20052
| |
Collapse
|