1
|
Carlson KB, Nguyen C, Wcisel DJ, Yoder JA, Dornburg A. Ancient fish lineages illuminate toll-like receptor diversification in early vertebrate evolution. Immunogenetics 2023; 75:465-478. [PMID: 37555888 DOI: 10.1007/s00251-023-01315-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2023] [Accepted: 06/28/2023] [Indexed: 08/10/2023]
Abstract
Since its initial discovery over 50 years ago, understanding the evolution of the vertebrate RAG- mediated adaptive immune response has been a major area of research focus for comparative geneticists. However, how the evolutionary novelty of an adaptive immune response impacted the diversity of receptors associated with the innate immune response has received considerably less attention until recently. Here, we investigate the diversification of vertebrate toll-like receptors (TLRs), one of the most ancient and well conserved innate immune receptor families found across the Tree of Life, integrating genomic data that represent all major vertebrate lineages with new transcriptomic data from Polypteriformes, the earliest diverging ray-finned fish lineage. Our analyses reveal TLR sequences that reflect the 6 major TLR subfamilies, TLR1, TLR3, TLR4, TLR5, TLR7, and TLR11, and also currently unnamed, yet phylogenetically distinct TLR clades. We additionally recover evidence for a pulse of gene gain coincident with the rise of the RAG-mediated adaptive immune response in jawed vertebrates, followed by a period of rapid gene loss during the Cretaceous. These gene losses are primarily concentrated in marine teleost fish and synchronous with the mid Cretaceous anoxic event, a period of rapid extinction for marine species. Finally, we reveal a mismatch between phylogenetic placement and gene nomenclature for up to 50% of TLRs found in clades such as ray-finned fishes, cyclostomes, amphibians, and elasmobranchs. Collectively, these results provide an unparalleled perspective of TLR diversity and offer a ready framework for testing gene annotations in non-model species.
Collapse
Affiliation(s)
- Kara B Carlson
- Department of Molecular Biomedical Sciences, North Carolina State University, Raleigh, NC, USA
- Genetics and Genomics Academy, North Carolina State University, Raleigh, NC, USA
| | - Cameron Nguyen
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, NC, USA
| | - Dustin J Wcisel
- Department of Molecular Biomedical Sciences, North Carolina State University, Raleigh, NC, USA
| | - Jeffrey A Yoder
- Department of Molecular Biomedical Sciences, North Carolina State University, Raleigh, NC, USA
- Genetics and Genomics Academy, North Carolina State University, Raleigh, NC, USA
- Comparative Medicine Institute, North Carolina State University, Raleigh, NC, USA
- Center for Human Health and the Environment, North Carolina State University, Raleigh, NC, USA
| | - Alex Dornburg
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, NC, USA.
| |
Collapse
|
2
|
Forthman M, Gordon ERL, Kimball RT. Low hybridization temperatures improve target capture success of invertebrate loci: a case study of leaf-footed bugs (Hemiptera: Coreoidea). ROYAL SOCIETY OPEN SCIENCE 2023; 10:230307. [PMID: 37388308 PMCID: PMC10300676 DOI: 10.1098/rsos.230307] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Accepted: 06/12/2023] [Indexed: 07/01/2023]
Abstract
Target capture is widely used in phylogenomic, ecological and functional genomic studies. Bait sets that allow capture from a diversity of species can be advantageous, but high-sequence divergence from baits can limit yields. Currently, only four experimental comparisons of a critical target capture parameter, hybridization temperature, have been published. These have been in vertebrates, where bait divergences are typically low, and none include invertebrates where bait-target divergences may be higher. Most invertebrate capture studies use a fixed, high hybridization temperature to maximize the proportion of on-target data, but many report low locus recovery. Using leaf-footed bugs (Hemiptera: Coreoidea), we investigate the effect of hybridization temperature on capture success of ultraconserved elements targeted by (i) baits developed from divergent hemipteran genomes and (ii) baits developed from less divergent coreoid transcriptomes. Lower temperatures generally resulted in more contigs and improved recovery of targets despite a lower proportion of on-target reads, lower read depth and more putative paralogues. Hybridization temperatures had less of an effect when using transcriptome-derived baits, which is probably due to lower bait-target divergences and greater bait tiling density. Thus, accommodating low hybridization temperatures during target capture can provide a cost-effective, widely applicable solution to improve invertebrate locus recovery.
Collapse
Affiliation(s)
- Michael Forthman
- California State Collection of Arthropods, Plant Pest Diagnostics Branch, California Department of Food and Agriculture, 3294 Meadowview Road, Sacramento, CA 95832, USA
- Entomology and Nematology Department, University of Florida, 1881 Natural Area Drive, Gainesville, FL 32611, USA
| | - Eric R. L. Gordon
- Department of Ecology and Evolutionary Biology, University of Connecticut, 75N. Eagleville Road, Unit 3043, Storrs, CT 06269, USA
| | - Rebecca T. Kimball
- Department of Biology, University of Florida, 876 Newell Drive, Gainesville, FL 32611, USA
| |
Collapse
|
3
|
Dornburg A, Mallik R, Wang Z, Bernal MA, Thompson B, Bruford EA, Nebert DW, Vasiliou V, Yohe LR, Yoder JA, Townsend JP. Placing human gene families into their evolutionary context. Hum Genomics 2022; 16:56. [PMID: 36369063 PMCID: PMC9652883 DOI: 10.1186/s40246-022-00429-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2022] [Accepted: 10/12/2022] [Indexed: 11/13/2022] Open
Abstract
Following the draft sequence of the first human genome over 20 years ago, we have achieved unprecedented insights into the rules governing its evolution, often with direct translational relevance to specific diseases. However, staggering sequence complexity has also challenged the development of a more comprehensive understanding of human genome biology. In this context, interspecific genomic studies between humans and other animals have played a critical role in our efforts to decode human gene families. In this review, we focus on how the rapid surge of genome sequencing of both model and non-model organisms now provides a broader comparative framework poised to empower novel discoveries. We begin with a general overview of how comparative approaches are essential for understanding gene family evolution in the human genome, followed by a discussion of analyses of gene expression. We show how homology can provide insights into the genes and gene families associated with immune response, cancer biology, vision, chemosensation, and metabolism, by revealing similarity in processes among distant species. We then explain methodological tools that provide critical advances and show the limitations of common approaches. We conclude with a discussion of how these investigations position us to gain fundamental insights into the evolution of gene families among living organisms in general. We hope that our review catalyzes additional excitement and research on the emerging field of comparative genomics, while aiding the placement of the human genome into its existentially evolutionary context.
Collapse
Affiliation(s)
- Alex Dornburg
- Department of Bioinformatics and Genomics, UNC-Charlotte, Charlotte, NC, USA.
| | - Rittika Mallik
- Department of Bioinformatics and Genomics, UNC-Charlotte, Charlotte, NC, USA
| | - Zheng Wang
- Department of Biostatistics, Yale School of Public Health, New Haven, CT, USA
| | - Moisés A Bernal
- Department of Biological Sciences, College of Science and Mathematics, Auburn University, Auburn, AL, USA
| | - Brian Thompson
- Department of Environmental Health Sciences, Yale School of Public Health, New Haven, CT, USA
| | - Elspeth A Bruford
- Department of Haematology, University of Cambridge School of Clinical Medicine, Cambridge, UK
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK
| | - Daniel W Nebert
- Department of Environmental Health, Center for Environmental Genetics, University of Cincinnati Medical Center, P.O. Box 670056, Cincinnati, OH, 45267, USA
- Department of Pediatrics and Molecular Developmental Biology, Division of Human Genetics, Cincinnati Children's Hospital, Cincinnati, OH, 45229, USA
| | - Vasilis Vasiliou
- Department of Environmental Health Sciences, Yale School of Public Health, New Haven, CT, USA
| | - Laurel R Yohe
- Department of Bioinformatics and Genomics, UNC-Charlotte, Charlotte, NC, USA
| | - Jeffrey A Yoder
- Department of Molecular Biomedical Sciences, College of Veterinary Medicine, North Carolina State University, Raleigh, NC, USA
| | - Jeffrey P Townsend
- Department of Bioinformatics and Genomics, UNC-Charlotte, Charlotte, NC, USA
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT, USA
| |
Collapse
|
4
|
Piller KR, Parker E, Lemmon AR, Moriarty Lemmon E. Investigating the utility of Anchored Hybrid Enrichment data to investigate the relationships among the Killifishes (Actinopterygii: Cyprinodontiformes), a globally distributed group of fishes. Mol Phylogenet Evol 2022; 173:107482. [PMID: 35452841 DOI: 10.1016/j.ympev.2022.107482] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2021] [Revised: 04/06/2022] [Accepted: 04/06/2022] [Indexed: 10/18/2022]
Abstract
The Killifishes (Cyprinodontiformes) are a diverse and well-known group of fishes that contains sixteen families inclusive of Anablepidae, Aphaniidae Aplocheilidae, Cubanichthyidae, Cyprinodontidae, Fluviphylacidae, Fundulidae, Goodeidae, Nothobranchiidae, Orestiidae, Pantanodontidae, Poeciliidae, Procatopodidae, Profundulidae, Rivulidae, and Valenciidae and more than 1,200 species that are globally distributed in tropical and temperate, freshwater and estuarine habitats. The evolutionary relationships among the families within the group, based on different molecular and morphological data sets, have remained uncertain. Therefore, the objective of this study was to use a targeted approach, anchored hybrid enrichment, to investigate the phylogenetic relationships among the families within the Cyprindontiformes. This study included more than 100 individuals, representing all sixteen families within the Cyprinodontiformes, including many recently diagnosed families. We recovered an average of 244 loci per individual. These data were submitted to phylogenetic analyses (RaxML and ASTRAL) and although we recovered many of the same relationships as in previous studies of the group, several novel sets of relationships for other families also were recovered. In addition, two well-established clades (Suborders Cyprinodontoidei and Aplocheilodei) were recovered as monophyletic and are in agreement with most previous studies. We also assessed the degree of gene tree discordance in our dataset to evaluate support for alternative topological hypotheses for interfamilial relationships within the Cyprinodontiformes using a variety of different analyses. The results from this study will provide a robust, historical framework needed to investigate a plethora of biogeographic, taxonomic, ecological, and physiological questions for this group of fishes.
Collapse
Affiliation(s)
- Kyle R Piller
- Department of Biological Science, Southeastern Louisiana University, Hammond, LA 70402, USA.
| | - Elyse Parker
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT, 06511, USA
| | - Alan R Lemmon
- Department of Scientific Computing, Florida State University, Dirac Science Library, Tallahassee, FL, 32306-4120, USA
| | - Emily Moriarty Lemmon
- Department of Biological Science, Florida State University, Biomedical Research Facility, Tallahassee, FL, 32306-4295, USA
| |
Collapse
|
5
|
Dornburg A, Near TJ. The Emerging Phylogenetic Perspective on the Evolution of Actinopterygian Fishes. ANNUAL REVIEW OF ECOLOGY, EVOLUTION, AND SYSTEMATICS 2021. [DOI: 10.1146/annurev-ecolsys-122120-122554] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
The emergence of a new phylogeny of ray-finned fishes at the turn of the twenty-first century marked a paradigm shift in understanding the evolutionary history of half of living vertebrates. We review how the new ray-finned fish phylogeny radically departs from classical expectations based on morphology. We focus on evolutionary relationships that span the backbone of ray-finned fish phylogeny, from the earliest divergences among teleosts and nonteleosts to the resolution of major lineages of Percomorpha. Throughout, we feature advances gained by the new phylogeny toward a broader understanding of ray-finned fish evolutionary history and the implications for topics that span from the genetics of human health to reconsidering the concept of living fossils. Additionally, we discuss conceptual challenges that involve reconciling taxonomic classification with phylogenetic relationships and propose an alternate higher-level classification for Percomorpha. Our review highlights remaining areas of phylogenetic uncertainty and opportunities for comparative investigations empowered by this new phylogenetic perspective on ray-finned fishes.
Collapse
Affiliation(s)
- Alex Dornburg
- Department of Bioinformatics and Genomics, University of North Carolina, Charlotte, North Carolina 28223, USA
| | - Thomas J. Near
- Department of Ecology and Evolutionary Biology and Peabody Museum of Natural History, Yale University, New Haven, Connecticut 06511, USA
| |
Collapse
|
6
|
Alda F, Ludt WB, Elías DJ, McMahan CD, Chakrabarty P. Comparing Ultraconserved Elements and Exons for Phylogenomic Analyses of Middle American Cichlids: When Data Agree to Disagree. Genome Biol Evol 2021; 13:evab161. [PMID: 34272856 PMCID: PMC8369075 DOI: 10.1093/gbe/evab161] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/05/2021] [Indexed: 12/20/2022] Open
Abstract
Choosing among types of genomic markers to be used in a phylogenomic study can have a major influence on the cost, design, and results of a study. Yet few attempts have been made to compare categories of next-generation sequence markers limiting our ability to compare the suitability of these different genomic fragment types. Here, we explore properties of different genomic markers to find if they vary in the accuracy of component phylogenetic trees and to clarify the causes of conflict obtained from different data sets or inference methods. As a test case, we explore the causes of discordance between phylogenetic hypotheses obtained using a novel data set of ultraconserved elements (UCEs) and a recently published exon data set of the cichlid tribe Heroini. Resolving relationships among heroine cichlids has historically been difficult, and the processes of colonization and diversification in Middle America and the Greater Antilles are not yet well understood. Despite differences in informativeness and levels of gene tree discordance between UCEs and exons, the resulting phylogenomic hypotheses generally agree on most relationships. The independent data sets disagreed in areas with low phylogenetic signal that were overwhelmed by incomplete lineage sorting and nonphylogenetic signals. For UCEs, high levels of incomplete lineage sorting were found to be the major cause of gene tree discordance, whereas, for exons, nonphylogenetic signal is most likely caused by a reduced number of highly informative loci. This paucity of informative loci in exons might be due to heterogeneous substitution rates that are problematic to model (i.e., computationally restrictive) resulting in systematic errors that UCEs (being less informative individually but more uniform) are less prone to. These results generally demonstrate the robustness of phylogenomic methods to accommodate genomic markers with different biological and phylogenetic properties. However, we identify common and unique pitfalls of different categories of genomic fragments when inferring enigmatic phylogenetic relationships.
Collapse
Affiliation(s)
- Fernando Alda
- Department of Biology, Geology and Environmental Science, University of Tennessee at Chattanooga, Tennessee, USA
| | - William B Ludt
- Department of Ichthyology, Natural History Museum of Los Angeles County, Los Angeles, California, USA
| | - Diego J Elías
- Museum of Natural Science, Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana, USA
| | | | - Prosanta Chakrabarty
- Museum of Natural Science, Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana, USA
| |
Collapse
|
7
|
Felix Afonso GV, Dario FD, Eduardo LN, Lucena-Frédou F, Bertrand A, Mincarone MM. Taxonomy and Distribution of Deep-Sea Bigscales and Whalefishes (Teleostei: Stephanoberycoidei) Collected off Northeastern Brazil, Including Seamounts and Oceanic Islands. ICHTHYOLOGY & HERPETOLOGY 2021. [DOI: 10.1643/i2020069] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
|
8
|
Tang KL, Stiassny MLJ, Mayden RL, DeSalle R. Systematics of Damselfishes. ICHTHYOLOGY & HERPETOLOGY 2021. [DOI: 10.1643/i2020105] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Affiliation(s)
- Kevin L. Tang
- University of Michigan–Flint, Department of Biology, 303 East Kearsley St., Flint, Michigan 48502; . Send reprint requests to this address
| | - Melanie L. J. Stiassny
- American Museum of Natural History, Department of Ichthyology, Central Park West at 79th St., New York, New York 10024;
| | - Richard L. Mayden
- Saint Louis University, Department of Biology, 3507 Laclede Ave., St. Louis, Missouri 63103;
| | - Robert DeSalle
- American Museum of Natural History, Division of Invertebrate Zoology, Central Park West at 79th St., New York, New York 10024;
| |
Collapse
|
9
|
Ghedotti MJ, DeKay HM, Maile AJ, Smith WL, Davis MP. Anatomy and evolution of bioluminescent organs in the slimeheads (Teleostei: Trachichthyidae). J Morphol 2021; 282:820-832. [PMID: 33733466 DOI: 10.1002/jmor.21349] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2021] [Revised: 03/11/2021] [Accepted: 03/14/2021] [Indexed: 11/10/2022]
Abstract
Bacterial bioluminescent organs in fishes have a diverse range of tissues of origin, patterns of compartmentalization, and associated light-conducting structures. The morphology of the perianal, bacterial bioluminescent organ of Aulotrachichthys prosthemius was described previously, but the light organ in other species of slimeheads, family Trachichthyidae, is poorly known. Here, we describe the anatomy of the bioluminescent organs in trachichthyids and places the evolution of this light-producing system in the context of a new phylogeny of the Trachichthyoidei to test the hypothesis that bioluminescence evolved twice in the suborder and that the light-producing component derives from the perianal ectoderm. We use gross and histological examination to provide the first description of the bioluminescent organ of Paratrachichthys and four additional species of Aulotrachichthys. Observations also strongly suggest the presence of a perianal bioluminescent organ in Sorosichthys ananasa. The updated phylogeny of the Trachichthyoidei is the first to combine morphological and DNA-sequence (11-gene fragments) evidence, and supports a monophyletic Trachichthyidae with component subfamilies Hoplostethinae and Trachichthyinae, supporting continued recognition of the family Anoplogastridae. All bioluminescent trachichthyoids share a similar bioluminescent-organ structure with elongate chambers filled with bacteria and connected to collecting ducts that, in turn, connect to superficial ducts that lead to and have lining epithelia continuous with the epidermis. In the context of the phylogeny, the bioluminescent organ of trachichthyids is inferred to have evolved as an elaboration of the proctodeum in the ancestor of Aulotrachichthys, Paratrachichthys, and Sorosichthys independently from the structurally similar cephalic bioluminescent organs in Anomalopidae and Monocentridae.
Collapse
Affiliation(s)
- Michael J Ghedotti
- Department of Biology, Regis University, Denver, Colorado, USA.,Bell Museum of Natural History, University of Minnesota, St. Paul, Minnesota, USA
| | - Hannah M DeKay
- Department of Biology, Regis University, Denver, Colorado, USA
| | - Alex J Maile
- Department of Biological Sciences, St. Cloud State University, St. Cloud, Minnesota, USA
| | - W Leo Smith
- Department of Ecology and Evolutionary Biology and Biodiversity Institute, University of Kansas, Lawrence, Kansas, USA
| | - Matthew P Davis
- Department of Biological Sciences, St. Cloud State University, St. Cloud, Minnesota, USA
| |
Collapse
|
10
|
Takezaki N. Resolving the Early Divergence Pattern of Teleost Fish Using Genome-Scale Data. Genome Biol Evol 2021; 13:6178791. [PMID: 33739405 PMCID: PMC8103497 DOI: 10.1093/gbe/evab052] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/10/2021] [Indexed: 12/13/2022] Open
Abstract
Regarding the phylogenetic relationship of the three primary groups of teleost fishes, Osteoglossomorpha (bonytongues and others), Elopomorpha (eels and relatives), Clupeocephala (the remaining teleost fish), early morphological studies hypothesized the first divergence of Osteoglossomorpha, whereas the recent prevailing view is the first divergence of Elopomorpha. Molecular studies supported all the possible relationships of the three primary groups. This study analyzed genome-scale data from four previous studies: 1) 412 genes from 12 species, 2) 772 genes from 15 species, 3) 1,062 genes from 30 species, and 4) 491 UCE loci from 27 species. The effects of the species, loci, and models used on the constructed tree topologies were investigated. In the analyses of the data sets (1)–(3), although the first divergence of Clupeocephala that left the other two groups in a sister relationship was supported by concatenated sequences and gene trees of all the species and genes, the first divergence of Elopomorpha among the three groups was supported using species and/or genes with low divergence of sequence and amino-acid frequencies. This result corresponded to that of the UCE data set (4), whose sequence divergence was low, which supported the first divergence of Elopomorpha with high statistical significance. The increase in accuracy of the phylogenetic construction by using species and genes with low sequence divergence was predicted by a phylogenetic informativeness approach and confirmed by computer simulation. These results supported that Elopomorpha was the first basal group of teleost fish to have diverged, consistent with the prevailing view of recent morphological studies.
Collapse
Affiliation(s)
- Naoko Takezaki
- Life Science Research Center, Kagawa University, Mikicho, Kitagun, Kagawa, Japan
| |
Collapse
|
11
|
Hughes LC, Ortí G, Saad H, Li C, White WT, Baldwin CC, Crandall KA, Arcila D, Betancur-R R. Exon probe sets and bioinformatics pipelines for all levels of fish phylogenomics. Mol Ecol Resour 2020; 21:816-833. [PMID: 33084200 DOI: 10.1111/1755-0998.13287] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2020] [Accepted: 10/09/2020] [Indexed: 11/28/2022]
Abstract
Exon markers have a long history of use in phylogenetics of ray-finned fishes, the most diverse clade of vertebrates with more than 35,000 species. As the number of published genomes increases, it has become easier to test exons and other genetic markers for signals of ancient duplication events and filter out paralogues that can mislead phylogenetic analysis. We present seven new probe sets for current target-capture phylogenomic protocols that capture 1,104 exons explicitly filtered for paralogues using gene trees. These seven probe sets span the diversity of teleost fishes, including four sets that target five hyperdiverse percomorph clades which together comprise ca. 17,000 species (Carangaria, Ovalentaria, Eupercaria, and Syngnatharia + Pelagiaria combined). We additionally included probes to capture legacy nuclear exons and mitochondrial markers that have been commonly used in fish phylogenetics (despite some exons being flagged for paralogues) to facilitate integration of old and new molecular phylogenetic matrices. We tested these probes experimentally for 56 fish species (eight species per probe set) and merged new exon-capture sequence data into an existing data matrix of 1,104 exons and 300 ray-finned fish species. We provide an optimized bioinformatics pipeline to assemble exon capture data from raw reads to alignments for downstream analysis. We show that legacy loci with known paralogues are at risk of assembling duplicated sequences with target-capture, but we also assembled many useful orthologous sequences that can be integrated with many PCR-generated matrices. These probe sets are a valuable resource for advancing fish phylogenomics because targeted exons can easily be extracted from increasingly available whole genome and transcriptome data sets, and also may be integrated with existing PCR-based exon and mitochondrial data.
Collapse
Affiliation(s)
- Lily C Hughes
- Department of Biological Sciences, George Washington University, Washington, DC, USA.,Computational Biology Institute, Milken Institute of Public Health, George Washington University, Washington, DC, USA.,Department of Vertebrate Zoology, National Museum of Natural History, Smithsonian Institution, Washington, DC, USA
| | - Guillermo Ortí
- Department of Biological Sciences, George Washington University, Washington, DC, USA.,Department of Vertebrate Zoology, National Museum of Natural History, Smithsonian Institution, Washington, DC, USA
| | - Hadeel Saad
- Department of Biological Sciences, George Washington University, Washington, DC, USA
| | - Chenhong Li
- College of Fisheries and Life Sciences, Shanghai Ocean University, Shanghai, China
| | - William T White
- CSIRO Australian National Fish Collection, National Research Collections of Australia, Hobart, TAS, Australia
| | - Carole C Baldwin
- Department of Vertebrate Zoology, National Museum of Natural History, Smithsonian Institution, Washington, DC, USA
| | - Keith A Crandall
- Department of Biological Sciences, George Washington University, Washington, DC, USA.,Computational Biology Institute, Milken Institute of Public Health, George Washington University, Washington, DC, USA
| | - Dahiana Arcila
- Department of Vertebrate Zoology, National Museum of Natural History, Smithsonian Institution, Washington, DC, USA.,Sam Noble Oklahoma Museum of Natural History, Norman, OK, USA.,Department of Biology, University of Oklahoma, Norman, OK, USA
| | | |
Collapse
|
12
|
Quek RZB, Jain SS, Neo ML, Rouse GW, Huang D. Transcriptome-based target-enrichment baits for stony corals (Cnidaria: Anthozoa: Scleractinia). Mol Ecol Resour 2020; 20:807-818. [PMID: 32077619 PMCID: PMC7468246 DOI: 10.1111/1755-0998.13150] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2019] [Revised: 02/01/2020] [Accepted: 02/10/2020] [Indexed: 01/09/2023]
Abstract
Despite the ecological and economic significance of stony corals (Scleractinia), a robust understanding of their phylogeny remains elusive due to patchy taxonomic and genetic sampling, as well as the limited availability of informative markers. To increase the number of genetic loci available for phylogenomic analyses in Scleractinia, we designed 15,919 DNA enrichment baits targeting 605 orthogroups (mean 565 ± SD 366 bp) over 1,139 exon regions. A further 236 and 62 barcoding baits were designed for COI and histone H3 genes respectively for quality and contamination checks. Hybrid capture using these baits was performed on 18 coral species spanning the presently understood scleractinian phylogeny, with two corallimorpharians as outgroup. On average, 74% of all loci targeted were successfully captured for each species. Barcoding baits were matched unambiguously to their respective samples and revealed low levels of cross-contamination in accordance with expectation. We put the data through a series of stringent filtering steps to ensure only scleractinian and phylogenetically informative loci were retained, and the final probe set comprised 13,479 baits, targeting 452 loci (mean 531 ± SD 307 bp) across 865 exon regions. Maximum likelihood, Bayesian and species tree analyses recovered maximally supported, topologically congruent trees consistent with previous phylogenomic reconstructions. The phylogenomic method presented here allows for consistent capture of orthologous loci among divergent coral taxa, facilitating the pooling of data from different studies and increasing the phylogenetic sampling of scleractinians in the future.
Collapse
Affiliation(s)
- Randolph Z. B. Quek
- Department of Biological SciencesNational University of SingaporeSingaporeSingapore
| | - Sudhanshi S. Jain
- Department of Biological SciencesNational University of SingaporeSingaporeSingapore
| | - Mei Lin Neo
- Department of Biological SciencesNational University of SingaporeSingaporeSingapore
- Tropical Marine Science InstituteNational University of SingaporeSingaporeSingapore
| | - Greg W. Rouse
- Scripps Institution of OceanographyUniversity of California San DiegoSan DiegoCAUSA
| | - Danwei Huang
- Department of Biological SciencesNational University of SingaporeSingaporeSingapore
- Tropical Marine Science InstituteNational University of SingaporeSingaporeSingapore
| |
Collapse
|
13
|
Wcisel DJ, Howard JT, Yoder JA, Dornburg A. Transcriptome Ortholog Alignment Sequence Tools (TOAST) for phylogenomic dataset assembly. BMC Evol Biol 2020; 20:41. [PMID: 32228442 PMCID: PMC7106827 DOI: 10.1186/s12862-020-01603-w] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2019] [Accepted: 03/11/2020] [Indexed: 01/05/2023] Open
Abstract
Background Advances in next-generation sequencing technologies have reduced the cost of whole transcriptome analyses, allowing characterization of non-model species at unprecedented levels. The rapid pace of transcriptomic sequencing has driven the public accumulation of a wealth of data for phylogenomic analyses, however lack of tools aimed towards phylogeneticists to efficiently identify orthologous sequences currently hinders effective harnessing of this resource. Results We introduce TOAST, an open source R software package that can utilize the ortholog searches based on the software Benchmarking Universal Single-Copy Orthologs (BUSCO) to assemble multiple sequence alignments of orthologous loci from transcriptomes for any group of organisms. By streamlining search, query, and alignment, TOAST automates the generation of locus and concatenated alignments, and also presents a series of outputs from which users can not only explore missing data patterns across their alignments, but also reassemble alignments based on user-defined acceptable missing data levels for a given research question. Conclusions TOAST provides a comprehensive set of tools for assembly of sequence alignments of orthologs for comparative transcriptomic and phylogenomic studies. This software empowers easy assembly of public and novel sequences for any target database of candidate orthologs, and fills a critically needed niche for tools that enable quantification and testing of the impact of missing data. As open-source software, TOAST is fully customizable for integration into existing or novel custom informatic pipelines for phylogenomic inference. Software, a detailed manual, and example data files are available through github carolinafishes.github.io
Collapse
Affiliation(s)
- Dustin J Wcisel
- Department of Molecular Biomedical Sciences, NC State University, Raleigh, NC, USA
| | - J Thomas Howard
- Department of Molecular Biomedical Sciences, NC State University, Raleigh, NC, USA
| | - Jeffrey A Yoder
- Department of Molecular Biomedical Sciences, NC State University, Raleigh, NC, USA.,Comparative Medicine Institute, NC State University, Raleigh, NC, USA.,Center for Human Health and the Environment, NC State University, Raleigh, NC, USA
| | - Alex Dornburg
- Department of Molecular Biomedical Sciences, NC State University, Raleigh, NC, USA.
| |
Collapse
|
14
|
Phillips AJ, Dornburg A, Zapfe KL, Anderson FE, James SW, Erséus C, Moriarty Lemmon E, Lemmon AR, Williams BW. Phylogenomic Analysis of a Putative Missing Link Sparks Reinterpretation of Leech Evolution. Genome Biol Evol 2020; 11:3082-3093. [PMID: 31214691 PMCID: PMC6598468 DOI: 10.1093/gbe/evz120] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/12/2019] [Indexed: 12/17/2022] Open
Abstract
Leeches (Hirudinida) comprise a charismatic, yet often maligned group of worms. Despite their ecological, economic, and medical importance, a general consensus on the phylogenetic relationships of major hirudinidan lineages is lacking. This absence of a consistent, robust phylogeny of early-diverging lineages has hindered our understanding of the underlying processes that enabled evolutionary diversification of this clade. Here, we used an anchored hybrid enrichment-based phylogenomic approach, capturing hundreds of loci to investigate phylogenetic relationships among major hirudinidan lineages and their closest living relatives. We recovered Branchiobdellida as sister to a clade that includes all major lineages of hirudinidans and Acanthobdella, casting doubt on the utility of Acanthobdella as a “missing link” between hirudinidans and the clitellate group formerly known as Oligochaeta. Further, our results corroborate the reciprocal monophyly of jawed and proboscis-bearing leeches. Our phylogenomic resolution of early-diverging leeches provides a useful framework for illuminating the evolution of key adaptations and host–symbiont associations that have allowed leeches to colonize a wide diversity of habitats worldwide.
Collapse
Affiliation(s)
- Anna J Phillips
- Department of Invertebrate Zoology, National Museum of Natural History, Smithsonian Institution, Washington, District of Columbia
| | - Alex Dornburg
- North Carolina Museum of Natural Sciences, Research Laboratory, Raleigh, North Carolina
| | - Katerina L Zapfe
- North Carolina Museum of Natural Sciences, Research Laboratory, Raleigh, North Carolina.,Department of Biological Sciences, Clemson University
| | | | | | - Christer Erséus
- Department of Biological and Environmental Sciences, University of Gothenburg, Sweden
| | | | - Alan R Lemmon
- Department of Scientific Computing, Florida State University
| | - Bronwyn W Williams
- North Carolina Museum of Natural Sciences, Research Laboratory, Raleigh, North Carolina
| |
Collapse
|
15
|
Parker E, Dornburg A, Domínguez-Domínguez O, Piller KR. Assessing phylogenetic information to reveal uncertainty in historical data: An example using Goodeinae (Teleostei: Cyprinodontiformes: Goodeidae). Mol Phylogenet Evol 2019; 134:282-290. [DOI: 10.1016/j.ympev.2019.01.025] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2018] [Revised: 01/17/2019] [Accepted: 01/30/2019] [Indexed: 01/18/2023]
|
16
|
MacGuigan DJ, Near TJ. Phylogenomic Signatures of Ancient Introgression in a Rogue Lineage of Darters (Teleostei: Percidae). Syst Biol 2019; 68:329-346. [PMID: 30395332 DOI: 10.1093/sysbio/syy074] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2018] [Accepted: 10/29/2018] [Indexed: 12/17/2022] Open
Abstract
Evolutionary history is typically portrayed as a branching phylogenetic tree, yet not all evolution proceeds in a purely bifurcating manner. Introgressive hybridization is one process that results in reticulate evolution. Most known examples of genome-wide introgression occur among closely related species with relatively recent common ancestry; however, we present evidence for ancient hybridization and genome-wide introgression between major stem lineages of darters, a species-rich clade of North American freshwater fishes. Previous attempts to resolve the relationships of darters have been confounded by the uncertain phylogenetic resolution of the lineage Allohistium. In this study, we investigate the phylogenomics of darters, specifically the relationships of Allohistium, through analyses of approximately 30,000 RADseq loci sampled from 112 species. Our phylogenetic inferences are based on traditional approaches in combination with strategies that accommodate reticulate evolution. These analyses result in a novel phylogenetic hypothesis for darters that includes ancient introgression between Allohistium and other two major darter lineages, minimally occurring 20 million years ago. Darters offer a compelling case for the necessity of incorporating phylogenetic networks in reconstructing the evolutionary history of diversification in species-rich lineages. We anticipate that the growing wealth of genomic data for clades of non-model organisms will reveal more examples of ancient hybridization, eventually requiring a re-evaluation of how evolutionary history is visualized and utilized in macroevolutonary investigations.
Collapse
Affiliation(s)
- Daniel J MacGuigan
- Department of Ecology and Evolutionary Biology, Yale University, P.O. Box 208106, New Haven, CT 06520, USA
| | - Thomas J Near
- Department of Ecology and Evolutionary Biology, Yale University, P.O. Box 208106, New Haven, CT 06520, USA.,Peabody Museum of Natural History, Yale University, New Haven, CT 06520, USA
| |
Collapse
|
17
|
Dornburg A, Su Z, Townsend JP. Optimal Rates for Phylogenetic Inference and Experimental Design in the Era of Genome-Scale Data Sets. Syst Biol 2018; 68:145-156. [PMID: 29939341 DOI: 10.1093/sysbio/syy047] [Citation(s) in RCA: 36] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2018] [Accepted: 06/13/2018] [Indexed: 02/02/2023] Open
Abstract
With the rise of genome-scale data sets, there has been a call for increased data scrutiny and careful selection of loci that are appropriate to use in an attempt to resolve a phylogenetic problem. Such loci should maximize phylogenetic information content while minimizing the risk of homoplasy. Theory posits the existence of characters that evolve at an optimum rate, and efforts to determine optimal rates of inference have been a cornerstone of phylogenetic experimental design for over two decades. However, both theoretical and empirical investigations of optimal rates have varied dramatically in their conclusions: spanning no relationship to a tight relationship between the rate of change and phylogenetic utility. Herein, we synthesize these apparently contradictory views, demonstrating both empirical and theoretical conditions under which each is correct. We find that optimal rates of characters-not genes-are generally robust to most experimental design decisions. Moreover, consideration of site rate heterogeneity within a given locus is critical to accurate predictions of utility. Factors such as taxon sampling or the targeted number of characters providing support for a topology are additionally critical to the predictions of phylogenetic utility based on the rate of character change. Further, optimality of rates and predictions of phylogenetic utility are not equivalent, demonstrating the need for further development of comprehensive theory of phylogenetic experimental design. [Divergence time; GC bias; homoplasy; incongruence; information content; internode length; optimal rates; phylogenetic informativeness; phylogenetic theory; phylogenetic utility; phylogenomics; signal and noise; subtending branch length; state space; taxon and character sampling.].
Collapse
Affiliation(s)
- Alex Dornburg
- North Carolina Museum of Natural Sciences, Raleigh, 1671 Goldstar Drive, NC 27601, USA
| | - Zhuo Su
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, 165 Prospect Street, CT 06525, USA
| | - Jeffrey P Townsend
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, 165 Prospect Street, CT 06525, USA
- Department of Biostatistics, Yale University, New Haven, 60 College Street, CT 06510, USA
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, 300 George Street, CT 06511, USA
| |
Collapse
|
18
|
Alda F, Tagliacollo VA, Bernt MJ, Waltz BT, Ludt WB, Faircloth BC, Alfaro ME, Albert JS, Chakrabarty P. Resolving Deep Nodes in an Ancient Radiation of Neotropical Fishes in the Presence of Conflicting Signals from Incomplete Lineage Sorting. Syst Biol 2018; 68:573-593. [DOI: 10.1093/sysbio/syy085] [Citation(s) in RCA: 40] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2018] [Revised: 11/30/2018] [Accepted: 12/03/2018] [Indexed: 12/13/2022] Open
Affiliation(s)
- Fernando Alda
- Museum of Natural Science, Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA
- Department of Biology, Geology and Environmental Science, University of Tennessee at Chattanooga, Chattanooga, TN 37403, USA
| | - Victor A Tagliacollo
- Museu de Zoologia da Universidade de São Paulo (MZUSP), Ipirianga, 04263-000, São Paulo, São Paulo, Brazil
| | - Maxwell J Bernt
- Department of Biology, University of Louisiana at Lafayette, Lafayette, LA 70503, USA
| | - Brandon T Waltz
- Department of Biology, University of Louisiana at Lafayette, Lafayette, LA 70503, USA
| | - William B Ludt
- Museum of Natural Science, Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA
| | - Brant C Faircloth
- Museum of Natural Science, Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA
| | - Michael E Alfaro
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, CA 90095, USA
| | - James S Albert
- Department of Biology, University of Louisiana at Lafayette, Lafayette, LA 70503, USA
| | - Prosanta Chakrabarty
- Museum of Natural Science, Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA
| |
Collapse
|
19
|
Near TJ, MacGuigan DJ, Parker E, Struthers CD, Jones CD, Dornburg A. Phylogenetic analysis of Antarctic notothenioids illuminates the utility of RADseq for resolving Cenozoic adaptive radiations. Mol Phylogenet Evol 2018; 129:268-279. [PMID: 30195039 DOI: 10.1016/j.ympev.2018.09.001] [Citation(s) in RCA: 41] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2018] [Revised: 08/31/2018] [Accepted: 09/01/2018] [Indexed: 10/28/2022]
Abstract
Notothenioids are a clade of ∼120 species of marine fishes distributed in extreme southern hemisphere temperate near-shore habitats and in the Southern Ocean surrounding Antarctica. Over the past 25 years, molecular and morphological approaches have redefined hypotheses of relationships among notothenioid lineages as well as their relationships among major lineages of percomorph teleosts. These phylogenies provide a basis for investigation of mechanisms of evolutionary diversification within the clade and have enhanced our understanding of the notothenioid adaptive radiation. Despite extensive efforts, there remain several questions concerning the phylogeny of notothenioids. In this study, we deploy DNA sequences of ∼100,000 loci obtained using RADseq to investigate the phylogenetic relationships of notothenioids and to assess the utility of RADseq loci for lineages that exhibit divergence times ranging from the Paleogene to the Quaternary. The notothenioid phylogenies inferred from the RADseq loci provide unparalleled resolution and node support for several long-standing problems including, (1) relationships among species of Trematomus, (2) resolution of Indonotothenia cyanobrancha as the sister lineage of Trematomus, (3) the deep paraphyly of Nototheniidae, (4) the paraphyly of Lepidonotothen s.l., (5) paraphyly of Artedidraco, and 6) the monophyly of the Bathydraconidae. Assessment of site rates demonstrates that RADseq loci are similar to mtDNA protein coding genes and exhibit peak phylogenetic informativeness at the time interval during which the major Antarctic notothenioid lineages originated and diversified. In addition to providing a well-resolved phylogenetic hypothesis for notothenioids, our analyses quantify the predicted utility of RADseq loci for Cenozoic phylogenetic inferences.
Collapse
Affiliation(s)
- Thomas J Near
- Department of Ecology & Evolutionary Biology, Yale University, P.O. Box 208106, New Haven, CT 06520, USA; Peabody Museum of Natural History, Yale University, New Haven, CT 06520, USA.
| | - Daniel J MacGuigan
- Department of Ecology & Evolutionary Biology, Yale University, P.O. Box 208106, New Haven, CT 06520, USA
| | - Elyse Parker
- Department of Ecology & Evolutionary Biology, Yale University, P.O. Box 208106, New Haven, CT 06520, USA
| | - Carl D Struthers
- Museum of New Zealand Te Papa Tongarewa, Wellington, New Zealand
| | - Christopher D Jones
- Antarctic Ecosystem Research Division, NOAA Southwest Fisheries Science Center, La Jolla, CA 92037, USA
| | - Alex Dornburg
- North Carolina Museum of Natural Sciences, Raleigh, NC 27601, USA
| |
Collapse
|
20
|
Herrando-Moraira S. Exploring data processing strategies in NGS target enrichment to disentangle radiations in the tribe Cardueae (Compositae). Mol Phylogenet Evol 2018; 128:69-87. [PMID: 30036700 DOI: 10.1016/j.ympev.2018.07.012] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2018] [Revised: 07/13/2018] [Accepted: 07/14/2018] [Indexed: 12/17/2022]
Abstract
Target enrichment is a cost-effective sequencing technique that holds promise for elucidating evolutionary relationships in fast-evolving lineages. However, potential biases and impact of bioinformatic sequence treatments in phylogenetic inference have not been thoroughly explored yet. Here, we investigate this issue with an ultimate goal to shed light into a highly diversified group of Compositae (Asteraceae) constituted by four main genera: Arctium, Cousinia, Saussurea, and Jurinea. Specifically, we compared sequence data extraction methods implemented in two easy-to-use workflows, PHYLUCE and HybPiper, and assessed the impact of two filtering practices intended to reduce phylogenetic noise. In addition, we compared two phylogenetic inference methods: (1) the concatenation approach, in which all loci were concatenated in a supermatrix; and (2) the coalescence approach, in which gene trees were produced independently and then used to construct a species tree under coalescence assumptions. Here we confirm the usefulness of the set of 1061 COS targets (a nuclear conserved orthology loci set developed for the Compositae) across a variety of taxonomic levels. Intergeneric relationships were completely resolved: there are two sister groups, Arctium-Cousinia and Saussurea-Jurinea, which are in agreement with a morphological hypothesis. Intrageneric relationships among species of Arctium, Cousinia, and Saussurea are also well defined. Conversely, conflicting species relationships remain for Jurinea. Methodological choices significantly affected phylogenies in terms of topology, branch length, and support. Across all analyses, the phylogeny obtained using HybPiper and the strictest scheme of removing fast-evolving sites was estimated as the optimal. Regarding methodological choices, we conclude that: (1) trees obtained under the coalescence approach are topologically more congruent between them than those inferred using the concatenation approach; (2) refining treatments only improved support values under the concatenation approach; and (3) branch support values are maximized when fast-evolving sites are removed in the concatenation approach, and when a higher number of loci is analyzed in the coalescence approach.
Collapse
Affiliation(s)
- Sonia Herrando-Moraira
- Botanic Institute of Barcelona (IBB, CSIC-ICUB), Pg. del Migdia, s.n., 08038 Barcelona, Spain.
| | | |
Collapse
|
21
|
Kuang T, Tornabene L, Li J, Jiang J, Chakrabarty P, Sparks JS, Naylor GJP, Li C. Phylogenomic analysis on the exceptionally diverse fish clade Gobioidei (Actinopterygii: Gobiiformes) and data-filtering based on molecular clocklikeness. Mol Phylogenet Evol 2018; 128:192-202. [PMID: 30036699 DOI: 10.1016/j.ympev.2018.07.018] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2017] [Revised: 07/11/2018] [Accepted: 07/17/2018] [Indexed: 11/30/2022]
Abstract
The use of genome-scale data to infer phylogenetic relationships has gained in popularity in recent years due to the progress made in target-gene capture and sequencing techniques. Data filtering, the approach of excluding data inconsistent with the model from analyses, presumably could alleviate problems caused by systematic errors in phylogenetic inference. Different data filtering criteria, such as those based on evolutionary rate and molecular clocklikeness as well as others have been proposed for selecting useful phylogenetic markers, yet few studies have tested these criteria using phylogenomic data. We developed a novel set of single-copy nuclear coding markers to capture thousands of target genes in gobioid fishes, a species-rich lineages of vertebrates, and tested the effects of data-filtering methods based on substitution rate and molecular clocklikeness while attempting to control for the compounding effects of missing data and variation in locus length. We found that molecular clocklikeness was a better predictor than overall substitution rate for phylogenetic usefulness of molecular markers in our study. In addition, when the 100 best ranked loci for our predictors were concatenated and analyzed using maximum likelihood, or combined in a coalescent-based species-tree analysis, the resulting trees showed a well-resolved topology of Gobioidei that mostly agrees with previous studies. However, trees generated from the 100 least clocklike frequently recovered conflicting, and in some cases clearly erroneous topologies with strong support, thus indicating strong systematic biases in those datasets. Collectively these results suggest that data filtering has the potential improve the performance of phylogenetic inference when using both a concatenation approach as well as methods that rely on input from individual gene trees (i.e. coalescent species-tree approaches), which may be preferred in scenarios where incomplete lineage sorting is likely to be an issue.
Collapse
Affiliation(s)
- Ting Kuang
- Shanghai Universities Key Laboratory of Marine Animal Taxonomy and Evolution, Shanghai, China; Shanghai Collaborative Innovation for Aquatic Animal Genetics and Breeding, Shanghai, China; National Demonstration Center for Experimental Fisheries Science Education (Shanghai Ocean University), China
| | - Luke Tornabene
- School of Aquatic and Fishery Sciences, University of Washington, Seattle, WA 98105, USA
| | - Jingyan Li
- Shanghai Universities Key Laboratory of Marine Animal Taxonomy and Evolution, Shanghai, China; Shanghai Collaborative Innovation for Aquatic Animal Genetics and Breeding, Shanghai, China; National Demonstration Center for Experimental Fisheries Science Education (Shanghai Ocean University), China
| | - Jiamei Jiang
- Shanghai Universities Key Laboratory of Marine Animal Taxonomy and Evolution, Shanghai, China; Shanghai Collaborative Innovation for Aquatic Animal Genetics and Breeding, Shanghai, China; National Demonstration Center for Experimental Fisheries Science Education (Shanghai Ocean University), China
| | - Prosanta Chakrabarty
- Louisiana State University, Museum of Natural Science, Department of Biological Sciences, Baton Rouge, LA 70803, USA
| | - John S Sparks
- American Museum of Natural History, Central Park West at 79th Street, NY, NY 10024, USA
| | | | - Chenhong Li
- Shanghai Universities Key Laboratory of Marine Animal Taxonomy and Evolution, Shanghai, China; Shanghai Collaborative Innovation for Aquatic Animal Genetics and Breeding, Shanghai, China; National Demonstration Center for Experimental Fisheries Science Education (Shanghai Ocean University), China.
| |
Collapse
|
22
|
Abstract
Kimura's neutral theory argued that positive selection was not responsible for an appreciable fraction of molecular substitutions. Correspondingly, quantitative analysis reveals that the vast majority of substitutions in cancer genomes are not detectably under selection. Insights from the somatic evolution of cancer reveal that beneficial substitutions in cancer constitute a small but important fraction of the molecular variants. The molecular evolution of cancer community will benefit by incorporating the neutral theory of molecular evolution into their understanding and analysis of cancer evolution-and accepting the use of tractable, predictive models, even when there is some evidence that they are not perfect.
Collapse
Affiliation(s)
| | - Jeffrey P Townsend
- Department of Biostatistics, Yale University, New Haven, CT
- Program in Computational Biology and Bioinformatics
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT
| |
Collapse
|
23
|
Gates DJ, Pilson D, Smith SD. Filtering of target sequence capture individuals facilitates species tree construction in the plant subtribe Iochrominae (Solanaceae). Mol Phylogenet Evol 2018; 123:26-34. [DOI: 10.1016/j.ympev.2018.02.002] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2017] [Revised: 01/30/2018] [Accepted: 02/01/2018] [Indexed: 10/18/2022]
|
24
|
Dornburg A, Townsend JP, Wang Z. Maximizing Power in Phylogenetics and Phylogenomics: A Perspective Illuminated by Fungal Big Data. ADVANCES IN GENETICS 2017; 100:1-47. [PMID: 29153398 DOI: 10.1016/bs.adgen.2017.09.007] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Since its original inception over 150 years ago by Darwin, we have made tremendous progress toward the reconstruction of the Tree of Life. In particular, the transition from analyzing datasets comprised of small numbers of loci to those comprised of hundreds of loci, if not entire genomes, has aided in resolving some of the most vexing of evolutionary problems while giving us a new perspective on biodiversity. Correspondingly, phylogenetic trees have taken a central role in fields that span ecology, conservation, and medicine. However, the rise of big data has also presented phylogenomicists with a new set of challenges to experimental design, quantitative analyses, and computation. The sequencing of a number of very first genomes presented significant challenges to phylogenetic inference, leading fungal phylogenomicists to begin addressing pitfalls and postulating solutions to the issues that arise from genome-scale analyses relevant to any lineage across the Tree of Life. Here we highlight insights from fungal phylogenomics for topics including systematics and species delimitation, ecological and phenotypic diversification, and biogeography while providing an overview of progress made on the reconstruction of the fungal Tree of Life. Finally, we provide a review of considerations to phylogenomic experimental design for robust tree inference. We hope that this special issue of Advances in Genetics not only excites the continued progress of fungal evolutionary biology but also motivates the interdisciplinary development of new theory and methods designed to maximize the power of genomic scale data in phylogenetic analyses.
Collapse
Affiliation(s)
- Alex Dornburg
- North Carolina Museum of Natural Sciences, Raleigh, NC, United States
| | | | - Zheng Wang
- Yale University, New Haven, CT, United States.
| |
Collapse
|
25
|
Molloy EK, Warnow T. To Include or Not to Include: The Impact of Gene Filtering on Species Tree Estimation Methods. Syst Biol 2017; 67:285-303. [DOI: 10.1093/sysbio/syx077] [Citation(s) in RCA: 138] [Impact Index Per Article: 19.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2016] [Accepted: 09/13/2017] [Indexed: 01/27/2023] Open
Affiliation(s)
- Erin K Molloy
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Tandy Warnow
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| |
Collapse
|
26
|
Betancur-R R, Wiley EO, Arratia G, Acero A, Bailly N, Miya M, Lecointre G, Ortí G. Phylogenetic classification of bony fishes. BMC Evol Biol 2017; 17:162. [PMID: 28683774 PMCID: PMC5501477 DOI: 10.1186/s12862-017-0958-3] [Citation(s) in RCA: 410] [Impact Index Per Article: 58.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2016] [Accepted: 04/26/2017] [Indexed: 12/23/2022] Open
Abstract
BACKGROUND Fish classifications, as those of most other taxonomic groups, are being transformed drastically as new molecular phylogenies provide support for natural groups that were unanticipated by previous studies. A brief review of the main criteria used by ichthyologists to define their classifications during the last 50 years, however, reveals slow progress towards using an explicit phylogenetic framework. Instead, the trend has been to rely, in varying degrees, on deep-rooted anatomical concepts and authority, often mixing taxa with explicit phylogenetic support with arbitrary groupings. Two leading sources in ichthyology frequently used for fish classifications (JS Nelson's volumes of Fishes of the World and W. Eschmeyer's Catalog of Fishes) fail to adopt a global phylogenetic framework despite much recent progress made towards the resolution of the fish Tree of Life. The first explicit phylogenetic classification of bony fishes was published in 2013, based on a comprehensive molecular phylogeny ( www.deepfin.org ). We here update the first version of that classification by incorporating the most recent phylogenetic results. RESULTS The updated classification presented here is based on phylogenies inferred using molecular and genomic data for nearly 2000 fishes. A total of 72 orders (and 79 suborders) are recognized in this version, compared with 66 orders in version 1. The phylogeny resolves placement of 410 families, or ~80% of the total of 514 families of bony fishes currently recognized. The ordinal status of 30 percomorph families included in this study, however, remains uncertain (incertae sedis in the series Carangaria, Ovalentaria, or Eupercaria). Comments to support taxonomic decisions and comparisons with conflicting taxonomic groups proposed by others are presented. We also highlight cases were morphological support exist for the groups being classified. CONCLUSIONS This version of the phylogenetic classification of bony fishes is substantially improved, providing resolution for more taxa than previous versions, based on more densely sampled phylogenetic trees. The classification presented in this study represents, unlike any other, the most up-to-date hypothesis of the Tree of Life of fishes.
Collapse
Affiliation(s)
- Ricardo Betancur-R
- Department of Biology, University of Puerto Rico, Río Piedras, P.O. Box 23360, San Juan, PR 00931 USA
- Department of Vertebrate Zoology, National Museum of Natural History, Smithsonian Institution, Washington, DC USA
| | - Edward O. Wiley
- Biodiversity Institute and Department of Ecology & Evolutionary Biology, University of Kansas, Lawrence, KS USA
- Sam Houston State Natural History Collections, Sam Houston State University, Huntsville, Texas USA
| | - Gloria Arratia
- Biodiversity Institute and Department of Ecology & Evolutionary Biology, University of Kansas, Lawrence, KS USA
| | - Arturo Acero
- Universidad Nacional de Colombia sede Caribe, Cecimar, El Rodadero, Santa Marta, Magdalena Colombia
| | - Nicolas Bailly
- FishBase Information and Research Group, Los Baños, Philippines
| | - Masaki Miya
- Department Ecology and Environmental Sciences, Natural History Museum and Institute, Chiba, Japan
| | - Guillaume Lecointre
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Muséum National d’Histoire Naturelle, Paris, France
| | - Guillermo Ortí
- Department of Vertebrate Zoology, National Museum of Natural History, Smithsonian Institution, Washington, DC USA
- Department of Biology, The George Washington University, Washington, DC USA
| |
Collapse
|
27
|
Eytan RI, Evans BR, Dornburg A, Lemmon AR, Lemmon EM, Wainwright PC, Near TJ. Are 100 enough? Inferring acanthomorph teleost phylogeny using Anchored Hybrid Enrichment. BMC Evol Biol 2015; 15:113. [PMID: 26071950 PMCID: PMC4465735 DOI: 10.1186/s12862-015-0415-0] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2014] [Accepted: 06/08/2015] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The past decade has witnessed remarkable progress towards resolution of the Tree of Life. However, despite the increased use of genomic scale datasets, some phylogenetic relationships remain difficult to resolve. Here we employ anchored phylogenomics to capture 107 nuclear loci in 29 species of acanthomorph teleost fishes, with 25 of these species sampled from the recently delimited clade Ovalentaria. Previous studies employing multilocus nuclear exon datasets have not been able to resolve the nodes at the base of the Ovalentaria tree with confidence. Here we test whether a phylogenomic approach will provide better support for these nodes, and if not, why this may be. RESULTS After using a novel method to account for paralogous loci, we estimated phylogenies with maximum likelihood and species tree methods using DNA sequence alignments of over 80,000 base pairs. Several key relationships within Ovalentaria are well resolved, including 1) the sister taxon relationship between Cichlidae and Pholidichthys, 2) a clade containing blennies, grammas, clingfishes, and jawfishes, and 3) monophyly of Atherinomorpha (topminnows, flyingfishes, and silversides). However, many nodes in the phylogeny associated with the early diversification of Ovalentaria are poorly resolved in several analyses. Through the use of rarefaction curves we show that limited phylogenetic resolution among the earliest nodes in the Ovalentaria phylogeny does not appear to be due to a deficiency of data, as average global node support ceases to increase when only 1/3rd of the sampled loci are used in analyses. Instead this lack of resolution may be driven by model misspecification as a Bayesian mixed model analysis of the amino acid dataset provided good support for parts of the base of the Ovalentaria tree. CONCLUSIONS Although it does not appear that the limited phylogenetic resolution among the earliest nodes in the Ovalentaria phylogeny is due to a deficiency of data, it may be that both stochastic and systematic error resulting from model misspecification play a role in the poor resolution at the base of the Ovalentaria tree as a Bayesian approach was able to resolve some of the deeper nodes, where the other methods failed.
Collapse
Affiliation(s)
- Ron I Eytan
- Department of Ecology & Evolutionary Biology and Peabody Museum of Natural History, Yale University, New Haven, 06520, CT, USA.
- Department of Marine Biology, Texas A&M University at Galveston, Galveston, 77553, TX, USA.
| | - Benjamin R Evans
- Department of Ecology & Evolutionary Biology and Peabody Museum of Natural History, Yale University, New Haven, 06520, CT, USA.
| | - Alex Dornburg
- Department of Ecology & Evolutionary Biology and Peabody Museum of Natural History, Yale University, New Haven, 06520, CT, USA.
| | - Alan R Lemmon
- Department of Scientific Computing, Florida State University, Dirac Science Library, Tallahassee, 32306, FL, USA.
| | - Emily Moriarty Lemmon
- Department of Biological Science, Florida State University, Biomedical Research Facility, Tallahassee, 32306, FL, USA.
| | - Peter C Wainwright
- Department of Evolution & Ecology, University of California, One Shields Avenue, Davis, 95616, CA, USA.
| | - Thomas J Near
- Department of Ecology & Evolutionary Biology and Peabody Museum of Natural History, Yale University, New Haven, 06520, CT, USA.
| |
Collapse
|