1
|
Gàlvez-Morante A, Guéguen L, Natsidis P, Telford MJ, Richter DJ. Dollo Parsimony Overestimates Ancestral Gene Content Reconstructions. Genome Biol Evol 2024; 16:evae062. [PMID: 38518756 PMCID: PMC10995720 DOI: 10.1093/gbe/evae062] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2024] [Revised: 03/15/2024] [Accepted: 03/19/2024] [Indexed: 03/24/2024] Open
Abstract
Ancestral reconstruction is a widely used technique that has been applied to understand the evolutionary history of gain and loss of gene families. Ancestral gene content can be reconstructed via different phylogenetic methods, but many current and previous studies employ Dollo parsimony. We hypothesize that Dollo parsimony is not appropriate for ancestral gene content reconstruction inferences based on sequence homology, as Dollo parsimony is derived from the assumption that a complex character cannot be regained. This premise does not accurately model molecular sequence evolution, in which false orthology can result from sequence convergence or lateral gene transfer. The aim of this study is to test Dollo parsimony's suitability for ancestral gene content reconstruction and to compare its inferences with a maximum likelihood-based approach that allows a gene family to be gained more than once within a tree. We first compared the performance of the two approaches on a series of artificial data sets each of 5,000 genes that were simulated according to a spectrum of evolutionary rates without gene gain or loss, so that inferred deviations from the true gene count would arise only from errors in orthology inference and ancestral reconstruction. Next, we reconstructed protein domain evolution on a phylogeny representing known eukaryotic diversity. We observed that Dollo parsimony produced numerous ancestral gene content overestimations, especially at nodes closer to the root of the tree. These observations led us to the conclusion that, confirming our hypothesis, Dollo parsimony is not an appropriate method for ancestral reconstruction studies based on sequence homology.
Collapse
Affiliation(s)
- Alex Gàlvez-Morante
- Institut de Biologia Evolutiva (CSIC-Universitat Pompeu Fabra), Barcelona 08003, Spain
| | - Laurent Guéguen
- LBBE, UMR 5558, CNRS, Université Claude Bernard Lyon 1, Villeurbanne 69622, France
| | - Paschalis Natsidis
- Centre for Life's Origins and Evolution, Department of Genetics, Evolution and Environment, University College London, London WC1E 6BT, UK
| | - Maximilian J Telford
- Centre for Life's Origins and Evolution, Department of Genetics, Evolution and Environment, University College London, London WC1E 6BT, UK
| | - Daniel J Richter
- Institut de Biologia Evolutiva (CSIC-Universitat Pompeu Fabra), Barcelona 08003, Spain
| |
Collapse
|
2
|
Dai J, Rubel T, Han Y, Molloy EK. Dollo-CDP: a polynomial-time algorithm for the clade-constrained large Dollo parsimony problem. Algorithms Mol Biol 2024; 19:2. [PMID: 38191515 PMCID: PMC10775561 DOI: 10.1186/s13015-023-00249-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Accepted: 12/10/2023] [Indexed: 01/10/2024] Open
Abstract
The last decade of phylogenetics has seen the development of many methods that leverage constraints plus dynamic programming. The goal of this algorithmic technique is to produce a phylogeny that is optimal with respect to some objective function and that lies within a constrained version of tree space. The popular species tree estimation method ASTRAL, for example, returns a tree that (1) maximizes the quartet score computed with respect to the input gene trees and that (2) draws its branches (bipartitions) from the input constraint set. This technique has yet to be used for parsimony problems where the input are binary characters, sometimes with missing values. Here, we introduce the clade-constrained character parsimony problem and present an algorithm that solves this problem for the Dollo criterion score in [Formula: see text] time, where n is the number of leaves, k is the number of characters, and [Formula: see text] is the set of clades used as constraints. Dollo parsimony, which requires traits/mutations to be gained at most once but allows them to be lost any number of times, is widely used for tumor phylogenetics as well as species phylogenetics, for example analyses of low-homoplasy retroelement insertions across the vertebrate tree of life. This motivated us to implement our algorithm in a software package, called Dollo-CDP, and evaluate its utility for analyzing retroelement insertion presence / absence patterns for bats, birds, toothed whales as well as simulated data. Our results show that Dollo-CDP can improve upon heuristic search from a single starting tree, often recovering a better scoring tree. Moreover, Dollo-CDP scales to data sets with much larger numbers of taxa than branch-and-bound while still having an optimality guarantee, albeit a more restricted one. Lastly, we show that our algorithm for Dollo parsimony can easily be adapted to Camin-Sokal parsimony but not Fitch parsimony.
Collapse
Affiliation(s)
- Junyan Dai
- Department of Computer Science, University of Maryland, College Park, MD, USA
| | - Tobias Rubel
- Department of Computer Science, University of Maryland, College Park, MD, USA
| | - Yunheng Han
- Department of Computer Science, University of Maryland, College Park, MD, USA
| | - Erin K Molloy
- Department of Computer Science, University of Maryland, College Park, MD, USA.
- University of Maryland Institute for Advanced Computer Studies, College Park, MD, USA.
| |
Collapse
|
3
|
Li Y, Li X, Nie S, Zhang M, Yang Q, Xu W, Duan Y, Wang X. Reticulate evolution of the tertiary relict Osmanthus. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2024; 117:145-160. [PMID: 37837261 DOI: 10.1111/tpj.16480] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/07/2023] [Revised: 09/10/2023] [Accepted: 09/13/2023] [Indexed: 10/15/2023]
Abstract
When interspecific gene flow is common, species relationships are more accurately represented by a phylogenetic network than by a bifurcating tree. This study aimed to uncover the role of introgression in the evolution of Osmanthus, the only genus of the subtribe Oleinae (Oleaceae) with its distribution center in East Asia. We built species trees, detected introgression, and constructed networks using multiple kinds of sequencing data (whole genome resequencing, transcriptome sequencing, and Sanger sequencing of nrDNA) combined with concatenation and coalescence approaches. Then, based on well-understood species relationships, historical biogeographic analyses and diversification rate estimates were employed to reveal the history of Osmanthus. Osmanthus originated in mid-Miocene Europe and dispersed to the eastern Tibetan Plateau in the late Miocene. Thereafter, it continued to spread eastwards. Phylogenetic conflict is common within the 'Core Osmanthus' clade and is seen at both early and late stages of diversification, leading to hypotheses of net-like species relationships. Incomplete lineage sorting proved ineffective in explaining phylogenetic conflicts and thus supported introgression as the main cause of conflicts. This study elucidates the diversification history of a relict genus in the subtropical regions of eastern Asia and reveals that introgression had profound effects on its evolutionary history.
Collapse
Affiliation(s)
- Yongfu Li
- Co-Innovation Center for Sustainable Forestry in Southern China, College of Life Sciences, International Cultivar Registration Center for Osmanthus, Nanjing Forestry University, Nanjing, 210037, Jiangsu, China
| | - Xuan Li
- Co-Innovation Center for Sustainable Forestry in Southern China, College of Life Sciences, International Cultivar Registration Center for Osmanthus, Nanjing Forestry University, Nanjing, 210037, Jiangsu, China
| | - Shuai Nie
- Rice Research Institute, Guangdong Academy of Agricultural Sciences & Key Laboratory of Genetics and Breeding of High Quality Rice in Southern China (Co-construction by Ministry and Province), Ministry of Agriculture and Rural Affairs & Guangdong Key Laboratory of New Technology in Rice Breeding, Guangdong Academy of Agricultural Sciences, Guangzhou, 510640, Guangdong, China
| | - Min Zhang
- Co-Innovation Center for Sustainable Forestry in Southern China, College of Life Sciences, International Cultivar Registration Center for Osmanthus, Nanjing Forestry University, Nanjing, 210037, Jiangsu, China
| | - Qinghua Yang
- Co-Innovation Center for Sustainable Forestry in Southern China, College of Life Sciences, International Cultivar Registration Center for Osmanthus, Nanjing Forestry University, Nanjing, 210037, Jiangsu, China
| | - Wenbin Xu
- Wuhan Botanical Garden, the Chinese Academy of Sciences, Wuhan, 430074, Hubei, China
| | - Yifan Duan
- Co-Innovation Center for Sustainable Forestry in Southern China, College of Life Sciences, International Cultivar Registration Center for Osmanthus, Nanjing Forestry University, Nanjing, 210037, Jiangsu, China
| | - Xianrong Wang
- Co-Innovation Center for Sustainable Forestry in Southern China, College of Life Sciences, International Cultivar Registration Center for Osmanthus, Nanjing Forestry University, Nanjing, 210037, Jiangsu, China
| |
Collapse
|
4
|
Liu GM, Pan Q, Du J, Zhu PF, Liu WQ, Li ZH, Wang L, Hu CY, Dai YC, Zhang XX, Zhang Z, Yu Y, Li M, Wang PC, Wang X, Li M, Zhou XM. Improved mammalian family phylogeny using gap-rare multiple sequence alignment: A timetree of extant placentals and marsupials. Zool Res 2023; 44:1064-1079. [PMID: 37914522 PMCID: PMC10802097 DOI: 10.24272/j.issn.2095-8137.2023.189] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2023] [Accepted: 10/23/2023] [Indexed: 11/03/2023] Open
Abstract
The timing of mammalian diversification in relation to the Cretaceous-Paleogene (KPg) mass extinction continues to be a subject of substantial debate. Previous studies have either focused on limited taxonomic samples with available whole-genome data or relied on short sequence alignments coupled with extensive species samples. In the present study, we improved an existing dataset from the landmark study of Meredith et al. (2011) by filling in missing fragments and further generated another dataset containing 120 taxa and 98 exonic markers. Using these two datasets, we then constructed phylogenies for extant mammalian families, providing improved resolution of many conflicting relationships. Moreover, the timetrees generated, which were calibrated using appropriate molecular clock models and multiple fossil records, indicated that the interordinal diversification of placental mammals initiated before the Late Cretaceous period. Additionally, intraordinal diversification of both extant placental and marsupial lineages accelerated after the KPg boundary, supporting the hypothesis that the availability of numerous vacant ecological niches subsequent to the mass extinction event facilitated rapid diversification. Thus, our results support a scenario of placental radiation characterized by both basal cladogenesis and active interordinal divergences spanning from the Late Cretaceous into the Paleogene.
Collapse
Affiliation(s)
- Gao-Ming Liu
- Key Laboratory of Animal Ecology and Conservation Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Qi Pan
- Key Laboratory of Animal Ecology and Conservation Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Juan Du
- Key Laboratory of Animal Ecology and Conservation Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Ping-Fen Zhu
- Key Laboratory of Animal Ecology and Conservation Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Wei-Qiang Liu
- Key Laboratory of Animal Ecology and Conservation Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Zi-Hao Li
- Key Laboratory of Animal Ecology and Conservation Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Ling Wang
- Key Laboratory of Animal Ecology and Conservation Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Chun-Yan Hu
- Key Laboratory of Animal Ecology and Conservation Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yi-Chen Dai
- Key Laboratory of Animal Ecology and Conservation Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Xiao-Xiao Zhang
- Key Laboratory of Animal Ecology and Conservation Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Zhan Zhang
- Key Laboratory of Animal Ecology and Conservation Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yang Yu
- School of Life Sciences, University of Science and Technology of China, Hefei, Anhui 230026, China
| | - Meng Li
- Key Laboratory of Animal Ecology and Conservation Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Peng-Cheng Wang
- Key Laboratory of Animal Ecology and Conservation Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
- College of Life Sciences, Nanjing Normal University, Nanjing, Jiangsu 210023, China
| | - Xiao Wang
- Key Laboratory of Animal Ecology and Conservation Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Ming Li
- Key Laboratory of Animal Ecology and Conservation Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Xu-Ming Zhou
- Key Laboratory of Animal Ecology and Conservation Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China. E-mail:
| |
Collapse
|
5
|
Doronina L, Ogoniak L, Schmitz J. Homoplasy of Retrotransposon Insertions in Toothed Whales. Genes (Basel) 2023; 14:1830. [PMID: 37761970 PMCID: PMC10531181 DOI: 10.3390/genes14091830] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2023] [Revised: 09/19/2023] [Accepted: 09/20/2023] [Indexed: 09/29/2023] Open
Abstract
Retrotransposon insertion patterns facilitate a virtually homoplasy-free picture of phylogenetic history. Still, a few most likely random parallel insertions or deletions result in rare cases of homoplasy in primates. The following question arises: how frequent is retrotransposon homoplasy in other phylogenetic clades? Here, we derived genome insertion data of toothed whales to evaluate the extension of homoplasy in a representative laurasiatherian group. Among more than a thousand extracted and aligned retrotransposon loci, we detected 37 cases of precise parallel insertions in species that are separated by over more than 10 million years, a time frame which minimizes the effects of incomplete lineage sorting. We compared the phylogenetic signal of insertions with the flanking sequences of these loci to further exclude potential polymorphic loci derived by incomplete lineage sorting. We found that the phylogenetic signals of retrotransposon insertion patterns exhibiting true homoplasy differ from the signals of their flanking sequences. In toothed whales, precise parallel insertions account for around 0.18-0.29% of insertion cases, which is about 12.5 times the frequency of such insertions among Alus in primates. We also detected five specific deletions of retrotransposons on various lineages of toothed whale evolution, a frequency of 0.003%, which is slightly higher than such occurrences in primates. Overall, the level of retrotransposon homoplasy in toothed whales is still marginal compared to the phylogenetic diagnostic retrotransposon presence/absence signal.
Collapse
Affiliation(s)
- Liliya Doronina
- Institute of Experimental Pathology, ZMBE, University of Münster, 48149 Münster, Germany;
- Institute for Evolution and Biodiversity, University of Münster, 48149 Münster, Germany
| | - Lynn Ogoniak
- Institute of Experimental Pathology, ZMBE, University of Münster, 48149 Münster, Germany;
| | - Jürgen Schmitz
- Institute of Experimental Pathology, ZMBE, University of Münster, 48149 Münster, Germany;
| |
Collapse
|
6
|
Scheben A, Mendivil Ramos O, Kramer M, Goodwin S, Oppenheim S, Becker DJ, Schatz MC, Simmons NB, Siepel A, McCombie WR. Long-Read Sequencing Reveals Rapid Evolution of Immunity- and Cancer-Related Genes in Bats. Genome Biol Evol 2023; 15:evad148. [PMID: 37728212 PMCID: PMC10510315 DOI: 10.1093/gbe/evad148] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/03/2023] [Indexed: 09/21/2023] Open
Abstract
Bats are exceptional among mammals for their powered flight, extended lifespans, and robust immune systems and therefore have been of particular interest in comparative genomics. Using the Oxford Nanopore Technologies long-read platform, we sequenced the genomes of two bat species with key phylogenetic positions, the Jamaican fruit bat (Artibeus jamaicensis) and the Mesoamerican mustached bat (Pteronotus mesoamericanus), and carried out a comprehensive comparative genomic analysis with a diverse collection of bats and other mammals. The high-quality, long-read genome assemblies revealed a contraction of interferon (IFN)-α at the immunity-related type I IFN locus in bats, resulting in a shift in relative IFN-ω and IFN-α copy numbers. Contradicting previous hypotheses of constitutive expression of IFN-α being a feature of the bat immune system, three bat species lost all IFN-α genes. This shift to IFN-ω could contribute to the increased viral tolerance that has made bats a common reservoir for viruses that can be transmitted to humans. Antiviral genes stimulated by type I IFNs also showed evidence of rapid evolution, including a lineage-specific duplication of IFN-induced transmembrane genes and positive selection in IFIT2. In addition, 33 tumor suppressors and 6 DNA-repair genes showed signs of positive selection, perhaps contributing to increased longevity and reduced cancer rates in bats. The robust immune systems of bats rely on both bat-wide and lineage-specific evolution in the immune gene repertoire, suggesting diverse immune strategies. Our study provides new genomic resources for bats and sheds new light on the extraordinary molecular evolution in this critically important group of mammals.
Collapse
Affiliation(s)
- Armin Scheben
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, USA
| | | | - Melissa Kramer
- Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, USA
| | - Sara Goodwin
- Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, USA
| | - Sara Oppenheim
- American Museum of Natural History, Institute for Comparative Genomics, New York, New York, USA
| | - Daniel J Becker
- School of Biological Sciences, University of Oklahoma, Norman, Oklahoma, USA
| | - Michael C Schatz
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, USA
- Departments of Computer Science and Biology, Johns Hopkins University, Baltimore, Maryland, USA
| | - Nancy B Simmons
- Department of Mammalogy, Division of Vertebrate Zoology, American Museum of Natural History, New York, New York, USA
| | - Adam Siepel
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, USA
| | | |
Collapse
|
7
|
Churakov G, Kuritzin A, Chukharev K, Zhang F, Wünnemann F, Ulyantsev V, Schmitz J. A 4-lineage Statistical Suite to Evaluate the Support of Large-Scale Retrotransposon Insertion Data to Reconstruct Evolutionary Trees. Syst Biol 2023; 72:649-661. [PMID: 36688484 DOI: 10.1093/sysbio/syac082] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2020] [Revised: 12/04/2022] [Accepted: 12/23/2022] [Indexed: 01/24/2023] Open
Abstract
Retrophylogenomics makes use of genome-wide retrotransposon presence/absence insertion patterns to resolve questions in phylogeny and population genetics. In the genomics era, evaluating high-throughput data requires the associated development of appropriately powerful statistical tools. The currently used KKSC 3-lineage statistical test for estimating the significance of retrophylogenomic data is limited by the number of possible tree topologies it can assess in one step. To improve on this, we have extended the analysis to simultaneously compare four lineages, enabling us to evaluate ten distinct presence/absence insertion patterns for 26 possible tree topologies plus 129 trees with different incidences of hybridization or introgression. The new tool provides statistics for cases involving multiple ancestral hybridizations/introgressions, ancestral incomplete lineage sorting, bifurcation, and polytomy. The test is embedded in a user-friendly web R application (http://retrogenomics.uni-muenster.de:3838/hammlet/) and is available for use by the scientific community. [ancestral hybridization/introgression; ancestral incomplete lineage sorting (ILS); empirical distribution; KKSC-statistics; 4-lineage (4-LIN) insertion polymorphism; polytomy; retrophylogenomics.].
Collapse
Affiliation(s)
- Gennady Churakov
- Institute of Experimental Pathology (ZMBE), University of Münster, Münster, Germany
- Department of Biochemistry, Institute of Experimental Medicine, St. Petersburg, Russia
| | - Andrej Kuritzin
- Department of System Analysis, Saint Petersburg State Institute of Technology, St. Petersburg, Russia
| | - Konstantin Chukharev
- Information Technologies, Mechanics and Optics, University Saint Petersburg, St. Petersburg, Russia
| | - Fengjun Zhang
- Institute of Experimental Pathology (ZMBE), University of Münster, Münster, Germany
| | - Florian Wünnemann
- Institute for Computational Biomedicine, University Heidelberg, Heidelberg, Germany
| | - Vladimir Ulyantsev
- Information Technologies, Mechanics and Optics, University Saint Petersburg, St. Petersburg, Russia
| | - Jürgen Schmitz
- Institute of Experimental Pathology (ZMBE), University of Münster, Münster, Germany
| |
Collapse
|
8
|
de la Fuente R, Díaz-Villanueva W, Arnau V, Moya A. Genomic Signature in Evolutionary Biology: A Review. BIOLOGY 2023; 12:biology12020322. [PMID: 36829597 PMCID: PMC9953303 DOI: 10.3390/biology12020322] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/08/2022] [Revised: 02/11/2023] [Accepted: 02/13/2023] [Indexed: 02/19/2023]
Abstract
Organisms are unique physical entities in which information is stored and continuously processed. The digital nature of DNA sequences enables the construction of a dynamic information reservoir. However, the distinction between the hardware and software components in the information flow is crucial to identify the mechanisms generating specific genomic signatures. In this work, we perform a bibliometric analysis to identify the different purposes of looking for particular patterns in DNA sequences associated with a given phenotype. This study has enabled us to make a conceptual breakdown of the genomic signature and differentiate the leading applications. On the one hand, it refers to gene expression profiling associated with a biological function, which may be shared across taxa. This signature is the focus of study in precision medicine. On the other hand, it also refers to characteristic patterns in species-specific DNA sequences. This interpretation plays a key role in comparative genomics, identifying evolutionary relationships. Looking at the relevant studies in our bibliographic database, we highlight the main factors causing heterogeneities in genome composition and how they can be quantified. All these findings lead us to reformulate some questions relevant to evolutionary biology.
Collapse
Affiliation(s)
- Rebeca de la Fuente
- Institute of Integrative Systems Biology (I2Sysbio), University of Valencia and Spanish Research Council (CSIC), 46980 Valencia, Spain
- Correspondence:
| | - Wladimiro Díaz-Villanueva
- Institute of Integrative Systems Biology (I2Sysbio), University of Valencia and Spanish Research Council (CSIC), 46980 Valencia, Spain
| | - Vicente Arnau
- Institute of Integrative Systems Biology (I2Sysbio), University of Valencia and Spanish Research Council (CSIC), 46980 Valencia, Spain
| | - Andrés Moya
- Institute of Integrative Systems Biology (I2Sysbio), University of Valencia and Spanish Research Council (CSIC), 46980 Valencia, Spain
- Foundation for the Promotion of Sanitary and Biomedical Research of the Valencian Community (FISABIO), 46020 Valencia, Spain
- CIBER in Epidemiology and Public Health (CIBEResp), 28029 Madrid, Spain
| |
Collapse
|
9
|
Kelley DB. Convergent and divergent neural circuit architectures that support acoustic communication. Front Neural Circuits 2022; 16:976789. [PMID: 36466364 PMCID: PMC9712726 DOI: 10.3389/fncir.2022.976789] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2022] [Accepted: 10/19/2022] [Indexed: 11/18/2022] Open
Abstract
Vocal communication is used across extant vertebrates, is evolutionarily ancient, and been maintained, in many lineages. Here I review the neural circuit architectures that support intraspecific acoustic signaling in representative anuran, mammalian and avian species as well as two invertebrates, fruit flies and Hawaiian crickets. I focus on hindbrain motor control motifs and their ties to respiratory circuits, expression of receptors for gonadal steroids in motor, sensory, and limbic neurons as well as divergent modalities that evoke vocal responses. Hindbrain and limbic participants in acoustic communication are highly conserved, while forebrain participants have diverged between anurans and mammals, as well as songbirds and rodents. I discuss the roles of natural and sexual selection in driving speciation, as well as exaptation of circuit elements with ancestral roles in respiration, for producing sounds and driving rhythmic vocal features. Recent technical advances in whole brain fMRI across species will enable real time imaging of acoustic signaling partners, tying auditory perception to vocal production.
Collapse
|
10
|
Wu J, Zhang L, Shen C, Sin SYW, Lei C, Zhao H. Comparative transcriptome analysis reveals molecular adaptations underlying distinct immunity and inverted resting posture in bats. Integr Zool 2022; 18:493-505. [PMID: 36049759 DOI: 10.1111/1749-4877.12676] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Understanding how natural selection shapes unique traits in mammals is a central topic in evolutionary biology. The mammalian order Chiroptera (bats) is attractive for biologists as well as the general public due to their specific traits of extraordinary immunity and inverted resting posture. However, genomic resources for bats that occupy key phylogenetic positions are not sufficient, which hinders comprehensive investigation of the molecular mechanisms underpinning the origin of specific traits in bats. Here, we sequenced the transcriptomes of five bats that are phylogenetically divergent and occupy key positions in the phylogenetic tree of bats. In combination with the available genomes of 19 bats and 21 other mammals, we built a database consisting of 10,918 one-to-one ortholog genes and reconstructed phylogenetic relationships of these mammals. We found that genes related to immunity, bone remodeling and cardiovascular system are targets of natural selection along the ancestral branch of bats. Further analyses revealed that the T cell receptor signaling pathway involved in immune adaptation is specifically enriched in bats. Moreover, molecular adaptations of bone remodeling, cardiovascular system, and balance sensing may help to explain the reverted resting posture in bats. Our study provides valuable transcriptome resources, enabling us to tentatively identify genetic changes associated with bat-specific traits. This work is among the first to advance our understanding of molecular underpinnings of inverted resting posture in bats, which could provide insight into healthcare applications such as hypertension in humans. This article is protected by copyright. All rights reserved.
Collapse
Affiliation(s)
- Jinwei Wu
- Engineering Research Center of Eco-environment in Three Gorges Reservoir Region of Ministry of Education, China Three Gorges University, Yichang, China
| | - Libiao Zhang
- Guangdong Key Laboratory of Animal Conservation and Resource Utilization, Guangdong Public Laboratory of Wild Animal Conservation and Utilization, Institute of Zoology, Guangdong Academy of Sciences, Guangzhou, China
| | - Chao Shen
- College of Life Sciences, Wuhan University, Wuhan, China
| | - Simon Yung Wa Sin
- School of Biological Sciences, The University of Hong Kong, Pok Fu Lam Road, Hong Kong SAR, China
| | - Caoqi Lei
- College of Life Sciences, Wuhan University, Wuhan, China
| | - Huabin Zhao
- College of Life Sciences, Wuhan University, Wuhan, China
| |
Collapse
|
11
|
Gatesy J, Springer MS. Phylogenomic Coalescent Analyses of Avian Retroelements Infer Zero-Length Branches at the Base of Neoaves, Emergent Support for Controversial Clades, and Ancient Introgressive Hybridization in Afroaves. Genes (Basel) 2022; 13:genes13071167. [PMID: 35885951 PMCID: PMC9324441 DOI: 10.3390/genes13071167] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2022] [Revised: 06/20/2022] [Accepted: 06/21/2022] [Indexed: 01/25/2023] Open
Abstract
Retroelement insertions (RIs) are low-homoplasy characters that are ideal data for addressing deep evolutionary radiations, where gene tree reconstruction errors can severely hinder phylogenetic inference with DNA and protein sequence data. Phylogenomic studies of Neoaves, a large clade of birds (>9000 species) that first diversified near the Cretaceous−Paleogene boundary, have yielded an array of robustly supported, contradictory relationships among deep lineages. Here, we reanalyzed a large RI matrix for birds using recently proposed quartet-based coalescent methods that enable inference of large species trees including branch lengths in coalescent units, clade-support, statistical tests for gene flow, and combined analysis with DNA-sequence-based gene trees. Genome-scale coalescent analyses revealed extremely short branches at the base of Neoaves, meager branch support, and limited congruence with previous work at the most challenging nodes. Despite widespread topological conflicts with DNA-sequence-based trees, combined analyses of RIs with thousands of gene trees show emergent support for multiple higher-level clades (Columbea, Passerea, Columbimorphae, Otidimorphae, Phaethoquornithes). RIs express asymmetrical support for deep relationships within the subclade Afroaves that hints at ancient gene flow involving the owl lineage (Strigiformes). Because DNA-sequence data are challenged by gene tree-reconstruction error, analysis of RIs represents one approach for improving gene tree-based methods when divergences are deep, internodes are short, terminal branches are long, and introgressive hybridization further confounds species−tree inference.
Collapse
Affiliation(s)
- John Gatesy
- Division of Vertebrate Zoology, American Museum of Natural History, New York, NY 10024, USA
- Correspondence:
| | - Mark S. Springer
- Department of Evolution, Ecology, and Organismal Biology, University of California, Riverside, CA 92521, USA;
| |
Collapse
|
12
|
SINEs as Credible Signs to Prove Common Ancestry in the Tree of Life: A Brief Review of Pioneering Case Studies in Retroposon Systematics. Genes (Basel) 2022; 13:genes13060989. [PMID: 35741751 PMCID: PMC9223172 DOI: 10.3390/genes13060989] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2022] [Revised: 05/28/2022] [Accepted: 05/28/2022] [Indexed: 12/31/2022] Open
Abstract
Currently, the insertions of SINEs (and other retrotransposed elements) are regarded as one of the most reliable synapomorphies in molecular systematics. The methodological mainstream of molecular systematics is the calculation of nucleotide (or amino acid) sequence divergences under a suitable substitution model. In contrast, SINE insertion analysis does not require any complex model because SINE insertions are unidirectional and irreversible. This straightforward methodology was named the “SINE method,” which resolved various taxonomic issues that could not be settled by sequence comparison alone. The SINE method has challenged several traditional hypotheses proposed based on the fossil record and anatomy, prompting constructive discussions in the Evo/Devo era. Here, we review our pioneering SINE studies on salmon, cichlids, cetaceans, Afrotherian mammals, and birds. We emphasize the power of the SINE method in detecting incomplete lineage sorting by tracing the genealogy of specific genomic loci with minimal noise. Finally, in the context of the whole-genome era, we discuss how the SINE method can be applied to further our understanding of the tree of life.
Collapse
|
13
|
Doronina L, Reising O, Clawson H, Churakov G, Schmitz J. Euarchontoglires Challenged by Incomplete Lineage Sorting. Genes (Basel) 2022; 13:774. [PMID: 35627160 PMCID: PMC9141288 DOI: 10.3390/genes13050774] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2022] [Revised: 04/08/2022] [Accepted: 04/20/2022] [Indexed: 11/17/2022] Open
Abstract
Euarchontoglires, once described as Supraprimates, comprise primates, colugos, tree shrews, rodents, and lagomorphs in a clade that evolved about 90 million years ago (mya) from a shared ancestor with Laurasiatheria. The rapid speciation of groups within Euarchontoglires, and the subsequent inherent incomplete marker fixation in ancestral lineages, led to challenged attempts at phylogenetic reconstructions, particularly for the phylogenetic position of tree shrews. To resolve this conundrum, we sampled genome-wide presence/absence patterns of transposed elements (TEs) from all representatives of Euarchontoglires. This specific marker system has the advantage that phylogenetic diagnostic characters can be extracted in a nearly unbiased fashion genome-wide from reference genomes. Their insertions are virtually free of homoplasy. We simultaneously employed two computational tools, the genome presence/absence compiler (GPAC) and 2-n-way, to find a maximum of diagnostic insertions from more than 3 million TE positions. From 361 extracted diagnostic TEs, 132 provide significant support for the current resolution of Primatomorpha (Primates plus Dermoptera), 94 support the union of Euarchonta (Primates, Dermoptera, plus Scandentia), and 135 marker insertion patterns support a variety of alternative phylogenetic scenarios. Thus, whole genome-level analysis and a virtually homoplasy-free marker system offer an opportunity to finally resolve the notorious phylogenetic challenges that nature produces in rapidly diversifying groups.
Collapse
Affiliation(s)
- Liliya Doronina
- Institute of Experimental Pathology, ZMBE, University of Münster, 48149 Münster, Germany; (O.R.); (G.C.); (J.S.)
| | - Olga Reising
- Institute of Experimental Pathology, ZMBE, University of Münster, 48149 Münster, Germany; (O.R.); (G.C.); (J.S.)
| | - Hiram Clawson
- Department of Biomolecular Engineering, University of California, Santa Cruz, CA 95064, USA;
| | - Gennady Churakov
- Institute of Experimental Pathology, ZMBE, University of Münster, 48149 Münster, Germany; (O.R.); (G.C.); (J.S.)
| | - Jürgen Schmitz
- Institute of Experimental Pathology, ZMBE, University of Münster, 48149 Münster, Germany; (O.R.); (G.C.); (J.S.)
- EvoPAD-RTG, University of Münster, 48149 Münster, Germany
| |
Collapse
|
14
|
Contradictory Phylogenetic Signals in the Laurasiatheria Anomaly Zone. Genes (Basel) 2022; 13:genes13050766. [PMID: 35627151 PMCID: PMC9141728 DOI: 10.3390/genes13050766] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2022] [Revised: 04/12/2022] [Accepted: 04/21/2022] [Indexed: 02/04/2023] Open
Abstract
Relationships among laurasiatherian clades represent one of the most highly disputed topics in mammalian phylogeny. In this study, we attempt to disentangle laurasiatherian interordinal relationships using two independent genome-level approaches: (1) quantifying retrotransposon presence/absence patterns, and (2) comparisons of exon datasets at the levels of nucleotides and amino acids. The two approaches revealed contradictory phylogenetic signals, possibly due to a high level of ancestral incomplete lineage sorting. The positions of Eulipotyphla and Chiroptera as the first and second earliest divergences were consistent across the approaches. However, the phylogenetic relationships of Perissodactyla, Cetartiodactyla, and Ferae, were contradictory. While retrotransposon insertion analyses suggest a clade with Cetartiodactyla and Ferae, the exon dataset favoured Cetartiodactyla and Perissodactyla. Future analyses of hitherto unsampled laurasiatherian lineages and synergistic analyses of retrotransposon insertions, exon and conserved intron/intergenic sequences might unravel the conflicting patterns of relationships in this major mammalian clade.
Collapse
|
15
|
Doronina L, Feigin CY, Schmitz J. OUP accepted manuscript. Syst Biol 2022; 71:1045-1053. [PMID: 35289914 PMCID: PMC9366447 DOI: 10.1093/sysbio/syac025] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2021] [Revised: 03/09/2022] [Accepted: 03/11/2022] [Indexed: 11/29/2022] Open
Abstract
Although first posited to be of a single origin, the two superfamilies of phalangeriform marsupial possums (Phalangeroidea: brushtail possums and cuscuses and Petauroidea: possums and gliders) have long been considered, based on multiple sequencing studies, to have evolved from two separate origins. However, previous data from these sequence analyses suggested a variety of conflicting trees. Therefore, we reinvestigated these relationships by screening \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{upgreek}
\usepackage{mathrsfs}
\setlength{\oddsidemargin}{-69pt}
\begin{document}
}{}$\sim$\end{document}200,000 orthologous short interspersed element (SINE) loci across the newly available whole-genome sequences of phalangeriform species and their relatives. Compared to sequence data, SINE presence/absence patterns are evolutionarily almost neutral molecular markers of the phylogenetic history of species. Their random and highly complex genomic insertion ensures their virtually homoplasy-free nature and enables one to compare hundreds of shared unique orthologous events to determine the true species tree. Here, we identify 106 highly reliable phylogenetic SINE markers whose presence/absence patterns within multiple Australasian possum genomes unexpectedly provide the first significant evidence for the reunification of Australasian possums into one monophyletic group. Together, our findings indicate that nucleotide homoplasy and ancestral incomplete lineage sorting have most likely driven the conflicting signal distributions seen in previous sequence-based studies. [Ancestral incomplete lineage sorting; possum genomes; possum monophyly; retrophylogenomics; SINE presence/absence.]
Collapse
Affiliation(s)
- Liliya Doronina
- Institute of Experimental Pathology (ZMBE), University of Münster, Von-Esmarch-Str. 56, D-48149 Münster, Germany
| | - Charles Y Feigin
- Department of Molecular Biology, Princeton University, 119 Lewis Thomas Laboratory, Washington Road, Princeton, NJ 08544-1014, USA
- School of BioSciences, The University of Melbourne, BioSciences 4, Royal Pde, Parkville, VIC 3010, Australia
| | - Jürgen Schmitz
- Correspondence to be sent to: Institute of Experimental Pathology (ZMBE), University of Münster, Von-Esmarch-Str. 56, D-48149 Münster, Germany; E-mail:
| |
Collapse
|
16
|
Schull JK, Turakhia Y, Hemker JA, Dally WJ, Bejerano G. OUP accepted manuscript. Genome Biol Evol 2022; 14:6529394. [PMID: 35171243 PMCID: PMC8920512 DOI: 10.1093/gbe/evac013] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/10/2022] [Indexed: 11/14/2022] Open
Abstract
We present Champagne, a whole-genome method for generating character matrices for phylogenomic analysis using large genomic indel events. By rigorously picking orthologous genes and locating large insertion and deletion events, Champagne delivers a character matrix that considerably reduces homoplasy compared with morphological and nucleotide-based matrices, on both established phylogenies and difficult-to-resolve nodes in the mammalian tree. Champagne provides ample evidence in the form of genomic structural variation to support incomplete lineage sorting and possible introgression in Paenungulata and human–chimp–gorilla which were previously inferred primarily through matrices composed of aligned single-nucleotide characters. Champagne also offers further evidence for Myomorpha as sister to Sciuridae and Hystricomorpha in the rodent tree. Champagne harbors distinct theoretical advantages as an automated method that produces nearly homoplasy-free character matrices on the whole-genome scale.
Collapse
Affiliation(s)
- James K Schull
- Department of Computer Science, Stanford University, USA
| | - Yatish Turakhia
- Department of Electrical and Computer Engineering, University of California San Diego, USA
| | - James A Hemker
- Department of Computer Science, Stanford University, USA
| | - William J Dally
- Department of Computer Science, Stanford University, USA
- NVIDIA, Santa Clara, California, USA
- Department of Electrical Engineering, Stanford University, USA
| | - Gill Bejerano
- Department of Computer Science, Stanford University, USA
- Department of Developmental Biology, Stanford University, USA
- Department of Biomedical Data Science, Stanford University, USA
- Department of Pediatrics, Stanford University, USA
- Corresponding author: E-mail:
| |
Collapse
|
17
|
Molloy EK, Gatesy J, Springer MS. Theoretical and practical considerations when using retroelement insertions to estimate species trees in the anomaly zone. Syst Biol 2021; 71:721-740. [PMID: 34677617 DOI: 10.1093/sysbio/syab086] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2020] [Accepted: 10/11/2021] [Indexed: 11/13/2022] Open
Abstract
A potential shortcoming of concatenation methods for species tree estimation is their failure to account for incomplete lineage sorting. Coalescent methods address this problem but make various assumptions that, if violated, can result in worse performance than concatenation. Given the challenges of analyzing DNA sequences with both concatenation and coalescent methods, retroelement insertions (RIs) have emerged as powerful phylogenomic markers for species tree estimation. Here, we show that two recently proposed quartet-based methods, SDPquartets and ASTRAL_BP, are statistically consistent estimators of the unrooted species tree topology under the coalescent when RIs follow a neutral infinite-sites model of mutation and the expected number of new RIs per generation is constant across the species tree. The accuracy of these (and other) methods for inferring species trees from RIs has yet to be assessed on simulated data sets, where the true species tree topology is known. Therefore, we evaluated eight methods given RIs simulated from four model species trees, all of which have short branches and at least three of which are in the anomaly zone. In our simulation study, ASTRAL_BP and SDPquartets always recovered the correct species tree topology when given a sufficiently large number of RIs, as predicted. A distance-based method (ASTRID_BP) and Dollo parsimony also performed well in recovering the species tree topology. In contrast, unordered, polymorphism, and Camin-Sokal parsimony typically fail to recover the correct species tree topology in anomaly zone situations with more than four ingroup taxa. Of the methods studied, only ASTRAL_BP automatically estimates internal branch lengths (in coalescent units) and support values (i.e. local posterior probabilities). We examined the accuracy of branch length estimation, finding that estimated lengths were accurate for short branches but upwardly biased otherwise. This led us to derive the maximum likelihood (branch length) estimate for when RIs are given as input instead of binary gene trees; this corrected formula produced accurate estimates of branch lengths in our simulation study, provided that a sufficiently large number of RIs were given as input. Lastly, we evaluated the impact of data quantity on species tree estimation by repeating the above experiments with input sizes varying from 100 to 100 000 parsimony-informative RIs. We found that, when given just 1 000 parsimony-informative RIs as input, ASTRAL_BP successfully reconstructed major clades (i.e clades separated by branches > 0.3 CUs) with high support and identified rapid radiations (i.e. shorter connected branches), although not their precise branching order. The local posterior probability was effective for controlling false positive branches in these scenarios.
Collapse
Affiliation(s)
- Erin K Molloy
- Department of Computer Science, University of Maryland, College Park, College Park, 20742, USA
| | - John Gatesy
- Sackler Institute for Comparative Genomics, American Museum of Natural History, New York, 10024, USA
| | - Mark S Springer
- Department of Evolution, Ecology, and Organismal Biology, University of California, Riverside, Riverside, 92521, USA
| |
Collapse
|
18
|
Lv X, Hu J, Hu Y, Li Y, Xu D, Ryder OA, Irwin DM, Yu L. Diverse phylogenomic datasets uncover a concordant scenario of laurasiatherian interordinal relationships. Mol Phylogenet Evol 2020; 157:107065. [PMID: 33387649 DOI: 10.1016/j.ympev.2020.107065] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2020] [Revised: 12/22/2020] [Accepted: 12/24/2020] [Indexed: 10/22/2022]
Abstract
Resolving the interordinal relationships in the mammalian superorder Laurasiatheria has been among the most intractable problems in higher-level mammalian systematics, with many conflicting hypotheses having been proposed. The present study collected three different sources of genome-scale data with comprehensive taxon sampling of laurasiatherian species, including two protein-coding datasets (4,186 protein-coding genes for an amino acid dataset comprising 2,761,247 amino acid residues and a nucleotide dataset comprising 5,516,340 nucleotides from 1st and 2nd codon positions), an intronic dataset (1,210 introns comprising 1,162,723 nucleotides) and an ultraconserved elements (UCEs) dataset (1,246 UCEs comprising 1,946,472 nucleotides) from 40 species representing all six laurasiatherian orders and 7 non-laurasiatherian outgroups. Remarkably, phylogenetic trees reconstructed with the four datasets using different tree-building methods (RAxML, FastTree, ASTRAL and MP-EST) all supported the relationship (Eulipotyphla, (Chiroptera, ((Carnivora, Pholidota), (Cetartiodactyla, Perissodactyla)))). We find a resolution of interordinal relationships of Laurasiatheria among all types of markers used in the present study, and the likelihood ratio tests for tree comparisons confirmed that the present tree topology is the optimal hypothesis compared to other examined hypotheses. Jackknifing subsampling analyses demonstrate that the results of laurasiatherian tree reconstruction varied with the number of loci and ordinal representatives used, which are likely the two main contributors to phylogenetic disagreements of Laurasiatheria seen in previous studies. Our study provides significant insight into laurasiatherian evolution, and moreover, an important methodological strategy and reference for resolving phylogenies of adaptive radiation, which have been a long-standing challenge in the field of phylogenetics.
Collapse
Affiliation(s)
- Xue Lv
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming, China; School of Life Sciences, Yunnan University, Kunming, China
| | - Jingyang Hu
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming, China; School of Life Sciences, Yunnan University, Kunming, China; Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, China
| | - Yiwen Hu
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming, China; School of Life Sciences, Yunnan University, Kunming, China
| | - Yitian Li
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming, China; School of Life Sciences, Yunnan University, Kunming, China
| | - Dongming Xu
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Kunming, China
| | - Oliver A Ryder
- Institute for Conservation Research, San Diego Zoo Global, Escondido, CA, USA
| | - David M Irwin
- Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, Canada
| | - Li Yu
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming, China.
| |
Collapse
|
19
|
Murphy WJ, Foley NM, Bredemeyer KR, Gatesy J, Springer MS. Phylogenomics and the Genetic Architecture of the Placental Mammal Radiation. Annu Rev Anim Biosci 2020; 9:29-53. [PMID: 33228377 DOI: 10.1146/annurev-animal-061220-023149] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The genomes of placental mammals are being sequenced at an unprecedented rate. Alignments of hundreds, and one day thousands, of genomes spanning the rich living and extinct diversity of species offer unparalleled power to resolve phylogenetic controversies, identify genomic innovations of adaptation, and dissect the genetic architecture of reproductive isolation. We highlight outstanding questions about the earliest phases of placental mammal diversification and the promise of newer methods, as well as remaining challenges, toward using whole genome data to resolve placental mammal phylogeny. The next phase of mammalian comparative genomics will see the completion and application of finished-quality, gapless genome assemblies from many ordinal lineages and closely related species. Interspecific comparisons between the most hypervariable genomic loci will likely reveal large, but heretofore mostly underappreciated, effects on population divergence, morphological innovation, and the origin of new species.
Collapse
Affiliation(s)
- William J Murphy
- Veterinary Integrative Biosciences, Texas A&M University, College Station, Texas 77843, USA;
| | - Nicole M Foley
- Veterinary Integrative Biosciences, Texas A&M University, College Station, Texas 77843, USA;
| | - Kevin R Bredemeyer
- Veterinary Integrative Biosciences, Texas A&M University, College Station, Texas 77843, USA;
| | - John Gatesy
- Division of Vertebrate Zoology, American Museum of Natural History, New York, NY 10024, USA
| | - Mark S Springer
- Department of Evolution, Ecology and Organismal Biology, University of California, Riverside, California 92521, USA
| |
Collapse
|
20
|
Abstract
To effectively analyze the increasing amounts of available genomic data, improved comparative analytical tools that are accessible to and applicable by a broad scientific community are essential. We built the “2-n-way” software suite to provide a fundamental and innovative processing framework for revealing and comparing inserted elements among various genomes. The suite comprises two user-friendly web-based modules. The 2-way module generates pairwise whole-genome alignments of target and query species. The resulting genome coordinates of blocks (matching sequences) and gaps (missing sequences) from multiple 2-ways are then transferred to the n-way module and sorted into projects, in which user-defined coordinates from reference species are projected to the block/gap coordinates of orthologous loci in query species to provide comparative information about presence (blocks) or absence (gaps) patterns of targeted elements over many entire genomes and phylogroups. Thus, the 2-n-way software suite is ideal for performing multidirectional, non-ascertainment-biased screenings to extract all possible presence/absence data of user-relevant elements in orthologous sequences. To highlight its applicability and versatility, we used 2-n-way to expose approximately 100 lost introns in vertebrates, analyzed thousands of potential phylogenetically informative bat and whale retrotransposons, and novel human exons as well as thousands of human polymorphic retrotransposons.
Collapse
|
21
|
Jebb D, Huang Z, Pippel M, Hughes GM, Lavrichenko K, Devanna P, Winkler S, Jermiin LS, Skirmuntt EC, Katzourakis A, Burkitt-Gray L, Ray DA, Sullivan KAM, Roscito JG, Kirilenko BM, Dávalos LM, Corthals AP, Power ML, Jones G, Ransome RD, Dechmann DKN, Locatelli AG, Puechmaille SJ, Fedrigo O, Jarvis ED, Hiller M, Vernes SC, Myers EW, Teeling EC. Six reference-quality genomes reveal evolution of bat adaptations. Nature 2020; 583:578-584. [PMID: 32699395 PMCID: PMC8075899 DOI: 10.1038/s41586-020-2486-3] [Citation(s) in RCA: 158] [Impact Index Per Article: 39.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2019] [Accepted: 06/09/2020] [Indexed: 11/08/2022]
Abstract
Bats possess extraordinary adaptations, including flight, echolocation, extreme longevity and unique immunity. High-quality genomes are crucial for understanding the molecular basis and evolution of these traits. Here we incorporated long-read sequencing and state-of-the-art scaffolding protocols1 to generate, to our knowledge, the first reference-quality genomes of six bat species (Rhinolophus ferrumequinum, Rousettus aegyptiacus, Phyllostomus discolor, Myotis myotis, Pipistrellus kuhlii and Molossus molossus). We integrated gene projections from our 'Tool to infer Orthologs from Genome Alignments' (TOGA) software with de novo and homology gene predictions as well as short- and long-read transcriptomics to generate highly complete gene annotations. To resolve the phylogenetic position of bats within Laurasiatheria, we applied several phylogenetic methods to comprehensive sets of orthologous protein-coding and noncoding regions of the genome, and identified a basal origin for bats within Scrotifera. Our genome-wide screens revealed positive selection on hearing-related genes in the ancestral branch of bats, which is indicative of laryngeal echolocation being an ancestral trait in this clade. We found selection and loss of immunity-related genes (including pro-inflammatory NF-κB regulators) and expansions of anti-viral APOBEC3 genes, which highlights molecular mechanisms that may contribute to the exceptional immunity of bats. Genomic integrations of diverse viruses provide a genomic record of historical tolerance to viral infection in bats. Finally, we found and experimentally validated bat-specific variation in microRNAs, which may regulate bat-specific gene-expression programs. Our reference-quality bat genomes provide the resources required to uncover and validate the genomic basis of adaptations of bats, and stimulate new avenues of research that are directly relevant to human health and disease1.
Collapse
Affiliation(s)
- David Jebb
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany
- Max Planck Institute for the Physics of Complex Systems, Dresden, Germany
- Center for Systems Biology Dresden, Dresden, Germany
| | - Zixia Huang
- School of Biology and Environmental Science, University College Dublin, Dublin, Ireland
| | - Martin Pippel
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany
- Center for Systems Biology Dresden, Dresden, Germany
| | - Graham M Hughes
- School of Biology and Environmental Science, University College Dublin, Dublin, Ireland
| | - Ksenia Lavrichenko
- Neurogenetics of Vocal Communication Group, Max Planck Institute for Psycholinguistics, Nijmegen, The Netherlands
| | - Paolo Devanna
- Neurogenetics of Vocal Communication Group, Max Planck Institute for Psycholinguistics, Nijmegen, The Netherlands
| | - Sylke Winkler
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany
| | - Lars S Jermiin
- School of Biology and Environmental Science, University College Dublin, Dublin, Ireland
- Research School of Biology, Australian National University, Canberra, Australian Capital Territory, Australia
- Earth Institute, University College Dublin, Dublin, Ireland
| | - Emilia C Skirmuntt
- Peter Medawar Building for Pathogen Research, Department of Zoology, University of Oxford, Oxford, UK
| | - Aris Katzourakis
- Peter Medawar Building for Pathogen Research, Department of Zoology, University of Oxford, Oxford, UK
| | - Lucy Burkitt-Gray
- Conway Institute of Biomolecular and Biomedical Science, University College Dublin, Dublin, Ireland
| | - David A Ray
- Department of Biological Sciences, Texas Tech University, Lubbock, TX, USA
| | - Kevin A M Sullivan
- Department of Biological Sciences, Texas Tech University, Lubbock, TX, USA
| | - Juliana G Roscito
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany
- Max Planck Institute for the Physics of Complex Systems, Dresden, Germany
- Center for Systems Biology Dresden, Dresden, Germany
| | - Bogdan M Kirilenko
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany
- Max Planck Institute for the Physics of Complex Systems, Dresden, Germany
- Center for Systems Biology Dresden, Dresden, Germany
| | - Liliana M Dávalos
- Department of Ecology and Evolution, Stony Brook University, Stony Brook, NY, USA
- Consortium for Inter-Disciplinary Environmental Research, Stony Brook University, Stony Brook, NY, USA
| | | | - Megan L Power
- School of Biology and Environmental Science, University College Dublin, Dublin, Ireland
| | - Gareth Jones
- School of Biological Sciences, University of Bristol, Bristol, UK
| | - Roger D Ransome
- School of Biological Sciences, University of Bristol, Bristol, UK
| | - Dina K N Dechmann
- Department of Migration, Max Planck Institute of Animal Behavior, Radolfzell, Germany
- Department of Biology, University of Konstanz, Konstanz, Germany
- Smithsonian Tropical Research Institute, Panama City, Panama
| | - Andrea G Locatelli
- School of Biology and Environmental Science, University College Dublin, Dublin, Ireland
| | - Sébastien J Puechmaille
- ISEM, University of Montpellier, Montpellier, France
- Zoological Institute and Museum, University of Greifswald, Greifswald, Germany
| | - Olivier Fedrigo
- Vertebrate Genomes Laboratory, The Rockefeller University, New York, NY, USA
| | - Erich D Jarvis
- Vertebrate Genomes Laboratory, The Rockefeller University, New York, NY, USA
- Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | - Michael Hiller
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany.
- Max Planck Institute for the Physics of Complex Systems, Dresden, Germany.
- Center for Systems Biology Dresden, Dresden, Germany.
| | - Sonja C Vernes
- Neurogenetics of Vocal Communication Group, Max Planck Institute for Psycholinguistics, Nijmegen, The Netherlands.
- Donders Institute for Brain, Cognition and Behaviour, Nijmegen, The Netherlands.
| | - Eugene W Myers
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany.
- Center for Systems Biology Dresden, Dresden, Germany.
- Faculty of Computer Science, Technical University Dresden, Dresden, Germany.
| | - Emma C Teeling
- School of Biology and Environmental Science, University College Dublin, Dublin, Ireland.
| |
Collapse
|
22
|
Reconstructing phylogenetic relationships based on repeat sequence similarities. Mol Phylogenet Evol 2020; 147:106766. [DOI: 10.1016/j.ympev.2020.106766] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2019] [Revised: 12/09/2019] [Accepted: 02/12/2020] [Indexed: 12/25/2022]
|
23
|
Vitales D, Garcia S, Dodsworth S. Reconstructing phylogenetic relationships based on repeat sequence similarities. Mol Phylogenet Evol 2020; 147:106766. [PMID: 32119996 DOI: 10.1101/624064] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2019] [Revised: 12/09/2019] [Accepted: 02/12/2020] [Indexed: 05/18/2023]
Abstract
A recent phylogenetic method based on genome-wide abundance of different repeat types proved to be useful in reconstructing the evolutionary history of several plant and animal groups. Here, we demonstrate that an alternative information source from the repeatome can also be employed to infer phylogenetic relationships among taxa. Specifically, this novel approach makes use of the repeat sequence similarity matrices obtained from the comparative clustering analyses of RepeatExplorer 2, which are subsequently transformed to between-taxa distance matrices. These pairwise matrices are used to construct neighbour-joining trees for each of the top most-abundant clusters and they are finally summarized in a consensus network. This methodology was tested on three groups of angiosperms and one group of insects, resulting in congruent evolutionary hypotheses compared to more standard systematic analyses based on commonly used DNA markers. We propose that the combined application of these phylogenetic approaches based on repeat abundances and repeat sequence similarities could be helpful to understand mechanisms governing genome and repeatome evolution.
Collapse
Affiliation(s)
- Daniel Vitales
- Institut Botànic de Barcelona (IBB, CSIC-Ajuntament de Barcelona), Barcelona, Catalonia, Spain; Laboratori de Botànica (UB) - Unitat associada al CSIC, Facultat de Farmàcia i Ciències de l'Alimentació, Universitat de Barcelona, Av. Joan XXIII 27-31, 08028 Barcelona, Catalonia, Spain.
| | - Sònia Garcia
- Institut Botànic de Barcelona (IBB, CSIC-Ajuntament de Barcelona), Barcelona, Catalonia, Spain
| | - Steven Dodsworth
- School of Life Sciences, University of Bedfordshire, Luton, United Kingdom
| |
Collapse
|
24
|
Borredá C, Pérez-Román E, Ibanez V, Terol J, Talon M. Reprogramming of Retrotransposon Activity during Speciation of the Genus Citrus. Genome Biol Evol 2020; 11:3478-3495. [PMID: 31710678 PMCID: PMC7145672 DOI: 10.1093/gbe/evz246] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/04/2019] [Indexed: 12/13/2022] Open
Abstract
Speciation of the genus Citrus from a common ancestor has recently been established to begin ∼8 Ma during the late Miocene, a period of major climatic alterations. Here, we report the changes in activity of Citrus LTR retrotransposons during the process of diversification that gave rise to the current Citrus species. To reach this goal, we analyzed four pure species that diverged early during Citrus speciation, three recent admixtures derived from those species and an outgroup of the Citrus clade. More than 30,000 retrotransposons were grouped in ten linages. Estimations of LTR insertion times revealed that retrotransposon activity followed a species-specific pattern of change that could be ascribed to one of three different models. In some genomes, the expected pattern of gradual transposon accumulation was suddenly arrested during the radiation of the ancestor that gave birth to the current Citrus species. The individualized analyses of retrotransposon lineages showed that in each and every species studied, not all lineages follow the general pattern of the species itself. For instance, in most of the genomes, the retrotransposon activity of elements from the SIRE lineage reached its highest level just before Citrus speciation, while for Retrofit elements, it has been steadily growing. Based on these observations, we propose that Citrus retrotransposons may respond to stressful conditions driving speciation as a part of the genetic response involved in adaptation. This proposal implies that the evolving conditions of each species interact with the internal regulatory mechanisms of the genome controlling the proliferation of mobile elements.
Collapse
Affiliation(s)
- Carles Borredá
- Centro de Genómica, Instituto Valenciano de Investigaciones Agrarias (IVIA), Valencia, Spain
| | - Estela Pérez-Román
- Centro de Genómica, Instituto Valenciano de Investigaciones Agrarias (IVIA), Valencia, Spain
| | - Victoria Ibanez
- Centro de Genómica, Instituto Valenciano de Investigaciones Agrarias (IVIA), Valencia, Spain
| | - Javier Terol
- Centro de Genómica, Instituto Valenciano de Investigaciones Agrarias (IVIA), Valencia, Spain
| | - Manuel Talon
- Centro de Genómica, Instituto Valenciano de Investigaciones Agrarias (IVIA), Valencia, Spain
| |
Collapse
|
25
|
Dogan M, Pouch M, Mandáková T, Hloušková P, Guo X, Winter P, Chumová Z, Van Niekerk A, Mummenhoff K, Al-Shehbaz IA, Mucina L, Lysak MA. Evolution of Tandem Repeats Is Mirroring Post-polyploid Cladogenesis in Heliophila (Brassicaceae). FRONTIERS IN PLANT SCIENCE 2020; 11:607893. [PMID: 33510751 PMCID: PMC7835680 DOI: 10.3389/fpls.2020.607893] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/18/2020] [Accepted: 11/16/2020] [Indexed: 05/02/2023]
Abstract
The unigeneric tribe Heliophileae encompassing more than 100 Heliophila species is morphologically the most diverse Brassicaceae lineage. The tribe is endemic to southern Africa, confined chiefly to the southwestern South Africa, home of two biodiversity hotspots (Cape Floristic Region and Succulent Karoo). The monospecific Chamira (C. circaeoides), the only crucifer species with persistent cotyledons, is traditionally retrieved as the closest relative of Heliophileae. Our transcriptome analysis revealed a whole-genome duplication (WGD) ∼26.15-29.20 million years ago, presumably preceding the Chamira/Heliophila split. The WGD was then followed by genome-wide diploidization, species radiations, and cladogenesis in Heliophila. The expanded phylogeny based on nuclear ribosomal DNA internal transcribed spacer (ITS) uncovered four major infrageneric clades (A-D) in Heliophila and corroborated the sister relationship between Chamira and Heliophila. Herein, we analyzed how the diploidization process impacted the evolution of repetitive sequences through low-coverage whole-genome sequencing of 15 Heliophila species, representing the four clades, and Chamira. Despite the firmly established infrageneric cladogenesis and different ecological life histories (four perennials vs. 11 annual species), repeatome analysis showed overall comparable evolution of genome sizes (288-484 Mb) and repeat content (25.04-38.90%) across Heliophila species and clades. Among Heliophila species, long terminal repeat (LTR) retrotransposons were the predominant components of the analyzed genomes (11.51-22.42%), whereas tandem repeats had lower abundances (1.03-12.10%). In Chamira, the tandem repeat content (17.92%, 16 diverse tandem repeats) equals the abundance of LTR retrotransposons (16.69%). Among the 108 tandem repeats identified in Heliophila, only 16 repeats were found to be shared among two or more species; no tandem repeats were shared by Chamira and Heliophila genomes. Six "relic" tandem repeats were shared between any two different Heliophila clades by a common descent. Four and six clade-specific repeats shared among clade A and C species, respectively, support the monophyly of these two clades. Three repeats shared by all clade A species corroborate the recent diversification of this clade revealed by plastome-based molecular dating. Phylogenetic analysis based on repeat sequence similarities separated the Heliophila species to three clades [A, C, and (B+D)], mirroring the post-polyploid cladogenesis in Heliophila inferred from rDNA ITS and plastome sequences.
Collapse
Affiliation(s)
- Mert Dogan
- CEITEC, Masaryk University, Brno, Czechia
- NCBR, Faculty of Science, Masaryk University, Brno, Czechia
| | - Milan Pouch
- CEITEC, Masaryk University, Brno, Czechia
- NCBR, Faculty of Science, Masaryk University, Brno, Czechia
| | - Terezie Mandáková
- CEITEC, Masaryk University, Brno, Czechia
- Department of Experimental Biology, Faculty of Science, Masaryk University, Brno, Czechia
| | | | - Xinyi Guo
- CEITEC, Masaryk University, Brno, Czechia
| | - Pieter Winter
- South African National Biodiversity Institute (SANBI), Kirstenbosch, Cape Town, South Africa
| | - Zuzana Chumová
- Institute of Botany, Czech Academy of Sciences, Prùhonice, Czechia
| | - Adriaan Van Niekerk
- Department of Geography & Environmental Studies, Stellenbosch University, Stellenbosch, South Africa
| | - Klaus Mummenhoff
- Department of Biology, Botany, Osnabrück University, Osnabrück, Germany
| | | | - Ladislav Mucina
- Department of Geography & Environmental Studies, Stellenbosch University, Stellenbosch, South Africa
- Harry Butler Institute, Murdoch University, Perth, WA, Australia
| | - Martin A. Lysak
- CEITEC, Masaryk University, Brno, Czechia
- NCBR, Faculty of Science, Masaryk University, Brno, Czechia
- *Correspondence: Martin A. Lysak, ;
| |
Collapse
|
26
|
Springer MS, Molloy EK, Sloan DB, Simmons MP, Gatesy J. ILS-Aware Analysis of Low-Homoplasy Retroelement Insertions: Inference of Species Trees and Introgression Using Quartets. J Hered 2019; 111:147-168. [DOI: 10.1093/jhered/esz076] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2019] [Accepted: 12/12/2019] [Indexed: 12/20/2022] Open
Abstract
Abstract
DNA sequence alignments have provided the majority of data for inferring phylogenetic relationships with both concatenation and coalescent methods. However, DNA sequences are susceptible to extensive homoplasy, especially for deep divergences in the Tree of Life. Retroelement insertions have emerged as a powerful alternative to sequences for deciphering evolutionary relationships because these data are nearly homoplasy-free. In addition, retroelement insertions satisfy the “no intralocus-recombination” assumption of summary coalescent methods because they are singular events and better approximate neutrality relative to DNA loci commonly sampled in phylogenomic studies. Retroelements have traditionally been analyzed with parsimony, distance, and network methods. Here, we analyze retroelement data sets for vertebrate clades (Placentalia, Laurasiatheria, Balaenopteroidea, Palaeognathae) with 2 ILS-aware methods that operate by extracting, weighting, and then assembling unrooted quartets into a species tree. The first approach constructs a species tree from retroelement bipartitions with ASTRAL, and the second method is based on split-decomposition with parsimony. We also develop a Quartet-Asymmetry test to detect hybridization using retroelements. Both ILS-aware methods recovered the same species-tree topology for each data set. The ASTRAL species trees for Laurasiatheria have consecutive short branch lengths in the anomaly zone whereas Palaeognathae is outside of this zone. For the Balaenopteroidea data set, which includes rorquals (Balaenopteridae) and gray whale (Eschrichtiidae), both ILS-aware methods resolved balaeonopterids as paraphyletic. Application of the Quartet-Asymmetry test to this data set detected 19 different quartets of species for which historical introgression may be inferred. Evidence for introgression was not detected in the other data sets.
Collapse
Affiliation(s)
- Mark S Springer
- Department of Evolution, Ecology, and Organismal Biology, University of California, Riverside, CA
| | - Erin K Molloy
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL
| | - Daniel B Sloan
- Department of Biology, Colorado State University, Fort Collins, CO
| | - Mark P Simmons
- Department of Biology, Colorado State University, Fort Collins, CO
| | - John Gatesy
- Division of Vertebrate Zoology and Sackler Institute for Comparative Genomics, American Museum of Natural History, New York, NY
| |
Collapse
|
27
|
Doronina L, Reising O, Clawson H, Ray DA, Schmitz J. True Homoplasy of Retrotransposon Insertions in Primates. Syst Biol 2019; 68:482-493. [PMID: 30445649 DOI: 10.1093/sysbio/syy076] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2018] [Revised: 11/05/2018] [Accepted: 11/13/2018] [Indexed: 01/24/2023] Open
Abstract
How reliable are the presence/absence insertion patterns of the supposedly homoplasy-free retrotransposons, which were randomly inserted in the quasi infinite genomic space? To systematically examine this question in an up-to-date, multigenome comparison, we screened millions of primate transposed Alu SINE elements for incidences of homoplasious precise insertions and deletions. In genome-wide analyses, we identified and manually verified nine cases of precise parallel Alu insertions of apparently identical elements at orthologous positions in two ape lineages and twelve incidences of precise deletions of previously established SINEs. Correspondingly, eight precise parallel insertions and no exact deletions were detected in a comparison of lemuriform primate and human insertions spanning the range of primate diversity. With an overall frequency of homoplasious Alu insertions of only 0.01% (for human-chimpanzee-rhesus macaque) and 0.02-0.04% (for human-bushbaby-lemurs) and precise Alu deletions of 0.001-0.002% (for human-chimpanzee-rhesus macaque), real homoplasy is not considered to be a quantitatively relevant source of evolutionary noise. Thus, presence/absence patterns of Alu retrotransposons and, presumably, all LINE1-mobilized elements represent indeed the virtually homoplasy-free markers they are considered to be. Therefore, ancestral incomplete lineage sorting and hybridization remain the only serious sources of conflicting presence/absence patterns of retrotransposon insertions, and as such are detectable and quantifiable. [Homoplasy; precise deletions; precise parallel insertions; primates; retrotransposons.].
Collapse
Affiliation(s)
- Liliya Doronina
- Institute of Experimental Pathology (ZMBE), University of Münster, Von-Esmarch-Str. 56, D-48149 Münster, Germany
| | - Olga Reising
- Institute of Experimental Pathology (ZMBE), University of Münster, Von-Esmarch-Str. 56, D-48149 Münster, Germany
| | - Hiram Clawson
- Department of Biomolecular Engineering, University of California, 1156 High Street, Santa Cruz, CA, USA
| | - David A Ray
- Department of Biological Sciences, Texas Tech University, 2901 Main Street, Lubbock, TX, USA
| | - Jürgen Schmitz
- Institute of Experimental Pathology (ZMBE), University of Münster, Von-Esmarch-Str. 56, D-48149 Münster, Germany
| |
Collapse
|
28
|
Halliday TJD, dos Reis M, Tamuri AU, Ferguson-Gow H, Yang Z, Goswami A. Rapid morphological evolution in placental mammals post-dates the origin of the crown group. Proc Biol Sci 2019; 286:20182418. [PMID: 30836875 PMCID: PMC6458320 DOI: 10.1098/rspb.2018.2418] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2018] [Accepted: 02/12/2019] [Indexed: 12/28/2022] Open
Abstract
Resolving the timing and pattern of early placental mammal evolution has been confounded by conflict among divergence date estimates from interpretation of the fossil record and from molecular-clock dating studies. Despite both fossil occurrences and molecular sequences favouring a Cretaceous origin for Placentalia, no unambiguous Cretaceous placental mammal has been discovered. Investigating the differing patterns of evolution in morphological and molecular data reveals a possible explanation for this conflict. Here, we quantified the relationship between morphological and molecular rates of evolution. We show that, independent of divergence dates, morphological rates of evolution were slow relative to molecular evolution during the initial divergence of Placentalia, but substantially increased during the origination of the extant orders. The rapid radiation of placentals into a highly morphologically disparate Cenozoic fauna is thus not associated with the origin of Placentalia, but post-dates superordinal origins. These findings predict that early members of major placental groups may not be easily distinguishable from one another or from stem eutherians on the basis of skeleto-dental morphology. This result supports a Late Cretaceous origin of crown placentals with an ordinal-level adaptive radiation in the early Paleocene, with the high relative rate permitting rapid anatomical change without requiring unreasonably fast molecular evolutionary rates. The lack of definitive Cretaceous placental mammals may be a result of morphological similarity among stem and early crown eutherians, providing an avenue for reconciling the fossil record with molecular divergence estimates for Placentalia.
Collapse
Affiliation(s)
- Thomas J. D. Halliday
- Department of Genetics, Evolution, and Environment, University College London, Gower Street, London WC1E 6BT, UK
- School of Geography, Earth, and Environmental Science, University of Birmingham, Edgbaston B15 2TT, UK
| | - Mario dos Reis
- School of Biological and Chemical Sciences, Queen Mary University London, Mile End Road, London E1 4NS, UK
| | - Asif U. Tamuri
- Research IT Services, University College London, Gower Street, London WC1E 6BT, UK
- European Molecular Biology Laboratory, European Bioinformatics, Wellcome Genome Campus, Hinxton, Cambridgeshire CB10 1SD, UK
| | - Henry Ferguson-Gow
- Department of Genetics, Evolution, and Environment, University College London, Gower Street, London WC1E 6BT, UK
| | - Ziheng Yang
- Department of Genetics, Evolution, and Environment, University College London, Gower Street, London WC1E 6BT, UK
| | - Anjali Goswami
- Department of Genetics, Evolution, and Environment, University College London, Gower Street, London WC1E 6BT, UK
- Department of Earth Sciences, University College London, Gower Street, London WC1E 6BT, UK
- Faculty of Life Sciences, Natural History Museum, Cromwell Road, London SW9 5DJ, UK
| |
Collapse
|
29
|
Sparwel M, Doronina L, Churakov G, Stegemann A, Brosius J, Robinson TJ, Schmitz J. The Volcano Rabbit in the Phylogenetic Network of Lagomorphs. Genome Biol Evol 2019; 11:11-16. [PMID: 30476046 PMCID: PMC6319600 DOI: 10.1093/gbe/evy257] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/20/2018] [Indexed: 11/13/2022] Open
Abstract
The order Lagomorpha unifies pikas (Ochotonidae) and the hares plus rabbits (Leporidae). Phylogenetic reconstructions of the species within Leporidae based on traditional morphological or molecular sequence data provide support for conflicting hypotheses. The retroposon presence/absence patterns analyzed in this study revealed strong support for the broadly accepted splitting of lagomorphs into ochotonids and leporids with Pronolagus as the first divergence in the leporid tree. Furthermore, the retroposon presence/absence patterns nested the rare volcano rabbit, Romerolagus diazi, within an unresolved network of deeper leporid relationships and provide the first homoplasy-free image of incomplete lineage sorting and/or ancestral hybridization/introgression in rapidly radiated Leporidae. At the same time, the strongest retroposon presence/absence signal supports the volcano rabbit as a separate branch between the Pronolagus junction and a unified cluster of the remaining leporids.
Collapse
Affiliation(s)
| | - Liliya Doronina
- Institute of Experimental Pathology (ZMBE), University of Münster, Germany
| | - Gennady Churakov
- Institute of Experimental Pathology (ZMBE), University of Münster, Germany
| | - Anja Stegemann
- Institute of Experimental Pathology (ZMBE), University of Münster, Germany
| | - Jürgen Brosius
- Institute of Experimental Pathology (ZMBE), University of Münster, Germany.,Brandenburg Medical School (MHB), Neuruppin, Germany
| | - Terence J Robinson
- Evolutionary Genomics Group, Department of Botany and Zoology, University of Stellenbosch, South Africa
| | - Jürgen Schmitz
- Institute of Experimental Pathology (ZMBE), University of Münster, Germany
| |
Collapse
|
30
|
Tian R, Chen M, Chai S, Rong X, Chen B, Ren W, Xu S, Yang G. Divergent Selection of Pattern Recognition Receptors in Mammals with Different Ecological Characteristics. J Mol Evol 2018; 86:138-149. [PMID: 29455279 DOI: 10.1007/s00239-018-9832-1] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2017] [Accepted: 02/05/2018] [Indexed: 12/22/2022]
Abstract
Pattern recognition receptors (PRRs) are specialized receptors that represent a key component of the host innate immune system. Whether molecular evolutionary history of different PRR classes have involved different genetic mechanisms underlying diverse pathogen environment in mammals, and whether distinct ecology of mammals may have imposed divergent selective pressures on the evolution of the PRRs, remained unknown. To test these hypotheses, we investigated the characterization of 20 genes belonging to four PRR classes in mammals. Evidence of positive selection was found in most (17 of 20) PRR genes examined, and most positively selected sites (84%) undergoing radical changes were found to fall in important functional regions, consistent with the co-evolutionary dynamics between the hosts and their microbial counterparts. We found different evolutionary patterns in different PRR classes, with the highest level of positive selection in C-type lectin receptor (CLR) family, suggesting that the capability of CLRs in response to a wide variety of ligands might explain their malleability to selection pressures. Tests using branch models that partitioned the data along habitat and social behavior found significant evidence of divergent selective pressures of PRRs among mammalian groups. Interestingly, species-specific evolution was detected on RIG-I-like helicase genes (RLRs) in cetaceans, suggesting that RLRs might play a critical role in the defense against widespread marine RNA viruses during their divergence and radiation into marine habitats. This study provides a comprehensive look at the evolutionary patterns and implications of mammalian PRRs, and highlights the importance of ecological influences in molecular adaptation.
Collapse
Affiliation(s)
- Ran Tian
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, Nanjing, 210023, China
| | - Meixiu Chen
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, Nanjing, 210023, China
| | - Simin Chai
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, Nanjing, 210023, China
| | - Xinghua Rong
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, Nanjing, 210023, China
| | - Bingyao Chen
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, Nanjing, 210023, China
| | - Wenhua Ren
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, Nanjing, 210023, China
| | - Shixia Xu
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, Nanjing, 210023, China.
| | - Guang Yang
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, Nanjing, 210023, China.
| |
Collapse
|
31
|
Feigin CY, Newton AH, Doronina L, Schmitz J, Hipsley CA, Mitchell KJ, Gower G, Llamas B, Soubrier J, Heider TN, Menzies BR, Cooper A, O'Neill RJ, Pask AJ. Genome of the Tasmanian tiger provides insights into the evolution and demography of an extinct marsupial carnivore. Nat Ecol Evol 2017; 2:182-192. [PMID: 29230027 DOI: 10.1038/s41559-017-0417-y] [Citation(s) in RCA: 47] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2017] [Accepted: 11/16/2017] [Indexed: 12/30/2022]
Abstract
The Tasmanian tiger or thylacine (Thylacinus cynocephalus) was the largest carnivorous Australian marsupial to survive into the modern era. Despite last sharing a common ancestor with the eutherian canids ~160 million years ago, their phenotypic resemblance is considered the most striking example of convergent evolution in mammals. The last known thylacine died in captivity in 1936 and many aspects of the evolutionary history of this unique marsupial apex predator remain unknown. Here we have sequenced the genome of a preserved thylacine pouch young specimen to clarify the phylogenetic position of the thylacine within the carnivorous marsupials, reconstruct its historical demography and examine the genetic basis of its convergence with canids. Retroposon insertion patterns placed the thylacine as the basal lineage in Dasyuromorphia and suggest incomplete lineage sorting in early dasyuromorphs. Demographic analysis indicated a long-term decline in genetic diversity starting well before the arrival of humans in Australia. In spite of their extraordinary phenotypic convergence, comparative genomic analyses demonstrated that amino acid homoplasies between the thylacine and canids are largely consistent with neutral evolution. Furthermore, the genes and pathways targeted by positive selection differ markedly between these species. Together, these findings support models of adaptive convergence driven primarily by cis-regulatory evolution.
Collapse
Affiliation(s)
- Charles Y Feigin
- School of BioSciences, The University of Melbourne, Parkville, Victoria, Australia
| | - Axel H Newton
- School of BioSciences, The University of Melbourne, Parkville, Victoria, Australia.,Museums Victoria, Melbourne, Victoria, Australia
| | - Liliya Doronina
- Institute of Experimental Pathology (ZMBE), University of Münster, Münster, Germany
| | - Jürgen Schmitz
- Institute of Experimental Pathology (ZMBE), University of Münster, Münster, Germany
| | - Christy A Hipsley
- School of BioSciences, The University of Melbourne, Parkville, Victoria, Australia.,Museums Victoria, Melbourne, Victoria, Australia
| | - Kieren J Mitchell
- Australian Centre for Ancient DNA, University of Adelaide, Adelaide, South Australia, Australia
| | - Graham Gower
- Australian Centre for Ancient DNA, University of Adelaide, Adelaide, South Australia, Australia
| | - Bastien Llamas
- Australian Centre for Ancient DNA, University of Adelaide, Adelaide, South Australia, Australia
| | - Julien Soubrier
- Australian Centre for Ancient DNA, University of Adelaide, Adelaide, South Australia, Australia
| | - Thomas N Heider
- Institute for Systems Genomics and Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, USA
| | - Brandon R Menzies
- School of BioSciences, The University of Melbourne, Parkville, Victoria, Australia
| | - Alan Cooper
- Australian Centre for Ancient DNA, University of Adelaide, Adelaide, South Australia, Australia
| | - Rachel J O'Neill
- Institute for Systems Genomics and Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, USA
| | - Andrew J Pask
- School of BioSciences, The University of Melbourne, Parkville, Victoria, Australia. .,Museums Victoria, Melbourne, Victoria, Australia.
| |
Collapse
|
32
|
Dodt WG, Gallus S, Phillips MJ, Nilsson MA. Resolving kangaroo phylogeny and overcoming retrotransposon ascertainment bias. Sci Rep 2017; 7:16811. [PMID: 29196678 PMCID: PMC5711953 DOI: 10.1038/s41598-017-16148-0] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2016] [Accepted: 10/26/2017] [Indexed: 01/31/2023] Open
Abstract
Reconstructing phylogeny from retrotransposon insertions is often limited by access to only a single reference genome, whereby support for clades that do not include the reference taxon cannot be directly observed. Here we have developed a new statistical framework that accounts for this ascertainment bias, allowing us to employ phylogenetically powerful retrotransposon markers to explore the radiation of the largest living marsupials, the kangaroos and wallabies of the genera Macropus and Wallabia. An exhaustive in silico screening of the tammar wallaby (Macropus eugenii) reference genome followed by experimental screening revealed 29 phylogenetically informative retrotransposon markers belonging to a family of endogenous retroviruses. We identified robust support for the enigmatic swamp wallaby (Wallabia bicolor) falling within a paraphyletic genus, Macropus. Our statistical approach provides a means to test for incomplete lineage sorting and introgression/hybridization in the presence of the ascertainment bias. Using retrotransposons as “molecular fossils”, we reveal one of the most complex patterns of hemiplasy yet identified, during the rapid diversification of kangaroos and wallabies. Ancestral state reconstruction incorporating the new retrotransposon phylogenetic information reveals multiple independent ecological shifts among kangaroos into more open habitats, coinciding with the Pliocene onset of increased aridification in Australia from ~3.6 million years ago.
Collapse
Affiliation(s)
- William G Dodt
- School of Earth, Environmental and Biological Sciences, Queensland University of Technology (QUT), 2 George Street, Brisbane, Australia.
| | - Susanne Gallus
- Senckenberg Biodiversity and Climate Research Centre (BiK-F) Frankfurt, Senckenberg Gesellschaft fuer Naturforschung, Senckenberganlage 25, Frankfurt am Main, Germany
| | - Matthew J Phillips
- School of Earth, Environmental and Biological Sciences, Queensland University of Technology (QUT), 2 George Street, Brisbane, Australia.
| | - Maria A Nilsson
- Senckenberg Biodiversity and Climate Research Centre (BiK-F) Frankfurt, Senckenberg Gesellschaft fuer Naturforschung, Senckenberganlage 25, Frankfurt am Main, Germany.
| |
Collapse
|
33
|
Esselstyn JA, Oliveros CH, Swanson MT, Faircloth BC. Investigating Difficult Nodes in the Placental Mammal Tree with Expanded Taxon Sampling and Thousands of Ultraconserved Elements. Genome Biol Evol 2017; 9:2308-2321. [PMID: 28934378 PMCID: PMC5604124 DOI: 10.1093/gbe/evx168] [Citation(s) in RCA: 79] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/25/2017] [Indexed: 12/21/2022] Open
Abstract
The phylogeny of eutherian mammals contains some of the most recalcitrant nodes in the tetrapod tree of life. We combined comprehensive taxon and character sampling to explore three of the most debated interordinal relationships among placental mammals. We performed in silico extraction of ultraconserved element loci from 72 published genomes and invitro enrichment and sequencing of ultraconserved elements from 28 additional mammals, resulting in alignments of 3,787 loci. We analyzed these data using concatenated and multispecies coalescent phylogenetic approaches, topological tests, and exploration of support among individual loci to identify the root of Eutheria and the sister groups of tree shrews (Scandentia) and horses (Perissodactyla). Individual loci provided weak, but often consistent support for topological hypotheses. Although many gene trees lacked accepted species-tree relationships, summary coalescent topologies were largely consistent with inferences from concatenation. At the root of Eutheria, we identified consistent support for a sister relationship between Xenarthra and Afrotheria (i.e., Atlantogenata). At the other nodes of interest, support was less consistent. We suggest Scandentia is the sister of Primatomorpha (Euarchonta), but we failed to reject a sister relationship between Scandentia and Glires. Similarly, we suggest Perissodactyla is sister to Cetartiodactyla (Euungulata), but a sister relationship between Perissodactyla and Chiroptera remains plausible.
Collapse
Affiliation(s)
- Jacob A. Esselstyn
- Museum of Natural Science and Department of Biological Sciences, Louisiana State University, Baton Rouge
| | - Carl H. Oliveros
- Museum of Natural Science and Department of Biological Sciences, Louisiana State University, Baton Rouge
| | - Mark T. Swanson
- Museum of Natural Science and Department of Biological Sciences, Louisiana State University, Baton Rouge
| | - Brant C. Faircloth
- Museum of Natural Science and Department of Biological Sciences, Louisiana State University, Baton Rouge
| |
Collapse
|
34
|
Esselstyn JA, Oliveros CH, Swanson MT, Faircloth BC. Investigating Difficult Nodes in the Placental Mammal Tree with Expanded Taxon Sampling and Thousands of Ultraconserved Elements. Genome Biol Evol 2017. [PMID: 28934378 DOI: 10.1093/gbe/evx168)] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
The phylogeny of eutherian mammals contains some of the most recalcitrant nodes in the tetrapod tree of life. We combined comprehensive taxon and character sampling to explore three of the most debated interordinal relationships among placental mammals. We performed in silico extraction of ultraconserved element loci from 72 published genomes and invitro enrichment and sequencing of ultraconserved elements from 28 additional mammals, resulting in alignments of 3,787 loci. We analyzed these data using concatenated and multispecies coalescent phylogenetic approaches, topological tests, and exploration of support among individual loci to identify the root of Eutheria and the sister groups of tree shrews (Scandentia) and horses (Perissodactyla). Individual loci provided weak, but often consistent support for topological hypotheses. Although many gene trees lacked accepted species-tree relationships, summary coalescent topologies were largely consistent with inferences from concatenation. At the root of Eutheria, we identified consistent support for a sister relationship between Xenarthra and Afrotheria (i.e., Atlantogenata). At the other nodes of interest, support was less consistent. We suggest Scandentia is the sister of Primatomorpha (Euarchonta), but we failed to reject a sister relationship between Scandentia and Glires. Similarly, we suggest Perissodactyla is sister to Cetartiodactyla (Euungulata), but a sister relationship between Perissodactyla and Chiroptera remains plausible.
Collapse
Affiliation(s)
- Jacob A Esselstyn
- Museum of Natural Science and Department of Biological Sciences, Louisiana State University, Baton Rouge
| | - Carl H Oliveros
- Museum of Natural Science and Department of Biological Sciences, Louisiana State University, Baton Rouge
| | - Mark T Swanson
- Museum of Natural Science and Department of Biological Sciences, Louisiana State University, Baton Rouge
| | - Brant C Faircloth
- Museum of Natural Science and Department of Biological Sciences, Louisiana State University, Baton Rouge
| |
Collapse
|