1
|
Tule S, Foley G, Zhao C, Forbes M, Bodén M. Optimal phylogenetic reconstruction of insertion and deletion events. Bioinformatics 2024; 40:i277-i286. [PMID: 38940131 PMCID: PMC11211827 DOI: 10.1093/bioinformatics/btae254] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/29/2024] Open
Abstract
MOTIVATION Insertions and deletions (indels) influence the genetic code in fundamentally distinct ways from substitutions, significantly impacting gene product structure and function. Despite their influence, the evolutionary history of indels is often neglected in phylogenetic tree inference and ancestral sequence reconstruction, hindering efforts to comprehend biological diversity determinants and engineer variants for medical and industrial applications. RESULTS We frame determining the optimal history of indel events as a single Mixed-Integer Programming (MIP) problem, across all branch points in a phylogenetic tree adhering to topological constraints, and all sites implied by a given set of aligned, extant sequences. By disentangling the impact on ancestral sequences at each branch point, this approach identifies the minimal indel events that jointly explain the diversity in sequences mapped to the tips of that tree. MIP can recover alternate optimal indel histories, if available. We evaluated MIP for indel inference on a dataset comprising 15 real phylogenetic trees associated with protein families ranging from 165 to 2000 extant sequences, and on 60 synthetic trees at comparable scales of data and reflecting realistic rates of mutation. Across relevant metrics, MIP outperformed alternative parsimony-based approaches and reported the fewest indel events, on par or below their occurrence in synthetic datasets. MIP offers a rational justification for indel patterns in extant sequences; importantly, it uniquely identifies global optima on complex protein data sets without making unrealistic assumptions of independence or evolutionary underpinnings, promising a deeper understanding of molecular evolution and aiding novel protein design. AVAILABILITY AND IMPLEMENTATION The implementation is available via GitHub at https://github.com/santule/indelmip.
Collapse
Affiliation(s)
- Sanjana Tule
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Gabriel Foley
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Chongting Zhao
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Michael Forbes
- School of Mathematics and Physics, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Mikael Bodén
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, QLD 4072, Australia
| |
Collapse
|
2
|
Echevarría LY, De la Riva I, Venegas PJ, Rojas-Runjaic FJM, R Dias I, Castroviejo-Fisher S. Total evidence and sensitivity phylogenetic analyses of egg-brooding frogs (Anura: Hemiphractidae). Cladistics 2021; 37:375-401. [PMID: 34478194 DOI: 10.1111/cla.12447] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/10/2020] [Indexed: 01/06/2023] Open
Abstract
We study the phylogenetic relationships of egg-brooding frogs, a group of 118 neotropical species, unique among anurans by having embryos with large bell-shaped gills and females carrying their eggs on the dorsum, exposed or inside a pouch. We assembled a total evidence dataset of published and newly generated data containing 51 phenotypic characters and DNA sequences of 20 loci for 143 hemiphractids and 127 outgroup terminals. We performed six analytical strategies combining different optimality criteria (parsimony and maximum likelihood), alignment methods (tree- and similarity-alignment), and three different indel coding schemes (fifth character state, unknown nucleotide, and presence/absence characters matrix). Furthermore, we analyzed a subset of the total evidence dataset to evaluate the impact of phenotypic characters on hemiphractid phylogenetic relationships. Our main results include: (i) monophyly of Hemiphractidae and its six genera for all our analyses, novel relationships among hemiphractid genera, and non-monophyly of Hemiphractinae according to our preferred phylogenetic hypothesis; (ii) non-monophyly of current supraspecific taxonomies of Gastrotheca, an updated taxonomy is provided; (iii) previous differences among studies were mainly caused by differences in analytical factors, not by differences in character/taxon sampling; (iv) optimality criteria, alignment method, and indel coding caused differences among optimal topologies, in that order of degree; (v) in most cases, parsimony analyses are more sensitive to the addition of phenotypic data than maximum likelihood analyses; (vi) adding phenotypic data resulted in an increase of shared clades for most analyses.
Collapse
Affiliation(s)
- Lourdes Y Echevarría
- Laboratório de Sistemática de Vertebrados, Pontifícia Universidade Católica do Rio Grande do Sul (PUCRS), Av. Ipiranga 6681, Porto Alegre, RS, 90619-900, Brazil.,División de Herpetología-Centro de Ornitología y Biodiversidad (CORBIDI), Urb. Huertos de San Antonio, Santa Rita No. 105 Of. 202, Surco, Lima, Perú
| | - Ignacio De la Riva
- Museo Nacional de Ciencias Naturales-CSIC, C/José Gutiérrez Abascal 2, Madrid, 28006, Spain
| | - Pablo J Venegas
- División de Herpetología-Centro de Ornitología y Biodiversidad (CORBIDI), Urb. Huertos de San Antonio, Santa Rita No. 105 Of. 202, Surco, Lima, Perú
| | | | - Iuri R Dias
- Graduate Program in Zoology, Universidade Estadual de Santa Cruz, Rodovia Jorge Amado, km 16, Ilhéus, Bahia, 45662-900, Brazil
| | - Santiago Castroviejo-Fisher
- Laboratório de Sistemática de Vertebrados, Pontifícia Universidade Católica do Rio Grande do Sul (PUCRS), Av. Ipiranga 6681, Porto Alegre, RS, 90619-900, Brazil.,Department of Herpetology, American Museum of Natural History, New York, NY, 10024, USA
| |
Collapse
|
3
|
Watson DG, Pomeroy PP, Al-Tannak NF, Kennedy MW. Stockpiling by pups and self-sacrifice by their fasting mothers observed in birth to weaning serum metabolomes of Atlantic grey seals. Sci Rep 2020; 10:7465. [PMID: 32366923 PMCID: PMC7198541 DOI: 10.1038/s41598-020-64488-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2019] [Accepted: 04/15/2020] [Indexed: 12/23/2022] Open
Abstract
During the uniquely short lactations of true seals, pups acquire a greater proportion of maternal body resources, at a greater rate, than in any other group of mammals. Mothers in many species enter a period of anorexia but must preserve sufficient reserves to fuel hunting and thermoregulation for return to cold seas. Moreover, pups may undergo a period of development after weaning during which they have no maternal care or nutrition. This nutritionally closed system presents a potentially extreme case of conflict between maternal survival and adequate provisioning of offspring, likely presenting strains on their metabolisms. We examined the serum metabolomes of five mother and pup pairs of Atlantic grey seals, Halichoerus grypus, from birth to weaning. Changes with time were particularly evident in pups, with indications of strain in the fat and energy metabolisms of both. Crucially, pups accumulate certain compounds to levels that are dramatically greater than in mothers. These include compounds that pups cannot synthesise themselves, such as pyridoxine/vitamin B6, taurine, some essential amino acids, and a conditionally essential amino acid and its precursor. Fasting mothers therefore appear to mediate stockpiling of critical metabolites in their pups, potentially depleting their own reserves and prompting cessation of lactation.
Collapse
Affiliation(s)
- David G Watson
- Strathclyde Institute of Pharmacy and Biomedical Sciences, University of Strathclyde, 161 Cathedral Street, Glasgow, G4 0RE, Scotland, UK.
| | - Patrick P Pomeroy
- Sea Mammal Research Unit, Scottish Oceans Institute, University of St Andrews, St Andrews, Fife, Scotland, United Kingdom
| | - Naser F Al-Tannak
- Strathclyde Institute of Pharmacy and Biomedical Sciences, University of Strathclyde, 161 Cathedral Street, Glasgow, G4 0RE, Scotland, UK.,Department of Pharmaceutical Chemistry, Faculty of Pharmacy, Kuwait University, P.O. Box 23924, Safat, 13110, Kuwait City, Kuwait
| | - Malcolm W Kennedy
- Institute of Biodiversity, Animal Health & Comparative Medicine, Graham Kerr Building, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, G12 8QQ, Scotland, UK.
| |
Collapse
|
4
|
Liu JT, Corbett JL, Heslop JA, Duncan SA. Enhanced genome editing in human iPSCs with CRISPR-CAS9 by co-targeting ATP1a1. PeerJ 2020; 8:e9060. [PMID: 32391204 PMCID: PMC7197401 DOI: 10.7717/peerj.9060] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2020] [Accepted: 04/04/2020] [Indexed: 12/11/2022] Open
Abstract
Genome editing in human induced pluripotent stem cells (iPSCs) provides the potential for disease modeling and cell therapy. By generating iPSCs with specific mutations, researchers can differentiate the modified cells to their lineage of interest for further investigation. However, the low efficiency of targeting in iPSCs has hampered the application of genome editing. In this study we used a CRISPR-Cas9 system that introduces a specific point substitution into the sequence of the Na+/K+-ATPase subunit ATP1A1. The introduced mutation confers resistance to cardiac glycosides, which can then be used to select successfully targeted cells. Using this system, we introduced different formats of donor DNA for homology-directed repair (HDR), including single-strand DNAs, double-strand DNAs, and plasmid donors. We achieved a 35-fold increase in HDR when using plasmid donor with a 400 bp repair template. We further co-targeted ATP1A1 and a second locus of interest to determine the enrichment of mutagenesis after cardiac glycoside selection. Through this approach, INDEL rate was increased after cardiac glycoside treatment, while HDR enrichment was only observed at certain loci. Collectively, these results suggest that a plasmid donor with a 400 bp repair template is an optimal donor DNA for targeted substitution and co-targeting ATP1A1 with the second locus enriches for mutagenesis events through cardiac glycoside selection in human iPSCs.
Collapse
Affiliation(s)
- Jui-Tung Liu
- Department of Regenerative Medicine and Cell Biology, Medical University of South Carolina, Charleston, SC, United States of America
| | - James L Corbett
- Department of Regenerative Medicine and Cell Biology, Medical University of South Carolina, Charleston, SC, United States of America
| | - James A Heslop
- Department of Regenerative Medicine and Cell Biology, Medical University of South Carolina, Charleston, SC, United States of America
| | - Stephen A Duncan
- Department of Regenerative Medicine and Cell Biology, Medical University of South Carolina, Charleston, SC, United States of America
| |
Collapse
|
5
|
Karin BR, Gamble T, Jackman TR. Optimizing Phylogenomics with Rapidly Evolving Long Exons: Comparison with Anchored Hybrid Enrichment and Ultraconserved Elements. Mol Biol Evol 2020; 37:904-922. [PMID: 31710677 PMCID: PMC7038749 DOI: 10.1093/molbev/msz263] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Marker selection has emerged as an important component of phylogenomic study design due to rising concerns of the effects of gene tree estimation error, model misspecification, and data-type differences. Researchers must balance various trade-offs associated with locus length and evolutionary rate among other factors. The most commonly used reduced representation data sets for phylogenomics are ultraconserved elements (UCEs) and Anchored Hybrid Enrichment (AHE). Here, we introduce Rapidly Evolving Long Exon Capture (RELEC), a new set of loci that targets single exons that are both rapidly evolving (evolutionary rate faster than RAG1) and relatively long in length (>1,500 bp), while at the same time avoiding paralogy issues across amniotes. We compare the RELEC data set to UCEs and AHE in squamate reptiles by aligning and analyzing orthologous sequences from 17 squamate genomes, composed of 10 snakes and 7 lizards. The RELEC data set (179 loci) outperforms AHE and UCEs by maximizing per-locus genetic variation while maintaining presence and orthology across a range of evolutionary scales. RELEC markers show higher phylogenetic informativeness than UCE and AHE loci, and RELEC gene trees show greater similarity to the species tree than AHE or UCE gene trees. Furthermore, with fewer loci, RELEC remains computationally tractable for full Bayesian coalescent species tree analyses. We contrast RELEC to and discuss important aspects of comparable methods, and demonstrate how RELEC may be the most effective set of loci for resolving difficult nodes and rapid radiations. We provide several resources for capturing or extracting RELEC loci from other amniote groups.
Collapse
Affiliation(s)
- Benjamin R Karin
- Department of Biology, Villanova University, Villanova, PA
- Museum of Vertebrate Zoology and Department of Integrative Biology, University of California, Berkeley, CA
| | - Tony Gamble
- Department of Biological Sciences, Marquette University, Milwaukee, WI
- Milwaukee Public Museum, Milwaukee, WI
- Bell Museum of Natural History, University of Minnesota, St. Paul, MN
| | - Todd R Jackman
- Department of Biology, Villanova University, Villanova, PA
| |
Collapse
|
6
|
Wilcox JS, Kerschner A, Hollocher H. Indel-informed Bayesian analysis suggests cryptic population structure between Plasmodium knowlesi of humans and long-tailed macaques (Macaca fascicularis) in Malaysian Borneo. INFECTION GENETICS AND EVOLUTION 2019; 75:103994. [PMID: 31421245 DOI: 10.1016/j.meegid.2019.103994] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/10/2019] [Revised: 08/01/2019] [Accepted: 08/03/2019] [Indexed: 01/02/2023]
Abstract
Plasmodium knowlesi is an important causative agent of malaria in humans of Southeast Asia. Macaques are natural hosts for this parasite, but little is conclusively known about its patterns of transmission within and between these hosts. Here, we apply a comprehensive phylogenetic approach to test for patterns of cryptic population genetic structure between P. knowlesi isolated from humans and long-tailed macaques from the state of Sarawak in Malaysian Borneo. Our approach differs from previous investigations through our exhaustive use of archival 18S Small Subunit rRNA (18S) gene sequences from Plasmodium and Hepatocystis species, our inclusion of insertion and deletion information during phylogenetic inference, and our application of Bayesian phylogenetic inference to this problem. We report distinct clades of P. knowlesi that predominantly contained sequences from either human or macaque hosts for paralogous A-type and S-type 18S gene loci. We report significant partitioning of sequence distances between host species across both types of loci, and confirmed that sequences of the same locus type showed significantly biased assortment into different clades depending on their host species. Our results support the zoonotic potential of Plasmodium knowlesi, but also suggest that humans may be preferentially infected with certain strains of this parasite. Broadly, such patterns could arise through preferential zoonotic transmission of some parasite lineages or a disposition of parasites to transmit within, rather than between, human and macaque hosts. Available data are insufficient to address these hypotheses. Our results suggest that the epidemiology of P. knowlesi may be more complicated than previously assumed, and highlight the need for renewed and more vigorous explorations of transmission patterns in the fifth human malarial parasite.
Collapse
Affiliation(s)
- JustinJ S Wilcox
- Department of Biological Sciences, University of Notre Dame, Notre Dame, IN 46556-5688, USA.
| | - Abigail Kerschner
- Department of Biological Sciences, University of Notre Dame, Notre Dame, IN 46556-5688, USA
| | - Hope Hollocher
- Department of Biological Sciences, University of Notre Dame, Notre Dame, IN 46556-5688, USA
| |
Collapse
|
7
|
Wilcox JJS, Kerschner A, Hollocher H. WITHDRAWN: Indel-informed bayesian analysis suggests cryptic divisions between Plasmodium knowlesi of humans and long-tailed macaques (Macaca fascicularis) in Malaysian Borneo. INFECTION, GENETICS AND EVOLUTION : JOURNAL OF MOLECULAR EPIDEMIOLOGY AND EVOLUTIONARY GENETICS IN INFECTIOUS DISEASES 2018:S1567-1348(18)30557-4. [PMID: 30481580 DOI: 10.1016/j.meegid.2018.11.017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/29/2018] [Revised: 10/24/2018] [Accepted: 11/23/2018] [Indexed: 06/09/2023]
Abstract
This article has been withdrawn at the request of the author(s) and/or editor. The Publisher apologizes for any inconvenience this may cause. The full Elsevier Policy on Article Withdrawal can be found at https://www.elsevier.com/about/our-business/policies/article-withdrawal.
Collapse
Affiliation(s)
- Justin J S Wilcox
- Department of Biological Sciences, University of Notre Dame, Notre Dame, IN 46556-5688, USA.
| | - Abigail Kerschner
- Department of Biological Sciences, University of Notre Dame, Notre Dame, IN 46556-5688, USA
| | - Hope Hollocher
- Department of Biological Sciences, University of Notre Dame, Notre Dame, IN 46556-5688, USA
| |
Collapse
|
8
|
Donath A, Stadler PF. Split-inducing indels in phylogenomic analysis. Algorithms Mol Biol 2018; 13:12. [PMID: 30026791 PMCID: PMC6047143 DOI: 10.1186/s13015-018-0130-7] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2017] [Accepted: 06/16/2018] [Indexed: 11/13/2022] Open
Abstract
Background Most phylogenetic studies using molecular data treat gaps in multiple sequence alignments as missing data or even completely exclude alignment columns that contain gaps. Results Here we show that gap patterns in large-scale, genome-wide alignments are themselves phylogenetically informative and can be used to infer reliable phylogenies provided the gap data are properly filtered to reduce noise introduced by the alignment method. We introduce here the notion of split-inducing indels (splids) that define an approximate bipartition of the taxon set. We show both in simulated data and in case studies on real-life data that splids can be efficiently extracted from phylogenomic data sets. Conclusions Suitably processed gap patterns extracted from genome-wide alignment provide a surprisingly clear phylogenetic signal and an allow the inference of accurate phylogenetic trees. Electronic supplementary material The online version of this article (10.1186/s13015-018-0130-7) contains supplementary material, which is available to authorized users.
Collapse
|
9
|
Li K, Sun X, Chen M, Sun Y, Tian R, Wang Z, Xu S, Yang G. Evolutionary changes of Hox genes and relevant regulatory factors provide novel insights into mammalian morphological modifications. Integr Zool 2018; 13:21-35. [PMID: 28685945 PMCID: PMC5817400 DOI: 10.1111/1749-4877.12271] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
The diversity of body plans of mammals accelerates the innovation of lifestyles and the extensive adaptation to different habitats, including terrestrial, aerial and aquatic habitats. However, the genetic basis of those phenotypic modifications, which have occurred during mammalian evolution, remains poorly explored. In the present study, we synthetically surveyed the evolutionary pattern of Hox clusters that played a powerful role in the morphogenesis along the head–tail axis of animal embryos and the main regulatory factors (Mll, Bmi1 and E2f6) that control the expression of Hox genes. A deflected density of repetitive elements and lineage‐specific radical mutations of Mll have been determined in marine mammals with morphological changes, suggesting that evolutionary changes may alter Hox gene expression in these lineages, leading to the morphological modification of these lineages. Although no positive selection was detected at certain ancestor nodes of lineages, the increased ω values of Hox genes implied the relaxation of functional constraints of these genes during the mammalian evolutionary process. More importantly, 49 positively‐selected sites were identified in mammalian lineages with phenotypic modifications, indicating adaptive evolution acting on Hox genes and regulatory factors. In addition, 3 parallel amino acid substitutions in some Hox genes were examined in marine mammals, which might be responsible for their streamlined body.
Collapse
Affiliation(s)
- Kui Li
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, Nanjing 210023, China
| | - Xiaohui Sun
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, Nanjing 210023, China
| | - Meixiu Chen
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, Nanjing 210023, China
| | - Yingying Sun
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, Nanjing 210023, China
| | - Ran Tian
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, Nanjing 210023, China
| | - Zhengfei Wang
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, Nanjing 210023, China
| | - Shixia Xu
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, Nanjing 210023, China
| | - Guang Yang
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, Nanjing 210023, China
| |
Collapse
|
10
|
Springer MS, Gatesy J. Pinniped Diphyly and Bat Triphyly: More Homology Errors Drive Conflicts in the Mammalian Tree. J Hered 2017; 109:297-307. [DOI: 10.1093/jhered/esx089] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2017] [Accepted: 10/07/2017] [Indexed: 11/14/2022] Open
Affiliation(s)
- Mark S Springer
- Department of Evolution, Ecology, and Organismal Biology, University of California, Riverside, CA
| | - John Gatesy
- Division of Vertebrate Zoology, American Museum of Natural History, New York, NY
| |
Collapse
|
11
|
Ferreira AC, Tenreiro R, de Sá MIC, Dias R. Evolution and genome specialization of Brucella suis biovar 2 Iberian lineages. BMC Genomics 2017; 18:726. [PMID: 28899413 PMCID: PMC5596481 DOI: 10.1186/s12864-017-4113-8] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2017] [Accepted: 09/04/2017] [Indexed: 01/06/2023] Open
Abstract
Background Swine brucellosis caused by B. suis biovar 2 is an emergent disease in domestic pigs in Europe. The emergence of this pathogen has been linked to the increase of extensive pig farms and the high density of infected wild boars (Sus scrofa). In Portugal and Spain, the majority of strains share specific molecular characteristics, which allowed establishing an Iberian clonal lineage. However, several strains isolated from wild boars in the North-East region of Spain are similar to strains isolated in different Central European countries. Results Comparative analysis of five newly fully sequenced B. suis biovar 2 strains belonging to the main circulating clones in Iberian Peninsula, with publicly available Brucella spp. genomes, revealed that strains from Iberian clonal lineage share 74% similarity with those reference genomes. Besides the 210 kb translocation event present in all biovar 2 strains, an inversion with 944 kb was presented in chromosome I of strains from the Iberian clone. At left and right crossover points, the inversion disrupted a TRAP dicarboxylate transporter, DctM subunit, and an integral membrane protein TerC. The gene dctM is well conserved in Brucella spp. except in strains from the Iberian clonal lineage. Intraspecies comparative analysis also exposed a number of biovar-, haplotype- and strain-specific insertion-deletion (INDELs) events and single nucleotide polymorphisms (SNPs) that could explain differences in virulence and host specificities. Most discriminative mutations were associated to membrane related molecules (29%) and enzymes involved in catabolism processes (20%). Molecular identification of both B. suis biovar 2 clonal lineages could be easily achieved using the target-PCR procedures established in this work for the evaluated INDELs. Conclusion Whole-genome analyses supports that the B. suis biovar 2 Iberian clonal lineage evolved from the Central-European lineage and suggests that the genomic specialization of this pathogen in the Iberian Peninsula is independent of a specific genomic event(s), but instead driven by allopatric speciation, resulting in the establishment of a new ecovar. Electronic supplementary material The online version of this article (10.1186/s12864-017-4113-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Ana Cristina Ferreira
- Instituto Nacional de Investigação Agrária e Veterinária, I.P. (INIAV, IP), Av. da República, Quinta do Marquês, 2780-157, Oeiras, Portugal. .,Universidade de Lisboa, Faculdade de Ciências, Biosystems and Integrative Sciences Institute (BioISI), Edificio TecLabs, Campus da FCUL, Campo Grande, 1749-016, Lisbon, Portugal.
| | - Rogério Tenreiro
- Universidade de Lisboa, Faculdade de Ciências, Biosystems and Integrative Sciences Institute (BioISI), Edificio TecLabs, Campus da FCUL, Campo Grande, 1749-016, Lisbon, Portugal
| | - Maria Inácia Corrêa de Sá
- Instituto Nacional de Investigação Agrária e Veterinária, I.P. (INIAV, IP), Av. da República, Quinta do Marquês, 2780-157, Oeiras, Portugal
| | - Ricardo Dias
- Universidade de Lisboa, Faculdade de Ciências, Biosystems and Integrative Sciences Institute (BioISI), Edificio TecLabs, Campus da FCUL, Campo Grande, 1749-016, Lisbon, Portugal
| |
Collapse
|
12
|
Feijoo M, Parada A. Macrosystematics of eutherian mammals combining HTS data to expand taxon coverage. Mol Phylogenet Evol 2017; 113:76-83. [PMID: 28487261 DOI: 10.1016/j.ympev.2017.05.004] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2016] [Revised: 05/04/2017] [Accepted: 05/04/2017] [Indexed: 02/04/2023]
Abstract
In the last few years high-throughput sequencing technologies have permitted significant advances in mammalian phylogenetic studies from a genomic perspective. However, these studies have been restricted to a sparse number of species with available reference genomes. Thus, several issues inside the eutherian mammals phylogeny remain unresolved. This may be due in part to limited taxon sampling, as taxonomic density is known to affect phylogenetic resolution. In this context, we present a protocol to increase taxon coverage using high-throughput sequencing data (RNA or DNA) generated for other biological studies and available in public databases. Following this procedure we addressed pending or controversial issues concerning the phylogenetic position of Dermoptera, Pholidota and Chiroptera, considering multiple and independent loci. Also for Chiroptera and Arctoidea we evaluated the relationships of the lineages that compose it. Although the maximum number of genes used is moderate (95), in some cases taxon coverage doubles that of previous related studies. Globally, all coalescent-based (STAR, MP-EST and ASTRAL) and concatenated (IQ-TREE and BEAST2) methods used for species tree reconstruction were consistent to each other and most of interrogated nodes received high statistical support.
Collapse
Affiliation(s)
- M Feijoo
- Departamento de Ecología y Evolución, Facultad de Ciencias, Universidad de la República, Montevideo CP 11400, Uruguay.
| | - A Parada
- Instituto de Ciencias Ambientales y Evolutivas, Universidad Austral de Chile, Valdivia CP 5090000, Chile.
| |
Collapse
|
13
|
Chloroplast DNA Structural Variation, Phylogeny, and Age of Divergence among Diploid Cotton Species. PLoS One 2016; 11:e0157183. [PMID: 27309527 PMCID: PMC4911064 DOI: 10.1371/journal.pone.0157183] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2016] [Accepted: 05/25/2016] [Indexed: 12/20/2022] Open
Abstract
The cotton genus (Gossypium spp.) contains 8 monophyletic diploid genome groups (A, B, C, D, E, F, G, K) and a single allotetraploid clade (AD). To gain insight into the phylogeny of Gossypium and molecular evolution of the chloroplast genome in this group, we performed a comparative analysis of 19 Gossypium chloroplast genomes, six reported here for the first time. Nucleotide distance in non-coding regions was about three times that of coding regions. As expected, distances were smaller within than among genome groups. Phylogenetic topologies based on nucleotide and indel data support for the resolution of the 8 genome groups into 6 clades. Phylogenetic analysis of indel distribution among the 19 genomes demonstrates contrasting evolutionary dynamics in different clades, with a parallel genome downsizing in two genome groups and a biased accumulation of insertions in the clade containing the cultivated cottons leading to large (for Gossypium) chloroplast genomes. Divergence time estimates derived from the cpDNA sequence suggest that the major diploid clades had diverged approximately 10 to 11 million years ago. The complete nucleotide sequences of 6 cpDNA genomes are provided, offering a resource for cytonuclear studies in Gossypium.
Collapse
|
14
|
Doronina L, Churakov G, Shi J, Brosius J, Baertsch R, Clawson H, Schmitz J. Exploring Massive Incomplete Lineage Sorting in Arctoids (Laurasiatheria, Carnivora). Mol Biol Evol 2015; 32:3194-204. [PMID: 26337548 DOI: 10.1093/molbev/msv188] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Freed from the competition of large raptors, Paleocene carnivores could expand their newly acquired habitats in search of prey. Such changing conditions might have led to their successful distribution and rapid radiation. Today, molecular evolutionary biologists are faced, however, with the consequences of such accelerated adaptive radiations, because they led to sequential speciation more rapidly than phylogenetic markers could be fixed. The repercussions being that current genealogies based on such markers are incongruent with species trees.Our aim was to explore such conflicting phylogenetic zones of evolution during the early arctoid radiation, especially to distinguish diagnostic from misleading phylogenetic signals, and to examine other carnivore-related speciation events. We applied a combination of high-throughput computational strategies to screen carnivore and related genomes in silico for randomly inserted retroposed elements that we then used to identify inconsistent phylogenetic patterns in the Arctoidea group, which is well known for phylogenetic discordances.Our combined retrophylogenomic and in vitro wet lab approach detected hundreds of carnivore-specific insertions, many of them confirming well-established splits or identifying and solving conflicting species distributions. Our systematic genome-wide screens for Long INterspersed Elements detected homoplasy-free markers with insertion-specific truncation points that we used to distinguish phylogenetically informative markers from conflicting signals. The results were independently confirmed by phylogenetic diagnostic Short INterspersed Elements. As statistical analysis ruled out ancestral hybridization, these doubly verified but still conflicting patterns were statistically determined to be genomic remnants from a time of ancestral incomplete lineage sorting that especially accompanied large parts of Arctoidea evolution.
Collapse
Affiliation(s)
- Liliya Doronina
- Institute of Experimental Pathology, ZMBE, University of Münster, Münster, Germany
| | - Gennady Churakov
- Institute of Experimental Pathology, ZMBE, University of Münster, Münster, Germany Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
| | - Jingjing Shi
- Institute of Experimental Pathology, ZMBE, University of Münster, Münster, Germany
| | - Jürgen Brosius
- Institute of Experimental Pathology, ZMBE, University of Münster, Münster, Germany Institute of Evolutionary and Medical Genomics, Brandenburg Medical School (MHB), Neuruppin, Germany
| | - Robert Baertsch
- Department of Biomolecular Engineering, University of California, Santa Cruz
| | - Hiram Clawson
- Department of Biomolecular Engineering, University of California, Santa Cruz
| | - Jürgen Schmitz
- Institute of Experimental Pathology, ZMBE, University of Münster, Münster, Germany
| |
Collapse
|
15
|
A phylogenomic and molecular markers based analysis of the phylum Chlamydiae: proposal to divide the class Chlamydiia into two orders, Chlamydiales and Parachlamydiales ord. nov., and emended description of the class Chlamydiia. Antonie van Leeuwenhoek 2015; 108:765-81. [PMID: 26179278 DOI: 10.1007/s10482-015-0532-1] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/08/2015] [Accepted: 07/10/2015] [Indexed: 12/27/2022]
Abstract
The phylum Chlamydiae contains nine ecologically and genetically diverse families all placed within a single order. In this work, we have completed a comprehensive comparative analysis of 36 sequenced Chlamydiae genomes in order to identify shared molecular characteristics, namely conserved signature insertions/deletions (CSIs) and conserved signature proteins (CSPs), which can serve as distinguishing characteristics of supra-familial clusters within the phylum Chlamydiae. Our analysis has led to the identification of 32 CSIs which are specific to clusters within the phylum Chlamydiae at various phylogenetic depths. Importantly, 17 CSIs and 98 CSPs were found to be specific for the family Chlamydiaceae while another 3 CSI variants and 15 CSPs were specific for a grouping of the families Criblamydiaceae, Parachlamydiaceae, Simkaniaceae and Waddliaceae. These two clusters were also found to be distinguishable in 16S rRNA based phylogenetic trees, concatenated protein based phylogenetic trees, character compatibility based phylogenetic analyses, and on the basis of 16S rRNA gene sequence identity and average amino acid identity values. On the basis of the identified molecular characteristics, branching in phylogenetic trees, and the genetic distance between the two clusters within the phylum Chlamydiae we propose a division of the class Chlamydiia into two orders: an emended order Chlamydiales, containing the family Chlamydiaceae and the closely related Candidatus family Clavichlamydiaceae, and the novel order Parachlamydiales ord. nov. containing the families Parachlamydiaceae, Simkaniaceae and Waddliaceae and the Candidatus families Criblamydiaceae, Parilichlamydiaceae, Piscichlamydiaceae, and Rhabdochlamydiaceae. We also include a brief discussion of the reunification of the genera Chlamydia and Chlamydophila.
Collapse
|
16
|
Spielman SJ, Kumar K, Wilke CO. Comprehensive, structurally-informed alignment and phylogeny of vertebrate biogenic amine receptors. PeerJ 2015; 3:e773. [PMID: 25737813 PMCID: PMC4338800 DOI: 10.7717/peerj.773] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2014] [Accepted: 01/26/2015] [Indexed: 01/29/2023] Open
Abstract
Biogenic amine receptors play critical roles in regulating behavior and physiology in both vertebrates and invertebrates, particularly within the central nervous system. Members of the G-protein coupled receptor (GPCR) family, these receptors interact with endogenous bioamine ligands such as dopamine, serotonin, and epinephrine, and are targeted by a wide array of pharmaceuticals. Despite the clear clinical and biological importance of these receptors, their evolutionary history remains poorly characterized. In particular, the relationships among biogenic amine receptors and any specific evolutionary constraints acting within distinct receptor subtypes are largely unknown. To advance and facilitate studies in this receptor family, we have constructed a comprehensive, high-quality sequence alignment of vertebrate biogenic amine receptors. In particular, we have integrated a traditional multiple sequence approach with robust structural domain predictions to ensure that alignment columns accurately capture the highly-conserved GPCR structural domains, and we demonstrate how ignoring structural information produces spurious inferences of homology. Using this alignment, we have constructed a structurally-partitioned maximum-likelihood phylogeny from which we deduce novel biogenic amine receptor relationships and uncover previously unrecognized lineage-specific receptor clades. Moreover, we find that roughly 1% of the 3039 sequences in our final alignment are either misannotated or unclassified, and we propose updated classifications for these receptors. We release our comprehensive alignment and its corresponding phylogeny as a resource for future research into the evolution and diversification of biogenic amine receptors.
Collapse
Affiliation(s)
| | - Keerthana Kumar
- Department of Integrative Biology, The University of Texas at Austin, Austin, USA
| | - Claus O. Wilke
- Department of Integrative Biology, The University of Texas at Austin, Austin, USA
| |
Collapse
|
17
|
Jones KE, Smaers JB, Goswami A. Impact of the terrestrial-aquatic transition on disparity and rates of evolution in the carnivoran skull. BMC Evol Biol 2015; 15:8. [PMID: 25648618 PMCID: PMC4328284 DOI: 10.1186/s12862-015-0285-5] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2014] [Accepted: 01/15/2015] [Indexed: 11/25/2022] Open
Abstract
Background Which factors influence the distribution patterns of morphological diversity among clades? The adaptive radiation model predicts that a clade entering new ecological niche will experience high rates of evolution early in its history, followed by a gradual slowing. Here we measure disparity and rates of evolution in Carnivora, specifically focusing on the terrestrial-aquatic transition in Pinnipedia. We analyze fissiped (mostly terrestrial, arboreal, and semi-arboreal, but also including the semi-aquatic otter) and pinniped (secondarily aquatic) carnivorans as a case study of an extreme ecological transition. We used 3D geometric morphometrics to quantify cranial shape in 151 carnivoran specimens (64 fissiped, 87 pinniped) and five exceptionally-preserved fossil pinnipeds, including the stem-pinniped Enaliarctos emlongi. Range-based and variance-based disparity measures were compared between pinnipeds and fissipeds. To distinguish between evolutionary modes, a Brownian motion model was compared to selective regime shifts associated with the terrestrial-aquatic transition and at the base of Pinnipedia. Further, evolutionary patterns were estimated on individual branches using both Ornstein-Uhlenbeck and Independent Evolution models, to examine the origin of pinniped diversity. Results Pinnipeds exhibit greater cranial disparity than fissipeds, even though they are less taxonomically diverse and, as a clade nested within fissipeds, phylogenetically younger. Despite this, there is no increase in the rate of morphological evolution at the base of Pinnipedia, as would be predicted by an adaptive radiation model, and a Brownian motion model of evolution is supported. Instead basal pinnipeds populated new areas of morphospace via low to moderate rates of evolution in new directions, followed by later bursts within the crown-group, potentially associated with ecological diversification within the marine realm. Conclusion The transition to an aquatic habitat in carnivorans resulted in a shift in cranial morphology without an increase in rate in the stem lineage, contra to the adaptive radiation model. Instead these data suggest a release from evolutionary constraint model, followed by aquatic diversifications within crown families. Electronic supplementary material The online version of this article (doi:10.1186/s12862-015-0285-5) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Katrina E Jones
- Center for Functional Anatomy and Evolution, Johns Hopkins University, Baltimore, MD, USA. .,Department of Organismic and Evolutionary Biology, Museum of Comparative Zoology, Harvard University, 26 Oxford Street, Cambridge, MA, 02138, USA.
| | - Jeroen B Smaers
- Department of Anthropology, Stony Brook University, Stony Brook, New York, NY, 11794-4364, USA.
| | - Anjali Goswami
- Department of Genetics, Evolution & Environment, University College London, Gower Street, London, WC1E 6BT, UK. .,Department of Earth Sciences, University College London, Gower Street, London, WC1E 6BT, UK.
| |
Collapse
|
18
|
Ashkenazy H, Cohen O, Pupko T, Huchon D. Indel reliability in indel-based phylogenetic inference. Genome Biol Evol 2014; 6:3199-209. [PMID: 25409663 PMCID: PMC4986452 DOI: 10.1093/gbe/evu252] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
It is often assumed that it is unlikely that the same insertion or deletion (indel) event occurred at the same position in two independent evolutionary lineages, and thus, indel-based inference of phylogeny should be less subject to homoplasy compared with standard inference which is based on substitution events. Indeed, indels were successfully used to solve debated evolutionary relationships among various taxonomical groups. However, indels are never directly observed but rather inferred from the alignment and thus indel-based inference may be sensitive to alignment errors. It is hypothesized that phylogenetic reconstruction would be more accurate if it relied only on a subset of reliable indels instead of the entire indel data. Here, we developed a method to quantify the reliability of indel characters by measuring how often they appear in a set of alternative multiple sequence alignments. Our approach is based on the assumption that indels that are consistently present in most alternative alignments are more reliable compared with indels that appear only in a small subset of these alignments. Using simulated and empirical data, we studied the impact of filtering and weighting indels by their reliability scores on the accuracy of indel-based phylogenetic reconstruction. The new method is available as a web-server at http://guidance.tau.ac.il/RELINDEL/.
Collapse
Affiliation(s)
- Haim Ashkenazy
- Department of Cell Research and Immunology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Ramat Aviv, Israel
| | - Ofir Cohen
- Department of Cell Research and Immunology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Ramat Aviv, Israel Present address: Department of Molecular Genetics, Weizmann Institute of Science, Rehovot, Israel
| | - Tal Pupko
- Department of Cell Research and Immunology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Ramat Aviv, Israel
| | - Dorothée Huchon
- Department of Zoology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Ramat Aviv, Israel
| |
Collapse
|
19
|
Papadopoulou A, Chesters D, Coronado I, De la Cadena G, Cardoso A, Reyes JC, Maes JM, Rueda RM, Gómez-Zurita J. Automated DNA-based plant identification for large-scale biodiversity assessment. Mol Ecol Resour 2014; 15:136-52. [PMID: 24666885 DOI: 10.1111/1755-0998.12256] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2013] [Revised: 02/17/2014] [Accepted: 02/22/2014] [Indexed: 11/29/2022]
Abstract
Rapid degradation of tropical forests urges to improve our efficiency in large-scale biodiversity assessment. DNA barcoding can assist greatly in this task, but commonly used phenetic approaches for DNA-based identifications rely on the existence of comprehensive reference databases, which are infeasible for hyperdiverse tropical ecosystems. Alternatively, phylogenetic methods are more robust to sparse taxon sampling but time-consuming, while multiple alignment of species-diagnostic, typically length-variable, markers can be problematic across divergent taxa. We advocate the combination of phylogenetic and phenetic methods for taxonomic assignment of DNA-barcode sequences against incomplete reference databases such as GenBank, and we developed a pipeline to implement this approach on large-scale plant diversity projects. The pipeline workflow includes several steps: database construction and curation, query sequence clustering, sequence retrieval, distance calculation, multiple alignment and phylogenetic inference. We describe the strategies used to establish these steps and the optimization of parameters to fit the selected psbA-trnH marker. We tested the pipeline using infertile plant samples and herbivore diet sequences from the highly threatened Nicaraguan seasonally dry forest and exploiting a valuable purpose-built resource: a partial local reference database of plant psbA-trnH. The selected methodology proved efficient and reliable for high-throughput taxonomic assignment, and our results corroborate the advantage of applying 'strict' tree-based criteria to avoid false positives. The pipeline tools are distributed as the scripts suite 'BAGpipe' (pipeline for Biodiversity Assessment using GenBank data), which can be readily adjusted to the purposes of other projects and applied to sequence-based identification for any marker or taxon.
Collapse
Affiliation(s)
- Anna Papadopoulou
- Animal Biodiversity and Evolution, Institut de Biologia Evolutiva (CSIC-Univ. Pompeu Fabra), 08003, Barcelona, Spain
| | | | | | | | | | | | | | | | | |
Collapse
|