1
|
Perfilyeva A, Bespalova K, Kuzovleva Y, Mussabayev R, Begmanova М, Amirgalyeva A, Vishnyakova O, Nazarenko I, Zhaxsylykova A, Yerzhan A, Perfilyeva Y, Dzhaembaeva T, Khamchukova A, Plakhov K, Torekhanov A, Djansugurova L, Zhunussova G, Bekmanov B. Genetic diversity and origin of Kazakh Tobet Dogs. Sci Rep 2024; 14:23137. [PMID: 39367220 PMCID: PMC11452641 DOI: 10.1038/s41598-024-74061-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2024] [Accepted: 09/23/2024] [Indexed: 10/06/2024] Open
Abstract
The Kazakh Tobet is an indigenous Kazakh dog breed that has been used to guard livestock since ancient times. To understand the genetic structure and phylogenetic relationship of the Kazakh Tobet breed with other herding and livestock guarding dog breeds, we analysed short tandem repeat data of 107 Kazakh Tobet dogs from different regions of Kazakhstan and Mongolia, as well as whole genome sequencing data from two Kazakh Tobet dogs and 43 dogs from 24 working breeds. Our results indicate a high genetic diversity of the Kazakh Tobet, with the average number of alleles per locus ranging from 6.00 to 10.22 and observed heterozygosity ranging from 76 to 78%. The breed has a complex genetic structure characterised by seven different clusters. The neighbour-joining tree constructed based on 14,668,406 autosomal and the maximum likelihood tree based on mitochondrial D-loop sequences indicate a common genetic heritage between the Kazakh Tobet, the Central Asian Shepherd Dog and the Turkish Akbash. The presence of haplotype A18 in the Kazakh Tobets supports the hypothesis of the ancient origin of the breed, which was previously suggested by archaeological finds and written sources. These results provide an important genetic basis for the ongoing efforts to improve the Kazakh Tobet breed, to ensure its preservation as an independent genetic lineage and to recognise a breed on an international level.
Collapse
Affiliation(s)
- Anastassiya Perfilyeva
- Laboratory of Molecular Genetics, Institute of Genetics and Physiology, Almaty, 050060, Kazakhstan
| | - Kira Bespalova
- Laboratory of Molecular Genetics, Institute of Genetics and Physiology, Almaty, 050060, Kazakhstan.
| | - Yelena Kuzovleva
- Laboratory of Molecular Genetics, Institute of Genetics and Physiology, Almaty, 050060, Kazakhstan
| | - Rustam Mussabayev
- Laboratory of Informational Processes Analysis and Modelling, Institute of Information and Computational Technologies, Almaty, 050000, Kazakhstan
| | - Мamura Begmanova
- Laboratory of Molecular Genetics, Institute of Genetics and Physiology, Almaty, 050060, Kazakhstan
| | - Almira Amirgalyeva
- Laboratory of Molecular Genetics, Institute of Genetics and Physiology, Almaty, 050060, Kazakhstan
| | - Olga Vishnyakova
- Department of Сynology, Republican Federation of Public Associations of Hunters and Hunting Societies "Kansonar", Almaty, 050008, Kazakhstan
| | - Inna Nazarenko
- Department of Сynology, Republican Federation of Public Associations of Hunters and Hunting Societies "Kansonar", Astana, 020000, Kazakhstan
| | - Assel Zhaxsylykova
- Laboratory of Molecular Genetics, Institute of Genetics and Physiology, Almaty, 050060, Kazakhstan
| | - Arailym Yerzhan
- Laboratory of Molecular Genetics, Institute of Genetics and Physiology, Almaty, 050060, Kazakhstan
| | - Yuliya Perfilyeva
- Laboratory of Molecular Immunology and Immunobiotechnology, M.A. Aitkhozhin's Institute of Molecular Biology and Biochemistry, Almaty, 050012, Kazakhstan
| | | | - Anna Khamchukova
- Laboratory of Biocenology and Hunting management, Institute of Zoology, Almaty, 050060, Kazakhstan
| | - Konstantin Plakhov
- Laboratory of Biocenology and Hunting management, Institute of Zoology, Almaty, 050060, Kazakhstan
| | - Aibyn Torekhanov
- Kazakh Research Institute of Livestock and Fodder Production, Almaty, 050071, Kazakhstan
| | - Leyla Djansugurova
- Laboratory of Molecular Genetics, Institute of Genetics and Physiology, Almaty, 050060, Kazakhstan
| | - Gulnur Zhunussova
- Laboratory of Molecular Genetics, Institute of Genetics and Physiology, Almaty, 050060, Kazakhstan
| | - Bakhytzhan Bekmanov
- Laboratory of Molecular Genetics, Institute of Genetics and Physiology, Almaty, 050060, Kazakhstan
| |
Collapse
|
2
|
The Two Domains of the Avian Double-β-Defensin AvBD11 Have Different Ancestors, Common with Potential Monodomain Crocodile and Turtle Defensins. BIOLOGY 2022; 11:biology11050690. [PMID: 35625418 PMCID: PMC9138766 DOI: 10.3390/biology11050690] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/10/2022] [Revised: 04/08/2022] [Accepted: 04/16/2022] [Indexed: 12/12/2022]
Abstract
Simple Summary Vertebrate defensins are a multigene family of antimicrobial peptides that evolved following a series of gene duplication and divergence events during the expansion of vertebrates. In birds, the repertoire of avian defensins contains an atypical defensin, namely AvBD11 (avian beta-defensin 11), which consists of two repeated but divergent defensin units (or domains) while most vertebrate defensins only possess one unit. In this study, we investigated the evolutionary scenario leading to the formation of this double defensin in birds by comparing each defensin unit of AvBD11 with other defensins from birds and closely related reptiles (crocodile, turtles) predicted to have a single defensin unit. Our most outstanding results suggest that the double defensin AvBD11 probably appeared following a fusion of two ancestral genes or from an ancestral double defensin, but not from a recent internal duplication as it can be observed in other types of proteins with domain repeats. Abstract Beta-defensins are an essential group of cysteine-rich host-defence peptides involved in vertebrate innate immunity and are generally monodomain. Among bird defensins, the avian β-defensin 11 (AvBD11) is unique because of its peculiar structure composed of two β-defensin domains. The reasons for the appearance of such ‘polydefensins’ during the evolution of several, but not all branches of vertebrates, still remain an open question. In this study, we aimed at exploring the origin and evolution of the bird AvBD11 using a phylogenetic approach. Although they are homologous, the N- and C-terminal domains of AvBD11 share low protein sequence similarity and possess different cysteine spacing patterns. Interestingly, strong variations in charge properties can be observed on the C-terminal domain depending on bird species but, despite this feature, no positive selection was detected on the AvBD11 gene (neither on site nor on branches). The comparison of AvBD11 protein sequences in different bird species, however, suggests that some amino acid residues may have undergone convergent evolution. The phylogenetic tree of avian defensins revealed that each domain of AvBD11 is distant from ovodefensins (OvoDs) and may have arisen from different ancestral defensins. Strikingly, our phylogenetic analysis demonstrated that each domain of AvBD11 has common ancestors with different putative monodomain β-defensins from crocodiles and turtles and are even more closely related with these reptilian defensins than with their avian paralogs. Our findings support that AvBD11′s domains, which differ in their cysteine spacing and charge distribution, do not result from a recent internal duplication but most likely originate from a fusion of two different ancestral genes or from an ancestral double-defensin arisen before the Testudines-Archosauria split.
Collapse
|
3
|
de Oliveira Martins L, Bloomfield S, Stoakes E, Grant AJ, Page AJ, Mather AE. Tatajuba: exploring the distribution of homopolymer tracts. NAR Genom Bioinform 2022; 4:lqac003. [PMID: 35118377 PMCID: PMC8808543 DOI: 10.1093/nargab/lqac003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2021] [Revised: 11/18/2021] [Accepted: 01/05/2022] [Indexed: 11/14/2022] Open
Abstract
Length variation of homopolymeric tracts, which induces phase variation, is known to regulate gene expression leading to phenotypic variation in a wide range of bacterial species. There is no specialized bioinformatics software which can, at scale, exhaustively explore and describe these features from sequencing data. Identifying these is non-trivial as sequencing and bioinformatics methods are prone to introducing artefacts when presented with homopolymeric tracts due to the decreased base diversity. We present tatajuba, which can automatically identify potential homopolymeric tracts and help predict their putative phenotypic impact, allowing for rapid investigation. We use it to detect all tracts in two separate datasets, one of Campylobacter jejuni and one of three Bordetella species, and to highlight those tracts that are polymorphic across samples. With this we confirm homopolymer tract variation with phenotypic impact found in previous studies and additionally find many more with potential variability. The software is written in C and is available under the open source licence GNU GPLv3.
Collapse
Affiliation(s)
| | - Samuel Bloomfield
- Quadram Institute Bioscience, Norwich Research Park, Norwich NR4 7UQ, UK
| | - Emily Stoakes
- Department of Veterinary Medicine, University of Cambridge, Madingley Road, Cambridge CB3 0ES, UK
| | - Andrew J Grant
- Department of Veterinary Medicine, University of Cambridge, Madingley Road, Cambridge CB3 0ES, UK
| | - Andrew J Page
- Quadram Institute Bioscience, Norwich Research Park, Norwich NR4 7UQ, UK
| | - Alison E Mather
- Quadram Institute Bioscience, Norwich Research Park, Norwich NR4 7UQ, UK
| |
Collapse
|
4
|
Cruaud A, Delvare G, Nidelet S, Sauné L, Ratnasingham S, Chartois M, Blaimer BB, Gates M, Brady SG, Faure S, van Noort S, Rossi JP, Rasplus JY. Ultra-Conserved Elements and morphology reciprocally illuminate conflicting phylogenetic hypotheses in Chalcididae (Hymenoptera, Chalcidoidea). Cladistics 2021; 37:1-35. [PMID: 34478176 DOI: 10.1111/cla.12416] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/15/2020] [Indexed: 11/30/2022] Open
Abstract
Recent technical advances combined with novel computational approaches have promised the acceleration of our understanding of the tree of life. However, when it comes to hyperdiverse and poorly known groups of invertebrates, studies are still scarce. As published phylogenies will be rarely challenged by future taxonomists, careful attention must be paid to potential analytical bias. We present the first molecular phylogenetic hypothesis for the family Chalcididae, a group of parasitoid wasps, with a representative sampling (144 ingroups and seven outgroups) that covers all described subfamilies and tribes, and 82% of the known genera. Analyses of 538 Ultra-Conserved Elements (UCEs) with supermatrix (RAxML and IQTREE) and gene tree reconciliation approaches (ASTRAL, ASTRID) resulted in highly supported topologies in overall agreement with morphology but reveal conflicting topologies for some of the deepest nodes. To resolve these conflicts, we explored the phylogenetic tree space with clustering and gene genealogy interrogation methods, analyzed marker and taxon properties that could bias inferences and performed a thorough morphological analysis (130 characters encoded for 40 taxa representative of the diversity). This joint analysis reveals that UCEs enable attainment of resolution between ancestry and convergent/divergent evolution when morphology is not informative enough, but also shows that a systematic exploration of bias with different analytical methods and a careful analysis of morphological features is required to prevent publication of artifactual results. We highlight a GC content bias for maximum-likelihood approaches, an artifactual mid-point rooting of the ASTRAL tree and a deleterious effect of high percentage of missing data (>85% missing UCEs) on gene tree reconciliation methods. Based on the results we propose a new classification of the family into eight subfamilies and ten tribes that lay the foundation for future studies on the evolutionary history of Chalcididae.
Collapse
Affiliation(s)
- Astrid Cruaud
- CBGP, CIRAD, INRAe, IRD, Montpellier SupAgro, Université de Montpellier, Montpellier, France
| | - Gérard Delvare
- CBGP, CIRAD, INRAe, IRD, Montpellier SupAgro, Université de Montpellier, Montpellier, France.,UMR CBGP, CIRAD, F-34398, Montpellier, France
| | - Sabine Nidelet
- CBGP, CIRAD, INRAe, IRD, Montpellier SupAgro, Université de Montpellier, Montpellier, France
| | - Laure Sauné
- CBGP, CIRAD, INRAe, IRD, Montpellier SupAgro, Université de Montpellier, Montpellier, France
| | | | - Marguerite Chartois
- CBGP, CIRAD, INRAe, IRD, Montpellier SupAgro, Université de Montpellier, Montpellier, France
| | | | - Michael Gates
- USDA, ARS, SEL, c/o Smithsonian Institution, National Museum of Natural History, Washington, DC, USA
| | - Seán G Brady
- Department of Entomology, Smithsonian Institution, National Museum of Natural History, Washington, DC, USA
| | - Sariana Faure
- Department of Zoology and Entomology, Rhodes University, Grahamstown, South Africa
| | - Simon van Noort
- Research and Exhibitions Department, South African Museum, Iziko Museums of South Africa, PO Box 61, Cape Town, 8000, South Africa.,Department of Biological Sciences, University of Cape Town, Private Bag, Rondebosch, 7701, Cape Town, South Africa
| | - Jean-Pierre Rossi
- CBGP, CIRAD, INRAe, IRD, Montpellier SupAgro, Université de Montpellier, Montpellier, France
| | - Jean-Yves Rasplus
- CBGP, CIRAD, INRAe, IRD, Montpellier SupAgro, Université de Montpellier, Montpellier, France
| |
Collapse
|
5
|
Barratt JLN, Sapp SGH. Machine learning-based analyses support the existence of species complexes for Strongyloides fuelleborni and Strongyloides stercoralis. Parasitology 2020; 147:1184-1195. [PMID: 32539880 PMCID: PMC7443747 DOI: 10.1017/s0031182020000979] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Human strongyloidiasis is a serious disease mostly attributable to Strongyloides stercoralis and to a lesser extent Strongyloides fuelleborni, a parasite mainly of non-human primates. The role of animals as reservoirs of human-infecting Strongyloides is ill-defined, and whether dogs are a source of human infection is debated. Published multi-locus sequence typing (MLST) studies attempt to elucidate relationships between Strongyloides genotypes, hosts, and distributions, but typically examine relatively few worms, making it difficult to identify population-level trends. Combining MLST data from multiple studies is often impractical because they examine different combinations of loci, eliminating phylogeny as a means of examining these data collectively unless hundreds of specimens are excluded. A recently-described machine learning approach that facilitates clustering of MLST data may offer a solution, even for datasets that include specimens sequenced at different combinations of loci. By clustering various MLST datasets as one using this procedure, we sought to uncover associations among genotype, geography, and hosts that remained elusive when examining datasets individually. Multiple datasets comprising hundreds of S. stercoralis and S. fuelleborni individuals were combined and clustered. Our results suggest that the commonly proposed 'two lineage' population structure of S. stercoralis (where lineage A infects humans and dogs, lineage B only dogs) is an over-simplification. Instead, S. stercoralis seemingly represents a species complex, including two distinct populations over-represented in dogs, and other populations vastly more common in humans. A distinction between African and Asian S. fuelleborni is also supported here, emphasizing the need for further resolving these taxonomic relationships through modern investigations.
Collapse
Affiliation(s)
- Joel L N Barratt
- Division of Parasitic Diseases and Malaria, Centers for Disease Control and Prevention, Parasitic Diseases Branch, Atlanta, USA
- Oak Ridge Associated Universities, Oak Ridge, Tennessee, USA
| | - Sarah G H Sapp
- Division of Parasitic Diseases and Malaria, Centers for Disease Control and Prevention, Parasitic Diseases Branch, Atlanta, USA
| |
Collapse
|
6
|
Boutte J, Fishbein M, Liston A, Straub SCK. NGS-Indel Coder: A pipeline to code indel characters in phylogenomic data with an example of its application in milkweeds (Asclepias). Mol Phylogenet Evol 2019; 139:106534. [PMID: 31212081 DOI: 10.1016/j.ympev.2019.106534] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2019] [Revised: 05/12/2019] [Accepted: 06/13/2019] [Indexed: 12/30/2022]
Abstract
Targeted genome sequencing approaches allow characterization of evolutionary relationships using a considerable number of nuclear genes and informative characters. However, most phylogenomic analyses only utilize single nucleotide polymorphisms (SNPs). Studies at the species level, especially in groups that have recently radiated, often recover low amounts of phylogenetically informative variation in coding regions, and require non-coding sequences, which are richer in indels, to resolve gene trees. Here, NGS-Indel Coder, a pipeline to detect and omit false positive indels inferred from assemblies of short read sequence data, was developed to resolve the relationships among and within major clades of the American milkweeds (Asclepias), which are the result of a rapid and recent evolutionary radiation, and whose phylogeny has been difficult to resolve. This pipeline was applied to a Hyb-Seq data set of 768 loci including targeted exons and flanking intron regions from 33 milkweed species. Robust species tree inference was improved by excluding small alignment partitions (<100 bp) that increased gene tree ambiguity and incongruence. To further investigate the robustness of indel coding, data sets that included small and large indels were explored, and species trees derived from concatenated loci versus coalescent methods based on gene trees were compared. The phylogeny of Asclepias obtained using nuclear data was well resolved, and phylogenetic information from indels improved resolution of specific nodes. The Temperate North American, Mexican Highland, and Incarnatae clades were well supported as monophyletic. Asclepias coulteri, which has been considered part of the Sonoran Desert clade based on plastome analyses, was placed as sister to all the other milkweed species studied here, rather than as a member of that clade. Two groups within the Temperate North American and Mexican clades were not resolved, and the inferred relationships strongly conflicted when comparing results based on data sets that did or did not include indel characters. This new pipeline represents a step forward in making maximal use of the information content in phylogenomic data sets.
Collapse
Affiliation(s)
- Julien Boutte
- Department of Biology, Hobart and William Smith Colleges, Geneva, NY, USA
| | - Mark Fishbein
- Department of Plant Biology, Ecology and Evolution, Oklahoma State University, Stillwater, OK, USA
| | - Aaron Liston
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR, USA
| | - Shannon C K Straub
- Department of Biology, Hobart and William Smith Colleges, Geneva, NY, USA.
| |
Collapse
|