1
|
Gorin I, Balanovsky O, Kozlov O, Koshel S, Kostryukova E, Zhabagin M, Agdzhoyan A, Pylev V, Balanovska E. Determining the Area of Ancestral Origin for Individuals From North Eurasia Based on 5,229 SNP Markers. Front Genet 2022; 13:902309. [PMID: 35651934 PMCID: PMC9149316 DOI: 10.3389/fgene.2022.902309] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2022] [Accepted: 04/26/2022] [Indexed: 11/26/2022] Open
Abstract
Currently available genetic tools effectively distinguish between different continental origins. However, North Eurasia, which constitutes one-third of the world’s largest continent, remains severely underrepresented. The dataset used in this study represents 266 populations from 12 North Eurasian countries, including most of the ethnic diversity across Russia’s vast territory. A total of 1,883 samples were genotyped using the Illumina Infinium Omni5Exome-4 v1.3 BeadChip. Three principal components were computed for the entire dataset using three iterations for outlier removal. It allowed the merging of 266 populations into larger groups while maintaining intragroup homogeneity, so 29 ethnic geographic groups were formed that were genetically distinguishable enough to trace individual ancestry. Several feature selection methods, including the random forest algorithm, were tested to estimate the number of genetic markers needed to differentiate between the groups; 5,229 ancestry-informative SNPs were selected. We tested various classifiers supporting multiple classes and output values for each class that could be interpreted as probabilities. The logistic regression was chosen as the best mathematical model for predicting ancestral populations. The machine learning algorithm for inferring an ancestral ethnic geographic group was implemented in the original software “Homeland” fitted with the interface module, the prediction module, and the cartographic module. Examples of geographic maps showing the likelihood of geographic ancestry for individuals from different regions of North Eurasia are provided. Validating methods show that the highest number of ethnic geographic group predictions with almost absolute accuracy and sensitivity was observed for South and Central Siberia, Far East, and Kamchatka. The total accuracy of prediction of one of 29 ethnic geographic groups reached 71%. The proposed method can be employed to predict ancestries from the populations of Russia and its neighbor states. It can be used for the needs of forensic science and genetic genealogy.
Collapse
Affiliation(s)
- Igor Gorin
- Vavilov Institute of General Genetics, Russian Academy of Sciences, Moscow, Russia.,Moscow Institute of Physics and Technology, Dolgoprudny, Russia.,Research Centre for Medical Genetics, Moscow, Russia
| | - Oleg Balanovsky
- Vavilov Institute of General Genetics, Russian Academy of Sciences, Moscow, Russia.,Research Centre for Medical Genetics, Moscow, Russia.,Biobank of North Eurasia, Moscow, Russia
| | - Oleg Kozlov
- Vavilov Institute of General Genetics, Russian Academy of Sciences, Moscow, Russia
| | - Sergey Koshel
- Research Centre for Medical Genetics, Moscow, Russia.,Faculty of Geography, Lomonosov Moscow State University, Moscow, Russia
| | - Elena Kostryukova
- Federal Research and Clinical Center of Physical-Chemical Medicine, Moscow, Russia
| | - Maxat Zhabagin
- National Center for Biotechnology, Nur-Sultan, Kazakhstan
| | - Anastasiya Agdzhoyan
- Vavilov Institute of General Genetics, Russian Academy of Sciences, Moscow, Russia.,Research Centre for Medical Genetics, Moscow, Russia
| | - Vladimir Pylev
- Research Centre for Medical Genetics, Moscow, Russia.,Biobank of North Eurasia, Moscow, Russia
| | - Elena Balanovska
- Vavilov Institute of General Genetics, Russian Academy of Sciences, Moscow, Russia.,Research Centre for Medical Genetics, Moscow, Russia.,Biobank of North Eurasia, Moscow, Russia
| |
Collapse
|
2
|
Moshkov N, Smetanin A, Tatarinova TV. Local ancestry prediction with PyLAE. PeerJ 2021; 9:e12502. [PMID: 35003914 PMCID: PMC8679960 DOI: 10.7717/peerj.12502] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2020] [Accepted: 10/26/2021] [Indexed: 11/20/2022] Open
Abstract
Summary We developed PyLAE, a new tool for determining local ancestry along a genome using whole-genome sequencing data or high-density genotyping experiments. PyLAE can process an arbitrarily large number of ancestral populations (with or without an informative prior). Since PyLAE does not involve estimating many parameters, it can process thousands of genomes within a day. PyLAE can run on phased or unphased genomic data. We have shown how PyLAE can be applied to the identification of differentially enriched pathways between populations. The local ancestry approach results in higher enrichment scores compared to whole-genome approaches. We benchmarked PyLAE using the 1000 Genomes dataset, comparing the aggregated predictions with the global admixture results and the current gold standard program RFMix. Computational efficiency, minimal requirements for data pre-processing, straightforward presentation of results, and ease of installation make PyLAE a valuable tool to study admixed populations. Availability and implementation The source code and installation manual are available at https://github.com/smetam/pylae.
Collapse
Affiliation(s)
- Nikita Moshkov
- Doctoral School of Interdisciplinary Medicine, University of Szeged, Szeged, Hungary
- Synthetic and Systems Biology Unit, Biological Research Centre, Szeged, Hungary
- Atlas Biomed Group Limited, London, United Kingdom
- Laboratory on AI for Computational Biology, Faculty of Computer Science, HSE University, Moscow, Russia
| | | | - Tatiana V. Tatarinova
- Department of Biology, University of La Verne, La Verne, CA, United States
- Siberian Federal University, Krasnoyarsk, Russia
- Institute of General Genetics, Moscow, Russia, Moscow, Russia
- Institute for Information Transmission Problems, Moscow, Russia, Moscow, Russia
| |
Collapse
|
3
|
Agranat-Tamir L, Waldman S, Rosen N, Yakir B, Carmi S, Carmel L. LINADMIX: Evaluating the effect of ancient admixture events on modern populations. Bioinformatics 2021; 37:4744-4755. [PMID: 34270685 DOI: 10.1093/bioinformatics/btab531] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2020] [Revised: 06/25/2021] [Accepted: 07/15/2021] [Indexed: 11/12/2022] Open
Abstract
MOTIVATION The rise in the number of genotyped ancient individuals provides an opportunity to estimate population admixture models for many populations. However, in models describing modern populations as mixtures of ancient ones, it is typically difficult to estimate the model mixing coefficients and to evaluate its fit to the data. RESULTS We present LINADMIX, designed to tackle this problem by solving a constrained linear model when both the ancient and the modern genotypes are represented in a low-dimensional space. LINADMIX estimates the mixing coefficients and their standard errors, and computes a p-value for testing the model fit to the data. We quantified the performance of LINADMIX using an extensive set of simulated studies. We show that LINADMIX can accurately estimate admixture coefficients, and is robust to factors such as population size, genetic drift, proportion of missing data, and various types of model misspecification. AVAILABILITY LINADMIX is available as a python code at https://github.com/swidler/linadmix. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Lily Agranat-Tamir
- Department of Genetics, The Alexander Silberman Institute of Life Sciences, Faculty of Science, The Hebrew University of Jerusalem, Edmond J. Safra Campus, Givat Ram, Jerusalem 9190401, Israel.,Department of Statistics and Data Science, The Hebrew University of Jerusalem, Jerusalem 9190501, Israel
| | - Shamam Waldman
- Braun School of Public Health and Community Medicine, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Naomi Rosen
- Department of Genetics, The Alexander Silberman Institute of Life Sciences, Faculty of Science, The Hebrew University of Jerusalem, Edmond J. Safra Campus, Givat Ram, Jerusalem 9190401, Israel
| | - Benjamin Yakir
- Department of Statistics and Data Science, The Hebrew University of Jerusalem, Jerusalem 9190501, Israel
| | - Shai Carmi
- Braun School of Public Health and Community Medicine, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Liran Carmel
- Department of Genetics, The Alexander Silberman Institute of Life Sciences, Faculty of Science, The Hebrew University of Jerusalem, Edmond J. Safra Campus, Givat Ram, Jerusalem 9190401, Israel
| |
Collapse
|
4
|
Benítez-Burraco A, Chekalin E, Bruskin S, Tatarinova T, Morozova I. Recent selection of candidate genes for mammal domestication in Europeans and language change in Europe: a hypothesis. Ann Hum Biol 2021; 48:313-320. [PMID: 34241552 DOI: 10.1080/03014460.2021.1936634] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
Abstract
BACKGROUND AND AIM Human evolution resulted from changes in our biology, behaviour, and culture. One source of these changes has been hypothesised to be our self-domestication (that is, the development in humans of features commonly found in domesticated strains of mammals, seemingly as a result of selection for reduced aggression). Signals of domestication, notably brain size reduction, have increased in recent times. METHODS In this paper, we compare whole-genome data between the Late Neolithic/Bronze Age individuals and modern Europeans. RESULTS We show that genes associated with mammal domestication and with neural crest development and function are significantly differently enriched in nonsynonymous single nucleotide polymorphisms between these two groups. CONCLUSION We hypothesise that these changes might account for the increased features of self-domestication in modern humans and, ultimately, for subtle recent changes in human cognition and behaviour, including language.
Collapse
Affiliation(s)
- Antonio Benítez-Burraco
- Department of Spanish, Linguistics, and Theory of Literature, Faculty of Philology, University of Seville, Seville, Spain
| | - Evgeny Chekalin
- Vavilov Institute of General Genetics, Russian Academy of Sciences, Moscow, Russia
| | - Sergey Bruskin
- Vavilov Institute of General Genetics, Russian Academy of Sciences, Moscow, Russia
| | - Tatiana Tatarinova
- Vavilov Institute of General Genetics, Russian Academy of Sciences, Moscow, Russia.,Department of Biology, University of La Verne, La Verne, CA, USA.,A. A. Kharkevich Institute for Information Transmission Problems, Moscow, Russia.,Department of Fundamental Biology and Biotechnology, Siberian Federal University, Krasnoyarsk, Russia
| | - Irina Morozova
- Institute of Evolutionary Medicine, University of Zurich, Zurich, Switzerland
| |
Collapse
|
5
|
The Genomic History of the Bronze Age Southern Levant. Cell 2020; 181:1146-1157.e11. [DOI: 10.1016/j.cell.2020.04.024] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2019] [Revised: 02/10/2020] [Accepted: 04/15/2020] [Indexed: 01/27/2023]
|
6
|
Chekalin E, Rubanovich A, Tatarinova TV, Kasianov A, Bender N, Chekalina M, Staub K, Koepke N, Rühli F, Bruskin S, Morozova I. Changes in Biological Pathways During 6,000 Years of Civilization in Europe. Mol Biol Evol 2019; 36:127-140. [PMID: 30376122 DOI: 10.1093/molbev/msy201] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
The beginning of civilization was a turning point in human evolution. With increasing separation from the natural environment, mankind stimulated new adaptive reactions in response to new environmental factors. In this paper, we describe direct signs of these reactions in the European population during the past 6,000 years. By comparing whole-genome data between Late Neolithic/Bronze Age individuals and modern Europeans, we revealed biological pathways that are significantly differently enriched in nonsynonymous single nucleotide polymorphisms in these two groups and which therefore could be shaped by cultural practices during the past six millennia. They include metabolic transformations, immune response, signal transduction, physical activity, sensory perception, reproduction, and cognitive functions. We demonstrated that these processes were influenced by different types of natural selection. We believe that our study opens new perspectives for more detailed investigations about when and how civilization has been modifying human genomes.
Collapse
Affiliation(s)
- Evgeny Chekalin
- Vavilov Institute of General Genetics, Russian Academy of Sciences, Moscow, Russia
| | - Alexandr Rubanovich
- Vavilov Institute of General Genetics, Russian Academy of Sciences, Moscow, Russia
| | - Tatiana V Tatarinova
- Vavilov Institute of General Genetics, Russian Academy of Sciences, Moscow, Russia.,Department of Biology, University of La Verne, La Verne, CA.,A. A. Kharkevich Institute for Information Transmission Problems, Moscow, Russia.,Department of Fundamental Biology and Biotechnology, Siberian Federal University, Krasnoyarsk, Russia
| | - Artem Kasianov
- Vavilov Institute of General Genetics, Russian Academy of Sciences, Moscow, Russia.,Center for Data-Intensive Biomedicine and Biotechnology, Skolkovo Institute of Science and Technology, Moscow, Russia
| | - Nicole Bender
- Institute of Evolutionary Medicine, University of Zurich, Zurich, Switzerland
| | - Marina Chekalina
- Vavilov Institute of General Genetics, Russian Academy of Sciences, Moscow, Russia
| | - Kaspar Staub
- Institute of Evolutionary Medicine, University of Zurich, Zurich, Switzerland
| | - Nikola Koepke
- Institute of Evolutionary Medicine, University of Zurich, Zurich, Switzerland
| | - Frank Rühli
- Institute of Evolutionary Medicine, University of Zurich, Zurich, Switzerland
| | - Sergey Bruskin
- Vavilov Institute of General Genetics, Russian Academy of Sciences, Moscow, Russia
| | - Irina Morozova
- Institute of Evolutionary Medicine, University of Zurich, Zurich, Switzerland
| |
Collapse
|
7
|
Das R, Upadhyai P. Application of the geographic population structure (GPS) algorithm for biogeographical analyses of wild and captive gorillas. BMC Bioinformatics 2019; 20:35. [PMID: 30717677 PMCID: PMC6362561 DOI: 10.1186/s12859-018-2568-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
Background The utilization of high resolution genome data has important implications for the phylogeographical evaluation of non-human species. Biogeographical analyses can yield detailed understanding of their population biology and facilitate the geo-localization of individuals to promote their efficacious management, particularly when bred in captivity. The Geographic Population Structure (GPS) algorithm is an admixture based tool for inference of biogeographical affinities and has been employed for the geo-localization of various human populations worldwide. Here, we applied the GPS tool for biogeographical analyses and localization of the ancestral origins of wild and captive gorilla genomes, of unknown geographic source, available in the Great Ape Genome Project (GAGP), employing Gorillas with known ancestral origin as the reference data. Results Our findings suggest that GPS was successful in recapitulating the population history and estimating the geographic origins of all gorilla genomes queried and localized the wild gorillas with unknown geographical origin < 150 km of National Parks/Wildlife Reserves within the political boundaries of countries, considered as prominent modern-day abode for gorillas in the wild. Further, the GPS localization of most captive-born gorillas was congruent with their previously presumed ancestral homes. Conclusions Currently there is limited knowledge of the ancestral origins of most North American captive gorillas, and our study highlights the usefulness of GPS for inferring ancestry of captive gorillas. Determination of the native geographical source of captive gorillas can provide valuable information to guide breeding programs and ensure their appropriate management at the population level. Finally, our findings shine light on the broader applicability of GPS for protecting the genetic integrity of other endangered non-human species, where controlled breeding is a vital component of their conservation. Electronic supplementary material The online version of this article (10.1186/s12859-018-2568-5) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Ranajit Das
- Manipal Centre for Natural Sciences (MCNS), Manipal Academy of Higher Education (MAHE), University building, Lab 11, Madhav Nagar, Manipal, Karnataka, 576104, India.
| | - Priyanka Upadhyai
- Department of Medical Genetics, Kasturba Medical College, Manipal Academy of Higher Education, Manipal, Karnataka, India
| |
Collapse
|
8
|
Affiliation(s)
- Tatiana V Tatarinova
- Department of Biology, University of La Verne, La Verne, CA, USA
- Department of Fundamental Biology and Biotechnology, Siberian Federal University, 660074, Krasnoyarsk, Russia
- Vavilov Institute of General Genetics RAS, Moscow, Russia
- A.A. Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow, Russia
| | - Ming Chen
- Zhejiang University, Hangzhou, 310058, China
| | - Yuriy L Orlov
- Institute of Cytology and Genetics SB RAS, 630090, Novosibirsk, Russia.
- Novosibirsk State University, 630090, Novosibirsk, Russia.
- A.O. Kovalevsky Institute of Marine Biological Research of RAS, 119334, Moscow, Russia.
| |
Collapse
|
9
|
Triska P, Chekanov N, Stepanov V, Khusnutdinova EK, Kumar GPA, Akhmetova V, Babalyan K, Boulygina E, Kharkov V, Gubina M, Khidiyatova I, Khitrinskaya I, Khrameeva EE, Khusainova R, Konovalova N, Litvinov S, Marusin A, Mazur AM, Puzyrev V, Ivanoshchuk D, Spiridonova M, Teslyuk A, Tsygankova S, Triska M, Trofimova N, Vajda E, Balanovsky O, Baranova A, Skryabin K, Tatarinova TV, Prokhortchouk E. Between Lake Baikal and the Baltic Sea: genomic history of the gateway to Europe. BMC Genet 2017; 18:110. [PMID: 29297395 PMCID: PMC5751809 DOI: 10.1186/s12863-017-0578-3] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open
Abstract
BACKGROUND The history of human populations occupying the plains and mountain ridges separating Europe from Asia has been eventful, as these natural obstacles were crossed westward by multiple waves of Turkic and Uralic-speaking migrants as well as eastward by Europeans. Unfortunately, the material records of history of this region are not dense enough to reconstruct details of population history. These considerations stimulate growing interest to obtain a genetic picture of the demographic history of migrations and admixture in Northern Eurasia. RESULTS We genotyped and analyzed 1076 individuals from 30 populations with geographical coverage spanning from Baltic Sea to Baikal Lake. Our dense sampling allowed us to describe in detail the population structure, provide insight into genomic history of numerous European and Asian populations, and significantly increase quantity of genetic data available for modern populations in region of North Eurasia. Our study doubles the amount of genome-wide profiles available for this region. We detected unusually high amount of shared identical-by-descent (IBD) genomic segments between several Siberian populations, such as Khanty and Ket, providing evidence of genetic relatedness across vast geographic distances and between speakers of different language families. Additionally, we observed excessive IBD sharing between Khanty and Bashkir, a group of Turkic speakers from Southern Urals region. While adding some weight to the "Finno-Ugric" origin of Bashkir, our studies highlighted that the Bashkir genepool lacks the main "core", being a multi-layered amalgamation of Turkic, Ugric, Finnish and Indo-European contributions, which points at intricacy of genetic interface between Turkic and Uralic populations. Comparison of the genetic structure of Siberian ethnicities and the geography of the region they inhabit point at existence of the "Great Siberian Vortex" directing genetic exchanges in populations across the Siberian part of Asia. Slavic speakers of Eastern Europe are, in general, very similar in their genetic composition. Ukrainians, Belarusians and Russians have almost identical proportions of Caucasus and Northern European components and have virtually no Asian influence. We capitalized on wide geographic span of our sampling to address intriguing question about the place of origin of Russian Starovers, an enigmatic Eastern Orthodox Old Believers religious group relocated to Siberia in seventeenth century. A comparative reAdmix analysis, complemented by IBD sharing, placed their roots in the region of the Northern European Plain, occupied by North Russians and Finno-Ugric Komi and Karelian people. Russians from Novosibirsk and Russian Starover exhibit ancestral proportions close to that of European Eastern Slavs, however, they also include between five to 10 % of Central Siberian ancestry, not present at this level in their European counterparts. CONCLUSIONS Our project has patched the hole in the genetic map of Eurasia: we demonstrated complexity of genetic structure of Northern Eurasians, existence of East-West and North-South genetic gradients, and assessed different inputs of ancient populations into modern populations.
Collapse
MESH Headings
- Algorithms
- Asia
- DNA
- Datasets as Topic
- Emigration and Immigration/history
- Ethnicity/genetics
- Europe
- Female
- Genetic Variation
- Genetics, Population
- Genotyping Techniques
- History, 15th Century
- History, 16th Century
- History, 17th Century
- History, 18th Century
- History, 19th Century
- History, 20th Century
- History, 21st Century
- History, Ancient
- History, Medieval
- Humans
- Male
- Russia
Collapse
Affiliation(s)
- Petr Triska
- Children's Hospital Los Angeles, Los Angeles, CA, USA
| | - Nikolay Chekanov
- Federal State Institution "Federal Research Centre «Fundamentals of Biotechnology» of the Russian Academy of Sciences", Moscow, Russia
- "Genoanalytica" CJSC, Moscow, Russia
| | - Vadim Stepanov
- Institute of Medical Genetics, Tomsk National Medical Research Center, Russian Academy of Sciences, Siberian Branch, Tomsk, Russia
| | - Elza K Khusnutdinova
- Institute of Biochemistry and Genetics, Russian Academy of Sciences, Ufa Scientific Centre of Russian Academy of Sciences, Ufa, Russia
- Bashkir State University, Ufa, Russia
| | | | - Vita Akhmetova
- Institute of Biochemistry and Genetics, Russian Academy of Sciences, Ufa Scientific Centre of Russian Academy of Sciences, Ufa, Russia
| | - Konstantin Babalyan
- Moscow Institute of Physics and Technology, Department of Molecular and Bio-Physics, Moscow, Russia
| | | | - Vladimir Kharkov
- Institute of Medical Genetics, Tomsk National Medical Research Center, Russian Academy of Sciences, Siberian Branch, Tomsk, Russia
| | - Marina Gubina
- Institute of Cytology and Genetics, Russian Academy of Sciences, Siberian Branch, Novosibirsk, Russia
| | - Irina Khidiyatova
- Institute of Biochemistry and Genetics, Russian Academy of Sciences, Ufa Scientific Centre of Russian Academy of Sciences, Ufa, Russia
- Bashkir State University, Ufa, Russia
| | - Irina Khitrinskaya
- Institute of Medical Genetics, Tomsk National Medical Research Center, Russian Academy of Sciences, Siberian Branch, Tomsk, Russia
| | - Ekaterina E Khrameeva
- "Genoanalytica" CJSC, Moscow, Russia
- Skolkovo Institute of Science and Technology, Skolkovo Innovation Center, Moscow, Russia
| | - Rita Khusainova
- Institute of Biochemistry and Genetics, Russian Academy of Sciences, Ufa Scientific Centre of Russian Academy of Sciences, Ufa, Russia
- Bashkir State University, Ufa, Russia
| | | | - Sergey Litvinov
- Institute of Biochemistry and Genetics, Russian Academy of Sciences, Ufa Scientific Centre of Russian Academy of Sciences, Ufa, Russia
| | - Andrey Marusin
- Institute of Medical Genetics, Tomsk National Medical Research Center, Russian Academy of Sciences, Siberian Branch, Tomsk, Russia
| | - Alexandr M Mazur
- Federal State Institution "Federal Research Centre «Fundamentals of Biotechnology» of the Russian Academy of Sciences", Moscow, Russia
| | - Valery Puzyrev
- Institute of Medical Genetics, Tomsk National Medical Research Center, Russian Academy of Sciences, Siberian Branch, Tomsk, Russia
| | - Dinara Ivanoshchuk
- Institute of Cytology and Genetics, Russian Academy of Sciences, Siberian Branch, Novosibirsk, Russia
| | - Maria Spiridonova
- Institute of Medical Genetics, Tomsk National Medical Research Center, Russian Academy of Sciences, Siberian Branch, Tomsk, Russia
| | - Anton Teslyuk
- Moscow Institute of Physics and Technology, Department of Molecular and Bio-Physics, Moscow, Russia
| | - Svetlana Tsygankova
- Moscow Institute of Physics and Technology, Department of Molecular and Bio-Physics, Moscow, Russia
| | - Martin Triska
- Children's Hospital Los Angeles, Los Angeles, CA, USA
| | - Natalya Trofimova
- Institute of Biochemistry and Genetics, Russian Academy of Sciences, Ufa Scientific Centre of Russian Academy of Sciences, Ufa, Russia
| | - Edward Vajda
- Department of Modern and Classical Languages, Western Washington University, Bellingham, WA, USA
| | - Oleg Balanovsky
- Research Centre for Medical Genetics, Moscow, Russia
- Vavilov Institute of General Genetics, Moscow, Russia
| | - Ancha Baranova
- Research Centre for Medical Genetics, Moscow, Russia
- School of Systems Biology, George Mason University, Fairfax, VA, USA
- Atlas Biomed Group, Moscow, Russia
| | - Konstantin Skryabin
- Federal State Institution "Federal Research Centre «Fundamentals of Biotechnology» of the Russian Academy of Sciences", Moscow, Russia
- Russian Scientific Centre "Kurchatov Institute", Moscow, Russia
- Department of Biology, Lomonosov Moscow State University, Moscow, Russia
| | - Tatiana V Tatarinova
- Vavilov Institute of General Genetics, Moscow, Russia.
- School of Systems Biology, George Mason University, Fairfax, VA, USA.
- Atlas Biomed Group, Moscow, Russia.
- Department of Biology, University of La Verne, La Verne, CA, USA.
- A. A. Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow, Russia.
| | - Egor Prokhortchouk
- Federal State Institution "Federal Research Centre «Fundamentals of Biotechnology» of the Russian Academy of Sciences", Moscow, Russia.
- Department of Biology, Lomonosov Moscow State University, Moscow, Russia.
| |
Collapse
|
10
|
Ponomarenko P, Ryutov A, Maglinte DT, Baranova A, Tatarinova TV, Gai X. Clinical utility of the low-density Infinium QC genotyping Array in a genomics-based diagnostics laboratory. BMC Med Genomics 2017; 10:57. [PMID: 28985730 PMCID: PMC5639583 DOI: 10.1186/s12920-017-0297-7] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2017] [Accepted: 10/02/2017] [Indexed: 11/10/2022] Open
Abstract
Background With 15,949 markers, the low-density Infinium QC Array-24 BeadChip enables linkage analysis, HLA haplotyping, fingerprinting, ethnicity determination, mitochondrial genome variations, blood groups and pharmacogenomics. It represents an attractive independent QC option for NGS-based diagnostic laboratories, and provides cost-efficient means for determining gender, ethnic ancestry, and sample kinships, that are important for data interpretation of NGS-based genetic tests. Methods We evaluated accuracy and reproducibility of Infinium QC genotyping calls by comparing them with genotyping data of the same samples from other genotyping platforms, whole genome/exome sequencing. Accuracy and robustness of determining gender, provenance, and kinships were assessed. Results Concordance of genotype calls between Infinium QC and other platforms was above 99%. Here we show that the chip’s ancestry informative markers are sufficient for ethnicity determination at continental and sometimes subcontinental levels, with assignment accuracy varying with the coverage for a particular region and ethnic groups. Mean accuracies of provenance prediction at a regional level were varied from 81% for Asia, to 89% for Americas, 86% for Africa, 97% for Oceania, 98% for Europe, and 100% for India. Mean accuracy of ethnicity assignment predictions was 63%. Pairwise concordances of AFR samples with the samples from any other super populations were the lowest (0.39–0.43), while the concordances within the same population were relatively high (0.55–0.61). For all populations except African, cross-population comparisons were similar in their concordance ranges to the range of within-population concordances (0.54–0.57). Gender determination was correct in all tested cases. Conclusions Our results indicate that the Infinium QC Array-24 chip is suitable for cost-efficient, independent QC assaying in the settings of an NGS-based molecular diagnostic laboratory; hence, we recommend its integration into the standard laboratory workflow. Low-density chips can provide sample-specific measures for variant call accuracy, prevent sample mix-ups, validate self-reported ethnicities, and detect consanguineous cases. Integration of low-density chips into QC procedures aids proper interpretation of candidate sequence variants. To enhance utility of this low-density chip, we recommend expansion of ADME and mitochondrial markers. Inexpensive Infinium-like low-density human chips have a potential to become a “Swiss army knife” among genotyping assays suitable for many applications requiring high-throughput assays. Electronic supplementary material The online version of this article (10.1186/s12920-017-0297-7) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Petr Ponomarenko
- Department of Biology, University of La Verne, La Verne, CA, USA
| | - Alex Ryutov
- Center for Personalized Medicine, Department of Pathology and Laboratory Medicine, Children's Hospital Los Angeles, Los Angeles, CA, USA
| | - Dennis T Maglinte
- Center for Personalized Medicine, Department of Pathology and Laboratory Medicine, Children's Hospital Los Angeles, Los Angeles, CA, USA
| | - Ancha Baranova
- School of Systems Biology, George Mason University, Fairfax, VA, USA.,Research Center for Medical Genetics, Moscow, Russia.,Atlas Biomed Group, Moscow, Russia
| | - Tatiana V Tatarinova
- Department of Biology, University of La Verne, La Verne, CA, USA. .,School of Systems Biology, George Mason University, Fairfax, VA, USA. .,Atlas Biomed Group, Moscow, Russia.
| | - Xiaowu Gai
- Center for Personalized Medicine, Department of Pathology and Laboratory Medicine, Children's Hospital Los Angeles, Los Angeles, CA, USA. .,Department of Pathology and Laboratory Medicine, USC Keck School of Medicine, Los Angeles, CA, USA.
| |
Collapse
|
11
|
Flegontov P, Kassian A, Thomas MG, Fedchenko V, Changmai P, Starostin G. Pitfalls of the Geographic Population Structure (GPS) Approach Applied to Human Genetic History: A Case Study of Ashkenazi Jews. Genome Biol Evol 2016; 8:2259-65. [PMID: 27389685 PMCID: PMC4987117 DOI: 10.1093/gbe/evw162] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
In a recent interdisciplinary study, Das et al. have attempted to trace the homeland of Ashkenazi Jews and of their historical language, Yiddish (Das et al. 2016 Localizing Ashkenazic Jews to Primeval Villages in the Ancient Iranian Lands of Ashkenaz. Genome Biol Evol. 8:1132-1149). Das et al. applied the geographic population structure (GPS) method to autosomal genotyping data and inferred geographic coordinates of populations supposedly ancestral to Ashkenazi Jews, placing them in Eastern Turkey. They argued that this unexpected genetic result goes against the widely accepted notion of Ashkenazi origin in the Levant, and speculated that Yiddish was originally a Slavic language strongly influenced by Iranian and Turkic languages, and later remodeled completely under Germanic influence. In our view, there are major conceptual problems with both the genetic and linguistic parts of the work. We argue that GPS is a provenancing tool suited to inferring the geographic region where a modern and recently unadmixed genome is most likely to arise, but is hardly suitable for admixed populations and for tracing ancestry up to 1,000 years before present, as its authors have previously claimed. Moreover, all methods of historical linguistics concur that Yiddish is a Germanic language, with no reliable evidence for Slavic, Iranian, or Turkic substrata.
Collapse
Affiliation(s)
- Pavel Flegontov
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Czech Republic Institute of Parasitology, Biology Centre, Czech Academy of Sciences, Ceske Budejovice, Czech Republic A.A. Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow, Russia ,
| | - Alexei Kassian
- Institute of Linguistics, Russian Academy of Sciences, Moscow, Russia Russian Presidential Academy of National Economy and Public Administration (RANEPA), Moscow, Russia ,
| | - Mark G Thomas
- Research Department of Genetics, Evolution and Environment, University College London, United Kingdom
| | | | - Piya Changmai
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Czech Republic
| | - George Starostin
- Russian Presidential Academy of National Economy and Public Administration (RANEPA), Moscow, Russia Russian State University for the Humanities, Moscow, Russia
| |
Collapse
|
12
|
Morozova I, Flegontov P, Mikheyev AS, Bruskin S, Asgharian H, Ponomarenko P, Klyuchnikov V, ArunKumar G, Prokhortchouk E, Gankin Y, Rogaev E, Nikolsky Y, Baranova A, Elhaik E, Tatarinova TV. Toward high-resolution population genomics using archaeological samples. DNA Res 2016; 23:295-310. [PMID: 27436340 PMCID: PMC4991838 DOI: 10.1093/dnares/dsw029] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2015] [Accepted: 05/22/2016] [Indexed: 12/30/2022] Open
Abstract
The term ‘ancient DNA’ (aDNA) is coming of age, with over 1,200 hits in the PubMed database, beginning in the early 1980s with the studies of ‘molecular paleontology’. Rooted in cloning and limited sequencing of DNA from ancient remains during the pre-PCR era, the field has made incredible progress since the introduction of PCR and next-generation sequencing. Over the last decade, aDNA analysis ushered in a new era in genomics and became the method of choice for reconstructing the history of organisms, their biogeography, and migration routes, with applications in evolutionary biology, population genetics, archaeogenetics, paleo-epidemiology, and many other areas. This change was brought by development of new strategies for coping with the challenges in studying aDNA due to damage and fragmentation, scarce samples, significant historical gaps, and limited applicability of population genetics methods. In this review, we describe the state-of-the-art achievements in aDNA studies, with particular focus on human evolution and demographic history. We present the current experimental and theoretical procedures for handling and analysing highly degraded aDNA. We also review the challenges in the rapidly growing field of ancient epigenomics. Advancement of aDNA tools and methods signifies a new era in population genetics and evolutionary medicine research.
Collapse
Affiliation(s)
- Irina Morozova
- Institute of Evolutionary Medicine, University of Zurich, Zurich, Switzerland
| | - Pavel Flegontov
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Ostrava, Czech Republic Bioinformatics Center, A.A. Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow, Russian Federation
| | - Alexander S Mikheyev
- Ecology and Evolution Unit, Okinawa Institute of Science and Technology Graduate University, Okinawa, Japan
| | - Sergey Bruskin
- Vavilov Institute of General Genetics RAS, Moscow, Russia
| | - Hosseinali Asgharian
- Department of Computational and Molecular Biology, University of Southern California, Los Angeles, CA, USA
| | - Petr Ponomarenko
- Center for Personalized Medicine, Children's Hospital Los Angeles, Los Angeles, CA, USA Spatial Sciences Institute, University of Southern California, Los Angeles, CA, USA
| | | | | | - Egor Prokhortchouk
- Research Center of Biotechnology RAS, Moscow, Russia Department of Biology, Lomonosov Moscow State University, Russia
| | | | - Evgeny Rogaev
- Vavilov Institute of General Genetics RAS, Moscow, Russia University of Massachusetts Medical School, Worcester, MA, USA
| | - Yuri Nikolsky
- Vavilov Institute of General Genetics RAS, Moscow, Russia F1 Genomics, San Diego, CA, USA School of Systems Biology, George Mason University, VA, USA
| | - Ancha Baranova
- School of Systems Biology, George Mason University, VA, USA Research Centre for Medical Genetics, Moscow, Russia Atlas Biomed Group, Moscow, Russia
| | - Eran Elhaik
- Department of Animal & Plant Sciences, University of Sheffield, Sheffield, South Yorkshire, UK
| | - Tatiana V Tatarinova
- Bioinformatics Center, A.A. Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow, Russian Federation Center for Personalized Medicine, Children's Hospital Los Angeles, Los Angeles, CA, USA Spatial Sciences Institute, University of Southern California, Los Angeles, CA, USA
| |
Collapse
|
13
|
Flegontov P, Changmai P, Zidkova A, Logacheva MD, Altınışık NE, Flegontova O, Gelfand MS, Gerasimov ES, Khrameeva EE, Konovalova OP, Neretina T, Nikolsky YV, Starostin G, Stepanova VV, Travinsky IV, Tříska M, Tříska P, Tatarinova TV. Genomic study of the Ket: a Paleo-Eskimo-related ethnic group with significant ancient North Eurasian ancestry. Sci Rep 2016; 6:20768. [PMID: 26865217 PMCID: PMC4750364 DOI: 10.1038/srep20768] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2015] [Accepted: 01/07/2016] [Indexed: 01/11/2023] Open
Abstract
The Kets, an ethnic group in the Yenisei River basin, Russia, are considered the last nomadic hunter-gatherers of Siberia, and Ket language has no transparent affiliation with any language family. We investigated connections between the Kets and Siberian and North American populations, with emphasis on the Mal'ta and Paleo-Eskimo ancient genomes, using original data from 46 unrelated samples of Kets and 42 samples of their neighboring ethnic groups (Uralic-speaking Nganasans, Enets, and Selkups). We genotyped over 130,000 autosomal SNPs, identified mitochondrial and Y-chromosomal haplogroups, and performed high-coverage genome sequencing of two Ket individuals. We established that Nganasans, Kets, Selkups, and Yukaghirs form a cluster of populations most closely related to Paleo-Eskimos in Siberia (not considering indigenous populations of Chukotka and Kamchatka). Kets are closely related to modern Selkups and to some Bronze and Iron Age populations of the Altai region, with all these groups sharing a high degree of Mal'ta ancestry. Implications of these findings for the linguistic hypothesis uniting Ket and Na-Dene languages into a language macrofamily are discussed.
Collapse
Affiliation(s)
- Pavel Flegontov
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Ostrava, Czech Republic
- A.A. Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow, Russia
- Institute of Parasitology, Biology Centre, Czech Academy of Sciences, České Budĕjovice, Czech Republic
| | - Piya Changmai
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Ostrava, Czech Republic
| | - Anastassiya Zidkova
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Ostrava, Czech Republic
| | - Maria D. Logacheva
- A.A. Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow, Russia
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow, Russia
- Pirogov Russian National Research Medical University, Moscow, Russia
| | - N. Ezgi Altınışık
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Ostrava, Czech Republic
| | - Olga Flegontova
- Institute of Parasitology, Biology Centre, Czech Academy of Sciences, České Budĕjovice, Czech Republic
| | - Mikhail S. Gelfand
- A.A. Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow, Russia
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Moscow, Russia
| | - Evgeny S. Gerasimov
- A.A. Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow, Russia
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow, Russia
| | - Ekaterina E. Khrameeva
- A.A. Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow, Russia
- Skolkovo Institute of Science and Technology, Skolkovo, Russia
| | - Olga P. Konovalova
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow, Russia
| | - Tatiana Neretina
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow, Russia
| | - Yuri V. Nikolsky
- Biomedical Cluster, Skolkovo Foundation, Skolkovo, Russia
- George Mason University, Fairfax, VA, USA
| | - George Starostin
- Russian State University for the Humanities, Moscow, Russia
- Russian Presidential Academy (RANEPA), Moscow, Russia
| | - Vita V. Stepanova
- A.A. Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow, Russia
- Skolkovo Institute of Science and Technology, Skolkovo, Russia
| | | | - Martin Tříska
- Children’s Hospital Los Angeles, Los Angeles, CA, USA
| | - Petr Tříska
- Instituto de Patologia e Imunologia Molecular da Universidade do Porto (IPATIMUP), Porto, Portugal
- Instituto de Ciências Biomédicas da Universidade do Porto (ICBAS), Porto, Portugal
| | - Tatiana V. Tatarinova
- A.A. Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow, Russia
- Children’s Hospital Los Angeles, Los Angeles, CA, USA
- Spatial Sciences Institute, University of Southern California, Los Angeles, CA, USA
| |
Collapse
|
14
|
Kozlov KN, Samsonov AM, Samsonova MG. The differential evolution entirely parallel method for model adaptation in systems biology. Biophysics (Nagoya-shi) 2015. [DOI: 10.1134/s0006350915060160] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
|
15
|
Bromberg Y, Capriotti E. VarI-SIG 2014--From SNPs to variants: interpreting different types of genetic variants. BMC Genomics 2015; 16 Suppl 8:I1. [PMID: 26110281 PMCID: PMC4480323 DOI: 10.1186/1471-2164-16-s8-i1] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open
|