1
|
Ringbauer H, Huang Y, Akbari A, Mallick S, Olalde I, Patterson N, Reich D. Accurate detection of identity-by-descent segments in human ancient DNA. Nat Genet 2024; 56:143-151. [PMID: 38123640 PMCID: PMC10786714 DOI: 10.1038/s41588-023-01582-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Accepted: 10/20/2023] [Indexed: 12/23/2023]
Abstract
Long DNA segments shared between two individuals, known as identity-by-descent (IBD), reveal recent genealogical connections. Here we introduce ancIBD, a method for identifying IBD segments in ancient human DNA (aDNA) using a hidden Markov model and imputed genotype probabilities. We demonstrate that ancIBD accurately identifies IBD segments >8 cM for aDNA data with an average depth of >0.25× for whole-genome sequencing or >1× for 1240k single nucleotide polymorphism capture data. Applying ancIBD to 4,248 ancient Eurasian individuals, we identify relatives up to the sixth degree and genealogical connections between archaeological groups. Notably, we reveal long IBD sharing between Corded Ware and Yamnaya groups, indicating that the Yamnaya herders of the Pontic-Caspian Steppe and the Steppe-related ancestry in various European Corded Ware groups share substantial co-ancestry within only a few hundred years. These results show that detecting IBD segments can generate powerful insights into the growing aDNA record, both on a small scale relevant to life stories and on a large scale relevant to major cultural-historical events.
Collapse
Affiliation(s)
- Harald Ringbauer
- Department of Archaeogenetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany.
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA.
| | - Yilei Huang
- Department of Archaeogenetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
- Bioinformatics Group, Institute of Computer Science, Universität Leipzig, Leipzig, Germany
| | - Ali Akbari
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA
- Department of Genetics, Harvard Medical School, Boston, MA, USA
- Broad Institute of Harvard and MIT, Cambridge, MA, USA
| | - Swapan Mallick
- Department of Genetics, Harvard Medical School, Boston, MA, USA
- Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Howard Hughes Medical Institute, Harvard Medical School, Boston, MA, USA
| | - Iñigo Olalde
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA
- BIOMICs Research Group, University of the Basque Country, Vitoria-Gasteiz, Spain
- Ikerbasque-Basque Foundation of Science, Bilbao, Spain
| | - Nick Patterson
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA
- Broad Institute of Harvard and MIT, Cambridge, MA, USA
| | - David Reich
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA.
- Department of Genetics, Harvard Medical School, Boston, MA, USA.
- Broad Institute of Harvard and MIT, Cambridge, MA, USA.
- Howard Hughes Medical Institute, Harvard Medical School, Boston, MA, USA.
| |
Collapse
|
2
|
Harney É, Micheletti S, Bruwelheide KS, Freyman WA, Bryc K, Akbari A, Jewett E, Comer E, Louis Gates H, Heywood L, Thornton J, Curry R, Ancona Esselmann S, Barca KG, Sedig J, Sirak K, Olalde I, Adamski N, Bernardos R, Broomandkhoshbacht N, Ferry M, Qiu L, Stewardson K, Workman JN, Zalzala F, Mallick S, Micco A, Mah M, Zhang Z, Rohland N, Mountain JL, Owsley DW, Reich D. The genetic legacy of African Americans from Catoctin Furnace. Science 2023; 381:eade4995. [PMID: 37535739 PMCID: PMC10958645 DOI: 10.1126/science.ade4995] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2022] [Accepted: 06/20/2023] [Indexed: 08/05/2023]
Abstract
Few African Americans have been able to trace family lineages back to ancestors who died before the 1870 United States Census, the first in which all Black people were listed by name. We analyzed 27 individuals from Maryland's Catoctin Furnace African American Cemetery (1774-1850), identifying 41,799 genetic relatives among consenting research participants in 23andMe, Inc.'s genetic database. One of the highest concentrations of close relatives is in Maryland, suggesting that descendants of the Catoctin individuals remain in the area. We find that many of the Catoctin individuals derived African ancestry from the Wolof or Kongo groups and European ancestry from Great Britain and Ireland. This study demonstrates the power of joint analysis of historical DNA and large datasets generated through direct-to-consumer ancestry testing.
Collapse
Affiliation(s)
- Éadaoin Harney
- 23andMe, Inc.; Sunnyvale, CA 94043, USA
- Department of Human Evolutionary Biology, Harvard University; Cambridge, MA, 02138, USA
| | | | - Karin S. Bruwelheide
- Department of Anthropology, National Museum of Natural History, Smithsonian Institution; Washington DC 20560, USA
| | | | | | - Ali Akbari
- Department of Human Evolutionary Biology, Harvard University; Cambridge, MA, 02138, USA
- Department of Genetics, Harvard Medical School; Boston, MA, 02115, USA
| | | | - Elizabeth Comer
- Catoctin Furnace Historical Society; Thurmont, MD, 21788, USA
| | - Henry Louis Gates
- Hutchins Center for African and African American Research, Harvard University; Cambridge, MA 02138, USA
| | - Linda Heywood
- Department of History/African American Studies, Boston University; Brookline, MA 02446, USA
| | - John Thornton
- Department of History/African American Studies, Boston University; Brookline, MA 02446, USA
| | - Roslyn Curry
- 23andMe, Inc.; Sunnyvale, CA 94043, USA
- Department of Human Evolutionary Biology, Harvard University; Cambridge, MA, 02138, USA
| | | | - Kathryn G. Barca
- Department of Anthropology, National Museum of Natural History, Smithsonian Institution; Washington DC 20560, USA
| | - Jakob Sedig
- Department of Human Evolutionary Biology, Harvard University; Cambridge, MA, 02138, USA
- Department of Genetics, Harvard Medical School; Boston, MA, 02115, USA
| | - Kendra Sirak
- Department of Human Evolutionary Biology, Harvard University; Cambridge, MA, 02138, USA
- Department of Genetics, Harvard Medical School; Boston, MA, 02115, USA
| | - Iñigo Olalde
- Department of Human Evolutionary Biology, Harvard University; Cambridge, MA, 02138, USA
- BIOMICs Research Group, Department of Zoology and Animal Cell Biology, University of the Basque Country UPV/EHU, Vitoria-Gasteiz, Spain
- Ikerbasque—Basque Foundation of Science, Bilbao, Spain
| | - Nicole Adamski
- Department of Genetics, Harvard Medical School; Boston, MA, 02115, USA
- Howard Hughes Medical Institute, Harvard Medical School; Boston, MA, 02115, USA
| | - Rebecca Bernardos
- Department of Genetics, Harvard Medical School; Boston, MA, 02115, USA
- Howard Hughes Medical Institute, Harvard Medical School; Boston, MA, 02115, USA
| | - Nasreen Broomandkhoshbacht
- Department of Genetics, Harvard Medical School; Boston, MA, 02115, USA
- Howard Hughes Medical Institute, Harvard Medical School; Boston, MA, 02115, USA
| | - Matthew Ferry
- Department of Genetics, Harvard Medical School; Boston, MA, 02115, USA
- Howard Hughes Medical Institute, Harvard Medical School; Boston, MA, 02115, USA
| | - Lijun Qiu
- Department of Genetics, Harvard Medical School; Boston, MA, 02115, USA
- Howard Hughes Medical Institute, Harvard Medical School; Boston, MA, 02115, USA
| | - Kristin Stewardson
- Department of Genetics, Harvard Medical School; Boston, MA, 02115, USA
- Howard Hughes Medical Institute, Harvard Medical School; Boston, MA, 02115, USA
| | - J. Noah Workman
- Department of Genetics, Harvard Medical School; Boston, MA, 02115, USA
- Howard Hughes Medical Institute, Harvard Medical School; Boston, MA, 02115, USA
| | - Fatma Zalzala
- Department of Genetics, Harvard Medical School; Boston, MA, 02115, USA
- Howard Hughes Medical Institute, Harvard Medical School; Boston, MA, 02115, USA
| | - Shop Mallick
- Department of Genetics, Harvard Medical School; Boston, MA, 02115, USA
- Howard Hughes Medical Institute, Harvard Medical School; Boston, MA, 02115, USA
- Broad Institute of MIT and Harvard; Cambridge, MA, 02142, USA
| | - Adam Micco
- Department of Genetics, Harvard Medical School; Boston, MA, 02115, USA
- Broad Institute of MIT and Harvard; Cambridge, MA, 02142, USA
| | - Matthew Mah
- Department of Genetics, Harvard Medical School; Boston, MA, 02115, USA
- Howard Hughes Medical Institute, Harvard Medical School; Boston, MA, 02115, USA
- Broad Institute of MIT and Harvard; Cambridge, MA, 02142, USA
| | - Zhao Zhang
- Department of Genetics, Harvard Medical School; Boston, MA, 02115, USA
| | | | - Nadin Rohland
- Department of Genetics, Harvard Medical School; Boston, MA, 02115, USA
| | | | - Douglas W. Owsley
- Department of Anthropology, National Museum of Natural History, Smithsonian Institution; Washington DC 20560, USA
| | - David Reich
- Department of Human Evolutionary Biology, Harvard University; Cambridge, MA, 02138, USA
- Department of Genetics, Harvard Medical School; Boston, MA, 02115, USA
- Howard Hughes Medical Institute, Harvard Medical School; Boston, MA, 02115, USA
- Broad Institute of MIT and Harvard; Cambridge, MA, 02142, USA
| |
Collapse
|
3
|
Ferrando-Bernal M. Ancient DNA suggests anaemia and low bone mineral density as the cause for porotic hyperostosis in ancient individuals. Sci Rep 2023; 13:6968. [PMID: 37117261 PMCID: PMC10147686 DOI: 10.1038/s41598-023-33405-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2023] [Accepted: 04/12/2023] [Indexed: 04/30/2023] Open
Abstract
Porotic hyperostosis (PH) is a disease that had high prevalence during the Neolithic. Several hypotheses have been suggested to explain the origin of the disease, such as an iron deficiency diet, low B12 intake, malaria caused by Plasmodium spp., low haemoglobin levels or low vitamin D levels. None of these hypotheses have been tested genetically. Here, I calculated different genetic scores to test each hypothesis. Additionally, I calculated a genetic score of bone mineral density as it is a phenotype that seems to be selected in ancient Europeans. I apply these genetic scores on 80 ancient samples, 33 with diagnosed PH. The results seem to suggest anaemia and low bone mineral density as the main cause for this disease. Additionally, Neolithic individuals show the lowest genetic risk score for bone mineral density of all other periods tested here, which may explain the highest prevalence of the porotic hyperostosis during this age.
Collapse
|
4
|
Ringbauer H, Huang Y, Akbari A, Mallick S, Patterson N, Reich D. ancIBD - Screening for identity by descent segments in human ancient DNA. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.03.08.531671. [PMID: 36945531 PMCID: PMC10028887 DOI: 10.1101/2023.03.08.531671] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]
Abstract
Long DNA sequences shared between two individuals, known as Identical by descent (IBD) segments, are a powerful signal for identifying close and distant biological relatives because they only arise when the pair shares a recent common ancestor. Existing methods to call IBD segments between present-day genomes cannot be straightforwardly applied to ancient DNA data (aDNA) due to typically low coverage and high genotyping error rates. We present ancIBD, a method to identify IBD segments for human aDNA data implemented as a Python package. Our approach is based on a Hidden Markov Model, using as input genotype probabilities imputed based on a modern reference panel of genomic variation. Through simulation and downsampling experiments, we demonstrate that ancIBD robustly identifies IBD segments longer than 8 centimorgan for aDNA data with at least either 0.25x average whole-genome sequencing (WGS) coverage depth or at least 1x average depth for in-solution enrichment experiments targeting a widely used aDNA SNP set ('1240k'). This application range allows us to screen a substantial fraction of the aDNA record for IBD segments and we showcase two downstream applications. First, leveraging the fact that biological relatives up to the sixth degree are expected to share multiple long IBD segments, we identify relatives between 10,156 ancient Eurasian individuals and document evidence of long-distance migration, for example by identifying a pair of two approximately fifth-degree relatives who were buried 1410km apart in Central Asia 5000 years ago. Second, by applying ancIBD, we reveal new details regarding the spread of ancestry related to Steppe pastoralists into Europe starting 5000 years ago. We find that the first individuals in Central and Northern Europe carrying high amounts of Steppe-ancestry, associated with the Corded Ware culture, share high rates of long IBD (12-25 cM) with Yamnaya herders of the Pontic-Caspian steppe, signaling a strong bottleneck and a recent biological connection on the order of only few hundred years, providing evidence that the Yamnaya themselves are a main source of Steppe ancestry in Corded Ware people. We also detect elevated sharing of long IBD segments between Corded Ware individuals and people associated with the Globular Amphora culture (GAC) from Poland and Ukraine, who were Copper Age farmers not yet carrying Steppe-like ancestry. These IBD links appear for all Corded Ware groups in our analysis, indicating that individuals related to GAC contexts must have had a major demographic impact early on in the genetic admixtures giving rise to various Corded Ware groups across Europe. These results show that detecting IBD segments in aDNA can generate new insights both on a small scale, relevant to understanding the life stories of people, and on the macroscale, relevant to large-scale cultural-historical events.
Collapse
Affiliation(s)
- Harald Ringbauer
- Department of Archaeogenetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA
| | - Yilei Huang
- Department of Archaeogenetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
- Bioinformatics Group, Institute of Computer Science, Universität Leipzig, Leipzig, Germanÿ
| | - Ali Akbari
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA
| | - Swapan Mallick
- Department of Genetics, Harvard Medical School, Boston, MA, USA
- Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Howard Hughes Medical Institute, Harvard Medical School, Boston, MA, USA
| | - Nick Patterson
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA
- Broad Institute of Harvard and MIT, Cambridge, MA, USA
| | - David Reich
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA
- Department of Genetics, Harvard Medical School, Boston, MA, USA
- Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Howard Hughes Medical Institute, Harvard Medical School, Boston, MA, USA
| |
Collapse
|
5
|
Revealing the recent demographic history of Europe via haplotype sharing in the UK Biobank. Proc Natl Acad Sci U S A 2022; 119:e2119281119. [PMID: 35696575 PMCID: PMC9233301 DOI: 10.1073/pnas.2119281119] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open
Abstract
Haplotype-based analyses have recently been leveraged to interrogate the fine-scale structure in specific geographic regions, notably in Europe, although an equivalent haplotype-based understanding across the whole of Europe with these tools is lacking. Furthermore, study of identity-by-descent (IBD) sharing in a large sample of haplotypes across Europe would allow a direct comparison between different demographic histories of different regions. The UK Biobank (UKBB) is a population-scale dataset of genotype and phenotype data collected from the United Kingdom, with established sampling of worldwide ancestries. The exact content of these non-UK ancestries is largely uncharacterized, where study could highlight valuable intracontinental ancestry references with deep phenotyping within the UKBB. In this context, we sought to investigate the sample of European ancestry captured in the UKBB. We studied the haplotypes of 5,500 UKBB individuals with a European birthplace; investigated the population structure and demographic history in Europe, showing in parallel the variety of footprints of demographic history in different genetic regions around Europe; and expand knowledge of the genetic landscape of the east and southeast of Europe. Providing an updated map of European genetics, we leverage IBD-segment sharing to explore the extent of population isolation and size across the continent. In addition to building and expanding upon previous knowledge in Europe, our results show the UKBB as a source of diverse ancestries beyond Britain. These worldwide ancestries sampled in the UKBB may complement and inform researchers interested in specific communities or regions not limited to Britain.
Collapse
|
6
|
Kivisild T, Saag L, Hui R, Biagini SA, Pankratov V, D'Atanasio E, Pagani L, Saag L, Rootsi S, Mägi R, Metspalu E, Valk H, Malve M, Irdt K, Reisberg T, Solnik A, Scheib CL, Seidman DN, Williams AL, Tambets K, Metspalu M. Patterns of genetic connectedness between modern and medieval Estonian genomes reveal the origins of a major ancestry component of the Finnish population. Am J Hum Genet 2021; 108:1792-1806. [PMID: 34411538 DOI: 10.1016/j.ajhg.2021.07.012] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2021] [Accepted: 07/23/2021] [Indexed: 11/20/2022] Open
Abstract
The Finnish population is a unique example of a genetic isolate affected by a recent founder event. Previous studies have suggested that the ancestors of Finnic-speaking Finns and Estonians reached the circum-Baltic region by the 1st millennium BC. However, high linguistic similarity points to a more recent split of their languages. To study genetic connectedness between Finns and Estonians directly, we first assessed the efficacy of imputation of low-coverage ancient genomes by sequencing a medieval Estonian genome to high depth (23×) and evaluated the performance of its down-sampled replicas. We find that ancient genomes imputed from >0.1× coverage can be reliably used in principal-component analyses without projection. By searching for long shared allele intervals (LSAIs; similar to identity-by-descent segments) in unphased data for >143,000 present-day Estonians, 99 Finns, and 14 imputed ancient genomes from Estonia, we find unexpectedly high levels of individual connectedness between Estonians and Finns for the last eight centuries in contrast to their clear differentiation by allele frequencies. High levels of sharing of these segments between Estonians and Finns predate the demographic expansion and late settlement process of Finland. One plausible source of this extensive sharing is the 8th-10th centuries AD migration event from North Estonia to Finland that has been proposed to explain uniquely shared linguistic features between the Finnish language and the northern dialect of Estonian and shared Christianity-related loanwords from Slavic. These results suggest that LSAI detection provides a computationally tractable way to detect fine-scale structure in large cohorts.
Collapse
Affiliation(s)
- Toomas Kivisild
- Department of Human Genetics, KU Leuven, Leuven 3000, Belgium; Estonian Biocentre, Institute of Genomics, University of Tartu, Tartu 51010, Estonia; McDonald Institute for Archaeological Research, University of Cambridge, Cambridge CB2 3ER, UK.
| | - Lehti Saag
- Estonian Biocentre, Institute of Genomics, University of Tartu, Tartu 51010, Estonia; Research Department of Genetics, Evolution, and Environment, University College London, London WC1E 6BT, UK
| | - Ruoyun Hui
- McDonald Institute for Archaeological Research, University of Cambridge, Cambridge CB2 3ER, UK; The Alan Turing Institute, British Library, 96 Euston Road, London NW1 2DB, UK
| | | | - Vasili Pankratov
- Estonian Biocentre, Institute of Genomics, University of Tartu, Tartu 51010, Estonia
| | - Eugenia D'Atanasio
- Instituto di Biologia e Patologia Molecolari, Consiglio Nazionale delle Ricerche, Rome, Italy
| | - Luca Pagani
- Estonian Biocentre, Institute of Genomics, University of Tartu, Tartu 51010, Estonia; Department of Biology, University of Padova, 35131 Padova, Italy
| | - Lauri Saag
- Estonian Biocentre, Institute of Genomics, University of Tartu, Tartu 51010, Estonia
| | - Siiri Rootsi
- Estonian Biocentre, Institute of Genomics, University of Tartu, Tartu 51010, Estonia
| | - Reedik Mägi
- Estonian Biocentre, Institute of Genomics, University of Tartu, Tartu 51010, Estonia
| | - Ene Metspalu
- Estonian Biocentre, Institute of Genomics, University of Tartu, Tartu 51010, Estonia
| | - Heiki Valk
- Department of Archaeology, Institute of History and Archaeology, University of Tartu, Tartu 51014, Estonia
| | - Martin Malve
- Department of Archaeology, Institute of History and Archaeology, University of Tartu, Tartu 51014, Estonia
| | - Kadri Irdt
- Estonian Biocentre, Institute of Genomics, University of Tartu, Tartu 51010, Estonia
| | - Tuuli Reisberg
- Core Facility, Institute of Genomics, University of Tartu, Tartu 51010, Estonia
| | - Anu Solnik
- Core Facility, Institute of Genomics, University of Tartu, Tartu 51010, Estonia
| | - Christiana L Scheib
- Estonian Biocentre, Institute of Genomics, University of Tartu, Tartu 51010, Estonia; McDonald Institute for Archaeological Research, University of Cambridge, Cambridge CB2 3ER, UK; St John's College, University of Cambridge, Cambridge CB2 1TP, UK
| | - Daniel N Seidman
- Department of Computational Biology, Cornell University, Ithaca, NY 14853, USA
| | - Amy L Williams
- Department of Computational Biology, Cornell University, Ithaca, NY 14853, USA
| | - Kristiina Tambets
- Estonian Biocentre, Institute of Genomics, University of Tartu, Tartu 51010, Estonia
| | - Mait Metspalu
- Estonian Biocentre, Institute of Genomics, University of Tartu, Tartu 51010, Estonia
| |
Collapse
|