1
|
Flores M, Ho E, Ly C, Ceberio N, Guardado M, Felix K, Thorner HM, Paunovich M, Godek C, Kalaydjian C, Rohlfs RV. Decreased accuracy of forensic DNA mixture analysis for groups with lower genetic diversity. iScience 2024; 27:111067. [PMID: 39507254 PMCID: PMC11539586 DOI: 10.1016/j.isci.2024.111067] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2023] [Revised: 08/01/2024] [Accepted: 09/25/2024] [Indexed: 11/08/2024] Open
Abstract
Forensic investigation of DNA samples from multiple contributors has become commonplace. These complex analyses use statistical frameworks accounting for multiple levels of uncertainty in allelic contributions from different individuals, particularly for samples containing few molecules of DNA. These methods have been thoroughly tested along some axes of variation, but less attention has been paid to accuracy across human genetic variation. Here, we quantify the accuracy of DNA mixture analysis over 83 human groups. We find higher false inclusion rates for mixtures with more contributors and for groups with lower genetic diversity. Even for three-contributor mixtures where two contributors are known and the reference group is correctly specified, false inclusion rates are 1e-5 or higher for 36 out of 83 groups. This means that, depending on multiple testing, some false inclusions may be expected. These false positives could be lessened with more selective and conservative use of DNA mixture analysis.
Collapse
Affiliation(s)
- Maria Flores
- San Francisco State University, Department of Biology, San Francisco, CA 94132, USA
- University of California, Los Angeles, Department of Molecular, Cell and Developmental Biology, Los Angeles, CA 90095, USA
| | - Evan Ho
- San Francisco State University, Department of Biology, San Francisco, CA 94132, USA
- University of California, Santa Cruz, Department of Ecology and Evolutionary Biology, Santa Cruz, CA 95060, USA
| | - Cara Ly
- San Francisco State University, Department of Biology, San Francisco, CA 94132, USA
- Yale University, Department of Genetics, New Haven, CT, USA
| | - Niquo Ceberio
- San Francisco State University, Department of Biology, San Francisco, CA 94132, USA
- Northern Arizona University, Department of Applied Physics and Materials Science, Flagstaff, AZ, USA
| | - Miguel Guardado
- University of California, San Francisco, Biological and Medical Informatics Graduate Program, San Francisco, CA 94143, USA
| | - Kamillah Felix
- San Francisco State University, Department of Biology, San Francisco, CA 94132, USA
- University of Oregon, Department of Data Science, Eugene, OR 97403, USA
| | - Hannah Mariko Thorner
- George Washington University, Department of Forensic Sciences - Forensic Molecular Biology, Washington, DC 20007, USA
| | - Matt Paunovich
- San Francisco State University, Department of Biology, San Francisco, CA 94132, USA
| | - Chris Godek
- San Francisco State University, Department of Mathematics, San Francisco, CA 94132, USA
| | - Carina Kalaydjian
- San Francisco State University, Department of Mathematics, San Francisco, CA 94132, USA
| | - Rori V. Rohlfs
- San Francisco State University, Department of Biology, San Francisco, CA 94132, USA
- University of Oregon, Department of Data Science, Eugene, OR 97403, USA
| |
Collapse
|
2
|
Flores M, Ly C, Ho E, Ceberio N, Felix K, Thorner HM, Guardado M, Paunovich M, Godek C, Kalaydjian C, Rohlfs R. Decreased accuracy of forensic DNA mixture analysis for groups with lower genetic diversity. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.25.554311. [PMID: 37745566 PMCID: PMC10515773 DOI: 10.1101/2023.08.25.554311] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/26/2023]
Abstract
Forensic investigation of DNA samples from multiple contributors has become commonplace. These complex analyses use statistical frameworks accounting for multiple levels of uncertainty in allelic contributions from different individuals, particularly for samples containing few molecules of DNA. These methods have been thoroughly tested along some axes of variation, but less attention has been paid to accuracy across human genetic variation. Here, we quantify the accuracy of DNA mixture analysis over 244 human groups. We find higher false inclusion rates for mixtures with more contributors, and for groups with lower genetic diversity. Even for two-contributor mixtures where one contributor is known and the reference group is correctly specified, false inclusion rates are 1e-5 or higher for 56 out of 244 groups. This means that, depending on multiple testing, some false inclusions may be expected. These false positives could be lessened with more selective and conservative use of DNA mixture analysis.
Collapse
Affiliation(s)
- Maria Flores
- San Francisco State University; Department of Biology; San Francisco, CA, 94132, USA
- University of California, Los Angeles; Department of Molecular, Cell and Developmental Biology; Los Angeles, CA, 90095, USA
| | - Cara Ly
- San Francisco State University; Department of Biology; San Francisco, CA, 94132, USA
| | - Evan Ho
- San Francisco State University; Department of Biology; San Francisco, CA, 94132, USA
| | - Niquo Ceberio
- San Francisco State University; Department of Biology; San Francisco, CA, 94132, USA
| | - Kamillah Felix
- San Francisco State University; Department of Biology; San Francisco, CA, 94132, USA
| | - Hannah Mariko Thorner
- George Washington University; Department of Forensic Sciences - Forensic Molecular Biology; Washington, DC, 20007, USA
| | - Miguel Guardado
- University of California, San Francisco; Biological and Medical Informatics Graduate Program; San Francisco CA, 94143, USA
| | - Matt Paunovich
- San Francisco State University; Department of Biology; San Francisco, CA, 94132, USA
| | - Chris Godek
- San Francisco State University; Department of Mathematics; San Francisco, CA, 94132, USA
| | - Carina Kalaydjian
- San Francisco State University; Department of Mathematics; San Francisco, CA, 94132, USA
| | - Rori Rohlfs
- San Francisco State University; Department of Biology; San Francisco, CA, 94132, USA
- University of Oregon; Department of Data Science; Eugene, OR, 97403, USA
| |
Collapse
|
3
|
Aguilar-Velázquez JA, Stephenson-Ojea MM, García-King MD, Rangel-Villalobos H. Genetic diversity, structure, and admixture in Mayans from Guatemala and Mexico based on 15 short tandem repeats. AMERICAN JOURNAL OF PHYSICAL ANTHROPOLOGY 2021; 175:238-250. [PMID: 33522608 DOI: 10.1002/ajpa.24203] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/15/2020] [Revised: 11/14/2020] [Accepted: 11/30/2020] [Indexed: 11/05/2022]
Abstract
OBJECTIVE To analyze the genetic origin, relationships, structure, and admixture in Mayan Native American groups from Guatemala and Mexico based on 15 autosomal short tandem repeats (STRs) loci commonly used in human identification (HID). METHODS We genotyped 513 unrelated Mayan samples from Guatemala based on 15 STR loci (AmpFlSTR® Identifiler kit). Moreover, we included 4408 genotypes previously reported, as following: Mayas from Guatemala and Mexico (n = 1666) and from Latin American, European, and African (n = 2742) populations. Forensic parameters, genetic distances, admixture, and population structure were assessed. RESULTS Forensic parameters of the 15 STRs in different Mayan groups from Guatemala were reported. Low (Fst = 0.78%; p = 0.000) and non-significant differentiation (Fst = 1.8%; p = 0.108) were observed in Mayas from Guatemala and Mexico, respectively. The relative homogeneity observed among Mayan groups supported theories of extensive pre-Columbian gene flow and trade throughout the Mayan Empire. The distribution of the three Native American ancestries among these Mayan groups did not support the presumable Guatemalan origin of Tojolabal and Lacandon people (South, Mexico). The nonsignificant differentiation between Ladinos and Mayas suggests a relative panmixia in Guatemala. Mestizos from southeastern Mexico and Guatemala constitute a core of Native American ancestry in Latin America related to the Mayan Empire in Central America. CONCLUSIONS The higher European admixture and homogeneity in Mexican Mayas of the Yucatan Peninsula suggest more intensive post-Columbian gene flow in this region than in Guatemalan Mayas.
Collapse
Affiliation(s)
- José Alonso Aguilar-Velázquez
- Instituto de Investigación en Genética Molecular, Departamento de Ciencias Médicas y de la Vida, Universidad de Guadalajara, Av. Universidad #1115, Col, Lindavista, Ocotlán, Jalisco, 47810, Mexico.,Doctorado en Genética Humana, Centro Universitario de Ciencias de la Salud (CUCS-UdeG), Universidad de Guadalajara, Sierra Mojada #950, Guadalajara, Jalisco, Mexico
| | - Mishel Marie Stephenson-Ojea
- Laboratorio de Genética Forense, Fundación de Antropología Forense de Guatemala (FAFG), 1a Calle 1-53, Guatemala, Zona 2, Guatemala
| | - Marco David García-King
- Laboratorio de Genética Forense, Fundación de Antropología Forense de Guatemala (FAFG), 1a Calle 1-53, Guatemala, Zona 2, Guatemala
| | - Héctor Rangel-Villalobos
- Instituto de Investigación en Genética Molecular, Departamento de Ciencias Médicas y de la Vida, Universidad de Guadalajara, Av. Universidad #1115, Col, Lindavista, Ocotlán, Jalisco, 47810, Mexico
| |
Collapse
|
4
|
Forensic and phylogenetic characterization of 15 autosomal STRs in Hazara population of Pakistan. Leg Med (Tokyo) 2020; 47:101786. [PMID: 32942206 DOI: 10.1016/j.legalmed.2020.101786] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2019] [Revised: 06/14/2020] [Accepted: 09/02/2020] [Indexed: 11/23/2022]
Abstract
In the current study, 217 unrelated individuals of the Hazara population were genotyped for 15 autosomal short tandem repeats to generate parentage and forensic efficacy parameters. Hazaras belong to the Shi'a sect and are recognized by their Turko-Mogholi features. We found that D2S1338 was the most discriminatory locus with a maximum power of exclusion and high value of polymorphism information content. Whilst the Combined Power of Discrimination (CPD), Combined Matching Probability (CMP) and Combined Power of Exclusion (CPE) were 0.999999999999999, 2.76796338879E-17 and 0.999999040733479 respectively. Furthermore, the pattern of genetic affinity with genetically assumed related populations was demonstrated through Heat Map and Phylogenetic analysis, which revealed a great level of genetic closeness of Hazaras with Mongol population and descendants of Genghis Khan. The resulting data can be used for forensic applications and anthropological studies.
Collapse
|
5
|
Characterization of the extra copy of TPOX locus with tri-allelic pattern. BMC Genet 2019; 20:18. [PMID: 30764755 PMCID: PMC6376737 DOI: 10.1186/s12863-019-0723-2] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2018] [Accepted: 02/11/2019] [Indexed: 02/02/2023] Open
Abstract
BACKGROUND An STR locus with tri-allelic pattern is occasionally observed in routine forensic casework. The extra copy of TPOX locus with tri-allelic pattern in populations has been assumed to be inserted into an X chromosome, which took place forth before the Bantu expansion in Africa. Nonetheless, the exact location of the duplication and the form of rearrangement in the human genome has not been clarified yet. RESULTS In this study, we investigated the extra copy of type 2 tri-allelic pattern at TPOX in various populations. While allele 10 is the major third allele in Africa, allele 11 appears more frequent in America and overwhelming in Chinese and Korean populations, which might attribute to the population substructures. Results from the investigation of family cases showed that the transmission of the extra allele had a similar genetic pattern of autosomal genes. Furthermore, a whole-genome sequencing followed by bioinformatics analysis revealed that the intact form of chromosomal duplication and rearrangement occurred ~ 407 kb away from the authentic TPOX locus on chromosome 2 in two cases. The breakpoints of the insertion were further validated in most other tri-allelic subjects, which can imply the identical origin from the ancestral extra copy. Nevertheless, de novo chromosomal duplication and rearrangement at thyroid peroxidase gene occur in populations. CONCLUSIONS Instead of the extra allele 10 in African populations, the main third allele at TPOX with tri-allelic pattern is allele 11 in Chinese and Korean populations. The insertion of the extra copy into chromosome 2 occurs in most subjects with tri-allelic pattern at TPOX and demonstrates the transmission of the third allele from parents to offspring. The breakpoints of the ancestral extra copy are defined, which shows evidence of its inheritance from African populations. In addition, the simple validation method would help improve tri-allelic pattern calling, distinguish de novo chromosomal rearrangements, and also count the frequencies among different geographic regions. Therefore, the statistical interpretation of tri-allelic pattern at TPOX could be enhanced during forensic practice.
Collapse
|
6
|
GlobalFiler ® Express DNA amplification kit in South Africa: Extracting the past from the present. Forensic Sci Int Genet 2016; 24:194-201. [DOI: 10.1016/j.fsigen.2016.07.007] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2015] [Revised: 06/10/2016] [Accepted: 07/11/2016] [Indexed: 01/23/2023]
|
7
|
Picanço JB, Raimann PE, Motta CHASD, Rodenbusch R, Gusmão L, Alho CS. Identification of the third/extra allele for forensic application in cases with TPOX tri-allelic pattern. Forensic Sci Int Genet 2015; 16:88-93. [DOI: 10.1016/j.fsigen.2014.11.016] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2013] [Revised: 10/24/2014] [Accepted: 11/19/2014] [Indexed: 10/24/2022]
|
8
|
Salazar-Flores J, Zuñiga-Chiquette F, Rubi-Castellanos R, Álvarez-Miranda J, Zetina-Hérnandez A, Martínez-Sevilla V, González-Andrade F, Corach D, Vullo C, Álvarez J, Lorente J, Sánchez-Diz P, Herrera R, Cerda-Flores R, Muñoz-Valle J, Rangel-Villalobos H. Admixture and genetic relationships of Mexican Mestizos regarding Latin American and Caribbean populations based on 13 CODIS-STRs. HOMO-JOURNAL OF COMPARATIVE HUMAN BIOLOGY 2015; 66:44-59. [DOI: 10.1016/j.jchb.2014.08.005] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/25/2013] [Accepted: 08/28/2014] [Indexed: 12/21/2022]
|
9
|
Picanço JB, Raimann PE, Paskulin GA, Alvarez L, Amorim A, Batista dos Santos SE, Alho CS. Tri-allelic pattern at the TPOX locus: A familial study. Gene 2014; 535:353-8. [DOI: 10.1016/j.gene.2013.10.019] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2013] [Revised: 10/01/2013] [Accepted: 10/10/2013] [Indexed: 10/26/2022]
|
10
|
Yoo SY, Cho NS, Park MJ, Seong KM, Hwang JH, Song SB, Han MS, Lee WT, Chung KW. A large population genetic study of 15 autosomal short tandem repeat loci for establishment of Korean DNA profile database. Mol Cells 2011; 32:15-9. [PMID: 21597912 PMCID: PMC3887661 DOI: 10.1007/s10059-011-2288-4] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2010] [Revised: 03/31/2011] [Accepted: 04/26/2011] [Indexed: 10/18/2022] Open
Abstract
Genotyping of highly polymorphic short tandem repeat (STR) markers is widely used for the genetic identification of individuals in forensic DNA analyses and in paternity disputes. The National DNA Profile Databank recently established by the DNA Identification Act in Korea contains the computerized STR DNA profiles of individuals convicted of crimes. For the establishment of a large autosomal STR loci population database, 1805 samples were obtained at random from Korean individuals and 15 autosomal STR markers were analyzed using the AmpFlSTR Identifiler PCR Amplification kit. For the 15 autosomal STR markers, no deviations from the Hardy-Weinberg equilibrium were observed. The most informative locus in our data set was the D2S1338 with a discrimination power of 0.9699. The combined matching probability was 1.521 × 10(-17). This large STR profile dataset including atypical alleles will be important for the establishment of the Korean DNA database and for forensic applications.
Collapse
Affiliation(s)
- Seong Yeon Yoo
- Department of Forensic Medicine, Central District Office, National Forensic Service, Daejeon 305-348, Korea
- Department of Biological Science and Department of Bioinformatics, Kongju National University, Gongju 314-701, Korea
- These authors contributed equally to this work
| | - Nam Soo Cho
- Department of Forensic Medicine, Central District Office, National Forensic Service, Daejeon 305-348, Korea
- These authors contributed equally to this work
| | - Myung Jin Park
- Department of Forensic Medicine and Brain Korea 21 Project for Medical Science, Yonsei University College of Medicine, Seoul 120-752, Korea
| | - Ki Min Seong
- Department of Forensic Medicine, Central District Office, National Forensic Service, Daejeon 305-348, Korea
- Department of Biological Science and Department of Bioinformatics, Kongju National University, Gongju 314-701, Korea
| | - Jung Ho Hwang
- Department of Forensic Medicine, Central District Office, National Forensic Service, Daejeon 305-348, Korea
| | - Seok Bean Song
- Institute of Drug Research and Development, College of Pharmacy, Chungnam National University, Daejeon 305-764, Korea
| | - Myun Soo Han
- DNA Analysis Center, National Forensic Service, Seoul 158-707, Korea
| | - Won Tae Lee
- Department of Forensic Medicine, Eulji University, Daejeon 301-746, Korea
| | - Ki Wha Chung
- Department of Biological Science and Department of Bioinformatics, Kongju National University, Gongju 314-701, Korea
| |
Collapse
|
11
|
Muñoz A, Albeza MV, Acreche N, Castro JA, Ramon MM, Picornell A. Allele frequencies of 15 STRs in the Calchaqui Valleys population (North-Western Argentina). Forensic Sci Int Genet 2011; 6:e58-60. [PMID: 21641291 DOI: 10.1016/j.fsigen.2011.05.002] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2011] [Revised: 05/05/2011] [Accepted: 05/07/2011] [Indexed: 10/18/2022]
Abstract
Allele frequencies for 15 short tandem repeat (STR) loci were obtained from a sample of 110 individuals from the Calchaqui Valleys population (North-Western Argentina). The combined power of exclusion and combined power of discriminating for the 15 tested STR loci were 0.999964 and 0.9999999999999998, respectively. Matching probability was 1 in 4.58 × 10(15). Therefore, it may be concluded that the set of 15 STRs included in the AmpF STR Identifiler kit, represents a powerful tool for forensic applications, paternity testing and population genetics studies in the Calchaqui Valleys population.
Collapse
|
12
|
Martins JA, de Freitas Figueiredo R, Yoshizaki CS, Paneto GG, Cicarelli RMB. Genetic data of 15 autosomal STR loci: an analysis of the Araraquara population colonization (São Paulo, Brazil). Mol Biol Rep 2011; 38:5397-403. [DOI: 10.1007/s11033-011-0693-6] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2010] [Accepted: 02/26/2011] [Indexed: 11/30/2022]
|
13
|
PopAffiliator: online calculator for individual affiliation to a major population group based on 17 autosomal short tandem repeat genotype profile. Int J Legal Med 2010; 125:629-36. [DOI: 10.1007/s00414-010-0472-2] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2010] [Accepted: 05/17/2010] [Indexed: 11/25/2022]
|
14
|
15 STR loci frequencies in the population from Paraná, Southern Brazil. Forensic Sci Int Genet 2009; 4:e23-4. [DOI: 10.1016/j.fsigen.2009.01.005] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2008] [Revised: 10/28/2008] [Accepted: 01/02/2009] [Indexed: 11/20/2022]
|
15
|
de Assis Poiares L, de Sá Osorio P, Spanhol FA, Coltre SC, Rodenbusch R, Gusmão L, Largura A, Sandrini F, da Silva CMD. Allele frequencies of 15 STRs in a representative sample of the Brazilian population. Forensic Sci Int Genet 2009; 4:e61-3. [PMID: 20129458 DOI: 10.1016/j.fsigen.2009.05.006] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2008] [Revised: 03/26/2009] [Accepted: 05/22/2009] [Indexed: 11/30/2022]
Abstract
Allele frequencies for 15 short tandem repeat (STR) loci were obtained from a sample of 12,030 individuals undergoing paternity testing. This sample includes individuals from all States in Brazil, combined according to the current country division into five regions (North, Northeast, Central West, Southeast, and South). The most polymorphic loci were D2S1338 and D18S51. All the analysed loci meet Hardy-Weinberg equilibrium expectations. Combined power of discrimination and combined power of exclusion for the 15 tested STR loci were 0.999999999999990 and 0.9999992, respectively. Comparative analysis between populations from different Brazilian macroregions as well as between Brazil and other relevant populations are presented.
Collapse
Affiliation(s)
- Lilian de Assis Poiares
- Laboratório Alvaro, Centro de Análises e Pesquisas Clínicas, Rua General Osório, 3212, Bairro Centro, Cascavel, Paraná, Brazil
| | | | | | | | | | | | | | | | | |
Collapse
|
16
|
15 STR loci frequencies in the population from Santa Catarina, Southern Brazil. Forensic Sci Int Genet 2008; 3:e129-31. [PMID: 19647698 DOI: 10.1016/j.fsigen.2008.10.010] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2008] [Revised: 10/28/2008] [Accepted: 10/28/2008] [Indexed: 11/22/2022]
Abstract
Allele frequencies for 15 short tandem repeats (STR) loci were determined with a sample of 3000 unrelated individuals from the population of Santa Catarina, Southern Brazil. The loci are most commonly used in forensic and paternity testing, being analyzed by the Powerplex 16 (Promega) commercial kit. The data shows that most polymorphic loci were Penta E and FGA. The distributions of the genotypes in the evaluated loci are in Hardy-Weinberg equilibrium. Comparative analyses between our population data and other Brazilian populations are presented. The calculated forensic parameters showed that the loci are useful for the solution of forensic problems in Brazilian Southern region.
Collapse
|