1
|
Yang SB, Lee JE, Lee HY. Forensic genetic analysis of single-nucleotide polymorphisms and microhaplotypes in Koreans through next-generation sequencing using precision ID identity panel. Genes Genomics 2023; 45:1281-1293. [PMID: 37440105 DOI: 10.1007/s13258-023-01424-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2023] [Accepted: 06/26/2023] [Indexed: 07/14/2023]
Abstract
BACKGROUND Forensic DNA analysis has seen remarkable advancements with the advent of Next Generation Sequencing (NGS). In particular, NGS analysis of single nucleotide polymorphisms (SNPs) offers significant advantages in the analysis of challenging samples compared to conventional STR analysis. OBJECTIVE This study aimed to investigate the SNPs of the Precision ID Identity Panel, a commercially available NGS panel for personal identification, by generating genetic profiles of 298 Koreans and comparing them with other global populations. METHODS A total of 124 SNPs, including 90 autosomal and 34 Y-SNPs, were analyzed using the Precision ID Identity Panel, and forensic parameters, microhaplotypes, and population differences were investigated. RESULTS The NGS data were successfully obtained from 298 Koreans. The analysis of forensic parameters exhibited a low combined match probability of 1.532 × 10- 34, which is comparable to that obtained from commonly used STR analysis. Additionally, the microhaplotype analysis revealed that the use of 16 microhaplotypes provided higher discriminatory power compared to single target SNPs. Furthermore, the adoption of microhaplotype data resulted in an increase of over 20% in expected heterozygosity at five loci. Inter-population analysis showed a close genetic relationship between Koreans and individuals from China and Myanmar in East and Southeast Asia, which are geographically adjacent to Korea. CONCLUSIONS The results of this study show that the Precision ID Identity panel can be a useful alternative where traditional STR typing is not feasible. Also, the data from our study will be useful as a reference for Koreans in forensic investigations and the prosecution of criminal justice.
Collapse
Affiliation(s)
- Soo-Bin Yang
- Department of Forensic Medicine, Seoul National University College of Medicine, Seoul, Korea
| | - Ji Eun Lee
- Department of Forensic Medicine, Seoul National University College of Medicine, Seoul, Korea
| | - Hwan Young Lee
- Department of Forensic Medicine, Seoul National University College of Medicine, Seoul, Korea.
- Institute of Forensic and Anthropological Science, Seoul National University College of Medicine, Seoul, Korea.
| |
Collapse
|
2
|
Pilli E, Morelli S, Poggiali B, Alladio E. Biogeographical ancestry, variable selection, and PLS-DA method: a new panel to assess ancestry in forensic samples via MPS technology. Forensic Sci Int Genet 2023; 62:102806. [PMID: 36399972 DOI: 10.1016/j.fsigen.2022.102806] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2022] [Revised: 11/09/2022] [Accepted: 11/10/2022] [Indexed: 11/14/2022]
Abstract
As evidenced by the large number of articles recently published in the literature, forensic scientists are making great efforts to infer externally visible features and biogeographical ancestry (BGA) from DNA analysis. Just as phenotypic, ancestry information obtained from DNA can provide investigative leads to identify the victims (missing/unidentified persons, crime/armed conflict/mass disaster victims) or trace their perpetrators when no matches were found with the reference profile or in the database. Recently, the advent of Massively Parallel Sequencing technologies associated with the possibility of harnessing high-throughput genetic data allowed us to investigate the associations between phenotypic and genomic variations in worldwide human populations and develop new BGA forensic tools capable of simultaneously analyzing up to millions of markers if for example the ancient DNA approach of hybridization capture was adopted to target SNPs of interest. In the present study, a selection of more than 3000 SNPs was performed to create a new BGA panel and the accuracy of the new panel to infer ancestry from unknown samples was evaluated by the PLS-DA method. Subsequently, the panel created was assessed using three variable selection techniques (Backward variable elimination, Genetic Algorithm and Regularized elimination procedure), and the best SNPs in terms of inferring bio-geographical ancestry at inter- and intra-continental level were selected to obtain panels to predict BGA with a reduced number of selected markers to be applied in routine forensic cases where PCR amplification is the best choice to target SNPs.
Collapse
Affiliation(s)
- Elena Pilli
- Department of Biology, Forensic Molecular Anthropology Laboratory, University of Florence, Florence, Italy
| | - Stefania Morelli
- Department of Biology, Forensic Molecular Anthropology Laboratory, University of Florence, Florence, Italy
| | - Brando Poggiali
- Department of Biology, Forensic Molecular Anthropology Laboratory, University of Florence, Florence, Italy
| | | |
Collapse
|
3
|
Carratto TMT, Moraes VMS, Recalde TSF, Oliveira MLGD, Teixeira Mendes-Junior C. Applications of massively parallel sequencing in forensic genetics. Genet Mol Biol 2022; 45:e20220077. [PMID: 36121926 PMCID: PMC9514793 DOI: 10.1590/1678-4685-gmb-2022-0077] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2022] [Accepted: 07/15/2022] [Indexed: 11/22/2022] Open
Abstract
Massively parallel sequencing, also referred to as next-generation sequencing, has positively changed DNA analysis, allowing further advances in genetics. Its capability of dealing with low quantity/damaged samples makes it an interesting instrument for forensics. The main advantage of MPS is the possibility of analyzing simultaneously thousands of genetic markers, generating high-resolution data. Its detailed sequence information allowed the discovery of variations in core forensic short tandem repeat loci, as well as the identification of previous unknown polymorphisms. Furthermore, different types of markers can be sequenced in a single run, enabling the emergence of DIP-STRs, SNP-STR haplotypes, and microhaplotypes, which can be very useful in mixture deconvolution cases. In addition, the multiplex analysis of different single nucleotide polymorphisms can provide valuable information about identity, biogeographic ancestry, paternity, or phenotype. DNA methylation patterns, mitochondrial DNA, mRNA, and microRNA profiling can also be analyzed for different purposes, such as age inference, maternal lineage analysis, body-fluid identification, and monozygotic twin discrimination. MPS technology also empowers the study of metagenomics, which analyzes genetic material from a microbial community to obtain information about individual identification, post-mortem interval estimation, geolocation inference, and substrate analysis. This review aims to discuss the main applications of MPS in forensic genetics.
Collapse
Affiliation(s)
- Thássia Mayra Telles Carratto
- Universidade de São Paulo, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto, Departamento de Química, Laboratório de Pesquisas Forenses e Genômicas, Ribeirão Preto, SP, Brazil
| | - Vitor Matheus Soares Moraes
- Universidade de São Paulo, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto, Departamento de Química, Laboratório de Pesquisas Forenses e Genômicas, Ribeirão Preto, SP, Brazil
| | | | | | - Celso Teixeira Mendes-Junior
- Universidade de São Paulo, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto, Departamento de Química, Laboratório de Pesquisas Forenses e Genômicas, Ribeirão Preto, SP, Brazil
| |
Collapse
|
4
|
Phillips C, de la Puente M, Ruiz-Ramirez J, Staniewska A, Ambroa-Conde A, Freire-Aradas A, Mosquera-Miguel A, Rodriguez A, Lareu MV. Eurasiaplex-2: Shifting the focus to SNPs with high population specificity increases the power of forensic ancestry marker sets. Forensic Sci Int Genet 2022; 61:102780. [PMID: 36174251 DOI: 10.1016/j.fsigen.2022.102780] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2022] [Revised: 09/16/2022] [Accepted: 09/18/2022] [Indexed: 11/27/2022]
Abstract
To compile a new South Asian-informative panel of forensic ancestry SNPs, we changed the strategy for selecting the most powerful markers for this purpose by targeting polymorphisms with near absolute specificity - when the South Asian-informative allele identified is absent from all other populations or present at frequencies below 0.001 (one in a thousand). More than 120 candidate SNPs were identified from 1000 Genomes datasets satisfying an allele frequency screen of ≥ 0.1 (10 % or more) allele frequency in South Asians, and ≤ 0.001 (0.1 % or less) in African, East Asian, and European populations. From the candidate pool of markers, a final panel of 36 SNPs, widely distributed across most autosomes, were selected that had allele frequencies in the five 1000 Genomes South Asian populations ranging from 0.4 to 0.15. Slightly lower average allele frequencies, but consistent patterns of informativeness were observed in gnomAD South Asian datasets used to validate the 1000 Genomes variant annotations. We named the panel of 36 South Asian-specific SNPs Eurasiaplex-2, and the informativeness of the panel was evaluated by compiling worldwide population data from 4097 samples in four genome variation databases that largely complement the global sampling of 1000 Genomes. Consistent patterns of allele frequency distribution, which were specific to South Asia, were observed in all populations in, or closely sited to, the Indian sub-continent. Pakistani populations from the HGDP-CEPH panel had markedly lower allele frequencies, highlighting the need to develop a statistical system to evaluate the ancestry inference value of counting the number of population-specific alleles present in an individual.
Collapse
Affiliation(s)
- C Phillips
- Forensic Genetics Unit, Institute of Forensic Sciences, University of Santiago de Compostela, Spain; Institute of Anthropology and Ethnology, Adam Mickiewicz University in Poznań, Poland..
| | - M de la Puente
- Forensic Genetics Unit, Institute of Forensic Sciences, University of Santiago de Compostela, Spain
| | - J Ruiz-Ramirez
- Forensic Genetics Unit, Institute of Forensic Sciences, University of Santiago de Compostela, Spain
| | - A Staniewska
- Institute of Anthropology and Ethnology, Adam Mickiewicz University in Poznań, Poland
| | - A Ambroa-Conde
- Forensic Genetics Unit, Institute of Forensic Sciences, University of Santiago de Compostela, Spain
| | - A Freire-Aradas
- Forensic Genetics Unit, Institute of Forensic Sciences, University of Santiago de Compostela, Spain
| | - A Mosquera-Miguel
- Forensic Genetics Unit, Institute of Forensic Sciences, University of Santiago de Compostela, Spain
| | - A Rodriguez
- Forensic Genetics Unit, Institute of Forensic Sciences, University of Santiago de Compostela, Spain
| | - M V Lareu
- Forensic Genetics Unit, Institute of Forensic Sciences, University of Santiago de Compostela, Spain
| |
Collapse
|
5
|
Laurent FX, Fischer A, Oldt RF, Kanthaswamy S, Buckleton JS, Hitchin S. Streamlining the decision-making process for international DNA kinship matching using Worldwide allele frequencies and tailored cutoff log 10LR thresholds. Forensic Sci Int Genet 2021; 57:102634. [PMID: 34871915 DOI: 10.1016/j.fsigen.2021.102634] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2021] [Revised: 10/13/2021] [Accepted: 11/15/2021] [Indexed: 11/30/2022]
Abstract
The identification of human remains belonging to missing persons is one of the main challenges for forensic genetics. Although other means of identification can be applied to missing person investigations, DNA is often extremely valuable to further support or refute potential associations. When reference DNA samples cannot be collected from personal items belonging to a missing person, a direct DNA identification cannot be carried out. However, identifications can be made indirectly using DNA from the missing person's relatives. The ranking of likelihood ratio (LR) values, which measure the fit of a missing person for any given pedigree, is often the first step in selecting candidates in a DNA database. Although implementing DNA kinship matching in a national environment is feasible, many challenges need to be resolved before applying this method to an international configuration. In this study, we present an innovative and intuitive method to perform international DNA kinship matching and facilitate the comparison of DNA profiles when the ancestry is unknown or unsure and/or when different marker sets are used. This straightforward method, which is based on calculations performed with the DNA matching software BONAPARTE, Worldwide allele frequencies and tailored cutoff log10LR thresholds, allows for the classification of potential candidates according to the strength of the DNA evidence and the predicted proportion of adventitious matches. This is a powerful method for streamlining the decision-making process in missing person investigations and DVI processes, especially when there are low numbers of overlapping typed STRs. Intuitive interpretation tables and a decision tree will help strengthen international data comparison for the identification of reported missing individuals discovered outside their national borders.
Collapse
Affiliation(s)
- François-Xavier Laurent
- International Criminal Police Organization - INTERPOL, DNA Unit, 200 quai Charles de Gaulle, 69006 Lyon, France.
| | - Andrea Fischer
- International Criminal Police Organization - INTERPOL, DNA Unit, 200 quai Charles de Gaulle, 69006 Lyon, France; Landeskriminalamt Baden-Württemberg, Taubenheimstr. 85, 70372 Stuttgart, Germany
| | - Robert F Oldt
- School of Mathematical and Natural Sciences, Arizona State University, Phoenix, AZ 85004, USA
| | - Sree Kanthaswamy
- School of Mathematical and Natural Sciences, Arizona State University, Phoenix, AZ 85004, USA
| | - John S Buckleton
- University of Auckland, Department of Statistics, Private Bag, 92019 Auckland, New Zealand
| | - Susan Hitchin
- International Criminal Police Organization - INTERPOL, DNA Unit, 200 quai Charles de Gaulle, 69006 Lyon, France.
| |
Collapse
|
6
|
A novel computational strategy to predict the value of the evidence in the SNP-based forensic mixtures. PLoS One 2021; 16:e0247344. [PMID: 34653182 PMCID: PMC8519470 DOI: 10.1371/journal.pone.0247344] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2021] [Accepted: 09/30/2021] [Indexed: 11/24/2022] Open
Abstract
This study introduces a methodology for inferring the weight of the evidence (WoE) in the single nucleotide polymorphism (SNP)-typed DNA mixtures of forensic interest. First, we redefined some algebraic formulae to approach the semi-continuous calculation of likelihoods and likelihood ratios (LRs). To address the allelic dropouts, a peak height ratio index (“h,” an index of heterozygous state plausibility) was incorporated into semi-continuous formulae to act as a proxy for the “split-drop” model of calculation. Second, the original ratio at which a person of interest (POI) has entered into the mixture was inferred by evaluating the DNA amounts conferred by unique genotypes to any possible permutation of any locus of the typing protocol (unique genotypes are genotypes that appear just once in the relevant permutation). We compared this expected ratio (MRex) to all the mixing ratios emerging at all other permutations of the mixture (MRobs) using several (1 - χ2) tests to evaluate the probability of each permutation to exist in the mixture according to quantitative criteria. At the level of each permutation state, we multiplied the (1 - χ2) value to the genotype frequencies and the h index. All the products of all the permutation states were finally summed to give a likelihood value that accounts for three independent properties of the mixtures. Owing to the (1 - χ2) index and the h index, this approach qualifies as a fully continuous methodology of LR calculation. We compared the MRs and LRs emerging from our methodology to those generated by the EuroForMix software ver. 3.0.3. When the true contributors were tested as POIs, our procedure generated highly discriminant LRs that, unlike EuroForMix, never overcame the corresponding single-source LRs. When false contributors were tested as POIs, we obtained a much lower LR value than that from EuroForMix. These two findings indicate that our computational method is more reliable and realistic than EuroForMix.
Collapse
|