1
|
Wen Y, Liu J, Su Y, Chen X, Hou Y, Liao L, Wang Z. Forensic biogeographical ancestry inference: recent insights and current trends. Genes Genomics 2023; 45:1229-1238. [PMID: 37081293 DOI: 10.1007/s13258-023-01387-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2023] [Accepted: 04/01/2023] [Indexed: 04/22/2023]
Abstract
BACKGROUND As a powerful complement to the paradigmatic DNA profiling strategy, biogeographical ancestry inference (BGAI) plays a significant part in human forensic investigation especially when a database hit or eyewitness testimony are not available. It indicates one's biogeographical profile based on known population-specific genetic variations, and thus is crucial for guiding authority investigations to find unknown individuals. Forensic biogeographical ancestry testing exploits much of the recent advances in the understanding of human genomic variation and improving of molecular biology. OBJECTIVE In this review, recent development of prospective ancestry informative markers (AIMs) and the statistical approaches of inferring biogeographic ancestry from AIMs are elucidated and discussed. METHODS We highlight the research progress of three potential AIMs (i.e., single nucleotide polymorphisms, microhaplotypes, and Y or mtDNA uniparental markers) and discuss the prospects and challenges of two methods that are commonly used in BGAI. CONCLUSION While BGAI for forensic purposes has been thriving in recent years, important challenges, such as ethics and responsibilities, data completeness, and ununified standards for evaluation, remain for the use of biogeographical ancestry information in human forensic investigations. To address these issues and fully realize the value of BGAI in forensic investigation, efforts should be made not only by labs/institutions around the world independently, but also by inter-lab/institution collaborations.
Collapse
Affiliation(s)
- Yufeng Wen
- Key Laboratory of Evidence Science (China University of Political Science and Law), Ministry of Education, Beijing, 100088, China
- Institute of Forensic Medicine, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu, 610041, China
- School of Life Sciences, Jilin University, Changchun, 130012, China
| | - Jing Liu
- Institute of Forensic Medicine, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu, 610041, China
| | - Yonglin Su
- Department of Rehabilitation Medicine, West China Hospital Sichuan University, Chengdu, 610041, China
| | - Xiacan Chen
- Institute of Forensic Medicine, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu, 610041, China
| | - Yiping Hou
- Institute of Forensic Medicine, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu, 610041, China
| | - Linchuan Liao
- Institute of Forensic Medicine, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu, 610041, China.
| | - Zheng Wang
- Key Laboratory of Evidence Science (China University of Political Science and Law), Ministry of Education, Beijing, 100088, China.
- Institute of Forensic Medicine, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu, 610041, China.
| |
Collapse
|
2
|
Pilli E, Morelli S, Poggiali B, Alladio E. Biogeographical ancestry, variable selection, and PLS-DA method: a new panel to assess ancestry in forensic samples via MPS technology. Forensic Sci Int Genet 2023; 62:102806. [PMID: 36399972 DOI: 10.1016/j.fsigen.2022.102806] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2022] [Revised: 11/09/2022] [Accepted: 11/10/2022] [Indexed: 11/14/2022]
Abstract
As evidenced by the large number of articles recently published in the literature, forensic scientists are making great efforts to infer externally visible features and biogeographical ancestry (BGA) from DNA analysis. Just as phenotypic, ancestry information obtained from DNA can provide investigative leads to identify the victims (missing/unidentified persons, crime/armed conflict/mass disaster victims) or trace their perpetrators when no matches were found with the reference profile or in the database. Recently, the advent of Massively Parallel Sequencing technologies associated with the possibility of harnessing high-throughput genetic data allowed us to investigate the associations between phenotypic and genomic variations in worldwide human populations and develop new BGA forensic tools capable of simultaneously analyzing up to millions of markers if for example the ancient DNA approach of hybridization capture was adopted to target SNPs of interest. In the present study, a selection of more than 3000 SNPs was performed to create a new BGA panel and the accuracy of the new panel to infer ancestry from unknown samples was evaluated by the PLS-DA method. Subsequently, the panel created was assessed using three variable selection techniques (Backward variable elimination, Genetic Algorithm and Regularized elimination procedure), and the best SNPs in terms of inferring bio-geographical ancestry at inter- and intra-continental level were selected to obtain panels to predict BGA with a reduced number of selected markers to be applied in routine forensic cases where PCR amplification is the best choice to target SNPs.
Collapse
Affiliation(s)
- Elena Pilli
- Department of Biology, Forensic Molecular Anthropology Laboratory, University of Florence, Florence, Italy
| | - Stefania Morelli
- Department of Biology, Forensic Molecular Anthropology Laboratory, University of Florence, Florence, Italy
| | - Brando Poggiali
- Department of Biology, Forensic Molecular Anthropology Laboratory, University of Florence, Florence, Italy
| | | |
Collapse
|
3
|
Evaluation of a SNP-STR haplotype panel for forensic genotype imputation. Forensic Sci Int Genet 2023; 62:102801. [PMID: 36272212 DOI: 10.1016/j.fsigen.2022.102801] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2022] [Revised: 10/08/2022] [Accepted: 10/14/2022] [Indexed: 11/07/2022]
Abstract
Short tandem repeat polymorphism (STR)-based individual identification is a popular and reliable method in many forensic applications. However, STRs still frequently fail to find any matched records. In such cases, if known STRs could provide more information, it would be very helpful to solve specific problems. Genotype imputation has long been used in the study of single nucleotide polymorphisms (SNPs) and has recently been introduced into forensic fields. The idea is that, through a reference haplotype panel containing SNPs and STRs, we can obtain unknown genetic information through genotype imputation based on known STR or SNP genotypes. Several recent studies have already demonstrated this exciting idea, and a 1000 Genomes SNP-STR haplotype panel has also been released. To further study the performance of genotype imputation in forensic fields, we collected STR, microhaplotype (MH) and SNP array genotypes from Chinese Han population individuals and then performed genotype imputation analysis based on the released reference panel. As a result, the average locus imputation accuracy was ∼83 % (or ∼70 %) when SNPs in the SNP array (or MH SNPs) were imputed from STRs, and was ∼30 % when highly polymorphic markers (STRs and MHs) were imputed from each other. When STRs were imputed from SNP array, the average locus imputation accuracy increased to ∼48 %. After analyzing the match scores between real STRs and the STRs imputed from SNPs, ∼80 % of studied STR records can be connected to corresponding SNP records, which may help for individual identification. Our results indicate that genotype imputation has great potential for forensic applications.
Collapse
|
4
|
Study on Gaseous Chlorobenzene Treatment by a Bio-Trickling Filter: Degradation Mechanism and Microbial Community. Processes (Basel) 2022. [DOI: 10.3390/pr10081483] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023] Open
Abstract
Large-flow waste gas generated from the pharmaceutical and chemical industry usually contains low concentrations of VOCs (volatile organic compounds), and it is also the key factor that presents challenges in terms of disposal. To date, due to the limitations of mass transfer rate and microbial degradation ability, the degradation performance of VOCs using the biological method has not been ideal. Therefore, in this study, the sludge from a chlorobenzene-containing wastewater treatment plant was inoculated into our experimental bio-trickling filter (BTF) to explore the feasibility of domestication and degradation of gaseous chlorobenzene by highly active microorganisms. The kinetics of its mass transfer reaction and microbial community dynamics were also discussed. Moreover, the main process parameters of BTF for chlorobenzene degradation were optimized. The results showed that the degradation effect of chlorobenzene reached more than 85% at an inlet concentration of chlorobenzene 700 mg·m−3, oxygen concentration of 10%, and an empty bed retention time (EBRT) of 80 s. The mass transfer kinetic analysis indicated that the process of chlorobenzene degradation in the BTF occurred between the zero-stage reaction and the first-stage reaction. This BTF contributed significantly to the biodegradability of chlorobenzene, overcoming the limitation of gas-to-liquid/solid mass transfer of chlorobenzene. The analysis of the species diversity showed that Thermomonas, Petrimona, Comana, and Ottowia were typical organic-matter-degrading bacteria that degraded chlorobenzene efficiently with xylene present.
Collapse
|
5
|
Alladio E, Poggiali B, Cosenza G, Pilli E. Multivariate statistical approach and machine learning for the evaluation of biogeographical ancestry inference in the forensic field. Sci Rep 2022; 12:8974. [PMID: 35643723 PMCID: PMC9148302 DOI: 10.1038/s41598-022-12903-0] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2021] [Accepted: 04/13/2022] [Indexed: 11/24/2022] Open
Abstract
The biogeographical ancestry (BGA) of a trace or a person/skeleton refers to the component of ethnicity, constituted of biological and cultural elements, that is biologically determined. Nowadays, many individuals are interested in exploring their genealogy, and the capability to distinguish biogeographic information about population groups and subgroups via DNA analysis plays an essential role in several fields such as in forensics. In fact, for investigative and intelligence purposes, it is beneficial to inference the biogeographical origins of perpetrators of crimes or victims of unsolved cold cases when no reference profile from perpetrators or database hits for comparative purposes are available. Current approaches for biogeographical ancestry estimation using SNPs data are usually based on PCA and Structure software. The present study provides an alternative method that involves multivariate data analysis and machine learning strategies to evaluate BGA discriminating power of unknown samples using different commercial panels. Starting from 1000 Genomes project, Simons Genome Diversity Project and Human Genome Diversity Project datasets involving African, American, Asian, European and Oceania individuals, and moving towards further and more geographically restricted populations, powerful multivariate techniques such as Partial Least Squares-Discriminant Analysis (PLS-DA) and machine learning techniques such as XGBoost were employed, and their discriminating power was compared. PLS-DA method provided more robust classifications than XGBoost method, showing that the adopted approach might be an interesting tool for forensic experts to infer BGA information from the DNA profile of unknown individuals, but also highlighting that the commercial forensic panels could be inadequate to discriminate populations at intra-continental level.
Collapse
Affiliation(s)
- Eugenio Alladio
- Department of Chemistry, University of Turin, Turin, Italy.,Centro Regionale Antidoping e di Tossicologia "A. Bertinaria", Orbassano, Torino, Italy
| | - Brando Poggiali
- Department of Biology, Forensic Molecular Anthropology Laboratory, University of Florence, Florence, Italy
| | - Giulia Cosenza
- Department of Biology, Forensic Molecular Anthropology Laboratory, University of Florence, Florence, Italy
| | - Elena Pilli
- Department of Biology, Forensic Molecular Anthropology Laboratory, University of Florence, Florence, Italy.
| |
Collapse
|
6
|
Development and validation of a novel 133-plex forensic STR panel (52 STRs and 81 Y-STRs) using single-end 400 bp massive parallel sequencing. Int J Legal Med 2021; 136:447-464. [PMID: 34741666 DOI: 10.1007/s00414-021-02738-1] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2021] [Accepted: 10/25/2021] [Indexed: 12/15/2022]
Abstract
Short tandem repeats (STRs) are the preferred genetic markers in forensic DNA analysis, routinely measured by capillary electrophoresis (CE) method based on the fragment length features. While, the massive parallel sequencing (MPS) technology could simultaneously target a large number of intriguing forensic STRs, bypassing the intrinsic limitations of amplicon size separation and accessible fluorophores in CE, which is efficient and promising for enabling the identification of forensic biological evidence. Here, we developed a novel MPS-based Forensic Analysis System Multiplecues SetB Kit of 133-plex forensic STR markers (52 STRs and 81 Y-STRs) and one Y-InDel (M175) based on multiplex PCR and single-end 400 bp sequencing strategy. This panel was subjected to developmental validation studies according to the SWGDAM Validation Guidelines. Approximately 2185 MPS-based reactions using 6 human DNA standards and 8 male donors were conducted for substrate studies (filter paper, gauze, cotton swab, four different types of FTA cards, peripheral venous blood, saliva, and exfoliated cells), sensitivity studies (from 2 ng down to 0.0625 ng), mixture studies (two-person DNA mixtures), PCR inhibitor studies (seven commonly encountered PCR inhibitors), species specificity studies (11 non-human species), and repeatability studies. Results of concordance studies (413 Han males and 6 human DNA standards) generated by STRait Razor and in-house Python scripts indicated 99.98% concordance rate in STR calling relative to CE for STRs between 41,900 genotypes at 100 STR markers. Moreover, the limitations of present studies, the nomenclature rules and forensic MPS applications were also described. In conclusion, the validation studies based on ~ 2200 MPS-based and ~ 2500 CE-based DNA profiles demonstrated that the novel MPS-based panel meets forensic DNA quality assurance guidelines with robust, reliable, and reproducible performance on samples of various quantities and qualities, and the STR nomenclature rules should be further regulated to integrate the inconformity between MPS-based and CE-based methods.
Collapse
|
7
|
Truelsen D, Freire-Aradas A, Nazari M, Aliferi A, Ballard D, Phillips C, Morling N, Pereira V, Børsting C. Evaluation of a custom QIAseq targeted DNA panel with 164 ancestry informative markers sequenced with the Illumina MiSeq. Sci Rep 2021; 11:21040. [PMID: 34702940 PMCID: PMC8548529 DOI: 10.1038/s41598-021-99933-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2021] [Accepted: 09/29/2021] [Indexed: 11/08/2022] Open
Abstract
Introduction of new methods requires meticulous evaluation before they can be applied to forensic genetic case work. Here, a custom QIAseq Targeted DNA panel with 164 ancestry informative markers was assessed using the MiSeq sequencing platform. Concordance, sensitivity, and the capability for analysis of mixtures were tested. The assay gave reproducible and nearly concordant results with an input of 10 and 2 ng DNA. Lower DNA input led to an increase in both locus and allele drop-outs, and a higher variation in heterozygote balance. Locus or allele drop-outs in the samples with less than 2 ng DNA input were not necessarily associated with the overall performance of a locus. Thus, the QIAseq assay will be difficult to implement in a forensic genetic setting where the sample material is often scarce and of poor quality. With equal or near equal mixture ratios, the mixture DNA profiles were easily identified by an increased number of imbalanced heterozygotes. For more skewed mixture ratios, the mixture DNA profiles were identified by an increased noise level. Lastly, individuals from Great Britain and the Middle East were investigated. The Middle Eastern individuals showed a greater affinity with South European populations compared to North European populations.
Collapse
Affiliation(s)
- D Truelsen
- Section of Forensic Genetics, Department of Forensic Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, 2100, Copenhagen, Denmark.
| | - A Freire-Aradas
- Forensic Genetics Unit, Institute of Forensic Sciences, University of Santiago de Compostela, Santiago de Compostela, Spain
| | - M Nazari
- Faculty of Life Sciences and Medicine, King's College, London, UK
| | - A Aliferi
- Faculty of Life Sciences and Medicine, King's College, London, UK
| | - D Ballard
- Faculty of Life Sciences and Medicine, King's College, London, UK
| | - C Phillips
- Forensic Genetics Unit, Institute of Forensic Sciences, University of Santiago de Compostela, Santiago de Compostela, Spain
| | - N Morling
- Section of Forensic Genetics, Department of Forensic Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, 2100, Copenhagen, Denmark
- Department of Mathematical Sciences, Aalborg University, 9220, Aalborg, Denmark
| | - V Pereira
- Section of Forensic Genetics, Department of Forensic Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, 2100, Copenhagen, Denmark
| | - C Børsting
- Section of Forensic Genetics, Department of Forensic Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, 2100, Copenhagen, Denmark
| |
Collapse
|