1
|
Ferreira MR, Carratto TMT, Frontanilla TS, Bonadio RS, Jain M, de Oliveira SF, Castelli EC, Mendes-Junior CT. Advances in forensic genetics: Exploring the potential of long read sequencing. Forensic Sci Int Genet 2024; 74:103156. [PMID: 39427416 DOI: 10.1016/j.fsigen.2024.103156] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2024] [Revised: 10/04/2024] [Accepted: 10/06/2024] [Indexed: 10/22/2024]
Abstract
DNA-based technologies have been used in forensic practice since the mid-1980s. While PCR-based STR genotyping using Capillary Electrophoresis remains the gold standard for generating DNA profiles in routine casework worldwide, the research community is continually seeking alternative methods capable of providing additional information to enhance discrimination power or contribute with new investigative leads. Oxford Nanopore Technologies (ONT) and PacBio third-generation sequencing have revolutionized the field, offering real-time capabilities, single-molecule resolution, and long-read sequencing (LRS). ONT, the pioneer of nanopore sequencing, uses biological nanopores to analyze nucleic acids in real-time. Its devices have revolutionized sequencing and may represent an interesting alternative for forensic research and routine casework, given that it offers unparalleled flexibility in a portable size: it enables sequencing approaches that range widely from PCR-amplified short target regions (e.g., CODIS STRs) to PCR-free whole transcriptome or even ultra-long whole genome sequencing. Despite its higher error rate compared to Illumina sequencing, it can significantly improve accuracy in read alignment against a reference genome or de novo genome assembly. This is achieved by generating long contiguous sequences that correctly assemble repetitive sections and regions with structural variation. Moreover, it allows real-time determination of DNA methylation status from native DNA without the need for bisulfite conversion. LRS enables the analysis of thousands of markers at once, providing phasing information and eliminating the need for multiple assays. This maximizes the information retrieved from a single invaluable sample. In this review, we explore the potential use of LRS in different forensic genetics approaches.
Collapse
Affiliation(s)
- Marcel Rodrigues Ferreira
- Molecular Genetics and Bioinformatics Laboratory, Experimental Research Unit - Unipex, School of Medicine, São Paulo State University - Unesp, Botucatu, São Paulo, Brazil
| | - Thássia Mayra Telles Carratto
- Departamento de Química, Laboratório de Pesquisas Forenses e Genômicas, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto, Universidade de São Paulo, Ribeirão Preto, SP 14040-901, Brazil
| | - Tamara Soledad Frontanilla
- Departamento de Genética, Faculdade de Medicina de Ribeirão Preto, Universidade de São Paulo, Ribeirão Preto, SP 14049-900, Brazil
| | - Raphael Severino Bonadio
- Depto Genética e Morfologia, Instituto de Ciências Biológicas, Universidade de Brasília, Brasília, DF, Brazil
| | - Miten Jain
- Department of Bioengineering, Department of Physics, Khoury College of Computer Sciences, Northeastern University, Boston, MA, United States
| | | | - Erick C Castelli
- Molecular Genetics and Bioinformatics Laboratory, Experimental Research Unit - Unipex, School of Medicine, São Paulo State University - Unesp, Botucatu, São Paulo, Brazil; Pathology Department, School of Medicine, São Paulo State University - Unesp, Botucatu, São Paulo, Brazil
| | - Celso Teixeira Mendes-Junior
- Departamento de Química, Laboratório de Pesquisas Forenses e Genômicas, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto, Universidade de São Paulo, Ribeirão Preto, SP 14040-901, Brazil.
| |
Collapse
|
2
|
Kulhankova L, Bindels E, Kayser M, Mulugeta E. Deconvoluting multi-person biological mixtures and accurate characterization and identification of separated contributors using non-targeted single-cell DNA sequencing. Forensic Sci Int Genet 2024; 71:103030. [PMID: 38513339 DOI: 10.1016/j.fsigen.2024.103030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2023] [Revised: 02/16/2024] [Accepted: 03/04/2024] [Indexed: 03/23/2024]
Abstract
The genetic characterization and identification of individuals who contributed to biological mixtures are complex and mostly unresolved tasks. These tasks are relevant in various fields, particularly in forensic investigations, which frequently encounters crime scene stains generated by more than one person. Currently, forensic mixture deconvolution is mostly performed subsequent to forensic DNA profiling at the level of the mixed DNA profiles, which comes with several limitations. Some previous studies attempted at separating single cells prior to forensic DNA profiling. However, these approaches are biased at selection of the cells and, due to their targeted DNA analysis on low template DNA, provide incomplete and unreliable forensic DNA profiles. We recently demonstrated the feasibility of performing mixture deconvolution prior to forensic DNA profiling through the utilization of a non-targeted single-cell transcriptome sequencing (scRNA-seq). In addition to individual-specific mixture deconvolution, this approach also allowed accurate characterisation of biological sex, biogeographic ancestry and individual identification of the separated mixture contributors. However, RNA has the forensic disadvantage of being prone to degradation, and sequencing RNA - focussing on coding regions - limits the number of single nucleotide polymorphisms (SNPs) utilized for genetic mixture deconvolution, characterization, and identification. These limitations can be overcome by performing single-cell sequencing on the level of DNA instead of RNA. Here, for the first time, we applied non-targeted single-cell DNA sequencing (scDNA-seq) by applying the scATAC-seq (Assay for Transposase-Accessible Chromatin with sequencing) technique to address the challenges of mixture deconvolution in the forensic context. We demonstrated that scATAC-seq, together with our recently developed De-goulash data analysis pipeline, is capable of deconvoluting complex blood mixtures of five individuals from both sexes with varying biogeographic ancestries. We further showed that our approach achieved correct genetic characterization of the biological sex and the biogeographic ancestry of each of the separated mixture contributors and established their identity. Furthermore, by analysing in-silico generated scATAC-seq data mixtures, we demonstrated successful individual-specific mixture deconvolution of i) highly complex mixtures of 11 individuals, ii) balanced mixtures containing as few as 20 cells (10 per each individual), and iii) imbalanced mixtures with a ratio as low as 1:80. Overall, our proof-of-principle study demonstrates the general feasibility of scDNA-seq in general, and scATAC-seq in particular, for mixture deconvolution, genetic characterization and individual identification of the separated mixture contributors. Furthermore, it shows that compared to scRNA-seq, scDNA-seq detects more SNPs from fewer cells, providing higher sensitivity, that is valuable in forensic genetics.
Collapse
Affiliation(s)
- Lucie Kulhankova
- Department of Genetic Identification, Erasmus MC University Medical Center Rotterdam, Rotterdam, the Netherlands
| | - Eric Bindels
- Department of Haematology, Erasmus MC University Medical Center Rotterdam, Rotterdam, the Netherlands
| | - Manfred Kayser
- Department of Genetic Identification, Erasmus MC University Medical Center Rotterdam, Rotterdam, the Netherlands.
| | - Eskeatnaf Mulugeta
- Department of Cell Biology, Erasmus MC University Medical Center Rotterdam, Rotterdam, the Netherlands.
| |
Collapse
|
3
|
Tang X, Wen D, Jin X, Wang C, Xu W, Qu W, Xu R, Jia H, Liu Y, Li X, Chen S, Fu X, Liang B, Li J, Liu Y, Zha L. A preliminary study on identification of the blood donor in a body fluid mixture using a novel compound genetic marker blood-specific methylation-microhaplotype. Forensic Sci Int Genet 2024; 70:103031. [PMID: 38493735 DOI: 10.1016/j.fsigen.2024.103031] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2023] [Revised: 03/06/2024] [Accepted: 03/06/2024] [Indexed: 03/19/2024]
Abstract
Blood-containing mixtures are frequently encountered at crime scenes involving violence and murder. However, the presence of blood, and the association of blood with a specific donor within these mixtures present significant challenges in forensic analysis. In light of these challenges, this study sought to address these issues by leveraging blood-specific methylation sites and closely linked microhaplotype sites, proposing a novel composite genetic marker known as "blood-specific methylation-microhaplotype". This marker was designed to the detection of blood and the determination of blood donor within blood-containing mixtures. According to the selection criteria mentioned in the Materials and Methods section, we selected 10 blood-specific methylation-microhaplotype loci for inclusion in this study. Among these loci, eight exhibited blood-specific hypomethylation, while the remaining two displayed blood-specific hypermethylation. Based on data obtained from 124 individual samples in our study, the combined discrimination power (CPD) of these 10 successfully sequenced loci was 0.999999298. The sample allele methylation rate (Ram) was obtained from massive parallel sequencing (MPS), which was defined as the proportion of methylated reads to the total clustered reads that were genotyped to a specific allele. To develop an allele type classification model capable of identifying the presence of blood and the blood donor, we used the Random Forest algorithm. This model was trained and evaluated using the Ram distribution of individual samples and the Ram distribution of simulated shared alleles. Subsequently, we applied the developed allele type classification model to predict alleles within actual mixtures, trying to exclude non-blood-specific alleles, ultimately allowing us to identify the presence of blood and the blood donor in the blood-containing mixtures. Our findings demonstrate that these blood-specific methylation-microhaplotype loci have the capability to not only detect the presence of blood but also accurately associate blood with the true donor in blood-containing mixtures with the mixing ratios of 1:29, 1:19, 1:9, 1:4, 1:2, 2:1, 7:1, 8:1, 31:1 and 36:1 (blood:non-blood) by DNA mixture interpretation methods. In addition, the presence of blood and the true blood donor could be identified in a mixture containing four body fluids (blood:vaginal fluid:semen:saliva = 1:1:1:1). It is important to note that while these loci exhibit great potential, the impact of allele dropouts and alleles misidentification must be considered when interpreting the results. This is a preliminary study utilising blood-specific methylation-microhaplotype as a complementary tool to other well-established genetic markers (STR, SNP, microhaplotype, etc.) for the analysis in blood-containing mixtures.
Collapse
Affiliation(s)
- Xuan Tang
- Department of Forensic Medicine, School of Basic Medical Sciences, Central South University, No172. Tongzipo Road, Changsha, Hunan 410013, PR China
| | - Dan Wen
- Department of Forensic Medicine, School of Basic Medical Sciences, Central South University, No172. Tongzipo Road, Changsha, Hunan 410013, PR China
| | - Xin Jin
- Department of Public Security of Hainan Province, Haikou, Hainan Province, PR China
| | - Chudong Wang
- Department of Forensic Medicine, School of Basic Medical Sciences, Central South University, No172. Tongzipo Road, Changsha, Hunan 410013, PR China
| | - Wei Xu
- Central Laboratory, Hunan Provincal People's Hospital (The First Affiliated Hospitak of Hunan Normal University), Changsha, Hunan Province 410000, PR China
| | - Weifeng Qu
- Department of Forensic Medicine, School of Basic Medical Sciences, Central South University, No172. Tongzipo Road, Changsha, Hunan 410013, PR China
| | - Ruyi Xu
- Department of Forensic Medicine, School of Basic Medical Sciences, Central South University, No172. Tongzipo Road, Changsha, Hunan 410013, PR China
| | - Hongtao Jia
- Department of Forensic Medicine, School of Basic Medical Sciences, Central South University, No172. Tongzipo Road, Changsha, Hunan 410013, PR China
| | - Yi Liu
- Department of Forensic Medicine, School of Basic Medical Sciences, Central South University, No172. Tongzipo Road, Changsha, Hunan 410013, PR China
| | - Xue Li
- Department of Forensic Medicine, School of Basic Medical Sciences, Xinjiang Medical University, Urumqi, Xinjiang 830017, PR China
| | - Siqi Chen
- Department of Forensic Medicine, School of Basic Medical Sciences, Central South University, No172. Tongzipo Road, Changsha, Hunan 410013, PR China
| | - Xiaoyi Fu
- Department of Forensic Medicine, School of Basic Medical Sciences, Central South University, No172. Tongzipo Road, Changsha, Hunan 410013, PR China
| | - Bin Liang
- Department of Forensic Medicine, School of Basic Medical Sciences, Central South University, No172. Tongzipo Road, Changsha, Hunan 410013, PR China
| | - Jienan Li
- Department of Forensic Medicine, School of Basic Medical Sciences, Central South University, No172. Tongzipo Road, Changsha, Hunan 410013, PR China
| | - Ying Liu
- Xiangya Stomatological Collage, Central South University, No72. Xiangya Road, Changsha, Hunan 410013, PR China.
| | - Lagabaiyila Zha
- Department of Forensic Medicine, School of Basic Medical Sciences, Central South University, No172. Tongzipo Road, Changsha, Hunan 410013, PR China; Hebei Key Laboratory of Forensic Medicine, School of Forensic Medicine, Hebei Medical University, Shijiazhuang, PR China.
| |
Collapse
|
4
|
Wang X, Muenzler M, King J, Liu M, Li H, Budowle B, Ge J. A complete pipeline enables haplotyping and phasing macrohaplotype in long sequencing reads for polyploidy samples and a multi-source DNA mixture. Electrophoresis 2024; 45:877-884. [PMID: 38196015 DOI: 10.1002/elps.202300143] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2023] [Revised: 11/19/2023] [Accepted: 11/30/2023] [Indexed: 01/11/2024]
Abstract
Macrohaplotype combines multiple types of phased DNA variants, increasing forensic discrimination power. High-quality long-sequencing reads, for example, PacBio HiFi reads, provide data to detect macrohaplotypes in multiploidy and DNA mixtures. However, the bioinformatics tools for detecting macrohaplotypes are lacking. In this study, we developed a bioinformatics software, MacroHapCaller, in which targeted loci (i.e., short TRs [STRs], single nucleotide polymorphisms, and insertion and deletions) are genotyped and combined with novel algorithms to call macrohaplotypes from long reads. MacroHapCaller uses physical phasing (i.e., read-backed phasing) to identify macrohaplotypes, and thus it can detect multi-allelic macrohaplotypes for a given sample. MacroHapCaller was validated with data generated from our designed targeted PacBio HiFi sequencing pipeline, which sequenced ∼8-kb amplicon regions harboring 20 core forensic STR loci in human benchmark samples HG002 and HG003. MacroHapCaller also was validated in whole-genome long-read sequencing data. Robust and accurate genotyping and phased macrohaplotypes were obtained with MacroHapCaller compared with the known ground truth. MacroHapCaller achieved a higher or consistent genotyping accuracy and faster speed than existing tools HipSTR and DeepVar. MacroHapCaller enables efficient macrohaplotype analysis from high-throughput sequencing data and supports applications using discriminating macrohaplotypes.
Collapse
Affiliation(s)
- Xuewen Wang
- Health Science Center, University of North Texas, Fort Worth, Texas, USA
| | - Melissa Muenzler
- Health Science Center, University of North Texas, Fort Worth, Texas, USA
| | - Jonathan King
- Health Science Center, University of North Texas, Fort Worth, Texas, USA
| | - Muyi Liu
- Health Science Center, University of North Texas, Fort Worth, Texas, USA
| | - Hongmin Li
- College of Science, Cal State East Bay, Hayward, California, USA
| | - Bruce Budowle
- Department of Forensic Medicine, University of Helsinki, Helsinki, Finland
- Forensic Science Institute, Radford University, Radford, Virginia, USA
| | - Jianye Ge
- Health Science Center, University of North Texas, Fort Worth, Texas, USA
| |
Collapse
|