1
|
Kulhankova L, Bindels E, Kayser M, Mulugeta E. Deconvoluting multi-person biological mixtures and accurate characterization and identification of separated contributors using non-targeted single-cell DNA sequencing. Forensic Sci Int Genet 2024; 71:103030. [PMID: 38513339 DOI: 10.1016/j.fsigen.2024.103030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2023] [Revised: 02/16/2024] [Accepted: 03/04/2024] [Indexed: 03/23/2024]
Abstract
The genetic characterization and identification of individuals who contributed to biological mixtures are complex and mostly unresolved tasks. These tasks are relevant in various fields, particularly in forensic investigations, which frequently encounters crime scene stains generated by more than one person. Currently, forensic mixture deconvolution is mostly performed subsequent to forensic DNA profiling at the level of the mixed DNA profiles, which comes with several limitations. Some previous studies attempted at separating single cells prior to forensic DNA profiling. However, these approaches are biased at selection of the cells and, due to their targeted DNA analysis on low template DNA, provide incomplete and unreliable forensic DNA profiles. We recently demonstrated the feasibility of performing mixture deconvolution prior to forensic DNA profiling through the utilization of a non-targeted single-cell transcriptome sequencing (scRNA-seq). In addition to individual-specific mixture deconvolution, this approach also allowed accurate characterisation of biological sex, biogeographic ancestry and individual identification of the separated mixture contributors. However, RNA has the forensic disadvantage of being prone to degradation, and sequencing RNA - focussing on coding regions - limits the number of single nucleotide polymorphisms (SNPs) utilized for genetic mixture deconvolution, characterization, and identification. These limitations can be overcome by performing single-cell sequencing on the level of DNA instead of RNA. Here, for the first time, we applied non-targeted single-cell DNA sequencing (scDNA-seq) by applying the scATAC-seq (Assay for Transposase-Accessible Chromatin with sequencing) technique to address the challenges of mixture deconvolution in the forensic context. We demonstrated that scATAC-seq, together with our recently developed De-goulash data analysis pipeline, is capable of deconvoluting complex blood mixtures of five individuals from both sexes with varying biogeographic ancestries. We further showed that our approach achieved correct genetic characterization of the biological sex and the biogeographic ancestry of each of the separated mixture contributors and established their identity. Furthermore, by analysing in-silico generated scATAC-seq data mixtures, we demonstrated successful individual-specific mixture deconvolution of i) highly complex mixtures of 11 individuals, ii) balanced mixtures containing as few as 20 cells (10 per each individual), and iii) imbalanced mixtures with a ratio as low as 1:80. Overall, our proof-of-principle study demonstrates the general feasibility of scDNA-seq in general, and scATAC-seq in particular, for mixture deconvolution, genetic characterization and individual identification of the separated mixture contributors. Furthermore, it shows that compared to scRNA-seq, scDNA-seq detects more SNPs from fewer cells, providing higher sensitivity, that is valuable in forensic genetics.
Collapse
Affiliation(s)
- Lucie Kulhankova
- Department of Genetic Identification, Erasmus MC University Medical Center Rotterdam, Rotterdam, the Netherlands
| | - Eric Bindels
- Department of Haematology, Erasmus MC University Medical Center Rotterdam, Rotterdam, the Netherlands
| | - Manfred Kayser
- Department of Genetic Identification, Erasmus MC University Medical Center Rotterdam, Rotterdam, the Netherlands.
| | - Eskeatnaf Mulugeta
- Department of Cell Biology, Erasmus MC University Medical Center Rotterdam, Rotterdam, the Netherlands.
| |
Collapse
|
2
|
Schulte J, Caliebe A, Marciano M, Neuschwander P, Seiberle I, Scheurer E, Schulz I. DEPArray™ single-cell technology: A validation study for forensic applications. Forensic Sci Int Genet 2024; 70:103026. [PMID: 38412740 DOI: 10.1016/j.fsigen.2024.103026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2023] [Revised: 01/17/2024] [Accepted: 02/14/2024] [Indexed: 02/29/2024]
Abstract
In forensics investigations, it is common to encounter biological mixtures consisting of homogeneous or heterogeneous components from multiple individuals and with different genetic contributions. One promising mixture deconvolution strategy is the DEPArray™ technology, which enables the separation of cell populations before genetic analysis. While technological advances are fundamental, their reliable validation is crucial for successful implementation and use for casework. Thus, this study aimed to 1) systematically validate the DEPArray™ system concerning specificity, sensitivity, repeatability, and contamination occurrences for blood, epithelial, and sperm cells, and 2) evaluate its potential for single-cell analysis in the field of forensic science. Our findings confirmed the effective identification of different cell types and the correct assignment of successfully genotyped single cells to their respective donor(s). Using the NGM Detect™ Amplification Kit, the average profile completeness for diploid cells was approximately 80%, with ∼ 290 RFUs. In contrast, haploid sperm analysis yielded an average completeness of 51% referring to the haploid reference profile, accompanied by mean peak heights of ∼ 176 RFUs. Although certain alleles of heterozygous loci in diploid cells showed strong imbalances, the overall peak balances yielded acceptable values above ≥ 60% with a mean value of 72% ± 0.21, a median of 77%, but with a maximum imbalance of 9% between heterozygous peaks. Locus dropouts were considered stochastic events, exhibiting variations among donors and cell types, with a notable failure incidence observed for TH01. Within the wet-lab experimentation with >500 single cells for the validation, profiling was performed using the consensus approach, where profiles were selected randomly from all data to better mirror real casework results. Nevertheless, complete profiles could be achieved with as few as three diploid cells, while the average success rate increased to 100% when using profiles of 6-10 cells. For sperms, however, a consensus profile with completeness >90% of the autosomal diploid genotype could be attained using ≥15 cells. In addition, the robustness of the consensus approach was evaluated in the absence of the respective reference profile without severe deterioration. Here, increased stutter peaks (≥ 15%) were found as the main artifact in single-cell profiles, while contamination and drop-ins were ascertained as rare events. Lastly, the technique's potential and limitations are discussed, and practical guidance is provided, particularly valuable for cold cases, multiple perpetrator rapes, and analyses of homogeneous mixed evidence.
Collapse
Affiliation(s)
- Janine Schulte
- Institute of Forensic Medicine, University Basel, Pestalozzistrasse 22, Basel 4056, Switzerland
| | - Amke Caliebe
- Institute of Medical Informatics and Statistics, Kiel University and University-Hospital Schleswig-Holstein, Brunswiker Str. 10, Kiel 24105, Germany
| | - Michael Marciano
- Forensic & National Security Sciences Institute, Syracuse University, 900 S Crouse Ave, Syracuse, NY 13244 , USA
| | - Pia Neuschwander
- Departement of Clinical Research, c/o Universitätsspital Basel, Spitalstrasse 8/12, Basel 4031, Switzerland
| | - Ilona Seiberle
- Institute of Forensic Medicine, University Basel, Pestalozzistrasse 22, Basel 4056, Switzerland
| | - Eva Scheurer
- Institute of Forensic Medicine, University Basel, Pestalozzistrasse 22, Basel 4056, Switzerland
| | - Iris Schulz
- Institute of Forensic Medicine, University Basel, Pestalozzistrasse 22, Basel 4056, Switzerland.
| |
Collapse
|
3
|
Grgicak CM, Bhembe Q, Slooten K, Sheth NC, Duffy KR, Lun DS. Single-cell investigative genetics: Single-cell data produces genotype distributions concentrated at the true genotype across all mixture complexities. Forensic Sci Int Genet 2024; 69:103000. [PMID: 38199167 DOI: 10.1016/j.fsigen.2023.103000] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2023] [Revised: 11/07/2023] [Accepted: 12/12/2023] [Indexed: 01/12/2024]
Abstract
In the absence of a suspect the forensic aim is investigative, and the focus is one of discerning what genotypes best explain the evidence. In traditional systems, the list of candidate genotypes may become vast if the sample contains DNA from many donors or the information from a minor contributor is swamped by that of major contributors, leading to lower evidential value for a true donor's contribution and, as a result, possibly overlooked or inefficient investigative leads. Recent developments in single-cell analysis offer a way forward, by producing data capable of discriminating genotypes. This is accomplished by first clustering single-cell data by similarity without reference to a known genotype. With good clustering it is reasonable to assume that the scEPGs in a cluster are of a single contributor. With that assumption we determine the probability of a cluster's content given each possible genotype at each locus, which is then used to determine the posterior probability mass distribution for all genotypes by application of Bayes' rule. A decision criterion is then applied such that the sum of the ranked probabilities of all genotypes falling in the set is at least 1-α. This is the credible genotype set and is used to inform database search criteria. Within this work we demonstrate the salience of single-cell analysis by performance testing a set of 630 previously constructed admixtures containing up to 5 donors of balanced and unbalanced contributions. We use scEPGs that were generated by isolating single cells, employing a direct-to-PCR extraction treatment, amplifying STRs that are compliant with existing national databases and applying post-PCR treatments that elicit a detection limit of one DNA copy. We determined that, for these test data, 99.3% of the true genotypes are included in the 99.8% credible set, regardless of the number of donors that comprised the mixture. We also determined that the most probable genotype was the true genotype for 97% of the loci when the number of cells in a cluster was at least two. Since efficient investigative leads will be borne by posterior mass distributions that are narrow and concentrated at the true genotype, we report that, for this test set, 47,900 (86%) loci returned only one credible genotype and of these 47,551 (99%) were the true genotype. When determining the LR for true contributors, 91% of the clusters rendered LR>1018, showing the potential of single-cell data to positively affect investigative reporting.
Collapse
Affiliation(s)
- Catherine M Grgicak
- Department of Chemistry, Rutgers University, Camden, NJ 08102, USA; Center for Computational and Integrative Biology, Rutgers University, Camden, NJ 08102, USA; Program in Biomedical Forensic Sciences, Boston University, Boston, MA 02118, USA.
| | - Qhawe Bhembe
- Center for Computational and Integrative Biology, Rutgers University, Camden, NJ 08102, USA
| | - Klaas Slooten
- Netherlands Forensic Institute, P.O. Box 24044, 2490 AA The Hague, the Netherlands; VU University Amsterdam, De Boelelaan 1081, 1081 HV Amsterdam, the Netherlands
| | - Nidhi C Sheth
- Center for Computational and Integrative Biology, Rutgers University, Camden, NJ 08102, USA
| | - Ken R Duffy
- Department of Mathematics, Northeastern University, Boston, MA 02115, USA; Department of Electrical and Computer Engineering, Northeastern University, Boston, MA 02115, USA; Hamilton Institute, Maynooth University, Ireland
| | - Desmond S Lun
- Center for Computational and Integrative Biology, Rutgers University, Camden, NJ 08102, USA; Department of Computer Science, Rutgers University, Camden, NJ 08102, USA
| |
Collapse
|
4
|
Huffman K, Ballantyne J. Single cell genomics applications in forensic science: Current state and future directions. iScience 2023; 26:107961. [PMID: 37876804 PMCID: PMC10590970 DOI: 10.1016/j.isci.2023.107961] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2023] Open
Abstract
Standard methods of mixture analysis involve subjecting a dried crime scene sample to a "bulk" DNA extraction method such that the resulting isolate compromises a homogenized DNA mixture from the individual donors. If, however, instead of bulk DNA extraction, a sufficient number of individual cells from the mixed stain are subsampled prior to genetic analysis then it should be possible to recover highly probative single source, non-mixed scDNA profiles from each of the donors. This approach can detect low DNA level minor donors to a mixture that otherwise would not be identified using standard methods and can also resolve rare mixtures comprising first degree relatives and thereby also prevent the false inclusion of non-donor relatives. This literature landscape review and associated commentary reports on the history and increasing interest in current and potential future applications of scDNA in forensic genomics, and critically evaluates opportunities and impediments to further progress.
Collapse
Affiliation(s)
- Kaitlin Huffman
- Graduate Program in Chemistry, Department of Chemistry, University of Central Florida, PO Box 162366, Orlando, FL 32816-2366, USA
| | - Jack Ballantyne
- National Center for Forensic Science, PO Box 162367, Orlando, FL 32816-2367, USA
- Department of Chemistry, University of Central Florida, PO Box 162366, Orlando, FL 32816-2366, USA
| |
Collapse
|
5
|
Fu J, Song B, Qian J, He T, Chen H, Cheng J, Fu J. Genetic Polymorphism Analysis of 24 Y-STRs in a Han Chinese Population in Luzhou, Southwest China. Genes (Basel) 2023; 14:1904. [PMID: 37895253 PMCID: PMC10606688 DOI: 10.3390/genes14101904] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2023] [Revised: 09/27/2023] [Accepted: 09/30/2023] [Indexed: 10/29/2023] Open
Abstract
Han is the largest of China's 56 ethnic groups and the most populous ethnic group in the world. The Luzhou region is located in southwest China, at the junction of three provinces. The unique historical factors contribute to the genetic polymorphism information. Short tandem repeats (STRs) are highly polymorphic, but the polymorphism of the Y chromosomal STRs (Y-STRs) loci in the Luzhou region is still unclear. It is of great significance to provide Y-STRs genetic data for the Han population from the Luzhou areas of southwest China. A total of 910 unrelated male individuals of the Han population from the Luzhou area were recruited, and 24 Y-STRs were analyzed. The population structure and phylogenetic relationships were compared with those of another 11 related Han populations. A total of 893 different haplotypes were achieved from 910 samples, of which 877 (98.21%) haplotypes were unique. Haplotype diversity and discrimination were 0.999956 and 0.981319, respectively. The lowest genetic diversity of DYS437 is 0.4321, and the highest genetic diversity of DYS385a/b is 0.9642. Pair-to-pair genetic distance and relative probability values indicate that Luzhou Han people are close to Sichuan Han people, Guangdong Han people, and Hunan Han people, which is consistent with geographical distribution, historical influence, and economic development. The 24 Y-STR markers of the southwest Luzhou Han population were highly polymorphic, which provided us with genetic polymorphism information and enriched the population genetic database. Therefore, it is of great value to our forensic applications and population genetics research.
Collapse
Affiliation(s)
- Jiewen Fu
- Key Laboratory of Epigenetics and Oncology, The Research Center for Preclinical Medicine, Southwest Medical University, Luzhou 646000, China; (J.F.); (B.S.); (J.Q.); (T.H.); (J.C.)
- School for Basic Medicine, Southwest Medical University, Luzhou 646000, China
- Laboratory of Forensic DNA, The Judicial Authentication Center, Southwest Medical University, Luzhou 646000, China
| | - Binghui Song
- Key Laboratory of Epigenetics and Oncology, The Research Center for Preclinical Medicine, Southwest Medical University, Luzhou 646000, China; (J.F.); (B.S.); (J.Q.); (T.H.); (J.C.)
- Laboratory of Forensic DNA, The Judicial Authentication Center, Southwest Medical University, Luzhou 646000, China
| | - Jie Qian
- Key Laboratory of Epigenetics and Oncology, The Research Center for Preclinical Medicine, Southwest Medical University, Luzhou 646000, China; (J.F.); (B.S.); (J.Q.); (T.H.); (J.C.)
| | - Ting He
- Key Laboratory of Epigenetics and Oncology, The Research Center for Preclinical Medicine, Southwest Medical University, Luzhou 646000, China; (J.F.); (B.S.); (J.Q.); (T.H.); (J.C.)
| | - Hanchun Chen
- Department of Biochemistry and Molecular Biology, School of Life Sciences, Central South University, Changsha 410013, China;
| | - Jingliang Cheng
- Key Laboratory of Epigenetics and Oncology, The Research Center for Preclinical Medicine, Southwest Medical University, Luzhou 646000, China; (J.F.); (B.S.); (J.Q.); (T.H.); (J.C.)
- Laboratory of Forensic DNA, The Judicial Authentication Center, Southwest Medical University, Luzhou 646000, China
| | - Junjiang Fu
- Key Laboratory of Epigenetics and Oncology, The Research Center for Preclinical Medicine, Southwest Medical University, Luzhou 646000, China; (J.F.); (B.S.); (J.Q.); (T.H.); (J.C.)
- Laboratory of Forensic DNA, The Judicial Authentication Center, Southwest Medical University, Luzhou 646000, China
| |
Collapse
|
6
|
Huffman K, Kruijver M, Ballantyne J, Taylor D. Carrying out common DNA donor analysis using DBLR™ on two or five-cell mini-mixture subsamples for improved discrimination power in complex DNA mixtures. Forensic Sci Int Genet 2023; 66:102908. [PMID: 37402330 DOI: 10.1016/j.fsigen.2023.102908] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Revised: 06/13/2023] [Accepted: 06/15/2023] [Indexed: 07/06/2023]
Abstract
Probabilistic genotyping systems are able to analyse complex mixed DNA profiles and show good power to discriminate contributors from non-contributors. However, the abilities of the statistical analyses are still unavoidably bound by the quality of information being analysed. If a profile has a high number of contributors, or a contributor that is present in trace amounts, then the amount of information about those individuals in the DNA profile is limited. Recent work has shown the ability to gain better resolution of the genotypes of contributors to complex profiles using cell subsampling. This is the process of taking many sets of a limited number of cells and individually profiling each set. These 'mini-mixtures' can provide greater information about the genotypes of underlying contributors. In our work we take the resulting profiles from multiple subsamplings of complex DNA profiles in equal amounts and show how testing for, and then assuming, a common DNA donor can further improve the ability to resolve the genotypes of contributors. Using direct cell sub-sampling and statistical analysis software DBLR™, we were able to recover single source profiles of uploadable quality from five out of the six contributors of an equally proportioned mixture. Through the analysis of mixtures in this work we provide a template for carrying out common donor analysis for maximum effect.
Collapse
Affiliation(s)
- Kaitlin Huffman
- Graduate Program in Chemistry, Department of Chemistry, University of Central Florida, P.O. Box 162366, Orlando, FL 32816-2366, USA
| | - Maarten Kruijver
- Institute of Environmental Science and Research Limited, Private Bag 92021, Auckland 1142, New Zealand
| | - Jack Ballantyne
- Graduate Program in Chemistry, Department of Chemistry, University of Central Florida, P.O. Box 162366, Orlando, FL 32816-2366, USA; National Center for Forensic Science, P.O. Box 162367, Orlando, FL 32816-2367, USA
| | - Duncan Taylor
- Forensic Science SA, GPO Box 2790, Adelaide, SA 5001, Australia; School of Biological Sciences, Flinders University, GPO Box 2100, Adelaide, SA 5001, Australia.
| |
Collapse
|