1
|
Using unique molecular identifiers to improve allele calling in low-template mixtures. Forensic Sci Int Genet 2023; 63:102807. [PMID: 36462297 DOI: 10.1016/j.fsigen.2022.102807] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2022] [Revised: 10/20/2022] [Accepted: 11/18/2022] [Indexed: 11/27/2022]
Abstract
PCR artifacts are an ever-present challenge in sequencing applications. These artifacts can seriously limit the analysis and interpretation of low-template samples and mixtures, especially with respect to a minor contributor. In medicine, molecular barcoding techniques have been employed to decrease the impact of PCR error and to allow the examination of low-abundance somatic variation. In principle, it should be possible to apply the same techniques to the forensic analysis of mixtures. To that end, several short tandem repeat loci were selected for targeted sequencing, and a bioinformatic pipeline for analyzing the sequence data was developed. The pipeline notes the relevant unique molecular identifiers (UMIs) attached to each read and, using machine learning, filters the noise products out of the set of potential alleles. To evaluate this pipeline, DNA from pairs of individuals were mixed at different ratios (1-1, 1-9) and sequenced with different starting amounts of DNA (10, 1 and 0.1 ng). Naïvely using the information in the molecular barcodes led to increased performance, with the machine learning resulting in an additional benefit. In concrete terms, using the UMI data results in less noise for a given amount of drop out. For instance, if thresholds are selected that filter out a quarter of the true alleles, using read counts accepts 2381 noise alleles and using raw UMI counts accepts 1726 noise alleles, while the machine learning approach only accepts 307.
Collapse
|
2
|
Agudo MM, Aanes H, Roseth A, Albert M, Gill P, Bleka Ø. A comprehensive characterization of MPS-STR stutter artefacts. Forensic Sci Int Genet 2022; 60:102728. [DOI: 10.1016/j.fsigen.2022.102728] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2022] [Revised: 05/03/2022] [Accepted: 05/24/2022] [Indexed: 11/04/2022]
|
3
|
Sharma V, Young B, Armogida L, Khan A, Wurmbach E. Evaluation of ArmedXpert software tools, MixtureAce and Mixture Interpretation, to analyze MPS-STR data. Forensic Sci Int Genet 2021; 56:102603. [PMID: 34673336 DOI: 10.1016/j.fsigen.2021.102603] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2020] [Revised: 09/28/2021] [Accepted: 10/06/2021] [Indexed: 12/12/2022]
Abstract
Massively parallel sequencing (MPS) technologies have revolutionized studies of genomic variations and transformed DNA analysis in multiple fields. Assays based on MPS must be capable of discriminating variations introduced by the method, i.e. artifacts from true polymorphisms. In PCR-MPS methods targeting microsatellite markers, artifacts can arise from PCR mis-incorporation, PCR strand slippage (stutter), and sequencing error. Reliable detection of artifacts in mixed DNA samples is a significant challenge that must be addressed in forensic DNA analysis. The ArmedXpert (NicheVision) software tools, MixtureAce™ and Mixture Interpretation, can analyze MPS data by categorizing sequence reads in alleles, stutter, and non-stutter artifacts and analyzing autosomal STR loci of mixed samples. In this study, we evaluated the ArmedXpert tools for the analysis of STR profiles of single-sourced and mixed samples generated by the ForenSeq™ DNA Signature Prep kit (Verogen). Data from eight experimental runs (240 samples) were analyzed: one benchmark run, two runs testing sensitivity with down to 50 pg DNA input, one run testing artificially degraded samples and DNA derived from bones, blood cards and teeth, as well as four runs with mixed DNA samples of varying ratios, sex, and different number of contributors (two to six). The MixtureAce stutter thresholds were initially set following the recommendations from Verogen, plus a non-stutter artifact threshold was set at 5% of allele read counts. A benchmark run, of 30 samples, plus two controls, containing 2310 total alleles, revealed over 5000 artifacts, above an analytical threshold of 10. A total of 4869 artifacts were correctly classified, while 435 were mis-classified as alleles due to exceedance of initial threshold settings. False positives must be resolved by an analyst, which can be time consuming. Stutter thresholds were adjusted based on the benchmark data and the samples were re-tested, resulting in only 57 false positive allele calls. The revised settings were then used in the analysis of the remaining seven experimental runs. Results show that MixtureAce can accurately classify artifacts and alleles when laboratory-specific threshold settings are used. The Mixture Interpretation tool was applied on two- and three-person mixtures. This tool utilized the analyzed data from MixtureAce to calculate, based on the number of alleles at a locus and their read counts, possible deconvolution outcomes with their respective ratios.
Collapse
Affiliation(s)
- Vishakha Sharma
- New York City Office of Chief Medical Examiner, Department of Forensic Biology, 421 East 26th Street, New York, NY 10016, USA
| | - Brian Young
- NicheVision Forensics, LLC., 526 South Main Street, Akron, OH 44311, USA
| | - Luigi Armogida
- NicheVision Forensics, LLC., 526 South Main Street, Akron, OH 44311, USA
| | - Amber Khan
- New York City Public Health Laboratory, Department of Health and Mental Hygiene, 455 East 26th Street, New York, NY 10016, USA
| | - Elisa Wurmbach
- New York City Office of Chief Medical Examiner, Department of Forensic Biology, 421 East 26th Street, New York, NY 10016, USA.
| |
Collapse
|
4
|
Moura-Neto R, King JL, Mello I, Dias V, Crysup B, Woerner AE, Budowle B, Silva R. Evaluation of Promega PowerSeq™ Auto/Y systems prototype on an admixed sample of Rio de Janeiro, Brazil: Population data, sensitivity, stutter and mixture studies. Forensic Sci Int Genet 2021; 53:102516. [PMID: 33878618 DOI: 10.1016/j.fsigen.2021.102516] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2020] [Revised: 03/06/2021] [Accepted: 03/29/2021] [Indexed: 02/01/2023]
Abstract
Forensic DNA typing typically relies on the length-based (LB) separation of PCR products containing short tandem repeat loci (STRs). Massively parallel sequencing (MPS) elucidates an additional level of STR motif and flanking region variation. Also, MPS enables simultaneous analysis of different marker-types - autosomal STRs, SNPs for lineage and identification purposes, reducing both the amount of sample used and the turn-around-time of analysis. Therefore, MPS methodologies are being considered as an additional tool in forensic genetic casework. The PowerSeq™ Auto/Y System (Promega Corp), a multiplex forensic kit for MPS, enables analysis of the 22 autosomal STR markers (plus Amelogenin) from the PowerPlex® Fusion 6C kit and 23 Y-STR markers from the PowerPlex® Y23 kit. Population data were generated from 140 individuals from an admixed sample from Rio de Janeiro, Brazil. All samples were processed according to the manufacturers' recommended protocols. Raw data (FastQ) were generated for each indexed sample and analyzed using STRait Razor v2s and PowerSeqv2.config file. The subsequent population data showed the largest increase in expected heterozygosity (23%), from LB to sequence-based (SB) analyses at the D5S818 locus. Unreported allele was found at the D21S11 locus. The random match probability across all loci decreased from 5.9 × 10-28 to 7.6 × 10-33. Sensitivity studies using 1, 0.25, 0.062 and 0.016 ng of DNA input were analyzed in triplicate. Full Y-STR profiles were detected in all samples, and no autosomal allele drop-out was observed with 62 pg of input DNA. For mixture studies, 1 ng of genomic DNA from a male and female sample at 1:1, 1:4, 1:9, 1:19 and 1:49 proportions were analyzed in triplicate. Clearly resolvable alleles (i.e., no stacking or shared alleles) were obtained at a 1:19 male to female contributor ratio. The minus one stutter (-1) increased with the longest uninterrupted stretch (LUS) allele size reads and according to simple or compound/complex repeats. The haplotype-specific stutter rates add more information for mixed samples interpretation. These data support the use of the PowerSeqTM Auto/Y systems prototype kit (22 autosomal STR loci, 23 Y-STR loci and Amelogenin) for forensic genetics applications.
Collapse
Affiliation(s)
- Rodrigo Moura-Neto
- Instituto de Biologia, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil.
| | - Jonathan L King
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, TX, USA
| | - Isadora Mello
- Instituto de Biofísica Carlos Chagas Filho, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil
| | - Victor Dias
- Instituto de Biofísica Carlos Chagas Filho, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil
| | - Benjamin Crysup
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, TX, USA
| | - August E Woerner
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, TX, USA; Department of Microbiology, Immunology and Genetics, University of North Texas Health Science Center, Fort Worth, TX, USA
| | - Bruce Budowle
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, TX, USA; Department of Microbiology, Immunology and Genetics, University of North Texas Health Science Center, Fort Worth, TX, USA
| | - Rosane Silva
- Instituto de Biofísica Carlos Chagas Filho, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil.
| |
Collapse
|
5
|
Woerner AE, Mandape S, King JL, Muenzler M, Crysup B, Budowle B. Reducing noise and stutter in short tandem repeat loci with unique molecular identifiers. Forensic Sci Int Genet 2020; 51:102459. [PMID: 33429137 DOI: 10.1016/j.fsigen.2020.102459] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2020] [Revised: 10/28/2020] [Accepted: 12/21/2020] [Indexed: 12/24/2022]
Abstract
Unique molecular identifiers (UMIs) are a promising approach to contend with errors generated during PCR and massively parallel sequencing (MPS). With UMI technology, random molecular barcodes are ligated to template DNA molecules prior to PCR, allowing PCR and sequencing error to be tracked and corrected bioinformatically. UMIs have the potential to be particularly informative for the interpretation of short tandem repeats (STRs). Traditional MPS approaches may simply lead to the observation of alleles that are consistent with the hypotheses of stutter, while with UMIs stutter products bioinformatically may be re-associated with their parental alleles and subsequently removed. Herein, a bioinformatics pipeline named strumi is described that is designed for the analysis of STRs that are tagged with UMIs. Unlike other tools, strumi is an alignment-free machine learning driven algorithm that clusters individual MPS reads into UMI families, infers consensus super-reads that represent each family and provides an estimate the resulting haplotype's accuracy. Super-reads, in turn, approximate independent measurements not of the PCR products, but of the original template molecules, both in terms of quantity and sequence identity. Provisional assessments show that naïve threshold-based approaches generate super-reads that are accurate (∼97 % haplotype accuracy, compared to ∼78 % when UMIs are not used), and the application of a more nuanced machine learning approach increases the accuracy to ∼99.5 % depending on the level of certainty desired. With these features, UMIs may greatly simplify probabilistic genotyping systems and reduce uncertainty. However, the ability to interpret alleles at trace levels also permits the interpretation, characterization and quantification of contamination as well as somatic variation (including somatic stutter), which may present newfound challenges.
Collapse
Affiliation(s)
- August E Woerner
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX 76107, USA; Department of Microbiology, Immunology and Genetics, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX 76107, USA.
| | - Sammed Mandape
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX 76107, USA
| | - Jonathan L King
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX 76107, USA
| | - Melissa Muenzler
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX 76107, USA
| | - Benjamin Crysup
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX 76107, USA
| | - Bruce Budowle
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX 76107, USA; Department of Microbiology, Immunology and Genetics, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX 76107, USA
| |
Collapse
|
6
|
Wu H, Ji AC, Liu YC, Kang KL, Zhang C, Li ZW, Ji AQ, Ye J, Nie SJ, Wang L. Massively parallel sequencing of STRs using a 29-plex panel reveals stutter sequence characteristics. Electrophoresis 2020; 41:2029-2035. [PMID: 32770833 DOI: 10.1002/elps.202000093] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2020] [Revised: 06/20/2020] [Accepted: 08/05/2020] [Indexed: 12/21/2022]
Abstract
Massively parallel sequencing of forensic STRs simultaneously provides length-based genotypes and core repeat sequences as well as flanking sequence variations. Here, we report primer sequences and concentrations of a next-generation sequencing (NGS)-based in-house panel covering 28 autosomal STR loci (CSF1PO, D1GATA113, D1S1627, D1S1656, D1S1677, D2S441, D2S1776, D3S3053, D5S818, D6S474, D6S1017, D6S1043, D8S1179, D9S2157, D10S1435, D11S4463, D13S317, D14S1434, D16S539, D18S51, D18S853, D20S482, D20S1082, D22S1045, FGA, TH01, TPOX, and vWA) and the sex determinant locus Amelogenin. Preliminary evaluation experiments showed that the panel yielded intralocus- and interlocus-balanced sequencing data with a sensitivity as low as 62.5 pg input DNA. A total of 203 individuals from Yunnan Bai population were sequenced with this panel. Comparative forensic genetic analyses showed that sequence-based matching probability of this 29-plex panel reached 2.37 × 10-29 , which was 23 times lower than the length-based data. Compound stutter sequences of eight STRs were compared with parental alleles. For seven loci, repeat motif insertions or deletions occurred in the longest uninterrupted repeat sequences (LUS). However, LUS and non-LUS stutters co-existed in the locus D6S474 with different sequencing depth ratios. These results supplemented our current knowledge of forensic STR stutters, and provided a sound basis for DNA mixture deconvolution.
Collapse
Affiliation(s)
- Hao Wu
- National Engineering Laboratory for Forensic Science, Key Laboratory of Forensic Genetics of Ministry of Public Security, Institute of Forensic Science, Ministry of Public Security, Beijing, P. R. China
| | - Ai-Cen Ji
- National Engineering Laboratory for Forensic Science, Key Laboratory of Forensic Genetics of Ministry of Public Security, Institute of Forensic Science, Ministry of Public Security, Beijing, P. R. China.,School of Forensic Medicine, Kunming Medical University, Kunming, P. R. China
| | - Yi-Cheng Liu
- National Engineering Laboratory for Forensic Science, Key Laboratory of Forensic Genetics of Ministry of Public Security, Institute of Forensic Science, Ministry of Public Security, Beijing, P. R. China
| | - Ke-Lai Kang
- National Engineering Laboratory for Forensic Science, Key Laboratory of Forensic Genetics of Ministry of Public Security, Institute of Forensic Science, Ministry of Public Security, Beijing, P. R. China
| | - Chi Zhang
- National Engineering Laboratory for Forensic Science, Key Laboratory of Forensic Genetics of Ministry of Public Security, Institute of Forensic Science, Ministry of Public Security, Beijing, P. R. China
| | - Zhi-Wen Li
- National Engineering Laboratory for Forensic Science, Key Laboratory of Forensic Genetics of Ministry of Public Security, Institute of Forensic Science, Ministry of Public Security, Beijing, P. R. China.,Public Security Bureau of Ximeng County in Pu'er City, Pu'er, P. R. China
| | - An-Quan Ji
- National Engineering Laboratory for Forensic Science, Key Laboratory of Forensic Genetics of Ministry of Public Security, Institute of Forensic Science, Ministry of Public Security, Beijing, P. R. China
| | - Jian Ye
- National Engineering Laboratory for Forensic Science, Key Laboratory of Forensic Genetics of Ministry of Public Security, Institute of Forensic Science, Ministry of Public Security, Beijing, P. R. China
| | - Sheng-Jie Nie
- School of Forensic Medicine, Kunming Medical University, Kunming, P. R. China
| | - Le Wang
- National Engineering Laboratory for Forensic Science, Key Laboratory of Forensic Genetics of Ministry of Public Security, Institute of Forensic Science, Ministry of Public Security, Beijing, P. R. China
| |
Collapse
|
7
|
Li R, Wu R, Li H, Zhang Y, Peng D, Wang N, Shen X, Wang Z, Sun H. Characterizing stutter variants in forensic STRs with massively parallel sequencing. Forensic Sci Int Genet 2019; 45:102225. [PMID: 31865225 DOI: 10.1016/j.fsigen.2019.102225] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2019] [Revised: 11/19/2019] [Accepted: 12/08/2019] [Indexed: 11/28/2022]
Abstract
Despite improvements in characterizing stutters of short tandem repeats (STRs), the relationships among the amounts of stutter variants and the relationships among motifs are not well understood yet. In the present study, 750 peripheral blood samples from human subjects were included to characterize the stutters of 58 STRs via the ForenSeq DNA Signature Prep Kit on a MiSeq FGx instrument. Alleles and corresponding stutter products were identified with a sequence simplification procedure. After screening, 26,921 alleles were included, that resulted in over 50 million reads, among which 8.69% were stutter products. Among these stutter products, 83.44% were N-1 stutters. Additionally, N-4, N-3, N-2, N0, N + 1, and N + 2 variants accounted for 0.11%, 0.77%, 6.45%, 3.01%, 5.95%, and 0.25% of the stutter products, respectively. For backward stutters, stutter products correlated best with the corresponding one-unit-longer stutter (or parental allele), which may represent a good predictor for backward stutters. For forward stutters, the N + 2 stutter correlated best with the N + 1 stutter, whereas the N + 1 stutter correlated best with the N-1 stutter rather than the expected parental allele, which indicated that the patterns were more complex for forward stutters. Additionally, some interesting findings were obtained for D21S11. For two adjacent contiguous motifs, co-stuttering patterns were observed where one motif tended to increase one repeat unit while the other motif decreased one repeat unit, whereas the inter-motif dependency was not significant for interrupted motifs. In conclusion, with massively parallel sequencing technology and our sequence simplification strategy, sequence variations within alleles and stutter products were identified, which was useful to determine the origin of stutters, identify more stutter variants, and explore the relationships among motifs. These findings may be helpful for allele designation, a deeper understanding of the mechanism of stutter, and improving resolution in forensic mixture analyses.
Collapse
Affiliation(s)
- Ran Li
- Faculty of Forensic Medicine, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou 510080, Guangdong, People's Republic of China; Guangdong Province Translational Forensic Medicine Engineering Technology Research Center, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou 510089, Guangdong, People's Republic of China
| | - Riga Wu
- Faculty of Forensic Medicine, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou 510080, Guangdong, People's Republic of China; Guangdong Province Translational Forensic Medicine Engineering Technology Research Center, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou 510089, Guangdong, People's Republic of China
| | - Haixia Li
- Faculty of Forensic Medicine, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou 510080, Guangdong, People's Republic of China; Guangdong Province Translational Forensic Medicine Engineering Technology Research Center, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou 510089, Guangdong, People's Republic of China
| | - Yinming Zhang
- Faculty of Forensic Medicine, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou 510080, Guangdong, People's Republic of China; Guangdong Province Translational Forensic Medicine Engineering Technology Research Center, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou 510089, Guangdong, People's Republic of China
| | - Dan Peng
- Faculty of Forensic Medicine, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou 510080, Guangdong, People's Republic of China; Guangdong Province Translational Forensic Medicine Engineering Technology Research Center, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou 510089, Guangdong, People's Republic of China
| | - Nana Wang
- Faculty of Forensic Medicine, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou 510080, Guangdong, People's Republic of China; Guangdong Province Translational Forensic Medicine Engineering Technology Research Center, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou 510089, Guangdong, People's Republic of China
| | - Xuefeng Shen
- Faculty of Forensic Medicine, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou 510080, Guangdong, People's Republic of China; Guangdong Province Translational Forensic Medicine Engineering Technology Research Center, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou 510089, Guangdong, People's Republic of China
| | - Zhiyuan Wang
- Forensic Science Center of Foshan Municipal Public Security Department, Foshan 528000, Guangdong, People's Republic of China
| | - Hongyu Sun
- Faculty of Forensic Medicine, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou 510080, Guangdong, People's Republic of China; Guangdong Province Translational Forensic Medicine Engineering Technology Research Center, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou 510089, Guangdong, People's Republic of China.
| |
Collapse
|
8
|
Compound stutter in D2S1338 and D12S391. Forensic Sci Int Genet 2018; 39:50-56. [PMID: 30554027 DOI: 10.1016/j.fsigen.2018.12.001] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2018] [Revised: 11/06/2018] [Accepted: 12/06/2018] [Indexed: 11/22/2022]
Abstract
Compound repeats provide a testing ground for evaluating hypotheses on the causes and consequences of stutter. As compound repeats have two repeating motifs, each motif may produce stutter variants; thus as the repeat within the locus is compound, so is the resultant stutter. Further, the rates of stutter formation between these motifs may not be independent. This lack of independence may complicate modeling strategies, thus contributing to the challenges of mixture interpretation that rely on nucleotide sequence. This study evaluates compound stutter in two STR loci: D2S1338 and D12S391. The effects of flanking variation, as well as possible interactions between the two different uninterrupted stretches (US) and their respective stutter variants are assessed. Multivariate multiple linear regression (MMLR) was used to show that, as with simple repeats, the rate of stutter product formation of a particular repeating motif is not solely a function of the US of that repeat. The nucleotides adjacent to the repeating motif also appear to influence the rate of stutter formation of that motif, with those nucleotides sometimes including the other motif. MMLR was used to estimate the size of these effects and to construct an example of a two-dimensional (thus, a compound) stutter prediction. This example may merit further investigation in the application of massively parallel sequencing data to mixture interpretation and probabilistic genotyping.
Collapse
|
9
|
Houston R, Mayes C, King JL, Hughes-Stamm S, Gangitano D. Massively parallel sequencing of 12 autosomal STRs in Cannabis sativa. Electrophoresis 2018; 39:2906-2911. [PMID: 30221375 DOI: 10.1002/elps.201800152] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2018] [Revised: 09/06/2018] [Accepted: 09/07/2018] [Indexed: 01/15/2023]
Abstract
Massively parallel sequencing (MPS) is an emerging technology in the field of forensic genetics that provides distinct advantages compared to capillary electrophoresis. This study offers a proof of concept that MPS technologies can be applied to genotype autosomal STRs in Cannabis sativa. A custom panel for MPS was designed to interrogate 12 cannabis-specific STR loci by sequence rather than size. A simple workflow was implemented to integrate the custom PCR multiplex into a workflow compatible with the Ion Plus Fragment Library Kit, Ion™ Chef, and Ion™ S5 System. For data sorting and sequence analysis, a custom configuration file was designed for STRait Razor v3 to parse and extract STR sequence data. This study represents a preliminary investigation of sequence variation for 12 autosomal STR loci in 16 cannabis samples. Full concordance was observed between the MPS and CE data. Results revealed intra-repeat variation in eight loci where the nominal or size-based allele was identical, but variances were discovered in the sequence of the flanking region. Although only a small number of cannabis samples were evaluated, this study demonstrates that more informative STR data can be obtained via MPS.
Collapse
Affiliation(s)
- Rachel Houston
- Department of Forensic Science, College of Criminal Justice, Sam Houston State University, Huntsville, TX, USA
| | - Carrie Mayes
- Department of Forensic Science, College of Criminal Justice, Sam Houston State University, Huntsville, TX, USA
| | - Jonathan L King
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, TX, USA
| | - Sheree Hughes-Stamm
- Department of Forensic Science, College of Criminal Justice, Sam Houston State University, Huntsville, TX, USA.,School of Biomedical Sciences, University of Queensland, Brisbane, Queensland, Australia
| | - David Gangitano
- Department of Forensic Science, College of Criminal Justice, Sam Houston State University, Huntsville, TX, USA
| |
Collapse
|
10
|
Ganschow S, Silvery J, Kalinowski J, Tiemann C. toaSTR: A web application for forensic STR genotyping by massively parallel sequencing. Forensic Sci Int Genet 2018; 37:21-28. [PMID: 30071493 DOI: 10.1016/j.fsigen.2018.07.006] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2018] [Revised: 07/04/2018] [Accepted: 07/05/2018] [Indexed: 01/29/2023]
Abstract
Massively parallel sequencing (MPS) is emerging within the forensic community as a promising technique for high-resolution short tandem repeat (STR) genotyping, discovering both length and sequence polymorphisms. Conversely, the application of MPS to routine casework poses new challenges to the DNA analyst in view of the complex sequence data that is generated with this technology. We developed the web application toaSTR to help forensic experts work with MPS data simply and efficiently. An intuitive graphical user interface guides through the STR genotyping workflow. This versatile software handles data from various popular MPS platforms and supports both commercial and in-house multiplex PCR kits. Users can define locus-specific stutter thresholds and create custom sets of STR markers to be analyzed. toaSTR's innovative sequence-based stutter model predicts and identifies common stutter variants. The algorithm automatically differentiates biological (iso-)alleles from stutter and other artefacts to assist the interpretation of mixed samples. toaSTR features a comprehensive data visualization with interactive diagrams and a dynamic tabular overview of sequence observations. The software provides an interface to biostatistics tools and enables PDF result export in compliance with the sequence notation recommended by the International Society for Forensic Genetics (ISFG). An initial compatibility and concordance study confirmed the software's independent functionality and precise allele calling with data of different MPS platforms, STR amplification kits, and library preparation chemistries. Discussion of genotyping results for single source and mixed samples demonstrates toaSTR's advantages and includes suggestions for future MPS software development. The beta version of toaSTR is freely accessible at www.toastr.online.
Collapse
Affiliation(s)
- Sebastian Ganschow
- LABCON-OWL Analytik, Forschung und Consulting GmbH, Siemensstr. 40, 32105 Bad Salzuflen, Germany.
| | - Janine Silvery
- LABCON-OWL Analytik, Forschung und Consulting GmbH, Siemensstr. 40, 32105 Bad Salzuflen, Germany
| | - Jörn Kalinowski
- Center for Biotechnology (CeBiTec), Bielefeld University, Sequenz 1, 33615 Bielefeld, Germany
| | - Carsten Tiemann
- LABCON-OWL Analytik, Forschung und Consulting GmbH, Siemensstr. 40, 32105 Bad Salzuflen, Germany
| |
Collapse
|
11
|
Flanking Variation Influences Rates of Stutter in Simple Repeats. Genes (Basel) 2017; 8:genes8110329. [PMID: 29149052 PMCID: PMC5704242 DOI: 10.3390/genes8110329] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2017] [Revised: 11/07/2017] [Accepted: 11/07/2017] [Indexed: 12/03/2022] Open
Abstract
It has been posited that the longest uninterrupted stretch (LUS) of tandem repeats, as defined by the number of exactly matching repeating motif units, is a better predictor of rates of stutter than the parental allele length (PAL). While there are cases where this hypothesis is likely correct, such as the 9.3 allele in the TH01 locus, there can be situations where it may not apply as well. For example, the PAL may capture flanking indel variations while remaining insensitive to polymorphisms in the repeat, and these haplotypic changes may impact the stutter rate. To address this, rates of stutter were contrasted against the LUS as well as the PAL on different flanking haplotypic backgrounds. This study shows that rates of stutter can vary substantially depending on the flanking haplotype, and while there are cases where the LUS is a better predictor of stutter than the PAL, examples to the contrary are apparent in commonly assayed forensic markers. Further, flanking variation that is 7 bp from the repeat region can impact rates of stutter. These findings suggest that non-proximal effects, such as DNA secondary structure, may be impacting the rates of stutter in common forensic short tandem repeat markers.
Collapse
|
12
|
Elwick K, Zeng X, King J, Budowle B, Hughes-Stamm S. Comparative tolerance of two massively parallel sequencing systems to common PCR inhibitors. Int J Legal Med 2017; 132:983-995. [DOI: 10.1007/s00414-017-1693-4] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2017] [Accepted: 09/18/2017] [Indexed: 10/18/2022]
|