1
|
Silva DSBS, Scheible MK, Bailey SF, Williams CL, Allwood JS, Just RS, Schuetter J, Skomrock N, Minard-Smith A, Barker-Scoggins N, Eichman C, Meiklejohn K, Faith SA. Sequence-based autosomal STR characterization in four US populations using PowerSeq™ Auto/Y system. Forensic Sci Int Genet 2020; 48:102311. [PMID: 32531758 DOI: 10.1016/j.fsigen.2020.102311] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2019] [Revised: 04/21/2020] [Accepted: 05/14/2020] [Indexed: 11/30/2022]
Abstract
The forensic science community is poised to utilize modern advances in massively parallel sequencing (MPS) technologies to better characterize biological samples with higher resolution. A critical component towards the advancement of forensic DNA analysis with these technologies is a comprehensive understanding of the diversity and population distribution of sequence-based short tandem repeat (STR) alleles. Here we analyzed 786 samples of individuals from different population groups, including four of the mostly commonly encountered in forensic casework in the USA. DNA samples were amplified with the PowerSeq™ Auto/Y System Prototype Kit (Promega Corp.), and sequencing was performed on an Illumina® MiSeq instrument. Sequence data were analyzed using a bioinformatics processing tool, Altius. For additional data analysis and profile comparison, capillary electrophoresis (CE) size-based STR genotypes were generated for a subset of individuals, and where possible, also with a second commercially available MPS STR assay. Autosomal STR loci were analyzed and frequencies were calculated based on sequence composition. Also, population genetics studies were performed, with Hardy-Weinberg equilibrium, polymorphic information content (PIC), and observed and expected heterozygosity all assessed. Overall, sequence-based allelic variants of the repeat region were observed in 20 out of 22 different STR loci commonly used in forensic DNA genotyping, with the highest number of sequence variation observed at locus D12S391. The highest increase in allelic diversity and in PIC through sequence-based genotyping was observed at loci D3S1358 and D8S1179. Such detailed sequence analysis, as the one performed in the present study, is important to help understand the diversity of sequence-based STR alleles across different populations and to demonstrate how such allelic variation can improve statistics used for forensic casework.
Collapse
Affiliation(s)
- Deborah S B S Silva
- Battelle Memorial Institute, 505 King Ave., Columbus, OH, 43201, USA; NC State University, Molecular Biomedical Sciences, 1060 William Moore Dr., Raleigh, NC, 27607, USA.
| | - Melissa K Scheible
- NC State University, Molecular Biomedical Sciences, 1060 William Moore Dr., Raleigh, NC, 27607, USA; NC State University, Population Health and Pathobiology, 1060 William Moore Dr., Raleigh, NC, 27607, USA
| | - Sarah F Bailey
- NC State University, Molecular Biomedical Sciences, 1060 William Moore Dr., Raleigh, NC, 27607, USA
| | - Christopher L Williams
- NC State University, Molecular Biomedical Sciences, 1060 William Moore Dr., Raleigh, NC, 27607, USA
| | - Julia S Allwood
- NC State University, Molecular Biomedical Sciences, 1060 William Moore Dr., Raleigh, NC, 27607, USA
| | - Rebecca S Just
- DNA Support Unit, Federal Bureau of Investigation Laboratory, 2501 Investigation Parkway, Quantico, VA, 22135, USA
| | - Jared Schuetter
- Battelle Memorial Institute, 505 King Ave., Columbus, OH, 43201, USA
| | - Nicholas Skomrock
- Battelle Memorial Institute, 505 King Ave., Columbus, OH, 43201, USA
| | | | - Nicole Barker-Scoggins
- NC State University, College of Veterinary Medicine, Office of Information Technology, 1060 William Moore Dr., Raleigh, NC, 27607, USA
| | - Christopher Eichman
- NC State University, College of Veterinary Medicine, Office of Information Technology, 1060 William Moore Dr., Raleigh, NC, 27607, USA
| | - Kelly Meiklejohn
- NC State University, Population Health and Pathobiology, 1060 William Moore Dr., Raleigh, NC, 27607, USA
| | - Seth A Faith
- Battelle Memorial Institute, 505 King Ave., Columbus, OH, 43201, USA; NC State University, Molecular Biomedical Sciences, 1060 William Moore Dr., Raleigh, NC, 27607, USA; Penn State University Forensic Science Program, 329 Whitmore Laboratory, University Park, PA, 16802, USA
| |
Collapse
|
2
|
Bailey SF, Scheible MK, Williams C, Silva DS, Hoggan M, Eichman C, Faith SA. Secure and robust cloud computing for high-throughput forensic microsatellite sequence analysis and databasing. Forensic Sci Int Genet 2017; 31:40-47. [DOI: 10.1016/j.fsigen.2017.08.008] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2017] [Revised: 07/24/2017] [Accepted: 08/06/2017] [Indexed: 01/03/2023]
|
3
|
Just RS, Scheible MK, Fast SA, Sturk-Andreaggi K, Röck AW, Bush JM, Higginbotham JL, Peck MA, Ring JD, Huber GE, Xavier C, Strobl C, Lyons EA, Diegoli TM, Bodner M, Fendt L, Kralj P, Nagl S, Niederwieser D, Zimmermann B, Parson W, Irwin JA. Full mtGenome reference data: Development and characterization of 588 forensic-quality haplotypes representing three U.S. populations. Forensic Sci Int Genet 2015; 14:141-55. [DOI: 10.1016/j.fsigen.2014.09.021] [Citation(s) in RCA: 64] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2014] [Revised: 09/08/2014] [Accepted: 09/26/2014] [Indexed: 11/26/2022]
|
4
|
Just RS, Scheible MK, Fast SA, Sturk-Andreaggi K, Higginbotham JL, Lyons EA, Bush JM, Peck MA, Ring JD, Diegoli TM, Röck AW, Huber GE, Nagl S, Strobl C, Zimmermann B, Parson W, Irwin JA. Development of forensic-quality full mtGenome haplotypes: success rates with low template specimens. Forensic Sci Int Genet 2014; 10:73-79. [PMID: 24637383 DOI: 10.1016/j.fsigen.2014.01.010] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2013] [Revised: 12/12/2013] [Accepted: 01/28/2014] [Indexed: 11/18/2022]
Abstract
Forensic mitochondrial DNA (mtDNA) testing requires appropriate, high quality reference population data for estimating the rarity of questioned haplotypes and, in turn, the strength of the mtDNA evidence. Available reference databases (SWGDAM, EMPOP) currently include information from the mtDNA control region; however, novel methods that quickly and easily recover mtDNA coding region data are becoming increasingly available. Though these assays promise to both facilitate the acquisition of mitochondrial genome (mtGenome) data and maximize the general utility of mtDNA testing in forensics, the appropriate reference data and database tools required for their routine application in forensic casework are lacking. To address this deficiency, we have undertaken an effort to: (1) increase the large-scale availability of high-quality entire mtGenome reference population data, and (2) improve the information technology infrastructure required to access/search mtGenome data and employ them in forensic casework. Here, we describe the application of a data generation and analysis workflow to the development of more than 400 complete, forensic-quality mtGenomes from low DNA quantity blood serum specimens as part of a U.S. National Institute of Justice funded reference population databasing initiative. We discuss the minor modifications made to a published mtGenome Sanger sequencing protocol to maintain a high rate of throughput while minimizing manual reprocessing with these low template samples. The successful use of this semi-automated strategy on forensic-like samples provides practical insight into the feasibility of producing complete mtGenome data in a routine casework environment, and demonstrates that large (>2kb) mtDNA fragments can regularly be recovered from high quality but very low DNA quantity specimens. Further, the detailed empirical data we provide on the amplification success rates across a range of DNA input quantities will be useful moving forward as PCR-based strategies for mtDNA enrichment are considered for targeted next-generation sequencing workflows.
Collapse
Affiliation(s)
- Rebecca S Just
- Armed Forces DNA Identification Laboratory, 115 Purple Heart Dr., Dover AFB, DE 19902, United States; American Registry of Pathology, 120A Old Camden Rd., Camden, DE 19934, United States; University of Maryland, College Park, 8082 Baltimore Ave., College Park, MD 20740, United States.
| | - Melissa K Scheible
- Armed Forces DNA Identification Laboratory, 115 Purple Heart Dr., Dover AFB, DE 19902, United States; American Registry of Pathology, 120A Old Camden Rd., Camden, DE 19934, United States
| | - Spence A Fast
- Armed Forces DNA Identification Laboratory, 115 Purple Heart Dr., Dover AFB, DE 19902, United States; American Registry of Pathology, 120A Old Camden Rd., Camden, DE 19934, United States
| | - Kimberly Sturk-Andreaggi
- Armed Forces DNA Identification Laboratory, 115 Purple Heart Dr., Dover AFB, DE 19902, United States; American Registry of Pathology, 120A Old Camden Rd., Camden, DE 19934, United States
| | - Jennifer L Higginbotham
- Armed Forces DNA Identification Laboratory, 115 Purple Heart Dr., Dover AFB, DE 19902, United States; American Registry of Pathology, 120A Old Camden Rd., Camden, DE 19934, United States
| | - Elizabeth A Lyons
- Armed Forces DNA Identification Laboratory, 115 Purple Heart Dr., Dover AFB, DE 19902, United States; American Registry of Pathology, 120A Old Camden Rd., Camden, DE 19934, United States
| | - Jocelyn M Bush
- Armed Forces DNA Identification Laboratory, 115 Purple Heart Dr., Dover AFB, DE 19902, United States; American Registry of Pathology, 120A Old Camden Rd., Camden, DE 19934, United States
| | - Michelle A Peck
- Armed Forces DNA Identification Laboratory, 115 Purple Heart Dr., Dover AFB, DE 19902, United States; American Registry of Pathology, 120A Old Camden Rd., Camden, DE 19934, United States
| | - Joseph D Ring
- Armed Forces DNA Identification Laboratory, 115 Purple Heart Dr., Dover AFB, DE 19902, United States; American Registry of Pathology, 120A Old Camden Rd., Camden, DE 19934, United States
| | - Toni M Diegoli
- Armed Forces DNA Identification Laboratory, 115 Purple Heart Dr., Dover AFB, DE 19902, United States; American Registry of Pathology, 120A Old Camden Rd., Camden, DE 19934, United States
| | - Alexander W Röck
- Institute of Legal Medicine, Innsbruck Medical University, Müllerstrasse 44, Innsbruck, Austria
| | - Gabriela E Huber
- Institute of Legal Medicine, Innsbruck Medical University, Müllerstrasse 44, Innsbruck, Austria
| | - Simone Nagl
- Institute of Legal Medicine, Innsbruck Medical University, Müllerstrasse 44, Innsbruck, Austria
| | - Christina Strobl
- Institute of Legal Medicine, Innsbruck Medical University, Müllerstrasse 44, Innsbruck, Austria
| | - Bettina Zimmermann
- Institute of Legal Medicine, Innsbruck Medical University, Müllerstrasse 44, Innsbruck, Austria
| | - Walther Parson
- Institute of Legal Medicine, Innsbruck Medical University, Müllerstrasse 44, Innsbruck, Austria; Penn State Eberly College of Science, 517 Thomas Building, University Park, PA 16802, United States
| | - Jodi A Irwin
- Armed Forces DNA Identification Laboratory, 115 Purple Heart Dr., Dover AFB, DE 19902, United States; American Registry of Pathology, 120A Old Camden Rd., Camden, DE 19934, United States
| |
Collapse
|
5
|
Lyons EA, Scheible MK, Sturk-Andreaggi K, Irwin JA, Just RS. A high-throughput Sanger strategy for human mitochondrial genome sequencing. BMC Genomics 2013; 14:881. [PMID: 24341507 PMCID: PMC3878621 DOI: 10.1186/1471-2164-14-881] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2013] [Accepted: 11/19/2013] [Indexed: 01/21/2023] Open
Abstract
BACKGROUND A population reference database of complete human mitochondrial genome (mtGenome) sequences is needed to enable the use of mitochondrial DNA (mtDNA) coding region data in forensic casework applications. However, the development of entire mtGenome haplotypes to forensic data quality standards is difficult and laborious. A Sanger-based amplification and sequencing strategy that is designed for automated processing, yet routinely produces high quality sequences, is needed to facilitate high-volume production of these mtGenome data sets. RESULTS We developed a robust 8-amplicon Sanger sequencing strategy that regularly produces complete, forensic-quality mtGenome haplotypes in the first pass of data generation. The protocol works equally well on samples representing diverse mtDNA haplogroups and DNA input quantities ranging from 50 pg to 1 ng, and can be applied to specimens of varying DNA quality. The complete workflow was specifically designed for implementation on robotic instrumentation, which increases throughput and reduces both the opportunities for error inherent to manual processing and the cost of generating full mtGenome sequences. CONCLUSIONS The described strategy will assist efforts to generate complete mtGenome haplotypes which meet the highest data quality expectations for forensic genetic and other applications. Additionally, high-quality data produced using this protocol can be used to assess mtDNA data developed using newer technologies and chemistries. Further, the amplification strategy can be used to enrich for mtDNA as a first step in sample preparation for targeted next-generation sequencing.
Collapse
Affiliation(s)
| | | | | | | | - Rebecca S Just
- American Registry of Pathology, 120A Old Camden Rd,, Camden DE 19934, USA.
| |
Collapse
|
6
|
Tomas C, Axler-DiPerte G, Budimlija ZM, Børsting C, Coble MD, Decker AE, Eisenberg A, Fang R, Fondevila M, Fredslund SF, Gonzalez S, Hansen AJ, Hoff-Olsen P, Haas C, Kohler P, Kriegel AK, Lindblom B, Manohar F, Maroñas O, Mogensen HS, Neureuther K, Nilsson H, Scheible MK, Schneider PM, Sonntag ML, Stangegaard M, Syndercombe-Court D, Thacker CR, Vallone PM, Westen AA, Morling N. Autosomal SNP typing of forensic samples with the GenPlex™ HID System: results of a collaborative study. Forensic Sci Int Genet 2010; 5:369-75. [PMID: 20650697 DOI: 10.1016/j.fsigen.2010.06.007] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2010] [Revised: 06/24/2010] [Accepted: 06/24/2010] [Indexed: 10/19/2022]
Abstract
The GenPlex™ HID System (Applied Biosystems - AB) offers typing of 48 of the 52 SNPforID SNPs and amelogenin. Previous studies have shown a high reproducibility of the GenPlex™ HID System using 250-500pg DNA of good quality. An international exercise was performed by 14 laboratories (9 in Europe and 5 in the US) in order to test the robustness and reliability of the GenPlex™ HID System on forensic samples. Three samples with partly degraded DNA and 10 samples with low amounts of DNA were analyzed in duplicates using various amounts of DNA. In order to compare the performance of the GenPlex™ HID System with the most commonly used STR kits, 500pg of partly degraded DNA from three samples was typed by the laboratories using one or more STR kits. The median SNP typing success rate was 92.3% with 500pg of partly degraded DNA. Three of the fourteen laboratories counted for more than two thirds of the locus dropouts. The median percentage of discrepant results was 0.2% with 500pg degraded DNA. An increasing percentage of locus dropouts and discrepant results were observed when lower amounts of DNA were used. Different success rates were observed for the various SNPs. The rs763869 SNP was the least successful. With the exception of the MiniFiler™ kit (AB), GenPlex™ HID performed better than five other tested STR kits. When partly degraded DNA was analyzed, GenPlex™ HID showed a very low mean mach probability, while all STR kits except MiniFiler™ had very limited discriminatory power.
Collapse
Affiliation(s)
- C Tomas
- Section of Forensic Genetics, Department of Forensic Medicine, Faculty of Health Sciences, University of Copenhagen, 11 Frederik V's Vej, DK-2100 Copenhagen, Denmark.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|