1
|
Antunes J, Walichiewicz P, Forouzmand E, Barta R, Didier M, Han Y, Perez JC, Snedecor J, Zlatkov C, Padmabandu G, Devesse L, Radecke S, Holt CL, Kumar SA, Budowle B, Stephens KM. Developmental validation of the ForenSeq® Kintelligence kit, MiSeq FGx® sequencing system and ForenSeq Universal Analysis Software. Forensic Sci Int Genet 2024; 71:103055. [PMID: 38762965 DOI: 10.1016/j.fsigen.2024.103055] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2023] [Revised: 04/23/2024] [Accepted: 04/25/2024] [Indexed: 05/21/2024]
Abstract
Forensic Investigative Genetic Genealogy, a recent sub discipline of forensic genomics, leverages the high throughput and sensitivity of detection of next generation sequencing and established genetic and genealogical approaches to support the identification of human remains from missing persons investigations and investigative lead generation in violent crimes. To facilitate forensic DNA evidence analysis, the ForenSeq® Kintelligence multiplex, consisting of 10,230 SNPs, was developed. Design of the ForenSeq Kintelligence Kit, the MiSeq FGx® Sequencing System and the ForenSeq Universal Analysis Software is described. Developmental validation in accordance with SWGDAM guidelines and forensic quality assurance standards, using single source samples, is reported for the end-to-end workflow from library preparation to data interpretation. Performance metrics support the conclusion that more genetic information can be obtained from challenging samples compared to other commercially available forensic targeted DNA assays developed for capillary electrophoresis (CE) or other current next generation sequencing (NGS) kits due to the higher number of markers, the overall shorter amplicon sizes (97.8% <150 bp), and kit design. Data indicate that the multiplex is robust and fit for purpose for a wide range of quantity and quality samples. The ForenSeq Kintelligence Kit and the Universal Analysis Software allow transfer of the genetic component of forensic investigative genetic genealogy to the operational forensic laboratory.
Collapse
Affiliation(s)
- Joana Antunes
- Verogen, Inc., now a QIAGEN company, 11111 Flintkote Ave., San Diego, CA 92121, USA
| | - Paulina Walichiewicz
- Verogen, Inc., now a QIAGEN company, 11111 Flintkote Ave., San Diego, CA 92121, USA
| | - Elmira Forouzmand
- Verogen, Inc., now a QIAGEN company, 11111 Flintkote Ave., San Diego, CA 92121, USA
| | - Richelle Barta
- Verogen, Inc., now a QIAGEN company, 11111 Flintkote Ave., San Diego, CA 92121, USA
| | - Meghan Didier
- Verogen, Inc., now a QIAGEN company, 11111 Flintkote Ave., San Diego, CA 92121, USA
| | - Yonmee Han
- Verogen, Inc., now a QIAGEN company, 11111 Flintkote Ave., San Diego, CA 92121, USA
| | - Juan Carlos Perez
- Verogen, Inc., now a QIAGEN company, 11111 Flintkote Ave., San Diego, CA 92121, USA
| | - June Snedecor
- Verogen, Inc., now a QIAGEN company, 11111 Flintkote Ave., San Diego, CA 92121, USA
| | - Clare Zlatkov
- Verogen, Inc., now a QIAGEN company, 11111 Flintkote Ave., San Diego, CA 92121, USA
| | - Gothami Padmabandu
- Verogen, Inc., now a QIAGEN company, 11111 Flintkote Ave., San Diego, CA 92121, USA
| | - Laurence Devesse
- Verogen, Inc., now a QIAGEN company, 11111 Flintkote Ave., San Diego, CA 92121, USA
| | - Sarah Radecke
- Verogen, Inc., now a QIAGEN company, 11111 Flintkote Ave., San Diego, CA 92121, USA
| | - Cydne L Holt
- Verogen, Inc., now a QIAGEN company, 11111 Flintkote Ave., San Diego, CA 92121, USA
| | - Swathi A Kumar
- Verogen, Inc., now a QIAGEN company, 11111 Flintkote Ave., San Diego, CA 92121, USA
| | - Bruce Budowle
- University of Helsinki, Department of Forensic Medicine, Haartmaninkatu 8, P.O. Box 63, Helsinki 00014, Finland; Forensic Science Institute, Radford University, Radford, VA 24142, USA
| | - Kathryn M Stephens
- Verogen, Inc., now a QIAGEN company, 11111 Flintkote Ave., San Diego, CA 92121, USA.
| |
Collapse
|
2
|
Budowle B, Sajantila A. Short tandem repeats - how microsatellites became the currency of forensic genetics. Nat Rev Genet 2024:10.1038/s41576-024-00721-1. [PMID: 38467785 DOI: 10.1038/s41576-024-00721-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/13/2024]
Affiliation(s)
- Bruce Budowle
- Department of Forensic Medicine, University of Helsinki, Helsinki, Finland.
- Forensic Science Institute, Radford University, Radford, VA, USA.
| | - Antti Sajantila
- Department of Forensic Medicine, University of Helsinki, Helsinki, Finland
- Forensic Medicine Unit, Finnish Institute for Health and Welfare, Helsinki, Finland
| |
Collapse
|
3
|
Wang X, Muenzler M, King J, Liu M, Li H, Budowle B, Ge J. A complete pipeline enables haplotyping and phasing macrohaplotype in long sequencing reads for polyploidy samples and a multi-source DNA mixture. Electrophoresis 2024. [PMID: 38196015 DOI: 10.1002/elps.202300143] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2023] [Revised: 11/19/2023] [Accepted: 11/30/2023] [Indexed: 01/11/2024]
Abstract
Macrohaplotype combines multiple types of phased DNA variants, increasing forensic discrimination power. High-quality long-sequencing reads, for example, PacBio HiFi reads, provide data to detect macrohaplotypes in multiploidy and DNA mixtures. However, the bioinformatics tools for detecting macrohaplotypes are lacking. In this study, we developed a bioinformatics software, MacroHapCaller, in which targeted loci (i.e., short TRs [STRs], single nucleotide polymorphisms, and insertion and deletions) are genotyped and combined with novel algorithms to call macrohaplotypes from long reads. MacroHapCaller uses physical phasing (i.e., read-backed phasing) to identify macrohaplotypes, and thus it can detect multi-allelic macrohaplotypes for a given sample. MacroHapCaller was validated with data generated from our designed targeted PacBio HiFi sequencing pipeline, which sequenced ∼8-kb amplicon regions harboring 20 core forensic STR loci in human benchmark samples HG002 and HG003. MacroHapCaller also was validated in whole-genome long-read sequencing data. Robust and accurate genotyping and phased macrohaplotypes were obtained with MacroHapCaller compared with the known ground truth. MacroHapCaller achieved a higher or consistent genotyping accuracy and faster speed than existing tools HipSTR and DeepVar. MacroHapCaller enables efficient macrohaplotype analysis from high-throughput sequencing data and supports applications using discriminating macrohaplotypes.
Collapse
Affiliation(s)
- Xuewen Wang
- Health Science Center, University of North Texas, Fort Worth, Texas, USA
| | - Melissa Muenzler
- Health Science Center, University of North Texas, Fort Worth, Texas, USA
| | - Jonathan King
- Health Science Center, University of North Texas, Fort Worth, Texas, USA
| | - Muyi Liu
- Health Science Center, University of North Texas, Fort Worth, Texas, USA
| | - Hongmin Li
- College of Science, Cal State East Bay, Hayward, California, USA
| | - Bruce Budowle
- Department of Forensic Medicine, University of Helsinki, Helsinki, Finland
- Forensic Science Institute, Radford University, Radford, Virginia, USA
| | - Jianye Ge
- Health Science Center, University of North Texas, Fort Worth, Texas, USA
| |
Collapse
|
4
|
Ceresa L, Chavez J, Bus MM, Budowle B, Kitchner E, Kimball J, Gryczynski I, Gryczynski Z. Multi intercalators FRET enhanced detection of minute amounts of DNA. Eur Biophys J 2023; 52:593-605. [PMID: 37140595 DOI: 10.1007/s00249-023-01655-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/15/2023] [Accepted: 04/15/2023] [Indexed: 05/05/2023]
Abstract
A novel approach is presented that increases sensitivity and specificity for detecting minimal traces of DNA in liquid and on solid samples. Förster Resonance Energy Transfer (FRET) from YOYO to Ethidium Bromide (EtBr) substantially increases the signal from DNA-bound EtBr highly enhancing sensitivity and specificity for DNA detection. The long fluorescence lifetime of the EtBr acceptor, when bound to DNA, allows for multi-pulse pumping with time gated (MPPTG) detection, which highly increases the detectable signal of DNA-bound EtBr. A straightforward spectra/image subtraction eliminates sample background and allows for a huge increase in the overall detection sensitivity. Using a combination of FRET and MPPTG detection an amount as small as 10 pg of DNA in a microliter sample can be detected without any additional sample purification/manipulation or use of amplification technologies. This amount of DNA is comparable to the DNA content of a one to two human cells. Such a detection method based on simple optics opens the potential for robust, highly sensitive DNA detection/imaging in the field, quick evaluation/sorting (i.e., triaging) of collected DNA samples, and can support various diagnostic assays.
Collapse
Affiliation(s)
- Luca Ceresa
- Department of Physics and Astronomy, Texas Christian University, Fort Worth, TX, 76109, USA.
| | - Jose Chavez
- Department of Physics and Astronomy, Texas Christian University, Fort Worth, TX, 76109, USA
| | - Magdalena M Bus
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX, 76107, USA
- Department of Microbiology, Immunology and Genetics, Graduate School of Biomedical Sciences, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX, 76107, USA
| | - Bruce Budowle
- Department of Forensic Medicine, University of Helsinki, Helsinki, Finland
- Forensic Science Institute, Radford University, Radford, VA, USA
| | - Emma Kitchner
- Department of Physics and Astronomy, Texas Christian University, Fort Worth, TX, 76109, USA
| | - Joseph Kimball
- Department of Physics and Astronomy, Texas Christian University, Fort Worth, TX, 76109, USA
| | - Ignacy Gryczynski
- Department of Physics and Astronomy, Texas Christian University, Fort Worth, TX, 76109, USA
| | - Zygmunt Gryczynski
- Department of Physics and Astronomy, Texas Christian University, Fort Worth, TX, 76109, USA
| |
Collapse
|
5
|
Vuorio A, Budowle B, Raal F, Kovanen PT. Wildfire smoke exposure and cardiovascular disease-should statins be recommended to prevent cardiovascular events? Front Cardiovasc Med 2023; 10:1259162. [PMID: 37781301 PMCID: PMC10537918 DOI: 10.3389/fcvm.2023.1259162] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2023] [Accepted: 08/28/2023] [Indexed: 10/03/2023] Open
Affiliation(s)
- Alpo Vuorio
- Mehiläinen, Airport Health Center, Vantaa, Finland
- Department of Forensic Medicine, University of Helsinki, Helsinki, Finland
| | - Bruce Budowle
- Department of Forensic Medicine, University of Helsinki, Helsinki, Finland
| | - Frederick Raal
- Faculty of Health Sciences, University of Witwatersrand, Johannesburg, South Africa
| | - Petri T. Kovanen
- Cardiovascular Research, Wihuri Research Institute, Helsinki, Finland
| |
Collapse
|
6
|
Budowle B. Unraveling crime scenes strand by strand: the forensic odyssey of Bruce Budowle. Biotechniques 2023; 75:85-89. [PMID: 37622321 DOI: 10.2144/btn-2023-0069] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/26/2023] Open
Abstract
Bruce Budowle speaks to Ashling Cannon, Journal Development Editor for BioTechniques, about advancements & challenges in forensic science. Budowle completed his doctorate in genetics at Virginia Tech (VA, USA) formally known as Virginia Polytechnic Institute and State University. He then went on to complete a postdoctoral fellowship at the University of Alabama at Birmingham (AL, USA) to study genetic risk factors for acute lymphocytic leukemia, diabetes and melanoma. Budowle was early in his career and hadn't spent much time in forensics at this stage, but in 1982 an advert caught his eye for a job with the FBI to develop genetic marker systems to identify people who have left biological evidence at crime scenes. Budowle spent 26 years with the FBI and helped develop a plethora of genetic analysis methods. In 1985, it became a reality that DNA could be a signature for identifying people, and there were huge developments in DNA forensic analysis. In 2009, Budowle moved into academia and went to the University of North Texas Health Science Center (TX, USA), eventually becoming the Director of the Center for Human Identification, where he oversaw missing person and traditional crime cases, taught students and carried out fundamental and applied research. Budowle feels incredibly lucky to have had the resources, opportunities and academic infrastructure to learn and develop his knowledge. Budowle recently retired from academia and now spends his time building capacity for DNA forensics applications in Africa through the Department of Justice, with a well-established program known as the International Criminal Investigative Training Assistance Program (ICITAP) as well as with the non-government organization (NGO) DNAforAfrica.
Collapse
Affiliation(s)
- Bruce Budowle
- Department of Forensic Medicine, University of Helsinki, Helsinki, Finland
- Radford University Forensic Science Institute, Radford University, Radford, VA, USA
| |
Collapse
|
7
|
Budowle B, Arnette A, Sajantila A. A cost-benefit analysis for use of large SNP panels and high throughput typing for forensic investigative genetic genealogy. Int J Legal Med 2023; 137:1595-1614. [PMID: 37341834 PMCID: PMC10421786 DOI: 10.1007/s00414-023-03029-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2023] [Accepted: 05/16/2023] [Indexed: 06/22/2023]
Abstract
Next-generation sequencing (NGS), also known as massively sequencing, enables large dense SNP panel analyses which generate the genetic component of forensic investigative genetic genealogy (FIGG). While the costs of implementing large SNP panel analyses into the laboratory system may seem high and daunting, the benefits of the technology may more than justify the investment. To determine if an infrastructural investment in public laboratories and using large SNP panel analyses would reap substantial benefits to society, a cost-benefit analysis (CBA) was performed. This CBA applied the logic that an increase of DNA profile uploads to a DNA database due to a sheer increase in number of markers and a greater sensitivity of detection afforded with NGS and a higher hit/association rate due to large SNP/kinship resolution and genealogy will increase investigative leads, will be more effective for identifying recidivists which in turn reduces future victims of crime, and will bring greater safety and security to communities. Analyses were performed for worst case/best case scenarios as well as by simulation sampling the range spaces with multiple input values simultaneously to generate best estimate summary statistics. This study shows that the benefits, both tangible and intangible, over the lifetime of an advanced database system would be huge and can be projected to be for less than $1 billion per year (over a 10-year period) investment can reap on average > $4.8 billion in tangible and intangible cost-benefits per year. More importantly, on average > 50,000 individuals need not become victims if FIGG were employed, assuming investigative associations generated were acted upon. The benefit to society is immense making the laboratory investment a nominal cost. The benefits likely are underestimated herein. There is latitude in the estimated costs, and even if they were doubled or tripled, there would still be substantial benefits gained with a FIGG-based approach. While the data used in this CBA are US centric (primarily because data were readily accessible), the model is generalizable and could be used by other jurisdictions to perform relevant and representative CBAs.
Collapse
Affiliation(s)
- Bruce Budowle
- Department of Forensic Medicine, University of Helsinki, Helsinki, Finland.
- Radford University Forensic Science Institute, Radford University, Radford, VA, USA.
| | - Andrew Arnette
- Department of Business Information Technology, Virginia Tech, Blacksburg, VA, USA
| | - Antti Sajantila
- Department of Forensic Medicine, University of Helsinki, Helsinki, Finland
- Forensic Medicine Unit, Finnish Institute for Health and Welfare, Helsinki, Finland
| |
Collapse
|
8
|
Wang X, Huang M, Budowle B, Ge J. TRcaller: a novel tool for precise and ultrafast tandem repeat variant genotyping in massively parallel sequencing reads. Front Genet 2023; 14:1227176. [PMID: 37533432 PMCID: PMC10390829 DOI: 10.3389/fgene.2023.1227176] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2023] [Accepted: 06/13/2023] [Indexed: 08/04/2023] Open
Abstract
Calling tandem repeat (TR) variants from DNA sequences is of both theoretical and practical significance. Some bioinformatics tools have been developed for detecting or genotyping TRs. However, little study has been done to genotyping TR alleles from long-read sequencing data, and the accuracy of genotyping TR alleles from next-generation sequencing data still needs to be improved. Herein, a novel algorithm is described to retrieve TR regions from sequence alignment, and a software program TRcaller has been developed and integrated into a web portal to call TR alleles from both short- and long-read sequences, both whole genome and targeted sequences generated from multiple sequencing platforms. All TR alleles are genotyped as haplotypes and the robust alleles will be reported, even multiple alleles in a DNA mixture. TRcaller could provide substantially higher accuracy (>99% in 289 human individuals) in detecting TR alleles with magnitudes faster (e.g., ∼2 s for 300x human sequence data) than the mainstream software tools. The web portal preselected 119 TR loci from forensics, genealogy, and disease related TR loci. TRcaller is validated to be scalable in various applications, such as DNA forensics and disease diagnosis, which can be expanded into other fields like breeding programs. Availability: TRcaller is available at https://www.trcaller.com/SignIn.aspx.
Collapse
Affiliation(s)
- Xuewen Wang
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, TX, United States
| | - Meng Huang
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, TX, United States
| | - Bruce Budowle
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, TX, United States
- Department of Microbiology, Immunology, and Genetics, University of North Texas Health Science Center, Fort Worth, TX, United States
| | - Jianye Ge
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, TX, United States
- Department of Microbiology, Immunology, and Genetics, University of North Texas Health Science Center, Fort Worth, TX, United States
| |
Collapse
|
9
|
Ge J, Crysup B, Peters D, Franco R, Liu M, Wang X, Huang M, Budowle B. MPKin-YSTR: Interpretation of Y chromosome STR haplotypes for missing persons cases. Electrophoresis 2023; 44:1080-1087. [PMID: 37016479 DOI: 10.1002/elps.202200225] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2022] [Revised: 03/22/2023] [Accepted: 03/30/2023] [Indexed: 04/06/2023]
Abstract
Y chromosome Short Tandem Repeat (STR) haplotypes have been used in assisting forensic investigations primarily for identification and male lineage determination. The current SWGDAM interpretation guidelines for Y-STR typing provide helpful guidance on those purposes but do not address the issue of kinship analysis with Y-STR haplotypes. Because of the high mutation rate of Y-STRs, there are complex missing person cases in which inconsistent Y-STR haplotypes between true paternal lineage relatives will arise and cases with two or more male references in the same lineage and yet differ in their haplotypes. Therefore, more useful methods are needed for interpreting the Y-STR haplotype data. Computational methods and interpretation guidelines have been developed specifically addressing this issue, either using a mismatch-based counting method or a pedigree likelihood ratio method. In this study, a software program, MPKin-YSTR, was developed by implementing those more sophisticated methods. This software should be able to improve the interpretation of complex cases with Y-STR haplotype evidence. Thus, more biological evidence will be interpreted, which in turn will result in more investigation leads to help solve crimes.
Collapse
Affiliation(s)
- Jianye Ge
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, Texas, USA
- Department of Microbiology, Immunology, and Genetics, University of North Texas Health Science Center, Fort Worth, Texas, USA
| | - Benjamin Crysup
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, Texas, USA
| | - Dixie Peters
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, Texas, USA
| | - Romy Franco
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, Texas, USA
| | - Muyi Liu
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, Texas, USA
| | - Xuewen Wang
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, Texas, USA
| | - Meng Huang
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, Texas, USA
| | - Bruce Budowle
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, Texas, USA
- Department of Microbiology, Immunology, and Genetics, University of North Texas Health Science Center, Fort Worth, Texas, USA
| |
Collapse
|
10
|
Buhmeida A, Assidi M, Budowle B. Current Healthcare Systems in Light of Hyperendemic NCDs and the COVID-19 Pandemic: Time to Change. Healthcare (Basel) 2023; 11:healthcare11101382. [PMID: 37239667 DOI: 10.3390/healthcare11101382] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2023] [Revised: 05/01/2023] [Accepted: 05/06/2023] [Indexed: 05/28/2023] Open
Abstract
Despite the significant achievements of current healthcare systems (CHCSs) in curing or treating several acute conditions, there has been far less success coping with noncommunicable diseases (NCDs), which have complex roots and nonconventional transmission vectors. Owing to the impact of the invisible hyperendemic NCDs and the COVID-19 pandemic, the limitations of CHCSs have been exposed. In contrast, the advent of omics-based technologies and big data science has raised global hope of curing or treating NCDs and improving overall healthcare outcomes. However, challenges related to their use and effectiveness must be addressed. Additionally, while such advancements intend to improve quality of life, they can also contribute the ever-increasing health disparity among vulnerable populations, such as low/middle-income populations, poorly educated people, gender-based violence victims, and minority and indigenous peoples, to name a few. Among five health determinants, the contribution of medical care to individual health does not exceed 11%. Therefore, it is time to implement a new well-being-oriented system complementary or parallel to CHCSs that incorporates all five health determinants to tackle NCDs and unforeseen diseases of the future, as well as to promote cost-effective, accessible, and sustainable healthy lifestyle choices that can reduce the current level of healthcare inequity.
Collapse
Affiliation(s)
- Abdelbaset Buhmeida
- Center of Excellence in Genomic Medicine Research, King Abdulaziz University, Jeddah 21589, Saudi Arabia
| | - Mourad Assidi
- Center of Excellence in Genomic Medicine Research, King Abdulaziz University, Jeddah 21589, Saudi Arabia
| | - Bruce Budowle
- Department of Forensic Medicine, University of Helsinki, Universitetsgatan 2, 00100 Helsinki, Finland
- Forensic Science Institute, Radford University, Radford, 24142 VA, USA
| |
Collapse
|
11
|
Stephens KM, Barta R, Fleming K, Perez JC, Wu SF, Snedecor J, Holt CL, LaRue B, Budowle B. Developmental validation of the ForenSeq MainstAY kit, MiSeq FGx sequencing system and ForenSeq Universal Analysis Software. Forensic Sci Int Genet 2023; 64:102851. [PMID: 36907074 DOI: 10.1016/j.fsigen.2023.102851] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2022] [Revised: 02/24/2023] [Accepted: 02/27/2023] [Indexed: 03/06/2023]
Abstract
For human identification purposes, forensic genetics has primarily relied upon a core set of autosomal (and to a lesser extent Y chromosome) short tandem repeat (STR) markers that are enriched by amplification using the polymerase chain reaction (PCR) that are subsequently separated and detected using capillary electrophoresis (CE). While STR typing conducted in this manner is well-developed and robust, advances in molecular biology that have occurred over the last 15 years, in particular massively parallel sequencing (MPS) [1-7], offer certain advantages as compared to CE-based typing. First and foremost is the high throughput capacity of MPS. Current bench top high throughput sequencers enable larger batteries of markers to be multiplexed and multiple samples to be sequenced simultaneously (e.g., millions to billions of nucleotides can be sequenced in one run). Second, compared to the length-based CE approach, sequencing STRs increases discrimination power, enhances sensitivity of detection, reduces noise due to instrumentation, and improves mixture interpretation [4,8-23]. Third, since detection of STRs is based on sequence and not fluorescence, amplicons can be designed that are shorter in length and of similar lengths among loci, where possible, which can improve amplification efficiency and analysis of degraded samples. Lastly, MPS offers a single format approach that can be applied to analysis of a wide variety of genetic markers of forensic interest (e.g., STRs, mitochondrial DNA, single nucleotide polymorphisms, insertion/deletions). These features make MPS a desirable technology for casework [14,15,24,25-48]. The developmental validation of the ForenSeq MainstAY library preparation kit with the MiSeq FGx Sequencing System and ForenSeq Universal Software is reported here to assist with validation of this MPS system for casework [49]. The results show that the system is sensitive, accurate and precise, specific, and performs well with mixtures and mock case-type samples.
Collapse
Affiliation(s)
| | - Richelle Barta
- Verogen, Inc., 11111 Flintkote Ave., San Diego, CA 92121, USA
| | - Keenan Fleming
- Verogen, Inc., 11111 Flintkote Ave., San Diego, CA 92121, USA
| | | | - Shan-Fu Wu
- Verogen, Inc., 11111 Flintkote Ave., San Diego, CA 92121, USA
| | - June Snedecor
- Verogen, Inc., 11111 Flintkote Ave., San Diego, CA 92121, USA
| | - Cydne L Holt
- Verogen, Inc., 11111 Flintkote Ave., San Diego, CA 92121, USA
| | - Bobby LaRue
- Verogen, Inc., 11111 Flintkote Ave., San Diego, CA 92121, USA
| | - Bruce Budowle
- University of Helsinki, Department of Forensic Medicine, Haartmaninkatu 8, P.O. Box 63, Helsinki 00014, Finland
| |
Collapse
|
12
|
Budowle B, Sajantila A. Revisiting informed consent in forensic genomics in light of current technologies and the times. Int J Legal Med 2023; 137:551-565. [PMID: 36642749 PMCID: PMC9902322 DOI: 10.1007/s00414-023-02947-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2022] [Accepted: 12/14/2022] [Indexed: 01/17/2023]
Abstract
Informed consent is based on basic ethical principles that should be considered when conducting biomedical and behavioral research involving human subjects. These principles-respect, beneficence, and justice-form the foundations of informed consent which in itself is grounded on three fundamental elements: information, comprehension, and voluntary participation. While informed consent has focused on human subjects and research, the practice has been adopted willingly in the forensic science arena primarily to acquire reference samples from family members to assist in identifying missing persons. With advances in molecular biology technologies, data mining, and access to metadata, it is important to assess whether the past informed consent process and in particular associated risks are concomitant with these increased capabilities. Given the state-of-the-art, areas in which informed consent may need to be modified and augmented are as follows: reference samples from family members in missing persons or unidentified human remains cases; targeted analysis of an individual(s) during forensic genetic genealogy cases to reduce an investigative burden; donors who provide their samples for validation studies (to include population studies and entry into databases that would be applied to forensic statistical calculations) to support implementation of procedures and operations of the forensic laboratory; family members that may contribute samples or obtain genetic information from a molecular autopsy; and use of medical and other acquired samples that could be informative for identification purposes. The informed consent process should cover (1) purpose for collection of samples; (2) process to analyze the samples (to include type of data); (3) benefits (to donor, target, family, community, etc. as applicable); (4) risks (to donor, target, family, community, etc. as applicable); (5) access to data/reports by the donor; (6) sample disposition; (7) removal of data process (i.e., expungement); (8) process to ask questions/assessment of comprehension; (9) follow-up processes; and (10) voluntary, signed, and dated consent. Issues surrounding these topics are discussed with an emphasis on addressing risk factors. Addressing informed consent will allow human subjects to make decisions voluntarily and with autonomy as well as secure the use of samples for intended use.
Collapse
Affiliation(s)
- Bruce Budowle
- Department of Forensic Medicine, University of Helsinki, Helsinki, Finland.
| | - Antti Sajantila
- Department of Forensic Medicine, University of Helsinki, Helsinki, Finland
- Forensic Medicine Unit, Finnish Institute for Health and Welfare, Helsinki, Finland
| |
Collapse
|
13
|
Wang X, Budowle B, Ge J. USAT: a bioinformatic toolkit to facilitate interpretation and comparative visualization of tandem repeat sequences. BMC Bioinformatics 2022; 23:497. [PMID: 36402991 PMCID: PMC9675219 DOI: 10.1186/s12859-022-05021-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2022] [Accepted: 10/29/2022] [Indexed: 11/21/2022] Open
Abstract
Background Tandem repeats (TR), highly variable genomic variants, are widely used in individual identification, disease diagnostics, and evolutionary studies. The recent advances in sequencing technologies and bioinformatic tools facilitate calling TR haplotypes genome widely. Both length-based and sequence-based TR alleles are used in different applications. However, sequence-based TR alleles could provide the highest precision in characterizing TR haplotypes. The need to identify the differences at the single nucleotide level between or among TR haplotypes with an easy-use bioinformatic tool is essential. Results In this study, we developed a Universal STR Allele Toolkit (USAT) for TR haplotype analysis, which takes TR haplotype output from existing tools to perform allele size conversion, sequence comparison of haplotypes, figure plotting, comparison for allele distribution, and interactive visualization. An exemplary application of USAT for analysis of the CODIS core STR loci for DNA forensics with benchmarking human individuals demonstrated the capabilities of USAT. USAT has user-friendly graphic interfaces and runs fast in major computing operating systems with parallel computing enabled. Conclusion USAT is a user-friendly bioinformatics software for interpretation, visualization, and comparisons of TRs. Supplementary Information The online version contains supplementary material available at 10.1186/s12859-022-05021-1.
Collapse
Affiliation(s)
- Xuewen Wang
- grid.266869.50000 0001 1008 957XCenter for Human Identification, Health Science Center, University of North Texas, Fort Worth, TX USA
| | - Bruce Budowle
- grid.266869.50000 0001 1008 957XCenter for Human Identification, Health Science Center, University of North Texas, Fort Worth, TX USA ,grid.266871.c0000 0000 9765 6057Department of Microbiology, Immunology, and Genetics, University of North Texas Health Science Center, Fort Worth, TX USA
| | - Jianye Ge
- grid.266869.50000 0001 1008 957XCenter for Human Identification, Health Science Center, University of North Texas, Fort Worth, TX USA ,grid.266871.c0000 0000 9765 6057Department of Microbiology, Immunology, and Genetics, University of North Texas Health Science Center, Fort Worth, TX USA
| |
Collapse
|
14
|
Woerner AE, Mandape S, Kapema KB, Duque TM, Smuts A, King JL, Crysup B, Wang X, Huang M, Ge J, Budowle B. Optimized variant calling for estimating kinship. Forensic Sci Int Genet 2022; 61:102785. [DOI: 10.1016/j.fsigen.2022.102785] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2022] [Revised: 08/07/2022] [Accepted: 09/29/2022] [Indexed: 11/16/2022]
|
15
|
Huang M, Liu M, Li H, King J, Smuts A, Budowle B, Ge J. A machine learning approach for missing persons cases with high genotyping errors. Front Genet 2022; 13:971242. [PMID: 36263419 PMCID: PMC9573995 DOI: 10.3389/fgene.2022.971242] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2022] [Accepted: 09/16/2022] [Indexed: 11/22/2022] Open
Abstract
Estimating the relationships between individuals is one of the fundamental challenges in many fields. In particular, relationship.ip estimation could provide valuable information for missing persons cases. The recently developed investigative genetic genealogy approach uses high-density single nucleotide polymorphisms (SNPs) to determine close and more distant relationships, in which hundreds of thousands to tens of millions of SNPs are generated either by microarray genotyping or whole-genome sequencing. The current studies usually assume the SNP profiles were generated with minimum errors. However, in the missing person cases, the DNA samples can be highly degraded, and the SNP profiles generated from these samples usually contain lots of errors. In this study, a machine learning approach was developed for estimating the relationships with high error SNP profiles. In this approach, a hierarchical classification strategy was employed first to classify the relationships by degree and then the relationship types within each degree separately. As for each classification, feature selection was implemented to gain better performance. Both simulated and real data sets with various genotyping error rates were utilized in evaluating this approach, and the accuracies of this approach were higher than individual measures; namely, this approach was more accurate and robust than the individual measures for SNP profiles with genotyping errors. In addition, the highest accuracy could be obtained by providing the same genotyping error rates in train and test sets, and thus estimating genotyping errors of the SNP profiles is critical to obtaining high accuracy of relationship estimation.
Collapse
Affiliation(s)
- Meng Huang
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, TX, United States
| | - Muyi Liu
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, TX, United States
| | - Hongmin Li
- Department of Computer Science, College of Science, California State University, East Bay, Hayward, CA, United States
| | - Jonathan King
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, TX, United States
| | - Amy Smuts
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, TX, United States
| | - Bruce Budowle
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, TX, United States
- Department of Microbiology, Immunology and Genetics, University of North Texas Health Science Center, Fort Worth, TX, United States
| | - Jianye Ge
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, TX, United States
- Department of Microbiology, Immunology and Genetics, University of North Texas Health Science Center, Fort Worth, TX, United States
- *Correspondence: Jianye Ge,
| |
Collapse
|
16
|
Neyra-Rivera C, Budowle B. Genetic variation of different Peruvian populations using 23 autosomal STR markers. Forensic Science International: Genetics Supplement Series 2022. [DOI: 10.1016/j.fsigss.2022.10.055] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
17
|
Mandape SN, Kapema KB, Duque T, Smuts A, King JL, Crysup B, Ge J, Budowle B, Woerner AE. Evaluating probabilistic genotyping for low-pass DNA sequencing. Forensic Science International: Genetics Supplement Series 2022. [DOI: 10.1016/j.fsigss.2022.10.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
18
|
Budowle B, Ge J, Sajantila A. A prospective cost-benefit analysis for nylon 4N6FLOQSwabs®: example of the process and potential benefits. Int J Legal Med 2022; 136:1541-1549. [PMID: 36057692 PMCID: PMC9576645 DOI: 10.1007/s00414-022-02884-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2022] [Accepted: 08/24/2022] [Indexed: 01/28/2023]
Abstract
Laboratories and their criminal justice systems are confronted with challenges for implementing new technologies, practices, and policies even when there appears to be demonstrative benefits to operational performance. Impacting decisions are the often higher costs associated with, for example, new technologies, limited current budgets, and making hard decisions on what to sacrifice to take on the seemingly better approach. A prospective cost–benefit analysis (CBA) could help an agency better formulate its strategies and plans and more importantly delineate how a relatively small increase to take on, for example, a new technology can have large impact on the system (e.g., the agency, other agencies, victims and families, and taxpayers). To demonstrate the process and potential value a CBA was performed on the use of an alternate and more expensive swab with reported better DNA yield and being certified human DNA free (i.e., nylon 4N6FLOQSwabs®), versus the traditional less costly swab (i.e., cotton swab). Assumptions are described, potential underestimates and overestimates noted, different values applied (for low and modest to high), and potential benefits (monetary and qualitative) presented. The overall outcome is that the cost of using the more expensive technology pales compared with the potential tangible and intangible benefits. This approach could be a guide for laboratories (and associated criminal justice systems) worldwide to support increased funding, although the costs and benefits may vary locally and for different technologies, practices, and policies. With well-developed CBAs, goals of providing the best services to support the criminal justice system and society can be attained.
Collapse
Affiliation(s)
- Bruce Budowle
- Department of Forensic Medicine, University of Helsinki, Helsinki, Finland.
| | - Jianye Ge
- Center for Human Identification, Health Science Center, University of North Texas, Fort Worth, TX, USA.,Department of Microbiology, Immunology, and Genetics, University of North Texas Health Science Center, Fort Worth, TX, USA
| | - Antti Sajantila
- Department of Forensic Medicine, University of Helsinki, Helsinki, Finland.,Forensic Medicine Unit, Finnish Institute for Health and Welfare, Helsinki, Finland
| |
Collapse
|
19
|
Stephens K, Snedecor J, Budowle B. Calculation and implementation of sample-wide stochastic thresholds for forensic genetic analysis of STRs and SNPs for massively parallel sequencing platforms. Forensic Science International: Genetics Supplement Series 2022. [DOI: 10.1016/j.fsigss.2022.09.032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/14/2022]
|
20
|
Budowle B, Arnette A, Ge J, Sajantila A. A Monte Carlo simulation and sensitivity cost benefit analysis for use of nylon 4N6FLOQSwabs®. Forensic Science International: Genetics Supplement Series 2022. [DOI: 10.1016/j.fsigss.2022.09.021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
21
|
Nagraj V, Scholz M, Jessa S, Ge J, Woerner AE, Huang M, Budowle B, Turner SD. vcferr: Development, validation, and application of a single nucleotide polymorphism genotyping error simulation framework. F1000Res 2022. [DOI: 10.12688/f1000research.122840.1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Motivation: Genotyping error can impact downstream single nucleotide polymorphism (SNP)-based analyses. Simulating various modes and levels of error can help investigators better understand potential biases caused by miscalled genotypes. Methods: We have developed and validated vcferr, a tool to probabilistically simulate genotyping error and missingness in variant call format (VCF) files. We demonstrate how vcferr could be used to address a research question by introducing varying levels of error of different type into a sample in a simulated pedigree, and assessed how kinship analysis degrades as a function of the kind and type of error. Software availability: vcferr is available for installation via PyPi (https://pypi.org/project/vcferr/) or conda (https://anaconda.org/bioconda/vcferr). The software is released under the MIT license with source code available on GitHub (https://github.com/signaturescience/vcferr)
Collapse
|
22
|
Woerner AE, Crysup B, Hewitt FC, Gardner MW, Freitas MA, Budowle B. Techniques for estimating genetically variable peptides and semi-continuous likelihoods from massively parallel sequencing data. Forensic Sci Int Genet 2022; 59:102719. [DOI: 10.1016/j.fsigen.2022.102719] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2022] [Revised: 04/25/2022] [Accepted: 05/01/2022] [Indexed: 11/25/2022]
|
23
|
Turner SD, Nagraj V, Scholz M, Jessa S, Acevedo C, Ge J, Woerner AE, Budowle B. Evaluating the Impact of Dropout and Genotyping Error on SNP-Based Kinship Analysis With Forensic Samples. Front Genet 2022; 13:882268. [PMID: 35846115 PMCID: PMC9282869 DOI: 10.3389/fgene.2022.882268] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2022] [Accepted: 05/16/2022] [Indexed: 11/13/2022] Open
Abstract
Technological advances in sequencing and single nucleotide polymorphism (SNP) genotyping microarray technology have facilitated advances in forensic analysis beyond short tandem repeat (STR) profiling, enabling the identification of unknown DNA samples and distant relationships. Forensic genetic genealogy (FGG) has facilitated the identification of distant relatives of both unidentified remains and unknown donors of crime scene DNA, invigorating the use of biological samples to resolve open cases. Forensic samples are often degraded or contain only trace amounts of DNA. In this study, the accuracy of genome-wide relatedness methods and identity by descent (IBD) segment approaches was evaluated in the presence of challenges commonly encountered with forensic data: missing data and genotyping error. Pedigree whole-genome simulations were used to estimate the genotypes of thousands of individuals with known relationships using multiple populations with different biogeographic ancestral origins. Simulations were also performed with varying error rates and types. Using these data, the performance of different methods for quantifying relatedness was benchmarked across these scenarios. When the genotyping error was low (<1%), IBD segment methods outperformed genome-wide relatedness methods for close relationships and are more accurate at distant relationship inference. However, with an increasing genotyping error (1–5%), methods that do not rely on IBD segment detection are more robust and outperform IBD segment methods. The reduced call rate had little impact on either class of methods. These results have implications for the use of dense SNP data in forensic genomics for distant kinship analysis and FGG, especially when the sample quality is low.
Collapse
Affiliation(s)
- Stephen D. Turner
- Signature Science, LLC., Austin, TX, United States
- *Correspondence: Stephen D. Turner,
| | - V.P. Nagraj
- Signature Science, LLC., Austin, TX, United States
| | | | | | | | - Jianye Ge
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, TX, United States
- Department of Microbiology, Immunology, and Genetics, University of North Texas Health Science Center, Fort Worth, TX, United States
| | - August E. Woerner
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, TX, United States
- Department of Microbiology, Immunology, and Genetics, University of North Texas Health Science Center, Fort Worth, TX, United States
| | - Bruce Budowle
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, TX, United States
- Department of Microbiology, Immunology, and Genetics, University of North Texas Health Science Center, Fort Worth, TX, United States
| |
Collapse
|
24
|
Abstract
Motivation: SNP-based kinship analysis with genome-wide relationship estimation and IBD segment analysis methods produces results that often require further downstream process- ing and manipulation. A dedicated software package that consistently and intuitively imple- ments this analysis functionality is needed. Results: Here we present the skater R package for SNP-based kinship analysis, testing, and evaluation with R. The skater package contains a suite of well-documented tools for importing, parsing, and analyzing pedigree data, performing relationship degree inference, benchmarking relationship degree classification, and summarizing IBD segment data. Availability: The skater package is implemented as an R package and is released under the MIT license at https://github.com/signaturescience/skater. Documentation is available at https://signaturescience.github.io/skater.
Collapse
Affiliation(s)
| | - V P Nagraj
- Signature Science, LLC., Austin, TX, 78759, USA
| | | | | | | | - Jianye Ge
- Center for Human Identification, Department of Microbiology, Immunology, and Genetics, University of North Texas Health Science Center, Fort Worth, TX, 76107, USA
| | - August E Woerner
- Center for Human Identification, Department of Microbiology, Immunology, and Genetics, University of North Texas Health Science Center, Fort Worth, TX, 76107, USA
| | - Bruce Budowle
- Center for Human Identification, Department of Microbiology, Immunology, and Genetics, University of North Texas Health Science Center, Fort Worth, TX, 76107, USA
| |
Collapse
|
25
|
Crysup B, Budowle B, Woerner AE. ProSynAR: a reference aware read merger. Bioinformatics 2022; 38:2052-2053. [PMID: 35020788 DOI: 10.1093/bioinformatics/btac022] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2021] [Revised: 01/05/2022] [Accepted: 01/07/2022] [Indexed: 02/03/2023] Open
Abstract
MOTIVATION Read-merging algorithms that look solely at the reads can misalign and mis-merge the reads (especially near repetitive sequences). RESULTS The C++ program ProSynAR has been written to take the reads' position in the reference into account when performing (and deciding whether to perform) a merge. AVAILABILITY *Nix users can retrieve the source from GitHub (https://github.com/Benjamin-Crysup/prosynar). Windows binary available at https://github.com/Benjamin-Crysup/prosynar/releases/download/1.0/prosynar.zip. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Benjamin Crysup
- Center for Human Identification, University of North Texas, Fort Worth, TX 76107, USA
| | - Bruce Budowle
- Center for Human Identification, University of North Texas, Fort Worth, TX 76107, USA
| | - August E Woerner
- Center for Human Identification, University of North Texas, Fort Worth, TX 76107, USA
| |
Collapse
|
26
|
Ceresa L, Chavez J, Bus MM, Budowle B, Kitchner E, Kimball J, Gryczynski I, Gryczynski Z. Förster Resonance Energy Transfer-Enhanced Detection of Minute Amounts of DNA. Anal Chem 2022; 94:5062-5068. [PMID: 35286067 DOI: 10.1021/acs.analchem.1c05275] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
This article presents a novel approach to increase the detection sensitivity of trace amounts of DNA in a sample by employing Förster resonance energy transfer (FRET) between intercalating dyes. Two intercalators that present efficient FRET were used to enhance sensitivity and improve specificity in detecting minute amounts of DNA. Comparison of steady-state acceptor emission spectra with and without the donor allows for simple and specific detection of DNA (acceptor bound to DNA) down to 100 pg/μL. When utilizing as an acceptor a dye with a significantly longer lifetime (e.g., ethidium bromide bound to DNA), multipulse pumping and time-gated detection enable imaging/visualization of picograms of DNA present in a microliter of an unprocessed sample or DNA collected on a swab or other substrate materials.
Collapse
Affiliation(s)
- Luca Ceresa
- Department of Physics and Astronomy, Texas Christian University, Fort Worth, Texas 76109, United States
| | - Jose Chavez
- Department of Physics and Astronomy, Texas Christian University, Fort Worth, Texas 76109, United States
| | - Magdalena M Bus
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, Texas 76107, United States.,Department of Microbiology, Immunology & Genetics, Graduate School of Biomedical Sciences, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, Texas 76107, United States
| | - Bruce Budowle
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, Texas 76107, United States.,Department of Microbiology, Immunology & Genetics, Graduate School of Biomedical Sciences, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, Texas 76107, United States
| | - Emma Kitchner
- Department of Physics and Astronomy, Texas Christian University, Fort Worth, Texas 76109, United States
| | - Joseph Kimball
- Department of Physics and Astronomy, Texas Christian University, Fort Worth, Texas 76109, United States
| | - Ignacy Gryczynski
- Department of Physics and Astronomy, Texas Christian University, Fort Worth, Texas 76109, United States
| | - Zygmunt Gryczynski
- Department of Physics and Astronomy, Texas Christian University, Fort Worth, Texas 76109, United States
| |
Collapse
|
27
|
Vuorio A, Budowle B, Kovanen PT. Airborne particles and cardiovascular morbidity in severe inherited hypercholesterolemia: Vulnerable endothelium under multiple attacks. Bioessays 2021; 44:e2100273. [PMID: 34967031 DOI: 10.1002/bies.202100273] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2021] [Revised: 12/17/2021] [Accepted: 12/20/2021] [Indexed: 11/07/2022]
Abstract
Despite recent advances in the research related to air pollution and associated adverse cardiovascular events, the combined effects of air pollution, climate change, and SARS-CoV-2 infection on cardiovascular health need to be researched further. This Commentary addresses their impacts on cardiovascular health in the approximately 25 million people with a severe form of inherited hypercholesterolemia, called familial hypercholesterolemia (FH). The arterial endothelium in these individuals is potentially under multiple attacks caused by particles of both endogenous and exogenous origin. Thus, they have a lifelong highly elevated level of circulating low density lipoprotein (LDL) cholesterol which drives premature atherosclerosis. The high levels of LDL particles, often associated with an elevated level of circulating lipoprotein(a) particles, are both capable of inducing and maintaining endothelial dysfunction. Such pre-existing endothelial dysfunction can be exacerbated by exposure to SARS-CoV-2 viral particles, by exposure to fine particulate matter generated by climate change-associated wildfires, and by dehydration during deadly heatwaves linked to the globally rising temperatures. The external factors can severely worsen the pre-existing endothelial dysfunction, and thereby significantly increase the risk of a cardiovascular event in the exposed FH patients.
Collapse
Affiliation(s)
- Alpo Vuorio
- Mehiläinen Airport Health Centre, Occupational Health Unit, Vantaa, Finland.,Department of Forensic Medicine, University of Helsinki, Helsinki, Finland
| | - Bruce Budowle
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, Texas, USA
| | - Petri T Kovanen
- Wihuri Research Institute, Cardiovascular Research Laboratory, Helsinki, Finland
| |
Collapse
|
28
|
Ge J, King J, Mandape S, Budowle B. Enhanced mixture interpretation with macrohaplotypes based on long-read DNA sequencing. Int J Legal Med 2021; 135:2189-2198. [PMID: 34378071 DOI: 10.1007/s00414-021-02679-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2021] [Accepted: 07/30/2021] [Indexed: 12/18/2022]
Abstract
Deconvoluting mixture samples is one of the most challenging problems confronting DNA forensic laboratories. Efforts have been made to provide solutions regarding mixture interpretation. The probabilistic interpretation of Short Tandem Repeat (STR) profiles has increased the number of complex mixtures that can be analyzed. A portion of complex mixture profiles, particularly for mixtures with a high number of contributors, are still being deemed uninterpretable. Novel forensic markers, such as Single Nucleotide Variants (SNV) and microhaplotypes, also have been proposed to allow for better mixture interpretation. However, these markers have both a lower discrimination power compared with STRs and are not compatible with CODIS or other national DNA databanks worldwide. The short-read sequencing (SRS) technologies can facilitate mixture interpretation by identifying intra-allelic variations within STRs. Unfortunately, the short size of the amplicons containing STR markers and sequence reads limit the alleles that can be attained per STR. The latest long-read sequencing (LRS) technologies can overcome this limitation in some samples in which larger DNA fragments (including both STRs and SNVs) with definitive phasing are available. Based on the LRS technologies, this study developed a novel CODIS compatible forensic marker, called a macrohaplotype, which combines a CODIS STR and flanking variants to offer extremely high number of haplotypes and hence very high discrimination power per marker. The macrohaplotype will substantially improve mixture interpretation capabilities. Based on publicly accessible data, a panel of 20 macrohaplotypes with sizes of ~ 8 k bp and the maximum high discrimination powers were designed. The statistical evaluation demonstrates that these macrohaplotypes substantially outperform CODIS STRs for mixture interpretation, particularly for mixtures with a high number of contributors, as well as other forensic applications. Based on these results, efforts should be undertaken to build a complete workflow, both wet-lab and bioinformatics, to precisely call the variants and generate the macrohaplotypes based on the LRS technologies.
Collapse
Affiliation(s)
- Jianye Ge
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, TX, USA.
- Department of Microbiology, Immunology, and Genetics, University of North Texas Health Science Center, Fort Worth, TX, USA.
| | - Jonathan King
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, TX, USA
| | - Sammed Mandape
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, TX, USA
| | - Bruce Budowle
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, TX, USA
- Department of Microbiology, Immunology, and Genetics, University of North Texas Health Science Center, Fort Worth, TX, USA
| |
Collapse
|
29
|
Ceresa L, Kitchner E, Seung M, Bus MM, Budowle B, Chavez J, Gryczynski I, Gryczynski Z. A novel approach to imaging and visualization of minute amounts of DNA in small volume samples. Analyst 2021; 146:6520-6527. [PMID: 34559174 DOI: 10.1039/d1an01391b] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
This report presents a novel approach for detecting and visualizing small to trace amounts of DNA in a sample. By utilizing both the change in emission spectrum and change in fluorescence lifetime, there is a significant increase in detection sensitivity allowing for the imaging/visualizing of a picograms amount of DNA in a microliters volume. As in the previous reports, one of the oldest DNA intercalators, Ethidium Bromide (EtBr), is employed as a model system. With this new approach, it is feasible to visualize just a few hundred picograms of DNA without the need for prior DNA amplification. The sensitivity can later be largely improved by using an intercalator that exhibits a higher affinity to DNA and a larger fluorescence change upon binding to DNA (e.g., ethidium homodimer, YOYO, or Diamond nucleic acid dyes).
Collapse
Affiliation(s)
- Luca Ceresa
- Department of Physics and Astronomy, Texas Christian University, 2800 S. University, Dr. Fort Worth, Texas, 76129, USA.
| | - Emma Kitchner
- Department of Physics and Astronomy, Texas Christian University, 2800 S. University, Dr. Fort Worth, Texas, 76129, USA.
| | - Michael Seung
- Department of Physics and Astronomy, Texas Christian University, 2800 S. University, Dr. Fort Worth, Texas, 76129, USA.
| | - Magdalena M Bus
- Center for Human Identification, Graduate School of Biomedical Sciences, University of North Texas Health Science Center, 3500 Camp Bowie Blvd, CBH-250, Ft Worth, TX 76107, USA.,Department of Microbiology, Immunology and Genetics, Graduate School of Biomedical Sciences, University of North Texas Health Science Center, Ft Worth, TX 76107, USA
| | - Bruce Budowle
- Center for Human Identification, Graduate School of Biomedical Sciences, University of North Texas Health Science Center, 3500 Camp Bowie Blvd, CBH-250, Ft Worth, TX 76107, USA.,Department of Microbiology, Immunology and Genetics, Graduate School of Biomedical Sciences, University of North Texas Health Science Center, Ft Worth, TX 76107, USA
| | - Jose Chavez
- Department of Physics and Astronomy, Texas Christian University, 2800 S. University, Dr. Fort Worth, Texas, 76129, USA.
| | - Ignacy Gryczynski
- Department of Physics and Astronomy, Texas Christian University, 2800 S. University, Dr. Fort Worth, Texas, 76129, USA.
| | - Zygmunt Gryczynski
- Department of Physics and Astronomy, Texas Christian University, 2800 S. University, Dr. Fort Worth, Texas, 76129, USA.
| |
Collapse
|
30
|
Ge J, King JL, Smuts A, Budowle B. Precision DNA Mixture Interpretation with Single-Cell Profiling. Genes (Basel) 2021; 12:1649. [PMID: 34828255 PMCID: PMC8623868 DOI: 10.3390/genes12111649] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2021] [Revised: 10/14/2021] [Accepted: 10/14/2021] [Indexed: 11/16/2022] Open
Abstract
Wet-lab based studies have exploited emerging single-cell technologies to address the challenges of interpreting forensic mixture evidence. However, little effort has been dedicated to developing a systematic approach to interpreting the single-cell profiles derived from the mixtures. This study is the first attempt to develop a comprehensive interpretation workflow in which single-cell profiles from mixtures are interpreted individually and holistically. In this approach, the genotypes from each cell are assessed, the number of contributors (NOC) of the single-cell profiles is estimated, followed by developing a consensus profile of each contributor, and finally the consensus profile(s) can be used for a DNA database search or comparing with known profiles to determine their potential sources. The potential of this single-cell interpretation workflow was assessed by simulation with various mixture scenarios and empirical allele drop-out and drop-in rates, the accuracies of estimating the NOC, the accuracies of recovering the true alleles by consensus, and the capabilities of deconvolving mixtures with related contributors. The results support that the single-cell based mixture interpretation can provide a precision that cannot beachieved with current standard CE-STR analyses. A new paradigm for mixture interpretation is available to enhance the interpretation of forensic genetic casework.
Collapse
Affiliation(s)
- Jianye Ge
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, TX 76107, USA; (J.L.K.); (A.S.); (B.B.)
- Department of Microbiology, Immunology and Genetics, University of North Texas Health Science Center, Fort Worth, TX 76107, USA
| | - Jonathan L. King
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, TX 76107, USA; (J.L.K.); (A.S.); (B.B.)
| | - Amy Smuts
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, TX 76107, USA; (J.L.K.); (A.S.); (B.B.)
| | - Bruce Budowle
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, TX 76107, USA; (J.L.K.); (A.S.); (B.B.)
- Department of Microbiology, Immunology and Genetics, University of North Texas Health Science Center, Fort Worth, TX 76107, USA
| |
Collapse
|
31
|
Mandape SN, Smart U, King JL, Muenzler M, Kapema KB, Budowle B, Woerner AE. MMDIT: A tool for the deconvolution and interpretation of mitochondrial DNA mixtures. Forensic Sci Int Genet 2021; 55:102568. [PMID: 34416654 DOI: 10.1016/j.fsigen.2021.102568] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2021] [Revised: 06/22/2021] [Accepted: 08/03/2021] [Indexed: 01/01/2023]
Abstract
Short tandem repeats of the nuclear genome have been the preferred markers for analyzing forensic DNA mixtures. However, when nuclear DNA in a sample is degraded or limited, mitochondrial DNA (mtDNA) markers provide a powerful alternative. Though historically considered challenging, the interpretation and analysis of mtDNA mixtures have recently seen renewed interest with the advent of massively parallel sequencing. However, there are only a few software tools available for mtDNA mixture interpretation. To address this gap, the Mitochondrial Mixture Deconvolution and Interpretation Tool (MMDIT) was developed. MMDIT is an interactive application complete with a graphical user interface that allows users to deconvolve mtDNA (whole or partial genomes) mixtures into constituent donor haplotypes and estimate random match probabilities on these resultant haplotypes. In cases where deconvolution might not be feasible, the software allows mixture analysis directly within a binary framework (i.e. qualitatively, only using data on allele presence/absence). This paper explains the functionality of MMDIT, using an example of an in vitro two-person mtDNA mixture with a ratio of 1:4. The uniqueness of MMDIT lies in its ability to resolve mixtures into complete donor haplotypes using a statistical phasing framework before mixture analysis and evaluating statistical weights employing a novel graph algorithm approach. MMDIT is the first available open-source software that can automate mtDNA mixture deconvolution and analysis. The MMDIT web application can be accessed online at https://www.unthsc.edu/mmdit/. The source code is available at https://github.com/SammedMandape/MMDIT_UI and archived on zenodo (https://doi.org/10.5281/zenodo.4770184).
Collapse
Affiliation(s)
- Sammed N Mandape
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp, Bowie Blvd., Fort Worth, TX 76107, USA
| | - Utpal Smart
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp, Bowie Blvd., Fort Worth, TX 76107, USA
| | - Jonathan L King
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp, Bowie Blvd., Fort Worth, TX 76107, USA
| | - Melissa Muenzler
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp, Bowie Blvd., Fort Worth, TX 76107, USA
| | - Kapema Bupe Kapema
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp, Bowie Blvd., Fort Worth, TX 76107, USA
| | - Bruce Budowle
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp, Bowie Blvd., Fort Worth, TX 76107, USA; Department of Microbiology, Immunology, and Genetics, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX 76107, USA
| | - August E Woerner
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp, Bowie Blvd., Fort Worth, TX 76107, USA; Department of Microbiology, Immunology, and Genetics, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX 76107, USA.
| |
Collapse
|
32
|
Neyra Rivera CD, Delgado Ramos E, Díaz Soria F, Quispe Ramírez JS, Ge J, Budowle B. Genetic study with autosomal STR markers in people of the Peruvian jungle for human identification purposes. Canadian Society of Forensic Science Journal 2021. [DOI: 10.1080/00085030.2021.1933811] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Affiliation(s)
| | - Edgardo Delgado Ramos
- Universidad Nacional Mayor de San Marcos, Ciudad Universitaria Cercado de Lima, Lima, Perú
| | - Fabiola Díaz Soria
- Instituto Nacional de Salud, Centro de Investigación en Enfermedades Tropicales “Máxime Kuczynski”, Iquitos, Perú
| | | | - Jianye Ge
- Center for Human Identification, University of North Texas Health Science Center, Ft Worth, TX, USA
| | - Bruce Budowle
- Center for Human Identification, University of North Texas Health Science Center, Ft Worth, TX, USA
| |
Collapse
|
33
|
Smart U, Cihlar JC, Budowle B. International Wildlife Trafficking: A perspective on the challenges and potential forensic genetics solutions. Forensic Sci Int Genet 2021; 54:102551. [PMID: 34134047 DOI: 10.1016/j.fsigen.2021.102551] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2021] [Revised: 06/03/2021] [Accepted: 06/04/2021] [Indexed: 12/29/2022]
Abstract
International wildlife trafficking (IWT) is a thriving and pervasive illegal enterprise that adversely affects modern societies. Yet, despite being globally recognized as a threat to biodiversity, national security, economy, and biosecurity, IWT remains largely unabated and is proliferating at an alarming rate. The increase in IWT is generally attributed to a lack of prioritization to curb wildlife crime through legal and scientific infrastructure. This review: (1) lays out the damaging scope and influence of IWT; (2) discusses the potential of DNA marker systems, barcodes, and emerging molecular technologies, such as long-read portable sequencing, to facilitate rapid, in situ identification of species and individuals; and (3) encourages initiatives that promote quality and innovation. Interdisciplinary collaboration promises to be one of the most effective ways forward to surmounting the complex scientific and legal challenges posed by IWT.
Collapse
Affiliation(s)
- Utpal Smart
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp, Bowie Blvd., Fort Worth, TX 76107, USA.
| | - Jennifer Churchill Cihlar
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp, Bowie Blvd., Fort Worth, TX 76107, USA; Department of Microbiology, Immunology, and Genetics, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX 76107, USA
| | - Bruce Budowle
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp, Bowie Blvd., Fort Worth, TX 76107, USA; Department of Microbiology, Immunology, and Genetics, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX 76107, USA
| |
Collapse
|
34
|
Santos CGM, Rolim-Filho NG, Domingues CA, Dornelas-Ribeiro M, King JL, Budowle B, Moura-Neto RS, Silva R. Association of whole mtDNA, an NADPH G11914A variant, and haplogroups with high physical performance in an elite military troop. ACTA ACUST UNITED AC 2021; 54:e10317. [PMID: 33909855 PMCID: PMC8075130 DOI: 10.1590/1414-431x202010317] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2020] [Accepted: 12/29/2020] [Indexed: 11/22/2022]
Abstract
Physical performance is a multifactorial and complex trait influenced by environmental and hereditary factors. Environmental factors alone have been insufficient to characterize all outstanding phenotypes. Recent advances in genomic technologies have enabled the investigation of whole nuclear and mitochondrial genome sequences, increasing our ability to understand interindividual variability in physical performance. Our objective was to evaluate the association of mitochondrial polymorphic loci with physical performance in Brazilian elite military personnel. Eighty-eight male military personnel who participated in the Command Actions Course of the Army were selected. Total DNA was obtained from blood samples and a complete mitochondrial genome (mtDNA) was sequenced using Illumina MiSeq platform. Twenty-nine subjects completed the training program (FINISHED, 'F'), and fifty-nine failed to complete (NOT_FINISHED, 'NF'). The mtDNA from NF was slightly more similar to genomes from African countries frequently related to endurance level. Twenty-two distinct mtDNA haplogroups were identified corroborating the intense genetic admixture of the Brazilian population, but their distribution was similar between the two groups (FST=0.0009). Of 745 polymorphisms detected in the mtDNA, the position G11914A within the NADPH gene component of the electron transport chain, was statistically different between F and NF groups (P=0.011; OR: 4.286; 95%CI: 1.198-16.719), with a higher frequency of the G allele in group F individuals). The high performance of military personnel may be mediated by performance-related genomic traits. Thus, mitochondrial genetic markers such as the ND4 gene may play an important role on physical performance variability.
Collapse
Affiliation(s)
- C G M Santos
- Instituto de Biologia do Exército, Rio de Janeiro, RJ, Brasil
| | - N G Rolim-Filho
- Centro de Instrução de Operações Especiais do Exército Brasileiro, Rio de Janeiro, RJ, Brasil
| | - C A Domingues
- Centro de Instrução de Operações Especiais do Exército Brasileiro, Rio de Janeiro, RJ, Brasil
| | | | - J L King
- Center for Human Identification, Department of Microbiology, Immunology, and Genetics, University of North Texas Health Science Center, Fort Worth, TX, USA
| | - B Budowle
- Center for Human Identification, Department of Microbiology, Immunology, and Genetics, University of North Texas Health Science Center, Fort Worth, TX, USA
| | - R S Moura-Neto
- Instituto de Biologia, Universidade Federal do Rio de Janeiro, Rio de Janeiro, RJ, Brasil
| | - R Silva
- Instituto de Biofísica Carlos Chagas Filho, Universidade Federal do Rio de Janeiro, Rio de Janeiro, RJ, Brasil
| |
Collapse
|
35
|
Li R, Budowle B, Sun H, Ge J. Linkage and linkage disequilibrium among the markers in the forensic MPS panels. J Forensic Sci 2021; 66:1637-1646. [PMID: 33885147 DOI: 10.1111/1556-4029.14724] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2021] [Revised: 03/11/2021] [Accepted: 03/22/2021] [Indexed: 11/28/2022]
Abstract
For the past two to three decades, forensic DNA evidence has been analyzed with a limited number of short tandem repeats (STRs), and these STRs are usually assumed to be independent for statistical calculations. With the development and implementation of the MPS technologies, more autosomal markers, both single nucleotide polymorphisms (SNPs) and STRs, can be analyzed. A number of these markers are physically very close to each other, and it may not be appropriate to assume all these markers are genetically unlinked or in linkage equilibrium. In this study, publicly accessible genomic data from five representative populations were used to evaluate the genetic linkage and linkage disequilibrium (LD) between autosomal markers represented in six major commercial panels (in total, 362 markers). Among the 3041 syntenic marker pairs, 1524 pairs had sex-average genetic distances <50 cM, and thus, these marker pairs can be considered as genetically linked. Among the 143 marker pairs with physical distances <1 Mb, 19 LD haplotype blocks (comprising 39 SNPs in total) were detected for at least one of the tested populations. Statistical methods for interpreting linked markers and/or markers in LD were suggested for various case scenarios.
Collapse
Affiliation(s)
- Ran Li
- Faculty of Forensic Medicine, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, China.,Guangdong Province Translational Forensic Medicine Engineering Technology Research Center, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, China
| | - Bruce Budowle
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, TX, USA.,Department of Microbiology, Immunology, and Genetics, University of North Texas Health Science Center, Fort Worth, TX, USA
| | - Hongyu Sun
- Faculty of Forensic Medicine, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, China.,Guangdong Province Translational Forensic Medicine Engineering Technology Research Center, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, China
| | - Jianye Ge
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, TX, USA.,Department of Microbiology, Immunology, and Genetics, University of North Texas Health Science Center, Fort Worth, TX, USA
| |
Collapse
|
36
|
Moura-Neto R, King JL, Mello I, Dias V, Crysup B, Woerner AE, Budowle B, Silva R. Evaluation of Promega PowerSeq™ Auto/Y systems prototype on an admixed sample of Rio de Janeiro, Brazil: Population data, sensitivity, stutter and mixture studies. Forensic Sci Int Genet 2021; 53:102516. [PMID: 33878618 DOI: 10.1016/j.fsigen.2021.102516] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2020] [Revised: 03/06/2021] [Accepted: 03/29/2021] [Indexed: 02/01/2023]
Abstract
Forensic DNA typing typically relies on the length-based (LB) separation of PCR products containing short tandem repeat loci (STRs). Massively parallel sequencing (MPS) elucidates an additional level of STR motif and flanking region variation. Also, MPS enables simultaneous analysis of different marker-types - autosomal STRs, SNPs for lineage and identification purposes, reducing both the amount of sample used and the turn-around-time of analysis. Therefore, MPS methodologies are being considered as an additional tool in forensic genetic casework. The PowerSeq™ Auto/Y System (Promega Corp), a multiplex forensic kit for MPS, enables analysis of the 22 autosomal STR markers (plus Amelogenin) from the PowerPlex® Fusion 6C kit and 23 Y-STR markers from the PowerPlex® Y23 kit. Population data were generated from 140 individuals from an admixed sample from Rio de Janeiro, Brazil. All samples were processed according to the manufacturers' recommended protocols. Raw data (FastQ) were generated for each indexed sample and analyzed using STRait Razor v2s and PowerSeqv2.config file. The subsequent population data showed the largest increase in expected heterozygosity (23%), from LB to sequence-based (SB) analyses at the D5S818 locus. Unreported allele was found at the D21S11 locus. The random match probability across all loci decreased from 5.9 × 10-28 to 7.6 × 10-33. Sensitivity studies using 1, 0.25, 0.062 and 0.016 ng of DNA input were analyzed in triplicate. Full Y-STR profiles were detected in all samples, and no autosomal allele drop-out was observed with 62 pg of input DNA. For mixture studies, 1 ng of genomic DNA from a male and female sample at 1:1, 1:4, 1:9, 1:19 and 1:49 proportions were analyzed in triplicate. Clearly resolvable alleles (i.e., no stacking or shared alleles) were obtained at a 1:19 male to female contributor ratio. The minus one stutter (-1) increased with the longest uninterrupted stretch (LUS) allele size reads and according to simple or compound/complex repeats. The haplotype-specific stutter rates add more information for mixed samples interpretation. These data support the use of the PowerSeqTM Auto/Y systems prototype kit (22 autosomal STR loci, 23 Y-STR loci and Amelogenin) for forensic genetics applications.
Collapse
Affiliation(s)
- Rodrigo Moura-Neto
- Instituto de Biologia, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil.
| | - Jonathan L King
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, TX, USA
| | - Isadora Mello
- Instituto de Biofísica Carlos Chagas Filho, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil
| | - Victor Dias
- Instituto de Biofísica Carlos Chagas Filho, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil
| | - Benjamin Crysup
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, TX, USA
| | - August E Woerner
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, TX, USA; Department of Microbiology, Immunology and Genetics, University of North Texas Health Science Center, Fort Worth, TX, USA
| | - Bruce Budowle
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, TX, USA; Department of Microbiology, Immunology and Genetics, University of North Texas Health Science Center, Fort Worth, TX, USA
| | - Rosane Silva
- Instituto de Biofísica Carlos Chagas Filho, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil.
| |
Collapse
|
37
|
Neyra-Rivera CD, Ticona Arenas A, Delgado Ramos E, Velasquez Reinoso MRE, Caceres Rey OA, Budowle B. Population data of 27 Y-chromosome STRS in Aymara population from Peru. AUST J FORENSIC SCI 2021. [DOI: 10.1080/00450618.2021.1882571] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
Affiliation(s)
| | - Andres Ticona Arenas
- Posgrado de Ciencias Biológicas, Universidad Nacional Mayor de San Marcos, Ciudad Universitaria Cercado de Lima, Lima, Perú
| | - Edgardo Delgado Ramos
- Universidad Nacional Mayor de San Marcos, Ciudad Universitaria Cercado de Lima, Lima, Perú
| | | | | | - Bruce Budowle
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, TX, USA
| |
Collapse
|
38
|
Crysup B, Woerner AE, King JL, Budowle B. Graph Algorithms for Mixture Interpretation. Genes (Basel) 2021; 12:genes12020185. [PMID: 33514030 PMCID: PMC7911948 DOI: 10.3390/genes12020185] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2020] [Revised: 01/22/2021] [Accepted: 01/26/2021] [Indexed: 12/19/2022] Open
Abstract
The scale of genetic methods are presently being expanded: forensic genetic assays previously were limited to tens of loci, but now technologies allow for a transition to forensic genomic approaches that assess thousands to millions of loci. However, there are subtle distinctions between genetic assays and their genomic counterparts (especially in the context of forensics). For instance, forensic genetic approaches tend to describe a locus as a haplotype, be it a microhaplotype or a short tandem repeat with its accompanying flanking information. In contrast, genomic assays tend to provide not haplotypes but sequence variants or differences, variants which in turn describe how the alleles apparently differ from the reference sequence. By the given construction, mitochondrial genetic assays can be thought of as genomic as they often describe genetic differences in a similar way. The mitochondrial genetics literature makes clear that sequence differences, unlike the haplotypes they encode, are not comparable to each other. Different alignment algorithms and different variant calling conventions may cause the same haplotype to be encoded in multiple ways. This ambiguity can affect evidence and reference profile comparisons as well as how “match” statistics are computed. In this study, a graph algorithm is described (and implemented in the MMDIT (Mitochondrial Mixture Database and Interpretation Tool) R package) that permits the assessment of forensic match statistics on mitochondrial DNA mixtures in a way that is invariant to both the variant calling conventions followed and the alignment parameters considered. The algorithm described, given a few modest constraints, can be used to compute the “random man not excluded” statistic or the likelihood ratio. The performance of the approach is assessed in in silico mitochondrial DNA mixtures.
Collapse
Affiliation(s)
- Benjamin Crysup
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX 76107, USA; (A.E.W.); (J.L.K.); (B.B.)
- Correspondence:
| | - August E. Woerner
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX 76107, USA; (A.E.W.); (J.L.K.); (B.B.)
- Department of Microbiology, Immunology, and Genetics, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX 76107, USA
| | - Jonathan L. King
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX 76107, USA; (A.E.W.); (J.L.K.); (B.B.)
| | - Bruce Budowle
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX 76107, USA; (A.E.W.); (J.L.K.); (B.B.)
- Department of Microbiology, Immunology, and Genetics, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX 76107, USA
| |
Collapse
|
39
|
Smart U, Cihlar JC, Mandape SN, Muenzler M, King JL, Budowle B, Woerner AE. A Continuous Statistical Phasing Framework for the Analysis of Forensic Mitochondrial DNA Mixtures. Genes (Basel) 2021; 12:128. [PMID: 33498312 PMCID: PMC7909279 DOI: 10.3390/genes12020128] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2020] [Revised: 01/14/2021] [Accepted: 01/15/2021] [Indexed: 11/16/2022] Open
Abstract
Despite the benefits of quantitative data generated by massively parallel sequencing, resolving mitotypes from mixtures occurring in certain ratios remains challenging. In this study, a bioinformatic mixture deconvolution method centered on population-based phasing was developed and validated. The method was first tested on 270 in silico two-person mixtures varying in mixture proportions. An assortment of external reference panels containing information on haplotypic variation (from similar and different haplogroups) was leveraged to assess the effect of panel composition on phasing accuracy. Building on these simulations, mitochondrial genomes from the Human Mitochondrial DataBase were sourced to populate the panels and key parameter values were identified by deconvolving an additional 7290 in silico two-person mixtures. Finally, employing an optimized reference panel and phasing parameters, the approach was validated with in vitro two-person mixtures with differing proportions. Deconvolution was most accurate when the haplotypes in the mixture were similar to haplotypes present in the reference panel and when the mixture ratios were neither highly imbalanced nor subequal (e.g., 4:1). Overall, errors in haplotype estimation were largely bounded by the accuracy of the mixture's genotype results. The proposed framework is the first available approach that automates the reconstruction of complete individual mitotypes from mixtures, even in ratios that have traditionally been considered problematic.
Collapse
Affiliation(s)
- Utpal Smart
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp, Bowie Blvd., Fort Worth, TX 76107, USA; (U.S.); (J.C.C.); (S.N.M.); (M.M.); (J.L.K.); (B.B.)
| | - Jennifer Churchill Cihlar
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp, Bowie Blvd., Fort Worth, TX 76107, USA; (U.S.); (J.C.C.); (S.N.M.); (M.M.); (J.L.K.); (B.B.)
- Department of Microbiology, Immunology and Genetics, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX 76107, USA
| | - Sammed N. Mandape
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp, Bowie Blvd., Fort Worth, TX 76107, USA; (U.S.); (J.C.C.); (S.N.M.); (M.M.); (J.L.K.); (B.B.)
| | - Melissa Muenzler
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp, Bowie Blvd., Fort Worth, TX 76107, USA; (U.S.); (J.C.C.); (S.N.M.); (M.M.); (J.L.K.); (B.B.)
| | - Jonathan L. King
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp, Bowie Blvd., Fort Worth, TX 76107, USA; (U.S.); (J.C.C.); (S.N.M.); (M.M.); (J.L.K.); (B.B.)
| | - Bruce Budowle
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp, Bowie Blvd., Fort Worth, TX 76107, USA; (U.S.); (J.C.C.); (S.N.M.); (M.M.); (J.L.K.); (B.B.)
- Department of Microbiology, Immunology and Genetics, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX 76107, USA
| | - August E. Woerner
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp, Bowie Blvd., Fort Worth, TX 76107, USA; (U.S.); (J.C.C.); (S.N.M.); (M.M.); (J.L.K.); (B.B.)
- Department of Microbiology, Immunology and Genetics, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX 76107, USA
| |
Collapse
|
40
|
Crysup B, Budowle B, Woerner AE. ProDerAl: Reference Position Dependent Alignment. Bioinformatics 2021; 37:2479-2480. [PMID: 33459758 DOI: 10.1093/bioinformatics/btab008] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2020] [Revised: 12/11/2020] [Accepted: 01/04/2021] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION Current read-mapping software uses a singular specification of alignment parameters with respect to the reference. In the presence of varying reference structures (such as the repetitive regions of the human genome), alignments can be improved if those parameters are allowed vary. RESULTS To that end, the C ++ program ProDerAl was written to refine previously generated alignments using varying parameters for these problematic regions. Synthetic benchmarks show that this realignment can result in an order of magnitude fewer misaligned bases. AVAILABILITY *Nix users can retrieve the source from GitHub (https://github.com/Benjamin-Crysup/proderal.git). Windows binary available at https://github.com/Benjamin-Crysup/proderal/releases/download/v1.1/proderal.zip. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Benjamin Crysup
- Center for Human Identification, University of North Texas, Fort Worth Texas
| | - Bruce Budowle
- Center for Human Identification, University of North Texas, Fort Worth Texas
| | - August E Woerner
- Center for Human Identification, University of North Texas, Fort Worth Texas
| |
Collapse
|
41
|
King JL, Woerner AE, Mandape SN, Kapema KB, Moura-Neto RS, Silva R, Budowle B. STRait Razor Online: An enhanced user interface to facilitate interpretation of MPS data. Forensic Sci Int Genet 2021; 52:102463. [PMID: 33493821 DOI: 10.1016/j.fsigen.2021.102463] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2020] [Revised: 11/06/2020] [Accepted: 12/29/2020] [Indexed: 12/17/2022]
Abstract
Since 2013, STRait Razor has enabled analysis of massively parallel sequencing (MPS) data from various marker systems such as short tandem repeats, single nucleotide polymorphisms, insertion/deletions, and mitochondrial DNA. In this paper, STRait Razor Online (SRO), available at https://www.unthsc.edu/straitrazor, is introduced as an interactive, Shiny-based user interface for primary analysis of MPS data and secondary analysis of STRait Razor haplotype pileups. This software can be accessed from any common browser via desktop, tablet, or smartphone device. SRO is available also as a standalone application and open-source R script available at https://github.com/ExpectationsManaged/STRaitRazorOnline. The local application is capable of batch processing of both fastq files and primary analysis output. Processed batches generate individual report folders and summary reports at the locus- and haplotype-level in a matter of minutes. For example, the processing of data from ∼700 samples generated with the ForenSeq Signature Preparation Kit from allsequences.txt to a final table can be performed in ∼40 min whereas the Excel-based workbooks can take 35-60 h to compile a subset of the tables generated by SRO. To facilitate analysis of single-source, reference samples, a preliminary triaging system was implemented that calls potential alleles and flags loci suspected of severe heterozygote imbalance. When compared to published, manually curated data sets, 98.72 % of software-assigned allele calls without manual interpretation were consistent with curated data sets, 0.99 % loci were presented to the user for interpretation due to heterozygote imbalance, and the remaining 0.29 % of loci were inconsistent due to the analytical thresholds used across the studies.
Collapse
Affiliation(s)
- Jonathan L King
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX, 76107, USA.
| | - August E Woerner
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX, 76107, USA; Department of Microbiology, Immunology, and Genetics, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX, 76107, USA
| | - Sammed N Mandape
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX, 76107, USA
| | - Kapema Bupe Kapema
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX, 76107, USA
| | | | - Rosane Silva
- Instituto de Biofisica Carlos Chagas Filho, Universidade Federal do Rio de Janeiro, Rio de Janeiro, RJ, Brazil
| | - Bruce Budowle
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX, 76107, USA; Department of Microbiology, Immunology, and Genetics, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX, 76107, USA
| |
Collapse
|
42
|
Kitchner E, Chavez J, Ceresa L, Bus MM, Budowle B, Gryczynski Z. A novel approach for visualization and localization of small amounts of DNA on swabs to improve DNA collection and recovery process. Analyst 2021; 146:1198-1206. [PMID: 33393553 DOI: 10.1039/d0an02043e] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
In this report, a simple and practical procedure is proposed for DNA localization on a solid matrix e.g., a collection swab. The approach is straightforward and employs spectrum decomposition using a model DNA intercalator Ethidium Bromide (EtBr). The proposed approach can detect picograms of DNA in solution and nanograms of DNA on solid surfaces (swabs) without the need for PCR amplification. The proposed technology offers the possibility for developing an inexpensive, sensitive, rapid, and practical method for localizing and recovering DNA deposited on collection swabs during routine DNA screening. Improved detection of low DNA concentrations is needed and, if feasible, will allow for better decision making in clinical medicine, biological and environmental research, and human identification in forensic investigations.
Collapse
Affiliation(s)
- Emma Kitchner
- Department of Physics and Astronomy, Texas Christian University, Fort Worth, TX 76129, USA.
| | | | | | | | | | | |
Collapse
|
43
|
Guevara EK, Palo JU, Översti S, King JL, Seidel M, Stoljarova M, Wendt FR, Bus MM, Guengerich A, Church WB, Guillén S, Roewer L, Budowle B, Sajantila A. Genetic assessment reveals no population substructure and divergent regional and sex-specific histories in the Chachapoyas from northeast Peru. PLoS One 2020; 15:e0244497. [PMID: 33382772 PMCID: PMC7774974 DOI: 10.1371/journal.pone.0244497] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2020] [Accepted: 12/10/2020] [Indexed: 12/31/2022] Open
Abstract
Many native populations in South America have been severely impacted by two relatively recent historical events, the Inca and the Spanish conquest. However decisive these disruptive events may have been, the populations and their gene pools have been shaped markedly also by the history prior to the conquests. This study focuses mainly on the Chachapoya peoples that inhabit the montane forests on the eastern slopes of the northern Peruvian Andes, but also includes three distinct neighboring populations (the Jívaro, the Huancas and the Cajamarca). By assessing mitochondrial, Y-chromosomal and autosomal diversity in the region, we explore questions that have emerged from archaeological and historical studies of the regional culture (s). These studies have shown, among others, that Chachapoyas was a crossroads for Coast-Andes-Amazon interactions since very early times. In this study, we examine the following questions: 1) was there pre-Hispanic genetic population substructure in the Chachapoyas sample? 2) did the Spanish conquest cause a more severe population decline on Chachapoyan males than on females? 3) can we detect different patterns of European gene flow in the Chachapoyas region? and, 4) did the demographic history in the Chachapoyas resemble the one from the Andean area? Despite cultural differences within the Chachapoyas region as shown by archaeological and ethnohistorical research, genetic markers show no significant evidence for past or current population substructure, although an Amazonian gene flow dynamic in the northern part of this territory is suggested. The data also indicates a bottleneck c. 25 generations ago that was more severe among males than females, as well as divergent population histories for populations in the Andean and Amazonian regions. In line with previous studies, we observe high genetic diversity in the Chachapoyas, despite the documented dramatic population declines. The diverse topography and great biodiversity of the northeastern Peruvian montane forests are potential contributing agents in shaping and maintaining the high genetic diversity in the Chachapoyas region.
Collapse
Affiliation(s)
- Evelyn K. Guevara
- Department of Forensic Medicine, University of Helsinki, Helsinki, Finland
- * E-mail: (EKG); (AS)
| | - Jukka U. Palo
- Department of Forensic Medicine, University of Helsinki, Helsinki, Finland
- Forensic Genetics Unit, Finnish Institute for Health and Welfare, Helsinki, Finland
| | - Sanni Översti
- Department of Biosciences, University of Helsinki, Helsinki, Finland
| | - Jonathan L. King
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, Texas, United States of America
| | - Maria Seidel
- Department of Forensic Genetics, Institute of Legal Medicine and Forensic Sciences, Charité – Universitätsmedizin Berlin, Berlin, Germany
| | - Monika Stoljarova
- Department of Chemistry and Biotechnology, Tallinn University of Technology, Tallinn, Estonia
| | - Frank R. Wendt
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, Texas, United States of America
- Department of Psychiatry, Yale University School of Medicine and VA Connecticut Healthcare System, West Haven, Connecticut, United States of America
- Department of Microbiology, Immunology and Genetics, Graduate School of Biomedical Sciences, University of North Texas Health Science Center, Fort Worth, Texas, United States of America
| | - Magdalena M. Bus
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, Texas, United States of America
- Department of Microbiology, Immunology and Genetics, Graduate School of Biomedical Sciences, University of North Texas Health Science Center, Fort Worth, Texas, United States of America
| | - Anna Guengerich
- Eckerd College, Saint Petersburg, Florida, United States of America
| | - Warren B. Church
- Department of Earth and Space Sciences, Columbus State University, Columbus, Georgia, United States of America
| | | | - Lutz Roewer
- Department of Forensic Genetics, Institute of Legal Medicine and Forensic Sciences, Charité – Universitätsmedizin Berlin, Berlin, Germany
| | - Bruce Budowle
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, Texas, United States of America
- Department of Microbiology, Immunology and Genetics, Graduate School of Biomedical Sciences, University of North Texas Health Science Center, Fort Worth, Texas, United States of America
| | - Antti Sajantila
- Department of Forensic Medicine, University of Helsinki, Helsinki, Finland
- Forensic Medicine Unit, Finnish Institute for Health and Welfare, Helsinki, Finland
- * E-mail: (EKG); (AS)
| |
Collapse
|
44
|
Woerner AE, Mandape S, King JL, Muenzler M, Crysup B, Budowle B. Reducing noise and stutter in short tandem repeat loci with unique molecular identifiers. Forensic Sci Int Genet 2020; 51:102459. [PMID: 33429137 DOI: 10.1016/j.fsigen.2020.102459] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2020] [Revised: 10/28/2020] [Accepted: 12/21/2020] [Indexed: 12/24/2022]
Abstract
Unique molecular identifiers (UMIs) are a promising approach to contend with errors generated during PCR and massively parallel sequencing (MPS). With UMI technology, random molecular barcodes are ligated to template DNA molecules prior to PCR, allowing PCR and sequencing error to be tracked and corrected bioinformatically. UMIs have the potential to be particularly informative for the interpretation of short tandem repeats (STRs). Traditional MPS approaches may simply lead to the observation of alleles that are consistent with the hypotheses of stutter, while with UMIs stutter products bioinformatically may be re-associated with their parental alleles and subsequently removed. Herein, a bioinformatics pipeline named strumi is described that is designed for the analysis of STRs that are tagged with UMIs. Unlike other tools, strumi is an alignment-free machine learning driven algorithm that clusters individual MPS reads into UMI families, infers consensus super-reads that represent each family and provides an estimate the resulting haplotype's accuracy. Super-reads, in turn, approximate independent measurements not of the PCR products, but of the original template molecules, both in terms of quantity and sequence identity. Provisional assessments show that naïve threshold-based approaches generate super-reads that are accurate (∼97 % haplotype accuracy, compared to ∼78 % when UMIs are not used), and the application of a more nuanced machine learning approach increases the accuracy to ∼99.5 % depending on the level of certainty desired. With these features, UMIs may greatly simplify probabilistic genotyping systems and reduce uncertainty. However, the ability to interpret alleles at trace levels also permits the interpretation, characterization and quantification of contamination as well as somatic variation (including somatic stutter), which may present newfound challenges.
Collapse
Affiliation(s)
- August E Woerner
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX 76107, USA; Department of Microbiology, Immunology and Genetics, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX 76107, USA.
| | - Sammed Mandape
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX 76107, USA
| | - Jonathan L King
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX 76107, USA
| | - Melissa Muenzler
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX 76107, USA
| | - Benjamin Crysup
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX 76107, USA
| | - Bruce Budowle
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX 76107, USA; Department of Microbiology, Immunology and Genetics, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX 76107, USA
| |
Collapse
|
45
|
Ge J, Budowle B. Forensic investigation approaches of searching relatives in DNA databases. J Forensic Sci 2020; 66:430-443. [PMID: 33136341 DOI: 10.1111/1556-4029.14615] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2020] [Revised: 09/11/2020] [Accepted: 10/05/2020] [Indexed: 11/29/2022]
Abstract
There are several indirect database searching approaches to identify the potential source of a forensic biological sample. These DNA-based approaches are familial searching, Y-STR database searching, and investigative genetic genealogy (IGG). The first two strategies use forensic DNA databases managed by the government, and the latter uses databases managed by private citizens or companies. Each of these search strategies relies on DNA testing to identify relatives of the donor of the crime scene sample, provided such profiles reside in the DNA database(s). All three approaches have been successfully used to identify the donor of biological evidence, which assisted in solving criminal cases or identifying unknown human remains. This paper describes and compares these approaches in terms of genotyping technologies, searching methods, database structures, searching efficiency, data quality, data security, and costs, and raises some potential privacy and legal considerations for further discussion by stakeholders and scientists. Y-STR database searching and IGG are advantageous since they are able to assist in more cases than familial searching readily identifying distant relatives. In contrast, familial searching can be performed more readily with existing laboratory systems. Every country or state may have its own unique economic, technical, cultural, and legal considerations and should decide the best approach(es) to fit those circumstances. Regardless of the approach, the ultimate goal should be the same: generate investigative leads and solve active and cold criminal cases to public safety, under stringent policies and security practices designed to protect the privacy of its citizenry.
Collapse
Affiliation(s)
- Jianye Ge
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, TX, USA.,Department of Microbiology, Immunology, and Genetics, University of North Texas Health Science Center, Fort Worth, TX, USA
| | - Bruce Budowle
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, TX, USA.,Department of Microbiology, Immunology, and Genetics, University of North Texas Health Science Center, Fort Worth, TX, USA
| |
Collapse
|
46
|
Buckleton JS, Pugh SN, Bright JA, Taylor DA, Curran JM, Kruijver M, Gill P, Budowle B, Cheng K. Are low LRs reliable? Forensic Sci Int Genet 2020; 49:102350. [DOI: 10.1016/j.fsigen.2020.102350] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2020] [Revised: 06/09/2020] [Accepted: 06/26/2020] [Indexed: 12/20/2022]
|
47
|
Neyra-Rivera CD, Ticona Arenas A, Delgado Ramos E, Velasquez Reinoso MRE, Budowle B. Allelic frequencies with 23 autosomic STRS in the Aymara population of Peru. Int J Legal Med 2020; 135:779-781. [PMID: 33089341 DOI: 10.1007/s00414-020-02448-0] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2020] [Accepted: 10/14/2020] [Indexed: 11/29/2022]
Abstract
Population data of the Aymara in the province of Puno were established for 23 autosomal STR markers. DNA was obtained from unrelated individuals (n = 190) who reside in three areas of the Floating Islands of Lake Titicaca, residents on the border with Bolivia and residents who are not from the border with Bolivia. The PENTA E marker presented the highest PD (0.9738), PIC (0.8793), and PM (0.7847) values. The combined PD was greater than 0.99999999 and the combined PE was 0.99999994. The largest distance, based on Fst values, was between the Aymara population and the Ashaninca population (0.04022), and the smallest distance was with the populations of Bolivia (0.00136) and Peru (0.00525).
Collapse
Affiliation(s)
| | - Andres Ticona Arenas
- Posgrado de Ciencias Biológicas, Universidad Nacional Mayor de San Marcos, Ciudad Universitaria Cercado de Lima,, 15081, Lima, Peru
| | - Edgardo Delgado Ramos
- Universidad Nacional Mayor de San Marcos, Ciudad Universitaria Cercado de Lima,, 15081, Lima, Peru
| | | | - Bruce Budowle
- Center for Human Identification, University of North Texas Health Science Center, Ft Worth, TX, 76107, USA
| |
Collapse
|
48
|
Affiliation(s)
- Jianye Ge
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, Texas, United States of America
- * E-mail:
| | - Bruce Budowle
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, Texas, United States of America
| |
Collapse
|
49
|
Budowle B, Bus MM, Josserand MA, Peters DL. A standalone humanitarian DNA identification database system to increase identification of human remains of foreign nationals. Int J Legal Med 2020; 134:2039-2044. [PMID: 32767019 DOI: 10.1007/s00414-020-02396-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2020] [Accepted: 08/03/2020] [Indexed: 11/30/2022]
Abstract
The identification of missing persons and human remains is a worldwide problem which has been exacerbated with increased migrations and rampant human trafficking and smuggling cases. DNA typing and DNA databases are primary tools and resources used to help identify human remains and missing persons. The foundation of most, if not all, national DNA database systems, e.g., CODIS, is law enforcement identification. With such database systems, compliance with statutory and operational requirements is necessary to ensure the integrity of the databases. However, because of conditions in their homelands, relatives of missing persons at times may not trust the government and may be reluctant to contact a law enforcement agency, making it difficult to satisfy the law enforcement nexus necessary for entry into a national DNA database. A potential solution to increase the identification of unidentified human remains found within the USA, such as those that may be of foreign nationals, the University of North Texas Center for Human Identification (UNTCHI) has created a Humanitarian DNA Identification DNA Database (HDID) that enables family reference sample DNA profiles from non-US citizens to be compared with the DNA profiles from unidentified human remains within its local database system. This short communication describes the needs, basis, policies, and practices to inform the scientific, investigative, and legal communities and the public so that various entities may become aware and consider submitting family reference sample (FRS) profiles from foreign nationals for the purpose of searching against UNTCHI's HDID. It is our hope that by creating this HDID, another vehicle is available to support identification of human remains within the USA and to bring much needed answers to the family members of missing persons. The HDID will merge high forensic quality and best practices with the broader accessibility for non-US families to voluntarily donate DNA profiles for searching for missing loved ones.
Collapse
Affiliation(s)
- Bruce Budowle
- Center for Human Identification, Graduate School of Biomedical Sciences, University of North Texas Health Science Center, 3500 Camp Bowie Blvd, CBH-250, Ft Worth, TX, 76107, USA. .,Department of Microbiology, Immunology and Genetics, Graduate School of Biomedical Sciences, University of North Texas Health Science Center, Ft Worth, TX, 76107, USA.
| | - Magdalena M Bus
- Center for Human Identification, Graduate School of Biomedical Sciences, University of North Texas Health Science Center, 3500 Camp Bowie Blvd, CBH-250, Ft Worth, TX, 76107, USA.,Department of Microbiology, Immunology and Genetics, Graduate School of Biomedical Sciences, University of North Texas Health Science Center, Ft Worth, TX, 76107, USA
| | - Melody A Josserand
- Center for Human Identification, Graduate School of Biomedical Sciences, University of North Texas Health Science Center, 3500 Camp Bowie Blvd, CBH-250, Ft Worth, TX, 76107, USA
| | - Dixie L Peters
- Center for Human Identification, Graduate School of Biomedical Sciences, University of North Texas Health Science Center, 3500 Camp Bowie Blvd, CBH-250, Ft Worth, TX, 76107, USA
| |
Collapse
|
50
|
Woerner AE, Cihlar JC, Smart U, Budowle B. Numt identification and removal with RtN! Bioinformatics 2020; 36:5115-5116. [DOI: 10.1093/bioinformatics/btaa642] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2020] [Revised: 06/15/2020] [Accepted: 07/18/2020] [Indexed: 12/30/2022] Open
Abstract
Abstract
Motivation
Assays in mitochondrial genomics rely on accurate read mapping and variant calling. However, there are known and unknown nuclear paralogs that have fundamentally different genetic properties than that of the mitochondrial genome. Such paralogs complicate the interpretation of mitochondrial genome data and confound variant calling.
Results
Remove the Numts! (RtN!) was developed to categorize reads from massively parallel sequencing data not based on the expected properties and sequence identities of paralogous nuclear encoded mitochondrial sequences, but instead using sequence similarity to a large database of publicly available mitochondrial genomes. RtN! removes low-level sequencing noise and mitochondrial paralogs while not impacting variant calling, while competing methods were shown to remove true variants from mitochondrial mixtures.
Availability and implementation
https://github.com/Ahhgust/RtN
Supplementary information
Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- August E Woerner
- Department of Microbiology, Immunology and Genetics
- Center for Human Identification, University of North Texas Health Science Center (UNTHSC), Fort Worth, TX 76107, USA
| | - Jennifer Churchill Cihlar
- Department of Microbiology, Immunology and Genetics
- Center for Human Identification, University of North Texas Health Science Center (UNTHSC), Fort Worth, TX 76107, USA
| | - Utpal Smart
- Center for Human Identification, University of North Texas Health Science Center (UNTHSC), Fort Worth, TX 76107, USA
| | - Bruce Budowle
- Department of Microbiology, Immunology and Genetics
- Center for Human Identification, University of North Texas Health Science Center (UNTHSC), Fort Worth, TX 76107, USA
| |
Collapse
|