1
|
Fan H, Xu Y, Zhao Y, Feng K, Hong L, Zhao Q, Lu X, Shi M, Li H, Wang L, Wen S. Development and validation of YARN: A novel SE-400 MPS kit for East Asian paternal lineage analysis. Forensic Sci Int Genet 2024; 71:103029. [PMID: 38518712 DOI: 10.1016/j.fsigen.2024.103029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2023] [Revised: 02/12/2024] [Accepted: 03/03/2024] [Indexed: 03/24/2024]
Abstract
Y-chromosomal short tandem repeat polymorphisms (Y-STRs) and Y-chromosomal single nucleotide polymorphisms (Y-SNPs) are valuable genetic markers used in paternal lineage identification and population genetics. Currently, there is a lack of an effective panel that integrates Y-STRs and Y-SNPs for studying paternal lineages, particularly in East Asian populations. Hence, we developed a novel Y-chromosomal targeted panel called YARN (Y-chromosome Ancestry and Region Network) based on multiplex PCR and a single-end 400 massive parallel sequencing (MPS) strategy, consisting of 44 patrilineage Y-STRs and 260 evolutionary Y-SNPs. A total of 386 reactions were validated for the effectiveness and applicability of YARN according to SWGDAM validation guidelines, including sensitivity (with a minimum input gDNA of 0.125 ng), mixture identification (ranging from 1:1-1:10), PCR inhibitor testing (using substances such as 50 μM hematin, 100 μM hemoglobin, 100 μM humic acid, and 2.5 mM indigo dye), species specificity (successfully distinguishing humans from other animals), repeatability study (achieved 100% accuracy), and concordance study (with 99.91% accuracy for 1121 Y-STR alleles). Furthermore, we conducted a pilot study using YARN in a cohort of 484 Han Chinese males from Huaiji County, Zhaoqing City, Guangdong, China (GDZQHJ cohort). In this cohort, we identified 52 different Y-haplogroups and 73 different surnames. We found weak to moderate correlations between the Y-haplogroups, Chinese surnames, and geographical locations of the GDZQHJ cohort (with λ values ranging from 0.050 to 0.340). However, when we combined two different categories into a new independent variable, we observed stronger correlations (with λ values ranging from 0.617 to 0.754). Overall, the YARN panel, which combines Y-STR and Y-SNP genetic markers, meets forensic DNA quality assurance guidelines and holds potential for East Asian geographical origin inference and paternal lineage analysis.
Collapse
Affiliation(s)
- Haoliang Fan
- MOE Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai 200433, China; School of Forensic Medicine, Shanxi Medical University, Taiyuan 030001, China.
| | - Yiran Xu
- Institute of Archaeological Science, Fudan University, Shanghai 200433, China.
| | - Yutao Zhao
- Public Security Bureau of Zhaoqing Municipality, Zhaoqing 526000, China.
| | - Kai Feng
- Duanzhou Branch of Zhaoqing Public Security Bureau, Zhaoqing 526060, China.
| | - Liuxi Hong
- Sihui Public Security Bureau of Guangdong Province, Zhaoqing 526299, China.
| | - Qiancheng Zhao
- Public Security Bureau of Zhaoqing Municipality, Zhaoqing 526000, China.
| | - Xiaoyu Lu
- Deepreads Biotech Company Limited, Guangzhou 510663, China.
| | - Meisen Shi
- Criminal Justice College of China University of Political Science and Law, Beijing 100088, China.
| | - Haiyan Li
- Criminal Technology Center of Guangdong Provincial Public Security Department, Guangzhou 510050, China.
| | - Lingxiang Wang
- MOE Laboratory for National Development and Intelligent Governance, Fudan University, Shanghai 200433, China.
| | - Shaoqing Wen
- MOE Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai 200433, China; Institute of Archaeological Science, Fudan University, Shanghai 200433, China; MOE Laboratory for National Development and Intelligent Governance, Fudan University, Shanghai 200433, China.
| |
Collapse
|
2
|
Agudo MM, Aanes H, Albert M, Janssen K, Gill P, Bleka Ø. An overview of autosomal STRs and identity SNPs in a Norwegian population using massively parallel sequencing. Forensic Sci Int Genet 2024; 71:103057. [PMID: 38733649 DOI: 10.1016/j.fsigen.2024.103057] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 02/27/2024] [Accepted: 04/28/2024] [Indexed: 05/13/2024]
Abstract
In recent years, probabilistic genotyping software has been adapted for the analysis of massively parallel sequencing (MPS) forensic data. Likelihood ratios (LR) are based on allele frequencies selected from populations of interest. This study provides an outline of sequence-based (SB) allele frequencies for autosomal short tandem repeats (aSTRs) and identity single nucleotide polymorphisms (iSNPs) in 371 individuals from Southern Norway. 27 aSTRs and 94 iSNPs were previously analysed with the ForenSeq™ DNA Signature Prep Kit (Verogen). The number of alleles with frequencies less than 0.05 for sequenced-based alleles was 4.6 times higher than for length-based alleles. Consistent with previous studies, it was observed that sequence-based data (both with and without flanks) exhibited higher allele diversity compared to length-based (LB) data; random match probabilities were lower for SB alleles confirming their advantage to discriminate between individuals. Two alleles in markers D22S1045 and Penta D were observed with SNPs in the 3´ flanking region, which have not been reported before. Also, a novel SNP with a minor allele frequency (MAF) of 0.001, was found in marker TH01. The impact of the sample size on minor allele frequency (MAF) values was studied in 88 iSNPs from Southern Norway (n = 371). The findings were then compared to a larger Norwegian population dataset (n = 15,769). The results showed that the smaller Southern Norway dataset provided similar results, and it was a representative sample. Population structure was analyzed for regions within Southern Norway; FST estimates for aSTR and iSNPs did not indicate any genetic structure. Finally, we investigated the genetic differences between Southern Norway and two other populations: Northern Norway and Denmark. Allele frequencies between these populations were compared, and we found no significant frequency differences (p-values > 0.0001). We also calculated the pairwise FST values per marker and comparisons between Southern and Northern Norway showed small differences. In contrast, the comparisons between Southern Norway and Denmark showed higher FST values for some markers, possibly driven by distinct alleles that were present in only one of the populations. In summary, we propose that allele frequencies from each population considered in this study could be used interchangeably to calculate genotype probabilities.
Collapse
Affiliation(s)
- Maria Martin Agudo
- Department of Forensic Sciences, Oslo University Hospital, Oslo, Norway; Department of Forensic Medicine, Institute of Clinical Medicine, University of Oslo, Oslo, Norway
| | - Håvard Aanes
- Department of Forensic Sciences, Oslo University Hospital, Oslo, Norway
| | - Michel Albert
- Department of Forensic Sciences, Oslo University Hospital, Oslo, Norway
| | - Kirstin Janssen
- Centre for Forensic Genetics, UiT The Arctic University of Norway, Norway
| | - Peter Gill
- Department of Forensic Sciences, Oslo University Hospital, Oslo, Norway; Department of Forensic Medicine, Institute of Clinical Medicine, University of Oslo, Oslo, Norway
| | - Øyvind Bleka
- Department of Forensic Sciences, Oslo University Hospital, Oslo, Norway.
| |
Collapse
|
3
|
Guo F, Liu Z, Long G, Zhang B, Liu D, Yu S. Performance and characterization of 94 identity-informative SNPs in Northern Han Chinese using ForenSeq ™ DNA signature prep kit. J Forensic Leg Med 2024; 103:102678. [PMID: 38522119 DOI: 10.1016/j.jflm.2024.102678] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2023] [Revised: 03/05/2024] [Accepted: 03/17/2024] [Indexed: 03/26/2024]
Abstract
Target and flanking region (FR) variation at 94 identity-informative SNPs (iSNPs) are investigated in 635 Northern Han Chinese using the ForenSeq DNA Signature Prep Kit on the MiSeq FGx Forensic Genomics System. The dataset presents the following performance characteristics (average values): ≥60% bases with a quality score of 20 or higher (%≥ Q20); >700 × of depth of coverage (DoC) from both Sample Details Reports and Flanking Region Reports; >80% of effective reads; ≥60% of allele coverage ratio (ACR); and ≥70% of inter-locus balance, while some stable low-performance characteristics are also observed: low DoC at rs1736442, rs1031825, rs7041158, rs338882, rs2920816, rs1493232, rs719366, and rs2342747; high noise at rs891700; and imbalanced ACR at rs6955448 and rs338882. The average amplicon length is 69 bp, suitable for detecting degraded samples. Bioinformatic concordance achieves 99.99% between the ForenSeq Universal Analysis Software (UAS) and the Integrative Genomic Viewer (IGV) inspection. Discordance results from flanking region deletions of rs10776839, rs8078417, rs2831700, and rs1454361. Due to FR variants within amplicons detected by massively parallel sequencing (MPS), the increases in the number of unique alleles, effective alleles (Ae), and observed heterozygosity (Hobs) are 46.81%, 4.51%, and 3.29%, respectively. Twelve FR variants are first reported to dbSNP, such as rs1252699848, rs1665500714, rs1771121532, rs2097285015, rs1851671415, rs2045669877, rs2046758811, rs2044248635, rs1251308240, rs1968822112, rs1981638299, and rs1341756746. All 94 iSNPs from target and amplicon data are in Hardy-Weinberg equilibrium (HWE) and independent within autosomes. As expected, forensic parameters from the amplicon data increase significantly on the combined power of discrimination (CPD = 1 - 3.9876 × 10-38) and the combined power of exclusion (CPE = 1 - 6.6690 × 10-8). Additionally, the power of the system effectiveness (CPD = 1 - 6.7054 × 10-72 and CPE = 1 - 4.4719 × 10-20) with sequence-based 27 autosomal STRs and 94 iSNP amplicons in combination is substantially improved compared to one type of marker alone. In conclusion, we have established a traditional length-based and current sequence-based reference database with 58 STRs and 94 iSNPs in the Northern Han Chinese population. We hope these data can serve as a solid reference and foundation for forensic practice.
Collapse
Affiliation(s)
- Fei Guo
- Shenyang Medical College, Shenyang, Liaoning, 110034, PR China; Key Laboratory of Human Ethnic Specificity and Phenomics of Critical Illness in Liaoning Province, Shenyang, Liaoning, 110034, PR China; Key Laboratory of Phenomics in Shenyang City, Shenyang, Liaoning, 110034, PR China.
| | - Ze Liu
- DNA Laboratory of Forensic Science Center, Shenyang Public Security Bureau, Shenyang, Liaoning, 110002, PR China
| | - Guannan Long
- DNA Laboratory of Forensic Science Center, Shenyang Public Security Bureau, Shenyang, Liaoning, 110002, PR China
| | - Biao Zhang
- DNA Laboratory of Forensic Science Center, Shenyang Public Security Bureau, Shenyang, Liaoning, 110002, PR China
| | - Dahua Liu
- Department of Forensic Medicine, Jinzhou Medical University, Jinzhou, Liaoning, 121001, PR China
| | - Shaobo Yu
- DNA Laboratory of Forensic Science Center, Shenyang Public Security Bureau, Shenyang, Liaoning, 110002, PR China.
| |
Collapse
|
4
|
Tao R, Dong X, Zhen X, Xia R, Qu Y, Liu S, Zhang S, Li C. Population genetic analyses of Eastern Chinese Han nationality using ForenSeq™ DNA Signature Prep Kit. Mol Genet Genomics 2024; 299:9. [PMID: 38374461 DOI: 10.1007/s00438-024-02121-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2023] [Accepted: 01/31/2024] [Indexed: 02/21/2024]
Abstract
Currently, the most commonly used method for human identification and kinship analysis in forensic genetics is the detection of length polymorphism in short tandem repeats (STRs) using polymerase chain reaction (PCR) and capillary electrophoresis (CE). However, numerous studies have shown that considerable sequence variations exist in the repeat and flanking regions of the STR loci, which cannot be identified by CE detection. Comparatively, massively parallel sequencing (MPS) technology can capture these sequence differences, thereby enhancing the identification capability of certain STRs. In this study, we used the ForenSeq™ DNA Signature Prep Kit to sequence 58 STRs and 94 individual identification SNPs (iiSNPs) in a sample of 220 unrelated individuals from the Eastern Chinese Han population. Our aim is to obtain MPS-based STR and SNP data, providing further evidence for the study of population genetics and forensic applications. The results showed that the MPS method, utilizing sequence information, identified a total of 486 alleles on autosomal STRs (A-STRs), 97 alleles on X-chromosome STRs (X-STRs), and 218 alleles on Y-chromosome STRs (Y-STRs). Compared with length polymorphism, we observed an increase of 260 alleles (157, 31, and 72 alleles on A-STRs, X-STRs, and Y-STRs, respectively) across 36 STRs. The most substantial increments were observed in DYF387S1 and DYS389II, with increases of 287.5% and 250%, respectively. The most increment in the number of alleles was found at DYF387S1 and DYS389II (287.5% and 250%, respectively). The length-based (LB) and sequence-based (SB) combined random match probability (RMP) of 27 A-STRs were 6.05E-31 and 1.53E-34, respectively. Furthermore, other forensic parameters such as total discrimination power (TDP), cumulative probability of exclusion of trios (CPEtrio), and duos (CPEduo) were significantly improved when using the SB data, and informative data were obtained for the 94 iiSNPs. Collectively, these findings highlight the advantages of MPS technology in forensic genetics, and the Eastern Chinese Han genetic data generated in this study could be used as a valuable reference for future research in this field.
Collapse
Affiliation(s)
- Ruiyang Tao
- Shanghai Key Laboratory of Forensic Medicine, Shanghai Forensic Service Platform, Ministry of Justice, Academy of Forensic Sciences, Key Laboratory of Forensic Science, Shanghai, 200063, China
| | - Xinyu Dong
- Shanghai Key Laboratory of Forensic Medicine, Shanghai Forensic Service Platform, Ministry of Justice, Academy of Forensic Sciences, Key Laboratory of Forensic Science, Shanghai, 200063, China
- Minhang Branch of Shanghai Public Security Bureau, Shanghai, 201108, China
| | - Xiaoyuan Zhen
- Shanghai Key Laboratory of Forensic Medicine, Shanghai Forensic Service Platform, Ministry of Justice, Academy of Forensic Sciences, Key Laboratory of Forensic Science, Shanghai, 200063, China
- Department of Forensic Science, Medical School of Soochow University, Suzhou, 215123, China
| | - Ruocheng Xia
- Shanghai Key Laboratory of Forensic Medicine, Shanghai Forensic Service Platform, Ministry of Justice, Academy of Forensic Sciences, Key Laboratory of Forensic Science, Shanghai, 200063, China
| | - Yiling Qu
- Shanghai Key Laboratory of Forensic Medicine, Shanghai Forensic Service Platform, Ministry of Justice, Academy of Forensic Sciences, Key Laboratory of Forensic Science, Shanghai, 200063, China
- Institute of Forensic Medicine, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu, 610041, China
| | - Shiquan Liu
- Institute of Evidence Law and Forensic Science, China University of Political Science and Law, Beijing, 100088, China.
| | - Suhua Zhang
- Shanghai Medical College, Fudan University, Shanghai, 200032, China.
| | - Chengtao Li
- Shanghai Key Laboratory of Forensic Medicine, Shanghai Forensic Service Platform, Ministry of Justice, Academy of Forensic Sciences, Key Laboratory of Forensic Science, Shanghai, 200063, China.
- Shanghai Medical College, Fudan University, Shanghai, 200032, China.
| |
Collapse
|
5
|
Feng Y, Zhao Y, Lu X, Li H, Zhao K, Shi M, Wen S. Forensic analysis and sequence variation of 133 STRs in the Hakka population. Front Genet 2024; 15:1347868. [PMID: 38317659 PMCID: PMC10839782 DOI: 10.3389/fgene.2024.1347868] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2023] [Accepted: 01/05/2024] [Indexed: 02/07/2024] Open
Abstract
Introduction: Short Tandem Repeats (STRs) are highly valuable genetic markers in forensic science. However, the conventional PCR-CE technique has limitations, and the emergence of massively parallel sequencing (MPS) technology presents new opportunities for STR analysis. Yet, there is limited research on Chinese population diversity using MPS. Methods: In this study, we obtained genotype data for 52 A-STRs and 81 Y-STRs from the Hakka population in Meizhou, Guangdong, China, using the Forensic Analysis System Multiplecues SetB Kit on the MGISEQ-2000 platform. Results: Our findings demonstrate that these 133 STRs are highly efficient for forensic applications within the Meizhou Hakka population. Statistical analysis revealed Hobs values ranging from 0.61306 to 0.91083 and Hexp values ranging from 0.59156 to 0.91497 for A-STRs based on length polymorphism. For sequence polymorphism, Hobs values ranged from 0.61306 to 0.94586, and Hexp values fluctuated between 0.59156 and 0.94487. The CPE values were 1-5.0779620E-21 and 1-3.257436E-24 for length and sequence polymorphism, respectively, while the CPD values were 1-1.727007E-59 and 1-5.517015E-66, respectively. Among the 80 Y-STR loci, the HD values for length and sequence polymorphism were 0.99764282 and 0.99894195, respectively. The HMP values stood at 0.00418102 and 0.00288427, respectively, and the DC values were 0.75502742 and 0.83363803, respectively. For the 52 A-STR loci, we identified 554 and 989 distinct alleles based on length and sequence polymorphisms, respectively. For the 81 Y-STR loci, 464 and 652 unique alleles were detected at the length and sequence level, respectively. Population genetic analysis revealed that the Meizhou Hakka population has a close kinship relationship with the Asian populations THI and KOR based on length polymorphism data of A-STRs. Conversely, based on length polymorphism data of Y-STRs, the Meizhou Hakka population has the closest kinship relationship with the Henan Han population. Discussion: Overall, the variation information of repeat region sequences significantly enhances the forensic identification efficacy of STR genetic markers, providing an essential database for forensic individual and paternity testing in this region. Additionally, the data generated by our study will serve as a vital resource for research into the genetic structure and historical origins of the Meizhou Hakka population.
Collapse
Affiliation(s)
- Yuhang Feng
- MOE Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, China
| | - Yutao Zhao
- Public Security Bureau of Zhaoqing Municipality, Zhaoqing, China
| | - Xiaoyu Lu
- Deepreads Biotech Company Limited, Guangzhou, China
| | - Haiyan Li
- Criminal Technology Center of Guangdong Provincial Public Security Department, Guangzhou, China
| | - Kai Zhao
- Criminal Technology Center of Guangdong Provincial Public Security Department, Guangzhou, China
| | - Meisen Shi
- Criminal Justice College of China University of Political Science and Law, Beijing, China
| | - Shaoqing Wen
- MOE Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, China
- Institute of Archaeological Science, Fudan University, Shanghai, China
- MOE Laboratory for National Development and Intelligent Governance, Fudan University, Shanghai, China
| |
Collapse
|
6
|
Aguilar-Velázquez JA, Llamas-de-Dios BJ, Córdova-Mercado MF, Coronado-Ávila CE, Salas-Salas O, López-Quintero A, Ramos-González B, Rangel-Villalobos H. Accuracy of Eye and Hair Color Prediction in Mexican Mestizos from Monterrey City Based on ForenSeq TM DNA Signature Prep. Genes (Basel) 2023; 14:genes14051120. [PMID: 37239480 DOI: 10.3390/genes14051120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2023] [Revised: 05/14/2023] [Accepted: 05/18/2023] [Indexed: 05/28/2023] Open
Abstract
Forensic genomic systems allow simultaneously analyzing identity informative (iiSNPs), ancestry informative (aiSNPs), and phenotype informative (piSNPs) genetic markers. Among these kits, the ForenSeq DNA Signature prep (Verogen) analyzes identity STRs and SNPs as well as 24 piSNPs from the HIrisPlex system to predict the hair and eye color. We report herein these 24 piSNPs in 88 samples from Monterrey City (Northeast, Mexico) based on the ForenSeq DNA Signature prep. Phenotypes were predicted by genotype results with both Universal Analysis Software (UAS) and the web tool of the Erasmus Medical Center (EMC). We observed predominantly brown eyes (96.5%) and black hair (75%) phenotypes, whereas blue eyes, and blond and red hair were not observed. Both UAS and EMC showed high performance in eye color prediction (p ≥ 96.6%), but a lower accuracy was observed for hair color prediction. Overall, UAS hair color predictions showed better performance and robustness than those obtained with the EMC web tool (when hair shade is excluded). Although we employed a threshold (p > 70%), we suggest using the EMC enhanced approach to avoid the exclusion of a high number of samples. Finally, although our results are helpful to employ these genomic tools to predict eye color, caution is suggested for hair color prediction in Latin American (admixed) populations such as those studied herein, principally when no black color is predicted.
Collapse
Affiliation(s)
- José Alonso Aguilar-Velázquez
- Instituto de Investigación en Ciencias Biomédicas, Centro Universitario de Ciencias de la Salud, Universidad de Guadalajara (CUCS-UdeG), Guadalajara 44340, Jalisco, Mexico
- Departamento de Morfología, Centro Universitario de Ciencias de la Salud, Universidad de Guadalajara (CUCS-UdeG), Guadalajara 44340, Jalisco, Mexico
| | - Blanca Jeannete Llamas-de-Dios
- Licenciatura en Ciencias Forenses, Centro Universitario de Ciencias de la Salud, Universidad de Guadalajara (CUCS-UdeG), Guadalajara 44340, Jalisco, Mexico
| | - Miranda Fabiola Córdova-Mercado
- Instituto de Criminalística y Servicios Periciales, Fiscalía General de Justicia del Estado de Nuevo León (FGJNL), Monterrey 64720, Nuevo León, Mexico
| | - Carolina Elena Coronado-Ávila
- Instituto de Criminalística y Servicios Periciales, Fiscalía General de Justicia del Estado de Nuevo León (FGJNL), Monterrey 64720, Nuevo León, Mexico
| | - Orlando Salas-Salas
- Instituto de Criminalística y Servicios Periciales, Fiscalía General de Justicia del Estado de Nuevo León (FGJNL), Monterrey 64720, Nuevo León, Mexico
| | - Andrés López-Quintero
- Instituto de Nutrigenética y Nutrigenómica Traslacional, Centro Universitario de Ciencias de la Salud, Universidad de Guadalajara (CUCS-UdeG), Guadalajara 44340, Jalisco, Mexico
| | - Benito Ramos-González
- Instituto de Criminalística y Servicios Periciales, Fiscalía General de Justicia del Estado de Nuevo León (FGJNL), Monterrey 64720, Nuevo León, Mexico
| | - Héctor Rangel-Villalobos
- Instituto de Investigación en Genética Molecular, Centro Universitario de la Ciénega, Universidad de Guadalajara (CUCI-UdeG), Ocotlán 47820, Jalisco, Mexico
| |
Collapse
|
7
|
Kiesler KM, Borsuk LA, Steffen CR, Vallone PM, Gettings KB. US Population Data for 94 Identity-Informative SNP Loci. Genes (Basel) 2023; 14:genes14051071. [PMID: 37239431 DOI: 10.3390/genes14051071] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2023] [Revised: 05/05/2023] [Accepted: 05/09/2023] [Indexed: 05/28/2023] Open
Abstract
The US National Institute of Standards and Technology (NIST) analyzed a set of 1036 samples representing four major US population groups (African American, Asian American, Caucasian, and Hispanic) with 94 single nucleotide polymorphisms (SNPs) used for individual identification (iiSNPs). The compact size of iiSNP amplicons compared to short tandem repeat (STR) markers increases the likelihood of successful amplification with degraded DNA samples. Allele frequencies and relevant forensic statistics were calculated for each population group as well as the aggregate population sample. Examination of sequence data in the regions flanking the targeted SNPs identified additional variants, which can be combined with the target SNPs to form microhaplotypes (multiple phased SNPs within a short-read sequence). Comparison of iiSNP performance with and without flanking SNP variation identified four amplicons containing microhaplotypes with observed heterozygosity increases of greater than 15% over the targeted SNP alone. For this set of 1036 samples, comparison of average match probabilities from iiSNPs with the 20 CODIS core STR markers yielded an estimate of 1.7 × 10-38 for iiSNPs (assuming independence between all 94 SNPs), which was four orders of magnitude lower (more discriminating) than STRs where internal sequence variation was considered, and 10 orders of magnitude lower than STRs using established capillary electrophoresis length-based genotypes.
Collapse
Affiliation(s)
- Kevin M Kiesler
- National Institute of Standards and Technology, 100 Bureau Drive, Mailstop 8314, Gaithersburg, MD 20899, USA
| | - Lisa A Borsuk
- National Institute of Standards and Technology, 100 Bureau Drive, Mailstop 8314, Gaithersburg, MD 20899, USA
| | - Carolyn R Steffen
- National Institute of Standards and Technology, 100 Bureau Drive, Mailstop 8314, Gaithersburg, MD 20899, USA
| | - Peter M Vallone
- National Institute of Standards and Technology, 100 Bureau Drive, Mailstop 8314, Gaithersburg, MD 20899, USA
| | - Katherine B Gettings
- National Institute of Standards and Technology, 100 Bureau Drive, Mailstop 8314, Gaithersburg, MD 20899, USA
| |
Collapse
|
8
|
Guo F, Liu Z, Long G, Zhang B, Dong X, Liu D, Yu S. High-resolution genotyping of 58 STRs in 635 Northern Han Chinese with MiSeq FGx ® Forensic Genomics System. Forensic Sci Int Genet 2023; 65:102879. [PMID: 37150076 DOI: 10.1016/j.fsigen.2023.102879] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2022] [Revised: 04/16/2023] [Accepted: 04/22/2023] [Indexed: 05/09/2023]
Abstract
Sequence polymorphisms were characterized at 27 autosomal STRs (A-STRs), 7 X chromosomal STRs (X-STRs), and 24 Y chromosomal STRs (Y-STRs) in 635 Northern Han Chinese with the ForenSeq DNA Signature Prep Kit on the MiSeq FGx Forensic Genomics System. Since repeat region (RR) and flanking region (FR) variation can be detected by massively parallel sequencing (MPS), the increase in the number of unique alleles and the average of gene diversity was 78.18% and 3.51% between sequence and length, respectively. A total of 74 novel RR variants were identified at 33 STRs compared with STRSeq and previous studies, and 13 FR variants (rs1770275883, rs2053373277, rs2082557941, rs1925525766, rs1926380862, rs1569322793, rs2051848492, rs2051848696, rs2016239814, rs2053269960, rs2044518192, rs2044536444, and rs2089968964) were first submitted to dbSNP. Also, 99.94% of alleles were concordant between the ForenSeq DNA Signature Prep Kit and commercial CE kits. Discordance resulted from the low performance at D22S1045 and occasionally at DYS392, flanking region deletions at D7S820 and DXS10074, and the strict alignment algorithm at DXS7132. Null alleles at DYS505 and DYS448 and multialleles at DYS387S1a/b, DYS385a/b, DYS448, DYS505, DXS7132, and HPRTB were validated with other MPS and CE kits. Thus, a high-resolution sequence-based (SB) and length-based (LB) allele frequencies dataset from Northern Han Chinese has been established already. As expected, forensic parameters increased significantly on combined power of discrimination (PD) and combined power of exclusion (PE) at A-STRs, mildly on combined PD and combined mean exclusion chance (MEC) at X-STRs, and barely on discrimination capacity (DC) at Y-STRs. Additionally, MiSeq FGx quality metrics and MPS performance were evaluated in this study, which presented the high-quality of the dataset at 20 consecutive runs, such as ≥ 60% bases with a quality score of 20 or higher (%≥ Q20), > 60% of effective reads, > 2000 × of depth of coverage (DoC), ≥ 60% of allele coverage ratio (ACR) or heterozygote balance, ≥ 70% of inter-locus balance, and ≤ 0.4 of the absolute value of observed minus expected heterozygosity (|Hexp - Hobs|). In conclusion, MiSeq FGx can help us generate a high-resolution and high-quality dataset for human identification and population genetic studies.
Collapse
Affiliation(s)
- Fei Guo
- School of Forensic Science and Technology, Criminal Investigation Police University of China, Shenyang, Liaoning 110854, PR China.
| | - Ze Liu
- DNA Laboratory of Forensic Science Center, Shenyang Public Security Bureau, Shenyang, Liaoning 110002, PR China
| | - Guannan Long
- DNA Laboratory of Forensic Science Center, Shenyang Public Security Bureau, Shenyang, Liaoning 110002, PR China
| | - Biao Zhang
- DNA Laboratory of Forensic Science Center, Shenyang Public Security Bureau, Shenyang, Liaoning 110002, PR China
| | - Xinyu Dong
- School of Forensic Medicine, Shanxi Medical University, Jinzhong, Shanxi 030619, PR China
| | - Dahua Liu
- Department of Forensic Medicine, Jinzhou Medical University, Jinzhou, Liaoning 121001, PR China
| | - Shaobo Yu
- DNA Laboratory of Forensic Science Center, Shenyang Public Security Bureau, Shenyang, Liaoning 110002, PR China.
| |
Collapse
|
9
|
Riman S, Ghemrawi M, Borsuk LA, Mahfouz R, Walsh S, Vallone PM. Sequence-based allelic variations and frequencies for 22 autosomal STR loci in the Lebanese population. Forensic Sci Int Genet 2023; 65:102872. [PMID: 37068444 DOI: 10.1016/j.fsigen.2023.102872] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2022] [Revised: 04/06/2023] [Accepted: 04/08/2023] [Indexed: 04/19/2023]
Abstract
This is the first study that characterizes the sequence-based allelic variations of 22 autosomal Short Tandem Repeat (aSTR) loci in a population dataset collected from Lebanon. Genomic DNA extracts from 195 unrelated Lebanese individuals were amplified with PowerSeq 46GY System Prototype. Targeted amplicons were subjected to DNA library preparation and sequenced on the Verogen MiSeq FGx Sequencing System. Raw FASTQ data files were processed by STRait Razor v3. Sequence strings were annotated according to the considerations of the DNA Commission of the International Society for Forensic Genetics (ISFG) and tabulated herein with their respective allelic frequencies and GeneBank accession and version numbers. The sequenced Lebanese dataset resulted in 429 distinct allelic sequences as compared to the 236 alleles identified by length only. The increase in the number of alleles was observed at 18 out of 22 aSTR loci and was attributed to the sequence variations residing in both the STR repeat motifs and flanking regions. The study uncovered 25 novel aSTR allelic sequences across 12 loci for which GenBank records did not previously exist in the STRSeq BioProject, PRJNA380127. For a concordance check, the length-based allelic calls derived from the full sequences were compared to those genotyped using capillary electrophoresis (CE) methods. Population genetic parameters relevant to the evaluation of forensic DNA evidence were assessed for the sequence-based data and compared to the parameters generated from the length-based information. Using the sequence-based data, Analysis of MOlecular VAriance (AMOVA), genetic distances, and population genetic structure were evaluated for 1231 individuals sampled from the Lebanese and four U.S. populations (African American, Asian, Caucasian, and Hispanic). The results were tabulated and visualized in a population tree, multidimensional scaling scatter plots, and bar plots. This newly established sequence-based database for the Lebanese population can be beneficial for extending NGS applicability to casework or paternity testing and assessing the strength of evidence for NGS-STR profiles. The described novel sequence variants at certain loci can further help in the effort to characterize the sequence diversity of STR markers from different populations around the world.
Collapse
Affiliation(s)
- Sarah Riman
- Applied Genetics Group, National Institute of Standards and Technology, Gaithersburg, MD 20899, USA.
| | - Mirna Ghemrawi
- Department of Chemistry and Biochemistry and International Forensic Research Institute, Florida International University, Miami, FL 33199, USA
| | - Lisa A Borsuk
- Applied Genetics Group, National Institute of Standards and Technology, Gaithersburg, MD 20899, USA
| | - Rami Mahfouz
- Department of Pathology and Laboratory Medicine, American University of Beirut Medical Center, Beirut, Lebanon
| | - Susan Walsh
- Department of Biology, Indiana University Purdue University Indianapolis, Indianapolis, IN 46202, USA
| | - Peter M Vallone
- Applied Genetics Group, National Institute of Standards and Technology, Gaithersburg, MD 20899, USA
| |
Collapse
|
10
|
Stephens KM, Barta R, Fleming K, Perez JC, Wu SF, Snedecor J, Holt CL, LaRue B, Budowle B. Developmental validation of the ForenSeq MainstAY kit, MiSeq FGx sequencing system and ForenSeq Universal Analysis Software. Forensic Sci Int Genet 2023; 64:102851. [PMID: 36907074 DOI: 10.1016/j.fsigen.2023.102851] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2022] [Revised: 02/24/2023] [Accepted: 02/27/2023] [Indexed: 03/06/2023]
Abstract
For human identification purposes, forensic genetics has primarily relied upon a core set of autosomal (and to a lesser extent Y chromosome) short tandem repeat (STR) markers that are enriched by amplification using the polymerase chain reaction (PCR) that are subsequently separated and detected using capillary electrophoresis (CE). While STR typing conducted in this manner is well-developed and robust, advances in molecular biology that have occurred over the last 15 years, in particular massively parallel sequencing (MPS) [1-7], offer certain advantages as compared to CE-based typing. First and foremost is the high throughput capacity of MPS. Current bench top high throughput sequencers enable larger batteries of markers to be multiplexed and multiple samples to be sequenced simultaneously (e.g., millions to billions of nucleotides can be sequenced in one run). Second, compared to the length-based CE approach, sequencing STRs increases discrimination power, enhances sensitivity of detection, reduces noise due to instrumentation, and improves mixture interpretation [4,8-23]. Third, since detection of STRs is based on sequence and not fluorescence, amplicons can be designed that are shorter in length and of similar lengths among loci, where possible, which can improve amplification efficiency and analysis of degraded samples. Lastly, MPS offers a single format approach that can be applied to analysis of a wide variety of genetic markers of forensic interest (e.g., STRs, mitochondrial DNA, single nucleotide polymorphisms, insertion/deletions). These features make MPS a desirable technology for casework [14,15,24,25-48]. The developmental validation of the ForenSeq MainstAY library preparation kit with the MiSeq FGx Sequencing System and ForenSeq Universal Software is reported here to assist with validation of this MPS system for casework [49]. The results show that the system is sensitive, accurate and precise, specific, and performs well with mixtures and mock case-type samples.
Collapse
Affiliation(s)
| | - Richelle Barta
- Verogen, Inc., 11111 Flintkote Ave., San Diego, CA 92121, USA
| | - Keenan Fleming
- Verogen, Inc., 11111 Flintkote Ave., San Diego, CA 92121, USA
| | | | - Shan-Fu Wu
- Verogen, Inc., 11111 Flintkote Ave., San Diego, CA 92121, USA
| | - June Snedecor
- Verogen, Inc., 11111 Flintkote Ave., San Diego, CA 92121, USA
| | - Cydne L Holt
- Verogen, Inc., 11111 Flintkote Ave., San Diego, CA 92121, USA
| | - Bobby LaRue
- Verogen, Inc., 11111 Flintkote Ave., San Diego, CA 92121, USA
| | - Bruce Budowle
- University of Helsinki, Department of Forensic Medicine, Haartmaninkatu 8, P.O. Box 63, Helsinki 00014, Finland
| |
Collapse
|
11
|
Davenport L, Devesse L, Syndercombe Court D, Ballard D. Forensic identity SNPs: Characterisation of flanking region variation using massively parallel sequencing. Forensic Sci Int Genet 2023; 64:102847. [PMID: 36863275 DOI: 10.1016/j.fsigen.2023.102847] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2022] [Revised: 02/03/2023] [Accepted: 02/09/2023] [Indexed: 02/16/2023]
Abstract
Single nucleotide polymorphisms (SNPs) can be analysed for identity or kinship applications in forensic genetics to either provide an adjunct to traditional STR typing or as a stand-alone approach. The advent of massively parallel sequencing technology (MPS) has provided a useful opportunity to more easily deploy SNP typing in a forensic context, given the ability to simultaneously amplify a large number of markers. Furthermore, MPS also provides valuable sequence data for the targeted regions, which enables the detection of any additional variation seen in the flanking regions of amplicons. In this study we genotyped 977 samples across five UK-relevant population groups (White British, East Asian, South Asian, North-East African and West African) for 94 identity-informative SNP markers using the ForenSeq DNA Signature Prep Kit. Examination of flanking region variation allowed for the identification of 158 additional alleles across all populations studied. Here we present allele frequencies for all 94 identity-informative SNPs, both including and excluding the flanking region sequence of these markers. We also present information on the configuration of these SNPs in the ForenSeq DNA Signature Prep Kit, including performance metrics for the markers and investigation of bioinformatic and chemistry-based discordances. Overall, the inclusion of flanking region variation in the analysing workflow for these markers reduced the average combined match probability 2175 times across all populations, with a maximum reduction of 675,000-fold in the West African population. The gain due to flanking region-based discrimination increased the heterozygosity of some loci above that of some of the least useful forensic STR loci; thus demonstrating the benefit of enhanced analysis of currently targeted SNP markers for forensic applications.
Collapse
Affiliation(s)
- Lucinda Davenport
- King's Forensics, Department of Analytical, Environmental and Forensic Sciences, Faculty of Life Sciences and Medicine, King's College London, London, United Kingdom.
| | - Laurence Devesse
- King's Forensics, Department of Analytical, Environmental and Forensic Sciences, Faculty of Life Sciences and Medicine, King's College London, London, United Kingdom
| | - Denise Syndercombe Court
- King's Forensics, Department of Analytical, Environmental and Forensic Sciences, Faculty of Life Sciences and Medicine, King's College London, London, United Kingdom
| | - David Ballard
- King's Forensics, Department of Analytical, Environmental and Forensic Sciences, Faculty of Life Sciences and Medicine, King's College London, London, United Kingdom
| |
Collapse
|
12
|
Ohuchi T, Guan X, Hirai E, Hashiyada M, Manabe S, Akane A, Adachi N, Tamaki K, Funayama M. Allele frequencies of 31 autosomal short tandem repeat (auSTR) loci obtained using the Precision ID GlobalFiler™ NGS STR Panel v2 in 322 individuals from the Japanese population. Leg Med (Tokyo) 2022; 59:102151. [PMID: 36191412 DOI: 10.1016/j.legalmed.2022.102151] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2022] [Revised: 08/18/2022] [Accepted: 09/22/2022] [Indexed: 10/31/2022]
Abstract
In human identification methods that target short tandem repeats (STRs), massively parallel sequencing (MPS) technology has made it possible to genotype at the level of the specific sequence itself. This allows for the detection of repeat unit variants and single nucleotide polymorphisms (SNPs) adjacent to the STRs. Using the GlobalFiler™ NGS STR Panel v2, Ion S5, and Converge software, this study constructed a Japanese database of 31 autosomal STRs (auSTRs) and two sex markers from 322 individuals. After excluding some sequence errors and stutters, a total of 31 novel alleles were identified. Additionally, using the allele frequencies of 31 auSTR loci, the match probabilities for the length-based and sequence-based data were calculated to be 1.433 × 10-34 and 9.163 × 10-38, respectively. These values are at least nine orders of magnitude higher than that obtained from 21 auSTR loci in the Japanese population using the conventional capillary electrophoresis method. The database generated in this study is expected to be implemented in forensic practice and used to solve difficult casework.
Collapse
Affiliation(s)
- Tsukasa Ohuchi
- Department of Forensic Medicine, Tohoku University Graduate School of Medicine, 2-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi 980-8575, Japan.
| | - Xueting Guan
- Department of Forensic Medicine, Tohoku University Graduate School of Medicine, 2-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi 980-8575, Japan
| | - Eriko Hirai
- Department of Forensic Medicine, Kyoto University Graduate School of Medicine, Yoshida-Konoe-cho, Sakyo-ku, Kyoto 606-8501, Japan
| | - Masaki Hashiyada
- Department of Legal Medicine, Kansai Medical University, 2-5-1 Shin-machi, Hirakata, Osaka 573-1010, Japan
| | - Sho Manabe
- Department of Legal Medicine, Kansai Medical University, 2-5-1 Shin-machi, Hirakata, Osaka 573-1010, Japan
| | - Atsushi Akane
- Department of Legal Medicine, Kansai Medical University, 2-5-1 Shin-machi, Hirakata, Osaka 573-1010, Japan
| | - Noboru Adachi
- Department of Legal Medicine, University of Yamanashi, 1110 Shimokato, Chuo, Yamanashi 409-3898, Japan
| | - Keiji Tamaki
- Department of Forensic Medicine, Kyoto University Graduate School of Medicine, Yoshida-Konoe-cho, Sakyo-ku, Kyoto 606-8501, Japan
| | - Masato Funayama
- Department of Forensic Medicine, Tohoku University Graduate School of Medicine, 2-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi 980-8575, Japan
| |
Collapse
|
13
|
Aguilar-Velázquez JA, Duran-Salazar MÁ, Córdoba-Mercado MF, Coronado-Avila CE, Salas-Salas O, Martinez-Cortés G, Casals F, Calafell F, Ramos-González B, Rangel-Villalobos H. Characterization of 58 STRs and 94 SNPs with the ForenSeq™ DNA signature prep kit in Mexican-Mestizos from the Monterrey city (Northeast, Mexico). Mol Biol Rep 2022; 49:7601-7609. [PMID: 35657453 DOI: 10.1007/s11033-022-07575-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2021] [Accepted: 05/06/2022] [Indexed: 10/18/2022]
Abstract
BACKGROUND STR allele frequency databases from populations are necessary to take full advantage of the increased power of discrimination offered by massively parallel sequencing (MPS) platforms. MATERIAL AND METHODS For this reason, we sequenced 58 STRs (aSTRs, X-STRs, and Y-STRs) and 94 identity informative SNPs (iiSNPs) on 105 Mestizo (admixed) individuals from Monterrey City (Northeast, Mexico), with the Primer Set-A of the ForenSeq™ DNA Signature Prep Kit. RESULTS Most of the STR markers were in Hardy Weinberg equilibrium, with a few exceptions. We found 346 different length-based alleles for these 58 STRs; nevertheless, they became 528 alleles when the sequence was assessed. The combined power of discrimination from autosomal STRs (aSTRs) was -virtually- 100% in both length and sequence-based alleles, while the power of exclusion was 99.9999999976065 and 99.9999999999494%, respectively. Haplotypes based on X-STRs and Y-STRs showed 100% of discriminatory capacity. CONCLUSIONS These results provide -for the first time- forensic genomic population data from Mexico necessary for interpretation in kinship and criminal analyses.
Collapse
Affiliation(s)
- José Alonso Aguilar-Velázquez
- Instituto de Investigación en Genética Molecular, Centro Universitario de la Ciénega, Universidad de Guadalajara, Av. Universidad #1115, CP 47810, Ocotlán, Jalisco, Mexico.,Instituto de Investigación en Ciencias Biomédicas, Centro Universitario de Ciencias de la Salud, Universidad de Guadalajara, Guadalajara, Jalisco, Mexico
| | - Miguel Ángel Duran-Salazar
- Instituto de Investigación en Genética Molecular, Centro Universitario de la Ciénega, Universidad de Guadalajara, Av. Universidad #1115, CP 47810, Ocotlán, Jalisco, Mexico.,Programa de Doctorado en Biociencias, Centro Universitario de los Altos, Universidad de uadalajara, Tepatitlán de Morelos, Jalisco, Mexico
| | - Miranda Fabiola Córdoba-Mercado
- Instituto de Criminalística y Servicios Periciales, Fiscalía General de Justicia del Estado de Nuevo León (FGJNL), Monterrey, Nuevo León, Mexico
| | - Carolina Elena Coronado-Avila
- Instituto de Criminalística y Servicios Periciales, Fiscalía General de Justicia del Estado de Nuevo León (FGJNL), Monterrey, Nuevo León, Mexico
| | - Orlando Salas-Salas
- Instituto de Criminalística y Servicios Periciales, Fiscalía General de Justicia del Estado de Nuevo León (FGJNL), Monterrey, Nuevo León, Mexico
| | - Gabriela Martinez-Cortés
- Instituto de Investigación en Genética Molecular, Centro Universitario de la Ciénega, Universidad de Guadalajara, Av. Universidad #1115, CP 47810, Ocotlán, Jalisco, Mexico
| | - Ferrán Casals
- Servei de Genómica, Universitat Pampeu Fabra, Dr. Aiguader, 88, 08003, Barcelona, Spain.,Departament de Genètica, Microbiologia i Estadística, Facultat de Biologia, Universitat de Barcelona, Barcelona, Spain
| | - Francesc Calafell
- Institut de Biologia Evolutiva, Universitat Pompeu Fabra, Dr. Aiguader, 88, 08003, Barcelona, Spain
| | - Benito Ramos-González
- Instituto de Criminalística y Servicios Periciales, Fiscalía General de Justicia del Estado de Nuevo León (FGJNL), Monterrey, Nuevo León, Mexico.
| | - Héctor Rangel-Villalobos
- Instituto de Investigación en Genética Molecular, Centro Universitario de la Ciénega, Universidad de Guadalajara, Av. Universidad #1115, CP 47810, Ocotlán, Jalisco, Mexico.
| |
Collapse
|
14
|
Lan Q, Zhao C, Chen C, Xu H, Fang Y, Yao H, Zhu B. Forensic Feature Exploration and Comprehensive Genetic Insights Into Yugu Ethnic Minority and Northern Han Population via a Novel NGS-Based Marker Set. Front Genet 2022; 13:816737. [PMID: 35601485 PMCID: PMC9121381 DOI: 10.3389/fgene.2022.816737] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2021] [Accepted: 03/08/2022] [Indexed: 12/02/2022] Open
Abstract
The MPS technology has expanded the potential applications of DNA markers and increased the discrimination power of the targeted loci by taking variations in their flanking regions into consideration. Here, a collection of nuclear and extranuclear DNA markers (totally six kinds of nuclear genetic markers and mtDNA hypervariable region variations) were comprehensively and systematically assessed for polymorphism detections, further employed to dissect the population backgrounds in the Yugu ethnic group from Gansu province (Yugu) and Han population from the Inner Mongolia Autonomous Region (NMH) of China. The elevated efficiencies of the marker set in separating full sibling and challenging half sibling determination cases in parentage tests (iiSNPs), as well as predicting ancestry origins of unknown individuals from at least four continental populations (aiSNPs) and providing informative characteristic-related clues for Chinese populations (piSNPs) are highlighted in the present study. To sum up, different sets of DNA markers revealed sufficient effciencies to serve as promising tools in forensic applications. Genetic insights from the perspectives of autosomal DNA, Y chromosomal DNA, and mtDNA variations yielded that the Yugu ethnic group was genetically close related to the Han populations of the northern region. But we admit that more reference populations (like Mongolian, Tibetan, Hui, and Tu) should be incorporated to gain a refined genetic background landscape of the Yugu group in future studies.
Collapse
Affiliation(s)
- Qiong Lan
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, Guangzhou, China
- Microbiome Medicine Center, Department of Laboratory Medicine, Zhujiang Hospital, Southern Medical University, Guangzhou, China
| | - Congying Zhao
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, Guangzhou, China
| | - Chong Chen
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, Guangzhou, China
- Clinical Research Center of Shaanxi Province for Dental and Maxillofacial Diseases, College of Stomatology, Xi’an Jiaotong University, Xi’an, China
| | - Hui Xu
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, Guangzhou, China
| | - Yating Fang
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, Guangzhou, China
| | - Hongbing Yao
- Belt and Road Research Center for Forensic Molecular Anthropology Gansu University of Political Science and Law, Lanzhou, China
| | - Bofeng Zhu
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, Guangzhou, China
- Microbiome Medicine Center, Department of Laboratory Medicine, Zhujiang Hospital, Southern Medical University, Guangzhou, China
- Clinical Research Center of Shaanxi Province for Dental and Maxillofacial Diseases, College of Stomatology, Xi’an Jiaotong University, Xi’an, China
- *Correspondence: Bofeng Zhu,
| |
Collapse
|
15
|
Laurent FX, Fischer A, Oldt RF, Kanthaswamy S, Buckleton JS, Hitchin S. Streamlining the decision-making process for international DNA kinship matching using Worldwide allele frequencies and tailored cutoff log 10LR thresholds. Forensic Sci Int Genet 2021; 57:102634. [PMID: 34871915 DOI: 10.1016/j.fsigen.2021.102634] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2021] [Revised: 10/13/2021] [Accepted: 11/15/2021] [Indexed: 11/30/2022]
Abstract
The identification of human remains belonging to missing persons is one of the main challenges for forensic genetics. Although other means of identification can be applied to missing person investigations, DNA is often extremely valuable to further support or refute potential associations. When reference DNA samples cannot be collected from personal items belonging to a missing person, a direct DNA identification cannot be carried out. However, identifications can be made indirectly using DNA from the missing person's relatives. The ranking of likelihood ratio (LR) values, which measure the fit of a missing person for any given pedigree, is often the first step in selecting candidates in a DNA database. Although implementing DNA kinship matching in a national environment is feasible, many challenges need to be resolved before applying this method to an international configuration. In this study, we present an innovative and intuitive method to perform international DNA kinship matching and facilitate the comparison of DNA profiles when the ancestry is unknown or unsure and/or when different marker sets are used. This straightforward method, which is based on calculations performed with the DNA matching software BONAPARTE, Worldwide allele frequencies and tailored cutoff log10LR thresholds, allows for the classification of potential candidates according to the strength of the DNA evidence and the predicted proportion of adventitious matches. This is a powerful method for streamlining the decision-making process in missing person investigations and DVI processes, especially when there are low numbers of overlapping typed STRs. Intuitive interpretation tables and a decision tree will help strengthen international data comparison for the identification of reported missing individuals discovered outside their national borders.
Collapse
Affiliation(s)
- François-Xavier Laurent
- International Criminal Police Organization - INTERPOL, DNA Unit, 200 quai Charles de Gaulle, 69006 Lyon, France.
| | - Andrea Fischer
- International Criminal Police Organization - INTERPOL, DNA Unit, 200 quai Charles de Gaulle, 69006 Lyon, France; Landeskriminalamt Baden-Württemberg, Taubenheimstr. 85, 70372 Stuttgart, Germany
| | - Robert F Oldt
- School of Mathematical and Natural Sciences, Arizona State University, Phoenix, AZ 85004, USA
| | - Sree Kanthaswamy
- School of Mathematical and Natural Sciences, Arizona State University, Phoenix, AZ 85004, USA
| | - John S Buckleton
- University of Auckland, Department of Statistics, Private Bag, 92019 Auckland, New Zealand
| | - Susan Hitchin
- International Criminal Police Organization - INTERPOL, DNA Unit, 200 quai Charles de Gaulle, 69006 Lyon, France.
| |
Collapse
|
16
|
Development and validation of a novel 133-plex forensic STR panel (52 STRs and 81 Y-STRs) using single-end 400 bp massive parallel sequencing. Int J Legal Med 2021; 136:447-464. [PMID: 34741666 DOI: 10.1007/s00414-021-02738-1] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2021] [Accepted: 10/25/2021] [Indexed: 12/15/2022]
Abstract
Short tandem repeats (STRs) are the preferred genetic markers in forensic DNA analysis, routinely measured by capillary electrophoresis (CE) method based on the fragment length features. While, the massive parallel sequencing (MPS) technology could simultaneously target a large number of intriguing forensic STRs, bypassing the intrinsic limitations of amplicon size separation and accessible fluorophores in CE, which is efficient and promising for enabling the identification of forensic biological evidence. Here, we developed a novel MPS-based Forensic Analysis System Multiplecues SetB Kit of 133-plex forensic STR markers (52 STRs and 81 Y-STRs) and one Y-InDel (M175) based on multiplex PCR and single-end 400 bp sequencing strategy. This panel was subjected to developmental validation studies according to the SWGDAM Validation Guidelines. Approximately 2185 MPS-based reactions using 6 human DNA standards and 8 male donors were conducted for substrate studies (filter paper, gauze, cotton swab, four different types of FTA cards, peripheral venous blood, saliva, and exfoliated cells), sensitivity studies (from 2 ng down to 0.0625 ng), mixture studies (two-person DNA mixtures), PCR inhibitor studies (seven commonly encountered PCR inhibitors), species specificity studies (11 non-human species), and repeatability studies. Results of concordance studies (413 Han males and 6 human DNA standards) generated by STRait Razor and in-house Python scripts indicated 99.98% concordance rate in STR calling relative to CE for STRs between 41,900 genotypes at 100 STR markers. Moreover, the limitations of present studies, the nomenclature rules and forensic MPS applications were also described. In conclusion, the validation studies based on ~ 2200 MPS-based and ~ 2500 CE-based DNA profiles demonstrated that the novel MPS-based panel meets forensic DNA quality assurance guidelines with robust, reliable, and reproducible performance on samples of various quantities and qualities, and the STR nomenclature rules should be further regulated to integrate the inconformity between MPS-based and CE-based methods.
Collapse
|
17
|
Chen C, Jin X, Zhang X, Zhang W, Guo Y, Tao R, Chen A, Xu Q, Li M, Yang Y, Zhu B. Comprehensive Insights Into Forensic Features and Genetic Background of Chinese Northwest Hui Group Using Six Distinct Categories of 231 Molecular Markers. Front Genet 2021; 12:705753. [PMID: 34721519 PMCID: PMC8555763 DOI: 10.3389/fgene.2021.705753] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2021] [Accepted: 09/07/2021] [Indexed: 11/13/2022] Open
Abstract
The Hui minority is predominantly composed of Chinese-speaking Islamic adherents distributed throughout China, of which the individuals are mainly concentrated in Northwest China. In the present study, we employed the length and sequence polymorphisms-based typing system of 231 molecular markers, i.e., amelogenin, 22 phenotypic-informative single nucleotide polymorphisms (PISNPs), 94 identity-informative single nucleotide polymorphisms (IISNPs), 24 Y-chromosomal short tandem repeats (Y-STRs), 56 ancestry-informative single nucleotide polymorphisms (AISNPs), 7 X-chromosomal short tandem repeats (X-STRs), and 27 autosomal short tandem repeats (A-STRs), into 90 unrelated male individuals from the Chinese Northwest Hui group to comprehensively explore its forensic characteristics and genetic background. Total of 451 length-based and 652 sequence-based distinct alleles were identified from 58 short tandem repeats (STRs) in 90 unrelated Northwest Hui individuals, denoting that the sequence-based genetic markers could pronouncedly provide more genetic information than length-based markers. The forensic characteristics and efficiencies of STRs and IISNPs were estimated, both of which externalized high polymorphisms in the Northwest Hui group and could be further utilized in forensic investigations. No significant departure from the Hardy-Weinberg equilibrium (HWE) expectation was observed after the Bonferroni correction. Additionally, four group sets of reference population data were exploited to dissect the genetic background of the Northwest Hui group separately from different perspectives, which contained 26 populations for 93 IISNPs, 58 populations for 17 Y-STRs, 26 populations for 55 AISNPs (raw data), and 109 populations for 55 AISNPs (allele frequencies). As a result, the analyses based on the Y-STRs indicated that the Northwest Hui group primarily exhibited intimate genetic relationships with reference Hui groups from Chinese different regions except for the Sichuan Hui group and secondarily displayed close genetic relationships with populations from Central and West Asia, as well as several Chinese groups. However, the AISNP analyses demonstrated that the Northwest Hui group shared more intimate relationships with current East Asian populations apart from reference Hui group, harboring the large proportion of ancestral component contributed by East Asia.
Collapse
Affiliation(s)
- Chong Chen
- Key Laboratory of Shaanxi Province for Craniofacial Precision Medicine Research, College of Stomatology, Xi'an Jiaotong University, Xi'an, China.,Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, Guangzhou, China
| | - Xiaoye Jin
- Key Laboratory of Shaanxi Province for Craniofacial Precision Medicine Research, College of Stomatology, Xi'an Jiaotong University, Xi'an, China
| | - Xingru Zhang
- Key Laboratory of Shaanxi Province for Craniofacial Precision Medicine Research, College of Stomatology, Xi'an Jiaotong University, Xi'an, China.,Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, Guangzhou, China
| | - Wenqing Zhang
- Key Laboratory of Shaanxi Province for Craniofacial Precision Medicine Research, College of Stomatology, Xi'an Jiaotong University, Xi'an, China
| | - Yuxin Guo
- Key Laboratory of Shaanxi Province for Craniofacial Precision Medicine Research, College of Stomatology, Xi'an Jiaotong University, Xi'an, China
| | - Ruiyang Tao
- Shanghai Key Laboratory of Forensic Medicine, Shanghai Forensic Service Platform, Academy of Forensic Sciences, Ministry of Justice, Shanghai, China
| | - Anqi Chen
- Shanghai Key Laboratory of Forensic Medicine, Shanghai Forensic Service Platform, Academy of Forensic Sciences, Ministry of Justice, Shanghai, China.,Department of Forensic Medicine, Shanghai Medical College of Fudan University, Shanghai, China
| | - Qiannan Xu
- Shanghai Key Laboratory of Forensic Medicine, Shanghai Forensic Service Platform, Academy of Forensic Sciences, Ministry of Justice, Shanghai, China.,Institute of Forensic Medicine, West China School of Basic Medical Sciences and Forensic Medicine, Sichuan University, Chengdu, China
| | - Min Li
- Shanghai Key Laboratory of Forensic Medicine, Shanghai Forensic Service Platform, Academy of Forensic Sciences, Ministry of Justice, Shanghai, China.,Institute of Forensic Medicine, West China School of Basic Medical Sciences and Forensic Medicine, Sichuan University, Chengdu, China
| | - Yue Yang
- Shanghai Key Laboratory of Forensic Medicine, Shanghai Forensic Service Platform, Academy of Forensic Sciences, Ministry of Justice, Shanghai, China.,School of Basic Medicine, Inner Mongolia Medical University, Hohhot, China
| | - Bofeng Zhu
- Key Laboratory of Shaanxi Province for Craniofacial Precision Medicine Research, College of Stomatology, Xi'an Jiaotong University, Xi'an, China.,Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, Guangzhou, China.,Department of Forensic Genetics, Multi-Omics Innovative Research Center of Forensic Identification, School of Forensic Medicine, Southern Medical University, Guangzhou, China
| |
Collapse
|
18
|
Pham NS, Tran HL, Nguyen THT, Nguyen VH, Hoang H, Tung QN, Phi QT. The First Autosomal STR Population Data of Kinh Ethinic Group in Vietnam by Using Massively Parallel Sequencing. RUSS J GENET+ 2021. [DOI: 10.1134/s102279542108010x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
19
|
Li H, Zhang C, Song G, Ma K, Cao Y, Zhao X, Yang Q, Xie J. Concordance and characterization of massively parallel sequencing at 58 STRs in a Tibetan population. Mol Genet Genomic Med 2021; 9:e1626. [PMID: 33630413 PMCID: PMC8123751 DOI: 10.1002/mgg3.1626] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2020] [Revised: 01/17/2021] [Accepted: 02/10/2021] [Indexed: 12/18/2022] Open
Abstract
BACKGROUND Massively parallel sequencing (MPS) is a promising supplementary method for forensic casework in short tandem repeats (STRs) genotyping, owing to several advantageous features in comparison to traditional capillary electrophoresis (CE). However, the application of MPS in casework requires accessible datasets from the worldwide population to enrich the allele frequencies of sequence-based STR genotypes. METHODS In this study, we report the characterization of sequence-based allele frequencies of 58 STRs from a Tibetan population comprising 120 unrelated individuals using the ForenSeq™ DNA Signature Prep Kit. A concordance study evaluating MPS and CE allele data was performed to ensure that MPS is compatible with current CE-based forensic databases. The diversity of observed alleles, allele frequencies, and forensic parameters per locus by length (LB), sequence without flanking region (RSB), and sequence with flanking region (FSB) were analyzed and compared. RESULTS The concordance study demonstrated a concordance rate exceeding 99%. The combined random match probability (RMP) for the 26 A-STRs was 2.04 × 10-29 , 1.93 × 10-31 , and 9.56 × 10-33 for LB, RSB, and FSB, respectively. Similar trends were observed in other forensic parameters resulting from the increase in the number of unique alleles available. A total of 111 and 113 unique haplotypes in the Y-STR loci were observed when using length-based and sequence-based alleles, respectively. In addition, we identified 35 novel alleles at 25 loci and 25 polymorphisms in the flanking regions at 17 STRs. CONCLUSIONS Our data suggest that MPS- and CE-derived alleles are compatible. MPS-based analysis of the STR data substantially increased the allele diversity and improved the forensic parameters, which clearly demonstrated the advantages of MPS in comparison to CE. With more pooled data and larger-scale validation, MPS could play a valuable role in forensic genetics and might be an additional tool for routine casework.
Collapse
Affiliation(s)
- Hui Li
- Department of Forensic MedicineSchool of Basic Medical SciencesFudan UniversityShanghaiChina
- Shanghai Key Laboratory of Crime Scene EvidenceShanghai Research Institute of Criminal Science and TechnologyShanghai Municipal Public Security BureauShanghaiChina
| | - Cheng Zhang
- Center for Material EvidenceTibet Autonomous Region Public SecurityLhasa, TibetChina
| | - Guoqing Song
- Laboratory of Forensic Evidence and Science TechnologyXianyang Municipal Public Security BureauXianyangShaanxiChina
| | - Ke Ma
- Shanghai Key Laboratory of Crime Scene EvidenceShanghai Research Institute of Criminal Science and TechnologyShanghai Municipal Public Security BureauShanghaiChina
| | - Yu Cao
- Shanghai Key Laboratory of Crime Scene EvidenceShanghai Research Institute of Criminal Science and TechnologyShanghai Municipal Public Security BureauShanghaiChina
| | - Xueying Zhao
- National Demonstration Center for Experimental Biology EducationSchool of Life SciencesFudan UniversityShanghaiChina
| | - Qinrui Yang
- Department of Forensic MedicineSchool of Basic Medical SciencesFudan UniversityShanghaiChina
| | - Jianhui Xie
- Department of Forensic MedicineSchool of Basic Medical SciencesFudan UniversityShanghaiChina
| |
Collapse
|
20
|
Autosomal STR and SNP characterization of populations from the Northeastern Peruvian Andes with the ForenSeq™ DNA Signature Prep Kit. Forensic Sci Int Genet 2021; 52:102487. [PMID: 33640735 DOI: 10.1016/j.fsigen.2021.102487] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2020] [Revised: 02/16/2021] [Accepted: 02/20/2021] [Indexed: 11/24/2022]
Abstract
Autosomal DNA data from Peru for human identity testing purposes are scarce in the scientific literature, which hinders obtaining an appropriate portrait of the genetic variation of the resident populations. In this study we genetically characterize five populations from the Northeastern Peruvian Andes (Chachapoyas, Awajún, Wampís, Huancas and Cajamarca). Autosomal short tandem repeat (aSTR) and identity informative single nucleotide polymorphism (iiSNP) data from a total of 233 unrelated individuals are provided, and forensic genetic parameters are calculated for each population and for the combined set Northeastern Peruvian Andes. After correction for multiple testing in the whole dataset of the Northeastern Peruvian Andes, the only departure from Hardy-Weinberg equilibrium was observed in locus rs2111980. Twenty one out of 27 aSTR loci exhibited an increased number of alleles due to sequence variation in the repeat motif and flanking regions. For iiSNPs 33% of the loci displayed flanking region variation. The combined random match probability (RMP), assuming independence of all loci (aSTRs and iiSNPs), in the Chachapoyas, the population with the largest samples size (N = 172), was 8.14 × 10-62 for length-based data while for sequence-based was 4.15 × 10-67. In the merged dataset (Northeastern Peruvian Andes; N = 233), the combined RMP when including all markers were 2.96 × 10-61 (length-based) and 3.21 × 10-66 (sequence-based). These new data help to fill up some of the gaps in the genetic canvas of South America and provide essential length- and sequence-based background information for other forensic genetic studies in Peru.
Collapse
|
21
|
Lee JH, Kim J, Kim H, Kim HS, Kim E. Massively parallel sequencing of 25 short tandem repeat loci including the SE33 marker in Koreans. Genes Genomics 2021; 43:133-140. [PMID: 33481226 DOI: 10.1007/s13258-020-01033-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2020] [Accepted: 12/19/2020] [Indexed: 11/30/2022]
Abstract
BACKGROUND Massively parallel sequencing (MPS) technology has recently been introduced in research, clinical diagnostics, and forensics. MPS enables determination of the genotypes of multiple short tandem repeat (STR) markers and to determine nucleotide sequence variations, additionally. OBJECTIVE To improve STR analysis and a paternity index, a new, smaller-sized STR panel was designed that includes the SE33 locus. METHODS This study performed MPS using an STR panel including the SE33 marker in 101 Koreans. The concordance study was conducted by comparing the data obtained from the MPS assay with the results of a capillary electrophoresis (CE)-based method. RESULTS In this study, an in-house MPS panel is designed that incorporates the 20 Combined DNA Index System (CODIS) loci and the Penta D, Penta E, and SE33 markers for enhanced discriminatory ability. The data obtained via MPS analysis were compared with CE data to confirm concordance. Fifty previously unreported alleles were detected through the MPS analysis. Three new SNP variations in the flanking region were also identified. Statistical analysis demonstrated that the SE33 marker was most effectively determined the match probability (PM) and typical paternity index (TPI). In the sensitivity study, concentrations as low as 80 pg could be used to obtain full and concordant profiles. CONCLUSIONS We designed a new, smaller-sized STR panel that includes the SE33 locus to improve STR analysis and the paternity index. Various new alleles were identified in SE33, indicating a high degree of polymorphism. The panel is expected to provide valid data for discrimination of unidentified bodies.
Collapse
Affiliation(s)
- Ja Hyun Lee
- Forensic DNA Division, National Forensic Service, Wonju, 26460, Korea
| | - Jeongyong Kim
- Forensic DNA Division, National Forensic Service, Wonju, 26460, Korea
| | - Hyojeong Kim
- Forensic DNA Division, National Forensic Service, Wonju, 26460, Korea
| | - Hyo Sook Kim
- Forensic DNA Division, National Forensic Service, Wonju, 26460, Korea
| | - Eungsoo Kim
- DNA Analysis Division, National Forensic Service Seoul Institute, Seoul, 08036, Korea.
| |
Collapse
|
22
|
Borsuk LA, Steffen CR, Kiesler KM, Vallone PM, Gettings KB. Sequence-based U.S. population data for 7 X-STR loci. FORENSIC SCIENCE INTERNATIONAL: REPORTS 2020. [DOI: 10.1016/j.fsir.2020.100160] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022] Open
|
23
|
Devesse L, Davenport L, Borsuk L, Gettings K, Mason-Buck G, Vallone PM, Syndercombe Court D, Ballard D. Classification of STR allelic variation using massively parallel sequencing and assessment of flanking region power. Forensic Sci Int Genet 2020; 48:102356. [PMID: 32712568 DOI: 10.1016/j.fsigen.2020.102356] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2020] [Revised: 06/24/2020] [Accepted: 07/09/2020] [Indexed: 12/14/2022]
Abstract
The application of massively parallel sequencing (MPS) to forensic genetics has led to improvements in multiple aspects of DNA analysis, however, additional complexities are concurrently associated with these advances. In relation to short tandem repeat (STR) typing, the move to sequence rather than length-based methodologies has highlighted the extent to which previous allelic variation was masked - both within and outside of the repeat regions (the flanking regions). In order to fully implement MPS for autosomal STR analysis, sequence-based allelic frequencies must be available, and concordance with previous typing techniques needs to be assessed. In this work, a series of samples (n = 1007) from five different population groups were genotyped using the MiSeq FGx™ Forensic Genomics System. Results were compared to those obtained using capillary electrophoresis (CE), and sequence variation has been characterised both within and outside STR repeat regions, with allelic frequencies provided for all variants observed within this database. Analysing and characterising flanking region sequence is currently less straightforward than studying repeat region variation alone, and the added value of doing so remains largely unexplored - this paper provides data to show that the gain in polymorphism achieved when analysing flanking regions is less than might be expected. In the White British population for example, including the sequence variation within repeat regions of 26 autosomal STRs made the average combined random match probability (RMP) over 700 times lower than with length-based alleles alone. Including the sequence variation within the flanking regions only resulted in a combined RMP that was a further 4 times lower.
Collapse
Affiliation(s)
- Laurence Devesse
- King's Forensics, King's College London, Franklin-Wilkins Building, 150 Stamford Street London, UK.
| | - Lucinda Davenport
- King's Forensics, King's College London, Franklin-Wilkins Building, 150 Stamford Street London, UK
| | - Lisa Borsuk
- U.S. National Institute of Standards and Technology, Biomolecular Measurement Division, 100 Bureau Drive, Gaithersburg, MD 20899, USA
| | - Katherine Gettings
- U.S. National Institute of Standards and Technology, Biomolecular Measurement Division, 100 Bureau Drive, Gaithersburg, MD 20899, USA
| | - Gabriella Mason-Buck
- King's Forensics, King's College London, Franklin-Wilkins Building, 150 Stamford Street London, UK
| | - Peter M Vallone
- U.S. National Institute of Standards and Technology, Biomolecular Measurement Division, 100 Bureau Drive, Gaithersburg, MD 20899, USA
| | - Denise Syndercombe Court
- King's Forensics, King's College London, Franklin-Wilkins Building, 150 Stamford Street London, UK
| | - David Ballard
- King's Forensics, King's College London, Franklin-Wilkins Building, 150 Stamford Street London, UK
| |
Collapse
|