1
|
Antão-Sousa S, Gusmão L, Modesti NM, Feliziani S, Faustino M, Marcucci V, Sarapura C, Ribeiro J, Carvalho E, Pereira V, Tomas C, de Pancorbo MM, Baeta M, Alghafri R, Almheiri R, Builes JJ, Gouveia N, Burgos G, Pontes MDL, Ibarra A, da Silva CV, Parveen R, Benitez M, Amorim A, Pinto N. Microsatellites' mutation modeling through the analysis of the Y-chromosomal transmission: Results of a GHEP-ISFG collaborative study. Forensic Sci Int Genet 2024; 69:102999. [PMID: 38181588 DOI: 10.1016/j.fsigen.2023.102999] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2023] [Revised: 10/25/2023] [Accepted: 12/10/2023] [Indexed: 01/07/2024]
Abstract
The Spanish and Portuguese Speaking Working Group of the International Society for Forensic Genetics (GHEP-ISFG) organized a collaborative study on mutations of Y-chromosomal short tandem repeats (Y-STRs). New data from 2225 father-son duos and data from 44 previously published reports, corresponding to 25,729 duos, were collected and analyzed. Marker-specific mutation rates were estimated for 33 Y-STRs. Although highly dependent on the analyzed marker, mutations compatible with the gain or loss of a single repeat were 23.2 times more likely than those involving a greater number of repeats. Longer alleles (relatively to the modal one) showed to be nearly twice more mutable than the shorter ones. Within the subset of longer alleles, the loss of repeats showed to be nearly twice more likely than the gain. Conversely, shorter alleles showed a symmetrical trend, with repeat gains being twofold more frequent than reductions. A positive correlation between the paternal age and the mutation rate was observed, strengthening previous findings. The results of a machine learning approach, via logistic regression analyses, allowed the establishment of algebraic formulas for estimating the probability of mutation depending on paternal age and allele length for DYS389I, DYS393 and DYS627. Algebraic formulas could also be established considering only the allele length as predictor for DYS19, DYS389I, DYS389II-I, DYS390, DYS391, DYS393, DYS437, DYS439, DYS449, DYS456, DYS458, DYS460, DYS481, DYS518, DYS533, DYS576, DYS626 and DYS627 loci. For the remaining Y-STRs, a lack of statistical significance was observed, probably as a consequence of the small effective size of the subsets available, a common difficulty in the modeling of rare events as is the case of mutations. The amount of data used in the different analyses varied widely, depending on how the data were reported in the publications analyzed. This shows a regrettable waste of produced data, due to inadequate communication of the results, supporting an urgent need of publication guidelines for mutation studies.
Collapse
Affiliation(s)
- Sofia Antão-Sousa
- Instituto de Investigação e Inovação em Saúde (i3S), Porto, Portugal; Institute of Molecular Pathology and Immunology, University of Porto (IPATIMUP), Porto, Portugal; Faculty of Sciences of the University of Porto (FCUP), Porto, Portugal; DNA Diagnostic Laboratory (LDD), State University of Rio de Janeiro (UERJ), Rio de Janeiro, Brazil
| | - Leonor Gusmão
- DNA Diagnostic Laboratory (LDD), State University of Rio de Janeiro (UERJ), Rio de Janeiro, Brazil
| | - Nidia M Modesti
- Centro de Genética Forense, Poder Judicial de Córdoba, Argentina
| | - Sofía Feliziani
- Centro de Genética Forense, Poder Judicial de Córdoba, Argentina
| | - Marisa Faustino
- Instituto de Investigação e Inovação em Saúde (i3S), Porto, Portugal; Faculty of Sciences of the University of Porto (FCUP), Porto, Portugal
| | - Valeria Marcucci
- Laboratorio Regional de Investigación Forense, Tribunal Superior de Justicia de Santa Cruz, Argentina
| | - Claudia Sarapura
- Laboratorio Regional de Investigación Forense, Tribunal Superior de Justicia de Santa Cruz, Argentina
| | - Julyana Ribeiro
- DNA Diagnostic Laboratory (LDD), State University of Rio de Janeiro (UERJ), Rio de Janeiro, Brazil
| | - Elizeu Carvalho
- DNA Diagnostic Laboratory (LDD), State University of Rio de Janeiro (UERJ), Rio de Janeiro, Brazil
| | - Vania Pereira
- Section of Forensic Genetics, Department of Forensic Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, Denmark
| | - Carmen Tomas
- Section of Forensic Genetics, Department of Forensic Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, Denmark
| | - Marian M de Pancorbo
- BIOMICs Research Group, Lascaray Research Center, Department of Zoology and Animal Cell Biology, University of the Basque Country UPV/EHU, Vitoria-Gasteiz, Spain
| | - Miriam Baeta
- BIOMICs Research Group, Lascaray Research Center, Department of Zoology and Animal Cell Biology, University of the Basque Country UPV/EHU, Vitoria-Gasteiz, Spain
| | - Rashed Alghafri
- International Center for Forensic Sciences, Dubai Police G.H.Q., Dubai, United Arab Emirates
| | - Reem Almheiri
- International Center for Forensic Sciences, Dubai Police G.H.Q., Dubai, United Arab Emirates
| | - Juan José Builes
- GENES SAS Laboratory, Medellín, Colombia; Institute of Biology, University of Antioquia, Medellín, Colombia
| | - Nair Gouveia
- Instituto Nacional de Medicina Legal e Ciências Forenses, I.P. / Serviço de Genética e Biologia Forenses, Delegação do Centro, Portugal
| | - German Burgos
- One Health Global Research Group, Facultad de Medicina, Universidad de Las Américas (UDLA), Quito, Ecuador; Grupo de Medicina Xenómica, Universidad de Santiago de Compostela, Santiago de Compostela, Spain
| | - Maria de Lurdes Pontes
- Instituto Nacional de Medicina Legal e Ciências Forenses, I.P. / Serviço de Genética e Biologia Forenses, Delegação do Norte, Portugal
| | - Adriana Ibarra
- Laboratorio IDENTIGEN, Universidad de Antioquia, Colombia
| | - Claudia Vieira da Silva
- Instituto Nacional de Medicina Legal e Ciências Forenses, I.P. / Serviço de Genética e Biologia Forenses, Delegação do Sul, Portugal
| | - Rukhsana Parveen
- Forensic Services Laboratory, Centre for Applied Molecular Biology, University of the Punjab, Lahore, Pakistan
| | - Marc Benitez
- Policia de la Generalitat de Catalunya - Mossos d'Esquadra. Unitat Central del Laboratori Biològic, Barcelona, Spain
| | - António Amorim
- Instituto de Investigação e Inovação em Saúde (i3S), Porto, Portugal; Institute of Molecular Pathology and Immunology, University of Porto (IPATIMUP), Porto, Portugal; Faculty of Sciences of the University of Porto (FCUP), Porto, Portugal
| | - Nadia Pinto
- Instituto de Investigação e Inovação em Saúde (i3S), Porto, Portugal; Institute of Molecular Pathology and Immunology, University of Porto (IPATIMUP), Porto, Portugal; Centre of Mathematics of the University of Porto, Porto, Portugal.
| |
Collapse
|
2
|
Mitchell MR, Chaseling J, Jones L, White T, Bernie A, Haupt LM, Griffiths LR, Wright KM. Improving the strategy to identify historical military remains: a literature review and Y-STR meta-analysis. Forensic Sci Res 2024; 9:owad050. [PMID: 38562552 PMCID: PMC10982847 DOI: 10.1093/fsr/owad050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2022] [Accepted: 09/26/2023] [Indexed: 04/04/2024] Open
Abstract
The identification of historical military remains by Unrecovered War Casualties-Army (UWC-A) currently relies on Y-chromosome Short Tandem Repeat (Y-STR) testing when maternal relatives are not available, or when a mitochondrial DNA match does not provide sufficient certainty of identification. However, common Y-STR profiles (using Yfiler™) between sets of remains or families often prevent identification. To resolve these cases, an investigation of additional Y-DNA markers is needed for their potential inclusion into the DNA identification strategy. The number of genetic transmissions between missing soldiers and their living relatives needs to be considered to avoid false exclusions between paternal relatives. Analysis of 236 World War I/II (WWI/II) era pairs of relatives identified up to seven genetic transmissions between WWII soldiers and their living relatives, and nine for WWI. Previous Y-STR meta-analyses were published approximately 10 years ago when rapidly mutating markers were relatively new. This paper reports a contemporary literature review and meta-analysis of 35 studies (which includes 23 studies not previously used in meta-analysis) and 23 commonly used Y-STR's mutation rates to inform the inclusion of additional loci to UWC-A's DNA identification strategy. Meta-analysis found mutation data for a given Y-STR locus could be pooled between studies and that the mutation rates were significantly different between some loci (at P < 0.05). Based on this meta-analysis, we have identified two additional markers from PowerPlex® Y23 for potential inclusion in UWC-A's identification strategy. Further avenues for potential experimental exploration are discussed. Key points From 236 UWC-A pairs of relatives, we observed up to nine genetic transmissions between WWI soldiers and their living relatives, and seven for WWII.MedCalc® software for meta-analysis utilizing the Freeman-Tukey transformation was run, which analysed 35 published studies and 23 commonly used loci. Previous Y-STR mutation rate meta-analyses are now 10 years old; this paper includes 23 studies that were not included in previous meta-analyses.Through meta-analysis, we identify two markers from PowerPlex® Y23 for potential inclusion in UWC-A's historical remains identification strategy (alongside Yfiler™). We discuss potential next steps for experimental exploration of additional Y-DNA markers.
Collapse
Affiliation(s)
- Melinda R Mitchell
- Queensland University of Technology (QUT), Genomics Research Centre, Centre for Genomics and Personalised Health, School of Biomedical Sciences, Kelvin Grove, Queensland, Australia
| | - Janet Chaseling
- Queensland University of Technology (QUT), Genomics Research Centre, Centre for Genomics and Personalised Health, School of Biomedical Sciences, Kelvin Grove, Queensland, Australia
| | - Lee Jones
- Queensland University of Technology (QUT), Research Methods Group, Centre for Genomics and Personalised Health, School of Biomedical Sciences, Kelvin Grove, Queensland, Australia
| | - Toni White
- Queensland University of Technology (QUT), Defence Innovation Hub, Centre for Genomics and Personalised Health, School of Biomedical Sciences, Kelvin Grove, Queensland, Australia
| | - Andrew Bernie
- Unrecovered War Casualties-Army, Australian Defence Force, Russell Offices, Russell, Australian Capital Territory, Australia
| | - Larisa M Haupt
- Queensland University of Technology (QUT), Genomics Research Centre, Centre for Genomics and Personalised Health, School of Biomedical Sciences, Kelvin Grove, Queensland, Australia
| | - Lyn R Griffiths
- Queensland University of Technology (QUT), Genomics Research Centre, Centre for Genomics and Personalised Health, School of Biomedical Sciences, Kelvin Grove, Queensland, Australia
| | - Kirsty M Wright
- Queensland University of Technology (QUT), Genomics Research Centre, Centre for Genomics and Personalised Health, School of Biomedical Sciences, Kelvin Grove, Queensland, Australia
- Unrecovered War Casualties-Army, Australian Defence Force, Russell Offices, Russell, Australian Capital Territory, Australia
- Royal Australian Air Force (RAAF), No 2 Expeditionary Health Squadron, RAAF Base Williamtown, Williamtown, New South Wales, Australia
| |
Collapse
|
3
|
Antão-Sousa S, Pinto N, Rende P, Amorim A, Gusmão L. The sequence of the repetitive motif influences the frequency of multistep mutations in Short Tandem Repeats. Sci Rep 2023; 13:10251. [PMID: 37355683 PMCID: PMC10290632 DOI: 10.1038/s41598-023-32137-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2022] [Accepted: 03/23/2023] [Indexed: 06/26/2023] Open
Abstract
Microsatellites, or Short Tandem Repeats (STRs), are subject to frequent length mutations that involve the loss or gain of an integer number of repeats. This work aimed to investigate the correlation between STRs' specific repetitive motif composition and mutational dynamics, specifically the occurrence of single- or multistep mutations. Allelic transmission data, comprising 323,818 allele transfers and 1,297 mutations, were gathered for 35 Y-chromosomal STRs with simple structure. Six structure groups were established: ATT, CTT, TCTA/GATA, GAAA/CTTT, CTTTT, and AGAGAT, according to the repetitive motif present in the DNA leading strand of the markers. Results show that the occurrence of multistep mutations varies significantly among groups of markers defined by the repetitive motif. The group of markers with the highest frequency of multistep mutations was the one with repetitive motif CTTTT (25% of the detected mutations) and the lowest frequency corresponding to the group with repetitive motifs TCTA/GATA (0.93%). Statistically significant differences (α = 0.05) were found between groups with repetitive motifs with different lengths, as is the case of TCTA/GATA and ATT (p = 0.0168), CTT (p < 0.0001) and CTTTT (p < 0.0001), as well as between GAAA/CTTT and CTTTT (p = 0.0102). The same occurred between the two tetrameric groups GAAA/CTTT and TCTA/GATA (p < 0.0001) - the first showing 5.7 times more multistep mutations than the second. When considering the number of repeats of the mutated paternal alleles, statistically significant differences were found for alleles with 10 or 12 repeats, between GATA and ATT structure groups. These results, which demonstrate the heterogeneity of mutational dynamics across repeat motifs, have implications in the fields of population genetics, epidemiology, or phylogeography, and whenever STR mutation models are used in evolutionary studies in general.
Collapse
Affiliation(s)
- Sofia Antão-Sousa
- Instituto de Investigação e Inovação em Saúde (i3S), University of Porto, Porto, Portugal.
- Institute of Molecular Pathology and Immunology of the University of Porto (IPATIMUP), Porto, Portugal.
- Department of Biology, Faculty of Sciences of University of Porto (FCUP), Porto, Portugal.
- DNA Diagnostic Laboratory (LDD), State University of Rio de Janeiro (UERJ), Rio de Janeiro, Brazil.
| | - Nádia Pinto
- Instituto de Investigação e Inovação em Saúde (i3S), University of Porto, Porto, Portugal
- Institute of Molecular Pathology and Immunology of the University of Porto (IPATIMUP), Porto, Portugal
- Center of Mathematics of University of Porto (CMUP), Porto, Portugal
| | - Pablo Rende
- Instituto de Investigação e Inovação em Saúde (i3S), University of Porto, Porto, Portugal
- Department of Biology, Faculty of Sciences of University of Porto (FCUP), Porto, Portugal
| | - António Amorim
- Instituto de Investigação e Inovação em Saúde (i3S), University of Porto, Porto, Portugal
- Institute of Molecular Pathology and Immunology of the University of Porto (IPATIMUP), Porto, Portugal
- Department of Biology, Faculty of Sciences of University of Porto (FCUP), Porto, Portugal
| | - Leonor Gusmão
- DNA Diagnostic Laboratory (LDD), State University of Rio de Janeiro (UERJ), Rio de Janeiro, Brazil
| |
Collapse
|
4
|
Santana C, Noris G, Meraz-Ríos MA, Magaña JJ, Calderon-Aranda ES, Muñoz MDL, Gómez R. Genetic analysis of 17 Y-STRs in a Mestizo population from the Central Valley of Mexico. Hum Biol 2016; 86:289-312. [PMID: 25959695 DOI: 10.13110/humanbiology.86.4.0289] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
This study aims to portray the complex diversity of the Mexican Mestizo population, which represents 98.8% of the entire population of Mexico. We compiled extended haplotype data of the Y chromosome from populations in the Central Valley of Mexico (CVM), which we compared with other Mestizo and parental (Amerindian, European, and African) populations. A complex ancestral relationship was found in the CVM population, suggesting cosmopolitan origins. Nevertheless, the most preeminent lineages point toward a European ancestry, where the R1b lineage was most frequent. In addition, important frequencies of Amerindian lineages were also found in the Mestizo sample studied. Interestingly, the Amerindian ancestry showed a remarkable substructure, which was represented by the two main founding lineages: QL54 (× M3) and M3. However, even within each lineage a high diversity was found despite the small number of sample bearers of these lineages. Further, we detected important genetic differences between the CVM populations and the Mexican Mestizo populations from the north and south. This result points to the fact that Mestizo populations present different ancestral proportions, which are related to the demographic events that gave origin to each population. Finally, we provide additional forensic statistical parameters that are useful in the interpretation of genetic analysis where autosomal loci are limited. Our findings illustrate the complex genetic background of the Mexican Mestizo population and reinforce the need to encompass more geographic regions to generate more robust data for forensic applications.
Collapse
Affiliation(s)
- Carla Santana
- 1 Laboratorio Biología Molecular Diagnóstica, Querétaro, Qro, México
| | | | | | | | | | | | | |
Collapse
|
5
|
Empirical evaluation reveals best fit of a logistic mutation model for human Y-chromosomal microsatellites. Genetics 2011; 189:1403-11. [PMID: 21968190 DOI: 10.1534/genetics.111.132308] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The rate of microsatellite mutation is dependent upon both the allele length and the repeat motif, but the exact nature of this relationship is still unknown. We analyzed data on the inheritance of human Y-chromosomal microsatellites in father-son duos, taken from 24 published reports and comprising 15,285 directly observable meioses. At the six microsatellites analyzed (DYS19, DYS389I, DYS390, DYS391, DYS392, and DYS393), a total of 162 mutations were observed. For each locus, we employed a maximum-likelihood approach to evaluate one of several single-step mutation models on the basis of the data. For five of the six loci considered, a novel logistic mutation model was found to provide the best fit according to Akaike's information criterion. This implies that the mutation probability at the loci increases (nonlinearly) with allele length at a rate that differs between upward and downward mutations. For DYS392, the best fit was provided by a linear model in which upward and downward mutation probabilities increase equally with allele length. This is the first study to empirically compare different microsatellite mutation models in a locus-specific fashion.
Collapse
|
6
|
Shi W, Ayub Q, Vermeulen M, Shao RG, Zuniga S, van der Gaag K, de Knijff P, Kayser M, Xue Y, Tyler-Smith C. A worldwide survey of human male demographic history based on Y-SNP and Y-STR data from the HGDP-CEPH populations. Mol Biol Evol 2009; 27:385-93. [PMID: 19822636 PMCID: PMC2806244 DOI: 10.1093/molbev/msp243] [Citation(s) in RCA: 74] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
We have investigated human male demographic history using 590 males from 51 populations in the Human Genome Diversity Project - Centre d’Étude du Polymorphisme Humain worldwide panel, typed with 37 Y-chromosomal Single Nucleotide Polymorphisms and 65 Y-chromosomal Short Tandem Repeats and analyzed with the program Bayesian Analysis of Trees With Internal Node Generation. The general patterns we observe show a gradient from the oldest population time to the most recent common ancestors (TMRCAs) and expansion times together with the largest effective population sizes in Africa, to the youngest times and smallest effective population sizes in the Americas. These parameters are significantly negatively correlated with distance from East Africa, and the patterns are consistent with most other studies of human variation and history. In contrast, growth rate showed a weaker correlation in the opposite direction. Y-lineage diversity and TMRCA also decrease with distance from East Africa, supporting a model of expansion with serial founder events starting from this source. A number of individual populations diverge from these general patterns, including previously documented examples such as recent expansions of the Yoruba in Africa, Basques in Europe, and Yakut in Northern Asia. However, some unexpected demographic histories were also found, including low growth rates in the Hazara and Kalash from Pakistan and recent expansion of the Mozabites in North Africa.
Collapse
Affiliation(s)
- Wentao Shi
- The Wellcome Trust Sanger Institute, Hinxton, Cambs., United Kingdom
| | | | | | | | | | | | | | | | | | | |
Collapse
|