1
|
Inokuchi S, Nakanishi H, Takada A, Saito K. Effect existence of aging on stutter ratio evaluated via Bayesian inference. Forensic Sci Int Genet 2023; 67:102933. [PMID: 37722181 DOI: 10.1016/j.fsigen.2023.102933] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2022] [Revised: 07/31/2023] [Accepted: 09/06/2023] [Indexed: 09/20/2023]
Abstract
The stochastic behavior of the stutter ratio (SR) in capillary electrophoresis-based DNA typing is currently described and predicted using statistical models in forensic genetics. Clarifying this behavior can help obtain more objective and robust evidence to the court in terms of mixture interpretation. This study aimed to investigate the effect existence of aging on SR via a Bayesian framework. Nail scrapings and clippings were collected from 68 healthy individuals with informed consent. Samples were classified by age-class: young group (0-16 years; n = 36) and older-adult group (>61 years; n = 32). Then, they were compared in terms of their SRs for each simple repeat locus included in GlobalFiler Kit. Bayesian modeling was performed with lognormal distribution model, which implemented multiple linear regression, allele and age-class as explanatory variables. For all simple repeat loci, the median of the posterior distribution of the age-class parameter was a positive value. For CSF1PO and D7S820, the 95% credible interval of the posterior distribution did not include 0. Our data suggested that aging slightly increases the SR. These findings might help elucidate the stochastic behavior of SR.
Collapse
Affiliation(s)
- Shota Inokuchi
- Department of Forensic Medicine, Graduate School of Medicine, Juntendo University, 2-1-1 Hongo, Bunkyo-ku, Tokyo, Japan; Forensic Science Laboratory, Tokyo Metropolitan Police Department, 3-35-21 Shakujiidai, Nerima-ku, Tokyo, Japan.
| | - Hiroaki Nakanishi
- Department of Forensic Medicine, Graduate School of Medicine, Juntendo University, 2-1-1 Hongo, Bunkyo-ku, Tokyo, Japan
| | - Aya Takada
- Department of Forensic Medicine, Saitama Medical University, 38 Moroyamamachimorohongo, Saitama, Japan
| | - Kazuyuki Saito
- Department of Forensic Medicine, Graduate School of Medicine, Juntendo University, 2-1-1 Hongo, Bunkyo-ku, Tokyo, Japan
| |
Collapse
|
2
|
Agudo MM, Aanes H, Roseth A, Albert M, Gill P, Bleka Ø. A comprehensive characterization of MPS-STR stutter artefacts. Forensic Sci Int Genet 2022; 60:102728. [DOI: 10.1016/j.fsigen.2022.102728] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2022] [Revised: 05/03/2022] [Accepted: 05/24/2022] [Indexed: 11/04/2022]
|
3
|
Inokuchi S, Fujii K, Nakanishi H, Takada A, Saito K, Mizuno N. Modeling the minus two base pair stutter ratio of the D1S1656 locus: A sequence-based mixture distribution model. Forensic Sci Int Genet 2020; 51:102450. [PMID: 33387945 DOI: 10.1016/j.fsigen.2020.102450] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2020] [Revised: 12/01/2020] [Accepted: 12/01/2020] [Indexed: 11/15/2022]
Abstract
In this study, we propose a stutter ratio for a minus two base pair stutter (-2bpSR) model of the D1S1656 locus in capillary electrophoresis (CE)-based short tandem repeat (STR) typing. DNA from a total of 108 Japanese individuals was analyzed via massively parallel sequencing to investigate the length of the longest uninterrupted stretch of two base repeat motif (2bpLUS value) within repetitive structures involving the flanking region. Additionally, -2bpSR data was collected using the GlobalFiler Kit on a 3500xL Genetic Analyzer. As a result of sequencing analysis, all alleles were classified into two types by their 2bpLUS values. The -2bpSR differed significantly between the types. Then, we modeled the -2bpSR with a mixture log-normal distribution using the classification of alleles based on the 2bpLUS values. Furthermore, probabilities of the sequence type within each repeat number in the mixture log-normal distribution model were estimated using logistic regression for each of the five major detected populations. This study is expected to enable interpretation of STR typing while considering minus two base pair stutter at the D1S1656 locus.
Collapse
Affiliation(s)
- Shota Inokuchi
- Department of Forensic Medicine, Graduate School of Medicne, Juntendo University, 2-1-1 Hongo, Bunkyo-ku, Tokyo, Japan; National Research Institute of Police Science, 6-3-1 Kashiwanoha, Kashiwa, Chiba, Japan; Forensic Science Laboratory, Tokyo Metropolitan Police Department, 3-35-21 Shakujiidai, Nerima-ku, Tokyo, Japan.
| | - Koji Fujii
- National Research Institute of Police Science, 6-3-1 Kashiwanoha, Kashiwa, Chiba, Japan
| | - Hiroaki Nakanishi
- Department of Forensic Medicine, Graduate School of Medicne, Juntendo University, 2-1-1 Hongo, Bunkyo-ku, Tokyo, Japan
| | - Aya Takada
- Department of Forensic Medicine, Saitama Medical University, 38 Moroyamamachimorohongo, Saitama, Japan
| | - Kazuyuki Saito
- Department of Forensic Medicine, Graduate School of Medicne, Juntendo University, 2-1-1 Hongo, Bunkyo-ku, Tokyo, Japan; Department of Forensic Medicine, Saitama Medical University, 38 Moroyamamachimorohongo, Saitama, Japan
| | - Natsuko Mizuno
- National Research Institute of Police Science, 6-3-1 Kashiwanoha, Kashiwa, Chiba, Japan
| |
Collapse
|
4
|
A reference allelic ladder for Western Capercaillie (Tetrao urogallus) and Black Grouse (Tetrao tetrix) enables linking grouse genetic data across studies. CONSERV GENET RESOUR 2020. [DOI: 10.1007/s12686-020-01180-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
AbstractRapid anthropogenic climate change and progressing habitat degradation are considered top threats to biodiversity. The employment of demanding umbrella species as indicators for ecosystem health is a popular and cost-effective strategy that facilitates continuous monitoring and evaluation within a long-term conservation management scheme. The Western Capercaillie (Tetrao urogallus) and the Black Grouse (Tetrao tetrix) are both considered viable candidates due to their extensive habitat requirements, the possibility for conservative, non-invasive sampling, and their broad popular appeal. Regional population surveys based on genetic data from Short Sequence Repeat (SSR) analysis are being conducted throughout the Palearctic. However, to ensure reliable comparability among laboratories, standardization is required. Here, we report a catalogue of fifty fully characterized reference alleles from twelve SSR loci and the construction of a customizable allelic ladder for genotyping and individualization in Western Capercaillie and Black Grouse. This methodological improvement will help to cost-efficiently generate and collate supraregional data from different grouse surveys and thereby contribute to conservation management. Reference alleles and ladders can be obtained on demand.
Collapse
|
5
|
Silvery J, Ganschow S, Wiegand P, Tiemann C. Developmental validation of the monSTR identity panel, a forensic STR multiplex assay for massively parallel sequencing. Forensic Sci Int Genet 2020; 46:102236. [PMID: 31986344 DOI: 10.1016/j.fsigen.2020.102236] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2019] [Revised: 01/07/2020] [Accepted: 01/12/2020] [Indexed: 12/11/2022]
Abstract
The 21-plex STR panel monSTR was designed for high-fidelity forensic genotyping on the Illumina MiSeq platform. In this study, the panel's performance was validated according to the recommended validation guidelines of the Scientific Working Group for DNA Analysis Methods (SWGDAM). Concordance, repeatability and reproducibility, sensitivity of detection, mixture analysis, species-specificity, and the ability to analyze mock samples were assessed. Sequence data was analyzed using the genotyping software toaSTR. The assay performance was evaluated by measuring the read on-target ratio, the genotype accuracy, the inter-locus balance, the heterozygosity balance, and the signal-to-noise ratio. Results showed that profiles of NIST reference DNA samples as well as GEDNAP proficiency samples were fully concordant with CE-based methods. In addition, inter-run and intra-run variation experiments indicated high precision. Furthermore, full profiles could be obtained using 62.5 pg of DNA input amount with proper inter-locus balance and read on-target ratio; 76.4% of alleles were correctly called with 7.8 pg DNA input amount. It was demonstrated that 94.4% of minor contributor alleles were resolved accurately in a 1:49 mixture. Results suggested that the minor contribution could be precisely calculated based on the minor component allele frequency. Validation results described here demonstrate that the monSTR forensic identity panel is a valid tool for forensic STR genotyping using massively parallel sequencing.
Collapse
Affiliation(s)
- Janine Silvery
- LABCON-OWL Analytik, Forschung und Consulting GmbH, Siemensstraße 40, 32105 Bad Salzuflen, Germany; Institute of Legal Medicine, University Hospital of Ulm, Albert-Einstein-Allee 23, 89081 Ulm, Germany.
| | - Sebastian Ganschow
- LABCON-OWL Analytik, Forschung und Consulting GmbH, Siemensstraße 40, 32105 Bad Salzuflen, Germany
| | - Peter Wiegand
- Institute of Legal Medicine, University Hospital of Ulm, Albert-Einstein-Allee 23, 89081 Ulm, Germany
| | - Carsten Tiemann
- LABCON-OWL Analytik, Forschung und Consulting GmbH, Siemensstraße 40, 32105 Bad Salzuflen, Germany; Faculty of Engineering and Mathematics, Bielefeld University of Applied Science, Interaktion 1, 33619 Bielefeld, Germany
| |
Collapse
|
6
|
Understanding the characteristics of sequence-based single-source DNA profiles. Forensic Sci Int Genet 2020; 44:102192. [DOI: 10.1016/j.fsigen.2019.102192] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2019] [Revised: 10/17/2019] [Accepted: 10/20/2019] [Indexed: 01/18/2023]
|
7
|
Kuffel A, Gray A, Nic Daeid N. Human Leukocyte Antigen alleles as an aid to STR in complex forensic DNA samples. Sci Justice 2019; 60:1-8. [PMID: 31924284 DOI: 10.1016/j.scijus.2019.09.003] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/25/2019] [Revised: 09/09/2019] [Accepted: 09/15/2019] [Indexed: 10/25/2022]
Abstract
Human biological samples with multiple contributors remain one of the most challenging aspects of DNA typing within a forensic science context. With the increasing sensitivity of commercially available kits allowing detection of low template DNA, complex mixtures are now a standard component of forensic DNA evidence. Over the years, various methods and techniques have been developed to try to resolve the issue of mixed profiles. However, forensic DNA analysis has relied on the same markers to generate DNA profiles for the past 30 years causing considerable challenges in the deconvolution of complex mixed samples. The future of resolving complicated DNA mixtures may rely on utilising markers that have been previously applied to gene typing of non-forensic relevance. With Massively Parallel Sequencing (MPS), techniques becoming more popular and accessible even epigenetic markers have become a source of interest for forensic scientists. The aim of this review is to consider the potential of alleles from the Human Leukocyte Antigen (HLA) complex as effective forensic markers. While Massively Parallel Sequencing of HLA is routinely used in clinical laboratories in fields such as transplantation, pharmacology or population studies, there have not been any studies testing its suitability for forensic casework samples.
Collapse
Affiliation(s)
- Agnieszka Kuffel
- Leverhulme Research Centre for Forensic Science, Ewing Building, University of Dundee, Small's Lane, Dundee DD1 4HR, United Kingdom.
| | - Alexander Gray
- Leverhulme Research Centre for Forensic Science, Ewing Building, University of Dundee, Small's Lane, Dundee DD1 4HR, United Kingdom.
| | - Niamh Nic Daeid
- Leverhulme Research Centre for Forensic Science, Ewing Building, University of Dundee, Small's Lane, Dundee DD1 4HR, United Kingdom.
| |
Collapse
|
8
|
Holzhausen I, Lendner M, Göhring F, Steinhöfel I, Daugschies A. Distribution of Cryptosporidium parvum gp60 subtypes in calf herds of Saxony, Germany. Parasitol Res 2019; 118:1549-1558. [PMID: 30790038 DOI: 10.1007/s00436-019-06266-1] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2018] [Accepted: 02/12/2019] [Indexed: 02/01/2023]
Abstract
Cryptosporidiosis is a common protozoan parasitic infection that causes diarrhoea in neonatal calves. The high shedding of environmentally resistant oocysts facilitates outbreaks of cryptosporidiosis in humans. In total, 58 farms (512 calves) in Germany (Saxony and Brandenburg) were visited three times each. Faecal samples of pre-weaned calves were microscopically examined for oocysts of Cryptosporidium spp. using Heine staining and were scored with regard to their consistency. Overall, 88.9% of calves tested microscopically positive for Cryptosporidium spp. in at least one sample, and the excretion of oocysts was significantly (P < 0.01) associated with a higher faecal score (diarrhoea). After DNA extraction from pooled farm isolates, 47 samples were successfully subtyped by sequence analysis of the 60 kDa glycoprotein gene (gp60). All isolates belonged to subtype family IIa. IIaA15G2R1 was the most common subtype (present on 66% of the farms), followed by IIaA16G3R1 (13%). Subtypes IIaA14G1R1, IIaA14G2R1, IIaA1612R1, IIaA16G2R1, IIaA17G1R1, IIaA17G2R1, IIaA17G4R1 and IIaA19G2R1 were found sporadically. This is the first description of gp60 subtype IIaA17G4R1 in cattle in Germany.
Collapse
Affiliation(s)
- I Holzhausen
- Institute of Parasitology, Centre for Infectious Diseases, University of Leipzig, An den Tierkliniken 35, D - 04103, Leipzig, Germany. .,Albrecht - Daniel - Thaer - Institute for Agricultural Sciences e.V., University of Leipzig, An den Tierkliniken 29, D - 04103, Leipzig, Germany.
| | - M Lendner
- Institute of Parasitology, Centre for Infectious Diseases, University of Leipzig, An den Tierkliniken 35, D - 04103, Leipzig, Germany
| | - F Göhring
- Institute of Parasitology, Centre for Infectious Diseases, University of Leipzig, An den Tierkliniken 35, D - 04103, Leipzig, Germany.,Albrecht - Daniel - Thaer - Institute for Agricultural Sciences e.V., University of Leipzig, An den Tierkliniken 29, D - 04103, Leipzig, Germany
| | - I Steinhöfel
- Saxon State Office for Environment, Agriculture and Geology, Am Park 3, D - 04886, Köllitsch, Germany
| | - A Daugschies
- Institute of Parasitology, Centre for Infectious Diseases, University of Leipzig, An den Tierkliniken 35, D - 04103, Leipzig, Germany.,Albrecht - Daniel - Thaer - Institute for Agricultural Sciences e.V., University of Leipzig, An den Tierkliniken 29, D - 04103, Leipzig, Germany
| |
Collapse
|
9
|
Ganschow S, Silvery J, Kalinowski J, Tiemann C. toaSTR: A web application for forensic STR genotyping by massively parallel sequencing. Forensic Sci Int Genet 2018; 37:21-28. [PMID: 30071493 DOI: 10.1016/j.fsigen.2018.07.006] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2018] [Revised: 07/04/2018] [Accepted: 07/05/2018] [Indexed: 01/29/2023]
Abstract
Massively parallel sequencing (MPS) is emerging within the forensic community as a promising technique for high-resolution short tandem repeat (STR) genotyping, discovering both length and sequence polymorphisms. Conversely, the application of MPS to routine casework poses new challenges to the DNA analyst in view of the complex sequence data that is generated with this technology. We developed the web application toaSTR to help forensic experts work with MPS data simply and efficiently. An intuitive graphical user interface guides through the STR genotyping workflow. This versatile software handles data from various popular MPS platforms and supports both commercial and in-house multiplex PCR kits. Users can define locus-specific stutter thresholds and create custom sets of STR markers to be analyzed. toaSTR's innovative sequence-based stutter model predicts and identifies common stutter variants. The algorithm automatically differentiates biological (iso-)alleles from stutter and other artefacts to assist the interpretation of mixed samples. toaSTR features a comprehensive data visualization with interactive diagrams and a dynamic tabular overview of sequence observations. The software provides an interface to biostatistics tools and enables PDF result export in compliance with the sequence notation recommended by the International Society for Forensic Genetics (ISFG). An initial compatibility and concordance study confirmed the software's independent functionality and precise allele calling with data of different MPS platforms, STR amplification kits, and library preparation chemistries. Discussion of genotyping results for single source and mixed samples demonstrates toaSTR's advantages and includes suggestions for future MPS software development. The beta version of toaSTR is freely accessible at www.toastr.online.
Collapse
Affiliation(s)
- Sebastian Ganschow
- LABCON-OWL Analytik, Forschung und Consulting GmbH, Siemensstr. 40, 32105 Bad Salzuflen, Germany.
| | - Janine Silvery
- LABCON-OWL Analytik, Forschung und Consulting GmbH, Siemensstr. 40, 32105 Bad Salzuflen, Germany
| | - Jörn Kalinowski
- Center for Biotechnology (CeBiTec), Bielefeld University, Sequenz 1, 33615 Bielefeld, Germany
| | - Carsten Tiemann
- LABCON-OWL Analytik, Forschung und Consulting GmbH, Siemensstr. 40, 32105 Bad Salzuflen, Germany
| |
Collapse
|
10
|
Vilsen SB, Tvedebrink T, Eriksen PS, Bøsting C, Hussing C, Mogensen HS, Morling N. Stutter analysis of complex STR MPS data. Forensic Sci Int Genet 2018; 35:107-112. [DOI: 10.1016/j.fsigen.2018.04.003] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2017] [Revised: 03/13/2018] [Accepted: 04/06/2018] [Indexed: 11/29/2022]
|
11
|
Just RS, Irwin JA. Use of the LUS in sequence allele designations to facilitate probabilistic genotyping of NGS-based STR typing results. Forensic Sci Int Genet 2018. [DOI: 10.1016/j.fsigen.2018.02.016] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
12
|
Ganschow S, Wiegand P, Tiemann C. toaSTR: A web-based forensic tool for the analysis of short tandem repeats in massively parallel sequencing data. FORENSIC SCIENCE INTERNATIONAL GENETICS SUPPLEMENT SERIES 2017. [DOI: 10.1016/j.fsigss.2017.09.034] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
13
|
Flanking Variation Influences Rates of Stutter in Simple Repeats. Genes (Basel) 2017; 8:genes8110329. [PMID: 29149052 PMCID: PMC5704242 DOI: 10.3390/genes8110329] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2017] [Revised: 11/07/2017] [Accepted: 11/07/2017] [Indexed: 12/03/2022] Open
Abstract
It has been posited that the longest uninterrupted stretch (LUS) of tandem repeats, as defined by the number of exactly matching repeating motif units, is a better predictor of rates of stutter than the parental allele length (PAL). While there are cases where this hypothesis is likely correct, such as the 9.3 allele in the TH01 locus, there can be situations where it may not apply as well. For example, the PAL may capture flanking indel variations while remaining insensitive to polymorphisms in the repeat, and these haplotypic changes may impact the stutter rate. To address this, rates of stutter were contrasted against the LUS as well as the PAL on different flanking haplotypic backgrounds. This study shows that rates of stutter can vary substantially depending on the flanking haplotype, and while there are cases where the LUS is a better predictor of stutter than the PAL, examples to the contrary are apparent in commonly assayed forensic markers. Further, flanking variation that is 7 bp from the repeat region can impact rates of stutter. These findings suggest that non-proximal effects, such as DNA secondary structure, may be impacting the rates of stutter in common forensic short tandem repeat markers.
Collapse
|
14
|
Next Generation Sequencing uncovers within-host differences in the genetic diversity of Cryptosporidium gp60 subtypes. Int J Parasitol 2017; 47:601-607. [DOI: 10.1016/j.ijpara.2017.03.003] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2017] [Revised: 03/23/2017] [Accepted: 03/31/2017] [Indexed: 12/21/2022]
|
15
|
Statistical modelling of Ion PGM HID STR 10-plex MPS data. Forensic Sci Int Genet 2017; 28:82-89. [DOI: 10.1016/j.fsigen.2017.01.017] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2016] [Revised: 01/18/2017] [Accepted: 01/30/2017] [Indexed: 11/18/2022]
|
16
|
de la Puente M, Phillips C, Fondevila M, Gelabert-Besada M, Carracedo Á, Lareu MV. A forensic multiplex of nine novel pentameric-repeat STRs. Forensic Sci Int Genet 2017; 29:154-164. [PMID: 28445836 DOI: 10.1016/j.fsigen.2017.04.007] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2016] [Revised: 03/29/2017] [Accepted: 04/11/2017] [Indexed: 12/22/2022]
Abstract
Pentameric-repeat short tandem repeats (STRs), consisting of loci with repeat units of five base-pairs, have the advantage of reduced stutter products compared to their tetrameric-repeat STR counterparts. This characteristic potentially helps the interpretation of mixed DNA profiles when minor component alleles may coincide with stutter peaks from the major components. To develop a simple but informative forensic multiplex with the capability to aid mixture interpretation, we designed an 11-plex assay of nine pentameric STRs new to forensic analysis plus two male- specific markers: DYS391 and the Y-Indel rs2032678 used in GlobalFiler™ (Life Technologies). East Asian-specific variation in the recently adopted Y-Indel rs2032678 is reported in this study for the first time in its forensic use as a sex marker. We estimated the levels of variation observed in the nine pentameric STRs in three of the major population groups sampled in the HGDP-CEPH human genome diversity panel: African, European and East Asian (combining individual populations as their sample sizes were too small for STR allele frequency estimations); and we include genotype data from a population sample of Northwest Spain. From this data, forensic informativeness metrics were estimated when applying the nine novel STRs in identification or kinship analyses. The assay was assessed for forensic sensitivity and ability to successfully genotype highly degraded DNA. In the profiles from the 11-plex assay we observed an average 2.15% stutter ratio in all the pentameric loci compared to 7.32% across equivalently-sized tetrameric STRs in the Promega Powerplex® ESX-17 kit.
Collapse
Affiliation(s)
- M de la Puente
- Forensic Genetics Unit, Institute of Forensic Sciences, University of Santiago de Compostela, Spain
| | - C Phillips
- Forensic Genetics Unit, Institute of Forensic Sciences, University of Santiago de Compostela, Spain.
| | - M Fondevila
- Forensic Genetics Unit, Institute of Forensic Sciences, University of Santiago de Compostela, Spain
| | - M Gelabert-Besada
- Forensic Genetics Unit, Institute of Forensic Sciences, University of Santiago de Compostela, Spain
| | - Á Carracedo
- Forensic Genetics Unit, Institute of Forensic Sciences, University of Santiago de Compostela, Spain; Grupo de Medicina Xenómica (GMX), Faculty of Medicine, University of Santiago de Compostela, Spain; Center of Excellence in Genomic Medicine Research, King Abdulaziz University, Jeddah, Saudi Arabia
| | - M V Lareu
- Forensic Genetics Unit, Institute of Forensic Sciences, University of Santiago de Compostela, Spain
| |
Collapse
|
17
|
FDSTools: A software package for analysis of massively parallel sequencing data with the ability to recognise and correct STR stutter and other PCR or sequencing noise. Forensic Sci Int Genet 2017; 27:27-40. [DOI: 10.1016/j.fsigen.2016.11.007] [Citation(s) in RCA: 57] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2016] [Revised: 10/31/2016] [Accepted: 11/23/2016] [Indexed: 11/20/2022]
|
18
|
|
19
|
Gill P, Haned H, Bleka O, Hansson O, Dørum G, Egeland T. Genotyping and interpretation of STR-DNA: Low-template, mixtures and database matches-Twenty years of research and development. Forensic Sci Int Genet 2015; 18:100-17. [PMID: 25866376 DOI: 10.1016/j.fsigen.2015.03.014] [Citation(s) in RCA: 92] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2014] [Revised: 03/19/2015] [Accepted: 03/24/2015] [Indexed: 12/17/2022]
Abstract
The introduction of Short Tandem Repeat (STR) DNA was a revolution within a revolution that transformed forensic DNA profiling into a tool that could be used, for the first time, to create National DNA databases. This transformation would not have been possible without the concurrent development of fluorescent automated sequencers, combined with the ability to multiplex several loci together. Use of the polymerase chain reaction (PCR) increased the sensitivity of the method to enable the analysis of a handful of cells. The first multiplexes were simple: 'the quad', introduced by the defunct UK Forensic Science Service (FSS) in 1994, rapidly followed by a more discriminating 'six-plex' (Second Generation Multiplex) in 1995 that was used to create the world's first national DNA database. The success of the database rapidly outgrew the functionality of the original system - by the year 2000 a new multiplex of ten-loci was introduced to reduce the chance of adventitious matches. The technology was adopted world-wide, albeit with different loci. The political requirement to introduce pan-European databases encouraged standardisation - the development of European Standard Set (ESS) of markers comprising twelve-loci is the latest iteration. Although development has been impressive, the methods used to interpret evidence have lagged behind. For example, the theory to interpret complex DNA profiles (low-level mixtures), had been developed fifteen years ago, but only in the past year or so, are the concepts starting to be widely adopted. A plethora of different models (some commercial and others non-commercial) have appeared. This has led to a confusing 'debate' about the 'best' to use. The different models available are described along with their advantages and disadvantages. A section discusses the development of national DNA databases, along with details of an associated controversy to estimate the strength of evidence of matches. Current methodology is limited to searches of complete profiles - another example where the interpretation of matches has not kept pace with development of theory. STRs have also transformed the area of Disaster Victim Identification (DVI) which frequently requires kinship analysis. However, genotyping efficiency is complicated by complex, degraded DNA profiles. Finally, there is now a detailed understanding of the causes of stochastic effects that cause DNA profiles to exhibit the phenomena of drop-out and drop-in, along with artefacts such as stutters. The phenomena discussed include: heterozygote balance; stutter; degradation; the effect of decreasing quantities of DNA; the dilution effect.
Collapse
Affiliation(s)
- Peter Gill
- Norwegian Institute of Public Health, Department of Forensic Biology, PO Box 4404 Nydalen, 0403 Oslo, Norway; Department of Forensic Medicine, Sognsvannsveien 20, Rikshospitalet, 0372 Oslo, Norway.
| | - Hinda Haned
- Netherlands Forensic Institute, Department of Human Biological Traces, The Hague, The Netherlands
| | - Oyvind Bleka
- Norwegian Institute of Public Health, Department of Forensic Biology, PO Box 4404 Nydalen, 0403 Oslo, Norway
| | - Oskar Hansson
- Norwegian Institute of Public Health, Department of Forensic Biology, PO Box 4404 Nydalen, 0403 Oslo, Norway
| | - Guro Dørum
- Department of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences, P.O. Box 5003, NO-1432 Aas, Norway
| | - Thore Egeland
- Norwegian Institute of Public Health, Department of Forensic Biology, PO Box 4404 Nydalen, 0403 Oslo, Norway; Department of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences, P.O. Box 5003, NO-1432 Aas, Norway
| |
Collapse
|
20
|
Rockah-Shmuel L, Tóth-Petróczy Á, Sela A, Wurtzel O, Sorek R, Tawfik DS. Correlated occurrence and bypass of frame-shifting insertion-deletions (InDels) to give functional proteins. PLoS Genet 2013; 9:e1003882. [PMID: 24204297 PMCID: PMC3812077 DOI: 10.1371/journal.pgen.1003882] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2013] [Accepted: 09/02/2013] [Indexed: 11/19/2022] Open
Abstract
Short insertions and deletions (InDels) comprise an important part of the natural mutational repertoire. InDels are, however, highly deleterious, primarily because two-thirds result in frame-shifts. Bypass through slippage over homonucleotide repeats by transcriptional and/or translational infidelity is known to occur sporadically. However, the overall frequency of bypass and its relation to sequence composition remain unclear. Intriguingly, the occurrence of InDels and the bypass of frame-shifts are mechanistically related - occurring through slippage over repeats by DNA or RNA polymerases, or by the ribosome, respectively. Here, we show that the frequency of frame-shifting InDels, and the frequency by which they are bypassed to give full-length, functional proteins, are indeed highly correlated. Using a laboratory genetic drift, we have exhaustively mapped all InDels that occurred within a single gene. We thus compared the naive InDel repertoire that results from DNA polymerase slippage to the frame-shifting InDels tolerated following selection to maintain protein function. We found that InDels repeatedly occurred, and were bypassed, within homonucleotide repeats of 3–8 bases. The longer the repeat, the higher was the frequency of InDels formation, and the more frequent was their bypass. Besides an expected 8A repeat, other types of repeats, including short ones, and G and C repeats, were bypassed. Although obtained in vitro, our results indicate a direct link between the genetic occurrence of InDels and their phenotypic rescue, thus suggesting a potential role for frame-shifting InDels as bridging evolutionary intermediates. Homonucleotide repeats are exceptionally prone to insertions and/or deletions of bases (InDels). However, unless they occur in a multiplicity of 3 bases, InDels disrupt the reading frame and are thus expected to be purged from coding regions. Homonucleotide repeats, however, are also vulnerable to slippage by RNA polymerases and the ribosome. Using laboratory evolution techniques, we systematically mapped the occurrence of InDels within a given gene, before and after selection. Our data indicate that frame-shifting InDels were frequently bypassed to give functional proteins at surprisingly high frequencies. Further, we found a strict correlation between the repeat length, the frequency of occurrence of InDels at the DNA level, and the likelihood of bypass by transcriptional/translational slippage. Our results suggest that frame-shifting InDels might comprise functional evolutionary intermediates, and an effective mean of sequence divergence (e.g. when an adjacent InDel restores the frame, resulting in altered sequence and, potentially, in an altered protein structure).
Collapse
Affiliation(s)
- Liat Rockah-Shmuel
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot, Israel
| | - Ágnes Tóth-Petróczy
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot, Israel
| | - Asaf Sela
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot, Israel
| | - Omri Wurtzel
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot, Israel
| | - Rotem Sorek
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot, Israel
| | - Dan S. Tawfik
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot, Israel
- * E-mail:
| |
Collapse
|
21
|
Kelly H, Bright JA, Buckleton JS, Curran JM. Identifying and modelling the drivers of stutter in forensic DNA profiles. AUST J FORENSIC SCI 2013. [DOI: 10.1080/00450618.2013.808697] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Affiliation(s)
- Hannah Kelly
- University of Auckland, Auckland, New Zealand
- ESR, Auckland, New Zealand
| | | | | | | |
Collapse
|
22
|
Schumm JW, Gutierrez-Mateo C, Tan E, Selden R. A 27-locus STR assay to meet all United States and European law enforcement agency standards. J Forensic Sci 2013; 58:1584-92. [PMID: 23822765 DOI: 10.1111/1556-4029.12214] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2012] [Revised: 08/15/2012] [Accepted: 08/19/2012] [Indexed: 11/27/2022]
Abstract
Different national and international agencies have selected specific STR sets for forensic database use. To enhance database comparison across national and international borders, a 27-locus multiplex system was developed comprising all 15 STR loci of the European standard set, the current 13 STR loci of the CODIS core, the proposed 22 STR loci of the expanded CODIS core, 4 additional commonly used STR loci, and the amelogenin locus. Development required iterative primer design to resolve primer-related artifacts, amplicon sizing, and locus-to-locus balance issues. The 19.5-min assay incorporated newly developed six-dye chemistry analyzed using a novel microfluidic electrophoresis instrument capable of simultaneous detection and discrimination of 8 or more fluorescent dyes. The 27-locus multiplex offers the potential for a new international STR standard permitting laboratories in any jurisdiction to use a single reaction to determine profiles for loci they typically generate plus an expanded common STR profiling set of global interest.
Collapse
|
23
|
Comparison of southern Chinese Han and Brazilian Caucasian mutation rates at autosomal short tandem repeat loci used in human forensic genetics. Int J Legal Med 2013; 128:1-9. [PMID: 23549598 DOI: 10.1007/s00414-013-0847-2] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2012] [Accepted: 03/05/2013] [Indexed: 10/27/2022]
Abstract
The short tandem repeat (STR) loci used in human genetic studies are characterized by having relatively high mutation rates. In particular, to ensure an appropriate evaluation of genetic evidence in parentage and forensic analyses, it is essential to have accurate estimates of the mutation rates associated with the commonly used autosomal and sex chromosome STR loci. Differences in STR mutation rates between different ethnic groups should also be determined. Mutation data from two laboratories working with different ethnic groups were extracted from many meiotic transmissions ascertained for 15 autosomal STR loci currently used in forensic routine. Forty-five thousand and eighty-five trios were checked for the biological consistency of maternity and paternity through the analysis of a minimum of 15 loci. Mutations were scored as paternal, maternal, or ambiguous according to the most parsimonious explanation for the inconsistency, using always the least requiring hypothesis in terms of number of repeat differences. The main findings are: (a) the overall mutation rate across the 15 loci was 9.78 × 10(-4) per gamete per generation (95% CI = 9.30 × 10(-4)-1.03 × 10(-3)), and with just 48 (out of 1,587) exceptions, all of the mutations were single-step; (b) repeat gains were more frequent than losses; (c) longer alleles were found to be more mutable; and (d) the mutation rates differ at some loci between the two ethnic groups. Large worldwide meiotic transmission datasets are still needed to measure allele-specific mutation rates at the STR loci consensually used in forensic genetics.
Collapse
|
24
|
Grinberg A, Biggs P, Dukkipati V, George T. Extensive intra-host genetic diversity uncovered in Cryptosporidium parvum using Next Generation Sequencing. INFECTION GENETICS AND EVOLUTION 2013; 15:18-24. [DOI: 10.1016/j.meegid.2012.08.017] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/23/2012] [Revised: 08/28/2012] [Accepted: 08/28/2012] [Indexed: 11/28/2022]
|
25
|
Westen AA, Grol LJ, Harteveld J, Matai AS, de Knijff P, Sijen T. Assessment of the stochastic threshold, back- and forward stutter filters and low template techniques for NGM. Forensic Sci Int Genet 2012; 6:708-15. [DOI: 10.1016/j.fsigen.2012.05.001] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2011] [Revised: 03/28/2012] [Accepted: 05/03/2012] [Indexed: 11/16/2022]
|
26
|
Characterising stutter in forensic STR multiplexes. Forensic Sci Int Genet 2011; 6:58-63. [PMID: 21388903 DOI: 10.1016/j.fsigen.2011.02.001] [Citation(s) in RCA: 94] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2010] [Revised: 12/23/2010] [Accepted: 02/01/2011] [Indexed: 11/20/2022]
Abstract
Stutter is an artefact seen when amplifying short tandem repeats and typically occurs at one repeat unit shorter in length than the parent allele. In forensic analysis, stutter complicates the analysis of DNA profiles from multiple contributors, known as mixed profiles, a common profile type. Consequently it is important to both understand and predict stutter behaviour in order to improve our understanding of the resolution and interpretation of these profiles. Whilst stutter is well recognised and documented, little information is available that identifies and quantifies what influences the formation of stutter. In this work we use a novel approach to examine this. We have used synthetic oligonucleotides comprising multiple repeat units to test; the influence of repeat number, the influence of repeat sequence and the impact of interruptions to the repeat sequence length. Using multiple replicates allows detailed statistical analysis. We have confirmed a linear relationship between stutter ratio and repeat number. We have shown that increased A-T content increases stutter ratio and that interruptions in repeating sequences decreased stutter ratios to levels similar to the longest uninterrupted repeat stretch. We also found that there was no relationship between stutter ratio and repeat number for a repeat unit with an A-T content of 1/4 and that half of the interrupted repeat sequences stuttered significantly less than their longest uninterrupted repeat stretches. We have applied the knowledge gained to examine specific features of the loci present in the AmpFlSTR(®) SGM Plus(®) multiplex kit used in our laboratory.
Collapse
|
27
|
Andersen JF, Pham VM, Meng Z, Champagne DE, Ribeiro JMC. Insight into the sialome of the Black Fly, Simulium vittatum. J Proteome Res 2009; 8:1474-88. [PMID: 19166301 DOI: 10.1021/pr8008429] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Adaptation to vertebrate blood feeding includes development of a salivary "magic potion" that can disarm host hemostasis and inflammatory reactions. Within the lower Diptera, a vertebrate blood-sucking mode evolved in the Psychodidae (sand flies), Culicidae (mosquitoes), Ceratopogonidae (biting midges), Simuliidae (black flies), and the frog-feeding Corethrellidae. Sialotranscriptome analyses from several species of mosquitoes and sand flies and from one biting midge indicate divergence in the evolution of the blood-sucking salivary potion, manifested in the finding of many unique proteins within each insect family, and even genus. Gene duplication and divergence events are highly prevalent, possibly driven by vertebrate host immune pressure. Within this framework, we describe the sialome (from Greek sialo, saliva) of the black fly Simulium vittatum and discuss the findings within the context of the protein families found in other blood-sucking Diptera. Sequences and results of Blast searches against several protein family databases are given in Supplemental Tables S1 and S2, which can be obtained from http://exon.niaid.nih.gov/transcriptome/S_vittatum/T1/SV-tb1.zip and http://exon.niaid.nih.gov/transcriptome/S_vittatum/T2/SV-tb2.zip .
Collapse
Affiliation(s)
- John F Andersen
- Laboratory of Malaria and Vector Research, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Rockville, Maryland 20852, USA
| | | | | | | | | |
Collapse
|
28
|
Chen JQ, Wu Y, Yang H, Bergelson J, Kreitman M, Tian D. Variation in the ratio of nucleotide substitution and indel rates across genomes in mammals and bacteria. Mol Biol Evol 2009; 26:1523-31. [PMID: 19329651 DOI: 10.1093/molbev/msp063] [Citation(s) in RCA: 96] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023] Open
Abstract
Rates of nucleotide substitution and insertion/deletion (indel) are known to vary across the functional components of a genome. Little attention has been paid, however, to the quantitative relationship between the two. Here we investigate the ratio of nucleotide substitutions to indels (S/I) in different regions of 4 primates, 70 bacteria, and 8 other genomes. We find that the ratio differs at 5.4-times between coding and noncoding, 3.3-times between conserved and less conserved coding sequences, and 1.46-times between nonrepeat and repeat regions. The S/I ratio is also positively correlated with the level of divergence between the genomes compared. Our results suggest that the S/I ratio may reflect differences in the efficacy of selection against indels. Due to the sensitivity of indel density in different regions, this ratio varies over a much larger range. With the recent discovery suggesting that indels act as local enhancers of mutation in surrounding sequences, nucleotide substitution rates are expected to be accelerated in regions of low constraint, where indels tend to accumulate, but will otherwise be modulated in proportion to the level of a sequence's functional constraint. Indels, therefore, may play a nontrivial role in controlling differences in genetic variation and divergence across functional regions of a genome.
Collapse
Affiliation(s)
- Jian-Qun Chen
- State Key Laboratory of Pharmaceutical Biotechnology, Department of Biology, Nanjing University, Nanjing, China
| | | | | | | | | | | |
Collapse
|
29
|
Olejniczak M, Krzyzosiak WJ. Genotyping of simple sequence repeats--factors implicated in shadow band generation revisited. Electrophoresis 2007; 27:3724-34. [PMID: 16960838 DOI: 10.1002/elps.200600136] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
PCR amplification of microsatellite sequences generates, besides the main product corresponding to allele size, also additional, undesired products usually shorter by multiples of the repeated unit. These extra products known as shadow bands or stutter products may complicate genotyping. The mechanism by which these artifacts are formed is not well understood and so no effective remedy has been found to cope with these spurious products. In this study, using the DNA templates containing the CAG/CTG repeats flanked by gene-specific sequences and universal priming sites, we analyzed the effects of many PCR variables on the shadow band generation. The most important result was that at the decreased temperature of the denaturation step during PCR cycling the shadow bands were either not formed or were strongly suppressed. Several possible sources of this effect are discussed.
Collapse
Affiliation(s)
- Marta Olejniczak
- Laboratory of Cancer Genetics, Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland
| | | |
Collapse
|
30
|
Reboul MP, Tandonnet O, Biteau N, Belet-de Putter C, Rebouissoux L, Moradkhani K, Vu PY, Saura R, Arveiler B, Lacombe D, Taine L, Iron A. Mosaic maternal uniparental isodisomy for chromosome 7q21-qter. Clin Genet 2006; 70:207-13. [PMID: 16922723 DOI: 10.1111/j.1399-0004.2006.00664.x] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Uniparental disomy (UPD) for several human chromosomes is associated with clinical abnormalities. We report the case of a 2-year-old boy with severe intrauterine and post-natal growth retardation (IUGR/PNGR) and highly variable sweat chloride concentrations. The patient was identified as heterozygous for the F508del mutation of the CFTR (cystic fibrosis transmembrane conductance regulator) gene. Unexpectedly, the signal corresponding to the maternally inherited F508del allele appeared much more intense than the paternally derived wild allele. Molecular analysis including polymorphic marker studies, microsatellites and single-nucleotide polymorphisms subsequently showed that the boy was a carrier of a de novo mosaic maternal isodisomy of a chromosome 7 segment while there was a biparental inheritance of the rest of the chromosome. This is the first report of a mosaic partial UPD7. The matUPD7 segment at 7q21-qter extends for 72.7 Mb. The karyotype (550 bands) of our patient was normal, and fluorescence in situ hybridization with probes mapping around the CFTR gene allowed us to rule out a partial duplication. The detection of this chromosomal rearrangement confirms the hypothesis that the 7q31-qter segment is a candidate for the localization of human imprinted genes involved in the control of IUGR and PNGR. It also emphasizes the importance of searching for UPD7 in severe, isolated and unexplained IUGR and PNGR.
Collapse
Affiliation(s)
- M-P Reboul
- Service de Génétique Médicale, Hopital Pellegrin, Laboratoire de Génétique Humaine, Développement et Cancer, Université Victor Segalen Bordeaux 2, France.
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
31
|
Karaca M, Bilgen M, Onus AN, Ince AG, Elmasulu SY. Exact tandem repeats analyzer (E-TRA): a new program for DNA sequence mining. J Genet 2005; 84:49-54. [PMID: 15876583 DOI: 10.1007/bf02715889] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
Exact Tandem Repeats Analyzer 1.0 (E-TRA) combines sequence motif searches with keywords such as 'organs', 'tissues', 'cell lines' and 'development stages' for finding simple exact tandem repeats as well as non-simple repeats. E-TRA has several advanced repeat search parameters/options compared to other repeat finder programs as it not only accepts GenBank, FASTA and expressed sequence tags (EST) sequence files, but also does analysis of multiple files with multiple sequences. The minimum and maximum tandem repeat motif lengths that E-TRA finds vary from one to one thousand. Advanced user defined parameters/options let the researchers use different minimum motif repeats search criteria for varying motif lengths simultaneously. One of the most interesting features of genomes is the presence of relatively short tandem repeats (TRs). These repeated DNA sequences are found in both prokaryotes and eukaryotes, distributed almost at random throughout the genome. Some of the tandem repeats play important roles in the regulation of gene expression whereas others do not have any known biological function as yet. Nevertheless, they have proven to be very beneficial in DNA profiling and genetic linkage analysis studies. To demonstrate the use of E-TRA, we used 5,465,605 human EST sequences derived from 18,814,550 GenBank EST sequences. Our results indicated that 12.44% (679,800) of the human EST sequences contained simple and non-simple repeat string patterns varying from one to 126 nucleotides in length. The results also revealed that human organs, tissues, cell lines and different developmental stages differed in number of repeats as well as repeat composition, indicating that the distribution of expressed tandem repeats among tissues or organs are not random, thus differing from the un-transcribed repeats found in genomes.
Collapse
Affiliation(s)
- Mehmet Karaca
- Faculty of Agriculture, Akdeniz University, 07059 Antalya, Turkey.
| | | | | | | | | |
Collapse
|