1
|
Li W, Almirantis Y, Provata A. Range-limited Heaps' law for functional DNA words in the human genome. J Theor Biol 2024; 592:111878. [PMID: 38901778 DOI: 10.1016/j.jtbi.2024.111878] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2023] [Revised: 05/31/2024] [Accepted: 06/10/2024] [Indexed: 06/22/2024]
Abstract
Heaps' or Herdan-Heaps' law is a linguistic law describing the relationship between the vocabulary/dictionary size (type) and word counts (token) to be a power-law function. Its existence in genomes with certain definition of DNA words is unclear partly because the dictionary size in genome could be much smaller than that in a human language. We define a DNA word as a coding region in a genome that codes for a protein domain. Using human chromosomes and chromosome arms as individual samples, we establish the existence of Heaps' law in the human genome within limited range. Our definition of words in a genomic or proteomic context is different from other definitions such as over-represented k-mers which are much shorter in length. Although an approximate power-law distribution of protein domain sizes due to gene duplication and the related Zipf's law is well known, their translation to the Heaps' law in DNA words is not automatic. Several other animal genomes are shown herein also to exhibit range-limited Heaps' law with our definition of DNA words, though with various exponents. When tokens were randomly sampled and sample sizes reach to the maximum level, a deviation from the Heaps' law was observed, but a quadratic regression in log-log type-token plot fits the data perfectly. Investigation of type-token plot and its regression coefficients could provide an alternative narrative of reusage and redundancy of protein domains as well as creation of new protein domains from a linguistic perspective.
Collapse
Affiliation(s)
- Wentian Li
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, NY, USA(1); The Robert S. Boas Center for Genomics and Human Genetics, The Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY, USA.
| | - Yannis Almirantis
- Theoretical Biology and Computational Genomics Laboratory, Institute of Bioscience and Applications, National Center for Scientific Research "Demokritos", 15341 Athens, Greece
| | - Astero Provata
- Statistical Mechanics and Dynamical Systems Laboratory, Institute of Nanoscience and Nanotechnology, National Center for Scientific Research "Demokritos", 15341 Athens, Greece
| |
Collapse
|
2
|
Ren H, Li Y, Huang T. Anomaly Detection Models for SARS-CoV-2 Surveillance Based on Genome k-mers. Microorganisms 2023; 11:2773. [PMID: 38004784 PMCID: PMC10673111 DOI: 10.3390/microorganisms11112773] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2023] [Revised: 11/06/2023] [Accepted: 11/10/2023] [Indexed: 11/26/2023] Open
Abstract
Since COVID-19 has brought great challenges to global public health governance, developing methods that track the evolution of the virus over the course of an epidemic or pandemic is useful for public health. This paper uses anomaly detection models to analyze SARS-CoV-2 virus genome k-mers to predict possible new critical variants in the collected samples. We used the sample data from Argentina, China and Portugal obtained from the Global Initiative on Sharing All Influenza Data (GISAID) to conduct multiple rounds of evaluation on several anomaly detection models, to verify the feasibility of this virus early warning and surveillance idea and find appropriate anomaly detection models for actual epidemic surveillance. Through multiple rounds of model testing, we found that the LUNAR (learnable unified neighborhood-based anomaly ranking) and LUNAR+LUNAR stacking model performed well in new critical variants detection. The results of simulated dynamic detection validate the feasibility of this approach, which can help efficiently monitor samples in local areas.
Collapse
Affiliation(s)
- Haotian Ren
- Bio-Med Big Data Center, CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Yixue Li
- Bio-Med Big Data Center, CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
- Key Laboratory of Systems Health Science of Zhejiang Province, School of Life Science, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou 310024, China
- Guangzhou Laboratory, Guangzhou 510005, China
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
- Collaborative Innovation Center for Genetics and Development, Fudan University, Shanghai 200433, China
| | - Tao Huang
- Bio-Med Big Data Center, CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| |
Collapse
|
3
|
Xue L, Moreira JD, Smith KK, Fetterman JL. The Mighty NUMT: Mitochondrial DNA Flexing Its Code in the Nuclear Genome. Biomolecules 2023; 13:753. [PMID: 37238623 PMCID: PMC10216076 DOI: 10.3390/biom13050753] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2023] [Revised: 04/07/2023] [Accepted: 04/24/2023] [Indexed: 05/28/2023] Open
Abstract
Nuclear-mitochondrial DNA segments (NUMTs) are mitochondrial DNA (mtDNA) fragments that have been inserted into the nuclear genome. Some NUMTs are common within the human population but most NUMTs are rare and specific to individuals. NUMTs range in size from 24 base pairs to encompassing nearly the entire mtDNA and are found throughout the nuclear genome. Emerging evidence suggests that the formation of NUMTs is an ongoing process in humans. NUMTs contaminate sequencing results of the mtDNA by introducing false positive variants, particularly heteroplasmic variants present at a low variant allele frequency (VAF). In our review, we discuss the prevalence of NUMTs in the human population, the potential mechanisms of de novo NUMT insertion via DNA repair mechanisms, and provide an overview of the existing approaches for minimizing NUMT contamination. Apart from filtering known NUMTs, both wet lab-based and computational methods can be used to minimize the contamination of NUMTs in analyses of human mtDNA. Current approaches include: (1) isolating mitochondria to enrich for mtDNA; (2) applying basic local alignment to identify NUMTs for subsequent filtering; (3) bioinformatic pipelines for NUMT detection; (4) k-mer-based NUMT detection; and (5) filtering candidate false positive variants by mtDNA copy number, VAF, or sequence quality score. Multiple approaches must be applied in order to effectively identify NUMTs in samples. Although next-generation sequencing is revolutionizing our understanding of heteroplasmic mtDNA, it also raises new challenges with the high prevalence and individual-specific NUMTs that need to be handled with care in studies of mitochondrial genetics.
Collapse
Affiliation(s)
- Liying Xue
- Evans Department of Medicine, Boston University Chobanian & Avedisian School of Medicine, Boston, MA 02118, USA
| | - Jesse D. Moreira
- Department of Health Sciences, Programs in Human Physiology, Boston University Sargent College, Boston, MA 02215, USA
| | - Karan K. Smith
- Evans Department of Medicine, Boston University Chobanian & Avedisian School of Medicine, Boston, MA 02118, USA
| | - Jessica L. Fetterman
- Evans Department of Medicine, Boston University Chobanian & Avedisian School of Medicine, Boston, MA 02118, USA
| |
Collapse
|
4
|
Becher H, Nichols RA. Assembly-free quantification of vagrant DNA inserts. Mol Ecol Resour 2023. [PMID: 36740932 DOI: 10.1111/1755-0998.13764] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2022] [Revised: 01/27/2023] [Accepted: 01/30/2023] [Indexed: 02/07/2023]
Abstract
Inserts of DNA from extranuclear sources, such as organelles and microbes, are common in eukaryote nuclear genomes. However, sequence similarity between the nuclear and extranuclear DNA, and a history of multiple insertions, make the assembly of these regions challenging. Consequently, the number, sequence and location of these vagrant DNAs cannot be reliably inferred from the genome assemblies of most organisms. We introduce two statistical methods to estimate the abundance of nuclear inserts even in the absence of a nuclear genome assembly. The first (intercept method) only requires low-coverage (<1×) sequencing data, as commonly generated for population studies of organellar and ribosomal DNAs. The second method additionally requires that a subset of the individuals carry extranuclear DNA with diverged genotypes. We validated our intercept method using simulations and by re-estimating the frequency of human NUMTs (nuclear mitochondrial inserts). We then applied it to the grasshopper Podisma pedestris, exceptional for both its large genome size and reports of numerous NUMT inserts, estimating that NUMTs make up 0.056% of the nuclear genome, equivalent to >500 times the mitochondrial genome size. We also re-analysed a museomics data set of the parrot Psephotellus varius, obtaining an estimate of only 0.0043%, in line with reports from other species of bird. Our study demonstrates the utility of low-coverage high-throughput sequencing data for the quantification of nuclear vagrant DNAs. Beyond quantifying organellar inserts, these methods could also be used on endosymbiont-derived sequences. We provide an R implementation of our methods called "vagrantDNA" and code to simulate test data sets.
Collapse
Affiliation(s)
- Hannes Becher
- Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, UK
| | - Richard A Nichols
- School of Biological and Behavioural Sciences, Queen Mary University of London, London, UK
| |
Collapse
|
5
|
Liu Z, Simayijiang H, Wang Q, Yang J, Sun H, Wu R, Yan J. DNA and protein analyses of hair in forensic genetics. Int J Legal Med 2023; 137:613-633. [PMID: 36732435 DOI: 10.1007/s00414-023-02955-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Accepted: 01/20/2023] [Indexed: 02/04/2023]
Abstract
Hair is one of the most common pieces of biological evidence found at a crime scene and plays an essential role in forensic investigation. Hairs, especially non-follicular hairs, are usually found at various crime scenes, either by natural shedding or by forcible shedding. However, the genetic material in hairs is usually highly degraded, which makes forensic analysis difficult. As a result, the value of hair has not been fully exploited in forensic investigations and trials. In recent years, with advances in molecular biology, forensic analysis of hair has achieved remarkable strides and provided crucial clues in numerous cases. This article reviews recent developments in DNA and protein analysis of hair and attempts to provide a comprehensive solution to improve forensic hair analysis.
Collapse
Affiliation(s)
- Zhiyong Liu
- Faculty of Forensic Medicine, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, Guangdong, 510080, People's Republic of China
| | - Halimureti Simayijiang
- School of Forensic Medicine, Shanxi Medical University, Jinzhong, Shanxi, 030600, People's Republic of China
| | - Qiangwei Wang
- Faculty of Forensic Medicine, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, Guangdong, 510080, People's Republic of China
| | - Jingyi Yang
- Faculty of Forensic Medicine, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, Guangdong, 510080, People's Republic of China
| | - Hongyu Sun
- Faculty of Forensic Medicine, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, Guangdong, 510080, People's Republic of China.,Guangdong Province Translational Forensic Medicine Engineering Technology Research Center, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, Guangdong, 510080, People's Republic of China
| | - Riga Wu
- Faculty of Forensic Medicine, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, Guangdong, 510080, People's Republic of China. .,Guangdong Province Translational Forensic Medicine Engineering Technology Research Center, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, Guangdong, 510080, People's Republic of China.
| | - Jiangwei Yan
- School of Forensic Medicine, Shanxi Medical University, Jinzhong, Shanxi, 030600, People's Republic of China.
| |
Collapse
|
6
|
Zverinova S, Guryev V. Variant calling: Considerations, practices, and developments. Hum Mutat 2021; 43:976-985. [PMID: 34882898 PMCID: PMC9545713 DOI: 10.1002/humu.24311] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2021] [Revised: 11/02/2021] [Accepted: 12/03/2021] [Indexed: 11/10/2022]
Abstract
The success of many clinical, association, or population genetics studies critically relies on properly performed variant calling step. The variety of modern genomics protocols, techniques, and platforms makes our choices of methods and algorithms difficult and there is no "one size fits all" solution for study design and data analysis. In this review, we discuss considerations that need to be taken into account while designing the study and preparing for the experiments. We outline the variety of variant types that can be detected using sequencing approaches and highlight some specific requirements and basic principles of their detection. Finally, we cover interesting developments that enable variant calling for a broad range of applications in the genomics field. We conclude by discussing technological and algorithmic advances that have the potential to change the ways of calling DNA variants in the nearest future.
Collapse
Affiliation(s)
- Stepanka Zverinova
- European Research Institute for the Biology of Ageing, University of Groningen, University Medical Centre Groningen, Groningen, The Netherlands
| | - Victor Guryev
- European Research Institute for the Biology of Ageing, University of Groningen, University Medical Centre Groningen, Groningen, The Netherlands
| |
Collapse
|
7
|
ULGEN A, ÇETİN Ş, BALCI P, ŞIVGIN H, ŞIVGIN S, ÇETİN M, Lİ W. COVID-19 outpatients and surviving inpatients exhibit comparable blood test results that are distinct from non-surviving inpatients. JOURNAL OF HEALTH SCIENCES AND MEDICINE 2021. [DOI: 10.32322/jhsm.900462] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open
|
8
|
Distinguishing mitochondrial DNA and NUMT sequences amplified with the precision ID mtDNA whole genome panel. Mitochondrion 2020; 55:122-133. [PMID: 32949792 DOI: 10.1016/j.mito.2020.09.001] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2020] [Revised: 08/31/2020] [Accepted: 09/04/2020] [Indexed: 11/22/2022]
Abstract
Nuclear mitochondrial DNA segments (NUMTs) are generated via transfer of portions of the mitochondrial genome into the nuclear genome. Given their common origin, there is the possibility that both the mitochondrial and NUMT segments may co-amplify using the same set of primers. Thus, analysis of the variation of the mitochondrial genome must take into account this co-amplification of mitochondrial and NUMT sequences. The study herein builds on data from the study by Strobl et al. (Strobl et al., 2019), in which multiple point heteroplasmies were called with an "N" to prevent labeling NUMT sequences mimicking mitochondrial heteroplasmy and being interpreted as true mitochondrial in origin sequence variants. Each of these point heteroplasmies was studied in greater detail, both molecularly and bioinformatically, to determine whether NUMT or true mitochondrial DNA variation was present. The bioinformatic and molecular tools available to help distinguish between NUMT and mitochondrial DNA and the effect of NUMT sequences on interpretation were discussed.
Collapse
|
9
|
Li Q, Ren Y, Xiang D, Shi X, Zhao J, Peng L, Zhao G. Comparative mitogenome analysis of two ectomycorrhizal fungi ( Paxillus) reveals gene rearrangement, intron dynamics, and phylogeny of basidiomycetes. IMA Fungus 2020; 11:12. [PMID: 32670777 PMCID: PMC7333402 DOI: 10.1186/s43008-020-00038-8] [Citation(s) in RCA: 34] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2019] [Accepted: 06/23/2020] [Indexed: 02/07/2023] Open
Abstract
In this study, the mitogenomes of two Paxillus species were assembled, annotated and compared. The two mitogenomes of Paxillus involutus and P. rubicundulus comprised circular DNA molecules, with the size of 39,109 bp and 41,061 bp, respectively. Evolutionary analysis revealed that the nad4L gene had undergone strong positive selection in the two Paxillus species. In addition, 10.64 and 36.50% of the repetitive sequences were detected in the mitogenomes of P. involutus and P. rubicundulus, respectively, which might transfer between mitochondrial and nuclear genomes. Large-scale gene rearrangements and frequent intron gain/loss events were detected in 61 basidiomycete species, which revealed large variations in mitochondrial organization and size in Basidiomycota. In addition, the insertion sites of the basidiomycete introns were found to have a base preference. Phylogenetic analysis of the combined mitochondrial gene set gave identical and well-supported tree topologies, indicating that mitochondrial genes were reliable molecular markers for analyzing the phylogenetic relationships of Basidiomycota. This study is the first report on the mitogenomes of Paxillus, which will promote a better understanding of their contrasted ecological strategies, molecular evolution and phylogeny of these important ectomycorrhizal fungi and related basidiomycete species.
Collapse
Affiliation(s)
- Qiang Li
- Key Laboratory of Coarse Cereal Processing, Ministry of Agriculture and Rural Affairs, College of Pharmacy and Biological Engineering, Chengdu University, Chengdu, 610106 Sichuan China
| | - Yuanhang Ren
- Key Laboratory of Coarse Cereal Processing, Ministry of Agriculture and Rural Affairs, College of Pharmacy and Biological Engineering, Chengdu University, Chengdu, 610106 Sichuan China
| | - Dabing Xiang
- Key Laboratory of Coarse Cereal Processing, Ministry of Agriculture and Rural Affairs, College of Pharmacy and Biological Engineering, Chengdu University, Chengdu, 610106 Sichuan China
| | - Xiaodong Shi
- Key Laboratory of Coarse Cereal Processing, Ministry of Agriculture and Rural Affairs, College of Pharmacy and Biological Engineering, Chengdu University, Chengdu, 610106 Sichuan China
| | - Jianglin Zhao
- Key Laboratory of Coarse Cereal Processing, Ministry of Agriculture and Rural Affairs, College of Pharmacy and Biological Engineering, Chengdu University, Chengdu, 610106 Sichuan China
| | - Lianxin Peng
- Key Laboratory of Coarse Cereal Processing, Ministry of Agriculture and Rural Affairs, College of Pharmacy and Biological Engineering, Chengdu University, Chengdu, 610106 Sichuan China
- Present address: Key Laboratory of Coarse Cereal Processing, Ministry of Agriculture and Rural Affairs, 2025 # Chengluo Avenue, Chengdu, 610106 Sichuan China
| | - Gang Zhao
- Key Laboratory of Coarse Cereal Processing, Ministry of Agriculture and Rural Affairs, College of Pharmacy and Biological Engineering, Chengdu University, Chengdu, 610106 Sichuan China
| |
Collapse
|
10
|
Phylogenetic Analysis of HIV-1 Genomes Based on the Position-Weighted K-mers Method. ENTROPY 2020; 22:e22020255. [PMID: 33286029 PMCID: PMC7516702 DOI: 10.3390/e22020255] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/17/2020] [Revised: 02/07/2020] [Accepted: 02/20/2020] [Indexed: 12/31/2022]
Abstract
HIV-1 viruses, which are predominant in the family of HIV viruses, have strong pathogenicity and infectivity. They can evolve into many different variants in a very short time. In this study, we propose a new and effective alignment-free method for the phylogenetic analysis of HIV-1 viruses using complete genome sequences. Our method combines the position distribution information and the counts of the k-mers together. We also propose a metric to determine the optimal k value. We name our method the Position-Weighted k-mers (PWkmer) method. Validation and comparison with the Robinson-Foulds distance method and the modified bootstrap method on a benchmark dataset show that our method is reliable for the phylogenetic analysis of HIV-1 viruses. PWkmer can resolve within-group variations for different known subtypes of Group M of HIV-1 viruses. This method is simple and computationally fast for whole genome phylogenetic analysis.
Collapse
|
11
|
Talotta R, Sarzi-Puttini P, Laska MJ, Atzeni F. Retrotransposons shuttling genetic and epigenetic information from the nuclear to the mitochondrial compartment: Do they play a pathogenetic role in scleroderma? Cytokine Growth Factor Rev 2019; 49:42-58. [PMID: 31677967 DOI: 10.1016/j.cytogfr.2019.10.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2019] [Revised: 10/09/2019] [Accepted: 10/10/2019] [Indexed: 10/25/2022]
Abstract
Endogenous retroelements are a class of ancient defective viral insertions contained in the genome of host cells, where they account for up to 40% of all DNA. Centuries of co-existence in host genome have led to the development of immunotolerance to endogenous retroelements, most of which are defective and unable to replicate or transcribe functional proteins. However, given their capacity to move across the nuclear and mitochondrial genome and recombine, they could mix phenotypes and give rise to infections that may trigger innate and adaptive immune responses by sensing receptors capable of recognising foreign nucleic acids and proteins. It has recently been suggested that they play a role in the pathogenesis of autoimmune diseases on the grounds of their partial reactivation or the epigenetic control of host gene transcription. A number of studies have confirmed their contribution to the development of rheumatoid arthritis, multiple sclerosis and systemic lupus erythematosus, but there is still a lack of data concerning systemic sclerosis (SSc). Their role in the pathogenesis of SSc can be hypothesised on the basis of mitochondrial and nuclear chromatinic damage, and hyper-activation of the immune pathway involved in antiviral defense. SSc is characterised by genetic and immunological evidence of a viral infection but, as no viral agent has yet been isolated from SSc patients, the hypothesis that partial reactivation of endogenous retroviruses may trigger the disease cannot be excluded and deserves further investigation.
Collapse
Affiliation(s)
- Rossella Talotta
- Rheumatology Unit, University of Messina, Azienda Ospedaliera Gaetano Martino, Via Consolare Valeria 1, 98100 Messina, Italy.
| | - Piercarlo Sarzi-Puttini
- Rheumatology Unit, University Hospital ASST-Fatebenefratelli-Sacco, Via G.B Grassi 74, 20157 Milan, Italy.
| | | | - Fabiola Atzeni
- Rheumatology Unit, University of Messina, Azienda Ospedaliera Gaetano Martino, Via Consolare Valeria 1, 98100 Messina, Italy.
| |
Collapse
|
12
|
Smart U, Budowle B, Ambers A, Soares Moura-Neto R, Silva R, Woerner AE. A novel phylogenetic approach for de novo discovery of putative nuclear mitochondrial (pNumt) haplotypes. Forensic Sci Int Genet 2019; 43:102146. [DOI: 10.1016/j.fsigen.2019.102146] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2019] [Revised: 08/09/2019] [Accepted: 08/13/2019] [Indexed: 10/26/2022]
|
13
|
Strobl C, Churchill Cihlar J, Lagacé R, Wootton S, Roth C, Huber N, Schnaller L, Zimmermann B, Huber G, Lay Hong S, Moura-Neto R, Silva R, Alshamali F, Souto L, Anslinger K, Egyed B, Jankova-Ajanovska R, Casas-Vargas A, Usaquén W, Silva D, Barletta-Carrillo C, Tineo DH, Vullo C, Würzner R, Xavier C, Gusmão L, Niederstätter H, Bodner M, Budowle B, Parson W. Evaluation of mitogenome sequence concordance, heteroplasmy detection, and haplogrouping in a worldwide lineage study using the Precision ID mtDNA Whole Genome Panel. Forensic Sci Int Genet 2019; 42:244-251. [PMID: 31382159 DOI: 10.1016/j.fsigen.2019.07.013] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2019] [Revised: 07/09/2019] [Accepted: 07/21/2019] [Indexed: 12/24/2022]
Abstract
The emergence of Massively Parallel Sequencing technologies enabled the analysis of full mitochondrial (mt)DNA sequences from forensically relevant samples that have, so far, only been typed in the control region or its hypervariable segments. In this study, we evaluated the performance of a commercially available multiplex-PCR-based assay, the Precision ID mtDNA Whole Genome Panel (Thermo Fisher Scientific), for the amplification and sequencing of the entire mitochondrial genome (mitogenome) from even degraded forensic specimens. For this purpose, more than 500 samples from 24 different populations were selected to cover the vast majority of established superhaplogroups. These are known to harbor different signature sequence motifs corresponding to their phylogenetic background that could have an effect on primer binding and, thus, could limit a broad application of this molecular genetic tool. The selected samples derived from various forensically relevant tissue sources and were DNA extracted using different methods. We evaluated sequence concordance and heteroplasmy detection and compared the findings to conventional Sanger sequencing as well as an orthogonal MPS platform. We discuss advantages and limitations of this approach with respect to forensic genetic workflow and analytical requirements.
Collapse
Affiliation(s)
- Christina Strobl
- Institute of Legal Medicine, Medical University of Innsbruck, Innsbruck, Austria
| | | | - Robert Lagacé
- Human Identification Group, ThermoFisher Scientific, San Francisco, CA, USA
| | - Sharon Wootton
- Human Identification Group, ThermoFisher Scientific, San Francisco, CA, USA
| | - Chantal Roth
- Human Identification Group, ThermoFisher Scientific, San Francisco, CA, USA
| | - Nicole Huber
- Institute of Legal Medicine, Medical University of Innsbruck, Innsbruck, Austria
| | - Lisa Schnaller
- Institute of Legal Medicine, Medical University of Innsbruck, Innsbruck, Austria
| | - Bettina Zimmermann
- Institute of Legal Medicine, Medical University of Innsbruck, Innsbruck, Austria
| | - Gabriela Huber
- Institute of Legal Medicine, Medical University of Innsbruck, Innsbruck, Austria
| | - Seah Lay Hong
- School of Health Sciences, Universiti Sains Malaysia, Kubang Kerian, Kelantan, Malaysia
| | - Rodrigo Moura-Neto
- Laboratório de Biologia Molecular Forense, Instituto de Biologia, Universidade Federal do Rio de Janeiro, Rio de Janeiro, RJ, Brazil
| | - Rosane Silva
- Instituto de Biofisica Carlos Chagas Filho, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil
| | - Farida Alshamali
- Dubai Police, Gen. Dept. Forensic Science & Criminology, Dubai, United Arab Emirates
| | - Luis Souto
- Laboratorio de Genética Aplicada, Departamento de Biologia, Universidade de Aveiro, Portugal
| | | | - Balazs Egyed
- Department of Genetics, ELTE Eötvös Loránd University, Budapest, Hungary
| | - Renata Jankova-Ajanovska
- Institute of Forensic Medicine, Criminalistic and Medical Deontology, Medical Faculty, University "St. Cyril and Methodius", Skopje, Macedonia
| | - Andrea Casas-Vargas
- Group of Population Genetics and Identification, Genetics Institute, National University of Colombia, Bogotá, Colombia
| | - Wiliam Usaquén
- Group of Population Genetics and Identification, Genetics Institute, National University of Colombia, Bogotá, Colombia
| | - Dayse Silva
- DNA Diagnostic Laboratory (LDD), State University of Rio de Janeiro (UERJ), Rio de Janeiro, Brazil
| | | | - Dean Herman Tineo
- Universidad Nacional Mayor de San Marcos, Instituto de Medicina Legal del Perú, Lima, Peru
| | - Carlos Vullo
- DNA Forensic Laboratory, Argentinean Forensic Anthropology team (EAAF), Córdoba, Argentina
| | - Reinhard Würzner
- Division of Hygiene & Med. Microbiology, Medical University of Innsbruck, Austria
| | - Catarina Xavier
- Institute of Legal Medicine, Medical University of Innsbruck, Innsbruck, Austria
| | - Leonor Gusmão
- DNA Diagnostic Laboratory (LDD), State University of Rio de Janeiro (UERJ), Rio de Janeiro, Brazil
| | - Harald Niederstätter
- Institute of Legal Medicine, Medical University of Innsbruck, Innsbruck, Austria
| | - Martin Bodner
- Institute of Legal Medicine, Medical University of Innsbruck, Innsbruck, Austria
| | - Bruce Budowle
- Center for Human Identification, University of North Texas Health Science Center, TX, USA
| | - Walther Parson
- Institute of Legal Medicine, Medical University of Innsbruck, Innsbruck, Austria; Forensic Science Program, The Pennsylvania State University, University Park, PA, USA.
| |
Collapse
|