1
|
Sun W, Wang Z, Wen S, Huang A, Li H, Jiang L, Feng Q, Fan D, Tian Q, Han D, Liu X. Technical strategy for monozygotic twin discrimination by single-nucleotide variants. Int J Legal Med 2024; 138:767-779. [PMID: 38197923 DOI: 10.1007/s00414-023-03150-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2023] [Accepted: 12/11/2023] [Indexed: 01/11/2024]
Abstract
Monozygotic (MZ) twins are theoretically genetically identical. Although they are revealed to accumulate mutations after the zygote splits, discriminating between twin genomes remains a formidable challenge in the field of forensic genetics. Single-nucleotide variants (SNVs) are responsible for a substantial portion of genetic variation, thus potentially serving as promising biomarkers for the identification of MZ twins. In this study, we sequenced the whole genome of a pair of female MZ twins when they were 27 and 33 years old to approximately 30 × coverage using peripheral blood on an Illumina NovaSeq 6000 Sequencing System. Potentially discordant SNVs supported by whole-genome sequencing were validated extensively by amplicon-based targeted deep sequencing and Sanger sequencing. In total, we found nine bona fide post-twinning SNVs, all of which were identified in the younger genomes and found in the older genomes. None of the SNVs occurred within coding exons, three of which were observed in introns, supported by whole-exome sequencing results. A double-blind test was employed, and the reliability of MZ twin discrimination by discordant SNVs was endorsed. All SNVs were successfully detected when input DNA amounts decreased to 0.25 ng, and reliable detection was limited to seven SNVs below 0.075 ng input. This comprehensive analysis confirms that SNVs could serve as cost-effective biomarkers for MZ twin discrimination.
Collapse
Affiliation(s)
- Weifen Sun
- Shanghai Key Laboratory of Forensic Medicine, Shanghai Forensic Service Platform, Academy of Forensic Science, Ministry of Justice, Shanghai, 200063, China
- Shanghai Institute of Hematology, State Key Laboratory of Medical Genomics, National Research Center for Translational Medicine at Shanghai, Ruijin Hospital Affiliated to Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China
| | - Ziwei Wang
- Shanghai Key Laboratory of Forensic Medicine, Shanghai Forensic Service Platform, Academy of Forensic Science, Ministry of Justice, Shanghai, 200063, China
- Department of Forensic Science, Medical School of Soochow University, Suzhou, 215123, China
| | - Shubo Wen
- Shanghai Key Laboratory of Forensic Medicine, Shanghai Forensic Service Platform, Academy of Forensic Science, Ministry of Justice, Shanghai, 200063, China
- Department of Forensic Science, Medical School of Soochow University, Suzhou, 215123, China
| | - Ao Huang
- Shanghai Key Laboratory of Forensic Medicine, Shanghai Forensic Service Platform, Academy of Forensic Science, Ministry of Justice, Shanghai, 200063, China
- Department of Forensic Science, Medical School of Soochow University, Suzhou, 215123, China
| | - Hui Li
- Shanghai Key Laboratory of Forensic Medicine, Shanghai Forensic Service Platform, Academy of Forensic Science, Ministry of Justice, Shanghai, 200063, China
| | - Lei Jiang
- Shanghai Key Laboratory of Forensic Medicine, Shanghai Forensic Service Platform, Academy of Forensic Science, Ministry of Justice, Shanghai, 200063, China
| | - Qi Feng
- National Center for Gene Research, State Key Laboratory of Plant Molecular Genetics, Center of Excellence in Molecular Plant Sciences, Institute of Plant Physiology and Ecology, Chinese Academy of Sciences, Shanghai, 200233, China
| | - Danlin Fan
- National Center for Gene Research, State Key Laboratory of Plant Molecular Genetics, Center of Excellence in Molecular Plant Sciences, Institute of Plant Physiology and Ecology, Chinese Academy of Sciences, Shanghai, 200233, China
| | - Qilin Tian
- National Center for Gene Research, State Key Laboratory of Plant Molecular Genetics, Center of Excellence in Molecular Plant Sciences, Institute of Plant Physiology and Ecology, Chinese Academy of Sciences, Shanghai, 200233, China
| | - Dingding Han
- Department of Clinical Laboratory, Shanghai Children's Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, 200062, China
| | - Xiling Liu
- Shanghai Key Laboratory of Forensic Medicine, Shanghai Forensic Service Platform, Academy of Forensic Science, Ministry of Justice, Shanghai, 200063, China.
| |
Collapse
|
2
|
Conti A, Casagrande Pierantoni D, Robert V, Corte L, Cardinali G. MinION Sequencing of Yeast Mock Communities To Assess the Effect of Databases and ITS-LSU Markers on the Reliability of Metabarcoding Analysis. Microbiol Spectr 2023; 11:e0105222. [PMID: 36519933 PMCID: PMC9927109 DOI: 10.1128/spectrum.01052-22] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
Microbial communities play key roles both for humans and the environment. They are involved in ecosystem functions, maintaining their stability, and provide important services, such as carbon cycle and nitrogen cycle. Acting both as symbionts and as pathogens, description of the structure and composition of these communities is important. Metabarcoding uses ribosomal DNA (rDNA) (eukaryotic) or rRNA gene (prokaryotic) sequences for identification of species present in a site and measuring their abundance. This procedure requires several technical steps that could be source of bias producing a distorted view of the real community composition. In this work, we took advantage of an innovative "long-read" next-generation sequencing (NGS) technology (MinION) amplifying the DNA spanning from the internal transcribed spacer (ITS) to large subunit (LSU) that can be read simultaneously in this platform, providing more information than "short-read" systems. The experimental system consisted of six fungal mock communities composed of species present at various relative amounts to mimic natural situations characterized by predominant and low-frequency species. The influence of the sequencing platform (MinION and Illumina MiSeq) and the effect of different reference databases and marker sequences on metagenomic identification of species were evaluated. The results showed that the ITS-based database provided more accurate species identification than LSU. Furthermore, a procedure based on a preliminary identification with standard reference databases followed by the production of custom databases, including only the best outputs of the first step, is proposed. This additional step improved the estimate of species proportion of the mock communities and reduced the number of ghost species not really present in the simulated communities. IMPORTANCE Metagenomic analyses are fundamental in many research areas; therefore, improvement of methods and protocols for the description of microbial communities becomes more and more necessary. Long-read sequencing could be used for reducing biases due to the multicopy nature of rDNA sequences and short-read limitations. However, these novel technologies need to be assessed and standardized with controlled experiments, such as mock communities. The interest behind this work was to evaluate how long reads performed identification and quantification of species mixed in precise proportions and how the choice of database affects such analyses. Development of a pipeline that mitigates the effect of the barcoding sequences and the impact of the reference database on metagenomic analyses can help microbiome studies go one step further.
Collapse
Affiliation(s)
- Angela Conti
- Department of Pharmaceutical Sciences, University of Perugia, Perugia, Italy
| | | | - Vincent Robert
- Westerdjik Institute for Biodiversity, Utrecht, Netherlands
| | - Laura Corte
- Department of Pharmaceutical Sciences, University of Perugia, Perugia, Italy
- CEMIN Excellence Research Centre, Perugia, Italy
| | - Gianluigi Cardinali
- Department of Pharmaceutical Sciences, University of Perugia, Perugia, Italy
- CEMIN Excellence Research Centre, Perugia, Italy
| |
Collapse
|
3
|
Pruski P, Lewis HV, Lee YS, Marchesi JR, Bennett PR, Takats Z, MacIntyre DA. Assessment of microbiota:host interactions at the vaginal mucosa interface. Methods 2018; 149:74-84. [PMID: 29705211 DOI: 10.1016/j.ymeth.2018.04.022] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2018] [Revised: 03/10/2018] [Accepted: 04/22/2018] [Indexed: 12/12/2022] Open
Abstract
There is increasing appreciation of the role that vaginal microbiota play in health and disease throughout a woman's lifespan. This has been driven partly by molecular techniques that enable detailed identification and characterisation of microbial community structures. However, these methods do not enable assessment of the biochemical and immunological interactions between host and vaginal microbiota involved in pathophysiology. This review examines our current knowledge of the relationships that exist between vaginal microbiota and the host at the level of the vaginal mucosal interface. We also consider methodological approaches to microbiomic, immunologic and metabolic profiling that permit assessment of these interactions. Integration of information derived from these platforms brings the potential for biomarker discovery, disease risk stratification and improved understanding of the mechanisms regulating vaginal microbial community dynamics in health and disease.
Collapse
Affiliation(s)
- Pamela Pruski
- Division of Computational and Systems Medicine, Department of Surgery and Cancer, Imperial College London, London SW7 2AZ, UK
| | - Holly V Lewis
- Imperial College Parturition Research Group, Institute of Reproductive and Developmental Biology, Department of Surgery and Cancer, Imperial College London, London W12 0NN, UK; Queen Charlotte's Hospital, Imperial College Healthcare National Health Service (NHS) Trust, London W12 0HS, UK
| | - Yun S Lee
- Imperial College Parturition Research Group, Institute of Reproductive and Developmental Biology, Department of Surgery and Cancer, Imperial College London, London W12 0NN, UK
| | - Julian R Marchesi
- Department of Biosciences, Cardiff University, Cardiff CF10 3AX, UK; Centre for Digestive and Gut Health, Surgery and Cancer, Imperial College London, London W2 1NY, UK
| | - Phillip R Bennett
- Division of Computational and Systems Medicine, Department of Surgery and Cancer, Imperial College London, London SW7 2AZ, UK; Imperial College Parturition Research Group, Institute of Reproductive and Developmental Biology, Department of Surgery and Cancer, Imperial College London, London W12 0NN, UK
| | - Zoltan Takats
- Division of Computational and Systems Medicine, Department of Surgery and Cancer, Imperial College London, London SW7 2AZ, UK
| | - David A MacIntyre
- Imperial College Parturition Research Group, Institute of Reproductive and Developmental Biology, Department of Surgery and Cancer, Imperial College London, London W12 0NN, UK.
| |
Collapse
|
4
|
Mohrbeck I, Raupach MJ, Martínez Arbizu P, Knebelsberger T, Laakmann S. High-Throughput Sequencing-The Key to Rapid Biodiversity Assessment of Marine Metazoa? PLoS One 2015; 10:e0140342. [PMID: 26479071 PMCID: PMC4610693 DOI: 10.1371/journal.pone.0140342] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2014] [Accepted: 09/24/2015] [Indexed: 02/03/2023] Open
Abstract
The applications of traditional morphological and molecular methods for species identification are greatly restricted by processing speed and on a regional or greater scale are generally considered unfeasible. In this context, high-throughput sequencing, or metagenetics, has been proposed as an efficient tool to document biodiversity. Here we evaluated the effectiveness of 454 pyrosequencing in marine metazoan community analysis using the 18S rDNA: V1-V2 region. Multiplex pyrosequencing of the V1-V2 region was used to analyze two pooled samples of DNA, one comprising 118 and the other 37 morphologically identified species, and one natural sample taken directly from a North Sea zooplankton community. A DNA reference library comprising all species represented in the pooled samples was created by Sanger sequencing, and this was then used to determine the optimal similarity threshold for species delineation. The optimal threshold was found at 99% species similarity, with 85% identification success. Pyrosequencing was able to identify between fewer species: 67% and 78% of the species in the two pooled samples. Also, a large number of sequences for three species that were not included in the pooled samples were amplified by pyrosequencing, suggesting preferential amplification of some genotypes and the sensitivity of this approach to even low levels of contamination. Conversely, metagenetic analysis of the natural zooplankton sample identified many more species (particularly gelatinous zooplankton and meroplankton) than morphological analysis of a formalin-fixed sample from the same sampling site, suggesting an increased level of taxonomic resolution with pyrosequencing. The study demonstrated that, based on the V1-V2 region, 454 sequencing does not provide accurate species differentiation and reliable taxonomic classification, as it is required in most biodiversity monitoring. The analysis of artificially prepared samples indicated that species detection in pyrosequencing datasets is complicated by potential PCR-based biases and that the V1-V2 marker is poorly resolved for some taxa.
Collapse
Affiliation(s)
- Inga Mohrbeck
- Department German Center for Marine Biodiversity Research, Senckenberg am Meer, Wilhelmshaven, Germany
| | - Michael J Raupach
- Department German Center for Marine Biodiversity Research, Senckenberg am Meer, Wilhelmshaven, Germany
| | - Pedro Martínez Arbizu
- Department German Center for Marine Biodiversity Research, Senckenberg am Meer, Wilhelmshaven, Germany
| | - Thomas Knebelsberger
- Department German Center for Marine Biodiversity Research, Senckenberg am Meer, Wilhelmshaven, Germany
| | - Silke Laakmann
- Department German Center for Marine Biodiversity Research, Senckenberg am Meer, Wilhelmshaven, Germany
| |
Collapse
|
5
|
Brooks JP, Edwards DJ, Harwich MD, Rivera MC, Fettweis JM, Serrano MG, Reris RA, Sheth NU, Huang B, Girerd P, Strauss JF, Jefferson KK, Buck GA. The truth about metagenomics: quantifying and counteracting bias in 16S rRNA studies. BMC Microbiol 2015; 15:66. [PMID: 25880246 PMCID: PMC4433096 DOI: 10.1186/s12866-015-0351-6] [Citation(s) in RCA: 316] [Impact Index Per Article: 35.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2014] [Accepted: 01/16/2015] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Characterizing microbial communities via next-generation sequencing is subject to a number of pitfalls involving sample processing. The observed community composition can be a severe distortion of the quantities of bacteria actually present in the microbiome, hampering analysis and threatening the validity of conclusions from metagenomic studies. We introduce an experimental protocol using mock communities for quantifying and characterizing bias introduced in the sample processing pipeline. We used 80 bacterial mock communities comprised of prescribed proportions of cells from seven vaginally-relevant bacterial strains to assess the bias introduced in the sample processing pipeline. We created two additional sets of 80 mock communities by mixing prescribed quantities of DNA and PCR product to quantify the relative contribution to bias of (1) DNA extraction, (2) PCR amplification, and (3) sequencing and taxonomic classification for particular choices of protocols for each step. We developed models to predict the "true" composition of environmental samples based on the observed proportions, and applied them to a set of clinical vaginal samples from a single subject during four visits. RESULTS We observed that using different DNA extraction kits can produce dramatically different results but bias is introduced regardless of the choice of kit. We observed error rates from bias of over 85% in some samples, while technical variation was very low at less than 5% for most bacteria. The effects of DNA extraction and PCR amplification for our protocols were much larger than those due to sequencing and classification. The processing steps affected different bacteria in different ways, resulting in amplified and suppressed observed proportions of a community. When predictive models were applied to clinical samples from a subject, the predicted microbiome profiles were better reflections of the physiology and diagnosis of the subject at the visits than the observed community compositions. CONCLUSIONS Bias in 16S studies due to DNA extraction and PCR amplification will continue to require attention despite further advances in sequencing technology. Analysis of mock communities can help assess bias and facilitate the interpretation of results from environmental samples.
Collapse
Affiliation(s)
- J Paul Brooks
- Department of Statistical Sciences and Operations Research, Virginia Commonwealth University, 23284-3083, Richmond, VA, USA. .,Center for the Study of Biological Complexity, Virginia Commonwealth University, 23284, Richmond, VA, USA.
| | - David J Edwards
- Department of Statistical Sciences and Operations Research, Virginia Commonwealth University, 23284-3083, Richmond, VA, USA.
| | - Michael D Harwich
- Department of Microbiology and Immunology, Virginia Commonwealth University, 23284, Richmond, VA, USA.
| | - Maria C Rivera
- Department of Biology, Virginia Commonwealth University, 23284, Richmond, VA, USA.
| | - Jennifer M Fettweis
- Department of Microbiology and Immunology, Virginia Commonwealth University, 23284, Richmond, VA, USA.
| | - Myrna G Serrano
- Center for the Study of Biological Complexity, Virginia Commonwealth University, 23284, Richmond, VA, USA. .,Department of Microbiology and Immunology, Virginia Commonwealth University, 23284, Richmond, VA, USA.
| | - Robert A Reris
- Department of Statistical Sciences and Operations Research, Virginia Commonwealth University, 23284-3083, Richmond, VA, USA.
| | - Nihar U Sheth
- Center for the Study of Biological Complexity, Virginia Commonwealth University, 23284, Richmond, VA, USA.
| | - Bernice Huang
- Department of Microbiology and Immunology, Virginia Commonwealth University, 23284, Richmond, VA, USA.
| | - Philippe Girerd
- Department of Obstetrics and Gynecology, Virginia Commonwealth University, 23284, Richmond, VA, USA.
| | | | - Jerome F Strauss
- Department of Obstetrics and Gynecology, Virginia Commonwealth University, 23284, Richmond, VA, USA.
| | - Kimberly K Jefferson
- Center for the Study of Biological Complexity, Virginia Commonwealth University, 23284, Richmond, VA, USA. .,Department of Microbiology and Immunology, Virginia Commonwealth University, 23284, Richmond, VA, USA.
| | - Gregory A Buck
- Center for the Study of Biological Complexity, Virginia Commonwealth University, 23284, Richmond, VA, USA. .,Department of Microbiology and Immunology, Virginia Commonwealth University, 23284, Richmond, VA, USA.
| |
Collapse
|
6
|
Pinto AJ, Raskin L. PCR biases distort bacterial and archaeal community structure in pyrosequencing datasets. PLoS One 2012; 7:e43093. [PMID: 22905208 PMCID: PMC3419673 DOI: 10.1371/journal.pone.0043093] [Citation(s) in RCA: 272] [Impact Index Per Article: 22.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2012] [Accepted: 07/17/2012] [Indexed: 11/19/2022] Open
Abstract
As 16S rRNA gene targeted massively parallel sequencing has become a common tool for microbial diversity investigations, numerous advances have been made to minimize the influence of sequencing and chimeric PCR artifacts through rigorous quality control measures. However, there has been little effort towards understanding the effect of multi-template PCR biases on microbial community structure. In this study, we used three bacterial and three archaeal mock communities consisting of, respectively, 33 bacterial and 24 archaeal 16S rRNA gene sequences combined in different proportions to compare the influences of (1) sequencing depth, (2) sequencing artifacts (sequencing errors and chimeric PCR artifacts), and (3) biases in multi-template PCR, towards the interpretation of community structure in pyrosequencing datasets. We also assessed the influence of each of these three variables on α- and β-diversity metrics that rely on the number of OTUs alone (richness) and those that include both membership and the relative abundance of detected OTUs (diversity). As part of this study, we redesigned bacterial and archaeal primer sets that target the V3-V5 region of the 16S rRNA gene, along with multiplexing barcodes, to permit simultaneous sequencing of PCR products from the two domains. We conclude that the benefits of deeper sequencing efforts extend beyond greater OTU detection and result in higher precision in β-diversity analyses by reducing the variability between replicate libraries, despite the presence of more sequencing artifacts. Additionally, spurious OTUs resulting from sequencing errors have a significant impact on richness or shared-richness based α- and β-diversity metrics, whereas metrics that utilize community structure (including both richness and relative abundance of OTUs) are minimally affected by spurious OTUs. However, the greatest obstacle towards accurately evaluating community structure are the errors in estimated mean relative abundance of each detected OTU due to biases associated with multi-template PCR reactions.
Collapse
Affiliation(s)
- Ameet J. Pinto
- Department of Civil and Environmental Engineering, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Lutgarde Raskin
- Department of Civil and Environmental Engineering, University of Michigan, Ann Arbor, Michigan, United States of America
| |
Collapse
|
7
|
Farell EM, Alexandre G. Bovine serum albumin further enhances the effects of organic solvents on increased yield of polymerase chain reaction of GC-rich templates. BMC Res Notes 2012; 5:257. [PMID: 22624992 PMCID: PMC3466135 DOI: 10.1186/1756-0500-5-257] [Citation(s) in RCA: 61] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2012] [Accepted: 05/15/2012] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND While being a standard powerful molecular biology technique, applications of the PCR to the amplification of high GC-rich DNA samples still present challenges which include limited yield and poor specificity of the reaction. Organic solvents, including DMSO and formamide, have been often employed as additives to increase the efficiency of amplification of high GC content (GC > 60%) DNA sequences. Bovine serum albumin (BSA) has been used as an additive in several applications, including restriction enzyme digestions as well as in PCR amplification of templates from environmental samples that contain potential inhibitors such as phenolic compounds. FINDINGS Significant increase in PCR amplification yields of GC-rich DNA targets ranging in sizes from 0.4 kb to 7.1 kb were achieved by using BSA as a co-additive along with DMSO and formamide. Notably, enhancing effects of BSA occurs in the initial PCR cycles with BSA additions having no detrimental impact on PCR yield or specificity. When a PCR was set up such that the cycling parameters paused after every ten cycles to allow for supplementation of BSA, combining BSA and organic solvent produced significantly higher yields relative to conditions using the solvent alone. The co-enhancing effects of BSA in presence of organic solvents were also obtained in other PCR applications, including site-directed mutagenesis and overlap extension PCR. CONCLUSIONS BSA significantly enhances PCR amplification yield when used in combination with organic solvents, DMSO or formamide. BSA enhancing effects were obtained in several PCR applications, with DNA templates of high GC content and spanning a broad size range. When added to the reaction buffer, promoting effects of BSA were seen in the first cycles of the PCR, regardless of the size of the DNA to amplify. The strategy outlined here provides a cost-effective alternative for increasing the efficiency of PCR amplification of GC-rich DNA targets over a broad size range.
Collapse
Affiliation(s)
- Eric M Farell
- Department of Biochemistry, Cellular and Molecular Biology, University of Tennessee, Knoxville, TN 37996, USA
| | | |
Collapse
|