1
|
Liu Y, Wang Y, Shi W, Wu N, Liu W, Francis F, Wang X. Enterobacter-infecting phages in nitrogen-deficient paddy soil impact nitrogen-fixation capacity and rice growth by shaping the soil microbiome. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 956:177382. [PMID: 39505046 DOI: 10.1016/j.scitotenv.2024.177382] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/01/2023] [Revised: 09/03/2024] [Accepted: 11/02/2024] [Indexed: 11/08/2024]
Abstract
Bacteriophages ("phage") play important roles in nutrient cycling and ecology in environments by regulating soil microbial community structure. Here, metagenomic sequencing showed that a low relative abundance of nitrogen-fixing bacteria but high abundance of Enterobacter-infecting phages in paddy soil where rice plants showed nitrogen deficiency. From soil in the same field, we also isolated and identified a novel virulent phage (named here as Apdecimavirus NJ2) that infects several species of Enterobacter and characterized its impact on nitrogen fixation in the soil and in plants. It has the morphology of the Autographiviridae family, with a dsDNA genome of 39,605 bp, 47 predicted open reading frames and 52.64 % GC content. Based on genomic characteristics, comparative genomics and phylogenetic analysis, Apdecimavirus NJ2 should be a novel species in the genus Apdecimavirus, subfamily Studiervirinae. After natural or sterilized field soil was potted and inoculated with the phage, soil nitrogen-fixation capacity and rice growth were impaired, the abundance of Enterobacter decreased, along with the bacterial community composition and biodiversity changed compared with that of the unadded control paddy soil. Our work provides strong evidence that phages can affect the soil nitrogen cycle by changing the bacterial community. Controlling phages in the soil could be a useful strategy for improving soil nitrogen fixation.
Collapse
Affiliation(s)
- Yu Liu
- State Key Laboratory for Biology of Plant Diseases and Insect Pests, Institute of Plant Protection, Chinese Academy of Agricultural Sciences, Beijing 100193, China; Functional & Evolutionary Entomology, University of Liège, Gembloux Agro-BioTech, Passage des Déportés, 2, 5030 Gembloux, Belgium
| | - Yajiao Wang
- Institute of Plant Protection, Hebei Academy of Agricultural and Forestry Sciences, Baoding 071000, China
| | - Wenchong Shi
- State Key Laboratory of Crop Biology, College of Life Sciences, Shandong Agricultural University, Tai'an 271018, China
| | - Nan Wu
- State Key Laboratory for Biology of Plant Diseases and Insect Pests, Institute of Plant Protection, Chinese Academy of Agricultural Sciences, Beijing 100193, China
| | - Wenwen Liu
- State Key Laboratory for Biology of Plant Diseases and Insect Pests, Institute of Plant Protection, Chinese Academy of Agricultural Sciences, Beijing 100193, China
| | - Frederic Francis
- Functional & Evolutionary Entomology, University of Liège, Gembloux Agro-BioTech, Passage des Déportés, 2, 5030 Gembloux, Belgium.
| | - Xifeng Wang
- State Key Laboratory for Biology of Plant Diseases and Insect Pests, Institute of Plant Protection, Chinese Academy of Agricultural Sciences, Beijing 100193, China.
| |
Collapse
|
2
|
Pierotti S, Welz B, Osuna-López M, Fitzgerald T, Wittbrodt J, Birney E. Genotype imputation in F2 crosses of inbred lines. BIOINFORMATICS ADVANCES 2024; 4:vbae107. [PMID: 39077633 PMCID: PMC11286293 DOI: 10.1093/bioadv/vbae107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/22/2024] [Revised: 06/04/2024] [Accepted: 07/22/2024] [Indexed: 07/31/2024]
Abstract
Motivation Crosses among inbred lines are a fundamental tool for the discovery of genetic loci associated with phenotypes of interest. In organisms for which large reference panels or SNP chips are not available, imputation from low-pass whole-genome sequencing is an effective method for obtaining genotype data from a large number of individuals. To date, a structured analysis of the conditions required for optimal genotype imputation has not been performed. Results We report a systematic exploration of the effect of several design variables on imputation performance in F2 crosses of inbred medaka lines using the imputation software STITCH. We determined that, depending on the number of samples, imputation performance reaches a plateau when increasing the per-sample sequencing coverage. We also systematically explored the trade-offs between cost, imputation accuracy, and sample numbers. We developed a computational pipeline to streamline the process, enabling other researchers to perform a similar cost-benefit analysis on their population of interest. Availability and implementation The source code for the pipeline is available at https://github.com/birneylab/stitchimpute. While our pipeline has been developed and tested for an F2 population, the software can also be used to analyse populations with a different structure.
Collapse
Affiliation(s)
- Saul Pierotti
- European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Hinxton, Cambridge CB101SD, United Kingdom
| | - Bettina Welz
- Centre for Organismal Studies (COS), Heidelberg University, Heidelberg 69120, Germany
| | - Mireia Osuna-López
- Genomics Core Facility, European Molecular Biology Laboratory (EMBL), Heidelberg 69117, Germany
| | - Tomas Fitzgerald
- European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Hinxton, Cambridge CB101SD, United Kingdom
| | - Joachim Wittbrodt
- Centre for Organismal Studies (COS), Heidelberg University, Heidelberg 69120, Germany
| | - Ewan Birney
- European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Hinxton, Cambridge CB101SD, United Kingdom
| |
Collapse
|
3
|
Ott O, Tolppi S, Figueroa-Cruz J, Myagmar K, Unurbuyan K, Tripathi A. Leveraging the fundamentals of heat transfer and fluid mechanics in microscale geometries for automated next-generation sequencing library preparation. Sci Rep 2024; 14:12564. [PMID: 38822053 PMCID: PMC11637099 DOI: 10.1038/s41598-024-63014-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2023] [Accepted: 05/23/2024] [Indexed: 06/02/2024] Open
Abstract
Next-generation sequencing (NGS) is emerging as a powerful tool for molecular diagnostics but remains limited by cumbersome and inefficient sample preparation. We present an innovative automated NGS library preparation system with a simplified mechanical design that exploits both macro- and microfluidic properties for optimizing heat transfer, reaction kinetics, mass transfer, fluid mechanics, adsorption-desorption rates, and molecular thermodynamics. Our approach introduces a unique two-cannula cylindrical capillary system connected to a programmable syringe pump and a Peltier heating element able to execute all steps with high efficiency. Automatic reagent movement, mixing, and magnetic bead-based washing with capillary-based thermal cycling (capillary-PCR) are completely integrated into a single platform. The manual 3-h library preparation process is reduced to less than 15 min of hands-on time via optimally pre-plated reagent plates, followed by less than 6 h of instrument run time during which no user interaction is required. We applied this method to two library preparation assays with different DNA fragmentation requirements (mechanical vs. enzymatic fragmentation), sufficiently limiting consumable use to one cartridge and one 384 well-plate per run. Our platform successfully prepared eight libraries in parallel, generating sequencing data for both human and Escherichia coli DNA libraries with negligible coverage bias compared to positive controls. All sequencing data from our libraries attained Phred (Q) scores > 30, mapping to reference genomes at 99% confidence. The method achieved final library concentrations and size distributions comparable with the conventional manual approach, demonstrating compatibility with downstream sequencing and subsequent data analysis. Our engineering design offers repeatability and consistency in the quality of sequence-able libraries, asserting the importance of mechanical design considerations that employ and optimize fundamental fluid mechanics and heat transfer properties. Furthermore in this work, we provide unique insights into the mechanisms of sample loss within NGS library preparation assays compared with automated adaptations and pinpoint areas in which the principles of thermodynamics, fluid mechanics, and heat transfer can improve future mechanical design iterations.
Collapse
Affiliation(s)
- Olivia Ott
- Center for Biomedical Engineering, School of Engineering, Brown University, Providence, RI, USA
| | - Sabrina Tolppi
- Center for Biomedical Engineering, School of Engineering, Brown University, Providence, RI, USA
| | - Jennifer Figueroa-Cruz
- Center for Biomedical Engineering, School of Engineering, Brown University, Providence, RI, USA
| | - Khaliun Myagmar
- Center for Biomedical Engineering, School of Engineering, Brown University, Providence, RI, USA
| | - Khulan Unurbuyan
- Center for Biomedical Engineering, School of Engineering, Brown University, Providence, RI, USA
| | - Anubhav Tripathi
- Center for Biomedical Engineering, School of Engineering, Brown University, Providence, RI, USA.
| |
Collapse
|
4
|
Chen H, Zhang Y, Wang B, Liao R, Duan X, Yang C, Chen J, Hao Y, Shu Y, Cai L, Leng X, Qian NS, Sun D, Niu B, Zhou Q. Characterization and mitigation of artifacts derived from NGS library preparation due to structure-specific sequences in the human genome. BMC Genomics 2024; 25:227. [PMID: 38429743 PMCID: PMC10908179 DOI: 10.1186/s12864-024-10157-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Accepted: 02/23/2024] [Indexed: 03/03/2024] Open
Abstract
BACKGROUND Hybridization capture-based targeted next generation sequencing (NGS) is gaining importance in routine cancer clinical practice. DNA library preparation is a fundamental step to produce high-quality sequencing data. Numerous unexpected, low variant allele frequency calls were observed in libraries using sonication fragmentation and enzymatic fragmentation. In this study, we investigated the characteristics of the artifact reads induced by sonication and enzymatic fragmentation. We also developed a bioinformatic algorithm to filter these sequencing errors. RESULTS We used pairwise comparisons of somatic single nucleotide variants (SNVs) and insertions and deletions (indels) of the same tumor DNA samples prepared using both ultrasonic and enzymatic fragmentation protocols. Our analysis revealed that the number of artifact variants was significantly greater in the samples generated using enzymatic fragmentation than using sonication. Most of the artifacts derived from the sonication-treated libraries were chimeric artifact reads containing both cis- and trans-inverted repeat sequences of the genomic DNA. In contrast, chimeric artifact reads of endonuclease-treated libraries contained palindromic sequences with mismatched bases. Based on these distinctive features, we proposed a mechanistic hypothesis model, PDSM (pairing of partial single strands derived from a similar molecule), by which these sequencing errors derive from ultrasonication and enzymatic fragmentation library preparation. We developed a bioinformatic algorithm to generate a custom mutation "blacklist" in the BED region to reduce errors in downstream analyses. CONCLUSIONS We first proposed a mechanistic hypothesis model (PDSM) of sequencing errors caused by specific structures of inverted repeat sequences and palindromic sequences in the natural genome. This new hypothesis predicts the existence of chimeric reads that could not be explained by previous models, and provides a new direction for further improving NGS analysis accuracy. A bioinformatic algorithm, ArtifactsFinder, was developed and used to reduce the sequencing errors in libraries produced using sonication and enzymatic fragmentation.
Collapse
Affiliation(s)
- HuiJuan Chen
- Beijing ChosenMed Clinical Laboratory Company Limited, Jinghai Industrial Park, Economic and Technological Development Area, Beijing, 100176, China
- Computer Network Information Center, Chinese Academy of Sciences,, University of Chinese Academy of Sciences, Beijing, 100190, China
- WillingMed Technology Beijing Co., Ltd., Beijing, 100176, China
| | - YiRan Zhang
- Beijing ChosenMed Clinical Laboratory Company Limited, Jinghai Industrial Park, Economic and Technological Development Area, Beijing, 100176, China
| | - Bing Wang
- Beijing ChosenMed Clinical Laboratory Company Limited, Jinghai Industrial Park, Economic and Technological Development Area, Beijing, 100176, China
| | - Rui Liao
- Beijing ChosenMed Clinical Laboratory Company Limited, Jinghai Industrial Park, Economic and Technological Development Area, Beijing, 100176, China
| | - XiaoHong Duan
- Beijing ChosenMed Clinical Laboratory Company Limited, Jinghai Industrial Park, Economic and Technological Development Area, Beijing, 100176, China
- ChosenMed Technology (Zhejiang) Co. Ltd., Zhejiang, 311103, China
| | - ChunYan Yang
- Beijing ChosenMed Clinical Laboratory Company Limited, Jinghai Industrial Park, Economic and Technological Development Area, Beijing, 100176, China
| | - Jing Chen
- Beijing ChosenMed Clinical Laboratory Company Limited, Jinghai Industrial Park, Economic and Technological Development Area, Beijing, 100176, China
| | - YanTong Hao
- Beijing ChosenMed Clinical Laboratory Company Limited, Jinghai Industrial Park, Economic and Technological Development Area, Beijing, 100176, China
| | - YingShuang Shu
- Beijing ChosenMed Clinical Laboratory Company Limited, Jinghai Industrial Park, Economic and Technological Development Area, Beijing, 100176, China
| | - LiLi Cai
- Beijing ChosenMed Clinical Laboratory Company Limited, Jinghai Industrial Park, Economic and Technological Development Area, Beijing, 100176, China
| | - Xue Leng
- Beijing ChosenMed Clinical Laboratory Company Limited, Jinghai Industrial Park, Economic and Technological Development Area, Beijing, 100176, China
| | - Nian-Song Qian
- Department of Oncology,Senior Department of Respiratory and Critical Care Medicine, The Eighth Medical Center of Chinese, PLA General Hospital, No.17A Heishanhu Road, Haidian District, Beijing, 100853, China.
| | - DaWei Sun
- Beijing ChosenMed Clinical Laboratory Company Limited, Jinghai Industrial Park, Economic and Technological Development Area, Beijing, 100176, China.
- ChosenMed Technology (Zhejiang) Co. Ltd., Zhejiang, 311103, China.
| | - Beifang Niu
- Beijing ChosenMed Clinical Laboratory Company Limited, Jinghai Industrial Park, Economic and Technological Development Area, Beijing, 100176, China.
- Computer Network Information Center, Chinese Academy of Sciences,, University of Chinese Academy of Sciences, Beijing, 100190, China.
- ChosenMed Technology (Zhejiang) Co. Ltd., Zhejiang, 311103, China.
| | - Qiming Zhou
- Beijing ChosenMed Clinical Laboratory Company Limited, Jinghai Industrial Park, Economic and Technological Development Area, Beijing, 100176, China.
- ChosenMed Technology (Zhejiang) Co. Ltd., Zhejiang, 311103, China.
| |
Collapse
|
5
|
Rádai Z, Váradi A, Takács P, Nagy NA, Schmitt N, Prépost E, Kardos G, Laczkó L. An overlooked phenomenon: complex interactions of potential error sources on the quality of bacterial de novo genome assemblies. BMC Genomics 2024; 25:45. [PMID: 38195441 PMCID: PMC10777565 DOI: 10.1186/s12864-023-09910-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2022] [Accepted: 12/15/2023] [Indexed: 01/11/2024] Open
Abstract
BACKGROUND Parameters adversely affecting the contiguity and accuracy of the assemblies from Illumina next-generation sequencing (NGS) are well described. However, past studies generally focused on their additive effects, overlooking their potential interactions possibly exacerbating one another's effects in a multiplicative manner. To investigate whether or not they act interactively on de novo genome assembly quality, we simulated sequencing data for 13 bacterial reference genomes, with varying levels of error rate, sequencing depth, PCR and optical duplicate ratios. RESULTS We assessed the quality of assemblies from the simulated sequencing data with a number of contiguity and accuracy metrics, which we used to quantify both additive and multiplicative effects of the four parameters. We found that the tested parameters are engaged in complex interactions, exerting multiplicative, rather than additive, effects on assembly quality. Also, the ratio of non-repeated regions and GC% of the original genomes can shape how the four parameters affect assembly quality. CONCLUSIONS We provide a framework for consideration in future studies using de novo genome assembly of bacterial genomes, e.g. in choosing the optimal sequencing depth, balancing between its positive effect on contiguity and negative effect on accuracy due to its interaction with error rate. Furthermore, the properties of the genomes to be sequenced also should be taken into account, as they might influence the effects of error sources themselves.
Collapse
Affiliation(s)
- Zoltán Rádai
- Institute of Metagenomics, University of Debrecen, Debrecen, Hungary.
- Department of Dermatology, University Hospital Düsseldorf, Heinrich-Heine-University, Düsseldorf, Germany.
| | - Alex Váradi
- Institute of Metagenomics, University of Debrecen, Debrecen, Hungary
- Department of Laboratory Medicine, Medical School, University of Pécs, Pécs, Hungary
| | - Péter Takács
- Institute of Metagenomics, University of Debrecen, Debrecen, Hungary
- Department of Health Informatics, Institute of Health Sciences, Faculty of Health, University of Debrecen, Debrecen, Hungary
| | - Nikoletta Andrea Nagy
- Institute of Metagenomics, University of Debrecen, Debrecen, Hungary
- Department of Evolutionary Zoology, ELKH-DE Behavioural Ecology Research Group, University of Debrecen, Debrecen, Hungary
- Department of Evolutionary Zoology and Human Biology, University of Debrecen, Debrecen, Hungary
| | - Nicholas Schmitt
- Department of Dermatology, University Hospital Düsseldorf, Heinrich-Heine-University, Düsseldorf, Germany
| | - Eszter Prépost
- Department of Health Industry, University of Debrecen, Debrecen, Hungary
| | - Gábor Kardos
- Institute of Metagenomics, University of Debrecen, Debrecen, Hungary
- Department of Gerontology, Faculty of Health Sciences, University of Debrecen, Debrecen, Hungary
| | - Levente Laczkó
- Institute of Metagenomics, University of Debrecen, Debrecen, Hungary
- ELKH-DE Conservation Biology Research Group, Debrecen, Hungary
| |
Collapse
|