1
|
Kumar KR, Cowley MJ, Davis RL. Next-Generation Sequencing and Emerging Technologies. Semin Thromb Hemost 2024; 50:1026-1038. [PMID: 38692283 DOI: 10.1055/s-0044-1786397] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/03/2024]
Abstract
Genetic sequencing technologies are evolving at a rapid pace with major implications for research and clinical practice. In this review, the authors provide an updated overview of next-generation sequencing (NGS) and emerging methodologies. NGS has tremendously improved sequencing output while being more time and cost-efficient in comparison to Sanger sequencing. The authors describe short-read sequencing approaches, such as sequencing by synthesis, ion semiconductor sequencing, and nanoball sequencing. Third-generation long-read sequencing now promises to overcome many of the limitations of short-read sequencing, such as the ability to reliably resolve repeat sequences and large genomic rearrangements. By combining complementary methods with massively parallel DNA sequencing, a greater insight into the biological context of disease mechanisms is now possible. Emerging methodologies, such as advances in nanopore technology, in situ nucleic acid sequencing, and microscopy-based sequencing, will continue the rapid evolution of this area. These new technologies hold many potential applications for hematological disorders, with the promise of precision and personalized medical care in the future.
Collapse
Affiliation(s)
- Kishore R Kumar
- Translational Genomics Group, Kinghorn Centre for Clinical Genomics, Garvan Institute of Medical Research, Darlinghurst, New South Wales, Australia
- Department of Neurogenetics, Kolling Institute, University of Sydney and Royal North Shore Hospital, St Leonards, New South Wales, Australia
- Molecular Medicine Laboratory, Concord Hospital, Sydney, Australia
| | - Mark J Cowley
- Translational Genomics Group, Kinghorn Centre for Clinical Genomics, Garvan Institute of Medical Research, Darlinghurst, New South Wales, Australia
- Computational Biology Group, Children's Cancer Institute, University of New South Wales, Randwick, New South Wales, Australia
| | - Ryan L Davis
- Translational Genomics Group, Kinghorn Centre for Clinical Genomics, Garvan Institute of Medical Research, Darlinghurst, New South Wales, Australia
- Department of Neurogenetics, Kolling Institute, University of Sydney and Royal North Shore Hospital, St Leonards, New South Wales, Australia
| |
Collapse
|
2
|
Shi Z, Zhang Y, Chen W, Yu Z. Crosstalk between 6-methyladenine and 4-methylcytosine in Geobacter sulfurreducens exposed to extremely low-frequency electromagnetic field. iScience 2024; 27:110607. [PMID: 39262814 PMCID: PMC11388800 DOI: 10.1016/j.isci.2024.110607] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2024] [Revised: 04/05/2024] [Accepted: 07/25/2024] [Indexed: 09/13/2024] Open
Abstract
4-Methylcytosine (4mC) and 6-methyladenine (6mA) are the most prevalent types of DNA modifications in prokaryotes. However, whether there is crosstalk between 4mC and 6mA remain unknown. Here, methylomes and transcriptomes of Geobacter sulfurreducens exposed to different intensities of extremely low frequency electromagnetic fields (ELF-EMF) were investigated. Results showed that the second adenine of all the 5'-GTACAG-3' motif was modified to 6mA (M-6mA). For the other 6mA (O-6mA), the variation in their distance from the neighboring M-6mA increased with the intensity of ELF-EMF. Moreover, cytosine adjacent to O-6mA has a much higher probability of being modified to 4mC than cytosine adjacent to M-6mA, and the closer an unmodified cytosine is to 4mC, the higher the probability that the cytosine will be modified to 4mC. Furthermore, there was no significant correlation between DNA methylation and gene expression regulation. These results suggest a reference signal that goes from M-6mA to O-6mA to 4mC.
Collapse
Affiliation(s)
- Zhenhua Shi
- College of Resources and Environment, Fujian Agriculture and Forestry University, 15 Shang Xia Dian Road, Cang Shan District, Fuzhou, Fujian 350002, China
| | - Yingrong Zhang
- College of Resources and Environment, Fujian Agriculture and Forestry University, 15 Shang Xia Dian Road, Cang Shan District, Fuzhou, Fujian 350002, China
| | - Wanqiu Chen
- College of Resources and Environment, Fujian Agriculture and Forestry University, 15 Shang Xia Dian Road, Cang Shan District, Fuzhou, Fujian 350002, China
| | - Zhen Yu
- Fujian Provincial Key Laboratory of Medical Analysis, Fujian Academy of Medical Sciences, 7 Wu Si Road, Gu Lou District, Fuzhou, Fujian 350001, China
| |
Collapse
|
3
|
Garg S, Nain P, Kumar A, Joshi S, Punetha H, Sharma PK, Siddiqui S, Alshaharni MO, Algopishi UB, Mittal A. Next generation plant biostimulants & genome sequencing strategies for sustainable agriculture development. Front Microbiol 2024; 15:1439561. [PMID: 39104588 PMCID: PMC11299335 DOI: 10.3389/fmicb.2024.1439561] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2024] [Accepted: 06/25/2024] [Indexed: 08/07/2024] Open
Abstract
The best environment for plant growth and development contains certain essential metabolites. A broad category of metabolites known as "plant biostimulants" (PBs) includes biomolecules such as proteins, carbohydrates, lipids, and other secondary metabolites related to groups of terpenes, specific nitrogen-containing compounds, and benzene ring-conjugated compounds. The formation of biomolecules depends on both biotic and abiotic factors, such as the release of PB by plants, animals, and microorganisms, or it can result from the control of temperature, humidity, and pressure in the atmosphere, in the case of humic substances (HSs). Understanding the genomic outputs of the concerned organism (may be plants or others than them) becomes crucial for identifying the underlying behaviors that lead to the synthesis of these complex compounds. For the purposes of achieving the objectives of sustainable agriculture, detailed research on PBs is essential because they aid in increasing yield and other growth patterns of agro-economic crops. The regulation of homeostasis in the plant-soil-microbe system for the survival of humans and other animals is mediated by the action of plant biostimulants, as considered essential for the growth of plants. The genomic size and gene operons for functional and regulation control have so far been revealed through technological implementations, but important gene annotations are still lacking, causing a delay in revealing the information. Next-generation sequencing techniques, such as nanopore, nanoball, and Illumina, are essential in troubleshooting the information gaps. These technical advancements have greatly expanded the candidate gene openings. The secondary metabolites being important precursors need to be studied in a much wider scale for accurate calculations of biochemical reactions, taking place inside and outside the synthesized living cell. The present review highlights the sequencing techniques to provide a foundation of opportunity generation for agricultural sustainability.
Collapse
Affiliation(s)
- Shivanshu Garg
- Department of Biochemistry, CBSH-GBPUA&T, Pantnagar, India
| | - Pooja Nain
- Department of Soil Science, College of Agriculture, GBPUA&T, Pantnagar, India
| | - Ashish Kumar
- Department of Microbiology, CBSH-GBPUA&T, Pantnagar, India
| | - Samiksha Joshi
- School of Agriculture, Graphic Era Hill University, Bhimtal, India
| | | | - Pradeep Kumar Sharma
- Department of Environment Science, Graphic Era Deemed to be University, Dehradun, India
| | - Sazada Siddiqui
- Department of Biology, College of Science, King Khalid University, Abha, Saudi Arabia
| | | | | | - Amit Mittal
- School of Allied Sciences, Graphic Era Hill University, Bhimtal, India
| |
Collapse
|
4
|
Nolan DJ, DaRoza J, Brody R, Ganta K, Luzuriaga K, Huston C, Rosenthal S, Lamers SL, Rose R. Comparing Gold-Standard Sanger Sequencing with Two Next-Generation Sequencing Platforms of HIV-1 gp160 Single Genome Amplicons. AIDS Res Hum Retroviruses 2024. [PMID: 38940749 DOI: 10.1089/aid.2024.0012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/29/2024] Open
Abstract
Our goal was to assess the accuracy of next generation sequencing (NGS) compared with Sanger. We performed single genome amplification (SGA) of HIV-1 gp160 on extracted tissue DNA from two HIV+ individuals. Amplicons (n = 30) were sequenced with Sanger or reamplified with barcoded primers and pooled before sequencing using Oxford Nanopore Technologies (ONT) and Pacific Biosciences (PB). For each amplicon, a consensus sequence for NGS reads was obtained by (1) mapping reads to the Sanger sequence when available ("reference-based") or (2) mapping reads to a "pseudo-reference" sequence, i.e., a consensus sequence of a subset of NGS reads ("reference-free"). PB reads were clustered based on genetic similarity. A Sanger consensus sequence was obtained for 23/30 amplicons, for which all NGS consensus sequences were identical (n = 9) or nearly identical (n = 14) compared with Sanger. For the nine mismatches between Sanger/NGS, the nucleotide in the NGS sequence matched all other sequences from that patient. Of the 7/30 amplicons without a Sanger sequence, NGS sequences had ≥35 ambiguous calls in five amplicons and 0 ambiguities in two amplicons. Analysis of the electropherograms showed failure of a single sequencing primer for the latter two amplicons (consistent with a single template) and overlapping peaks for the other five (consistent with multiple templates). Clustering results closely followed the Sanger/NGS consensus results, where amplicons derived from a single template also had a single cluster and vice versa (with one exception, which could be the result of barcode misidentification). Representative sequences from the clusters contained 2-13 differences compared with Sanger/NGS. In summary, we show that both ONT and PB can produce amplicon consensus sequences with similar or higher accuracy compared with Sanger and, importantly, without the need for a known reference sequence. Clustering could be useful in some circumstances to predict or confirm the presence of multiple starting templates.
Collapse
Affiliation(s)
| | | | - Robin Brody
- Molecular Medicine, UMass Chan Medical School, Worcester, Massachusetts, USA
| | - Krishna Ganta
- Molecular Medicine, UMass Chan Medical School, Worcester, Massachusetts, USA
| | - Katherine Luzuriaga
- Molecular Medicine, UMass Chan Medical School, Worcester, Massachusetts, USA
| | | | | | | | | |
Collapse
|
5
|
Jia H, Tan S, Zhang YE. Chasing Sequencing Perfection: Marching Toward Higher Accuracy and Lower Costs. GENOMICS, PROTEOMICS & BIOINFORMATICS 2024; 22:qzae024. [PMID: 38991976 PMCID: PMC11423848 DOI: 10.1093/gpbjnl/qzae024] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/23/2023] [Revised: 01/25/2024] [Accepted: 01/29/2024] [Indexed: 07/13/2024]
Abstract
Next-generation sequencing (NGS), represented by Illumina platforms, has been an essential cornerstone of basic and applied research. However, the sequencing error rate of 1 per 1000 bp (10-3) represents a serious hurdle for research areas focusing on rare mutations, such as somatic mosaicism or microbe heterogeneity. By examining the high-fidelity sequencing methods developed in the past decade, we summarized three major factors underlying errors and the corresponding 12 strategies mitigating these errors. We then proposed a novel framework to classify 11 preexisting representative methods according to the corresponding combinatory strategies and identified three trends that emerged during methodological developments. We further extended this analysis to eight long-read sequencing methods, emphasizing error reduction strategies. Finally, we suggest two promising future directions that could achieve comparable or even higher accuracy with lower costs in both NGS and long-read sequencing.
Collapse
Affiliation(s)
- Hangxing Jia
- CAS Key Laboratory of Zoological Systematics and Evolution & State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Shengjun Tan
- CAS Key Laboratory of Zoological Systematics and Evolution & State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Yong E Zhang
- CAS Key Laboratory of Zoological Systematics and Evolution & State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
- CAS Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming 650223, China
| |
Collapse
|
6
|
Fu Y, Aganezov S, Mahmoud M, Beaulaurier J, Juul S, Treangen TJ, Sedlazeck FJ. MethPhaser: methylation-based long-read haplotype phasing of human genomes. Nat Commun 2024; 15:5327. [PMID: 38909018 PMCID: PMC11193733 DOI: 10.1038/s41467-024-49588-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2023] [Accepted: 06/11/2024] [Indexed: 06/24/2024] Open
Abstract
The assignment of variants across haplotypes, phasing, is crucial for predicting the consequences, interaction, and inheritance of mutations and is a key step in improving our understanding of phenotype and disease. However, phasing is limited by read length and stretches of homozygosity along the genome. To overcome this limitation, we designed MethPhaser, a method that utilizes methylation signals from Oxford Nanopore Technologies to extend Single Nucleotide Variation (SNV)-based phasing. We demonstrate that haplotype-specific methylations extensively exist in Human genomes and the advent of long-read technologies enabled direct report of methylation signals. For ONT R9 and R10 cell line data, we increase the phase length N50 by 78%-151% at a phasing accuracy of 83.4-98.7% To assess the impact of tissue purity and random methylation signals due to inactivation, we also applied MethPhaser on blood samples from 4 patients, still showing improvements over SNV-only phasing. MethPhaser further improves phasing across HLA and multiple other medically relevant genes, improving our understanding of how mutations interact across multiple phenotypes. The concept of MethPhaser can also be extended to non-human diploid genomes. MethPhaser is available at https://github.com/treangenlab/methphaser .
Collapse
Affiliation(s)
- Yilei Fu
- Department of Computer Science, Rice University, Houston, TX, USA
| | | | - Medhat Mahmoud
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA
| | | | - Sissel Juul
- Oxford Nanopore Technologies Inc, New York, NY, USA
| | - Todd J Treangen
- Department of Computer Science, Rice University, Houston, TX, USA.
- Department of Bioengineering, Rice University, Houston, TX, USA.
| | - Fritz J Sedlazeck
- Department of Computer Science, Rice University, Houston, TX, USA.
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA.
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA.
| |
Collapse
|
7
|
Corradi Z, Dhaenens CM, Grunewald O, Kocabaş IS, Meunier I, Banfi S, Karali M, Cremers FPM, Hitti-Malin RJ. Novel and Recurrent Copy Number Variants in ABCA4-Associated Retinopathy. Int J Mol Sci 2024; 25:5940. [PMID: 38892127 PMCID: PMC11173210 DOI: 10.3390/ijms25115940] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2024] [Revised: 05/22/2024] [Accepted: 05/24/2024] [Indexed: 06/21/2024] Open
Abstract
ABCA4 is the most frequently mutated gene leading to inherited retinal disease (IRD) with over 2200 pathogenic variants reported to date. Of these, ~1% are copy number variants (CNVs) involving the deletion or duplication of genomic regions, typically >50 nucleotides in length. An in-depth assessment of the current literature based on the public database LOVD, regarding the presence of known CNVs and structural variants in ABCA4, and additional sequencing analysis of ABCA4 using single-molecule Molecular Inversion Probes (smMIPs) for 148 probands highlighted recurrent and novel CNVs associated with ABCA4-associated retinopathies. An analysis of the coverage depth in the sequencing data led to the identification of eleven deletions (six novel and five recurrent), three duplications (one novel and two recurrent) and one complex CNV. Of particular interest was the identification of a complex defect, i.e., a 15.3 kb duplicated segment encompassing exon 31 through intron 41 that was inserted at the junction of a downstream 2.7 kb deletion encompassing intron 44 through intron 47. In addition, we identified a 7.0 kb tandem duplication of intron 1 in three cases. The identification of CNVs in ABCA4 can provide patients and their families with a genetic diagnosis whilst expanding our understanding of the complexity of diseases caused by ABCA4 variants.
Collapse
Affiliation(s)
- Zelia Corradi
- Department of Human Genetics, Radboud University Medical Center, 6525 GA Nijmegen, The Netherlands
| | - Claire-Marie Dhaenens
- Université de Lille, Inserm, CHU Lille, U1172-LilNCog-Lille Neuroscience & Cognition, F-59000 Lille, France
| | - Olivier Grunewald
- Université de Lille, Inserm, CHU Lille, U1172-LilNCog-Lille Neuroscience & Cognition, F-59000 Lille, France
| | - Ipek Selen Kocabaş
- Department of Human Genetics, Radboud University Medical Center, 6525 GA Nijmegen, The Netherlands
| | - Isabelle Meunier
- Institute des Neurosciences de Montpellier, INSERM, Université de Montpellier, F-34295 Montpellier, France
| | - Sandro Banfi
- Department of Precision Medicine, University of Campania “Luigi Vanvitelli”, 81031 Naples, Italy
- Telethon Institute of Genetics and Medicine (TIGEM), 80078 Pozzuoli, Italy
| | - Marianthi Karali
- Department of Precision Medicine, University of Campania “Luigi Vanvitelli”, 81031 Naples, Italy
- Eye Clinic, Multidisciplinary Department of Medical, Surgical and Dental Sciences, University of Campania “Luigi Vanvitelli”, 81031 Naples, Italy
| | - Frans P. M. Cremers
- Department of Human Genetics, Radboud University Medical Center, 6525 GA Nijmegen, The Netherlands
| | - Rebekkah J. Hitti-Malin
- Department of Human Genetics, Radboud University Medical Center, 6525 GA Nijmegen, The Netherlands
| |
Collapse
|
8
|
Purahong W, Ji L, Wu YT. Community Assembly Processes of Deadwood Mycobiome in a Tropical Forest Revealed by Long-Read Third-Generation Sequencing. MICROBIAL ECOLOGY 2024; 87:66. [PMID: 38700528 PMCID: PMC11068674 DOI: 10.1007/s00248-024-02372-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/20/2023] [Accepted: 03/31/2024] [Indexed: 05/06/2024]
Abstract
Despite the importance of wood-inhabiting fungi on nutrient cycling and ecosystem functions, their ecology, especially related to their community assembly, is still highly unexplored. In this study, we analyzed the wood-inhabiting fungal richness, community composition, and phylogenetics using PacBio sequencing. Opposite to what has been expected that deterministic processes especially environmental filtering through wood-physicochemical properties controls the community assembly of wood-inhabiting fungal communities, here we showed that both deterministic and stochastic processes can highly contribute to the community assembly processes of wood-inhabiting fungi in this tropical forest. We demonstrated that the dynamics of stochastic and deterministic processes varied with wood decomposition stages. The initial stage was mainly governed by a deterministic process (homogenous selection), whereas the early and later decomposition stages were governed by the stochastic processes (ecological drift). Deterministic processes were highly contributed by wood physicochemical properties (especially macronutrients and hemicellulose) rather than soil physicochemical factors. We elucidated that fine-scale fungal-fungal interactions, especially the network topology, modularity, and keystone taxa of wood-inhabiting fungal communities, strongly differed in an initial and decomposing deadwood. This current study contributes to a better understanding of the ecological processes of wood-inhabiting fungi in tropical regions where the knowledge of wood-inhabiting fungi is highly limited.
Collapse
Affiliation(s)
- Witoon Purahong
- Department of Soil Ecology, UFZ-Helmholtz Centre for Environmental Research, Theodor-Lieser-Str. 4, 06120, Halle (Saale), Germany
| | - Li Ji
- Department of Soil Ecology, UFZ-Helmholtz Centre for Environmental Research, Theodor-Lieser-Str. 4, 06120, Halle (Saale), Germany
- School of Forestry, Central South University of Forestry and Technology, Changsha, 410004, China
| | - Yu-Ting Wu
- Department of Forestry, National Pingtung University of Science and Technology, Pingtung, 91201, Taiwan.
- Department of Biomedical Science and Environmental Biology, Kaohsiung Medical University, Kaohsiung, 80708, Taiwan.
| |
Collapse
|
9
|
Li M, Zhang H, Xiao H, Zhu K, Shi W, Zhang D, Wang Y, Yang L, Wu Q, Xie J, Chen Y, Qiu D, Guo G, Lu P, Li B, Dong L, Li W, Cui X, Li L, Tian X, Yuan C, Li Y, Yu D, Nevo E, Fahima T, Li H, Dong L, Zhao Y, Liu Z. A membrane associated tandem kinase from wild emmer wheat confers broad-spectrum resistance to powdery mildew. Nat Commun 2024; 15:3124. [PMID: 38600164 PMCID: PMC11006675 DOI: 10.1038/s41467-024-47497-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2023] [Accepted: 04/04/2024] [Indexed: 04/12/2024] Open
Abstract
Crop wild relatives offer natural variations of disease resistance for crop improvement. Here, we report the isolation of broad-spectrum powdery mildew resistance gene Pm36, originated from wild emmer wheat, that encodes a tandem kinase with a transmembrane domain (WTK7-TM) through the combination of map-based cloning, PacBio SMRT long-read genome sequencing, mutagenesis, and transformation. Mutagenesis assay reveals that the two kinase domains and the transmembrane domain of WTK7-TM are critical for the powdery mildew resistance function. Consistently, in vitro phosphorylation assay shows that two kinase domains are indispensable for the kinase activity of WTK7-TM. Haplotype analysis uncovers that Pm36 is an orphan gene only present in a few wild emmer wheat, indicating its single ancient origin and potential contribution to the current wheat gene pool. Overall, our findings not only provide a powdery mildew resistance gene with great potential in wheat breeding but also sheds light into the mechanism underlying broad-spectrum resistance.
Collapse
Affiliation(s)
- Miaomiao Li
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Beijing, China.
| | - Huaizhi Zhang
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Beijing, China
| | - Huixin Xiao
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Beijing, China
| | - Keyu Zhu
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Beijing, China
- College of Advanced Agricultural Sciences, University of Chinese Academy of Sciences, Beijing, China
| | - Wenqi Shi
- Institute of Plant Protection and Soil Science, Hubei Academy of Agricultural Sciences, Wuhan, China
| | - Dong Zhang
- Beijing PlantTech Biotechnology Co., Ltd., Beijing, China
| | - Yong Wang
- Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Lijun Yang
- Institute of Plant Protection and Soil Science, Hubei Academy of Agricultural Sciences, Wuhan, China
| | - Qiuhong Wu
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Beijing, China
| | - Jingzhong Xie
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Beijing, China
| | - Yongxing Chen
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Beijing, China
- College of Advanced Agricultural Sciences, University of Chinese Academy of Sciences, Beijing, China
| | - Dan Qiu
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Beijing, China
| | - Guanghao Guo
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Beijing, China
| | - Ping Lu
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Beijing, China
| | - Beibei Li
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Beijing, China
- College of Advanced Agricultural Sciences, University of Chinese Academy of Sciences, Beijing, China
| | - Lei Dong
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Beijing, China
- College of Advanced Agricultural Sciences, University of Chinese Academy of Sciences, Beijing, China
| | - Wenling Li
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Beijing, China
- College of Advanced Agricultural Sciences, University of Chinese Academy of Sciences, Beijing, China
| | - Xuejia Cui
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Beijing, China
- College of Advanced Agricultural Sciences, University of Chinese Academy of Sciences, Beijing, China
| | - Lingchuan Li
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Beijing, China
- College of Advanced Agricultural Sciences, University of Chinese Academy of Sciences, Beijing, China
| | - Xiubin Tian
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Beijing, China
| | | | - Yiwen Li
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Beijing, China
| | - Dazhao Yu
- Institute of Plant Protection and Soil Science, Hubei Academy of Agricultural Sciences, Wuhan, China
| | - Eviatar Nevo
- Institute of Evolution, University of Haifa, Mt. Carmel, Haifa, Israel
| | - Tzion Fahima
- Institute of Evolution, University of Haifa, Mt. Carmel, Haifa, Israel
| | - Hongjie Li
- National Engineering Laboratory for Crop Molecular Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Lingli Dong
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Beijing, China.
| | - Yusheng Zhao
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Beijing, China.
- College of Advanced Agricultural Sciences, University of Chinese Academy of Sciences, Beijing, China.
| | - Zhiyong Liu
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Beijing, China.
- College of Advanced Agricultural Sciences, University of Chinese Academy of Sciences, Beijing, China.
- Hainan Seed Industry Laboratory, Sanya City, Hainan Province, China.
| |
Collapse
|
10
|
Westfall DH, Deng W, Pankow A, Murrell H, Chen L, Zhao H, Williamson C, Rolland M, Murrell B, Mullins JI. Optimized SMRT-UMI protocol produces highly accurate sequence datasets from diverse populations-Application to HIV-1 quasispecies. Virus Evol 2024; 10:veae019. [PMID: 38765465 PMCID: PMC11099545 DOI: 10.1093/ve/veae019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2023] [Revised: 12/19/2023] [Accepted: 02/20/2024] [Indexed: 05/22/2024] Open
Abstract
Pathogen diversity resulting in quasispecies can enable persistence and adaptation to host defenses and therapies. However, accurate quasispecies characterization can be impeded by errors introduced during sample handling and sequencing, which can require extensive optimizations to overcome. We present complete laboratory and bioinformatics workflows to overcome many of these hurdles. The Pacific Biosciences single molecule real-time platform was used to sequence polymerase-chain reaction (PCR) amplicons derived from cDNA templates tagged with unique molecular identifiers (SMRT-UMI). Optimized laboratory protocols were developed through extensive testing of different sample preparation conditions to minimize between-template recombination during PCR. The use of UMI allowed accurate template quantitation as well as removal of point mutations introduced during PCR and sequencing to produce a highly accurate consensus sequence from each template. Production of highly accurate sequences from the large datasets produced from SMRT-UMI sequencing is facilitated by a novel bioinformatic pipeline, Probabilistic Offspring Resolver for Primer IDs (PORPIDpipeline). PORPIDpipeline automatically filters and parses circular consensus reads by sample, identifies and discards reads with UMIs likely created from PCR and sequencing errors, generates consensus sequences, checks for contamination within the dataset, and removes any sequence with evidence of PCR recombination, heteroduplex formation, or early cycle PCR errors. The optimized SMRT-UMI sequencing and PORPIDpipeline methods presented here represent a highly adaptable and established starting point for accurate sequencing of diverse pathogens. These methods are illustrated through characterization of human immunodeficiency virus quasispecies in a virus transmitter-recipient pair of individuals.
Collapse
Affiliation(s)
- Dylan H Westfall
- Department of Microbiology, University of Washington School of Medicine, 960 Republican Street, Seattle, WA 98195-8070, USA
| | - Wenjie Deng
- Department of Microbiology, University of Washington School of Medicine, 960 Republican Street, Seattle, WA 98195-8070, USA
| | - Alec Pankow
- Department of Microbiology, University of Washington School of Medicine, 960 Republican Street, Seattle, WA 98195-8070, USA
| | - Hugh Murrell
- Department of Pathology, Division of Medical Virology, University of Cape Town and National Health Laboratory Services, Observatory, Cape Town 7925, South Africa
| | - Lennie Chen
- Department of Microbiology, University of Washington School of Medicine, 960 Republican Street, Seattle, WA 98195-8070, USA
| | - Hong Zhao
- Department of Microbiology, University of Washington School of Medicine, 960 Republican Street, Seattle, WA 98195-8070, USA
| | - Carolyn Williamson
- Department of Pathology, Division of Medical Virology, University of Cape Town and National Health Laboratory Services, Observatory, Cape Town 7925, South Africa
| | - Morgane Rolland
- US Military HIV Research Program, Walter Reed Army Institute of Research, 503 Robert Grant Avenue, Silver Spring, MD 20910, USA
- The Henry M. Jackson Foundation for the Advancement of Military Medicine, Inc., 6720A Rockledge Drive, Bethesda, MD 20817, USA
| | - Ben Murrell
- Department of Microbiology, Tumor and Cell Biology, Karolinska Institutet, Solnavägen 9, Stockholm 171 65, Sweden
| | - James I Mullins
- Department of Microbiology, University of Washington School of Medicine, 960 Republican Street, Seattle, WA 98195-8070, USA
- Department of Medicine, University of Washington School of Medicine, 960 Republican Street, Seattle, WA 98195-8070, USA
- Department of Global Health, University of Washington Schools of Medicine and Public Health, 960 Republican Street, Seattle, WA 98195-8070, USA
| |
Collapse
|
11
|
Sena J, Karwal L, Bell C, Devitt N, Schilkey F, Huang C, Livengood J, Das S, Dean HJ. Identification and quantitation of multiple variants in RNA virus genomes. Biol Methods Protoc 2024; 9:bpae004. [PMID: 38414646 PMCID: PMC10898329 DOI: 10.1093/biomethods/bpae004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2023] [Revised: 01/17/2024] [Accepted: 01/30/2024] [Indexed: 02/29/2024] Open
Abstract
The goal of the study was to identify and characterize RNA virus variants containing mutations spread over genomic distances >5 kb. As proof of concept, high-quality viral RNA of the Dengue 2 component of Takeda's tetravalent dengue vaccine candidate (TDV-2) was used to develop a reverse transcription-polymerase chain reaction protocol to amplify a ∼5.3 kb cDNA segment that contains the three genetic determinants of TDV-2 attenuation. Unique molecular identifiers were incorporated into each viral cDNA molecule for PacBio library preparation to improve the quantitative precision of the observed variants at the attenuation loci. Following assay optimization, PacBio long-read sequencing was validated with multiple clone-derived TDV-2 revertant variants and four complex revertant mixtures containing various compositions of TDV-2 and revertant viruses. PacBio sequencing analysis correctly identified and quantified variant composition in all tested samples, demonstrating that TDV-2 revertants could be identified and characterized and supporting the use of this method in the differentiation and quantification of complex variants of other RNA viruses. Long-read sequencing can identify complex RNA virus variants containing multiple mutations on a single-genome molecule, which is useful for in-depth genetic stability and revertant detection of live-attenuated viral vaccines, as well as research in virus evolution to reveal mechanisms of immune evasion and host cell adaption.
Collapse
Affiliation(s)
- Johnny Sena
- National Center for Genome Resources, Santa Fe, NM, United States
| | - Lovkesh Karwal
- Vaccine Business Unit, Takeda Pharmaceuticals, Cambridge, MA, United States
| | - Callum Bell
- National Center for Genome Resources, Santa Fe, NM, United States
| | - Nicholas Devitt
- National Center for Genome Resources, Santa Fe, NM, United States
| | - Faye Schilkey
- National Center for Genome Resources, Santa Fe, NM, United States
| | - Claire Huang
- Arboviral Diseases Branch, Division of Vector-Borne Diseases, Centers for Disease Control and Prevention, Fort Collins, CO, United States
| | - Jill Livengood
- Vaccine Business Unit, Takeda Pharmaceuticals, Cambridge, MA, United States
| | - Subash Das
- Vaccine Business Unit, Takeda Pharmaceuticals, Cambridge, MA, United States
| | - Hansi J Dean
- Vaccine Business Unit, Takeda Pharmaceuticals, Cambridge, MA, United States
| |
Collapse
|
12
|
Shi Y, Cao C, Zeng Y, Ding Y, Chen L, Zheng F, Chen X, Zhou F, Yang X, Li J, Xu L, Xu G, Lin M, Ishiura H, Tsuji S, Wang N, Wang Z, Chen WJ, Yang K. CGG repeat expansion in LOC642361/NUTM2B-AS1 typically presents as oculopharyngodistal myopathy. J Genet Genomics 2024; 51:184-196. [PMID: 38159879 DOI: 10.1016/j.jgg.2023.12.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2023] [Revised: 12/25/2023] [Accepted: 12/25/2023] [Indexed: 01/03/2024]
Abstract
CGG repeat expansions in LOC642361/NUTM2B-AS1 have recently been identified as a cause of oculopharyngeal myopathy with leukoencephalopathy. However, since only three patients from a single family were reported, it remains unknown whether their clinicopathological features are typical for CGG repeat expansions in LOC642361/NUTM2B-AS1. Here, using repeat-primed-polymerase chain reaction and long-read sequencing, we identify 12 individuals from 3 unrelated families with CGG repeat expansions in LOC642361/NUTM2B-AS1, typically presenting with oculopharyngodistal myopathy. The CGG repeat expansions range from 161 to 669 repeat units. Most of the patients present with ptosis, restricted eye movements, dysphagia, dysarthria, and diffuse limb muscle weakness. Only one patient shows T2-weighted hyperintensity in the cerebellar white matter surrounding the deep cerebellar nuclei on brain magnetic resonance imaging. Muscle biopsies from three patients show a myopathic pattern and rimmed vacuoles. Analyses of muscle biopsies suggest that CGG repeat expansions in LOC642361/NUTM2B-AS1 may deleteriously affect aggrephagic capacity, suggesting that RNA toxicity and mitochondrial dysfunction may contribute to pathogenesis. Our study thus expands the phenotypic spectrum for the CGG repeat expansion of LOC642361/NUTM2B-AS1 and indicates that this genetic variant typically manifests as oculopharyngodistal myopathy with chronic myopathic changes with rimmed vacuoles and filamentous intranuclear inclusions in muscle fibers.
Collapse
Affiliation(s)
- Yan Shi
- Department of Neurology and Institute of Neurology of First Affiliated Hospital, Institute of Neuroscience, Fujian Key Laboratory of Molecular Neurology, Fujian Medical University, Fuzhou, Fujian 350005, China; Department of Neurology, National Regional Medical Center, Binhai Campus of the First Affiliated Hospital, Fujian Medical University, Fuzhou, Fujian 350212, China
| | - Chunyan Cao
- Department of Neurology and Institute of Neurology of First Affiliated Hospital, Institute of Neuroscience, Fujian Key Laboratory of Molecular Neurology, Fujian Medical University, Fuzhou, Fujian 350005, China; The First Affiliated Hospital, College of Clinical Medicine of Henan University of Science and Technology, Luoyang, Henan 471000, China
| | - Yiheng Zeng
- Department of Neurology and Institute of Neurology of First Affiliated Hospital, Institute of Neuroscience, Fujian Key Laboratory of Molecular Neurology, Fujian Medical University, Fuzhou, Fujian 350005, China; Department of Neurology, National Regional Medical Center, Binhai Campus of the First Affiliated Hospital, Fujian Medical University, Fuzhou, Fujian 350212, China
| | - Yuanliang Ding
- Department of Neurology and Institute of Neurology of First Affiliated Hospital, Institute of Neuroscience, Fujian Key Laboratory of Molecular Neurology, Fujian Medical University, Fuzhou, Fujian 350005, China; Department of Neurology, National Regional Medical Center, Binhai Campus of the First Affiliated Hospital, Fujian Medical University, Fuzhou, Fujian 350212, China
| | - Long Chen
- Department of Neurology and Institute of Neurology of First Affiliated Hospital, Institute of Neuroscience, Fujian Key Laboratory of Molecular Neurology, Fujian Medical University, Fuzhou, Fujian 350005, China; Department of Neurology, National Regional Medical Center, Binhai Campus of the First Affiliated Hospital, Fujian Medical University, Fuzhou, Fujian 350212, China
| | - Fuze Zheng
- Department of Neurology and Institute of Neurology of First Affiliated Hospital, Institute of Neuroscience, Fujian Key Laboratory of Molecular Neurology, Fujian Medical University, Fuzhou, Fujian 350005, China; Department of Neurology, National Regional Medical Center, Binhai Campus of the First Affiliated Hospital, Fujian Medical University, Fuzhou, Fujian 350212, China
| | - Xuejiao Chen
- Department of Neurology, Zhangzhou Municipal Hospital of Fujian Province and Zhangzhou Affiliated Hospital of Fujian Medical University, Zhangzhou, Fujian 363000, China
| | - Fanggui Zhou
- Department of Neurology, Jian'ou Municipal Hospital of Fujian Province, Jian'ou, Fujian 353100, China
| | - Xiefeng Yang
- Department of Radiology, First Affiliated Hospital of Fujian Medical University, Fuzhou, Fujian 350005, China
| | - Jinjing Li
- Department of Neurology and Institute of Neurology of First Affiliated Hospital, Institute of Neuroscience, Fujian Key Laboratory of Molecular Neurology, Fujian Medical University, Fuzhou, Fujian 350005, China; Department of Neurology, National Regional Medical Center, Binhai Campus of the First Affiliated Hospital, Fujian Medical University, Fuzhou, Fujian 350212, China
| | - Liuqing Xu
- Department of Neurology and Institute of Neurology of First Affiliated Hospital, Institute of Neuroscience, Fujian Key Laboratory of Molecular Neurology, Fujian Medical University, Fuzhou, Fujian 350005, China; Department of Neurology, National Regional Medical Center, Binhai Campus of the First Affiliated Hospital, Fujian Medical University, Fuzhou, Fujian 350212, China
| | - Guorong Xu
- Department of Neurology and Institute of Neurology of First Affiliated Hospital, Institute of Neuroscience, Fujian Key Laboratory of Molecular Neurology, Fujian Medical University, Fuzhou, Fujian 350005, China; Department of Neurology, National Regional Medical Center, Binhai Campus of the First Affiliated Hospital, Fujian Medical University, Fuzhou, Fujian 350212, China
| | - Minting Lin
- Department of Neurology and Institute of Neurology of First Affiliated Hospital, Institute of Neuroscience, Fujian Key Laboratory of Molecular Neurology, Fujian Medical University, Fuzhou, Fujian 350005, China; Department of Neurology, National Regional Medical Center, Binhai Campus of the First Affiliated Hospital, Fujian Medical University, Fuzhou, Fujian 350212, China
| | - Hiroyuki Ishiura
- Department of Neurology, Graduate School of Medicine, The University of Tokyo, Tokyo 113-8655, Japan; Department of Neurology, Okayama University Graduate School of Medicine, Dentistry and Pharmaceutical Sciences, Okayama 700-8558, Japan
| | - Shoji Tsuji
- Department of Neurology, Graduate School of Medicine, The University of Tokyo, Tokyo 113-8655, Japan; Institute of Medical Genomics, International University of Health and Welfare, Chiba 286-0048, Japan
| | - Ning Wang
- Department of Neurology and Institute of Neurology of First Affiliated Hospital, Institute of Neuroscience, Fujian Key Laboratory of Molecular Neurology, Fujian Medical University, Fuzhou, Fujian 350005, China; Department of Neurology, National Regional Medical Center, Binhai Campus of the First Affiliated Hospital, Fujian Medical University, Fuzhou, Fujian 350212, China
| | - Zhiqiang Wang
- Department of Neurology and Institute of Neurology of First Affiliated Hospital, Institute of Neuroscience, Fujian Key Laboratory of Molecular Neurology, Fujian Medical University, Fuzhou, Fujian 350005, China; Department of Neurology, National Regional Medical Center, Binhai Campus of the First Affiliated Hospital, Fujian Medical University, Fuzhou, Fujian 350212, China.
| | - Wan-Jin Chen
- Department of Neurology and Institute of Neurology of First Affiliated Hospital, Institute of Neuroscience, Fujian Key Laboratory of Molecular Neurology, Fujian Medical University, Fuzhou, Fujian 350005, China; Department of Neurology, National Regional Medical Center, Binhai Campus of the First Affiliated Hospital, Fujian Medical University, Fuzhou, Fujian 350212, China.
| | - Kang Yang
- Department of Neurology and Institute of Neurology of First Affiliated Hospital, Institute of Neuroscience, Fujian Key Laboratory of Molecular Neurology, Fujian Medical University, Fuzhou, Fujian 350005, China; Department of Neurology, National Regional Medical Center, Binhai Campus of the First Affiliated Hospital, Fujian Medical University, Fuzhou, Fujian 350212, China.
| |
Collapse
|
13
|
Jagtap SS, Liu JJ, Walukiewicz HE, Riley R, Ahrendt S, Koriabine M, Cobaugh K, Salamov A, Yoshinaga Y, Ng V, Daum C, Grigoriev IV, Slininger PJ, Dien BS, Jin YS, Rao CV. Draft genome sequence of Yarrowia lipolytica NRRL Y-64008, an oleaginous yeast capable of growing on lignocellulosic hydrolysates. Microbiol Resour Announc 2023; 12:e0043523. [PMID: 37982613 PMCID: PMC10720525 DOI: 10.1128/mra.00435-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2023] [Accepted: 10/10/2023] [Indexed: 11/21/2023] Open
Abstract
Yarrowia lipolytica is an oleaginous yeast that produces high titers of fatty acid-derived biofuels and biochemicals. It can grow on hydrophobic carbon sources and lignocellulosic hydrolysates. The genome sequence of Y. lipolytica NRRL Y-64008 is reported to aid in its development as a biotechnological chassis for producing biofuels and bioproducts.
Collapse
Affiliation(s)
- Sujit Sadashiv Jagtap
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana–Champaign, Urbana, Illinois, USA
- DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana–Champaign, Urbana, Illinois, USA
| | - Jing-Jing Liu
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana–Champaign, Urbana, Illinois, USA
- DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana–Champaign, Urbana, Illinois, USA
| | - Hanna E. Walukiewicz
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana–Champaign, Urbana, Illinois, USA
- DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana–Champaign, Urbana, Illinois, USA
| | - Robert Riley
- Lawrence Berkeley National Laboratory, US Department of Energy Joint Genome Institute, Berkeley, California, USA
| | - Steven Ahrendt
- Lawrence Berkeley National Laboratory, US Department of Energy Joint Genome Institute, Berkeley, California, USA
| | - Maxim Koriabine
- Lawrence Berkeley National Laboratory, US Department of Energy Joint Genome Institute, Berkeley, California, USA
| | - Kelly Cobaugh
- Lawrence Berkeley National Laboratory, US Department of Energy Joint Genome Institute, Berkeley, California, USA
| | - Asaf Salamov
- Lawrence Berkeley National Laboratory, US Department of Energy Joint Genome Institute, Berkeley, California, USA
| | - Yuko Yoshinaga
- Lawrence Berkeley National Laboratory, US Department of Energy Joint Genome Institute, Berkeley, California, USA
| | - Vivian Ng
- Lawrence Berkeley National Laboratory, US Department of Energy Joint Genome Institute, Berkeley, California, USA
| | - Chris Daum
- Lawrence Berkeley National Laboratory, US Department of Energy Joint Genome Institute, Berkeley, California, USA
| | - Igor V. Grigoriev
- Lawrence Berkeley National Laboratory, US Department of Energy Joint Genome Institute, Berkeley, California, USA
- Department of Plant and Microbial Biology, University of California, Berkeley, California, USA
| | - Patricia J. Slininger
- Bioenergy Research Unit, National Center for Agricultural Utilization Research, USDA-ARS, Peoria, Illinois, USA
| | - Bruce S. Dien
- Bioenergy Research Unit, National Center for Agricultural Utilization Research, USDA-ARS, Peoria, Illinois, USA
| | - Yong-Su Jin
- DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana–Champaign, Urbana, Illinois, USA
- Department of Food Science and Nutrition, University of Illinois at Urbana–Champaign, Urbana, Illinois, USA
| | - Christopher V. Rao
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana–Champaign, Urbana, Illinois, USA
- DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana–Champaign, Urbana, Illinois, USA
| |
Collapse
|
14
|
Van Hofwegen DJ, Hovde CJ, Minnich SA. Comparison of Yersinia enterocolitica DNA Methylation at Ambient and Host Temperatures. EPIGENOMES 2023; 7:30. [PMID: 38131902 PMCID: PMC10742451 DOI: 10.3390/epigenomes7040030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2023] [Revised: 11/13/2023] [Accepted: 11/23/2023] [Indexed: 12/23/2023] Open
Abstract
Pathogenic bacteria recognize environmental cues to vary gene expression for host adaptation. Moving from ambient to host temperature, Yersinia enterocolitica responds by immediately repressing flagella synthesis and inducing the virulence plasmid (pYV)-encoded type III secretion system. In contrast, shifting from host to ambient temperature requires 2.5 generations to restore motility, suggesting a link to the cell cycle. We hypothesized that differential DNA methylation contributes to temperature-regulated gene expression. We tested this hypothesis by comparing single-molecule real-time (SMRT) sequencing of Y. enterocolitica DNA from cells growing exponentially at 22 °C and 37 °C. The inter-pulse duration ratio rather than the traditional QV scoring was the kinetic metric to compare DNA from cells grown at each temperature. All 565 YenI restriction sites were fully methylated at both temperatures. Among the 27,118 DNA adenine methylase (Dam) sites, 42 had differential methylation patterns, while 17 remained unmethylated regardless of the temperature. A subset of the differentially methylated Dam sites localized to promoter regions of predicted regulatory genes including LysR-type and PadR-like transcriptional regulators and a cyclic-di-GMP phosphodiesterase. The unmethylated Dam sites localized with a bias to the replication terminus, suggesting they were protected from Dam methylase. No cytosine methylation was detected at Dcm sites.
Collapse
Affiliation(s)
| | | | - Scott A. Minnich
- Department of Animal Veterinary and Food Science, University of Idaho, Moscow, ID 83843, USA; (D.J.V.H.); (C.J.H.)
| |
Collapse
|
15
|
Barthélémy D, Belmonte E, Pilla LD, Bardel C, Duport E, Gautier V, Payen L. Direct Comparative Analysis of a Pharmacogenomics Panel with PacBio Hifi ® Long-Read and Illumina Short-Read Sequencing. J Pers Med 2023; 13:1655. [PMID: 38138882 PMCID: PMC10744512 DOI: 10.3390/jpm13121655] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Revised: 11/10/2023] [Accepted: 11/23/2023] [Indexed: 12/24/2023] Open
Abstract
BACKGROUND Pharmacogenetics (PGx) aims to determine genetic signatures that can be used in clinical settings to individualize treatment for each patient, including anti-cancer drugs, anti-psychotics, and painkillers. Taken together, a better understanding of the impacts of genetic variants on the corresponding protein function or expression permits the prediction of the pharmacological response: responders, non-responders, and those with adverse drug reactions (ADRs). OBJECTIVE This work provides a comparison between innovative long-read sequencing (LRS) and short-read sequencing (SRS) techniques. METHODS AND MATERIALS The gene panel captured using PacBio HiFi® sequencing was tested on thirteen clinical samples on GENTYANE's platform. SRS, using a comprehensive pharmacogenetics panel, was performed in routine settings at the Civil Hospitals of Lyon. We focused on complex regions analysis, including copy number variations (CNVs), structural variants, repeated regions, and phasing-haplotyping for three key pharmacogenes: CYP2D6, UGT1A1, and NAT2. RESULTS Variants and the corresponding expected star (*) alleles were reported. Although only 38.4% concordance was found for haplotype determination and 61.5% for diplotype, this did not affect the metabolism scoring. A better accuracy of LRS was obtained for the detection of the CYP2D6*5 haplotype in the presence of the duplicated wild-type CYP2D6*2 form. A total concordance was performed for UGT1A1 TA repeat detection. Direct phasing using the LRS approach allowed us to correct certain NAT2 profiles. CONCLUSIONS Combining an optimized variant-calling pipeline and with direct phasing analysis, LRS is a robust technique for PGx analysis that can minimize the risk of mis-haplotyping.
Collapse
Affiliation(s)
- David Barthélémy
- Institut of Pharmaceutical and Biological Sciences of Lyon, Claude Bernard Lyon I, 69373 Lyon, France; (D.B.); (C.B.)
- Department of Biochemistry and Molecular Biology, Lyon-Sud Hospital, Hospices Civils de Lyon, Réseau Francophone de Pharmacogénétique (RNPGx), 69495 Pierre-Bénite, France; (L.D.P.); (E.D.)
- Center for Innovation in Cancerology of Lyon (CICLY) EA 3738, Faculty of Medicine and Maieutic Lyon Sud, Claude Bernard University Lyon I, 69921 Oullins, France
| | - Elodie Belmonte
- Plateforme Génotypage et Séquençage en Auvergne (GENTYANE) UMR 1095 Génétique, Diversité Ecophysiologie des Céréales INRAE, Université Clermont Auvergne, 63100 Clermont Ferrand, France; (E.B.); (V.G.)
| | - Laurie Di Pilla
- Department of Biochemistry and Molecular Biology, Lyon-Sud Hospital, Hospices Civils de Lyon, Réseau Francophone de Pharmacogénétique (RNPGx), 69495 Pierre-Bénite, France; (L.D.P.); (E.D.)
| | - Claire Bardel
- Institut of Pharmaceutical and Biological Sciences of Lyon, Claude Bernard Lyon I, 69373 Lyon, France; (D.B.); (C.B.)
- Department of Bioinformatics, Hospices Civils de Lyon, 69008 Lyon, France
| | - Eve Duport
- Department of Biochemistry and Molecular Biology, Lyon-Sud Hospital, Hospices Civils de Lyon, Réseau Francophone de Pharmacogénétique (RNPGx), 69495 Pierre-Bénite, France; (L.D.P.); (E.D.)
| | - Veronique Gautier
- Plateforme Génotypage et Séquençage en Auvergne (GENTYANE) UMR 1095 Génétique, Diversité Ecophysiologie des Céréales INRAE, Université Clermont Auvergne, 63100 Clermont Ferrand, France; (E.B.); (V.G.)
| | - Léa Payen
- Institut of Pharmaceutical and Biological Sciences of Lyon, Claude Bernard Lyon I, 69373 Lyon, France; (D.B.); (C.B.)
- Department of Biochemistry and Molecular Biology, Lyon-Sud Hospital, Hospices Civils de Lyon, Réseau Francophone de Pharmacogénétique (RNPGx), 69495 Pierre-Bénite, France; (L.D.P.); (E.D.)
- Center for Innovation in Cancerology of Lyon (CICLY) EA 3738, Faculty of Medicine and Maieutic Lyon Sud, Claude Bernard University Lyon I, 69921 Oullins, France
| |
Collapse
|
16
|
Maes S, Deploey N, Peelman F, Eyckerman S. Deep mutational scanning of proteins in mammalian cells. CELL REPORTS METHODS 2023; 3:100641. [PMID: 37963462 PMCID: PMC10694495 DOI: 10.1016/j.crmeth.2023.100641] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/12/2023] [Revised: 07/06/2023] [Accepted: 10/20/2023] [Indexed: 11/16/2023]
Abstract
Protein mutagenesis is essential for unveiling the molecular mechanisms underlying protein function in health, disease, and evolution. In the past decade, deep mutational scanning methods have evolved to support the functional analysis of nearly all possible single-amino acid changes in a protein of interest. While historically these methods were developed in lower organisms such as E. coli and yeast, recent technological advancements have resulted in the increased use of mammalian cells, particularly for studying proteins involved in human disease. These advancements will aid significantly in the classification and interpretation of variants of unknown significance, which are being discovered at large scale due to the current surge in the use of whole-genome sequencing in clinical contexts. Here, we explore the experimental aspects of deep mutational scanning studies in mammalian cells and report the different methods used in each step of the workflow, ultimately providing a useful guide toward the design of such studies.
Collapse
Affiliation(s)
- Stefanie Maes
- VIB Center for Medical Biotechnology (CMB), Technologiepark-Zwijnaarde 75, 9052 Ghent, Belgium; Department of Biochemistry and Microbiology, Ghent University, Technologiepark-Zwijnaarde 75, 9052 Ghent, Belgium; Department of Biomolecular Medicine, Ghent University, Technologiepark-Zwijnaarde 75, 9052 Ghent, Belgium
| | - Nick Deploey
- VIB Center for Medical Biotechnology (CMB), Technologiepark-Zwijnaarde 75, 9052 Ghent, Belgium; Department of Biomolecular Medicine, Ghent University, Technologiepark-Zwijnaarde 75, 9052 Ghent, Belgium
| | - Frank Peelman
- VIB Center for Medical Biotechnology (CMB), Technologiepark-Zwijnaarde 75, 9052 Ghent, Belgium; Department of Biomolecular Medicine, Ghent University, Technologiepark-Zwijnaarde 75, 9052 Ghent, Belgium
| | - Sven Eyckerman
- VIB Center for Medical Biotechnology (CMB), Technologiepark-Zwijnaarde 75, 9052 Ghent, Belgium; Department of Biomolecular Medicine, Ghent University, Technologiepark-Zwijnaarde 75, 9052 Ghent, Belgium.
| |
Collapse
|
17
|
Jagtap SS, Liu JJ, Walukiewicz HE, Pangilinan J, Lipzen A, Ahrendt S, Koriabine M, Cobaugh K, Salamov A, Yoshinaga Y, Ng V, Daum C, Grigoriev IV, Slininger PJ, Dien BS, Jin YS, Rao CV. Near-complete genome sequence of Lipomyces tetrasporous NRRL Y-64009, an oleaginous yeast capable of growing on lignocellulosic hydrolysates. Microbiol Resour Announc 2023; 12:e0042623. [PMID: 37906027 PMCID: PMC10652991 DOI: 10.1128/mra.00426-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2023] [Accepted: 09/11/2023] [Indexed: 11/02/2023] Open
Abstract
Lipomyces tetrasporous is an oleaginous yeast that can utilize a variety of plant-based sugars. It accumulates lipids during growth on lignocellulosic biomass hydrolysates. We present the annotated genome sequence of L. tetrasporous NRRL Y-64009 to aid in its development as a platform organism for producing lipids and lipid-based bioproducts.
Collapse
Affiliation(s)
- Sujit Sadashiv Jagtap
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
- DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
| | - Jing-Jing Liu
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
- DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
| | - Hanna E. Walukiewicz
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
- DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
| | - Jasmyn Pangilinan
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Anna Lipzen
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Steven Ahrendt
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Maxim Koriabine
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Kelly Cobaugh
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Asaf Salamov
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Yuko Yoshinaga
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Vivian Ng
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Chris Daum
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Igor V. Grigoriev
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
- Department of Plant and Microbial Biology, University of California Berkeley, Berkeley, California, USA
| | - Patricia J. Slininger
- Bioenergy Research Unit, National Center for Agricultural Utilization Research, USDA-ARS, Peoria, Illinois, USA
| | - Bruce S. Dien
- Bioenergy Research Unit, National Center for Agricultural Utilization Research, USDA-ARS, Peoria, Illinois, USA
| | - Yong-Su Jin
- DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
- Department of Food Science and Nutrition, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
| | - Christopher V. Rao
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
- DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
| |
Collapse
|
18
|
Chen J, Xu F. Application of Nanopore Sequencing in the Diagnosis and Treatment of Pulmonary Infections. Mol Diagn Ther 2023; 27:685-701. [PMID: 37563539 PMCID: PMC10590290 DOI: 10.1007/s40291-023-00669-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/18/2023] [Indexed: 08/12/2023]
Abstract
This review provides an in-depth discussion of the development, principles and utility of nanopore sequencing technology and its diverse applications in the identification of various pulmonary pathogens. We examined the emergence and advancements of nanopore sequencing as a significant player in this field. We illustrate the challenges faced in diagnosing mixed infections and further scrutinize the use of nanopore sequencing in the identification of single pathogens, including viruses (with a focus on its use in epidemiology, outbreak investigation, and viral resistance), bacteria (emphasizing 16S targeted sequencing, rare bacterial lung infections, and antimicrobial resistance studies), fungi (employing internal transcribed spacer sequencing), tuberculosis, and atypical pathogens. Furthermore, we discuss the role of nanopore sequencing in metagenomics and its potential for unbiased detection of all pathogens in a clinical setting, emphasizing its advantages in sequencing genome repeat areas and structural variant regions. We discuss the limitations in dealing with host DNA removal, the inherent high error rate of nanopore sequencing technology, along with the complexity of operation and processing, while acknowledging the possibilities provided by recent technological improvements. We compared nanopore sequencing with the BioFire system, a rapid molecular diagnostic system based on polymerase chain reaction. Although the BioFire system serves well for the rapid screening of known and common pathogens, it falls short in the identification of unknown or rare pathogens and in providing comprehensive genome analysis. As technological advancements continue, it is anticipated that the role of nanopore sequencing technology in diagnosing and treating lung infections will become increasingly significant.
Collapse
Affiliation(s)
- Jie Chen
- Department of Infectious Diseases, The Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, 310009, Zhejiang, China
| | - Feng Xu
- Department of Infectious Diseases, The Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, 310009, Zhejiang, China.
| |
Collapse
|
19
|
van Dijk EL, Naquin D, Gorrichon K, Jaszczyszyn Y, Ouazahrou R, Thermes C, Hernandez C. Genomics in the long-read sequencing era. Trends Genet 2023; 39:649-671. [PMID: 37230864 DOI: 10.1016/j.tig.2023.04.006] [Citation(s) in RCA: 25] [Impact Index Per Article: 25.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2023] [Revised: 04/21/2023] [Accepted: 04/25/2023] [Indexed: 05/27/2023]
Abstract
Long-read sequencing (LRS) technologies have provided extremely powerful tools to explore genomes. While in the early years these methods suffered technical limitations, they have recently made significant progress in terms of read length, throughput, and accuracy and bioinformatics tools have strongly improved. Here, we aim to review the current status of LRS technologies, the development of novel methods, and the impact on genomics research. We will explore the most impactful recent findings made possible by these technologies focusing on high-resolution sequencing of genomes and transcriptomes and the direct detection of DNA and RNA modifications. We will also discuss how LRS methods promise a more comprehensive understanding of human genetic variation, transcriptomics, and epigenetics for the coming years.
Collapse
Affiliation(s)
- Erwin L van Dijk
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198, Gif-sur-Yvette, France.
| | - Delphine Naquin
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198, Gif-sur-Yvette, France
| | - Kévin Gorrichon
- National Center of Human Genomics Research (CNRGH), 91000 Évry-Courcouronnes, France
| | - Yan Jaszczyszyn
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198, Gif-sur-Yvette, France
| | - Rania Ouazahrou
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198, Gif-sur-Yvette, France
| | - Claude Thermes
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198, Gif-sur-Yvette, France
| | - Céline Hernandez
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198, Gif-sur-Yvette, France
| |
Collapse
|
20
|
Menon V, Brash DE. Next-generation sequencing methodologies to detect low-frequency mutations: "Catch me if you can". MUTATION RESEARCH. REVIEWS IN MUTATION RESEARCH 2023; 792:108471. [PMID: 37716438 PMCID: PMC10843083 DOI: 10.1016/j.mrrev.2023.108471] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/04/2023] [Revised: 09/06/2023] [Accepted: 09/07/2023] [Indexed: 09/18/2023]
Abstract
Mutations, the irreversible changes in an organism's DNA sequence, are present in tissues at a variant allele frequency (VAF) ranging from ∼10-8 per bp for a founder mutation to ∼10-3 for a histologically normal tissue sample containing several independent clones - compared to 1%- 50% for a heterozygous tumor mutation or a polymorphism. The rarity of these events poses a challenge for accurate clinical diagnosis and prognosis, toxicology, and discovering new disease etiologies. Standard Next-Generation Sequencing (NGS) technologies report VAFs as low as 0.5% per nt, but reliably observing rarer precursor events requires additional sophistication to measure ultralow-frequency mutations. We detail the challenge; define terms used to characterize the results, which vary between laboratories and sometimes conflict between biologists and bioinformaticists; and describe recent innovations to improve standard NGS methodologies including: single-strand consensus sequence methods such as Safe-SeqS and SiMSen-Seq; tandem-strand consensus sequence methods such as o2n-Seq and SMM-Seq; and ultrasensitive parent-strand consensus sequence methods such as DuplexSeq, PacBio HiFi, SinoDuplex, OPUSeq, EcoSeq, BotSeqS, Hawk-Seq, NanoSeq, SaferSeq, and CODEC. Practical applications are also noted. Several methods quantify VAF down to 10-5 at a nt and mutation frequency (MF) in a target region down to 10-7 per nt. By expanding to > 1 Mb of sites never observed twice, thus forgoing VAF, other methods quantify MF < 10-9 per nt or < 15 errors per haploid genome. Clonal expansion cannot be directly distinguished from independent mutations by sequencing, so it is essential for a paper to report whether its MF counted only different mutations - the minimum independent-mutation frequency MFminI - or all mutations observed including recurrences - the larger maximum independent-mutation frequency MFmaxI which may reflect clonal expansion. Ultrasensitive methods reveal that, without their use, even mutations with VAF 0.5-1% are usually spurious.
Collapse
Affiliation(s)
- Vijay Menon
- Department of Therapeutic Radiology, Yale School of Medicine, New Haven, CT 06520-8040, USA.
| | - Douglas E Brash
- Department of Therapeutic Radiology, Yale School of Medicine, New Haven, CT 06520-8040, USA; Department of Dermatology, Yale School of Medicine, New Haven, CT 06520-8059, USA; Yale Cancer Center, Yale School of Medicine, New Haven, CT 06520-8028, USA.
| |
Collapse
|
21
|
Kersten S, Rabanal FA, Herrmann J, Hess M, Kronenberg ZN, Schmid K, Weigel D. Deep haplotype analyses of target-site resistance locus ACCase in blackgrass enabled by pool-based amplicon sequencing. PLANT BIOTECHNOLOGY JOURNAL 2023; 21:1240-1253. [PMID: 36807472 PMCID: PMC10214753 DOI: 10.1111/pbi.14033] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/27/2022] [Revised: 12/27/2022] [Accepted: 02/06/2023] [Indexed: 05/27/2023]
Abstract
Rapid adaptation of weeds to herbicide applications in agriculture through resistance development is a widespread phenomenon. In particular, the grass Alopecurus myosuroides is an extremely problematic weed in cereal crops with the potential to manifest resistance in only a few generations. Target-site resistances (TSRs), with their strong phenotypic response, play an important role in this rapid adaptive response. Recently, using PacBio's long-read amplicon sequencing technology in hundreds of individuals, we were able to decipher the genomic context in which TSR mutations occur. However, sequencing individual amplicons are costly and time-consuming, thus impractical to implement for other resistance loci or applications. Alternatively, pool-based approaches overcome these limitations and provide reliable allele frequencies, although at the expense of not preserving haplotype information. In this proof-of-concept study, we sequenced with PacBio High Fidelity (HiFi) reads long-range amplicons (13.2 kb), encompassing the entire ACCase gene in pools of over 100 individuals, and resolved them into haplotypes using the clustering algorithm PacBio amplicon analysis (pbaa), a new application for pools in plants and other organisms. From these amplicon pools, we were able to recover most haplotypes from previously sequenced individuals of the same population. In addition, we analysed new pools from a Germany-wide collection of A. myosuroides populations and found that TSR mutations originating from soft sweeps of independent origin were common. Forward-in-time simulations indicate that TSR haplotypes will persist for decades even at relatively low frequencies and without selection, highlighting the importance of accurate measurement of TSR haplotype prevalence for weed management.
Collapse
Affiliation(s)
- Sonja Kersten
- Institute of Plant Breeding, Seed Science and Population GeneticsUniversity of HohenheimStuttgartGermany
- Department of Molecular BiologyMax Planck Institute for Biology TübingenTübingenGermany
| | - Fernando A. Rabanal
- Department of Molecular BiologyMax Planck Institute for Biology TübingenTübingenGermany
| | | | | | | | - Karl Schmid
- Institute of Plant Breeding, Seed Science and Population GeneticsUniversity of HohenheimStuttgartGermany
| | - Detlef Weigel
- Department of Molecular BiologyMax Planck Institute for Biology TübingenTübingenGermany
| |
Collapse
|
22
|
Ciobanu CG, Nucă I, Popescu R, Antoci LM, Caba L, Ivanov AV, Cojocaru KA, Rusu C, Mihai CT, Pânzaru MC. Narrative Review: Update on the Molecular Diagnosis of Fragile X Syndrome. Int J Mol Sci 2023; 24:9206. [PMID: 37298158 PMCID: PMC10252420 DOI: 10.3390/ijms24119206] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2023] [Revised: 04/30/2023] [Accepted: 05/20/2023] [Indexed: 06/12/2023] Open
Abstract
The diagnosis and management of fragile X syndrome (FXS) have significantly improved in the last three decades, although the current diagnostic techniques are not yet able to precisely identify the number of repeats, methylation status, level of mosaicism, and/or the presence of AGG interruptions. A high number of repeats (>200) in the fragile X messenger ribonucleoprotein 1 gene (FMR1) results in hypermethylation of promoter and gene silencing. The actual molecular diagnosis is performed using a Southern blot, TP-PCR (Triplet-Repeat PCR), MS-PCR (Methylation-Specific PCR), and MS-MLPA (Methylation-Specific MLPA) with some limitations, with multiple assays being necessary to completely characterise a patient with FXS. The actual gold standard diagnosis uses Southern blot; however, it cannot accurately characterise all cases. Optical genome mapping is a new technology that has also been developed to approach the diagnosis of fragile X syndrome. Long-range sequencing represented by PacBio and Oxford Nanopore has the potential to replace the actual diagnosis and offers a complete characterization of molecular profiles in a single test. The new technologies have improved the diagnosis of fragile X syndrome and revealed unknown aberrations, but they are a long way from being used routinely in clinical practice.
Collapse
Affiliation(s)
- Cristian-Gabriel Ciobanu
- Medical Genetics Department, Faculty of Medicine, “Grigore T. Popa” University of Medicine and Pharmacy, University Street No 16, 700115 Iasi, Romania; (C.-G.C.)
| | - Irina Nucă
- Medical Genetics Department, Faculty of Medicine, “Grigore T. Popa” University of Medicine and Pharmacy, University Street No 16, 700115 Iasi, Romania; (C.-G.C.)
- Investigatii Medicale Praxis, St. Moara de Vant No 35, 700376 Iasi, Romania
| | - Roxana Popescu
- Medical Genetics Department, Faculty of Medicine, “Grigore T. Popa” University of Medicine and Pharmacy, University Street No 16, 700115 Iasi, Romania; (C.-G.C.)
- Medical Genetics Department, “Saint Mary” Emergency Children’s Hospital, St. Vasile Lupu No 62, 700309 Iasi, Romania
| | - Lucian-Mihai Antoci
- Medical Genetics Department, Faculty of Medicine, “Grigore T. Popa” University of Medicine and Pharmacy, University Street No 16, 700115 Iasi, Romania; (C.-G.C.)
| | - Lavinia Caba
- Medical Genetics Department, Faculty of Medicine, “Grigore T. Popa” University of Medicine and Pharmacy, University Street No 16, 700115 Iasi, Romania; (C.-G.C.)
| | - Anca Viorica Ivanov
- Pediatrics Department, “Grigore T. Popa” University of Medicine and Pharmacy, University Street No 16, 700115 Iasi, Romania
| | - Karina-Alexandra Cojocaru
- Department of Biochemistry, Faculty of Dental Medicine, “Grigore T. Popa” University of Medicine and Pharmacy, University Street No 16, 700115 Iasi, Romania
| | - Cristina Rusu
- Medical Genetics Department, Faculty of Medicine, “Grigore T. Popa” University of Medicine and Pharmacy, University Street No 16, 700115 Iasi, Romania; (C.-G.C.)
- Medical Genetics Department, “Saint Mary” Emergency Children’s Hospital, St. Vasile Lupu No 62, 700309 Iasi, Romania
| | | | - Monica-Cristina Pânzaru
- Medical Genetics Department, Faculty of Medicine, “Grigore T. Popa” University of Medicine and Pharmacy, University Street No 16, 700115 Iasi, Romania; (C.-G.C.)
- Medical Genetics Department, “Saint Mary” Emergency Children’s Hospital, St. Vasile Lupu No 62, 700309 Iasi, Romania
| |
Collapse
|
23
|
Zhu W, Liao X. LCAT: an isoform-sensitive error correction for transcriptome sequencing long reads. Front Genet 2023; 14:1166975. [PMID: 37292144 PMCID: PMC10245045 DOI: 10.3389/fgene.2023.1166975] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2023] [Accepted: 05/04/2023] [Indexed: 06/10/2023] Open
Abstract
As the carrier of genetic information, RNA carries the information from genes to proteins. Transcriptome sequencing technology is an important way to obtain transcriptome sequences, and it is also the basis for transcriptome research. With the development of third-generation sequencing, long reads can cover full-length transcripts and reflect the composition of different isoforms. However, the high error rate of third-generation sequencing affects the accuracy of long reads and downstream analysis. The current error correction methods seldom consider the existence of different isoforms in RNA, which makes the diversity of isoforms a serious loss. Here, we introduce LCAT (long-read error correction algorithm for transcriptome sequencing data), a wrapper algorithm of MECAT, to reduce the loss of isoform diversity while keeping MECAT's error correction performance. The experimental results show that LCAT can not only improve the quality of transcriptome sequencing long reads but also retain the diversity of isoforms.
Collapse
Affiliation(s)
- Wufei Zhu
- Department of Endocrinology, Yichang Central People’s Hospital, The First College of Clinical Medical Science, China Three Gorges University, Yichang, China
| | - Xingyu Liao
- Computer, Electrical and Mathematical Sciences, and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia
| |
Collapse
|
24
|
Chang YH, Lin YC, Liu HH. One-pot making of sequence-restricted DNA dumbbells. Heliyon 2023; 9:e16035. [PMID: 37215900 PMCID: PMC10196794 DOI: 10.1016/j.heliyon.2023.e16035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2023] [Revised: 04/27/2023] [Accepted: 05/03/2023] [Indexed: 05/24/2023] Open
Abstract
A method to build sequence-restricted DNA dumbbells was developed. 5'-exonuclease converts end sequences of DNA targets into sticky ends. Self-looping oligonucleotides with complementary 3'-overhangs are ligated to form dumbbells by DNA polymerase and ligase in a sequence-restricted manner. These reactions proceed in one pot at one temperature. We demonstrated one use of this method to 'tunnel' sequencing libraries into dumbbells for the Pacific Biosciences (PacBio) platform. Readouts of an Illumina P5/P7-ended 16S library from a standard microbial community confirmed successful tunneling. Twelve fecal samples additionally showed significant correlations between standard and tunneled 16S sequence variants on the PacBio platform. We further extended the method at a genomic scale to build a ∼0.45 Mbp giant dumbbell on chromosome 6. Sequences inside the dumbbells were successfully protected from a cocktail of exonucleases. Roughly 11-fold enrichment was achieved for the dumbbell-guarded region compared to the vicinity.
Collapse
Affiliation(s)
- Ya-Hui Chang
- Institute of Molecular and Genomic Medicine, National Health Research Institutes, Zhunan Town, Miaoli County, 35053, Taiwan
| | - Yu-Cheng Lin
- Pediatrics, Far Eastern Memorial Hospital, Pan-Chiao District, New Taipei City, 220216, Taiwan
- Electronic Engineering, Oriental Institute of Technology, Pan-Chiao District, New Taipei City, 220216, Taiwan
| | - Hong-Hsing Liu
- Institute of Molecular and Genomic Medicine, National Health Research Institutes, Zhunan Town, Miaoli County, 35053, Taiwan
- Pediatrics, En Chu Kong Hospital, Sanxia District, New Taipei City, 237414, Taiwan
| |
Collapse
|
25
|
Fu X, Du B, Meng Y, Li Y, Zhu X, Ou Z, Zhang M, Wen H, Ma'pol A, Hashim JH, Hashim Z, Wieslander G, Chen Q, Jiang J, Wang J, Norbäck D, Xia Y, Chen Q, Sun Y. Associations between environmental characteristics, high-resolution indoor microbiome, metabolome and allergic and non-allergic rhinitis symptoms for junior high school students. ENVIRONMENTAL SCIENCE. PROCESSES & IMPACTS 2023; 25:791-804. [PMID: 36883483 DOI: 10.1039/d2em00480a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]
Abstract
Rhinitis is one of the most prevalent chronic diseases globally. Microbiome exposure affects the occurrence of rhinitis. However, previous studies did not differentiate allergic rhinitis (AR) and non-allergic rhinitis (NAR) in the microbial association analysis. In this study, we investigate 347 students in 8 junior high schools, Terengganu, Malaysia, who were categorized as healthy (70.9%), AR (13.8%) and NAR (15.3%) based on a self-administered questionnaire and skin prick tests of pollen, pet, mould and house dust mite allergens. Classroom microbial and metabolite exposure in vacuumed dust was characterized by PacBio long-read amplicon sequencing, quantitative PCR and LC-MS-based untargeted metabolomics. Our findings indicate a similar microbial association pattern between AR and NAR. The richness in Gammaproteobacteria was negatively associated with AR and NAR symptoms, whereas total fungal richness was positively associated with AR and NAR symptoms (p < 0.05). Brasilonema bromeliae and Aeromonas enteropelogenes were negatively associated with AR and NAR, and Deinococcus was positively associated with AR and NAR (p < 0.01). Pipecolic acid was protectively associated with AR and NAR symptoms (OR = 0.06 and 0.13, p = 0.009 and 0.045). A neural network analysis showed that B. bromeliae was co-occurring with pipecolic acid, suggesting that the protective role of this species may be mediated by releasing pipecolic acid. Indoor relative humidity and the weight of vacuum dust were associated with AR and NAR, respectively (p < 0.05), but the health effects were mediated by two protective bacterial species, Aliinostoc morphoplasticum and Ilumatobacter fluminis. Overall, our study reported a similar microbial association pattern between AR and NAR and also revealed the complex interactions between microbial species, environmental characteristics, and rhinitis symptoms.
Collapse
Affiliation(s)
- Xi Fu
- Guangdong Provincial Engineering Research Center of Public Health Detection and Assessment, School of Public Health, Guangdong Pharmaceutical University, Guangzhou, 510006, PR China.
| | - Bingqian Du
- Guangdong Provincial Engineering Research Center of Public Health Detection and Assessment, School of Public Health, Guangdong Pharmaceutical University, Guangzhou, 510006, PR China.
| | - Yi Meng
- Guangdong Provincial Key Laboratory of Protein Function and Regulation in Agricultural Organisms, College of Life Sciences, South China Agricultural University, Guangzhou, Guangdong, 510642, PR China.
- Key Laboratory of Zoonosis of Ministry of Agriculture and Rural Affairs, South China Agricultural University, Guangzhou, Guangdong, 510642, PR China
- Guangdong Laboratory for Lingnan Modern Agriculture, South China Agricultural University, Guangzhou, Guangdong, 510642, China
| | - Yanling Li
- Guangdong Provincial Key Laboratory of Protein Function and Regulation in Agricultural Organisms, College of Life Sciences, South China Agricultural University, Guangzhou, Guangdong, 510642, PR China.
- Key Laboratory of Zoonosis of Ministry of Agriculture and Rural Affairs, South China Agricultural University, Guangzhou, Guangdong, 510642, PR China
- Guangdong Laboratory for Lingnan Modern Agriculture, South China Agricultural University, Guangzhou, Guangdong, 510642, China
| | - Xunhua Zhu
- Guangdong Provincial Key Laboratory of Protein Function and Regulation in Agricultural Organisms, College of Life Sciences, South China Agricultural University, Guangzhou, Guangdong, 510642, PR China.
- Key Laboratory of Zoonosis of Ministry of Agriculture and Rural Affairs, South China Agricultural University, Guangzhou, Guangdong, 510642, PR China
- Guangdong Laboratory for Lingnan Modern Agriculture, South China Agricultural University, Guangzhou, Guangdong, 510642, China
| | - Zheyuan Ou
- Guangdong Provincial Key Laboratory of Protein Function and Regulation in Agricultural Organisms, College of Life Sciences, South China Agricultural University, Guangzhou, Guangdong, 510642, PR China.
- Key Laboratory of Zoonosis of Ministry of Agriculture and Rural Affairs, South China Agricultural University, Guangzhou, Guangdong, 510642, PR China
- Guangdong Laboratory for Lingnan Modern Agriculture, South China Agricultural University, Guangzhou, Guangdong, 510642, China
| | - Mei Zhang
- Guangdong Provincial Key Laboratory of Protein Function and Regulation in Agricultural Organisms, College of Life Sciences, South China Agricultural University, Guangzhou, Guangdong, 510642, PR China.
- Key Laboratory of Zoonosis of Ministry of Agriculture and Rural Affairs, South China Agricultural University, Guangzhou, Guangdong, 510642, PR China
- Guangdong Laboratory for Lingnan Modern Agriculture, South China Agricultural University, Guangzhou, Guangdong, 510642, China
| | - Huarong Wen
- Baling Health Center, Dangyang, Hubei, 444100, PR China
| | - Aminnuddin Ma'pol
- Gombak District Health Office, Ministry of Health, Batu Caves, Selangor Darul Ehsan, Malaysia
| | | | - Zailina Hashim
- Department of Environmental and Occupational Health, Faculty of Medicine and Health Sciences, Universiti Putra Malaysia, UPM, Serdang, Selangor, Malaysia
| | - Gunilla Wieslander
- Occupational and Environmental Medicine, Department of Medical Science, University Hospital, Uppsala University, 75237 Uppsala, Sweden
| | - Qingmei Chen
- Guangdong Provincial Key Laboratory of Protein Function and Regulation in Agricultural Organisms, College of Life Sciences, South China Agricultural University, Guangzhou, Guangdong, 510642, PR China.
- Key Laboratory of Zoonosis of Ministry of Agriculture and Rural Affairs, South China Agricultural University, Guangzhou, Guangdong, 510642, PR China
- Guangdong Laboratory for Lingnan Modern Agriculture, South China Agricultural University, Guangzhou, Guangdong, 510642, China
| | - Jun Jiang
- Guangdong Provincial Key Laboratory of Protein Function and Regulation in Agricultural Organisms, College of Life Sciences, South China Agricultural University, Guangzhou, Guangdong, 510642, PR China.
- Key Laboratory of Zoonosis of Ministry of Agriculture and Rural Affairs, South China Agricultural University, Guangzhou, Guangdong, 510642, PR China
- Guangdong Laboratory for Lingnan Modern Agriculture, South China Agricultural University, Guangzhou, Guangdong, 510642, China
| | - Juan Wang
- Occupational and Environmental Medicine, Department of Medical Science, University Hospital, Uppsala University, 75237 Uppsala, Sweden
| | - Dan Norbäck
- Occupational and Environmental Medicine, Department of Medical Science, University Hospital, Uppsala University, 75237 Uppsala, Sweden
| | - Yun Xia
- Guangdong Provincial Engineering Research Center of Public Health Detection and Assessment, School of Public Health, Guangdong Pharmaceutical University, Guangzhou, 510006, PR China.
| | - Qingsong Chen
- Guangdong Provincial Engineering Research Center of Public Health Detection and Assessment, School of Public Health, Guangdong Pharmaceutical University, Guangzhou, 510006, PR China.
| | - Yu Sun
- Guangdong Provincial Key Laboratory of Protein Function and Regulation in Agricultural Organisms, College of Life Sciences, South China Agricultural University, Guangzhou, Guangdong, 510642, PR China.
- Key Laboratory of Zoonosis of Ministry of Agriculture and Rural Affairs, South China Agricultural University, Guangzhou, Guangdong, 510642, PR China
- Guangdong Laboratory for Lingnan Modern Agriculture, South China Agricultural University, Guangzhou, Guangdong, 510642, China
| |
Collapse
|
26
|
Jeon MS, Jeong DM, Doh H, Kang HA, Jung H, Eyun SI. A practical comparison of the next-generation sequencing platform and assemblers using yeast genome. Life Sci Alliance 2023; 6:e202201744. [PMID: 36746534 PMCID: PMC9902641 DOI: 10.26508/lsa.202201744] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2022] [Revised: 01/25/2023] [Accepted: 01/25/2023] [Indexed: 02/08/2023] Open
Abstract
Assembling fragmented whole-genomic information from the sequencing data is an inevitable process for further genome-wide research. However, it is intricate to select the appropriate assembly pipeline for unknown species because of the species-specific genomic properties. Therefore, our study focused on relatively more static proclivities of sequencing platforms and assembly algorithms than the fickle genome sequences. A total of 212 draft and polished de novo assemblies were constructed under the different sequencing platforms and assembly algorithms with the repetitive yeast genome. Our comprehensive data indicated that sequencing reads from Oxford Nanopore with R7.3 flow cells generated more continuous assemblies than those derived from the PacBio Sequel, although the homopolymer-based assembly errors and chimeric contigs exist. In addition, the comparison between two second-generation sequencing platforms showed that Illumina NovaSeq 6000 provides more accurate and continuous assembly in the second-generation-sequencing-first pipeline, but MGI DNBSEQ-T7 provides a cheap and accurate read in the polishing process. Furthermore, our insight into the relationship among the computational time, read length, and coverage depth provided clues to the optimal pipelines of yeast assembly.
Collapse
Affiliation(s)
- Min-Seung Jeon
- Department of Life Science, Chung-Ang University, Seoul, Korea
| | - Da Min Jeong
- Department of Life Science, Chung-Ang University, Seoul, Korea
| | - Huijeong Doh
- Department of Life Science, Chung-Ang University, Seoul, Korea
| | - Hyun Ah Kang
- Department of Life Science, Chung-Ang University, Seoul, Korea
| | - Hyungtaek Jung
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, Australia
| | - Seong-Il Eyun
- Department of Life Science, Chung-Ang University, Seoul, Korea
| |
Collapse
|
27
|
Searle B, Müller M, Carell T, Kellett A. Third-Generation Sequencing of Epigenetic DNA. Angew Chem Int Ed Engl 2023; 62:e202215704. [PMID: 36524852 DOI: 10.1002/anie.202215704] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2022] [Revised: 12/14/2022] [Accepted: 12/15/2022] [Indexed: 12/23/2022]
Abstract
The discovery of epigenetic bases has revolutionised the understanding of disease and development. Among the most studied epigenetic marks are cytosines covalently modified at the 5 position. In order to gain insight into their biological significance, the ability to determine their spatiotemporal distribution within the genome is essential. Techniques for sequencing on "next-generation" platforms often involve harsh chemical treatments leading to sample degradation. Third-generation sequencing promises to further revolutionise the field by providing long reads, enabling coverage of highly repetitive regions of the genome or structural variants considered unmappable by next generation sequencing technology. While the ability of third-generation platforms to directly detect epigenetic modifications is continuously improving, at present chemical or enzymatic derivatisation presents the most convenient means of enhancing reliability. This Review presents techniques available for the detection of cytosine modifications on third-generation platforms.
Collapse
Affiliation(s)
- Bethany Searle
- SSPC, the SFI Research Centre for Pharmaceuticals, School of Chemical Sciences, Dublin City University, Glasnevin, Dublin 9, Dublin, Ireland
| | - Markus Müller
- Department of Chemistry, Ludwig-Maximilians Universität München, Butenandtstr. 5-13, 81377, Munich, Germany
| | - Thomas Carell
- Department of Chemistry, Ludwig-Maximilians Universität München, Butenandtstr. 5-13, 81377, Munich, Germany
| | - Andrew Kellett
- SSPC, the SFI Research Centre for Pharmaceuticals, School of Chemical Sciences, Dublin City University, Glasnevin, Dublin 9, Dublin, Ireland
| |
Collapse
|
28
|
Gao L, Xu W, Xin T, Song J. Application of third-generation sequencing to herbal genomics. FRONTIERS IN PLANT SCIENCE 2023; 14:1124536. [PMID: 36959935 PMCID: PMC10027759 DOI: 10.3389/fpls.2023.1124536] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Accepted: 02/02/2023] [Indexed: 06/18/2023]
Abstract
There is a long history of traditional medicine use. However, little genetic information is available for the plants used in traditional medicine, which limits the exploitation of these natural resources. Third-generation sequencing (TGS) techniques have made it possible to gather invaluable genetic information and develop herbal genomics. In this review, we introduce two main TGS techniques, PacBio SMRT technology and Oxford Nanopore technology, and compare the two techniques against Illumina, the predominant next-generation sequencing technique. In addition, we summarize the nuclear and organelle genome assemblies of commonly used medicinal plants, choose several examples from genomics, transcriptomics, and molecular identification studies to dissect the specific processes and summarize the advantages and disadvantages of the two TGS techniques when applied to medicinal organisms. Finally, we describe how we expect that TGS techniques will be widely utilized to assemble telomere-to-telomere (T2T) genomes and in epigenomics research involving medicinal plants.
Collapse
|
29
|
Westfall DH, Deng W, Pankow A, Murrell H, Chen L, Zhao H, Williamson C, Rolland M, Murrell B, Mullins JI. Optimized SMRT-UMI protocol produces highly accurate sequence datasets from diverse populations - application to HIV-1 quasispecies. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.02.23.529831. [PMID: 36865215 PMCID: PMC9980183 DOI: 10.1101/2023.02.23.529831] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/03/2023]
Abstract
Pathogen diversity resulting in quasispecies can enable persistence and adaptation to host defenses and therapies. However, accurate quasispecies characterization can be impeded by errors introduced during sample handling and sequencing which can require extensive optimizations to overcome. We present complete laboratory and bioinformatics workflows to overcome many of these hurdles. The Pacific Biosciences single molecule real-time platform was used to sequence PCR amplicons derived from cDNA templates tagged with universal molecular identifiers (SMRT-UMI). Optimized laboratory protocols were developed through extensive testing of different sample preparation conditions to minimize between-template recombination during PCR and the use of UMI allowed accurate template quantitation as well as removal of point mutations introduced during PCR and sequencing to produce a highly accurate consensus sequence from each template. Handling of the large datasets produced from SMRT-UMI sequencing was facilitated by a novel bioinformatic pipeline, Probabilistic Offspring Resolver for Primer IDs (PORPIDpipeline), that automatically filters and parses reads by sample, identifies and discards reads with UMIs likely created from PCR and sequencing errors, generates consensus sequences, checks for contamination within the dataset, and removes any sequence with evidence of PCR recombination or early cycle PCR errors, resulting in highly accurate sequence datasets. The optimized SMRT-UMI sequencing method presented here represents a highly adaptable and established starting point for accurate sequencing of diverse pathogens. These methods are illustrated through characterization of human immunodeficiency virus (HIV) quasispecies.
Collapse
Affiliation(s)
- Dylan H. Westfall
- Department of Microbiology, University of Washington Schools of Medicine and Public Health, Seattle, WA 98195-8070 US
| | - Wenjie Deng
- Department of Microbiology, University of Washington Schools of Medicine and Public Health, Seattle, WA 98195-8070 US
| | - Alec Pankow
- Department of Microbiology, University of Washington Schools of Medicine and Public Health, Seattle, WA 98195-8070 US
- Department of Microbiology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Hugh Murrell
- Division of Medical Virology, Department of Pathology, University of Cape Town and National Health Laboratory Services, Cape Town, South Africa
| | - Lennie Chen
- Department of Microbiology, University of Washington Schools of Medicine and Public Health, Seattle, WA 98195-8070 US
| | - Hong Zhao
- Department of Microbiology, University of Washington Schools of Medicine and Public Health, Seattle, WA 98195-8070 US
| | - Carolyn Williamson
- Division of Medical Virology, Department of Pathology, University of Cape Town and National Health Laboratory Services, Cape Town, South Africa
| | - Morgane Rolland
- US Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, Maryland, 20910, USA
- The Henry M. Jackson Foundation for the Advancement of Military Medicine, Inc., Bethesda, Maryland, 20817, USA
| | - Ben Murrell
- Department of Microbiology, Tumor and Cell Biology, Karolinska Institutet, 17717 Stockholm, SE
| | - James I. Mullins
- Department of Microbiology, University of Washington Schools of Medicine and Public Health, Seattle, WA 98195-8070 US
- Department of Medicine, University of Washington Schools of Medicine and Public Health, Seattle, WA 98195-8070 US
- Department of Global Health, University of Washington Schools of Medicine and Public Health, Seattle, WA 98195-8070 US
| |
Collapse
|
30
|
DeepConsensus improves the accuracy of sequences with a gap-aware sequence transformer. Nat Biotechnol 2023; 41:232-238. [PMID: 36050551 DOI: 10.1038/s41587-022-01435-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2021] [Accepted: 07/15/2022] [Indexed: 11/08/2022]
Abstract
Circular consensus sequencing with Pacific Biosciences (PacBio) technology generates long (10-25 kilobases), accurate 'HiFi' reads by combining serial observations of a DNA molecule into a consensus sequence. The standard approach to consensus generation, pbccs, uses a hidden Markov model. We introduce DeepConsensus, which uses an alignment-based loss to train a gap-aware transformer-encoder for sequence correction. Compared to pbccs, DeepConsensus reduces read errors by 42%. This increases the yield of PacBio HiFi reads at Q20 by 9%, at Q30 by 27% and at Q40 by 90%. With two SMRT Cells of HG003, reads from DeepConsensus improve hifiasm assembly contiguity (NG50 4.9 megabases (Mb) to 17.2 Mb), increase gene completeness (94% to 97%), reduce the false gene duplication rate (1.1% to 0.5%), improve assembly base accuracy (Q43 to Q45) and reduce variant-calling errors by 24%. DeepConsensus models could be trained to the general problem of analyzing the alignment of other types of sequences, such as unique molecular identifiers or genome assemblies.
Collapse
|
31
|
Lipsh-Sokolik R, Khersonsky O, Schröder SP, de Boer C, Hoch SY, Davies GJ, Overkleeft HS, Fleishman SJ. Combinatorial assembly and design of enzymes. Science 2023; 379:195-201. [PMID: 36634164 DOI: 10.1126/science.ade9434] [Citation(s) in RCA: 32] [Impact Index Per Article: 32.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Abstract
The design of structurally diverse enzymes is constrained by long-range interactions that are necessary for accurate folding. We introduce an atomistic and machine learning strategy for the combinatorial assembly and design of enzymes (CADENZ) to design fragments that combine with one another to generate diverse, low-energy structures with stable catalytic constellations. We applied CADENZ to endoxylanases and used activity-based protein profiling to recover thousands of structurally diverse enzymes. Functional designs exhibit high active-site preorganization and more stable and compact packing outside the active site. Implementing these lessons into CADENZ led to a 10-fold improved hit rate and more than 10,000 recovered enzymes. This design-test-learn loop can be applied, in principle, to any modular protein family, yielding huge diversity and general lessons on protein design principles.
Collapse
Affiliation(s)
- R Lipsh-Sokolik
- Department of Biomolecular Sciences, Weizmann Institute of Science, 7610001 Rehovot, Israel
| | - O Khersonsky
- Department of Biomolecular Sciences, Weizmann Institute of Science, 7610001 Rehovot, Israel
| | - S P Schröder
- Leiden Institute of Chemistry, Leiden University, Einsteinweg 55, 2300 RA Leiden, Netherlands
| | - C de Boer
- Leiden Institute of Chemistry, Leiden University, Einsteinweg 55, 2300 RA Leiden, Netherlands
| | - S-Y Hoch
- Department of Biomolecular Sciences, Weizmann Institute of Science, 7610001 Rehovot, Israel
| | - G J Davies
- York Structural Biology Laboratory, Department of Chemistry, The University of York, Heslington, York YO10 5DD, UK
| | - H S Overkleeft
- Leiden Institute of Chemistry, Leiden University, Einsteinweg 55, 2300 RA Leiden, Netherlands
| | - S J Fleishman
- Department of Biomolecular Sciences, Weizmann Institute of Science, 7610001 Rehovot, Israel
| |
Collapse
|
32
|
Chai AH. Whole Genome Sequencing for Detection of Structural Variants in Patients with Retinitis Pigmentosa. Methods Mol Biol 2022; 2560:73-79. [PMID: 36481884 DOI: 10.1007/978-1-0716-2651-1_6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Retinitis pigmentosa (RP) is a group of inherited retinal diseases characterized by the progressive degeneration of rod then cone photoreceptors. Most of the known mutations that cause RP reside in the protein-coding portions of DNA; however, a growing number of pathogenic mutations have been identified within the non-coding portions. This chapter details a brief method for the detection of structural variants throughout the genome for the identification of novel mutations and to ultimately provide patients with a precise molecular diagnosis.
Collapse
|
33
|
Miller AR, Wijeratne S, McGrath SD, Schieffer KM, Miller KE, Lee K, Mathew M, LaHaye S, Fitch JR, Kelly BJ, White P, Mardis ER, Wilson RK, Cottrell CE, Magrini V. Pacific Biosciences Fusion and Long Isoform Pipeline for Cancer Transcriptome-Based Resolution of Isoform Complexity. J Mol Diagn 2022; 24:1292-1306. [PMID: 36191838 DOI: 10.1016/j.jmoldx.2022.09.003] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2021] [Revised: 08/05/2022] [Accepted: 09/13/2022] [Indexed: 01/13/2023] Open
Abstract
Genomic profiling using short-read sequencing has utility in detecting disease-associated variation in both DNA and RNA. However, given the frequent occurrence of structural variation in cancer, molecular profiling using long-read sequencing improves the resolution of such events. For example, the Pacific Biosciences long-read RNA-sequencing (Iso-Seq) transcriptome protocol provides full-length isoform characterization, discernment of allelic phasing, and isoform discovery, and identifies expressed fusion partners. The Pacific Biosciences Fusion and Long Isoform Pipeline (PB_FLIP) incorporates a suite of RNA-sequencing software analysis tools and scripts to identify expressed fusion partners and isoforms. In addition, sequencing of a commercial reference (Spike-In RNA Variants) with known isoform complexity was performed and demonstrated high recall of the Iso-Seq and PB_FLIP workflow to benchmark our protocol and analysis performance. This study describes the utility of Iso-Seq and PB_FLIP analysis in improving deconvolution of complex structural variants and isoform detection within an institutional pediatric and adolescent/young adult translational cancer research cohort. The exemplar case studies demonstrate that Iso-Seq and PB_FLIP discover novel expressed fusion partners, resolve complex intragenic alterations, and discriminate between allele-specific expression profiles.
Collapse
Affiliation(s)
- Anthony R Miller
- The Steve and Cindy Rasmussen Institute for Genomic Medicine, Abigail Wexner Research Institute at Nationwide Children's Hospital, Columbus, Ohio
| | - Saranga Wijeratne
- The Steve and Cindy Rasmussen Institute for Genomic Medicine, Abigail Wexner Research Institute at Nationwide Children's Hospital, Columbus, Ohio
| | - Sean D McGrath
- The Steve and Cindy Rasmussen Institute for Genomic Medicine, Abigail Wexner Research Institute at Nationwide Children's Hospital, Columbus, Ohio
| | - Kathleen M Schieffer
- The Steve and Cindy Rasmussen Institute for Genomic Medicine, Abigail Wexner Research Institute at Nationwide Children's Hospital, Columbus, Ohio
| | - Katherine E Miller
- The Steve and Cindy Rasmussen Institute for Genomic Medicine, Abigail Wexner Research Institute at Nationwide Children's Hospital, Columbus, Ohio; Department of Pediatrics, The Ohio State University College of Medicine, Columbus, Ohio
| | - Kristy Lee
- The Steve and Cindy Rasmussen Institute for Genomic Medicine, Abigail Wexner Research Institute at Nationwide Children's Hospital, Columbus, Ohio; Department of Pediatrics, The Ohio State University College of Medicine, Columbus, Ohio; Department of Pathology, The Ohio State University College of Medicine, Columbus, Ohio
| | - Mariam Mathew
- The Steve and Cindy Rasmussen Institute for Genomic Medicine, Abigail Wexner Research Institute at Nationwide Children's Hospital, Columbus, Ohio; Department of Pediatrics, The Ohio State University College of Medicine, Columbus, Ohio; Department of Pathology, The Ohio State University College of Medicine, Columbus, Ohio
| | - Stephanie LaHaye
- The Steve and Cindy Rasmussen Institute for Genomic Medicine, Abigail Wexner Research Institute at Nationwide Children's Hospital, Columbus, Ohio
| | - James R Fitch
- The Steve and Cindy Rasmussen Institute for Genomic Medicine, Abigail Wexner Research Institute at Nationwide Children's Hospital, Columbus, Ohio
| | - Benjamin J Kelly
- The Steve and Cindy Rasmussen Institute for Genomic Medicine, Abigail Wexner Research Institute at Nationwide Children's Hospital, Columbus, Ohio
| | - Peter White
- The Steve and Cindy Rasmussen Institute for Genomic Medicine, Abigail Wexner Research Institute at Nationwide Children's Hospital, Columbus, Ohio; Department of Pediatrics, The Ohio State University College of Medicine, Columbus, Ohio
| | - Elaine R Mardis
- The Steve and Cindy Rasmussen Institute for Genomic Medicine, Abigail Wexner Research Institute at Nationwide Children's Hospital, Columbus, Ohio; Department of Pediatrics, The Ohio State University College of Medicine, Columbus, Ohio; Department of Neurosurgery, The Ohio State University College of Medicine, Columbus, Ohio
| | - Richard K Wilson
- The Steve and Cindy Rasmussen Institute for Genomic Medicine, Abigail Wexner Research Institute at Nationwide Children's Hospital, Columbus, Ohio; Department of Pediatrics, The Ohio State University College of Medicine, Columbus, Ohio
| | - Catherine E Cottrell
- The Steve and Cindy Rasmussen Institute for Genomic Medicine, Abigail Wexner Research Institute at Nationwide Children's Hospital, Columbus, Ohio; Department of Pediatrics, The Ohio State University College of Medicine, Columbus, Ohio; Department of Pathology, The Ohio State University College of Medicine, Columbus, Ohio.
| | - Vincent Magrini
- The Steve and Cindy Rasmussen Institute for Genomic Medicine, Abigail Wexner Research Institute at Nationwide Children's Hospital, Columbus, Ohio; Department of Pediatrics, The Ohio State University College of Medicine, Columbus, Ohio
| |
Collapse
|
34
|
Hamim I, Sekine KT, Komatsu K. How do emerging long-read sequencing technologies function in transforming the plant pathology research landscape? PLANT MOLECULAR BIOLOGY 2022; 110:469-484. [PMID: 35962900 DOI: 10.1007/s11103-022-01305-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/03/2022] [Accepted: 07/26/2022] [Indexed: 06/15/2023]
Abstract
Long-read sequencing technologies are revolutionizing the sequencing and analysis of plant and pathogen genomes and transcriptomes, as well as contributing to emerging areas of interest in plant-pathogen interactions, disease management techniques, and the introduction of new plant varieties or cultivars. Long-read sequencing (LRS) technologies are progressively being implemented to study plants and pathogens of agricultural importance, which have substantial economic effects. The variability and complexity of the genome and transcriptome affect plant growth, development and pathogen responses. Overcoming the limitations of second-generation sequencing, LRS technology has significantly increased the length of a single contiguous read from a few hundred to millions of base pairs. Because of the longer read lengths, new analysis methods and tools have been developed for plant and pathogen genomics and transcriptomics. LRS technologies enable faster, more efficient, and high-throughput ultralong reads, allowing direct sequencing of genomes that would be impossible or difficult to investigate using short-read sequencing approaches. These benefits include genome assembly in repetitive areas, creating more comprehensive and exact genome determinations, assembling full-length transcripts, and detecting DNA and RNA alterations. Furthermore, these technologies allow for the identification of transcriptome diversity, significant structural variation analysis, and direct epigenetic mark detection in plant and pathogen genomic regions. LRS in plant pathology is found efficient for identifying and characterization of effectors in plants as well as known and unknown plant pathogens. In this review, we investigate how these technologies are transforming the landscape of determination and characterization of plant and pathogen genomes and transcriptomes efficiently and accurately. Moreover, we highlight potential areas of interest offered by LRS technologies for future study into plant-pathogen interactions, disease control strategies, and the development of new plant varieties or cultivars.
Collapse
Affiliation(s)
- Islam Hamim
- Laboratory of Plant Pathology, Graduate School of Agriculture, Tokyo University of Agriculture and Technology, Fuchu, Japan
- International Research Fellow of Japan Society for the Promotion of Science, Tokyo, Japan
- Department of Plant Pathology, Bangladesh Agricultural University, Mymensingh, Bangladesh
| | - Ken-Taro Sekine
- Faculty of Agriculture, University of the Ryukyus, Okinawa, Japan
| | - Ken Komatsu
- Laboratory of Plant Pathology, Graduate School of Agriculture, Tokyo University of Agriculture and Technology, Fuchu, Japan.
| |
Collapse
|
35
|
Zacharopoulou P, Ansari MA, Frater J. A calculated risk: Evaluating HIV resistance to the broadly neutralising antibodies10-1074 and 3BNC117. Curr Opin HIV AIDS 2022; 17:352-358. [PMID: 36178770 PMCID: PMC9594129 DOI: 10.1097/coh.0000000000000764] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
PURPOSE OF THIS REVIEW Broadly neutralising antibodies (bNAbs) are a promising new therapy for the treatment of HIV infection. However, the effective use of bNAbs is impacted by the presence of preexisting virological resistance and the potential to develop new resistance during treatment. With several bNAb clinical trials underway, sensitive and scalable assays are needed to screen for resistance. This review summarises the data on resistance from published clinical trials using the bNAbs 10-1074 and 3BNC117 and evaluates current approaches for detecting bNAb sensitivity as well as their limitations. RECENT FINDINGS Analyses of samples from clinical trials of 10-1074 and 3BNC117 reveal viral mutations that emerge on therapy which may result in bNAb resistance. These mutations are also found in some potential study participants prior to bNAb exposure. These clinical data are further informed by ex-vivo neutralisation assays which offer an alternative measure of resistance and allow more detailed interrogation of specific viral mutations. However, the limited amount of publicly available data and the need for better understanding of other viral features that may affect bNAb binding mean there is no widely accepted approach to measuring bNAb resistance. SUMMARY Resistance to the bNAbs 10-1074 and 3BNC117 may significantly impact clinical outcome following their therapeutic administration. Predicting bNAb resistance may help to lower the risk of treatment failure and therefore a robust methodology to screen for bNAb sensitivity is needed.
Collapse
Affiliation(s)
- Panagiota Zacharopoulou
- Peter Medawar Building for Pathogen Research, Nuffield Department of Medicine, University of Oxford
| | - M. Azim Ansari
- Peter Medawar Building for Pathogen Research, Nuffield Department of Medicine, University of Oxford
| | - John Frater
- Peter Medawar Building for Pathogen Research, Nuffield Department of Medicine, University of Oxford
- NIHR Oxford Biomedical Research Centre, Oxford, UK
| |
Collapse
|
36
|
Distinct HIV-1 Population Structure across Meningeal and Peripheral T Cells and Macrophage Lineage Cells. Microbiol Spectr 2022; 10:e0250822. [PMID: 36173332 PMCID: PMC9602438 DOI: 10.1128/spectrum.02508-22] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023] Open
Abstract
HIV-1 sequence population structure among brain and nonbrain cellular compartments is incompletely understood. Here, we compared proviral pol and env high-quality consensus single-molecule real-time (SMRT) sequences derived from CD3+ T cells and CD14+ macrophage lineage cells from meningeal or peripheral (spleen, blood) tissues obtained at autopsy from two individuals with viral suppression on antiretroviral therapy (ART). Phylogenetic analyses showed strong evidence of population structure between CD3+ and CD14+ virus populations. Distinct env variable-region characteristics were also found between CD3+ and CD14+ viruses. Furthermore, shared macrophage-tropic amino acid residues (env) and drug resistance mutations (pol) between meningeal and peripheral virus populations were consistent with the meninges playing a role in viral gene flow across the blood-brain barrier. Overall, our results point toward potential functional differences among meningeal and peripheral CD3+ and CD14+ virus populations and a complex evolutionary history driven by distinct selection pressures and/or viral gene flow. IMPORTANCE Different cell types and/or tissues may serve as a reservoir for HIV-1 during ART-induced viral suppression. We compared proviral pol and env sequences from CD3+ T cells and CD14+ macrophage lineage cells from brain and nonbrain tissues from two virally suppressed individuals. We found strong evidence of viral population structure among cells/tissues, which may result from distinct selective pressures across cell types and anatomic sites.
Collapse
|
37
|
Yang W, Fomenkov A, Heiter D, Xu SY, Ettwiller L. High-throughput sequencing of EcoWI restriction fragments maps the genome-wide landscape of phosphorothioate modification at base resolution. PLoS Genet 2022; 18:e1010389. [PMID: 36121836 PMCID: PMC9521924 DOI: 10.1371/journal.pgen.1010389] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2022] [Revised: 09/29/2022] [Accepted: 08/18/2022] [Indexed: 11/26/2022] Open
Abstract
Phosphorothioation (PT), in which a non-bridging oxygen is replaced by a sulfur, is one of the rare modifications discovered in bacteria and archaea that occurs on the sugar-phosphate backbone as opposed to the nucleobase moiety of DNA. While PT modification is widespread in the prokaryotic kingdom, how PT modifications are distributed in the genomes and their exact roles in the cell remain to be defined. In this study, we developed a simple and convenient technique called EcoWI-seq based on a modification-dependent restriction endonuclease to identify genomic positions of PT modifications. EcoWI-seq shows similar performance than other PT modification detection techniques and additionally, is easily scalable while requiring little starting material. As a proof of principle, we applied EcoWI-seq to map the PT modifications at base resolution in the genomes of both the Salmonella enterica cerro 87 and E. coli expressing the dnd+ gene cluster. Specifically, we address whether the partial establishment of modified PT positions is a stochastic or deterministic process. EcoWI-seq reveals a systematic usage of the same subset of target sites in clones for which the PT modification has been independently established. Large number of bacteria have modified their DNA mainly as part of a strategy to resist virus infection. Most of the modifications are chemical variations on the canonical bases A, T, C or G with phosphorothioate (PT) being a rare exception of a modification that happens on the backbone of the DNA. Interestingly, this PT modification was first chemically synthesized for specific biotechnological processes before scientists discovered that bacteria and archaea naturally perform this modification using their enzymes. The exact roles of phosphorothioation in bacteria and archaea is still under investigation. To enable further investigation of PT modifications, we designed EcoWI-seq, a method to identify the exact positions of these modifications in bacterial genomes. Notably, we applied the EcoWI-seq to several strains of E. coli for which PT modification has been induced by cloning into these strains, the necessary genes for making such modification. We found that these strains, despite being independently made, followed a precise pattern of PT modification with always the same sites being modified. This result indicates a deterministic process for the establishment of PT modification.
Collapse
Affiliation(s)
- Weiwei Yang
- New England Biolabs Inc., Ipswich, Massachusetts, United States of America
| | - Alexey Fomenkov
- New England Biolabs Inc., Ipswich, Massachusetts, United States of America
| | - Dan Heiter
- New England Biolabs Inc., Ipswich, Massachusetts, United States of America
| | - Shuang-yong Xu
- New England Biolabs Inc., Ipswich, Massachusetts, United States of America
- * E-mail: (SYX); (LE)
| | - Laurence Ettwiller
- New England Biolabs Inc., Ipswich, Massachusetts, United States of America
- * E-mail: (SYX); (LE)
| |
Collapse
|
38
|
Dong Q, Zou QC, Mao LH, Tian DQ, Hu W, Cao XR, Ding HQ. The Chromosome-Scale Assembly of the Curcuma alismatifolia Genome Provides Insight Into Anthocyanin and Terpenoid Biosynthesis. FRONTIERS IN PLANT SCIENCE 2022; 13:899588. [PMID: 35783929 PMCID: PMC9241516 DOI: 10.3389/fpls.2022.899588] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/18/2022] [Accepted: 05/23/2022] [Indexed: 06/01/2023]
Abstract
Curcuma alismatifolia, a bulbous flower known for its showy bracts, is widely used around the world as a cut flower, potted, and garden plant. Besides its ornamental value, this species is rich in terpenoid metabolites and could serve as a resource for essential oils. Here, we report a chromosome-level genome assembly of C. alismatifolia and describe its biosynthetic pathways for anthocyanins and terpenoids. This high-quality, assembled genome size is 991.3 Mb with a scaffold N50 value of 56.7 Mb. Evolutionary analysis of the genome suggests that C. alismatifolia diverged from Zingiber officinale about 9.7 million years ago, after it underwent a whole-genome duplication. Transcriptome analysis was performed on bracts at five developmental stages. Nine highly expressed genes were identified, encoding for six enzymes downstream of the anthocyanin biosynthetic pathway. Of these, one gene encoding F3'5'H might be a key node in the regulation of bract color formation. Co-expression network analysis showed that MYB, bHLH, NAC, and ERF transcription factors collectively regulated color formation in the bracts. Characterization of terpenoid biosynthesis genes revealed their dispersal and tandem duplications, both of which contributed greatly to the increase in the number of terpene synthase genes in C. alismatifolia, especially to species-specific expansion of sesquiterpene synthase genes. This work facilitates understanding of genetic basis of anthocyanin and terpenoid biosynthesis and could accelerate the selective breeding of C. alismatifolia varieties with higher ornamental and medicinal value.
Collapse
|
39
|
Ren J, Jiang Z, Li W, Kang X, Bai S, Yang L, Li S, Zhang D. Characterization of Glutenin Genes in Bread Wheat by Third-Generation RNA Sequencing and the Development of a Glu-1Dx5 Marker Specific for the Extra Cysteine Residue. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2022; 70:7211-7219. [PMID: 35666675 DOI: 10.1021/acs.jafc.2c02050] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
High-molecular-weight glutenin subunits (HMW-GS) and low-molecular-weight glutenin subunits (LMW-GS) in a mature grain play important roles in the formation of a glutenin macropolymer and gluten quality. To characterize the expressed glutenin genes of the bread wheat variety Xinmai 26 during seed development, a total of 18 full-length transcripts were obtained by the newly emerged third-generation RNA sequencing of the PacBio Sequel II platform, including 5 transcripts of HMW-GS genes and 13 transcripts of LMW-GS genes (8 intact genes and 5 pseudogenes). Combined with the patterns of sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) and matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF-MS), allelic types of the obtained glutenin genes were, respectively, determined, wherein molecular characterization deduced by transcript1528 (1Dx5) and transcript907 (Glu-A3c) indicated their great influence on dough quality. In addition, a specific functional marker dCAPS5 was developed for the single-nucleotide substitution at position 353 of the 1Dx5 subunit, which was further intensively compared with the other proposed markers to efficiently utilize the 1Dx5 subunit with the extra cysteine residue. This study provides an efficient method to accurately identify and utilize glutenin genes in bread wheat, which is helpful in understanding the contributions of glutenin genes to wheat quality.
Collapse
Affiliation(s)
- Jiajia Ren
- Key Laboratory of Plant Stress Biology, State Key Laboratory of Cotton Biology, College of Agriculture, Henan University, Kaifeng 475001, China
| | - Zhikai Jiang
- Xinxiang Academy of Agricultural Sciences, Xinxiang 453003, China
| | - Wenjie Li
- Key Laboratory of Plant Stress Biology, State Key Laboratory of Cotton Biology, College of Agriculture, Henan University, Kaifeng 475001, China
| | - Xusen Kang
- Key Laboratory of Plant Stress Biology, State Key Laboratory of Cotton Biology, College of Agriculture, Henan University, Kaifeng 475001, China
| | - Shenglong Bai
- Key Laboratory of Plant Stress Biology, State Key Laboratory of Cotton Biology, College of Agriculture, Henan University, Kaifeng 475001, China
| | - Lijuan Yang
- Xinxiang Academy of Agricultural Sciences, Xinxiang 453003, China
| | - Suoping Li
- Key Laboratory of Plant Stress Biology, State Key Laboratory of Cotton Biology, College of Agriculture, Henan University, Kaifeng 475001, China
| | - Dale Zhang
- Key Laboratory of Plant Stress Biology, State Key Laboratory of Cotton Biology, College of Agriculture, Henan University, Kaifeng 475001, China
| |
Collapse
|
40
|
Goussarov G, Mysara M, Vandamme P, Van Houdt R. Introduction to the principles and methods underlying the recovery of metagenome-assembled genomes from metagenomic data. Microbiologyopen 2022; 11:e1298. [PMID: 35765182 PMCID: PMC9179125 DOI: 10.1002/mbo3.1298] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2022] [Revised: 05/19/2022] [Accepted: 05/19/2022] [Indexed: 11/18/2022] Open
Abstract
The rise of metagenomics offers a leap forward for understanding the genetic diversity of microorganisms in many different complex environments by providing a platform that can identify potentially unlimited numbers of known and novel microorganisms. As such, it is impossible to imagine new major initiatives without metagenomics. Nevertheless, it represents a relatively new discipline with various levels of complexity and demands on bioinformatics. The underlying principles and methods used in metagenomics are often seen as common knowledge and often not detailed or fragmented. Therefore, we reviewed these to guide microbiologists in taking the first steps into metagenomics. We specifically focus on a workflow aimed at reconstructing individual genomes, that is, metagenome-assembled genomes, integrating DNA sequencing, assembly, binning, identification and annotation.
Collapse
Affiliation(s)
- Gleb Goussarov
- Microbiology Unit, Belgian Nuclear Research Centre (SCK CEN)MolBelgium
- Laboratory of Microbiology and BCCM/LMG Bacteria Collection, Faculty of SciencesGhent UniversityGhentBelgium
| | - Mohamed Mysara
- Microbiology Unit, Belgian Nuclear Research Centre (SCK CEN)MolBelgium
| | - Peter Vandamme
- Laboratory of Microbiology and BCCM/LMG Bacteria Collection, Faculty of SciencesGhent UniversityGhentBelgium
| | - Rob Van Houdt
- Microbiology Unit, Belgian Nuclear Research Centre (SCK CEN)MolBelgium
| |
Collapse
|
41
|
Near-Complete Genome Sequence of Zygosaccharomyces rouxii NRRL Y-64007, a Yeast Capable of Growing on Lignocellulosic Hydrolysates. Microbiol Resour Announc 2022; 11:e0005022. [PMID: 35442079 PMCID: PMC9119105 DOI: 10.1128/mra.00050-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The halotolerant and osmotolerant yeast Zygosaccharomyces rouxii can produce multiple volatile compounds and has the ability to grow on lignocellulosic hydrolysates. We report the annotated genome sequence of Z. rouxii NRRL Y-64007 to support its development as a platform organism for biofuel and bioproduct production.
Collapse
|
42
|
Mutational spectrum of hepatitis C virus in patients with chronic hepatitis C determined by single molecule real-time sequencing. Sci Rep 2022; 12:7083. [PMID: 35490163 PMCID: PMC9056513 DOI: 10.1038/s41598-022-11151-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2021] [Accepted: 04/18/2022] [Indexed: 11/08/2022] Open
Abstract
The emergence of hepatitis C virus (HCV) with resistance-associated substitution (RAS), produced by mutations in the HCV genome, is a major problem in direct acting antivirals (DAA) treatment. This study aimed to clarify the mutational spectrum in HCV-RNA and the substitution pattern for the emergence of RASs in patients with chronic HCV infection. HCV-RNA from two HCV replicon cell lines and the serum HCV-RNA of four non-liver transplant and four post-liver transplant patients with unsuccessful DAA treatment were analyzed using high-accuracy single-molecule real-time long-read sequencing. Transition substitutions, especially A>G and U>C, occurred prominently under DAAs in both non-transplant and post-transplant patients, with a mutational bias identical to that occurring in HCV replicon cell lines during 10-year culturing. These mutational biases were reproduced in natural courses after DAA treatment. RASs emerged via both transition and transversion substitutions. NS3-D168 and NS5A-L31 RASs resulted from transversion mutations, while NS5A-Y93 RASs was caused by transition substitutions. The fidelity of the RNA-dependent RNA polymerase, HCV-NS5B, produces mutational bias in the HCV genome, characterized by dominant transition mutations, notably A>G and U>C substitutions. However, RASs are acquired by both transition and transversion substitutions, and the RASs-positive HCV clones are selected and proliferated under DAA treatment pressure.
Collapse
|
43
|
Neupane S, Bonilla SI, Manalo AM, Pelz-Stelinski KS. Complete de novo assembly of Wolbachia endosymbiont of Diaphorina citri Kuwayama (Hemiptera: Liviidae) using long-read genome sequencing. Sci Rep 2022; 12:125. [PMID: 34996906 PMCID: PMC8741817 DOI: 10.1038/s41598-021-03184-0] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2021] [Accepted: 11/26/2021] [Indexed: 01/23/2023] Open
Abstract
Wolbachia, a gram-negative \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\mathrm{\alpha }$$\end{document}α-proteobacterium, is an endosymbiont found in some arthropods and nematodes. Diaphorina citri Kuwayama, the vector of ‘Candidatus Liberibacter asiaticus’ (CLas), are naturally infected with a strain of Wolbachia (wDi), which has been shown to colocalize with the bacteria pathogens CLas, the pathogen associated with huanglongbing (HLB) disease of citrus. The relationship between wDi and CLas is poorly understood in part because the complete genome of wDi has not been available. Using high-quality long-read PacBio circular consensus sequences, we present the largest complete circular wDi genome among supergroup-B members. The assembled circular chromosome is 1.52 megabases with 95.7% genome completeness with contamination of 1.45%, as assessed by checkM. We identified Insertion Sequences (ISs) and prophage genes scattered throughout the genomes. The proteins were annotated using Pfam, eggNOG, and COG that assigned unique domains and functions. The wDi genome was compared with previously sequenced Wolbachia genomes using pangenome and phylogenetic analyses. The availability of a complete circular chromosome of wDi will facilitate understanding of its role within the insect vector, which may assist in developing tools for disease management. This information also provides a baseline for understanding phylogenetic relationships among Wolbachia of other insect vectors.
Collapse
Affiliation(s)
- Surendra Neupane
- Entomology and Nematology Department, Citrus Research and Education Center/IFAS, University of Florida, Lake Alfred, Florida, 33850, USA
| | - Sylvia I Bonilla
- Entomology and Nematology Department, Citrus Research and Education Center/IFAS, University of Florida, Lake Alfred, Florida, 33850, USA
| | - Andrew M Manalo
- Entomology and Nematology Department, Citrus Research and Education Center/IFAS, University of Florida, Lake Alfred, Florida, 33850, USA
| | - Kirsten S Pelz-Stelinski
- Entomology and Nematology Department, Citrus Research and Education Center/IFAS, University of Florida, Lake Alfred, Florida, 33850, USA.
| |
Collapse
|
44
|
Chen Z, Wang Z, Xu Y, Zhang X, Tian B, Bai J. Controlled movement of ssDNA conjugated peptide through Mycobacterium smegmatis porin A (MspA) nanopore by a helicase motor for peptide sequencing application. Chem Sci 2021; 12:15750-15756. [PMID: 35003607 PMCID: PMC8654031 DOI: 10.1039/d1sc04342k] [Citation(s) in RCA: 31] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2021] [Accepted: 11/02/2021] [Indexed: 12/16/2022] Open
Abstract
The lack of an efficient, low-cost sequencing method has long been a significant bottleneck in protein research and applications. In recent years, the nanopore platform has emerged as a fast and inexpensive method for single-molecule nucleic acid sequencing, but attempts to apply it to protein/peptide sequencing have resulted in limited success. Here we report a strategy to control peptide translocation through the MspA nanopore, which could serve as the first step toward strand peptide sequencing. By conjugating the target peptide to a helicase-regulated handle-ssDNA, we achieved a read length of up to 17 amino acids (aa) and demonstrated the feasibility of distinguishing between amino acid residues of different charges or between different phosphorylation sites. Further improvement of resolution may require engineering MspA-M2 to reduce its constriction zone's size and stretch the target peptide inside the nanopore to minimize random thermal motion. We believe that our method in this study can significantly accelerate the development and commercialization of nanopore-based peptide sequencing technologies.
Collapse
Affiliation(s)
- Zhijie Chen
- School of Pharmaceutical Sciences, Tsinghua University 100084 Beijing China
| | - Zhenqin Wang
- School of Pharmaceutical Sciences, Tsinghua University 100084 Beijing China
| | - Yang Xu
- School of Pharmaceutical Sciences, Tsinghua University 100084 Beijing China
| | - Xiaochun Zhang
- School of Pharmaceutical Sciences, Tsinghua University 100084 Beijing China
| | - Boxue Tian
- School of Pharmaceutical Sciences, Tsinghua University 100084 Beijing China
| | - Jingwei Bai
- School of Pharmaceutical Sciences, Tsinghua University 100084 Beijing China
| |
Collapse
|
45
|
Ultradeep HIV-1 Proviral Envelope Sequencing Reveals Complex Population Structure within and between Brain and Splenic Tissues. J Virol 2021; 95:e0120221. [PMID: 34495695 DOI: 10.1128/jvi.01202-21] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Understanding tissue-based HIV-1 proviral population structure is important for improving treatment strategies for individuals with HIV-associated neurological disorders (HAND). Previous analyses have revealed HIV-1 envelope (env) population structure between brain and peripheral tissues as well as Env functional differences, especially in individuals with HAND. Furthermore, population structure has been detected among different anatomical locations in the brain itself, although such patterns are inconsistent across individuals and less strongly associated with the presence/absence of HAND. Here, we utilized the Pacific Biosciences single-molecule real-time (SMRT) high-throughput technology to generate thousands of sequences for each tissue, along with phylogenetic and distance-based analyses, to investigate env sequences from paired brain and spleen samples from eight individuals with/without HAND. To account for the high error rate associated with SMRT sequencing, we used a clustering approach to identify high-quality consensus sequences representative of ≥10 reads ("HQCS10"). In parallel, we characterized variable regions from nonclustered sequences to identify potential functional differences. We found evidence for significant population structure between brain and spleen tissues, as well as among brain tissues and within the same brain tissue, in individuals both with and without HAND. Variable region analysis showed differences in length and charge among brain and nonbrain tissues as well as within the brain, suggesting possible functional differences. Our results demonstrate the complexity of HIV-1 env structure/gene flow among tissues and support the concept that selective pressures in different tissue microenvironments drive viral evolution and adaptation. IMPORTANCE Understanding the evolution of HIV-1 in the brain compared to other tissues is important for improving treatment strategies for individuals with HIV-associated neurological disorders (HAND). We utilized high-throughput sequencing technology to generate thousands of full-length env sequences from paired brain and spleen samples from eight individuals with/without HAND. We found significant viral population structure for participants both with and without HAND, providing robust evidence for the brain as a compartmentalized tissue and potentially a viral reservoir. We also found striking genetic differences between virus populations, even from the same tissue, suggesting the potential for functional differences and the possibility for multiple evolutionary pathways that result in similar tropisms and/or other tissue-adapted characteristics. Our results demonstrate the complexity of viral population structure within the brain and suggest that analysis of peripheral blood samples alone may not be fully informative with respect to improving strategies to treat or eradicate HIV-1.
Collapse
|
46
|
Ren H, He Y, Qi X, Zheng X, Zhang S, Yu Z, Hu F. The bayberry database: a multiomic database for Myrica rubra, an important fruit tree with medicinal value. BMC PLANT BIOLOGY 2021; 21:452. [PMID: 34615485 PMCID: PMC8493685 DOI: 10.1186/s12870-021-03232-x] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/04/2021] [Accepted: 09/28/2021] [Indexed: 05/16/2023]
Abstract
BACKGROUND Chinese bayberry (Myrica rubra Sieb. & Zucc.) is an important fruit tree in China, and has high medicinal value. At present, the genome, transcriptome and germplasm resources of bayberry have been reported. In order to make more convenient use of these data, the Bayberry Database was established. RESULTS The Bayberry Database is a comprehensive and intuitive data platform for examining the diverse annotated genome and germplasm resources of this species. This database contains nine central functional domains to interact with multiomic data: home, genome, germplasm, markers, tools, map, expression, reference, and contact. All domains provide pathways to a variety of data types composed of a reference genome sequence, transcriptomic data, gene patterns, phenotypic data, fruit images of Myrica rubra varieties, gSSR data, gene maps with annotation and evolutionary analyses. The tools module includes BLAST search, keyword search, sequence fetch and enrichment analysis functions. CONCLUSIONS The web address of the database is as follows http://www.bayberrybase.cn/ . The Myrica rubra database is an intelligent, interactive, and user-friendly system that enables researchers, breeders and horticultural personnel to browse, search and retrieve relevant and useful information and thus facilitate genomic research and breeding efforts concerning Myrica rubra. This database will be of great help to bayberry research and breeding in the future.
Collapse
Affiliation(s)
- Haiying Ren
- Institute of Horticulture, Zhejiang Academy of Agricultural Sciences, Hangzhou, 310021 China
| | - Yuanhao He
- Institute of Horticulture, Zhejiang Academy of Agricultural Sciences, Hangzhou, 310021 China
- College of Landscape Architecture, Nanjing Forestry University, Nanjing, 210037 China
| | - Xingjiang Qi
- Institute of Horticulture, Zhejiang Academy of Agricultural Sciences, Hangzhou, 310021 China
| | - Xiliang Zheng
- Institute of Horticulture, Zhejiang Academy of Agricultural Sciences, Hangzhou, 310021 China
| | - Shuwen Zhang
- Institute of Horticulture, Zhejiang Academy of Agricultural Sciences, Hangzhou, 310021 China
| | - Zheping Yu
- Institute of Horticulture, Zhejiang Academy of Agricultural Sciences, Hangzhou, 310021 China
| | - Fengrong Hu
- Institute of Horticulture, Zhejiang Academy of Agricultural Sciences, Hangzhou, 310021 China
| |
Collapse
|
47
|
Mielinis P, Sukackaitė R, Serapinaitė A, Samoilovas F, Alzbutas G, Matjošaitis K, Lubys A. MuA-based Molecular Indexing for Rare Mutation Detection by Next-Generation Sequencing. J Mol Biol 2021; 433:167209. [PMID: 34419430 DOI: 10.1016/j.jmb.2021.167209] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2021] [Revised: 08/12/2021] [Accepted: 08/12/2021] [Indexed: 12/12/2022]
Abstract
Detection of low-frequency mutations in cancer genomes or other heterogeneous cell populations requires high-fidelity sequencing. Molecular barcoding is one of the key technologies that enables the differentiation of true mutations from errors, which can be caused by sequencing or library preparation processes. However, current approaches where barcodes are introduced via primer extension or adaptor ligation do not utilize the full power of barcoding, due to complicated library preparation workflows and biases. Here we demonstrate the remarkable tolerance of MuA transposase to the presence of multiple replacements in transposon sequence, and explore this unique feature to engineer the MuA transposome complex with randomised nucleotides in 12 transposon positions, which can be introduced as a barcode into the target molecule after transposition event. We applied the approach of Unique MuA-based Molecular Indexing (UMAMI) to assess the power of rare mutation detection by shortgun sequencing on the Illumina platform. Our results show that UMAMI allows detection of rare mutations readily and reliably, and in this paper we report error rate values for the number of thermophilic DNA polymerases measured by using UMAMI.
Collapse
Affiliation(s)
- Paulius Mielinis
- Thermo Fisher Scientific Baltics UAB, V. A. Graičiūno 8, Vilnius LT-02241, Lithuania
| | - Rasa Sukackaitė
- Thermo Fisher Scientific Baltics UAB, V. A. Graičiūno 8, Vilnius LT-02241, Lithuania.
| | - Aistė Serapinaitė
- Thermo Fisher Scientific Baltics UAB, V. A. Graičiūno 8, Vilnius LT-02241, Lithuania
| | - Faustas Samoilovas
- Thermo Fisher Scientific Baltics UAB, V. A. Graičiūno 8, Vilnius LT-02241, Lithuania
| | - Gediminas Alzbutas
- Thermo Fisher Scientific Baltics UAB, V. A. Graičiūno 8, Vilnius LT-02241, Lithuania
| | - Karolis Matjošaitis
- Thermo Fisher Scientific Baltics UAB, V. A. Graičiūno 8, Vilnius LT-02241, Lithuania
| | - Arvydas Lubys
- Thermo Fisher Scientific Baltics UAB, V. A. Graičiūno 8, Vilnius LT-02241, Lithuania
| |
Collapse
|
48
|
PacBio sequencing output increased through uniform and directional fivefold concatenation. Sci Rep 2021; 11:18065. [PMID: 34508117 PMCID: PMC8433307 DOI: 10.1038/s41598-021-96829-z] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2021] [Accepted: 08/17/2021] [Indexed: 12/20/2022] Open
Abstract
Advances in sequencing technology have allowed researchers to sequence DNA with greater ease and at decreasing costs. Main developments have focused on either sequencing many short sequences or fewer large sequences. Methods for sequencing mid-sized sequences of 600-5,000 bp are currently less efficient. For example, the PacBio Sequel I system yields ~ 100,000-300,000 reads with an accuracy per base pair of 90-99%. We sought to sequence several DNA populations of ~ 870 bp in length with a sequencing accuracy of 99% and to the greatest depth possible. We optimised a simple, robust method to concatenate genes of ~ 870 bp five times and then sequenced the resulting DNA of ~ 5,000 bp by PacBioSMRT long-read sequencing. Our method improved upon previously published concatenation attempts, leading to a greater sequencing depth, high-quality reads and limited sample preparation at little expense. We applied this efficient concatenation protocol to sequence nine DNA populations from a protein engineering study. The improved method is accompanied by a simple and user-friendly analysis pipeline, DeCatCounter, to sequence medium-length sequences efficiently at one-fifth of the cost.
Collapse
|
49
|
High-Level Diversity of Basal Fungal Lineages and the Control of Fungal Community Assembly by Stochastic Processes in Mangrove Sediments. Appl Environ Microbiol 2021; 87:e0092821. [PMID: 34190611 DOI: 10.1128/aem.00928-21] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Fungi are key components of microbial communities in mangrove wetlands, with important roles in the transformation of nutrients and energy. However, existing studies typically focus on cultivable fungi and seldom on the structure and driving factors of entire fungal communities. The compositions, community assembly, and interaction patterns of mangrove fungal communities on a large scale remain elusive. Here, biogeography, assembly, and co-occurrence patterns of fungal communities in mangroves across eastern to southern China were systematically analyzed by targeting the entire internal transcribed spacer (ITS) region with high-throughput Pacific Biosciences single-molecule real-time sequencing. The analysis revealed a high level of fungal diversity, including a number of basal fungal lineages not previously reported in mangroves, such as Rozellomycota and Chytridiomycota. Beta nearest-taxon index analyses suggested a determinant role of dispersal limitation on fungal community in overall and most individual mangroves, with support from the strong distance-decay patterns of community similarity. Further, nonmetric multidimensional scaling analyses revealed similar biogeographies of dominant and rare fungal communities. A minor role of environmental selection on the fungal community was noted, with geographical location and sediment depth as crucial factors driving the distribution of both, the dominant and rare taxa. Finally, network analysis revealed high modularized co-occurrence patterns of fungal community in mangrove sediments, and the keystone taxa might play important roles in microbial interactions and ecological functions. The investigation expands our understanding of biogeography, assembly patterns, driving factors, and co-occurrence relationships of mangrove fungi and will spur the further functional exploration and protection of fungal resources in mangroves. IMPORTANCE As key components of microbial community in mangroves, fungi have important ecological functions. However, the fungal community in mangroves on a large scale is generally elusive, and mangroves are declining rapidly due to climate change and anthropogenic activities. This work provides an overview of fungal community structure and biogeography in mangrove wetlands along a >9,000-km coastline across eastern to southern China. Our study observed a high number of basal fungal lineages, such as Rozellomycota and Chytridiomycota, in mangrove sediments. In addition, our results highlight a crucial role of dispersal limitation and a minor role of environmental selections on fungal communities in mangrove sediments. These novel findings add important knowledge about the structure, assembly processes, and driving factors of fungal communities in mangrove sediments.
Collapse
|
50
|
Stephens Z, Milosevic D, Kipp B, Grebe S, Iyer RK, Kocher JPA. PB-Motif-A Method for Identifying Gene/Pseudogene Rearrangements With Long Reads: An Application to CYP21A2 Genotyping. Front Genet 2021; 12:716586. [PMID: 34394200 PMCID: PMC8355628 DOI: 10.3389/fgene.2021.716586] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2021] [Accepted: 07/05/2021] [Indexed: 12/30/2022] Open
Abstract
Long read sequencing technologies have the potential to accurately detect and phase variation in genomic regions that are difficult to fully characterize with conventional short read methods. These difficult to sequence regions include several clinically relevant genes with highly homologous pseudogenes, many of which are prone to gene conversions or other types of complex structural rearrangements. We present PB-Motif, a new method for identifying rearrangements between two highly homologous genomic regions using PacBio long reads. PB-Motif leverages clustering and filtering techniques to efficiently report rearrangements in the presence of sequencing errors and other systematic artifacts. Supporting reads for each high-confidence rearrangement can then be used for copy number estimation and phased variant calling. First, we demonstrate PB-Motif's accuracy with simulated sequence rearrangements of PMS2 and its pseudogene PMS2CL using simulated reads sweeping over a range of sequencing error rates. We then apply PB-Motif to 26 clinical samples, characterizing CYP21A2 and its pseudogene CYP21A1P as part of a diagnostic assay for congenital adrenal hyperplasia. We successfully identify damaging variation and patient carrier status concordant with clinical diagnosis obtained from multiplex ligation-dependent amplification (MLPA) and Sanger sequencing. The source code is available at: github.com/zstephens/pb-motif.
Collapse
Affiliation(s)
- Zachary Stephens
- Department of Electrical and Computer Engineering, University of Illinois Urbana-Champaign, Urbana, IL, United States
| | | | | | | | - Ravishankar K Iyer
- Department of Electrical and Computer Engineering, University of Illinois Urbana-Champaign, Urbana, IL, United States
| | | |
Collapse
|