301
|
Next-Generation Sequencing in High-Sensitive Detection of Mutations in Tumors: Challenges, Advances, and Applications. J Mol Diagn 2020; 22:994-1007. [PMID: 32480002 DOI: 10.1016/j.jmoldx.2020.04.213] [Citation(s) in RCA: 39] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2020] [Revised: 03/17/2020] [Accepted: 04/23/2020] [Indexed: 02/06/2023] Open
Abstract
Next-generation sequencing (NGS) technologies have come of age as preferred technologies for screening of genomic variants of pathologic and therapeutic potential. Because of their capability for high-throughput and massively parallel sequencing, they can screen for a variety of genomic changes in multiple samples simultaneously. This has made them platforms of choice for clinical testing of solid tumors and hematological malignancies. Consequently, they are increasingly replacing conventional technologies, such as Sanger sequencing and pyrosequencing, expression arrays, real-time PCR, and fluorescence in situ hybridization methods, for routine molecular testing of tumors. However, one limitation of routinely used NGS technologies is the inability to detect low-level genomic variants with high accuracy. This can be attributed to the frequent occurrence of low-level sequencing errors and artifacts in NGS workflow that need specialized approaches to be identified and eliminated. This review focuses on the origins and nature of these artifacts and recent improvements in the NGS technologies to overcome them to facilitate accurate high-sensitive detection of low-level mutations. Potential applications of high-sensitive NGS in oncology and comparisons with non-NGS technologies of similar capabilities are also summarized.
Collapse
|
302
|
Pearman WS, Freed NE, Silander OK. Testing the advantages and disadvantages of short- and long- read eukaryotic metagenomics using simulated reads. BMC Bioinformatics 2020; 21:220. [PMID: 32471343 PMCID: PMC7257156 DOI: 10.1186/s12859-020-3528-4] [Citation(s) in RCA: 47] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2019] [Accepted: 04/30/2020] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The first step in understanding ecological community diversity and dynamics is quantifying community membership. An increasingly common method for doing so is through metagenomics. Because of the rapidly increasing popularity of this approach, a large number of computational tools and pipelines are available for analysing metagenomic data. However, the majority of these tools have been designed and benchmarked using highly accurate short read data (i.e. Illumina), with few studies benchmarking classification accuracy for long error-prone reads (PacBio or Oxford Nanopore). In addition, few tools have been benchmarked for non-microbial communities. RESULTS Here we compare simulated long reads from Oxford Nanopore and Pacific Biosciences (PacBio) with high accuracy Illumina read sets to systematically investigate the effects of sequence length and taxon type on classification accuracy for metagenomic data from both microbial and non-microbial communities. We show that very generally, classification accuracy is far lower for non-microbial communities, even at low taxonomic resolution (e.g. family rather than genus). We then show that for two popular taxonomic classifiers, long reads can significantly increase classification accuracy, and this is most pronounced for non-microbial communities. CONCLUSIONS This work provides insight on the expected accuracy for metagenomic analyses for different taxonomic groups, and establishes the point at which read length becomes more important than error rate for assigning the correct taxon.
Collapse
Affiliation(s)
- William S Pearman
- School of Natural and Computational Sciences, Massey University, Private Bag 102904, North Shore, Auckland, 0745, New Zealand.
| | - Nikki E Freed
- School of Natural and Computational Sciences, Massey University, Private Bag 102904, North Shore, Auckland, 0745, New Zealand
| | - Olin K Silander
- School of Natural and Computational Sciences, Massey University, Private Bag 102904, North Shore, Auckland, 0745, New Zealand.
| |
Collapse
|
303
|
Tan L, Gómez-Betancur I, Guo S, Ge Y, Zhao J, Chen C, Wang N. Complete Genome of Vibrio neocaledonicus CGJ02-2, An active Compounds Producing Bacterium Isolated from South China Sea. Curr Microbiol 2020; 77:2665-2673. [PMID: 32458061 DOI: 10.1007/s00284-020-02047-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2020] [Accepted: 05/20/2020] [Indexed: 11/30/2022]
Abstract
Strain CGJ02-2 was isolated from the coral reefs in South China sea and deposited in South China Sea Institute of Oceanology, Chinese Academy of Sciences. Active compounds including indole, ρ-hydroxybenzaldehyde were isolated from this strain. To explore the biosynthetic way of these compounds and search gene clusters, the complete genome of this strain was sequenced by Single Molecule, Real-Time (SMRT) technology. It was de novo assembled to two circular chromosomes of 3,400,283 bp with GC% 44.77 and 1,845,572 bp with GC% 44.59 respectively and classified as Vibrio alginolyticus. In silico phenotype features of Vibrio alginolyticus CGJ02-2 were also analyzed. The biosynthetic pathway of ρ-hydroxybenzaldehyde and indole in this strain were postulated. Gene clusters of four secondary metabolites including bacteriocin, ectoine, siderophore, arylpolyene were identified. This study provides helpful information for further utilizing Vibrio alginolyticus CGJ02-2 as a source of valuable bioactive compounds.
Collapse
Affiliation(s)
- Lin Tan
- Haikou Experimental Station, Chinese Academy of Tropical Agricultural Sciences (CATAS), Haikou, 571101, Hainan, China.
| | - Isabel Gómez-Betancur
- Haikou Experimental Station, Chinese Academy of Tropical Agricultural Sciences (CATAS), Haikou, 571101, Hainan, China.,Programa de Ofidismo/Escorpionismo, Facultad de Ciencias Farmacéuticas Y Alimentarias, Universidad de Antioquia, 1226, Medellín, Colombia
| | - Suxia Guo
- Haikou Experimental Station, Chinese Academy of Tropical Agricultural Sciences (CATAS), Haikou, 571101, Hainan, China
| | - Yu Ge
- Haikou Experimental Station, Chinese Academy of Tropical Agricultural Sciences (CATAS), Haikou, 571101, Hainan, China
| | - Jianping Zhao
- School of Pharmacy, National Center for Natural Products Research, Thad Cochran Research Center, University of Mississippi, University, MS, 38677, USA
| | - Chang Chen
- Key Laboratory of Tropical Marine Bio-Resources and Ecology, South China Sea Institute of Oceanology, Chinese Academy of Sciences, Guangzhou, Guangdong, China.
| | - Nan Wang
- Haikou Experimental Station, Chinese Academy of Tropical Agricultural Sciences (CATAS), Haikou, 571101, Hainan, China
| |
Collapse
|
304
|
Yeetong P, Pongpanich M, Srichomthong C, Assawapitaksakul A, Shotelersuk V, Tantirukdham N, Chunharas C, Suphapeetiporn K, Shotelersuk V. TTTCA repeat insertions in an intron of YEATS2 in benign adult familial myoclonic epilepsy type 4. Brain 2020; 142:3360-3366. [PMID: 31539032 DOI: 10.1093/brain/awz267] [Citation(s) in RCA: 71] [Impact Index Per Article: 17.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2019] [Revised: 06/02/2019] [Accepted: 07/04/2019] [Indexed: 11/13/2022] Open
Abstract
Epilepsy is a common neurological disorder and identification of its causes is important for a better understanding of its pathogenesis. We previously studied a Thai family with a type of epilepsy, benign adult familial myoclonic epilepsy type 4 (BAFME4), and localized its gene to chromosome 3q26.32-q28. Here, we used single-molecule real-time sequencing and found expansions of TTTTA and insertions of TTTCA repeats in intron 1 of YEATS2 in one affected member of the family. Of all the available members in the family-comprising 13 affected and eight unaffected-repeat-primed PCR and long-range PCR revealed the co-segregation of the TTTCA repeat insertions with the TTTTA repeat expansions and the disease status. For 1116 Thai control subjects, none were found to harbour the TTTCA repeats while four had the TTTTA repeat expansions. Therefore, our findings suggest that BAFME4 is caused by the insertions of the intronic TTTCA repeats in YEATS2. Interestingly, all four types of BAFMEs for which underlying genes have been found (BAFMEs 1, 4, 6 and 7) are caused by the same molecular pathology, suggesting that the insertions of non-coding TTTCA repeats are involved in their pathogenesis.
Collapse
Affiliation(s)
- Patra Yeetong
- Division of Human Genetics, Department of Botany, Faculty of Science, Chulalongkorn University, Bangkok, Thailand
| | - Monnat Pongpanich
- Department of Mathematics and Computer Science, Faculty of Science, Chulalongkorn University, Bangkok, Thailand.,Omics Sciences and Bioinformatics Center, Faculty of Science, Chulalongkorn University, Bangkok,, Thailand
| | - Chalurmpon Srichomthong
- Center of Excellence for Medical Genomics, Department of Pediatrics, Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand.,Excellence Center for Genomics and Precision Medicine, King Chulalongkorn Memorial Hospital, the Thai Red Cross Society, Bangkok, Thailand
| | - Adjima Assawapitaksakul
- Center of Excellence for Medical Genomics, Department of Pediatrics, Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand.,Excellence Center for Genomics and Precision Medicine, King Chulalongkorn Memorial Hospital, the Thai Red Cross Society, Bangkok, Thailand
| | - Varote Shotelersuk
- Center of Excellence for Medical Genomics, Department of Pediatrics, Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand.,Excellence Center for Genomics and Precision Medicine, King Chulalongkorn Memorial Hospital, the Thai Red Cross Society, Bangkok, Thailand
| | - Nithiphut Tantirukdham
- Division of Human Genetics, Department of Botany, Faculty of Science, Chulalongkorn University, Bangkok, Thailand
| | - Chaipat Chunharas
- Division of Neurology, Department of Medicine, Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand
| | - Kanya Suphapeetiporn
- Center of Excellence for Medical Genomics, Department of Pediatrics, Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand.,Excellence Center for Genomics and Precision Medicine, King Chulalongkorn Memorial Hospital, the Thai Red Cross Society, Bangkok, Thailand
| | - Vorasuk Shotelersuk
- Center of Excellence for Medical Genomics, Department of Pediatrics, Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand.,Excellence Center for Genomics and Precision Medicine, King Chulalongkorn Memorial Hospital, the Thai Red Cross Society, Bangkok, Thailand
| |
Collapse
|
305
|
Batista FM, Stapleton T, Lowther JA, Fonseca VG, Shaw R, Pond C, Walker DI, van Aerle R, Martinez-Urtaza J. Whole Genome Sequencing of Hepatitis A Virus Using a PCR-Free Single-Molecule Nanopore Sequencing Approach. Front Microbiol 2020; 11:874. [PMID: 32523561 PMCID: PMC7261825 DOI: 10.3389/fmicb.2020.00874] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2019] [Accepted: 04/14/2020] [Indexed: 12/18/2022] Open
Abstract
Hepatitis A virus (HAV) is one of the most common causes of acute viral hepatitis in humans. Although HAV has a relatively small genome, there are several factors limiting whole genome sequencing such as PCR amplification artefacts and ambiguities in de novo assembly. The recently developed Oxford Nanopore technologies (ONT) allows single-molecule sequencing of long-size fragments of DNA or RNA using PCR-free strategies. We have sequenced the whole genome of HAV using a PCR-free approach by direct reverse-transcribed sequencing. We were able to sequence HAV cDNA and obtain reads over 7 kilobases in length containing almost the whole genome of the virus. The comparison of these raw long nanopore reads with the HAV reference wild type revealed a nucleotide sequence identity between 81.1 and 96.6%. By de novo assembly of all HAV reads we obtained a consensus sequence of 7362 bases, with a nucleotide sequence identity of 99.0% with the genome of the HAV strain pHM175/18f. When the assembly was performed using as reference the HAV strain pHM175/18f a consensus with a sequence similarity of 99.8 % was obtained. We have also used an ONT amplicon-based assay to sequence two fragments of the VP3 and VP1 regions which showed a sequence similarity of 100% with matching regions of the consensus sequence obtained using the direct cDNA sequencing approach. This study showed the applicability of ONT sequencing technologies to obtain the whole genome of HAV by direct cDNA nanopore sequencing, highlighting the utility of this PCR-free approach for HAV characterization and potentially other viruses of the Picornaviridae family.
Collapse
Affiliation(s)
- Frederico M Batista
- International Centre of Excellence for Aquatic Animal Health, Centre for Environment Fisheries and Aquaculture Science (CEFAS), Weymouth, Dorset, United Kingdom
| | - Tina Stapleton
- International Centre of Excellence for Aquatic Animal Health, Centre for Environment Fisheries and Aquaculture Science (CEFAS), Weymouth, Dorset, United Kingdom
| | - James A Lowther
- International Centre of Excellence for Aquatic Animal Health, Centre for Environment Fisheries and Aquaculture Science (CEFAS), Weymouth, Dorset, United Kingdom
| | - Vera G Fonseca
- International Centre of Excellence for Aquatic Animal Health, Centre for Environment Fisheries and Aquaculture Science (CEFAS), Weymouth, Dorset, United Kingdom
| | - Rebecca Shaw
- International Centre of Excellence for Aquatic Animal Health, Centre for Environment Fisheries and Aquaculture Science (CEFAS), Weymouth, Dorset, United Kingdom
| | - Christopher Pond
- International Centre of Excellence for Aquatic Animal Health, Centre for Environment Fisheries and Aquaculture Science (CEFAS), Weymouth, Dorset, United Kingdom
| | - David I Walker
- International Centre of Excellence for Aquatic Animal Health, Centre for Environment Fisheries and Aquaculture Science (CEFAS), Weymouth, Dorset, United Kingdom
| | - Ronny van Aerle
- International Centre of Excellence for Aquatic Animal Health, Centre for Environment Fisheries and Aquaculture Science (CEFAS), Weymouth, Dorset, United Kingdom
| | - Jaime Martinez-Urtaza
- International Centre of Excellence for Aquatic Animal Health, Centre for Environment Fisheries and Aquaculture Science (CEFAS), Weymouth, Dorset, United Kingdom.,Department of Genetics and Microbiology, Facultat de Biociències - Edifici C, Campus Universitat Autònoma de Barcelona (UAB), Barcelona, Spain
| |
Collapse
|
306
|
Chen L, Wang J, Tan L, Lu C, Fu G, Fu L, Zhang X, Wang Q, Ma C, Cong B, Li S. Highly accurate mtGenome haplotypes from long-read SMRT sequencing can distinguish between monozygotic twins. Forensic Sci Int Genet 2020; 47:102306. [PMID: 32438082 DOI: 10.1016/j.fsigen.2020.102306] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2019] [Revised: 04/16/2020] [Accepted: 04/29/2020] [Indexed: 12/13/2022]
Abstract
Discriminating between monozygotic twins (MZ) remains a challenge in the field of forensics globally. It is very difficult to find sequence variants within MZ twins, despite using ultra-deep next generation sequencing (NGS) for nuclear DNA. However, mitochondrial DNA might be a potential marker owing to its higher mutation rate and easier sequencing via NGS. Here, we aimed to introduce a long-read single molecule real-time sequencing (SMRT) strategy, with better continuity and fewer alignment errors, to obtain more accurate mitochondrial genome (mtGenome) sequence on the Sequel platform. Compared to Ion Torrent Personal Genome Machine (PGM), the long-read SMRT sequencing strategy generated highly accurate and mapped circular consensus sequence (CCS) reads and exhibited robust performance in terms of reliable repeatability, consistent coverage pattern, and balanced strands in mtGenome recovery. Moreover, the long-read SMRT strategy exhibited superior ability to not only identify accurate haplotypes but also discover a total of 785 low-level variants within 16 MZ twin pairs with threshold of 2% and 20 CCS reads with Q30 quality. Taken together, our findings suggested the long-read SMRT technology as an appreciable strategy for obtaining accurate mitotypes and providing a promising solution for distinguishing between MZ twin pairs in forensic genetics.
Collapse
Affiliation(s)
- Lu Chen
- College of Forensic Medicine, Hebei Medical University, Hebei Key Laboratory of Forensic Medicine, Collaborative Innovation Center of Forensic Medical Molecular Identification, Hebei Province, Shijiazhuang, 050017, PR China
| | - Jiaqian Wang
- College of Forensic Medicine, Hebei Medical University, Hebei Key Laboratory of Forensic Medicine, Collaborative Innovation Center of Forensic Medical Molecular Identification, Hebei Province, Shijiazhuang, 050017, PR China
| | - Lu Tan
- College of Forensic Medicine, Hebei Medical University, Hebei Key Laboratory of Forensic Medicine, Collaborative Innovation Center of Forensic Medical Molecular Identification, Hebei Province, Shijiazhuang, 050017, PR China
| | - Chaolong Lu
- College of Forensic Medicine, Hebei Medical University, Hebei Key Laboratory of Forensic Medicine, Collaborative Innovation Center of Forensic Medical Molecular Identification, Hebei Province, Shijiazhuang, 050017, PR China
| | - Guangping Fu
- College of Forensic Medicine, Hebei Medical University, Hebei Key Laboratory of Forensic Medicine, Collaborative Innovation Center of Forensic Medical Molecular Identification, Hebei Province, Shijiazhuang, 050017, PR China
| | - Lihong Fu
- College of Forensic Medicine, Hebei Medical University, Hebei Key Laboratory of Forensic Medicine, Collaborative Innovation Center of Forensic Medical Molecular Identification, Hebei Province, Shijiazhuang, 050017, PR China
| | - Xiaojing Zhang
- College of Forensic Medicine, Hebei Medical University, Hebei Key Laboratory of Forensic Medicine, Collaborative Innovation Center of Forensic Medical Molecular Identification, Hebei Province, Shijiazhuang, 050017, PR China
| | - Qian Wang
- College of Forensic Medicine, Hebei Medical University, Hebei Key Laboratory of Forensic Medicine, Collaborative Innovation Center of Forensic Medical Molecular Identification, Hebei Province, Shijiazhuang, 050017, PR China
| | - Chunling Ma
- College of Forensic Medicine, Hebei Medical University, Hebei Key Laboratory of Forensic Medicine, Collaborative Innovation Center of Forensic Medical Molecular Identification, Hebei Province, Shijiazhuang, 050017, PR China
| | - Bin Cong
- College of Forensic Medicine, Hebei Medical University, Hebei Key Laboratory of Forensic Medicine, Collaborative Innovation Center of Forensic Medical Molecular Identification, Hebei Province, Shijiazhuang, 050017, PR China
| | - Shujin Li
- College of Forensic Medicine, Hebei Medical University, Hebei Key Laboratory of Forensic Medicine, Collaborative Innovation Center of Forensic Medical Molecular Identification, Hebei Province, Shijiazhuang, 050017, PR China.
| |
Collapse
|
307
|
Hu X, Zeng C, Hou Z, Wang Y, Xu Q, Isobe K, Senoo K, Zhu L. The complete genome sequence of the archaeal isolate Halomicrobium sp. ZPS1 reveals the nitrogen metabolism characteristics under hypersaline conditions. ANN MICROBIOL 2020. [DOI: 10.1186/s13213-020-01575-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Abstract
Purpose
As a potential tool for the biodegradation of nitrogen contaminants, including nitrate, nitrite, and ammonium, in pickled foods with high salinity, the halophilic and denitrifying archaeal strain Halomicrobium sp. ZPS1 was isolated from edible salt particles.
Methods
Under anaerobic and static culture conditions, Halomicrobium sp. ZPS1 could simultaneously degrade nitrate, nitrite, and ammonium in liquid medium with 18% salinity and generate N2O. To gain insight into these physiological characteristics, the complete genome of Halomicrobium sp. ZPS1 was sequenced to reveal the mechanism of nitrogen metabolism associated with salt-tolerance.
Result
The complete genome sequencing revealed a genome size of 3,094,203 bp with a circular chromosome and a GC content of 65.64%. Based on gene annotation, 3191 CDSs, 6 rRNA genes, and 76 tRNA genes were identified. Moreover, 28 genes were annotated as related to salt tolerance, ammonium assimilation, and a truncated denitrification pathway.
Conclusion
The annotated functional genes indicate that Halomicrobium sp. ZPS1 could be a candidate strain for the simultaneous removal of nitrate, nitrite, and ammonia in extremely high salt environments.
Collapse
|
308
|
Fang X, Qin K, Li S, Han S, Zhu T, Fang X, Qin K. Whole genome sequence of Diaporthe capsici, a new pathogen of walnut blight. Genomics 2020; 112:3751-3761. [PMID: 32371024 DOI: 10.1016/j.ygeno.2020.04.018] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2019] [Revised: 04/15/2020] [Accepted: 04/24/2020] [Indexed: 11/15/2022]
Abstract
Many fungi in the Diaporthe genus across the world are pathogenic. Diaporthe capsici. is a pathogenic fungus that can infect peppers and walnuts, causing their death. The aim of this study was to develop a genomic resource to provide substantial data and a theoretical basis for research on molecular pathogenesis, transcriptome, proteome, and metabonome of D. capsici. The whole genome of D. capsici was sequenced using the PacBio RSII sequencing platform, and functional annotation was performed using different public databases. The genome was found to be 57.56 Mb in size, with an N50 contig size of 5,171,887 bp, and encodes 14,425 putative genes. This is the first genome-scale assembly and annotation for D. capsici, which is the eighth species in Diaporthe to be sequenced.
Collapse
Affiliation(s)
- Xinmei Fang
- College of Forestry, Sichuan Agricultural University, Chengdu 611130, Sichuan Province, China
| | - Kai Qin
- College of Forestry, Sichuan Agricultural University, Chengdu 611130, Sichuan Province, China
| | - Shujiang Li
- College of Forestry, Sichuan Agricultural University, Chengdu 611130, Sichuan Province, China
| | - Shan Han
- College of Forestry, Sichuan Agricultural University, Chengdu 611130, Sichuan Province, China
| | - Tianhui Zhu
- College of Forestry, Sichuan Agricultural University, Chengdu 611130, Sichuan Province, China.
| | - Xinmei Fang
- College of Forestry, Sichuan Agricultural University, Chengdu 611130, Sichuan Province, China
| | - Kai Qin
- College of Forestry, Sichuan Agricultural University, Chengdu 611130, Sichuan Province, China
| |
Collapse
|
309
|
Comparative Genomic Analysis Provides Insights into the Phylogeny, Resistome, Virulome, and Host Adaptation in the Genus Ewingella. Pathogens 2020; 9:pathogens9050330. [PMID: 32354059 PMCID: PMC7281767 DOI: 10.3390/pathogens9050330] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2020] [Revised: 04/20/2020] [Accepted: 04/22/2020] [Indexed: 12/18/2022] Open
Abstract
Ewingella americana is a cosmopolitan bacterial pathogen that has been isolated from many hosts. Here, we sequenced a high-quality genome of E. americana B6-1 isolated from Flammulina filiformis, an important cultivated mushroom, performed a comparative genomic analysis with four other E. americana strains from various origins, and tested the susceptibility of B6-1 to antibiotics. The genome size, predicted genes, and GC (guanine-cytosine) content of B6-1 was 4.67 Mb, 4301, and 53.80%, respectively. The origin of the strains did not significantly affect the phylogeny, but mobile genetic elements shaped the evolution of the genus Ewingella. The strains encoded a set of common genes for type secretion, virulence effectors, CAZymes, and toxins required for pathogenicity in all hosts. They also had antibiotic resistance, pigments to suppress or evade host defense responses, as well as genes for adaptation to different environmental conditions, including temperature, oxidation, and nutrients. These findings provide a better understanding of the virulence, antibiotic resistance, and host adaptation strategies of Ewingella, and they also contribute to the development of effective control strategies.
Collapse
|
310
|
Zhou J, Sun T, Kang W, Tang D, Feng Q. Pathogenic and antimicrobial resistance genes in Streptococcus oralis strains revealed by comparative genome analysis. Genomics 2020; 112:3783-3793. [PMID: 32334114 DOI: 10.1016/j.ygeno.2020.04.014] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2019] [Revised: 04/03/2020] [Accepted: 04/20/2020] [Indexed: 02/05/2023]
Abstract
Streptococcus oralis is an early colonizer bacterium in dental plaques and is considered a potential pathogen of infective endocarditis (IE) disease. In this study, we built a complete genome map of Streptococcus oralis strain SOT, Streptococcus oralis strain SOD and Streptococcus infantis strain SO and performed comparative genomic analysis among these three strains. The results showed that there are five genomic islands (GIs) in strain SOT and one CRISPR in strain SOD. Each genome harbors various pathogenic genes related to diseases and drug resistance, while the antibiotic resistance genes in strains SOT and SOD were quite similar but different from those in strain SO. In addition, we identified 17 main virulence factors and capsule-related genes in three strains. These results suggest the pathogenic potential of Streptococcus strains, which lay a foundation for the prevention and treatment of a Streptococcus oralis infection.
Collapse
Affiliation(s)
- Jiannan Zhou
- Department of Human Microbiome, School and Hospital of Stomatology, Shandong University, Shandong Provincial Key Laboratory of Oral Tissue Regeneration, Shandong Engineering Laboratory for Dental Materials and Oral Tissue Regeneration, No. 44-1 Wenhua Road West, 250012 Jinan, Shandong, China
| | - Tianyong Sun
- Department of Human Microbiome, School and Hospital of Stomatology, Shandong University, Shandong Provincial Key Laboratory of Oral Tissue Regeneration, Shandong Engineering Laboratory for Dental Materials and Oral Tissue Regeneration, No. 44-1 Wenhua Road West, 250012 Jinan, Shandong, China
| | - Wenyan Kang
- Department of Human Microbiome, School and Hospital of Stomatology, Shandong University, Shandong Provincial Key Laboratory of Oral Tissue Regeneration, Shandong Engineering Laboratory for Dental Materials and Oral Tissue Regeneration, No. 44-1 Wenhua Road West, 250012 Jinan, Shandong, China
| | - Di Tang
- Department of Human Microbiome, School and Hospital of Stomatology, Shandong University, Shandong Provincial Key Laboratory of Oral Tissue Regeneration, Shandong Engineering Laboratory for Dental Materials and Oral Tissue Regeneration, No. 44-1 Wenhua Road West, 250012 Jinan, Shandong, China
| | - Qiang Feng
- Department of Human Microbiome, School and Hospital of Stomatology, Shandong University, Shandong Provincial Key Laboratory of Oral Tissue Regeneration, Shandong Engineering Laboratory for Dental Materials and Oral Tissue Regeneration, No. 44-1 Wenhua Road West, 250012 Jinan, Shandong, China; State Key Laboratory of Microbial Technology, Shandong University,266237 Qingdao, Shandong, China; NHC Key Laboratory of Otorhinolaryngology (Shandong University), Jinan, Shandong, China.
| |
Collapse
|
311
|
Heyer EE, Blackburn J. Sequencing Strategies for Fusion Gene Detection. Bioessays 2020; 42:e2000016. [DOI: 10.1002/bies.202000016] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2020] [Revised: 03/11/2020] [Indexed: 02/06/2023]
Affiliation(s)
- Erin E. Heyer
- The Kinghorn Cancer CentreGarvan Institute of Medical Research 384 Victoria Street Darlinghurst NSW 2010 Australia
| | - James Blackburn
- The Kinghorn Cancer CentreGarvan Institute of Medical Research 384 Victoria Street Darlinghurst NSW 2010 Australia
- Faculty of Medicine, St. Vincent's Clinical SchoolUNSW, St Vincent's Hospital Victoria Street Darlinghurst NSW 2010 Australia
| |
Collapse
|
312
|
Sequencing barcode construction and identification methods based on block error-correction codes. SCIENCE CHINA-LIFE SCIENCES 2020; 63:1580-1592. [PMID: 32303959 DOI: 10.1007/s11427-019-1651-3] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/23/2019] [Accepted: 02/11/2020] [Indexed: 02/07/2023]
Abstract
Multiplexed sequencing relies on specific sample labels, the barcodes, to tag DNA fragments belonging to different samples and to separate the output of the sequencers. However, the barcodes are often corrupted by insertion, deletion and substitution errors introduced during sequencing, which may lead to sample misassignment. In this paper, we propose a barcode construction method, which combines a block error-correction code with a predetermined pseudorandom sequence to generate a base sequence for labeling different samples. Furthermore, to identify the corrupted barcodes for assigning reads to their respective samples, we present a soft decision identification method that consists of inner decoding and outer decoding. The inner decoder establishes the hidden Markov model (HMM) for base insertion/deletion estimation with the pseudorandom sequence, and adapts the forward-backward (FB) algorithm to output the soft information of each bit in the block code. The outer decoder performs soft decision decoding using the soft information to effectively correct multiple errors in the barcodes. Simulation results show that the proposed methods are highly robust to high error rates of insertions, deletions and substitutions in the barcodes. In addition, compared with the inner decoding algorithm of the barcodes based on watermarks, the proposed inner decoding algorithm can greatly reduce the decoding complexity.
Collapse
|
313
|
Rice ES, Koren S, Rhie A, Heaton MP, Kalbfleisch TS, Hardy T, Hackett PH, Bickhart DM, Rosen BD, Ley BV, Maurer NW, Green RE, Phillippy AM, Petersen JL, Smith TPL. Continuous chromosome-scale haplotypes assembled from a single interspecies F1 hybrid of yak and cattle. Gigascience 2020; 9:giaa029. [PMID: 32242610 PMCID: PMC7118895 DOI: 10.1093/gigascience/giaa029] [Citation(s) in RCA: 35] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2019] [Revised: 01/08/2020] [Accepted: 03/10/2020] [Indexed: 12/30/2022] Open
Abstract
BACKGROUND The development of trio binning as an approach for assembling diploid genomes has enabled the creation of fully haplotype-resolved reference genomes. Unlike other methods of assembly for diploid genomes, this approach is enhanced, rather than hindered, by the heterozygosity of the individual sequenced. To maximize heterozygosity and simultaneously assemble reference genomes for 2 species, we applied trio binning to an interspecies F1 hybrid of yak (Bos grunniens) and cattle (Bos taurus), 2 species that diverged nearly 5 million years ago. The genomes of both of these species are composed of acrocentric autosomes. RESULTS We produced the most continuous haplotype-resolved assemblies for a diploid animal yet reported. Both the maternal (yak) and paternal (cattle) assemblies have the largest 2 chromosomes in single haplotigs, and more than one-third of the autosomes similarly lack gaps. The maximum length haplotig produced was 153 Mb without any scaffolding or gap-filling steps and represents the longest haplotig reported for any species. The assemblies are also more complete and accurate than those reported for most other vertebrates, with 97% of mammalian universal single-copy orthologs present. CONCLUSIONS The high heterozygosity inherent to interspecies crosses maximizes the effectiveness of the trio binning method. The interspecies trio binning approach we describe is likely to provide the highest-quality assemblies for any pair of species that can interbreed to produce hybrid offspring that develop to sufficient cell numbers for DNA extraction.
Collapse
Affiliation(s)
- Edward S Rice
- Department of Animal Science, University of Nebraska–Lincoln, C203 ANSC, Lincoln, NE 68583, USA
- Bond Life Sciences Center, University of Missouri, 1201 Rollins Street, Columbia, MO 65201, USA
| | - Sergey Koren
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, 9000 Rockville Pike, Bethesda, MD 20892, USA
| | - Arang Rhie
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, 9000 Rockville Pike, Bethesda, MD 20892, USA
| | - Michael P Heaton
- US Meat Animal Research Center, US Department of Agriculture, State Spur 18D, Clay Center, NE 68933, USA
| | - Theodore S Kalbfleisch
- Gluck Equine Research Center, University of Kentucky, 1400 Nicholasville Rd., Lexington, KY 40546, USA
| | | | | | - Derek M Bickhart
- Dairy Forage Research Center, 1925 Linden Drive, ARS USDA, Madison, WI 53706, USA
| | - Benjamin D Rosen
- Animal Genomics and Improvement Laboratory, 10300 Baltimore Ave., ARS USDA, Beltsville, MD 20705, USA
| | - Brian Vander Ley
- Great Plains Veterinary Educational Center, School of Veterinary Medicine and Biomedical Sciences, University of Nebraska–Lincoln, 820 Road 313, Clay Center, NE 68933, USA
| | - Nicholas W Maurer
- Department of Biomolecular Engineering, University of California, 1156 High St., Santa Cruz, CA 95064, USA
| | - Richard E Green
- Department of Biomolecular Engineering, University of California, 1156 High St., Santa Cruz, CA 95064, USA
| | - Adam M Phillippy
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, 9000 Rockville Pike, Bethesda, MD 20892, USA
| | - Jessica L Petersen
- Department of Animal Science, University of Nebraska–Lincoln, C203 ANSC, Lincoln, NE 68583, USA
| | - Timothy P L Smith
- US Meat Animal Research Center, US Department of Agriculture, State Spur 18D, Clay Center, NE 68933, USA
| |
Collapse
|
314
|
Xiang S, Lin R, Shang H, Xu Y, Zhang Z, Wu X, Zong F. Efficient Degradation of Phenoxyalkanoic Acid Herbicides by the Alkali-Tolerant Cupriavidus oxalaticus Strain X32. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2020; 68:3786-3795. [PMID: 32133852 DOI: 10.1021/acs.jafc.9b05061] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Phenoxyalkanoic acid (PAA) herbicides are mainly metabolized by microorganisms in soils, but the degraders that perform well under alkaline environments are rarely considered. Herein, we report Cupriavidus oxalaticus strain X32, which showed encouraging PAA-degradation abilities, PAA tolerance, and alkali tolerance. In liquid media, without the addition of exogenous carbon sources, X32 could completely remove 500 mg/L 2,4-dichlorophenoxyacetic acid (2,4-D) or 4-chloro-2-methylphenoxyacetic acid within 3 days, faster than that with the model degrader Cupriavidus necator JMP134. Particularly, X32 still functioned at pH 10.5. Of note, with X32 inoculation, we observed 2,4-D degradation in soils and diminished phytotoxicity to maize (Zea mays). Furthermore, potential mechanisms underlying PAA biodegradation and alkali tolerance were then analyzed by whole-genome sequencing. Three modules of tfd gene clusters involved in 2,4-D catabolism and genes encoding monovalent cation/proton antiporters involved in alkali tolerance were putatively identified. Thus, X32 could be a promising candidate for the bioremediation of PAA-contaminated sites, especially in alkaline surroundings.
Collapse
Affiliation(s)
- Sheng Xiang
- Department of Applied Chemistry, College of Science, China Agricultural University, Beijing 100193, China
| | - Ronghua Lin
- Institute for the Control of Agrochemicals, Ministry of Agriculture and Rural Affairs, Beijing 100125, China
| | - Hongyi Shang
- Department of Applied Chemistry, College of Science, China Agricultural University, Beijing 100193, China
| | - Yong Xu
- Department of Applied Chemistry, College of Science, China Agricultural University, Beijing 100193, China
| | - Zhenhua Zhang
- Department of Applied Chemistry, College of Science, China Agricultural University, Beijing 100193, China
| | - Xuemin Wu
- Department of Applied Chemistry, College of Science, China Agricultural University, Beijing 100193, China
| | - Fulin Zong
- Institute for the Control of Agrochemicals, Ministry of Agriculture and Rural Affairs, Beijing 100125, China
| |
Collapse
|
315
|
Chen LQ, Zhao WS, Luo GZ. Mapping and editing of nucleic acid modifications. Comput Struct Biotechnol J 2020; 18:661-667. [PMID: 32257049 PMCID: PMC7113611 DOI: 10.1016/j.csbj.2020.03.010] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2019] [Revised: 02/29/2020] [Accepted: 03/13/2020] [Indexed: 12/18/2022] Open
Abstract
Modification on nucleic acid plays a pivotal role in controlling gene expression. Various kinds of modifications greatly increase the information-encoding capacity of DNA and RNA by introducing extra chemical group to existing bases instead of altering the genetic sequences. As a marker on DNA or RNA, nucleic acid modification can be recognized by specific proteins, leading to versatile regulation of gene expression. However, modified and regular bases are often indistinguishable by most conventional molecular methods, impeding detailed functional studies that require the information of genomic location. Recently, new technologies are emerging to resolve the positions of varied modifications on both DNA and RNA. Intriguingly, by integrating regional targeting tools and effector proteins, researchers begin to actively control the modification status of desired gene in vivo. In this review, we summarize the characteristics of DNA and RNA modifications, the available mapping and editing tools, and the potential application as well as deficiency of these technologies in basic and translational researches.
Collapse
Affiliation(s)
| | | | - Guan-Zheng Luo
- MOE Key Laboratory of Gene Function and Regulation, State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-sen University, Guangzhou, Guangdong, China
| |
Collapse
|
316
|
Yu J, Xiang X, Huang J, Liang X, Pan X, Dong Z, Petersen TS, Qu K, Yang L, Zhao X, Li S, Zheng T, Xu Z, Liu C, Han P, Xu F, Yang H, Liu X, Zhang X, Bolund L, Luo Y, Lin L. Haplotyping by CRISPR-mediated DNA circularization (CRISPR-hapC) broadens allele-specific gene editing. Nucleic Acids Res 2020; 48:e25. [PMID: 31943080 PMCID: PMC7049710 DOI: 10.1093/nar/gkz1233] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2019] [Revised: 12/19/2019] [Accepted: 12/24/2019] [Indexed: 12/15/2022] Open
Abstract
Allele-specific protospacer adjacent motif (asPAM)-positioning SNPs and CRISPRs are valuable resources for gene therapy of dominant disorders. However, one technical hurdle is to identify the haplotype comprising the disease-causing allele and the distal asPAM SNPs. Here, we describe a novel CRISPR-based method (CRISPR-hapC) for haplotyping. Based on the generation (with a pair of CRISPRs) of extrachromosomal circular DNA in cells, the CRISPR-hapC can map haplotypes from a few hundred bases to over 200 Mb. To streamline and demonstrate the applicability of the CRISPR-hapC and asPAM CRISPR for allele-specific gene editing, we reanalyzed the 1000 human pan-genome and generated a high frequency asPAM SNP and CRISPR database (www.crispratlas.com/knockout) for four CRISPR systems (SaCas9, SpCas9, xCas9 and Cas12a). Using the huntingtin (HTT) CAG expansion and transthyretin (TTR) exon 2 mutation as examples, we showed that the asPAM CRISPRs can specifically discriminate active and dead PAMs for all 23 loci tested. Combination of the CRISPR-hapC and asPAM CRISPRs further demonstrated the capability for achieving highly accurate and haplotype-specific deletion of the HTT CAG expansion allele and TTR exon 2 mutation in human cells. Taken together, our study provides a new approach and an important resource for genome research and allele-specific (haplotype-specific) gene therapy.
Collapse
Affiliation(s)
- Jiaying Yu
- BGI Education Center, University of Chinese Academy of Sciences, Shenzhen 518083, China
- Lars Bolund Institute of Regenerative Medicine, BGI-Qingdao, Qingdao 266555, China
| | - Xi Xiang
- BGI Education Center, University of Chinese Academy of Sciences, Shenzhen 518083, China
- Lars Bolund Institute of Regenerative Medicine, BGI-Qingdao, Qingdao 266555, China
- BGI-Shenzhen, Shenzhen 518083, China
- Department of Biomedicine, Aarhus University, Aarhus 8000, Denmark
| | - Jinrong Huang
- Lars Bolund Institute of Regenerative Medicine, BGI-Qingdao, Qingdao 266555, China
- BGI-Shenzhen, Shenzhen 518083, China
- Department of Biomedicine, Aarhus University, Aarhus 8000, Denmark
- Department of Biology, University of Copenhagen, Copenhagen 2200, Denmark
| | - Xue Liang
- Lars Bolund Institute of Regenerative Medicine, BGI-Qingdao, Qingdao 266555, China
| | - Xiaoguang Pan
- Lars Bolund Institute of Regenerative Medicine, BGI-Qingdao, Qingdao 266555, China
| | - Zhanying Dong
- Lars Bolund Institute of Regenerative Medicine, BGI-Qingdao, Qingdao 266555, China
| | | | - Kunli Qu
- Lars Bolund Institute of Regenerative Medicine, BGI-Qingdao, Qingdao 266555, China
| | - Ling Yang
- Lars Bolund Institute of Regenerative Medicine, BGI-Qingdao, Qingdao 266555, China
- BGI-Shenzhen, Shenzhen 518083, China
| | - Xiaoying Zhao
- BGI Education Center, University of Chinese Academy of Sciences, Shenzhen 518083, China
- Lars Bolund Institute of Regenerative Medicine, BGI-Qingdao, Qingdao 266555, China
| | - Siyuan Li
- BGI Education Center, University of Chinese Academy of Sciences, Shenzhen 518083, China
- Lars Bolund Institute of Regenerative Medicine, BGI-Qingdao, Qingdao 266555, China
| | - Tianyu Zheng
- BGI Education Center, University of Chinese Academy of Sciences, Shenzhen 518083, China
- Lars Bolund Institute of Regenerative Medicine, BGI-Qingdao, Qingdao 266555, China
| | - Zhe Xu
- BGI Education Center, University of Chinese Academy of Sciences, Shenzhen 518083, China
- Lars Bolund Institute of Regenerative Medicine, BGI-Qingdao, Qingdao 266555, China
| | - Chengxun Liu
- Lars Bolund Institute of Regenerative Medicine, BGI-Qingdao, Qingdao 266555, China
| | - Peng Han
- Lars Bolund Institute of Regenerative Medicine, BGI-Qingdao, Qingdao 266555, China
| | - Fengping Xu
- Lars Bolund Institute of Regenerative Medicine, BGI-Qingdao, Qingdao 266555, China
- BGI-Shenzhen, Shenzhen 518083, China
| | | | - Xin Liu
- BGI-Shenzhen, Shenzhen 518083, China
| | | | - Lars Bolund
- Lars Bolund Institute of Regenerative Medicine, BGI-Qingdao, Qingdao 266555, China
- BGI-Shenzhen, Shenzhen 518083, China
- Department of Biomedicine, Aarhus University, Aarhus 8000, Denmark
| | - Yonglun Luo
- Lars Bolund Institute of Regenerative Medicine, BGI-Qingdao, Qingdao 266555, China
- BGI-Shenzhen, Shenzhen 518083, China
- Department of Biomedicine, Aarhus University, Aarhus 8000, Denmark
- Steno Diabetes Center Aarhus, Aarhus University Hospital, Aarhus 8200, Denmark
| | - Lin Lin
- Department of Biomedicine, Aarhus University, Aarhus 8000, Denmark
- Steno Diabetes Center Aarhus, Aarhus University Hospital, Aarhus 8200, Denmark
| |
Collapse
|
317
|
Single-molecule analysis of nucleic acid biomarkers - A review. Anal Chim Acta 2020; 1115:61-85. [PMID: 32370870 DOI: 10.1016/j.aca.2020.03.001] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2019] [Revised: 02/29/2020] [Accepted: 03/02/2020] [Indexed: 12/11/2022]
Abstract
Nucleic acids are important biomarkers for disease detection, monitoring, and treatment. Advances in technologies for nucleic acid analysis have enabled discovery and clinical implementation of nucleic acid biomarkers. However, challenges remain with technologies for nucleic acid analysis, thereby limiting the use of nucleic acid biomarkers in certain contexts. Here, we review single-molecule technologies for nucleic acid analysis that can be used to overcome these challenges. We first discuss the various types of nucleic acid biomarkers important for clinical applications and conventional technologies for nucleic acid analysis. We then discuss technologies for single-molecule in vitro and in situ analysis of nucleic acid biomarkers. Finally, we discuss other ultra-sensitive techniques for nucleic acid biomarker detection.
Collapse
|
318
|
Liu Y, Cheng J, Siejka-Zielińska P, Weldon C, Roberts H, Lopopolo M, Magri A, D'Arienzo V, Harris JM, McKeating JA, Song CX. Accurate targeted long-read DNA methylation and hydroxymethylation sequencing with TAPS. Genome Biol 2020; 21:54. [PMID: 32127008 PMCID: PMC7053107 DOI: 10.1186/s13059-020-01969-6] [Citation(s) in RCA: 47] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2019] [Accepted: 02/21/2020] [Indexed: 12/17/2022] Open
Abstract
We present long-read Tet-assisted pyridine borane sequencing (lrTAPS) for targeted base-resolution sequencing of DNA methylation and hydroxymethylation in regions up to 10 kb from nanogram-level input. Compatible with both Oxford Nanopore and PacBio Single-Molecule Real-Time (SMRT) sequencing, lrTAPS detects methylation with accuracy comparable to short-read Illumina sequencing but with long-range epigenetic phasing. We applied lrTAPS to sequence difficult-to-map regions in mouse embryonic stem cells and to identify distinct methylation events in the integrated hepatitis B virus genome.
Collapse
Affiliation(s)
- Yibin Liu
- Nuffield Department of Medicine, Ludwig Institute for Cancer Research, University of Oxford, Oxford, OX3 7FZ, UK
- Nuffield Department of Medicine, Target Discovery Institute, University of Oxford, Oxford, OX3 7FZ, UK
| | - Jingfei Cheng
- Nuffield Department of Medicine, Ludwig Institute for Cancer Research, University of Oxford, Oxford, OX3 7FZ, UK
- Nuffield Department of Medicine, Target Discovery Institute, University of Oxford, Oxford, OX3 7FZ, UK
| | - Paulina Siejka-Zielińska
- Nuffield Department of Medicine, Ludwig Institute for Cancer Research, University of Oxford, Oxford, OX3 7FZ, UK
- Nuffield Department of Medicine, Target Discovery Institute, University of Oxford, Oxford, OX3 7FZ, UK
| | - Carika Weldon
- Oxford Genomics Centre, Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
| | - Hannah Roberts
- Oxford Genomics Centre, Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
| | - Maria Lopopolo
- Oxford Genomics Centre, Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
| | - Andrea Magri
- Nuffield Department of Medicine, Target Discovery Institute, University of Oxford, Oxford, OX3 7FZ, UK
| | - Valentina D'Arienzo
- Nuffield Department of Medicine, Target Discovery Institute, University of Oxford, Oxford, OX3 7FZ, UK
| | - James M Harris
- Nuffield Department of Medicine, Target Discovery Institute, University of Oxford, Oxford, OX3 7FZ, UK
| | - Jane A McKeating
- Nuffield Department of Medicine, Target Discovery Institute, University of Oxford, Oxford, OX3 7FZ, UK
| | - Chun-Xiao Song
- Nuffield Department of Medicine, Ludwig Institute for Cancer Research, University of Oxford, Oxford, OX3 7FZ, UK.
- Nuffield Department of Medicine, Target Discovery Institute, University of Oxford, Oxford, OX3 7FZ, UK.
| |
Collapse
|
319
|
Ming L, Wang Z, Yi L, Batmunkh M, Liu T, Siren D, He J, Juramt N, Jambl T, Li Y, Jirimutu. Chromosome-level assembly of wild Bactrian camel genome reveals organization of immune gene loci. Mol Ecol Resour 2020; 20. [PMID: 32012460 DOI: 10.1111/1755-0998.13141] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2019] [Revised: 01/09/2020] [Accepted: 01/30/2020] [Indexed: 01/22/2023]
Abstract
Camelids are characterized by their unique adaptive immune system that exhibits the generation of homodimeric heavy-chain immunoglobulins, somatic hypermutation of T-cell receptors, and low genetic diversity of major histocompatibility complex (MHC) genes. However, short-read assemblies are typically highly fragmented in these gene loci owing to their repetitive and polymorphic nature. Here, we constructed a chromosome-level assembly of wild Bactrian camel genome based on high-coverage long-read sequencing and chromatin interaction mapping. The assembly with a contig N50 of 5.37 Mb and a scaffold N50 of 76.03 Mb, represents the most contiguous camelid genome to date. The genomic organization of immunoglobulin heavy-chain locus was similar between the wild Bactrian camel and alpaca, and genes encoding for conventional and heavy-chain antibodies were intermixed. The organizations of two immunoglobulin light-chain loci and four T cell receptor loci were also fully deciphered using the new assembly. Additionally, the complete classical MHC region was resolved into a single contig. The high-quality assembly presented here provides an essential reference for future investigations examining the camelid immune system.
Collapse
Affiliation(s)
- Liang Ming
- Key Laboratory of Dairy Biotechnology and Engineering, Ministry of Education, College of Food Science and Engineering, Inner Mongolia Agricultural University, Huhhot, China.,Inner Mongolia Institute of Camel Research, West Alax, Inner Mongolia, China
| | - Zhen Wang
- Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China
| | - Li Yi
- Key Laboratory of Dairy Biotechnology and Engineering, Ministry of Education, College of Food Science and Engineering, Inner Mongolia Agricultural University, Huhhot, China
| | - Mijiddorj Batmunkh
- Mongolian Wild Camel Protection Area, Ministry of Nature and Environment, Ulaanbaatar, Mongolia
| | - Tao Liu
- Annoroad Gene Technology (Beijing) Co. Ltd, Beijing, China
| | - Dalai Siren
- Key Laboratory of Dairy Biotechnology and Engineering, Ministry of Education, College of Food Science and Engineering, Inner Mongolia Agricultural University, Huhhot, China.,Inner Mongolia Institute of Camel Research, West Alax, Inner Mongolia, China
| | - Jing He
- Key Laboratory of Dairy Biotechnology and Engineering, Ministry of Education, College of Food Science and Engineering, Inner Mongolia Agricultural University, Huhhot, China
| | | | - Tuyatsetseg Jambl
- College of Industrial Technology, Mongolian University of Science and Technology, Ulaanbaater, Mongolia
| | - Yixue Li
- Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China.,Bio-Med Big Data Center, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China.,Shanghai Center for Bioinformation Technology, Shanghai Industrial Technology Institute, Shanghai, China.,Collaborative Innovation Center for Genetics and Development, Fudan University, Shanghai, China
| | - Jirimutu
- Key Laboratory of Dairy Biotechnology and Engineering, Ministry of Education, College of Food Science and Engineering, Inner Mongolia Agricultural University, Huhhot, China.,Inner Mongolia Institute of Camel Research, West Alax, Inner Mongolia, China
| |
Collapse
|
320
|
Complete Genome Sequence of Bacillus velezensis Strain AL7, a Biocontrol Agent for Suppression of Cotton Verticillium Wilt. Microbiol Resour Announc 2020; 9:9/8/e01595-19. [PMID: 32079637 PMCID: PMC7033274 DOI: 10.1128/mra.01595-19] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Bacillus velezensis AL7, isolated from cotton soil, had strong antagonistic activity to Verticillium dahlia Kleb. The AL7 genome consisted of one chromosome with 3,894,709 bp (46.64% G+C content). Genome annotation predicted 3,706 protein-coding genes, 86 tRNAs, and 27 rRNAs. We sequenced and annotated the complete AL7 genome to help us better understand use of this strain. Bacillus velezensis AL7, isolated from cotton soil, had strong antagonistic activity to Verticillium dahlia Kleb. The AL7 genome consisted of one chromosome with 3,894,709 bp (46.64% G+C content). Genome annotation predicted 3,706 protein-coding genes, 86 tRNAs, and 27 rRNAs. We sequenced and annotated the complete AL7 genome to help us better understand use of this strain.
Collapse
|
321
|
Advances in monitoring soil microbial community dynamic and function. J Appl Genet 2020; 61:249-263. [PMID: 32062778 DOI: 10.1007/s13353-020-00549-5] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2019] [Revised: 01/17/2020] [Accepted: 02/06/2020] [Indexed: 12/22/2022]
Abstract
Microorganisms are vital to the overall ecosystem functioning, stability, and sustainability. Soil fertility and health depend on chemical composition and also on the qualitative and quantitative nature of microorganisms inhabiting it. Historically, denaturing gradient gel electrophoresis (DGGE) and temperature gradient gel electrophoresis (TGGE), single-strand conformation polymorphism, DNA amplification fingerprinting, amplified ribosomal DNA restriction analysis, terminal restriction fragment length polymorphism, length heterogeneity PCR, and ribosomal intergenic spacer analysis were used to assess soil microbial community structure (SMCS), abundance, and diversity. However, these methods had significant shortcomings and limitations for application in land reclamation monitoring. SMCS has been primarily determined by phospholipid fatty acid (PLFA) analysis. This method provides a direct measure of viable biomass in addition to a biochemical profile of the microbial community. PLFA has limitations such as overlap in the composition of microorganisms and the specificity of PLFAs signature. In recent years, high-throughput next-generation sequencing has dramatically increased the resolution and detectable spectrum of diverse microbial phylotypes from environmental samples and it plays a significant role in microbial ecology studies. Next-generation sequencings using 454, Illumina, SOLiD, and Ion Torrent platforms are rapid and flexible. The two methods, PLFA and next-generation sequencing, are useful in detecting changes in microbial community diversity and structure in different ecosystems. Single-molecule real-time (SMRT) and nanopore sequencing technologies represent third-generation sequencing (TGS) platforms that have been developed to address the shortcomings of second-generation sequencing (SGS). Enzymatic and soil respiration analyses are performed to further determine soil quality and microbial activities. Other valuable methods that are being recently applied to microbial function and structures include NanoSIM, GeoChip, and DNA stable staple isotope probing (DNA-SIP) technologies. They are powerful metagenomics tool for analyzing microbial communities, including their structure, metabolic potential, diversity, and their impact on ecosystem functions. This review is a critical analysis of current methods used in monitoring soil microbial community dynamic and functions.
Collapse
|
322
|
Lee N, Kim W, Hwang S, Lee Y, Cho S, Palsson B, Cho BK. Thirty complete Streptomyces genome sequences for mining novel secondary metabolite biosynthetic gene clusters. Sci Data 2020; 7:55. [PMID: 32054853 PMCID: PMC7018776 DOI: 10.1038/s41597-020-0395-9] [Citation(s) in RCA: 49] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2019] [Accepted: 01/24/2020] [Indexed: 01/04/2023] Open
Abstract
Streptomyces are Gram-positive bacteria of significant industrial importance due to their ability to produce a wide range of antibiotics and bioactive secondary metabolites. Recent advances in genome mining have revealed that Streptomyces genomes possess a large number of unexplored silent secondary metabolite biosynthetic gene clusters (smBGCs). This indicates that Streptomyces genomes continue to be an invaluable source for new drug discovery. Here, we present high-quality genome sequences of 22 Streptomyces species and eight different Streptomyces venezuelae strains assembled by a hybrid strategy exploiting both long-read and short-read genome sequencing methods. The assembled genomes have more than 97.4% gene space completeness and total lengths ranging from 6.7 to 10.1 Mbp. Their annotation identified 7,000 protein coding genes, 20 rRNAs, and 68 tRNAs on average. In silico prediction of smBGCs identified a total of 922 clusters, including many clusters whose products are unknown. We anticipate that the availability of these genomes will accelerate discovery of novel secondary metabolites from Streptomyces and elucidate complex smBGC regulation.
Collapse
Affiliation(s)
- Namil Lee
- Department of Biological Sciences and KI for the BioCentury, Korea Advanced Institute of Science and Technology, Daejeon, 34141, Republic of Korea
| | - Woori Kim
- Department of Biological Sciences and KI for the BioCentury, Korea Advanced Institute of Science and Technology, Daejeon, 34141, Republic of Korea
| | - Soonkyu Hwang
- Department of Biological Sciences and KI for the BioCentury, Korea Advanced Institute of Science and Technology, Daejeon, 34141, Republic of Korea
| | - Yongjae Lee
- Department of Biological Sciences and KI for the BioCentury, Korea Advanced Institute of Science and Technology, Daejeon, 34141, Republic of Korea
| | - Suhyung Cho
- Department of Biological Sciences and KI for the BioCentury, Korea Advanced Institute of Science and Technology, Daejeon, 34141, Republic of Korea
| | - Bernhard Palsson
- Department of Bioengineering, University of California San Diego, La Jolla, CA, 92093, USA
- Department of Pediatrics, University of California San Diego, La Jolla, CA, 92093, USA
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Lyngby, 2800, Denmark
| | - Byung-Kwan Cho
- Department of Biological Sciences and KI for the BioCentury, Korea Advanced Institute of Science and Technology, Daejeon, 34141, Republic of Korea.
- Intelligent Synthetic Biology Center, Daejeon, 34141, Republic of Korea.
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Lyngby, 2800, Denmark.
| |
Collapse
|
323
|
Amarasinghe SL, Su S, Dong X, Zappia L, Ritchie ME, Gouil Q. Opportunities and challenges in long-read sequencing data analysis. Genome Biol 2020; 21:30. [PMID: 32033565 PMCID: PMC7006217 DOI: 10.1186/s13059-020-1935-5] [Citation(s) in RCA: 744] [Impact Index Per Article: 186.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2019] [Accepted: 01/15/2020] [Indexed: 12/11/2022] Open
Abstract
Long-read technologies are overcoming early limitations in accuracy and throughput, broadening their application domains in genomics. Dedicated analysis tools that take into account the characteristics of long-read data are thus required, but the fast pace of development of such tools can be overwhelming. To assist in the design and analysis of long-read sequencing projects, we review the current landscape of available tools and present an online interactive database, long-read-tools.org, to facilitate their browsing. We further focus on the principles of error correction, base modification detection, and long-read transcriptomics analysis and highlight the challenges that remain.
Collapse
Affiliation(s)
- Shanika L. Amarasinghe
- Epigenetics and Development Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, 3052 Australia
- Department of Medical Biology, The University of Melbourne, Parkville, 3010 Australia
| | - Shian Su
- Epigenetics and Development Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, 3052 Australia
- Department of Medical Biology, The University of Melbourne, Parkville, 3010 Australia
| | - Xueyi Dong
- Epigenetics and Development Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, 3052 Australia
- Department of Medical Biology, The University of Melbourne, Parkville, 3010 Australia
| | - Luke Zappia
- Bioinformatics, Murdoch Children’s Research Institute, Parkville, 3052 Australia
- School of Biosciences, Faculty of Science, The University of Melbourne, Parkville, 3010 Australia
| | - Matthew E. Ritchie
- Epigenetics and Development Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, 3052 Australia
- Department of Medical Biology, The University of Melbourne, Parkville, 3010 Australia
- School of Mathematics and StatisticsThe University of Melbourne, Parkville, 3010 Australia
| | - Quentin Gouil
- Epigenetics and Development Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, 3052 Australia
- Department of Medical Biology, The University of Melbourne, Parkville, 3010 Australia
| |
Collapse
|
324
|
Zhu Y, Lu J, Zhao J, Zhang X, Yu HH, Velkov T, Li J. Complete genome sequence and genome-scale metabolic modelling of Acinetobacter baumannii type strain ATCC 19606. Int J Med Microbiol 2020; 310:151412. [PMID: 32081464 DOI: 10.1016/j.ijmm.2020.151412] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2019] [Revised: 12/29/2019] [Accepted: 02/03/2020] [Indexed: 12/11/2022] Open
Abstract
Multidrug-resistant (MDR) Acinetobacter baumannii is a critical threat to global health. The type strain ATCC 19606 has been widely used in studying the virulence, pathogenesis and mechanisms of antimicrobial resistance in A. baumannii. However, the lack of a complete genome sequence is a hindrance towards detailed bioinformatic studies. Here we report the generation of a complete genome for ATCC 19606 using PacBio sequencing. ATCC 19606 genome consists of a 3,980,848-bp chromosome and a 9,450-bp plasmid pMAC, and harbours a chromosomal dihydropteroate synthase gene sul2 conferring resistance to sulphonamides and a plasmid-borne ohr gene conferring resistance to peroxides. The genome also contains 69 virulence genes involved in surface adherence, biofilm formation, extracellular phospholipase, iron uptake, immune evasion and quorum sensing. Insertion sequences ISCR2 and ISAba11 are embedded in a 36.1-Kb genomic island, suggesting an IS-mediated large-scale DNA recombination. Furthermore, a genome-scale metabolic model (GSMM) iATCC19606v2 was constructed using the complete genome annotation. The model iATCC19606v2 incorporated a periplasmic compartment, 1,422 metabolites, 2,114 reactions and 1,009 genes, and a set of protein crowding constraints taking into account enzyme abundance limitation. The prediction of bacterial growth on 190 carbon and 95 nitrogen sources achieved a high accuracy of 85.6% compared to Biolog experiment results. Based upon two transposon mutant libraries of AB5075 and ATCC 17978, the predictions of essential genes reached the accuracy of 87.6% and 82.1%, respectively. Together, the complete genome sequence and high-quality GSMM iATCC19606v2 provide valuable tools for antimicrobial systems pharmacological investigations on A. baumannii.
Collapse
Affiliation(s)
- Yan Zhu
- Biomedicine Discovery Institute, Infection & Immunity Program and Department of Microbiology, Monash University, Melbourne, VIC, 3800, Australia.
| | - Jing Lu
- Biomedicine Discovery Institute, Infection & Immunity Program and Department of Microbiology, Monash University, Melbourne, VIC, 3800, Australia.
| | - Jinxin Zhao
- Biomedicine Discovery Institute, Infection & Immunity Program and Department of Microbiology, Monash University, Melbourne, VIC, 3800, Australia.
| | - Xinru Zhang
- Biomedicine Discovery Institute, Infection & Immunity Program and Department of Microbiology, Monash University, Melbourne, VIC, 3800, Australia.
| | - Heidi H Yu
- Biomedicine Discovery Institute, Infection & Immunity Program and Department of Microbiology, Monash University, Melbourne, VIC, 3800, Australia.
| | - Tony Velkov
- Department of Pharmacology and Therapeutics, University of Melbourne, Melbourne, VIC, 3010, Australia.
| | - Jian Li
- Biomedicine Discovery Institute, Infection & Immunity Program and Department of Microbiology, Monash University, Melbourne, VIC, 3800, Australia.
| |
Collapse
|
325
|
conLSH: Context based Locality Sensitive Hashing for mapping of noisy SMRT reads. Comput Biol Chem 2020; 85:107206. [PMID: 32000034 DOI: 10.1016/j.compbiolchem.2020.107206] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2019] [Accepted: 01/14/2020] [Indexed: 11/20/2022]
Abstract
Single Molecule Real-Time (SMRT) sequencing is a recent advancement of Next Gen technology developed by Pacific Bio (PacBio). It comes with an explosion of long and noisy reads demanding cutting edge research to get most out of it. To deal with the high error probability of SMRT data, a novel contextual Locality Sensitive Hashing (conLSH) based algorithm is proposed in this article, which can effectively align the noisy SMRT reads to the reference genome. Here, sequences are hashed together based not only on their closeness, but also on similarity of context. The algorithm has O(nρ+1) space requirement, where n is the number of sequences in the corpus and ρ is a constant. The indexing time and querying time are bounded by Onρ+1·lnnln1P2 and O(nρ) respectively, where P2 > 0, is a probability value. This algorithm is particularly useful for retrieving similar sequences, a widely used task in biology. The proposed conLSH based aligner is compared with rHAT, popularly used for aligning SMRT reads, and is found to comprehensively beat it in speed as well as in memory requirements. In particular, it takes approximately 24.2% less processing time, while saving about 70.3% in peak memory requirement for H.sapiens PacBio dataset.
Collapse
|
326
|
Osei Sekyere J, Maningi NE, Fourie PB. Mycobacterium tuberculosis, antimicrobials, immunity, and lung-gut microbiota crosstalk: current updates and emerging advances. Ann N Y Acad Sci 2020; 1467:21-47. [PMID: 31989644 DOI: 10.1111/nyas.14300] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2019] [Revised: 12/10/2019] [Accepted: 12/20/2019] [Indexed: 12/16/2022]
Abstract
Increasingly, gut microbiota distortions are being implicated in the pathogenesis of several infectious and noninfectious diseases. Specifically, in the absence of an eubiotic microbiota, mice are more prone to colonization and infection by Mycobacterium tuberculosis (Mtb). In this qualitative analysis, the following were observed: (1) antimicrobials cause long-term gut microbiota perturbations; (2) Mtb causes limited and transient disturbances to the lung-gut microbiota; (3) pathogens (e.g., Helicobacter hepaticus) affect microbiota integrity and reduce resistance to Mtb; (4) dysbiosis depletes bacterial species regulating proper immune functioning, reducing resistance to Mtb; (5) dysregulated immune cells fail to express important pathogen-recognition receptors (e.g., macrophage-inducible C-type lectin; MINCLE) and Mtb-killing cytokines (e.g., IFN-γ, TNF-α, and IL-17), with hampered phagocytic capability; (6) autophagy is central to the immune system's clearance of Mtb, control of inflammation, and immunity-microbiome balance; (7) microbiota-produced short-chain fatty acids, which are reduced by dysbiosis, affect immune cells and increase Mtb proliferation; (8) commensal species (e.g., Lactobacillus plantarum) and microbiota metabolites (e.g., indole propionic acid) reduce tuberculosis progression; and (9) fecal transplants mostly restored eubiosis, increased immune resistance to Mtb, restricted dissemination of Mtb, and reduced tuberculosis-associated organ pathologies. Overuse of antimicrobials, as shown in mice, is a risk factor for reactivating latent or treated tuberculosis.
Collapse
Affiliation(s)
- John Osei Sekyere
- Molecular Mycobacteriology Laboratory, Department of Medical Microbiology, School of Medicine, Faculty of Health Sciences, University of Pretoria, Pretoria, South Africa
| | - Nontuthuko E Maningi
- Molecular Mycobacteriology Laboratory, Department of Medical Microbiology, School of Medicine, Faculty of Health Sciences, University of Pretoria, Pretoria, South Africa
| | - Petrus B Fourie
- Molecular Mycobacteriology Laboratory, Department of Medical Microbiology, School of Medicine, Faculty of Health Sciences, University of Pretoria, Pretoria, South Africa
| |
Collapse
|
327
|
Ramaswamy P, Yadav R, Pal PK, Christopher R. Clinical Application of Circulating MicroRNAs in Parkinson's Disease: The Challenges and Opportunities as Diagnostic Biomarker. Ann Indian Acad Neurol 2020; 23:84-97. [PMID: 32055127 PMCID: PMC7001448 DOI: 10.4103/aian.aian_440_19] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2019] [Revised: 09/03/2019] [Accepted: 09/16/2019] [Indexed: 12/20/2022] Open
Abstract
Discovery of evolutionarily conserved, nonprotein-coding, endogenous microRNAs has induced a paradigm shift in the overall understanding of gene regulation. Now, microRNAs are considered and classified as master regulators of gene expression as they regulate a wide range of processes – gene regulation, splicing, translation and posttranscriptional modifications. Besides, dysregulated microRNAs have been related to many diseases, including Parkinson's and related disorders. Several studies proposed that differentially expressed microRNAs as a potential biomarker. So far, there is no accepted clinical diagnostic test for Parkinson's disease based on biochemical analysis of biological fluids. However, circulating microRNAs possess many vital features typical of reliable biomarkers and discriminates Parkinson's patients from healthy control with much higher sensitivity and specificity. Though they show tremendous promise as a putative biomarker, translating these research findings to clinical application is often met with many obstacles. Most of the candidate microRNAs reported as a diagnostic biomarker is not organ-specific, and their overlap is low between studies. Therefore this review aimed to highlight the challenges in the application of microRNA in guiding disease discrimination decisions and its future prospects as a diagnostic biomarker in Parkinson's Disease.
Collapse
Affiliation(s)
- Palaniswamy Ramaswamy
- Department of Neurology, National Institute of Mental Health and Neuro Sciences (NIMHANS), Bengaluru, Karnataka, India
| | - Ravi Yadav
- Department of Neurology, National Institute of Mental Health and Neuro Sciences (NIMHANS), Bengaluru, Karnataka, India
| | - Pramod Kumar Pal
- Department of Neurology, National Institute of Mental Health and Neuro Sciences (NIMHANS), Bengaluru, Karnataka, India
| | - Rita Christopher
- Department of Neurochemistry, National Institute of Mental Health and Neuro Sciences (NIMHANS), Bengaluru, Karnataka, India
| |
Collapse
|
328
|
Wang A, Au KF. Performance difference of graph-based and alignment-based hybrid error correction methods for error-prone long reads. Genome Biol 2020; 21:14. [PMID: 31952552 PMCID: PMC6966875 DOI: 10.1186/s13059-019-1885-y] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2019] [Accepted: 11/10/2019] [Indexed: 11/10/2022] Open
Abstract
The error-prone third-generation sequencing (TGS) long reads can be corrected by the high-quality second-generation sequencing (SGS) short reads, which is referred to as hybrid error correction. We here investigate the influences of the principal algorithmic factors of two major types of hybrid error correction methods by mathematical modeling and analysis on both simulated and real data. Our study reveals the distribution of accuracy gain with respect to the original long read error rate. We also demonstrate that the original error rate of 19% is the limit for perfect correction, beyond which long reads are too error-prone to be corrected by these methods.
Collapse
Affiliation(s)
- Anqi Wang
- Department of Biomedical Informatics, The Ohio State University, Columbus, OH, 43210, USA
- Department of Internal Medicine, University of Iowa, Iowa City, IA, 52242, USA
| | - Kin Fai Au
- Department of Biomedical Informatics, The Ohio State University, Columbus, OH, 43210, USA.
- Department of Internal Medicine, University of Iowa, Iowa City, IA, 52242, USA.
- Department of Biostatistics, University of Iowa, Iowa City, IA, 52242, USA.
| |
Collapse
|
329
|
Complete Genome Sequence of a Plant-Derived Phenylpropanoid-Degrading Bacterium, Pseudomonas putida JYR-1. Microbiol Resour Announc 2020; 9:9/1/e01152-19. [PMID: 31896631 PMCID: PMC6940283 DOI: 10.1128/mra.01152-19] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Pseudomonas putida JYR-1 was isolated from soil contaminated with industrial oil because of its ability to utilize trans-anethole as a carbon and energy source. The complete genome is 5.41 Mb with 4,834 protein-coding genes. Study of this isolate will provide insight into biotransformation pathways for phenylpropanoids.
Collapse
|
330
|
Marla SS, Mishra P, Maurya R, Singh M, Wankhede DP, Kumar A, Yadav MC, Subbarao N, Singh SK, Kumar R. Refinement of Draft Genome Assemblies of Pigeonpea ( Cajanus cajan). Front Genet 2020. [PMID: 33384719 DOI: 10.1101/2020.08.10.243949] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/28/2023] Open
Abstract
Genome assembly of short reads from large plant genomes remains a challenge in computational biology despite major developments in next generation sequencing. Of late several draft assemblies have been reported in sequenced plant genomes. The reported draft genome assemblies of Cajanus cajan have different levels of genome completeness, a large number of repeats, gaps, and segmental duplications. Draft assemblies with portions of genome missing are shorter than the referenced original genome. These assemblies come with low map accuracy affecting further functional annotation and the prediction of gene components as desired by crop researchers. Genome coverage, i.e., the number of sequenced raw reads mapped onto a certain location of the genome is an important quality indicator of completeness and assembly quality in draft assemblies. The present work aimed to improve the coverage in reported de novo sequenced draft genomes (GCA_000340665.1 and GCA_000230855.2) of pigeonpea, a legume widely cultivated in India. The two recently sequenced assemblies, A1 and A2 comprised 72% and 75% of the estimated coverage of the genome, respectively. We employed an assembly reconciliation approach to compare the draft assemblies and merge them, filling the gaps by employing an algorithm size sorting mate-pair library to generate a high quality and near complete assembly with enhanced contiguity. The majority of gaps present within scaffolds were filled with right-sized mate-pair reads. The improved assembly reduced the number of gaps than those reported in draft assemblies resulting in an improved genome coverage of 82.4%. Map accuracy of the improved assembly was evaluated using various quality metrics and for the presence of specific trait-related functional genes. Employed pair-end and mate-pair local libraries helped us to reduce gaps, repeats, and other sequence errors resulting in lengthier scaffolds compared to the two draft assemblies. We reported the prediction of putative host resistance genes against Fusarium wilt disease by their performance and evaluated them both in wet laboratory and field phenotypic conditions.
Collapse
Affiliation(s)
- Soma S Marla
- Indian Council for Agricultural Research (ICAR)-National Bureau of Plant Genetic Resources, New Delhi, India
| | - Pallavi Mishra
- Indian Council for Agricultural Research (ICAR)-National Bureau of Plant Genetic Resources, New Delhi, India
| | - Ranjeet Maurya
- Indian Council for Agricultural Research (ICAR)-National Bureau of Plant Genetic Resources, New Delhi, India
| | - Mohar Singh
- Indian Council for Agricultural Research (ICAR)-National Bureau of Plant Genetic Resources, New Delhi, India
| | | | - Anil Kumar
- Directorate of Education, Rani Lakshmi Bai Central Agricultural University, Jhansi, India
| | - Mahesh C Yadav
- Indian Council for Agricultural Research (ICAR)-National Bureau of Plant Genetic Resources, New Delhi, India
| | - N Subbarao
- School of Computational and Integrative Sciences, Jawaharlal Nehru University, New Delhi, India
| | - Sanjeev K Singh
- Indian Council for Agricultural Research (ICAR)-National Bureau of Plant Genetic Resources, New Delhi, India
| | - Rajesh Kumar
- Indian Council for Agricultural Research (ICAR)-National Bureau of Plant Genetic Resources, New Delhi, India
| |
Collapse
|
331
|
Bromfield ESP, Cloutier S, Nguyen HDT. Description and complete genome sequences of Bradyrhizobium symbiodeficiens sp. nov., a non-symbiotic bacterium associated with legumes native to Canada. Int J Syst Evol Microbiol 2020; 70:442-449. [PMID: 31626586 PMCID: PMC7395627 DOI: 10.1099/ijsem.0.003772] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2019] [Revised: 09/09/2019] [Accepted: 09/17/2019] [Indexed: 12/22/2022] Open
Abstract
Four bacterial strains isolated from root nodules of soybean plants that had been inoculated with root-zone soil of either Amphicarpaea bracteata (Hog Peanut) or Desmodium canadense (Showy Tick Trefoil) growing in Canada, were previously characterized and placed in a novel lineage within the genus Bradyrhizobium. The taxonomic status of the novel strains was verified by genomic and phenotypic analyses. Phylogenetic analyses of individual and concatenated housekeeping gene sequences (atp D, gln II, rec A, gyr B and rpo B) placed all novel strains in a highly supported lineage distinct from named Bradyrhizobium species. Data for sequence similarities of concatenated housekeeping genes of novel strains relative to type strains of named species were consistent with the phylogenetic data. Average nucleotide identity values of genome sequences (84.5-93.7 %) were below the threshold value of 95-96 % for bacterial species circumscription. Close relatives to the novel strains are Bradyrhizobium amphicarpaeae, Bradyrhizobium ottawaense and Bradyrhizobium shewense. The complete genomes of strains 85S1MBT and 65S1MB consist of single chromosomes of size 7.04 and 7.13 Mbp, respectively. The genomes of both strains have a G+C content of 64.3 mol%. These strains lack a symbiosis island as well as key nodulation, nitrogen-fixation and photosystem genes. Data from various phenotypic tests including growth characteristics and carbon source utilization supported the sequence-based analyses. Based on the data presented here, the four strains represent a novel species for which the name B radyrhizobium symbiodeficiens sp. nov., is proposed, with 85S1MBT (=LMG 29937T=HAMBI 3684T) as the type strain.
Collapse
Affiliation(s)
- Eden S. P. Bromfield
- Ottawa Research and Development Centre, Agriculture and Agri-Food Canada, 960 Carling Avenue, Ottawa, Ontario K1A OC6, Canada
| | - Sylvie Cloutier
- Ottawa Research and Development Centre, Agriculture and Agri-Food Canada, 960 Carling Avenue, Ottawa, Ontario K1A OC6, Canada
| | - Hai D. T. Nguyen
- Ottawa Research and Development Centre, Agriculture and Agri-Food Canada, 960 Carling Avenue, Ottawa, Ontario K1A OC6, Canada
| |
Collapse
|
332
|
Xu L, Seki M. Recent advances in the detection of base modifications using the Nanopore sequencer. J Hum Genet 2020; 65:25-33. [PMID: 31602005 PMCID: PMC7087776 DOI: 10.1038/s10038-019-0679-0] [Citation(s) in RCA: 82] [Impact Index Per Article: 20.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2019] [Revised: 09/22/2019] [Accepted: 09/26/2019] [Indexed: 12/29/2022]
Abstract
DNA and RNA modifications have important functions, including the regulation of gene expression. Existing methods based on short-read sequencing for the detection of modifications show difficulty in determining the modification patterns of single chromosomes or an entire transcript sequence. Furthermore, the kinds of modifications for which detection methods are available are very limited. The Nanopore sequencer is a single-molecule, long-read sequencer that can directly sequence RNA as well as DNA. Moreover, the Nanopore sequencer detects modifications on long DNA and RNA molecules. In this review, we mainly focus on base modification detection in the DNA and RNA of mammals using the Nanopore sequencer. We summarize current studies of modifications using the Nanopore sequencer, detection tools using statistical tests or machine learning, and applications of this technology, such as analyses of open chromatin, DNA replication, and RNA metabolism.
Collapse
Affiliation(s)
- Liu Xu
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa, Chiba, Japan
| | - Masahide Seki
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa, Chiba, Japan.
| |
Collapse
|
333
|
Yang K, Yu W, Huang G, Zhou J, Yang X, Fu W. Highly sensitive detection of Staphylococcus aureus by a THz metamaterial biosensor based on gold nanoparticles and rolling circle amplification. RSC Adv 2020; 10:26824-26833. [PMID: 35515811 PMCID: PMC9055468 DOI: 10.1039/d0ra03116j] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2020] [Accepted: 07/09/2020] [Indexed: 01/31/2023] Open
Abstract
A highly sensitive method for detecting Staphylococcus aureus (S. aureus) is urgently needed to reduce the impact and spread of hospital-acquired infections and food-borne illness. For this purpose, this paper presents a THz metamaterial biosensor based on gold nanoparticles (AuNPs) and rolling circle amplification (RCA). The RCA process amplified the S. aureus DNA fragments and generated copious yields of long single-strand DNA molecules. These molecules were then conjugated with the AuNPs to form complexes that delivered exceptional increases in the refractive indices of the samples, and resulted in corresponding improvements in the THz response of the metamaterial. Under optimal conditions, the shifts in the metamaterial's resonance frequency displayed a linear relationship with concentrations of synthetic S. aureus DNA in the range from 10 fM to 10 pM, with a limit of detection of 2.77 fM. We also tested the practical application of this biosensor in measurements of genomic DNA in clinical bacterial strains, where the sensor showed a detection limit of 0.08 pg μL−1 and a linear range from 0.1 to 5 pg μL−1. It also exhibited reasonable specificity, resisting interference from three other pathogenic bacteria. These findings indicate that the proposed approach offers a cost-effective THz biosensing strategy that can be easily fabricated and conveniently operated to aid the diagnosis of infectious disease and food safety control. A highly sensitive method for detecting Staphylococcus aureus (S. aureus) is urgently needed to reduce the impact and spread of hospital-acquired infections and food-borne illness.![]()
Collapse
Affiliation(s)
- Ke Yang
- Department of Laboratory Medicine
- Southwest Hospital
- Third Military Medical University (Army Medical University)
- Chongqing 400038
- China
| | - Wenjing Yu
- Department of Laboratory Medicine
- Southwest Hospital
- Third Military Medical University (Army Medical University)
- Chongqing 400038
- China
| | - Guorong Huang
- Department of Laboratory Medicine
- Southwest Hospital
- Third Military Medical University (Army Medical University)
- Chongqing 400038
- China
| | - Jie Zhou
- Department of Laboratory Medicine
- Southwest Hospital
- Third Military Medical University (Army Medical University)
- Chongqing 400038
- China
| | - Xiang Yang
- Department of Laboratory Medicine
- Southwest Hospital
- Third Military Medical University (Army Medical University)
- Chongqing 400038
- China
| | - Weiling Fu
- Department of Laboratory Medicine
- Southwest Hospital
- Third Military Medical University (Army Medical University)
- Chongqing 400038
- China
| |
Collapse
|
334
|
Vasudevan K, Devanga Ragupathi NK, Jacob JJ, Veeraraghavan B. Highly accurate-single chromosomal complete genomes using IonTorrent and MinION sequencing of clinical pathogens. Genomics 2020; 112:545-551. [DOI: 10.1016/j.ygeno.2019.04.006] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2019] [Revised: 03/13/2019] [Accepted: 04/09/2019] [Indexed: 10/27/2022]
|
335
|
Muhammad II, Kong SL, Akmar Abdullah SN, Munusamy U. RNA-seq and ChIP-seq as Complementary Approaches for Comprehension of Plant Transcriptional Regulatory Mechanism. Int J Mol Sci 2019; 21:E167. [PMID: 31881735 PMCID: PMC6981605 DOI: 10.3390/ijms21010167] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2019] [Revised: 12/19/2019] [Accepted: 12/23/2019] [Indexed: 02/07/2023] Open
Abstract
The availability of data produced from various sequencing platforms offer the possibility to answer complex questions in plant research. However, drawbacks can arise when there are gaps in the information generated, and complementary platforms are essential to obtain more comprehensive data sets relating to specific biological process, such as responses to environmental perturbations in plant systems. The investigation of transcriptional regulation raises different challenges, particularly in associating differentially expressed transcription factors with their downstream responsive genes. In this paper, we discuss the integration of transcriptional factor studies through RNA sequencing (RNA-seq) and Chromatin Immunoprecipitation sequencing (ChIP-seq). We show how the data from ChIP-seq can strengthen information generated from RNA-seq in elucidating gene regulatory mechanisms. In particular, we discuss how integration of ChIP-seq and RNA-seq data can help to unravel transcriptional regulatory networks. This review discusses recent advances in methods for studying transcriptional regulation using these two methods. It also provides guidelines for making choices in selecting specific protocols in RNA-seq pipelines for genome-wide analysis to achieve more detailed characterization of specific transcription regulatory pathways via ChIP-seq.
Collapse
Affiliation(s)
- Isiaka Ibrahim Muhammad
- Laboratory of Plantation Science and Technology, Institute of Plantation Studies, Universiti Putra Malaysia, Selangor 43400, Malaysia; (I.I.M.); (S.L.K.); (U.M.)
| | - Sze Ling Kong
- Laboratory of Plantation Science and Technology, Institute of Plantation Studies, Universiti Putra Malaysia, Selangor 43400, Malaysia; (I.I.M.); (S.L.K.); (U.M.)
| | - Siti Nor Akmar Abdullah
- Laboratory of Plantation Science and Technology, Institute of Plantation Studies, Universiti Putra Malaysia, Selangor 43400, Malaysia; (I.I.M.); (S.L.K.); (U.M.)
- Department of Agriculture Technology, Faculty of Agriculture, Universiti Putra Malaysia, Selangor 43400, Malaysia
| | - Umaiyal Munusamy
- Laboratory of Plantation Science and Technology, Institute of Plantation Studies, Universiti Putra Malaysia, Selangor 43400, Malaysia; (I.I.M.); (S.L.K.); (U.M.)
| |
Collapse
|
336
|
MinION sequencing of Streptococcus suis allows for functional characterization of bacteria by multilocus sequence typing and antimicrobial resistance profiling. J Microbiol Methods 2019; 169:105817. [PMID: 31881288 DOI: 10.1016/j.mimet.2019.105817] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2019] [Revised: 12/20/2019] [Accepted: 12/23/2019] [Indexed: 01/31/2023]
Abstract
In recent years, high-throughput sequencing has revolutionized disease diagnosis by its powerful ability to provide high resolution genomic information. The Oxford Nanopore MinION sequencer has unparalleled potential as a rapid disease diagnostic tool due to its high mobility, accessibility, and short turnaround time. However, there is a lack of rigorous quality assessment and control processes standardizing the testing on the MinION, which is necessary for incorporation into a diagnostic workflow. Thus, our study examined the use of the MinION sequencer for bacterial whole genome generation and characterization. Using Streptococcus suis as a model, we optimized DNA isolation and treatments to be used for MinION sequencing and standardized de novo assembly to quickly generate a full-length consensus sequence achieving a 99.4% average accuracy. The consensus genomes from MinION sequencing were able to accurately predict the multilocus sequence type in 8 out of 10 samples and identified antimicrobial resistance profiles for 100% of the samples, despite the concern of a high error rate. The inability to unequivocally predict sequence types was due to difficulty in differentiating high identity alleles, which was overcome by applying additional error correction methods to increase consensus accuracy. This manuscript provides methods for the use of MinION sequencing for identification of S. suis genome sequence, sequence type, and antibiotic resistance profile that can be used as a framework for identification and classification of other pathogens.
Collapse
|
337
|
Affiliation(s)
- Stanislav Mazurenko
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, 625 00 Brno, Czech Republic
| | - Zbynek Prokop
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, 625 00 Brno, Czech Republic
- International Centre for Clinical Research, St. Ann’s Hospital, 602 00 Brno, Czech Republic
| | - Jiri Damborsky
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, 625 00 Brno, Czech Republic
- International Centre for Clinical Research, St. Ann’s Hospital, 602 00 Brno, Czech Republic
| |
Collapse
|
338
|
Cruz-Garcia L, O'Brien G, Sipos B, Mayes S, Love MI, Turner DJ, Badie C. Generation of a Transcriptional Radiation Exposure Signature in Human Blood Using Long-Read Nanopore Sequencing. Radiat Res 2019; 193:143-154. [PMID: 31829904 DOI: 10.1667/rr15476.1] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
In the event of a large-scale event leading to acute ionizing radiation exposure, high-throughput methods would be required to assess individual dose estimates for triage purposes. Blood-based gene expression is a broad source of biomarkers of radiation exposure which have great potential for providing rapid dose estimates for a large population. Time is a crucial component in radiological emergencies and the shipment of blood samples to relevant laboratories presents a concern. In this study, we performed nanopore sequencing analysis to determine if the technology can be used to detect radiation-inducible genes in human peripheral blood mononuclear cells (PBMCs). The technology offers not only long-read sequencing but also a portable device which can overcome issues involving sample shipment, and provide faster results. For this goal, blood from nine healthy volunteers was 2 Gy ex vivo X irradiated. After PBMC isolation, irradiated samples were incubated along with the controls for 24 h at 37°C. RNA was extracted, poly(A)+ enriched and reverse-transcribed before sequencing. The data generated was analyzed using a Snakemake pipeline modified to handle paired samples. The sequencing analysis identified a radiation signature consisting of 46 differentially expressed genes (DEGs) which included 41 protein-coding genes, a long non-coding RNA and four pseudogenes, five of which have been identified as radiation-responsive transcripts for the first time. The genes in which transcriptional expression is most significantly modified after radiation exposure were APOBEC3H and FDXR, presenting a 25- and 28-fold change on average, respectively. These levels of transcriptional response were comparable to results we obtained by quantitative polymerase chain reaction (qPCR) analysis. In vivo exposure analyses showed a transcriptional radioresponse at 24 h postirradiation for both genes together with a strong dose-dependent response in blood irradiated ex vivo. Finally, extrapolating from the data we obtained, the minimum sequencing time required to detect an irradiated sample using APOBEC3H transcripts would be less than 3 min for a total of 50,000 reads. Future improvements, in sample processing and bioinformatic pipeline for specific radiation-responsive transcript identification, will allow the provision of a portable, rapid, real-time biodosimetry platform based on this new sequencing technology. In summary, our data show that nanopore sequencing can identify radiation-responsive genes and can also be used for identification of new transcripts.
Collapse
Affiliation(s)
- Lourdes Cruz-Garcia
- Cancer Mechanisms and Biomarkers Group, Radiation Effects Department, Centre for Radiation, Chemical and Environmental Hazards Public Health England Chilton, Didcot, OX11 ORQ United Kingdom
| | - Grainne O'Brien
- Cancer Mechanisms and Biomarkers Group, Radiation Effects Department, Centre for Radiation, Chemical and Environmental Hazards Public Health England Chilton, Didcot, OX11 ORQ United Kingdom
| | - Botond Sipos
- Oxford Nanopore Technologies, OX4 4DQ, Oxford, United Kingdom
| | - Simon Mayes
- Oxford Nanopore Technologies, OX4 4DQ, Oxford, United Kingdom
| | - Michael I Love
- Departments of Biostatistics.,Departments of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, 27516
| | - Daniel J Turner
- Oxford Nanopore Technologies, OX4 4DQ, Oxford, United Kingdom
| | - Christophe Badie
- Cancer Mechanisms and Biomarkers Group, Radiation Effects Department, Centre for Radiation, Chemical and Environmental Hazards Public Health England Chilton, Didcot, OX11 ORQ United Kingdom
| |
Collapse
|
339
|
Fichou Y, Berlivet I, Richard G, Tournamille C, Castilho L, Férec C. Defining Blood Group Gene Reference Alleles by Long-Read Sequencing: Proof of Concept in the ACKR1 Gene Encoding the Duffy Antigens. Transfus Med Hemother 2019; 47:23-32. [PMID: 32110191 DOI: 10.1159/000504584] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2019] [Accepted: 11/01/2019] [Indexed: 01/31/2023] Open
Abstract
Background In the novel era of blood group genomics, (re-)defining reference gene/allele sequences of blood group genes has become an important goal to achieve, both for diagnostic and research purposes. As novel potent sequencing technologies are available, we thought to investigate the variability encountered in the three most common alleles of ACKR1, the gene encoding the clinically relevant Duffy antigens, at the haplotype level by a long-read sequencing approach. Materials and Methods After long-range PCR amplification spanning the whole ACKR1 gene locus (∼2.5 kilobases), amplicons generated from 81 samples with known genotypes were sequenced in a single read by using the Pacific Biosciences (PacBio) single molecule, real-time (SMRT) sequencing technology. Results High-quality sequencing reads were obtained for the 162 alleles (accuracy >0.999). Twenty-two nucleotide variations reported in databases were identified, defining 19 haplotypes: four, eight, and seven haplotypes in 46 ACKR1*01, 63 ACKR1*02, and 53 ACKR1*02N.01 alleles, respectively. Discussion Overall, we have defined a subset of reference alleles by third-generation (long-read) sequencing. This technology, which provides a "longitudinal" overview of the loci of interest (several thousand base pairs) and is complementary to the second-generation (short-read) next-generation sequencing technology, is of critical interest for resolving novel, rare, and null alleles.
Collapse
Affiliation(s)
- Yann Fichou
- EFS, Inserm, Univ Brest, UMR 1078, GGB, Brest, France.,Laboratoire d'Excellence GR-Ex, Paris, France
| | | | | | - Christophe Tournamille
- Laboratoire d'Excellence GR-Ex, Paris, France.,IMRB-Inserm U955 Equipe 2 Transfusion et Maladies du Globule Rouge, EFS Ile-de-France, Créteil, France
| | | | - Claude Férec
- EFS, Inserm, Univ Brest, UMR 1078, GGB, Brest, France.,Laboratoire de Génétique Moléculaire et d'Histocompatibilité, CHU Morvan, Brest, France
| |
Collapse
|
340
|
Fürst D, Tsamadou C, Neuchel C, Schrezenmeier H, Mytilineos J, Weinstock C. Next-Generation Sequencing Technologies in Blood Group Typing. Transfus Med Hemother 2019; 47:4-13. [PMID: 32110189 DOI: 10.1159/000504765] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2019] [Accepted: 11/07/2019] [Indexed: 12/14/2022] Open
Abstract
Sequencing of the human genome has led to the definition of the genes for most of the relevant blood group systems, and the polymorphisms responsible for most of the clinically relevant blood group antigens are characterized. Molecular blood group typing is used in situations where erythrocytes are not available or where serological testing was inconclusive or not possible due to the lack of antisera. Also, molecular testing may be more cost-effective in certain situations. Molecular typing approaches are mostly based on either PCR with specific primers, DNA hybridization, or DNA sequencing. Particularly the transition of sequencing techniques from Sanger-based sequencing to next-generation sequencing (NGS) technologies has led to exciting new possibilities in blood group genotyping. We describe briefly the currently available NGS platforms and their specifications, depict the genetic background of blood group polymorphisms, and discuss applications for NGS approaches in immunohematology. As an example, we delineate a protocol for large-scale donor blood group screening established and in use at our institution. Furthermore, we discuss technical challenges and limitations as well as the prospect for future developments, including long-read sequencing technologies.
Collapse
Affiliation(s)
- Daniel Fürst
- Institute for Clinical Transfusion Medicine and Immunogenetics Ulm, German Red Cross Blood Transfusion Service, Baden Wuerttemberg/Hessen, and University Hospital Ulm, Ulm, Germany.,Institute of Transfusion Medicine, University of Ulm, Ulm, Germany
| | - Chrysanthi Tsamadou
- Institute for Clinical Transfusion Medicine and Immunogenetics Ulm, German Red Cross Blood Transfusion Service, Baden Wuerttemberg/Hessen, and University Hospital Ulm, Ulm, Germany.,Institute of Transfusion Medicine, University of Ulm, Ulm, Germany
| | - Christine Neuchel
- Institute for Clinical Transfusion Medicine and Immunogenetics Ulm, German Red Cross Blood Transfusion Service, Baden Wuerttemberg/Hessen, and University Hospital Ulm, Ulm, Germany.,Institute of Transfusion Medicine, University of Ulm, Ulm, Germany
| | - Hubert Schrezenmeier
- Institute for Clinical Transfusion Medicine and Immunogenetics Ulm, German Red Cross Blood Transfusion Service, Baden Wuerttemberg/Hessen, and University Hospital Ulm, Ulm, Germany.,Institute of Transfusion Medicine, University of Ulm, Ulm, Germany
| | - Joannis Mytilineos
- Institute for Clinical Transfusion Medicine and Immunogenetics Ulm, German Red Cross Blood Transfusion Service, Baden Wuerttemberg/Hessen, and University Hospital Ulm, Ulm, Germany.,Institute of Transfusion Medicine, University of Ulm, Ulm, Germany
| | - Christof Weinstock
- Institute for Clinical Transfusion Medicine and Immunogenetics Ulm, German Red Cross Blood Transfusion Service, Baden Wuerttemberg/Hessen, and University Hospital Ulm, Ulm, Germany.,Institute of Transfusion Medicine, University of Ulm, Ulm, Germany
| |
Collapse
|
341
|
Schwartz DC. Biophysics and the Genomic Sciences. Biophys J 2019; 117:2047-2053. [PMID: 31409480 DOI: 10.1016/j.bpj.2019.07.038] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2019] [Revised: 06/27/2019] [Accepted: 07/09/2019] [Indexed: 12/20/2022] Open
Abstract
It is now rare to find biological, or genetic investigations that do not rely on the tools, data, and thinking drawn from the genomic sciences. Much of this revolution is powered by contemporary sequencing approaches that readily deliver large, genome-wide data sets that not only provide genetic insights but also uniquely report molecular outcomes from experiments that biophysicists are increasingly using for potentiating structural and mechanistic investigations. In this perspective, I describe a path of how biophysical thinking greatly contributed to this revolution in ways that parallel advancements in computer science through discussion of several key inventions, described as "foundational devices." These discussions also point at the future of how biophysics and the genomic sciences may become more finely integrated for empowering new measurement paradigms for biological investigations.
Collapse
Affiliation(s)
- David C Schwartz
- Department of Chemistry, Laboratory of Genetics, Laboratory for Molecular and Computational Genomics, University of Wisconsin-Madison, Madison Wisconsin.
| |
Collapse
|
342
|
Chiara M, Zambelli F, Picardi E, Horner DS, Pesole G. Critical assessment of bioinformatics methods for the characterization of pathological repeat expansions with single-molecule sequencing data. Brief Bioinform 2019; 21:1971-1986. [DOI: 10.1093/bib/bbz099] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2019] [Revised: 06/22/2019] [Accepted: 07/09/2019] [Indexed: 01/19/2023] Open
Abstract
Abstract
A number of studies have reported the successful application of single-molecule sequencing technologies to the determination of the size and sequence of pathological expanded microsatellite repeats over the last 5 years. However, different custom bioinformatics pipelines were employed in each study, preventing meaningful comparisons and somewhat limiting the reproducibility of the results. In this review, we provide a brief summary of state-of-the-art methods for the characterization of expanded repeats alleles, along with a detailed comparison of bioinformatics tools for the determination of repeat length and sequence, using both real and simulated data. Our reanalysis of publicly available human genome sequencing data suggests a modest, but statistically significant, increase of the error rate of single-molecule sequencing technologies at genomic regions containing short tandem repeats. However, we observe that all the methods herein tested, irrespective of the strategy used for the analysis of the data (either based on the alignment or assembly of the reads), show high levels of sensitivity in both the detection of expanded tandem repeats and the estimation of the expansion size, suggesting that approaches based on single-molecule sequencing technologies are highly effective for the detection and quantification of tandem repeat expansions and contractions.
Collapse
Affiliation(s)
- Matteo Chiara
- Department of Biosciences, University of Milan, via Celoria 26, 20133 Milan, Italy
- Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies, National Research Council, Via Amendola e, 70126 Bari, Italy
| | - Federico Zambelli
- Department of Biosciences, University of Milan, via Celoria 26, 20133 Milan, Italy
- Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies, National Research Council, Via Amendola e, 70126 Bari, Italy
| | - Ernesto Picardi
- Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies, National Research Council, Via Amendola e, 70126 Bari, Italy
- Department of Biosciences, Biotechnology and Biopharmaceutics, University of Bari “A. Moro”, Via Orabona 4, 70126 Bari, Italy
| | - David S Horner
- Department of Biosciences, University of Milan, via Celoria 26, 20133 Milan, Italy
- Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies, National Research Council, Via Amendola e, 70126 Bari, Italy
| | - Graziano Pesole
- Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies, National Research Council, Via Amendola e, 70126 Bari, Italy
- Department of Biosciences, Biotechnology and Biopharmaceutics, University of Bari “A. Moro”, Via Orabona 4, 70126 Bari, Italy
| |
Collapse
|
343
|
Wilmott P, Lisowski L, Alexander IE, Logan GJ. A User's Guide to the Inverted Terminal Repeats of Adeno-Associated Virus. Hum Gene Ther Methods 2019; 30:206-213. [DOI: 10.1089/hgtb.2019.276] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Affiliation(s)
- Patrick Wilmott
- Translational Vectorology Group, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, Westmead, Australia
| | - Leszek Lisowski
- Translational Vectorology Group, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, Westmead, Australia
- Military Institute of Hygiene and Epidemiology, The Biological Threats Identification and Countermeasure Centre, Puławy, Poland
- Vector and Genome Engineering Facility; Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, Westmead, Australia
| | - Ian E. Alexander
- Discipline of Child and Adolescent Health, University of Sydney, Westmead, Australia
- Gene Therapy Research Unit, Children's Medical Research Institute and Sydney Children's Hospitals Network, University of Sydney, Westmead, Australia
| | - Grant J. Logan
- Gene Therapy Research Unit, Children's Medical Research Institute and Sydney Children's Hospitals Network, University of Sydney, Westmead, Australia
| |
Collapse
|
344
|
Vilperte V, Lucaciu CR, Halbwirth H, Boehm R, Rattei T, Debener T. Hybrid de novo transcriptome assembly of poinsettia (Euphorbia pulcherrima Willd. Ex Klotsch) bracts. BMC Genomics 2019; 20:900. [PMID: 31775622 PMCID: PMC6882326 DOI: 10.1186/s12864-019-6247-3] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2019] [Accepted: 10/30/2019] [Indexed: 12/11/2022] Open
Abstract
BACKGROUND Poinsettia is a popular and important ornamental crop, mostly during the Christmas season. Its bract coloration ranges from pink/red to creamy/white shades. Despite its ornamental value, there is a lack of knowledge about the genetics and molecular biology of poinsettia, especially on the mechanisms of color formation. We performed an RNA-Seq analysis in order to shed light on the transcriptome of poinsettia bracts. Moreover, we analyzed the transcriptome differences of red- and white-bracted poinsettia varieties during bract development and coloration. For the assembly of a bract transcriptome, two paired-end cDNA libraries from a red and white poinsettia pair were sequenced with the Illumina technology, and one library from a red-bracted variety was used for PacBio sequencing. Both short and long reads were assembled using a hybrid de novo strategy. Samples of red- and white-bracted poinsettias were sequenced and comparatively analyzed in three color developmental stages in order to understand the mechanisms of color formation and accumulation in the species. RESULTS The final transcriptome contains 288,524 contigs, with 33% showing confident protein annotation against the TAIR10 database. The BUSCO pipeline, which is based on near-universal orthologous gene groups, was applied to assess the transcriptome completeness. From a total of 1440 BUSCO groups searched, 77% were categorized as complete (41% as single-copy and 36% as duplicated), 10% as fragmented and 13% as missing BUSCOs. The gene expression comparison between red and white varieties of poinsettia showed a differential regulation of the flavonoid biosynthesis pathway only at particular stages of bract development. An initial impairment of the flavonoid pathway early in the color accumulation process for the white poinsettia variety was observed, but these differences were no longer present in the subsequent stages of bract development. Nonetheless, GSTF11 and UGT79B10 showed a lower expression in the last stage of bract development for the white variety and, therefore, are potential candidates for further studies on poinsettia coloration. CONCLUSIONS In summary, this transcriptome analysis provides a valuable foundation for further studies on poinsettia, such as plant breeding and genetics, and highlights crucial information on the molecular mechanism of color formation.
Collapse
Affiliation(s)
- Vinicius Vilperte
- Institute of Plant Genetics, Leibniz Universität Hannover, 30419, Hannover, Germany.,Klemm + Sohn GmbH & Co., 70379, Stuttgart, KG, Germany
| | - Calin Rares Lucaciu
- Department of Microbiology and Ecosystem Science, University of Vienna, 1090, Vienna, Austria
| | - Heidi Halbwirth
- Institute of Chemical, Environmental and Bioscience Engineering, Technische Universität Wien, 1060, Vienna, Austria
| | - Robert Boehm
- Klemm + Sohn GmbH & Co., 70379, Stuttgart, KG, Germany
| | - Thomas Rattei
- Department of Microbiology and Ecosystem Science, University of Vienna, 1090, Vienna, Austria.
| | - Thomas Debener
- Institute of Plant Genetics, Leibniz Universität Hannover, 30419, Hannover, Germany.
| |
Collapse
|
345
|
Sevim V, Lee J, Egan R, Clum A, Hundley H, Lee J, Everroad RC, Detweiler AM, Bebout BM, Pett-Ridge J, Göker M, Murray AE, Lindemann SR, Klenk HP, O'Malley R, Zane M, Cheng JF, Copeland A, Daum C, Singer E, Woyke T. Shotgun metagenome data of a defined mock community using Oxford Nanopore, PacBio and Illumina technologies. Sci Data 2019; 6:285. [PMID: 31772173 PMCID: PMC6879543 DOI: 10.1038/s41597-019-0287-z] [Citation(s) in RCA: 63] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2019] [Accepted: 10/31/2019] [Indexed: 11/17/2022] Open
Abstract
Metagenomic sequence data from defined mock communities is crucial for the assessment of sequencing platform performance and downstream analyses, including assembly, binning and taxonomic assignment. We report a comparison of shotgun metagenome sequencing and assembly metrics of a defined microbial mock community using the Oxford Nanopore Technologies (ONT) MinION, PacBio and Illumina sequencing platforms. Our synthetic microbial community BMock12 consists of 12 bacterial strains with genome sizes spanning 3.2–7.2 Mbp, 40–73% GC content, and 1.5–7.3% repeats. Size selection of both PacBio and ONT sequencing libraries prior to sequencing was essential to yield comparable relative abundances of organisms among all sequencing technologies. While the Illumina-based metagenome assembly yielded good coverage with few misassemblies, contiguity was greatly improved by both, Illumina + ONT and Illumina + PacBio hybrid assemblies but increased misassemblies, most notably in genomes with high sequence similarity to each other. Our resulting datasets allow evaluation and benchmarking of bioinformatics software on Illumina, PacBio and ONT platforms in parallel. Measurement(s) | metagenomic data • sequence_assembly | Technology Type(s) | ONT MinION • Illumina sequencing • PacBio RS II | Factor Type(s) | sequencing platform | Sample Characteristic - Organism | Bacteria | Sample Characteristic - Environment | mock community |
Machine-accessible metadata file describing the reported data: 10.6084/m9.figshare.10260740
Collapse
Affiliation(s)
- Volkan Sevim
- DOE Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA, 94598, USA
| | - Juna Lee
- DOE Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA, 94598, USA
| | - Robert Egan
- DOE Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA, 94598, USA
| | - Alicia Clum
- DOE Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA, 94598, USA
| | - Hope Hundley
- DOE Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA, 94598, USA
| | - Janey Lee
- DOE Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA, 94598, USA
| | - R Craig Everroad
- NASA Ames Research Center, Exobiology Branch, Moffett Field, CA, 94035, USA
| | - Angela M Detweiler
- NASA Ames Research Center, Exobiology Branch, Moffett Field, CA, 94035, USA.,Bay Area Environmental Research Institute, Moffett Field, CA, 94035, USA
| | - Brad M Bebout
- NASA Ames Research Center, Exobiology Branch, Moffett Field, CA, 94035, USA
| | - Jennifer Pett-Ridge
- Lawrence Livermore National Laboratory, Nuclear and Chemical Science Division, 7000 East Ave, Livermore, CA, 94550-9234, USA
| | - Markus Göker
- Leibniz-Institut DSMZ-Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH, Inhoffenstraße 7B, 38124, Braunschweig, Germany
| | - Alison E Murray
- Desert Research Institute, Division of Earth and Ecosystem Sciences, 2215 Raggio Pkwy, Reno, NV, 89512, USA
| | | | - Hans-Peter Klenk
- Newcastle University, School of Natural and Environmental Sciences, Ridley Building 2, Newcastle upon Tyne, NE1 7RU, UK
| | - Ronan O'Malley
- DOE Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA, 94598, USA
| | - Matthew Zane
- DOE Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA, 94598, USA
| | - Jan-Fang Cheng
- DOE Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA, 94598, USA
| | - Alex Copeland
- DOE Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA, 94598, USA
| | - Christopher Daum
- DOE Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA, 94598, USA
| | - Esther Singer
- DOE Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA, 94598, USA. .,Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA, 94720, USA.
| | - Tanja Woyke
- DOE Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA, 94598, USA.
| |
Collapse
|
346
|
Analysis of the Complete Genome Sequence of the Widely Studied Strain Bradyrhizobium betae PL7HG1 T Reveals the Presence of Photosynthesis Genes and a Putative Plasmid. Microbiol Resour Announc 2019; 8:8/46/e01282-19. [PMID: 31727720 PMCID: PMC6856286 DOI: 10.1128/mra.01282-19] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Here, we present the complete genome sequence of the widely studied strain Bradyrhizobium betae PL7HG1T, isolated from a tumor on the roots of sugar beet. The genome consists of a 7.2-Mbp circular chromosome containing key photosynthesis genes but not genes for nodulation and nitrogen fixation. A putative plasmid was also detected. Here, we present the complete genome sequence of the widely studied strain Bradyrhizobium betae PL7HG1T, isolated from a tumor on the roots of sugar beet. The genome consists of a 7.2-Mbp circular chromosome containing key photosynthesis genes but not genes for nodulation and nitrogen fixation. A putative plasmid was also detected.
Collapse
|
347
|
Du GX, Qu LY, Shang K, Gao P, Ding DW, Sun CJ. Complete Genome Sequence of Strain YCSC6, a Marine Bacterium Isolated from Saturated Saltpan with Activity Against Uronema marinum. Curr Microbiol 2019; 77:129-135. [PMID: 31691022 DOI: 10.1007/s00284-019-01796-4] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2019] [Accepted: 10/18/2019] [Indexed: 11/30/2022]
Abstract
Salinivibrio proteolyticus strain YCSC6 was isolated from a saturated saltpan and demonstrated to have strong insecticidal activity against turbot's pathogenic ciliate-Uronema marinum. In this study, we sequenced its complete genome. Results showed that it consists of two circular chromosomes: 2.49 Mbps and 0.74 Mbps, respectively. It encodes 3429 protein-coding sequences. Biosynthetic gene clusters predicted to synthesize bacteriocins and antimicrobial peptides were discovered, which might be the key factors to lyse and kill U. marinum. The complete genome sequence of strain YCSC6 provides insights into the fundamental genetic potential for elucidating its insecticidal mechanism against U. marinum.
Collapse
Affiliation(s)
- Guang Xun Du
- School of Marine Sciences, Ningbo University, Ningbo, 315823, People's Republic of China.,First Institute of Oceanography, MNR, Qingdao, 266061, People's Republic of China
| | - Ling Yun Qu
- First Institute of Oceanography, MNR, Qingdao, 266061, People's Republic of China. .,Laboratory for Marine Fisheries Science and Food Production Processes, Qingdao National Laboratory for Marine Science and Technology, Qingdao, 266237, People's Republic of China.
| | - Kun Shang
- First Institute of Oceanography, MNR, Qingdao, 266061, People's Republic of China.,College of Fisheries and Life Science, Dalian Ocean University, Dalian, 116023, People's Republic of China
| | - Ping Gao
- First Institute of Oceanography, MNR, Qingdao, 266061, People's Republic of China
| | - De Wen Ding
- School of Marine Sciences, Ningbo University, Ningbo, 315823, People's Republic of China.,First Institute of Oceanography, MNR, Qingdao, 266061, People's Republic of China
| | - Cheng Jun Sun
- First Institute of Oceanography, MNR, Qingdao, 266061, People's Republic of China
| |
Collapse
|
348
|
Mei-Han, Qingshan-Wang, Baiyintala, Wuhanqimuge. The whole-genome sequence analysis of Morchella sextelata. Sci Rep 2019; 9:15376. [PMID: 31653932 PMCID: PMC6814724 DOI: 10.1038/s41598-019-51831-4] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2018] [Accepted: 09/20/2019] [Indexed: 12/20/2022] Open
Abstract
Morchella are macrofungi and are also called morels, as they exhibit a morel-like upper cap structure. Morels contain abundant essential amino acids, vitamins and biologically active compounds, which provide substantial health benefits. Approximately 80 species of Morchella have been reported, and even more species have been isolated. However, the lack of wild Morchella resources and the difficulties associated with culturing Morchella have caused a shortage in the morels available for daily consumption. Additionally, in-depth genomic and morphological studies are still needed. In this study, to provide genomic data for further investigations of culturing techniques and the biological functions of Morchella sextelata (M. sextelata), de novo genome sequencing was carried out on the Illumina HiSeq. 4000 platform using both the Illumina 150 and PacBio systems. The final estimated genome size of M. sextelata was 52.93 Mb, containing 59 contigs and a GC content of 47.37%. A total of 9,550 protein-coding genes were annotated. In addition, the repeat sequences, gene components and gene functions were analyzed using various databases. Furthermore, the secondary metabolite gene clusters and the predicted structures of their products were analyzed. Finally, a genomic comparison of different species of Morchella was performed.
Collapse
Affiliation(s)
- Mei-Han
- Inner Mongolia ZhongXing Agriculture and Animal Husbandry Development Co., Ltd., Ulanqab, China
| | - Qingshan-Wang
- Inner Mongolia ZhongXing Agriculture and Animal Husbandry Development Co., Ltd., Ulanqab, China
| | - Baiyintala
- Inner Mongolia Mang Lai Food Co., Ltd., Hohhot, China
| | - Wuhanqimuge
- Innovative Mongolian Pharmaceutical Preparations Laboratory of Inner Mongolia, Inner Mongolia International Mongolian Hospital, Hohhot, China.
| |
Collapse
|
349
|
Kim M, Lee S, Lim S, Kim S. SpliceHetero: An information theoretic approach for measuring spliceomic intratumor heterogeneity from bulk tumor RNA-seq. PLoS One 2019; 14:e0223520. [PMID: 31644551 PMCID: PMC6808416 DOI: 10.1371/journal.pone.0223520] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2019] [Accepted: 09/23/2019] [Indexed: 01/19/2023] Open
Abstract
Motivation Intratumor heterogeneity (ITH) represents the diversity of cell populations that make up cancer tissue. The level of ITH in a tumor is usually measured by a genomic variation profile, such as copy number variation and somatic mutation. However, a recent study has identified ITH at the transcriptome level and suggested that ITH at gene expression levels is useful for predicting prognosis. Measuring ITH levels at the spliceome level is a natural extension. There are serious technical challenges in measuring spliceomic ITH (sITH) from bulk tumor RNA sequencing (RNA-seq) due to the complex splicing patterns. Results We propose an information-theoretic method to measure the sITH of bulk tumors to overcome the above challenges. This method has been extensively tested in experiments using synthetic data, xenograft tumor data, and TCGA pan-cancer data. As a result, we showed that sITH is closely related to cancer progression and clonal heterogeneity, along with clinically significant features such as cancer stage, survival outcome and PAM50 subtype. As far as we know, it is the first study to define ITH at the spliceome level. This method can greatly improve the understanding of cancer spliceome and has great potential as a diagnostic and prognostic tool.
Collapse
Affiliation(s)
- Minsu Kim
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, 08826, Korea
| | - Sangseon Lee
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, 08826, Korea
| | - Sangsoo Lim
- Department of Computer Science and Engineering, Seoul National University, Seoul, 08826, Korea
| | - Sun Kim
- Department of Computer Science and Engineering, Seoul National University, Seoul, 08826, Korea
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, 08826, Korea
- Bioinformatics Institute, Seoul National University, Seoul, 08826, Korea
- * E-mail:
| |
Collapse
|
350
|
Abstract
Colonization of the intestine with Oxalobacter formigenes reduces urinary oxalate excretion and lowers the risk of forming calcium oxalate kidney stones. Here, we report the genome sequence of Oxalobacter formigenes SSYG-15, a strain isolated from a stool sample from a healthy Chinese boy. Colonization of the intestine with Oxalobacter formigenes reduces urinary oxalate excretion and lowers the risk of forming calcium oxalate kidney stones. Here, we report the genome sequence of Oxalobacter formigenes SSYG-15, a strain isolated from a stool sample from a healthy Chinese boy.
Collapse
|