1
|
Aplakidou E, Vergoulidis N, Chasapi M, Venetsianou NK, Kokoli M, Panagiotopoulou E, Iliopoulos I, Karatzas E, Pafilis E, Georgakopoulos-Soares I, Kyrpides NC, Pavlopoulos GA, Baltoumas FA. Visualizing metagenomic and metatranscriptomic data: A comprehensive review. Comput Struct Biotechnol J 2024; 23:2011-2033. [PMID: 38765606 PMCID: PMC11101950 DOI: 10.1016/j.csbj.2024.04.060] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2024] [Revised: 04/25/2024] [Accepted: 04/25/2024] [Indexed: 05/22/2024] Open
Abstract
The fields of Metagenomics and Metatranscriptomics involve the examination of complete nucleotide sequences, gene identification, and analysis of potential biological functions within diverse organisms or environmental samples. Despite the vast opportunities for discovery in metagenomics, the sheer volume and complexity of sequence data often present challenges in processing analysis and visualization. This article highlights the critical role of advanced visualization tools in enabling effective exploration, querying, and analysis of these complex datasets. Emphasizing the importance of accessibility, the article categorizes various visualizers based on their intended applications and highlights their utility in empowering bioinformaticians and non-bioinformaticians to interpret and derive insights from meta-omics data effectively.
Collapse
Affiliation(s)
- Eleni Aplakidou
- Institute for Fundamental Biomedical Research, BSRC "Alexander Fleming", Vari, Greece
- Department of Informatics and Telecommunications, Data Science and Information Technologies program, University of Athens, 15784 Athens, Greece
| | - Nikolaos Vergoulidis
- Institute for Fundamental Biomedical Research, BSRC "Alexander Fleming", Vari, Greece
| | - Maria Chasapi
- Institute for Fundamental Biomedical Research, BSRC "Alexander Fleming", Vari, Greece
- Department of Informatics and Telecommunications, Data Science and Information Technologies program, University of Athens, 15784 Athens, Greece
| | - Nefeli K. Venetsianou
- Institute for Fundamental Biomedical Research, BSRC "Alexander Fleming", Vari, Greece
| | - Maria Kokoli
- Institute for Fundamental Biomedical Research, BSRC "Alexander Fleming", Vari, Greece
| | - Eleni Panagiotopoulou
- Institute for Fundamental Biomedical Research, BSRC "Alexander Fleming", Vari, Greece
- Department of Informatics and Telecommunications, Data Science and Information Technologies program, University of Athens, 15784 Athens, Greece
| | - Ioannis Iliopoulos
- Department of Basic Sciences, School of Medicine, University of Crete, 71003 Heraklion, Greece
| | - Evangelos Karatzas
- Institute for Fundamental Biomedical Research, BSRC "Alexander Fleming", Vari, Greece
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Evangelos Pafilis
- Institute of Marine Biology, Biotechnology and Aquaculture (IMBBC), Hellenic Centre for Marine Research (HCMR), Heraklion, Greece
| | - Ilias Georgakopoulos-Soares
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
| | - Nikos C. Kyrpides
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Georgios A. Pavlopoulos
- Institute for Fundamental Biomedical Research, BSRC "Alexander Fleming", Vari, Greece
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
- Center of New Biotechnologies & Precision Medicine, Department of Medicine, School of Health Sciences, National and Kapodistrian University of Athens, Greece
- Hellenic Army Academy, 16673 Vari, Greece
| | - Fotis A. Baltoumas
- Institute for Fundamental Biomedical Research, BSRC "Alexander Fleming", Vari, Greece
| |
Collapse
|
2
|
Kim S, Heo J, Choi H, Lee D, Kwon SW, Kim Y. Sphingobacterium oryzagri sp. nov., isolated from rice paddy soil. Int J Syst Evol Microbiol 2024; 74. [PMID: 38713185 DOI: 10.1099/ijsem.0.006371] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/08/2024] Open
Abstract
An aerobic, Gram-stain-negative and short rod-shaped bacterial strain, designated M6-31T, was isolated from rice paddy soil sampled in Miryang, Republic of Korea. Growth was observed at 4-35 °C (optimum, 28 °C), pH 6.0-9.0 (optimum, pH 7.0-8.0) and in the presence of 0-4 % (w/v) NaCl (optimum, 0 % w/v). Phylogenetic analysis based on 16S rRNA gene sequences grouped strain M6-31T with Sphingobacterium bambusae IBFC2009T, Sphingobacterium griseoflavum SCU-B140T and Sphingobacterium solani MLS-26-JM13-11T in the same clade, with the 16S rRNA gene sequence similarities ranging from 95.8 to 96.6 %. A genome-based phylogenetic tree reconstructed by using all publicly available Sphingobacterium genomes placed strain M6-31T with S. bambusae KACC 22910T, 'Sphingobacterium deserti' ACCC 05744T, S. griseoflavum CGMCC 1.12966T and Sphingobacterium paludis CGMCC 1.12801T. Orthologous average nucleotide identity and digital DNA-DNA hybridization values between strain M6-31T and its closely related strains were lower than 74.6 and 22.0 %, respectively. The respiratory quinone was menaquinone-7, and the major polar lipid was phosphatidylethanolamine. The major fatty acids (>10 %) were C15 : 0 iso, C17 : 0 iso 3OH and summed feature 3. The phenotypic, chemotaxonomic and genotypic data obtained in this study showed that strain M6-31T represents a novel species of the genus Sphingobacterium, for which the name Sphingobacterium oryzagri sp. nov. (type strain M6-31T=KACC 22765T=JCM 35893T) is proposed.
Collapse
Affiliation(s)
- Seunghwan Kim
- Agricultural Microbiology Division, National Institute of Agricultural Sciences, Rural Development Administration, Wanju 55365, Republic of Korea
| | - Jun Heo
- Agricultural Microbiology Division, National Institute of Agricultural Sciences, Rural Development Administration, Wanju 55365, Republic of Korea
| | - Hyorim Choi
- Agricultural Microbiology Division, National Institute of Agricultural Sciences, Rural Development Administration, Wanju 55365, Republic of Korea
- Department of Biotechnology, Jeonbuk National University, Iksan 55496, Republic of Korea
| | - Daseul Lee
- Agricultural Microbiology Division, National Institute of Agricultural Sciences, Rural Development Administration, Wanju 55365, Republic of Korea
| | - Soon-Wo Kwon
- Agricultural Microbiology Division, National Institute of Agricultural Sciences, Rural Development Administration, Wanju 55365, Republic of Korea
| | - Yiseul Kim
- Agricultural Microbiology Division, National Institute of Agricultural Sciences, Rural Development Administration, Wanju 55365, Republic of Korea
| |
Collapse
|
3
|
Kim C, Pongpanich M, Porntaveetus T. Unraveling metagenomics through long-read sequencing: a comprehensive review. J Transl Med 2024; 22:111. [PMID: 38282030 PMCID: PMC10823668 DOI: 10.1186/s12967-024-04917-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2023] [Accepted: 01/21/2024] [Indexed: 01/30/2024] Open
Abstract
The study of microbial communities has undergone significant advancements, starting from the initial use of 16S rRNA sequencing to the adoption of shotgun metagenomics. However, a new era has emerged with the advent of long-read sequencing (LRS), which offers substantial improvements over its predecessor, short-read sequencing (SRS). LRS produces reads that are several kilobases long, enabling researchers to obtain more complete and contiguous genomic information, characterize structural variations, and study epigenetic modifications. The current leaders in LRS technologies are Pacific Biotechnologies (PacBio) and Oxford Nanopore Technologies (ONT), each offering a distinct set of advantages. This review covers the workflow of long-read metagenomics sequencing, including sample preparation (sample collection, sample extraction, and library preparation), sequencing, processing (quality control, assembly, and binning), and analysis (taxonomic annotation and functional annotation). Each section provides a concise outline of the key concept of the methodology, presenting the original concept as well as how it is challenged or modified in the context of LRS. Additionally, the section introduces a range of tools that are compatible with LRS and can be utilized to execute the LRS process. This review aims to present the workflow of metagenomics, highlight the transformative impact of LRS, and provide researchers with a selection of tools suitable for this task.
Collapse
Affiliation(s)
- Chankyung Kim
- Center of Excellence in Genomics and Precision Dentistry, Department of Physiology, Faculty of Dentistry, Chulalongkorn University, Bangkok, Thailand
- Graduate Program in Bioinformatics and Computational Biology, Faculty of Science, Chulalongkorn University, Bangkok, Thailand
| | - Monnat Pongpanich
- Department of Mathematics and Computer Science, Faculty of Science, Chulalongkorn University, Bangkok, Thailand
- Center of Excellence for Cancer and Inflammation, Chulalongkorn University, Bangkok, Thailand
| | - Thantrira Porntaveetus
- Center of Excellence in Genomics and Precision Dentistry, Department of Physiology, Faculty of Dentistry, Chulalongkorn University, Bangkok, Thailand.
- Graduate Program in Geriatric and Special Patients Care, Faculty of Dentistry, Chulalongkorn University, Bangkok, Thailand.
| |
Collapse
|
4
|
Kim S, Heo J, Kwon SW, Lee D, Han BH, Hong SB, Kim Y. Brevibacillus ruminantium sp. nov., isolated from cow faeces. Int J Syst Evol Microbiol 2023; 73. [PMID: 38097527 DOI: 10.1099/ijsem.0.006204] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2023] Open
Abstract
An aerobic, Gram-stain-positive, rod-shaped, endospore-forming bacterial strain, designated BB3-R1T, was isolated from cow faeces sampled in Daejeon, Republic of Korea. Growth was observed at 25-45 °C (optimum, 35-40 °C) and pH 7.0-9.0 (optimum, pH 8.0), with up to 3 % (w/v) NaCl (optimum, 0 % NaCl). blast analysis of 16S rRNA gene sequences revealed the highest sequence similarity of strain BB3-R1T to Brevibacillus borstelensis NRRL NRS-818T (98.8 %) followed by Brevibacillus panacihumi JCM 15085T (97.5 %). According to 16S rRNA gene and whole-genome based phylogenetic trees, strain BB3-R1T clustered with Brevibacillus composti FJAT-54423T and B. borstelensis NRRL NRS-818T. OrthoANI and dDDH values of strain BB3-R1T with the closely related strains were lower than 77.5 and 26.8 %, respectively. The major menaquinones and polar lipids of the strain were MK-7 and phosphatidylmonomethylethanolamine, diphosphatidylglycerol, phosphatidylglycerol and phosphatidylethanolamine, respectively. The major fatty acids (>10 %) were C14 : 0 iso, C15 : 0 iso, C15 : 0 anteiso and C16 : 1 ω7c alcohol. The cell-wall peptidoglycan contained cross-linked meso-diaminopimelic acid (type A1 gamma). The phenotypic, chemotaxonomic and genotypic data obtained in this study showed that the strain represents a novel species of the genus Brevibacillus, for which the name Brevibacillus ruminantium sp. nov. (type strain BB3-R1T=KACC 22663T=NBRC 115962T) is proposed.
Collapse
Affiliation(s)
- Seunghwan Kim
- Agricultural Microbiology Division, National Institute of Agricultural Sciences, Rural Development Administration, Wanju-gun, Jeollabuk-do 55365, Republic of Korea
| | - Jun Heo
- Agricultural Microbiology Division, National Institute of Agricultural Sciences, Rural Development Administration, Wanju-gun, Jeollabuk-do 55365, Republic of Korea
| | - Soon-Wo Kwon
- Agricultural Microbiology Division, National Institute of Agricultural Sciences, Rural Development Administration, Wanju-gun, Jeollabuk-do 55365, Republic of Korea
| | - Daseul Lee
- Agricultural Microbiology Division, National Institute of Agricultural Sciences, Rural Development Administration, Wanju-gun, Jeollabuk-do 55365, Republic of Korea
| | - Byeong-Hak Han
- Agricultural Microbiology Division, National Institute of Agricultural Sciences, Rural Development Administration, Wanju-gun, Jeollabuk-do 55365, Republic of Korea
| | - Seung-Beom Hong
- Agricultural Microbiology Division, National Institute of Agricultural Sciences, Rural Development Administration, Wanju-gun, Jeollabuk-do 55365, Republic of Korea
| | - Yiseul Kim
- Agricultural Microbiology Division, National Institute of Agricultural Sciences, Rural Development Administration, Wanju-gun, Jeollabuk-do 55365, Republic of Korea
| |
Collapse
|
5
|
Kim Y, Kim S, Kwon SW, Weon HY, Naito H, Asano T, Hamada M, Heo J. Halobacillus salinarum sp. nov., Halobacillus shinanisalinarum sp. nov. and Halobacillus amylolyticus sp. nov., isolated from saltern soil. Int J Syst Evol Microbiol 2023; 73. [PMID: 37882660 DOI: 10.1099/ijsem.0.006098] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2023] Open
Abstract
Three bacterial strains, designated SSBR10-3T, SSTM10-2T and SSHM10-5T, were isolated from saltern soil sampled in Jeollanam-do, Republic of Korea. Cells were aerobic, Gram-stain-positive, flagellated and rod-shaped. The strains grew optimally at 28°C and at pH 7.0. Phylogenetic analysis based on 16S rRNA gene sequences indicated that strains SSBR10-3T, SSTM10-2T and SSHM10-5T were placed within the genus Halobacillus, showing the highest similarity to Halobacillus alkaliphilus FP5T (98.6 %), 'Halobacillus ihumii' Marseille-Q1234T (98.5 %) and Halobacillus locisalis MSS-155T (98.6 %), respectively. The genomic similarity values between strains SSBR10-3T, SSTM10-2T and SSHM10-5T and their related species were 17.6-22.6 % for digital DNA-DNA hybridization (dDDH) and 69.6-78.5 % for orthologous average nucleotide identity (OrthoANI), which were lower than the thresholds recommended for species delineation. The dDDH and OrthoANI values among the three strains were below 38.3 and 89.4 %, respectively. Besides the differences in genomic features, strains SSBR10-3T, SSTM10-2T and SSHM10-5T were distinct from each other and from members of the genus in terms of phenotypic traits related to substrate assimilation. The cell-wall peptidoglycan contained meso-diaminopimelic acid, the major fatty acids were anteiso-C15 : 0, iso-C16 : 0 and anteiso-C17 : 0, and the predominant menaquinone was MK-7 for all three strains. Diphosphatidylglycerol, phosphatidylglycerol and an unidentified phospholipid were present in their polar lipid profiles. Based on a polyphasic approach incorporating genomic data, strains SSBR10-3T, SSTM10-2T and SSHM10-5T represent novel species, for which the names Halobacillus salinarum sp. nov. (SSBR10-3T=DSM 114353T=KACC 21935T=NBRC 115504T), Halobacillus shinanisalinarum sp. nov. (SSTM10-2T=DSM 114354T=KACC 21936T=NBRC 115505T) and Halobacillus amylolyticus sp. nov. (SSHM10-5T=DSM 114355T= KACC 21937T=NBRC 115506T) are proposed.
Collapse
Affiliation(s)
- Yiseul Kim
- Agricultural Microbiology Division, National Institute of Agricultural Sciences, Rural Development Administration, Jeollabuk-do 55365, Republic of Korea
| | - Seunghwan Kim
- Agricultural Microbiology Division, National Institute of Agricultural Sciences, Rural Development Administration, Jeollabuk-do 55365, Republic of Korea
| | - Soon-Wo Kwon
- Agricultural Microbiology Division, National Institute of Agricultural Sciences, Rural Development Administration, Jeollabuk-do 55365, Republic of Korea
| | - Hang-Yeon Weon
- Agricultural Microbiology Division, National Institute of Agricultural Sciences, Rural Development Administration, Jeollabuk-do 55365, Republic of Korea
| | - Hanako Naito
- NITE Biological Resource Center (NBRC), National Institute of Technology and Evaluation, 2-5-8, Kazusakamatari, Kisarazu, Chiba 292-0818, Japan
| | - Tomomi Asano
- NITE Biological Resource Center (NBRC), National Institute of Technology and Evaluation, 2-5-8, Kazusakamatari, Kisarazu, Chiba 292-0818, Japan
| | - Moriyuki Hamada
- NITE Biological Resource Center (NBRC), National Institute of Technology and Evaluation, 2-5-8, Kazusakamatari, Kisarazu, Chiba 292-0818, Japan
| | - Jun Heo
- Agricultural Microbiology Division, National Institute of Agricultural Sciences, Rural Development Administration, Jeollabuk-do 55365, Republic of Korea
| |
Collapse
|
6
|
Srinivasan S, Austin MN, Fiedler TL, Strenk SM, Agnew KJ, Gowda GAN, Raftery D, Beamer MA, Achilles SL, Wiesenfeld HC, Fredricks DN, Hillier SL. Amygdalobacter indicium gen. nov., sp. nov., and Amygdalobacter nucleatus sp. nov., gen. nov.: novel bacteria from the family Oscillospiraceae isolated from the female genital tract. Int J Syst Evol Microbiol 2023; 73:006017. [PMID: 37787404 PMCID: PMC11318147 DOI: 10.1099/ijsem.0.006017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2023] [Accepted: 07/17/2023] [Indexed: 10/04/2023] Open
Abstract
Four obligately anaerobic Gram-positive bacteria representing one novel genus and two novel species were isolated from the female genital tract. Both novel species, designated UPII 610-JT and KA00274T, and an additional isolate of each species were characterized utilizing biochemical, genotypic and phylogenetic analyses. All strains were non-motile and non-spore forming, asaccharolytic, non-cellulolytic and indole-negative coccobacilli. Fatty acid methyl ester analysis for UPII 610-JT and KA00274T and additional isolates revealed C16 : 0, C18 : 0, C18:1ω9c and C18:2ω6,9c to be the major fatty acids for both species. UPII 610-JT had a 16S rRNA gene sequence similarity of 99.4 % to an uncultured clone sequence (AY724740) designated as Bacterial Vaginosis Associated Bacterium 2 (BVAB2). KA00274T had a 16S rRNA gene sequence similarity of 96.5 % to UPII 610-JT. Whole genomic DNA mol% G+C content was 42.2 and 39.3 % for UPII 610-JT and KA00274T, respectively. Phylogenetic analyses indicate these isolates represent a novel genus and two novel species within the Oscillospiraceae family. We propose the names Amygdalobacter indicium gen. nov., sp. nov., for UPII 610-JT representing the type strain of this species (=DSM 112989T, =ATCC TSD-274T) and Amygdalobacter nucleatus gen. nov., sp. nov., for KA00274T representing the type strain of this species (=DSM 112988T, =ATCC TSD-275T).
Collapse
Affiliation(s)
- Sujatha Srinivasan
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Center, Seattle, WA, USA
| | | | - Tina L. Fiedler
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Center, Seattle, WA, USA
| | - Susan M. Strenk
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Center, Seattle, WA, USA
| | - Kathy J. Agnew
- Division of Gynecologic Oncology, Department of Obstetrics & Gynecology, University of Washington Medical Center, Seattle, WA, USA
| | - G. A. Nagana Gowda
- Northwest Metabolomics Research Center and Mitochondrial and Metabolism Center, Anesthesiology and Pain Medicine, University of Washington Medical Center, Seattle, WA, USA
| | - Daniel Raftery
- Northwest Metabolomics Research Center and Mitochondrial and Metabolism Center, Anesthesiology and Pain Medicine, University of Washington Medical Center, Seattle, WA, USA
- Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle, WA, USA
| | - May A. Beamer
- Magee-Womens Research Institute, Pittsburgh, PA, USA
| | - Sharon L. Achilles
- Magee-Womens Research Institute, Pittsburgh, PA, USA
- University of Pittsburgh School of Medicine, Department of Obstetrics, Gynecology and Reproductive Sciences, Pittsburgh PA, USA
| | - Harold C. Wiesenfeld
- Magee-Womens Research Institute, Pittsburgh, PA, USA
- University of Pittsburgh School of Medicine, Department of Obstetrics, Gynecology and Reproductive Sciences, Pittsburgh PA, USA
| | - David N. Fredricks
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Center, Seattle, WA, USA
- Department of Medicine, University of Washington, Seattle, WA, USA
| | - Sharon L. Hillier
- Magee-Womens Research Institute, Pittsburgh, PA, USA
- University of Pittsburgh School of Medicine, Department of Obstetrics, Gynecology and Reproductive Sciences, Pittsburgh PA, USA
| |
Collapse
|
7
|
Leung W, Torosin N, Cao W, Reed LK, Arrigo C, Elgin SCR, Ellison CE. Long-read genome assemblies for the study of chromosome expansion: Drosophila kikkawai, Drosophila takahashii, Drosophila bipectinata, and Drosophila ananassae. G3 (BETHESDA, MD.) 2023; 13:jkad191. [PMID: 37611223 PMCID: PMC10542312 DOI: 10.1093/g3journal/jkad191] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/23/2023] [Revised: 08/01/2023] [Accepted: 08/04/2023] [Indexed: 08/25/2023]
Abstract
Flow cytometry estimates of genome sizes among species of Drosophila show a 3-fold variation, ranging from ∼127 Mb in Drosophila mercatorum to ∼400 Mb in Drosophila cyrtoloma. However, the assembled portion of the Muller F element (orthologous to the fourth chromosome in Drosophila melanogaster) shows a nearly 14-fold variation in size, ranging from ∼1.3 Mb to >18 Mb. Here, we present chromosome-level long-read genome assemblies for 4 Drosophila species with expanded F elements ranging in size from 2.3 to 20.5 Mb. Each Muller element is present as a single scaffold in each assembly. These assemblies will enable new insights into the evolutionary causes and consequences of chromosome size expansion.
Collapse
Affiliation(s)
- Wilson Leung
- Department of Biology, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Nicole Torosin
- Department of Genetics and Human Genetics Institute of New Jersey, Rutgers University, Piscataway, NJ 08854, USA
| | - Weihuan Cao
- Department of Genetics and Human Genetics Institute of New Jersey, Rutgers University, Piscataway, NJ 08854, USA
| | - Laura K Reed
- Department of Biological Sciences, The University of Alabama, Tuscaloosa, AL 35487, USA
| | - Cindy Arrigo
- Department of Biology, New Jersey City University, Jersey City, NJ 07305, USA
| | - Sarah C R Elgin
- Department of Biology, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Christopher E Ellison
- Department of Genetics and Human Genetics Institute of New Jersey, Rutgers University, Piscataway, NJ 08854, USA
| |
Collapse
|
8
|
Leung W, Torosin N, Cao W, Reed LK, Arrigo C, Elgin SCR, Ellison CE. Long-read genome assemblies for the study of chromosome expansion: Drosophila kikkawai , Drosophila takahashii , Drosophila bipectinata , and Drosophila ananassae. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.22.541758. [PMID: 37292993 PMCID: PMC10245892 DOI: 10.1101/2023.05.22.541758] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Flow cytometry estimates of genome sizes among species of Drosophila show a 3-fold variation, ranging from ∼127 Mb in Drosophila mercatorum to ∼400 Mb in Drosophila cyrtoloma . However, the assembled portion of the Muller F Element (orthologous to the fourth chromosome in Drosophila melanogaster ) shows a nearly 14-fold variation in size, ranging from ∼1.3 Mb to > 18 Mb. Here, we present chromosome-level long read genome assemblies for four Drosophila species with expanded F Elements ranging in size from 2.3 Mb to 20.5 Mb. Each Muller Element is present as a single scaffold in each assembly. These assemblies will enable new insights into the evolutionary causes and consequences of chromosome size expansion.
Collapse
Affiliation(s)
- Wilson Leung
- Department of Biology, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Nicole Torosin
- Department of Genetics and Human Genetics Institute of New Jersey, Rutgers University, Piscataway, NJ 08854, USA
| | - Weihuan Cao
- Department of Genetics and Human Genetics Institute of New Jersey, Rutgers University, Piscataway, NJ 08854, USA
| | - Laura K Reed
- Department of Biological Sciences, The University of Alabama, Tuscaloosa, Alabama, 35487, USA
| | - Cindy Arrigo
- Department of Biology, New Jersey City University, Jersey City, NJ 07305, USA
| | - Sarah C R Elgin
- Department of Biology, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Christopher E Ellison
- Department of Genetics and Human Genetics Institute of New Jersey, Rutgers University, Piscataway, NJ 08854, USA
| |
Collapse
|
9
|
Avershina E, Khezri A, Ahmad R. Clinical Diagnostics of Bacterial Infections and Their Resistance to Antibiotics-Current State and Whole Genome Sequencing Implementation Perspectives. Antibiotics (Basel) 2023; 12:antibiotics12040781. [PMID: 37107143 PMCID: PMC10135054 DOI: 10.3390/antibiotics12040781] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2023] [Revised: 03/19/2023] [Accepted: 04/13/2023] [Indexed: 04/29/2023] Open
Abstract
Antimicrobial resistance (AMR), defined as the ability of microorganisms to withstand antimicrobial treatment, is responsible for millions of deaths annually. The rapid spread of AMR across continents warrants systematic changes in healthcare routines and protocols. One of the fundamental issues with AMR spread is the lack of rapid diagnostic tools for pathogen identification and AMR detection. Resistance profile identification often depends on pathogen culturing and thus may last up to several days. This contributes to the misuse of antibiotics for viral infection, the use of inappropriate antibiotics, the overuse of broad-spectrum antibiotics, or delayed infection treatment. Current DNA sequencing technologies offer the potential to develop rapid infection and AMR diagnostic tools that can provide information in a few hours rather than days. However, these techniques commonly require advanced bioinformatics knowledge and, at present, are not suited for routine lab use. In this review, we give an overview of the AMR burden on healthcare, describe current pathogen identification and AMR screening methods, and provide perspectives on how DNA sequencing may be used for rapid diagnostics. Additionally, we discuss the common steps used for DNA data analysis, currently available pipelines, and tools for analysis. Direct, culture-independent sequencing has the potential to complement current culture-based methods in routine clinical settings. However, there is a need for a minimum set of standards in terms of evaluating the results generated. Additionally, we discuss the use of machine learning algorithms regarding pathogen phenotype detection (resistance/susceptibility to an antibiotic).
Collapse
Affiliation(s)
- Ekaterina Avershina
- Department of Biotechnology, Inland Norway University of Applied Sciences, Holsetgata, 222317 Hamar, Norway
| | - Abdolrahman Khezri
- Department of Biotechnology, Inland Norway University of Applied Sciences, Holsetgata, 222317 Hamar, Norway
| | - Rafi Ahmad
- Department of Biotechnology, Inland Norway University of Applied Sciences, Holsetgata, 222317 Hamar, Norway
- Institute of Clinical Medicine, Faculty of Health Science, UiT The Arctic University of Norway, Hansine Hansens veg, 189019 Tromsø, Norway
| |
Collapse
|
10
|
Ibañez-Lligoña M, Colomer-Castell S, González-Sánchez A, Gregori J, Campos C, Garcia-Cehic D, Andrés C, Piñana M, Pumarola T, Rodríguez-Frias F, Antón A, Quer J. Bioinformatic Tools for NGS-Based Metagenomics to Improve the Clinical Diagnosis of Emerging, Re-Emerging and New Viruses. Viruses 2023; 15:v15020587. [PMID: 36851800 PMCID: PMC9965957 DOI: 10.3390/v15020587] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2023] [Revised: 02/16/2023] [Accepted: 02/17/2023] [Indexed: 02/24/2023] Open
Abstract
Epidemics and pandemics have occurred since the beginning of time, resulting in millions of deaths. Many such disease outbreaks are caused by viruses. Some viruses, particularly RNA viruses, are characterized by their high genetic variability, and this can affect certain phenotypic features: tropism, antigenicity, and susceptibility to antiviral drugs, vaccines, and the host immune response. The best strategy to face the emergence of new infectious genomes is prompt identification. However, currently available diagnostic tests are often limited for detecting new agents. High-throughput next-generation sequencing technologies based on metagenomics may be the solution to detect new infectious genomes and properly diagnose certain diseases. Metagenomic techniques enable the identification and characterization of disease-causing agents, but they require a large amount of genetic material and involve complex bioinformatic analyses. A wide variety of analytical tools can be used in the quality control and pre-processing of metagenomic data, filtering of untargeted sequences, assembly and quality control of reads, and taxonomic profiling of sequences to identify new viruses and ones that have been sequenced and uploaded to dedicated databases. Although there have been huge advances in the field of metagenomics, there is still a lack of consensus about which of the various approaches should be used for specific data analysis tasks. In this review, we provide some background on the study of viral infections, describe the contribution of metagenomics to this field, and place special emphasis on the bioinformatic tools (with their capabilities and limitations) available for use in metagenomic analyses of viral pathogens.
Collapse
Affiliation(s)
- Marta Ibañez-Lligoña
- Liver Diseases-Viral Hepatitis, Liver Unit, Vall d’Hebron Institut de Recerca (VHIR), Vall d’Hebron Hospital Universitari, Vall d’Hebron Barcelona Hospital Campus, Passeig Vall d’Hebron 119-129, 08035 Barcelona, Spain
- Centro de Investigación Biomédica en Red de Enfermedades Hepáticas y Digestivas (CIBERehd), Instituto de Salud Carlos III, Av. Monforte de Lemos, 3-5, 28029 Madrid, Spain
- Biochemistry and Molecular Biology Department, Universitat Autònoma de Barcelona (UAB), Campus de la UAB, Plaça Cívica, 08193 Bellaterra, Spain
| | - Sergi Colomer-Castell
- Liver Diseases-Viral Hepatitis, Liver Unit, Vall d’Hebron Institut de Recerca (VHIR), Vall d’Hebron Hospital Universitari, Vall d’Hebron Barcelona Hospital Campus, Passeig Vall d’Hebron 119-129, 08035 Barcelona, Spain
- Centro de Investigación Biomédica en Red de Enfermedades Hepáticas y Digestivas (CIBERehd), Instituto de Salud Carlos III, Av. Monforte de Lemos, 3-5, 28029 Madrid, Spain
- Biochemistry and Molecular Biology Department, Universitat Autònoma de Barcelona (UAB), Campus de la UAB, Plaça Cívica, 08193 Bellaterra, Spain
| | - Alejandra González-Sánchez
- Microbiology Department, Vall d’Hebron Institut de Recerca (VHIR), Vall d’Hebron Hospital Universitari, Vall d’Hebron Barcelona Hospital Campus, Passeig Vall d’Hebron 119-129, 08035 Barcelona, Spain
| | - Josep Gregori
- Liver Diseases-Viral Hepatitis, Liver Unit, Vall d’Hebron Institut de Recerca (VHIR), Vall d’Hebron Hospital Universitari, Vall d’Hebron Barcelona Hospital Campus, Passeig Vall d’Hebron 119-129, 08035 Barcelona, Spain
| | - Carolina Campos
- Liver Diseases-Viral Hepatitis, Liver Unit, Vall d’Hebron Institut de Recerca (VHIR), Vall d’Hebron Hospital Universitari, Vall d’Hebron Barcelona Hospital Campus, Passeig Vall d’Hebron 119-129, 08035 Barcelona, Spain
- Centro de Investigación Biomédica en Red de Enfermedades Hepáticas y Digestivas (CIBERehd), Instituto de Salud Carlos III, Av. Monforte de Lemos, 3-5, 28029 Madrid, Spain
- Biochemistry and Molecular Biology Department, Universitat Autònoma de Barcelona (UAB), Campus de la UAB, Plaça Cívica, 08193 Bellaterra, Spain
| | - Damir Garcia-Cehic
- Liver Diseases-Viral Hepatitis, Liver Unit, Vall d’Hebron Institut de Recerca (VHIR), Vall d’Hebron Hospital Universitari, Vall d’Hebron Barcelona Hospital Campus, Passeig Vall d’Hebron 119-129, 08035 Barcelona, Spain
- Centro de Investigación Biomédica en Red de Enfermedades Hepáticas y Digestivas (CIBERehd), Instituto de Salud Carlos III, Av. Monforte de Lemos, 3-5, 28029 Madrid, Spain
| | - Cristina Andrés
- Microbiology Department, Vall d’Hebron Institut de Recerca (VHIR), Vall d’Hebron Hospital Universitari, Vall d’Hebron Barcelona Hospital Campus, Passeig Vall d’Hebron 119-129, 08035 Barcelona, Spain
| | - Maria Piñana
- Microbiology Department, Vall d’Hebron Institut de Recerca (VHIR), Vall d’Hebron Hospital Universitari, Vall d’Hebron Barcelona Hospital Campus, Passeig Vall d’Hebron 119-129, 08035 Barcelona, Spain
| | - Tomàs Pumarola
- Microbiology Department, Vall d’Hebron Institut de Recerca (VHIR), Vall d’Hebron Hospital Universitari, Vall d’Hebron Barcelona Hospital Campus, Passeig Vall d’Hebron 119-129, 08035 Barcelona, Spain
- Microbiology Department, Universitat Autònoma de Barcelona (UAB), Campus de la UAB, Plaça Cívica, 08193 Bellaterra, Spain
| | - Francisco Rodríguez-Frias
- Liver Diseases-Viral Hepatitis, Liver Unit, Vall d’Hebron Institut de Recerca (VHIR), Vall d’Hebron Hospital Universitari, Vall d’Hebron Barcelona Hospital Campus, Passeig Vall d’Hebron 119-129, 08035 Barcelona, Spain
- Centro de Investigación Biomédica en Red de Enfermedades Hepáticas y Digestivas (CIBERehd), Instituto de Salud Carlos III, Av. Monforte de Lemos, 3-5, 28029 Madrid, Spain
- Department of Basic Sciences, Universitat Internacional de Catalunya, Sant Cugat del Vallès, 08195 Barcelona, Spain
| | - Andrés Antón
- Microbiology Department, Vall d’Hebron Institut de Recerca (VHIR), Vall d’Hebron Hospital Universitari, Vall d’Hebron Barcelona Hospital Campus, Passeig Vall d’Hebron 119-129, 08035 Barcelona, Spain
- Microbiology Department, Universitat Autònoma de Barcelona (UAB), Campus de la UAB, Plaça Cívica, 08193 Bellaterra, Spain
| | - Josep Quer
- Liver Diseases-Viral Hepatitis, Liver Unit, Vall d’Hebron Institut de Recerca (VHIR), Vall d’Hebron Hospital Universitari, Vall d’Hebron Barcelona Hospital Campus, Passeig Vall d’Hebron 119-129, 08035 Barcelona, Spain
- Centro de Investigación Biomédica en Red de Enfermedades Hepáticas y Digestivas (CIBERehd), Instituto de Salud Carlos III, Av. Monforte de Lemos, 3-5, 28029 Madrid, Spain
- Biochemistry and Molecular Biology Department, Universitat Autònoma de Barcelona (UAB), Campus de la UAB, Plaça Cívica, 08193 Bellaterra, Spain
- Correspondence:
| |
Collapse
|
11
|
López-Solano A, Nester TL, Perea S, Doadrio I. Complete mitochondrial genome of the Spanish toothcarp, Aphanius iberus (Valenciennes, 1846) (Actinopterygii, Aphaniidae) and its phylogenetic position within the Cyprinodontiformes order. Mol Biol Rep 2023; 50:2953-2962. [PMID: 36650373 PMCID: PMC10042920 DOI: 10.1007/s11033-022-08236-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2022] [Accepted: 12/22/2022] [Indexed: 01/19/2023]
Abstract
BACKGROUND The Spanish toothcarp (Aphanius iberus Valenciennes, 1846) is a small fish endemic to the eastern coastline of the Iberian Peninsula and is currently listed as "Endangered" (category IUCN: EN). It mainly inhabits brackish waters which can exhibit large fluctuations in temperature and salinity throughout the year. The genetics of A. iberus are not well-known since most studies have only evaluated the genetic structure of the species under a conservation framework in order to identify its potential conservation units. Different phylogenetic relationships of Aphanius have been published based on some particular genes. In the present study, the entire mitochondrial genome of A. iberus was obtained for the first time in the context of an A. iberus reference genome and a hypothesis regarding its phylogenetic position was considered. METHODS AND RESULTS The mitogenome (a circular doble-stranded DNA sequence of 16,708 bp) was reconstructed and aligned against 83 Cyprinodontiformes and two outgroup taxa to identify the phylogenetic position of A. iberus. PartitionFinder was first used to test for the best evolutionary model and the phylogenetic analyses were performed using two methods: Maximun-Likelihood Approximation (IQ-Tree) and Bayesian inference (MrBayes). Our results show that A. iberus forms a sister group with Orestias ascotanensis, a cyprinodontiform species native to South America. CONCLUSIONS The results were congruent with the traditional morphometric reconstructed trees and with a geological vicariant hypothesis involving Cyprinodontiformes where Aphaniidae is shown as a monophyletic family separated from the family Cyprinodontidae. The information gathered from this study is not only valuable for improving our understanding of the evolutionary history of A. iberus, but for future genomic studies involving the species.
Collapse
Affiliation(s)
- Alfonso López-Solano
- Department of Biodiversity and Evolutionary Biology, Museo Nacional de Ciencias Naturales (CSIC), Madrid, Spain.
| | - Tessa Lynn Nester
- Department of Biodiversity and Evolutionary Biology, Museo Nacional de Ciencias Naturales (CSIC), Madrid, Spain
| | - Silvia Perea
- Department of Biodiversity and Evolutionary Biology, Museo Nacional de Ciencias Naturales (CSIC), Madrid, Spain.,Departamento de Zoología, Instituto de Biología, Universidad Nacional Autónoma de México, Tercer Circuito Exterior S/N, C.P. 04510, Ciudad de Mexico, Mexico
| | - Ignacio Doadrio
- Department of Biodiversity and Evolutionary Biology, Museo Nacional de Ciencias Naturales (CSIC), Madrid, Spain
| |
Collapse
|
12
|
DNA Methyltransferase Regulates Nitric Oxide Homeostasis and Virulence in a Chronically Adapted Pseudomonas aeruginosa Strain. mSystems 2022; 7:e0043422. [PMID: 36106744 PMCID: PMC9600465 DOI: 10.1128/msystems.00434-22] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
Opportunistic pathogens such as Pseudomonas aeruginosa adapt their genomes rapidly during chronic infections. Understanding their epigenetic regulation may provide biomarkers for diagnosis and reveal novel regulatory mechanisms. We performed single-molecule real-time sequencing (SMRT-seq) to characterize the methylome of a chronically adapted P. aeruginosa clinical strain, TBCF10839. Two N6-methyladenine (6mA) methylation recognition motifs (RCCANNNNNNNTGAR and TRGANNNNNNTGC [modification sites are in bold]) were identified and predicted as new type I methylation sites using REBASE analysis. We confirmed that the motif TRGANNNNNNTGC was methylated by the methyltransferase (MTase) M.PaeTBCFII, according to methylation sensitivity assays in vivo and vitro. Transcriptomic analysis showed that a ΔpaeTBCFIIM knockout mutant significantly downregulated nitric oxide reductase (NOR) regulation and expression of coding genes such as nosR and norB, which contain methylated motifs in their promoters or coding regions. The ΔpaeTBCFIIM strain exhibited reduced intercellular survival capacity in NO-producing RAW264.7 macrophages and attenuated virulence in a Galleria mellonella infection model; the complemented strain recovered these defective phenotypes. Further phylogenetic analysis demonstrated that homologs of M.PaeTBCFII occur frequently in P. aeruginosa as well as other bacterial species. Our work therefore provided new insights into the relationship between DNA methylation, NO detoxification, and bacterial virulence, laying a foundation for further exploring the molecular mechanism of DNA methyltransferase in regulating the pathogenicity of P. aeruginosa. IMPORTANCE Pseudomonas aeruginosa is an opportunistic pathogen which causes acute and chronic infections that are difficult to treat. Comparative genomic analysis has showed broad genome diversity among P. aeruginosa clinical strains and revealed their different regulatory traits compared to the laboratory strains. While current investigation of the epigenetics of P. aeruginosa is still lacking, understanding epigenetic regulation may provide biomarkers for diagnosis and facilitate development of novel therapies. Denitrification capability is critical for microbial versatility in response to different environmental stress conditions, including the bacterial infection process, where nitric oxide (NO) can be generated by phagocytic cells. The denitrification regulation mechanisms have been studied intensively at genetic and biochemical levels. However, there is very little evidence about the epigenetic regulation of bacterial denitrification mechanism. P. aeruginosa TBCF10839 is a chronically host-adapted strain isolated from a cystic fibrosis (CF) patient with special antiphagocytosis characteristics. Here, we investigated the regulatory effect of an orphan DNA MTase, M.PaeTBCFII, in P. aeruginosa TBCF10839. We demonstrated that the DNA MTase regulates the transcription of denitrification genes represented by NOR and affects antiphagocytic ability in bacteria. In silico analysis suggested that DNA methylation modification may enhance gene expression by affecting the binding of transacting factors such as DNR and RpoN. Our findings not only deepen the understanding of the role of DNA MTase in transcriptional regulation in P. aeruginosa but also provide a theoretical foundation for the in-depth study of the molecular mechanism of the epigenetic regulation on denitrification, virulence, and host-pathogen interaction.
Collapse
|
13
|
London EW, Roca AL, Novakofski JE, Mateus-Pinilla NE. A De Novo Chromosome-Level Genome Assembly of the White-Tailed Deer, Odocoileus Virginianus. J Hered 2022; 113:479-489. [PMID: 35511871 PMCID: PMC9308042 DOI: 10.1093/jhered/esac022] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2021] [Accepted: 05/05/2022] [Indexed: 11/12/2022] Open
Abstract
Cervids are distinguished by the shedding and regrowth of antlers. Furthermore, they provide insights into prion and other diseases. Genomic resources can facilitate studies of the genetic underpinnings of deer phenotypes, behavior, and disease resistance. Widely distributed in North America, the white-tailed deer (Odocoileus virginianus) has recreational, commercial, and food source value for many households. We present a genome generated using DNA from a single Illinois white-tailed sequenced on the PacBio Sequel II platform and assembled using Wtdbg2. Omni-C chromatin conformation capture sequencing was used to scaffold the genome contigs. The final assembly was 2.42 Gb, consisting of 508 scaffolds with a contig N50 of 21.7 Mb, a scaffold N50 of 52.4 Mb, and a BUSCO complete score of 93.1%. Thirty-six chromosome pseudomolecules comprised 93% of the entire sequenced genome length. A total of 20 651 predicted genes using the BRAKER pipeline were validated using InterProScan. Chromosome length assembly sequences were aligned to the genomes of related species to reveal corresponding chromosomes.
Collapse
Affiliation(s)
- Evan W London
- Illinois Natural History Survey-Prairie Research Institute, University of Illinois at Urbana-Champaign, Champaign, IL 61820, USA
- Department of Animal Sciences, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Alfred L Roca
- Illinois Natural History Survey-Prairie Research Institute, University of Illinois at Urbana-Champaign, Champaign, IL 61820, USA
- Department of Animal Sciences, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
- Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Jan E Novakofski
- Illinois Natural History Survey-Prairie Research Institute, University of Illinois at Urbana-Champaign, Champaign, IL 61820, USA
- Department of Animal Sciences, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Nohra E Mateus-Pinilla
- Illinois Natural History Survey-Prairie Research Institute, University of Illinois at Urbana-Champaign, Champaign, IL 61820, USA
- Department of Animal Sciences, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| |
Collapse
|
14
|
Hu Y, Resende Jr. M. Maize Genome Assembly with PacBio Reads. Bio Protoc 2022. [DOI: 10.21769/bioprotoc.4456] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022] Open
|
15
|
Comprehensive transcriptome characterization of Grus japonensis using PacBio SMRT and Illumina sequencing. Sci Rep 2021; 11:23927. [PMID: 34907275 PMCID: PMC8671462 DOI: 10.1038/s41598-021-03474-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2021] [Accepted: 12/03/2021] [Indexed: 12/13/2022] Open
Abstract
The red-crowned crane (Grus japonensis) is an endangered species distributed across southeast Russia, northeast China, Korea, and Japan. Here, we sequenced for the first time the full-length unreferenced transcriptome of red-crowned crane mixed samples using a PacBio Sequel platform. A total of 359,136 circular consensus sequences (CCS) were obtained via clustering to remove redundancy. A total of 303,544 full-length non-chimeric sequences were identified by judging whether CCS contained 5' and 3' adapters, and the poly(A) tail. Eight samples were sequenced using Illumina, and PacBio sequencing data were corrected according to the collected Illumina data to obtain more accurate full-length transcripts. A total of 4,100 long non-coding RNAs, 13,115 simple sequences repeat loci and 29 transcription factor families were identified. The expression of lncRNAs and TFs in pancreas was lowest comparing with other tissues. Many enriched immune-related transmission pathways (MHC and IL receptors) were identified in the spleen. This study will contribute to a better understanding of the gene structure and post-transcriptional regulatory network, and provide references for future studies on red-crowned cranes.
Collapse
|
16
|
Yang C, Chowdhury D, Zhang Z, Cheung WK, Lu A, Bian Z, Zhang L. A review of computational tools for generating metagenome-assembled genomes from metagenomic sequencing data. Comput Struct Biotechnol J 2021; 19:6301-6314. [PMID: 34900140 PMCID: PMC8640167 DOI: 10.1016/j.csbj.2021.11.028] [Citation(s) in RCA: 72] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2021] [Revised: 11/17/2021] [Accepted: 11/17/2021] [Indexed: 12/16/2022] Open
Abstract
Metagenomic sequencing provides a culture-independent avenue to investigate the complex microbial communities by constructing metagenome-assembled genomes (MAGs). A MAG represents a microbial genome by a group of sequences from genome assembly with similar characteristics. It enables us to identify novel species and understand their potential functions in a dynamic ecosystem. Many computational tools have been developed to construct and annotate MAGs from metagenomic sequencing, however, there is a prominent gap to comprehensively introduce their background and practical performance. In this paper, we have thoroughly investigated the computational tools designed for both upstream and downstream analyses, including metagenome assembly, metagenome binning, gene prediction, functional annotation, taxonomic classification, and profiling. We have categorized the commonly used tools into unique groups based on their functional background and introduced the underlying core algorithms and associated information to demonstrate a comparative outlook. Furthermore, we have emphasized the computational requisition and offered guidance to the users to select the most efficient tools. Finally, we have indicated current limitations, potential solutions, and future perspectives for further improving the tools of MAG construction and annotation. We believe that our work provides a consolidated resource for the current stage of MAG studies and shed light on the future development of more effective MAG analysis tools on metagenomic sequencing.
Collapse
Key Words
- CNN, convolutional neural network
- DBG, De Bruijn graph
- GTDB, Genome Taxonomy Database
- Gene functional annotation
- Gene prediction
- Genome assembly
- HMM, Hidden Markov Model
- KEGG, Kyoto Encyclopedia of Genes and Genomes
- LCA, lowest common ancestor
- LPA, label propagation algorithm
- MAGs, metagenome-assembled genomes
- Metagenome binning
- Metagenome-assembled genomes
- Metagenomic sequencing
- Microbial abundance profiling
- OLC, overlap-layout consensus
- ONT, Oxford Nanopore Technologies
- ORFs, open reading frames
- PacBio, Pacific Biosciences
- QC, quality control
- SLR, synthetic long reads
- TNFs, tetranucleotide frequencies
- Taxonomic classification
Collapse
Affiliation(s)
- Chao Yang
- Department of Computer Science, Hong Kong Baptist University, Hong Kong Special Administrative Region
| | - Debajyoti Chowdhury
- Computational Medicine Lab, Hong Kong Baptist University, Hong Kong Special Administrative Region
- Institute of Integrated Bioinformedicine and Translational Sciences, School of Chinese Medicine, Hong Kong Baptist University, Hong Kong Special Administrative Region
| | - Zhenmiao Zhang
- Department of Computer Science, Hong Kong Baptist University, Hong Kong Special Administrative Region
| | - William K. Cheung
- Department of Computer Science, Hong Kong Baptist University, Hong Kong Special Administrative Region
| | - Aiping Lu
- Computational Medicine Lab, Hong Kong Baptist University, Hong Kong Special Administrative Region
- Institute of Integrated Bioinformedicine and Translational Sciences, School of Chinese Medicine, Hong Kong Baptist University, Hong Kong Special Administrative Region
| | - Zhaoxiang Bian
- Institute of Brain and Gut Research, School of Chinese Medicine, Hong Kong Baptist University, Hong Kong Special Administrative Region
- Chinese Medicine Clinical Study Center, School of Chinese Medicine, Hong Kong Baptist University, Hong Kong Special Administrative Region
| | - Lu Zhang
- Department of Computer Science, Hong Kong Baptist University, Hong Kong Special Administrative Region
- Computational Medicine Lab, Hong Kong Baptist University, Hong Kong Special Administrative Region
| |
Collapse
|
17
|
Stephens Z, Milosevic D, Kipp B, Grebe S, Iyer RK, Kocher JPA. PB-Motif-A Method for Identifying Gene/Pseudogene Rearrangements With Long Reads: An Application to CYP21A2 Genotyping. Front Genet 2021; 12:716586. [PMID: 34394200 PMCID: PMC8355628 DOI: 10.3389/fgene.2021.716586] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2021] [Accepted: 07/05/2021] [Indexed: 12/30/2022] Open
Abstract
Long read sequencing technologies have the potential to accurately detect and phase variation in genomic regions that are difficult to fully characterize with conventional short read methods. These difficult to sequence regions include several clinically relevant genes with highly homologous pseudogenes, many of which are prone to gene conversions or other types of complex structural rearrangements. We present PB-Motif, a new method for identifying rearrangements between two highly homologous genomic regions using PacBio long reads. PB-Motif leverages clustering and filtering techniques to efficiently report rearrangements in the presence of sequencing errors and other systematic artifacts. Supporting reads for each high-confidence rearrangement can then be used for copy number estimation and phased variant calling. First, we demonstrate PB-Motif's accuracy with simulated sequence rearrangements of PMS2 and its pseudogene PMS2CL using simulated reads sweeping over a range of sequencing error rates. We then apply PB-Motif to 26 clinical samples, characterizing CYP21A2 and its pseudogene CYP21A1P as part of a diagnostic assay for congenital adrenal hyperplasia. We successfully identify damaging variation and patient carrier status concordant with clinical diagnosis obtained from multiplex ligation-dependent amplification (MLPA) and Sanger sequencing. The source code is available at: github.com/zstephens/pb-motif.
Collapse
Affiliation(s)
- Zachary Stephens
- Department of Electrical and Computer Engineering, University of Illinois Urbana-Champaign, Urbana, IL, United States
| | | | | | | | - Ravishankar K Iyer
- Department of Electrical and Computer Engineering, University of Illinois Urbana-Champaign, Urbana, IL, United States
| | | |
Collapse
|
18
|
Baiakhmetov E, Guyomar C, Shelest E, Nobis M, Gudkova PD. The first draft genome of feather grasses using SMRT sequencing and its implications in molecular studies of Stipa. Sci Rep 2021; 11:15345. [PMID: 34321531 PMCID: PMC8319324 DOI: 10.1038/s41598-021-94068-w] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2020] [Accepted: 06/24/2021] [Indexed: 11/22/2022] Open
Abstract
The Eurasian plant Stipa capillata is the most widespread species within feather grasses. Many taxa of the genus are dominants in steppe plant communities and can be used for their classification and in studies related to climate change. Moreover, some species are of economic importance mainly as fodder plants and can be used for soil remediation processes. Although large-scale molecular data has begun to appear, there is still no complete or draft genome for any Stipa species. Thus, here we present a single-molecule long-read sequencing dataset generated using the Pacific Biosciences Sequel System. A draft genome of about 1004 Mb was obtained with a contig N50 length of 351 kb. Importantly, here we report 81,224 annotated protein-coding genes, present 77,614 perfect and 58 unique imperfect SSRs, reveal the putative allopolyploid nature of S. capillata, investigate the evolutionary history of the genus, demonstrate structural heteroplasmy of the chloroplast genome and announce for the first time the mitochondrial genome in Stipa. The assembled nuclear, mitochondrial and chloroplast genomes provide a significant source of genetic data for further works on phylogeny, hybridisation and population studies within Stipa and the grass family Poaceae.
Collapse
Affiliation(s)
- Evgenii Baiakhmetov
- Institute of Botany, Faculty of Biology, Jagiellonian University, Gronostajowa 3, 30-387, Kraków, Poland. .,Research Laboratory 'Herbarium', National Research Tomsk State University, Lenin 36 Ave., Tomsk, 634050, Russia.
| | - Cervin Guyomar
- German Centre for Integrative Biodiversity Research (iDiv), Puschstrasse 4, 04103, Leipzig, Germany.,Institute for Genetics, Environment and Plant Protection (IGEPP), Agrocampus Ouest, INRAE, University of Rennes 1, 35650, Le Rheu, France
| | - Ekaterina Shelest
- German Centre for Integrative Biodiversity Research (iDiv), Puschstrasse 4, 04103, Leipzig, Germany.,Centre for Enzyme Innovation, University of Portsmouth, Portsmouth, PO1 2UP, UK
| | - Marcin Nobis
- Institute of Botany, Faculty of Biology, Jagiellonian University, Gronostajowa 3, 30-387, Kraków, Poland. .,Research Laboratory 'Herbarium', National Research Tomsk State University, Lenin 36 Ave., Tomsk, 634050, Russia.
| | - Polina D Gudkova
- Research Laboratory 'Herbarium', National Research Tomsk State University, Lenin 36 Ave., Tomsk, 634050, Russia.,Department of Biology, Altai State University, Lenin 61 Ave., Barnaul, Russia, 656049
| |
Collapse
|