1
|
Gao Z, Lu Y, Chong Y, Li M, Hong J, Wu J, Wu D, Xi D, Deng W. Beef Cattle Genome Project: Advances in Genome Sequencing, Assembly, and Functional Genes Discovery. Int J Mol Sci 2024; 25:7147. [PMID: 39000250 PMCID: PMC11240973 DOI: 10.3390/ijms25137147] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2024] [Revised: 06/23/2024] [Accepted: 06/26/2024] [Indexed: 07/16/2024] Open
Abstract
Beef is a major global source of protein, playing an essential role in the human diet. The worldwide production and consumption of beef continue to rise, reflecting a significant trend. However, despite the critical importance of beef cattle resources in agriculture, the diversity of cattle breeds faces severe challenges, with many breeds at risk of extinction. The initiation of the Beef Cattle Genome Project is crucial. By constructing a high-precision functional annotation map of their genome, it becomes possible to analyze the genetic mechanisms underlying important traits in beef cattle, laying a solid foundation for breeding more efficient and productive cattle breeds. This review details advances in genome sequencing and assembly technologies, iterative upgrades of the beef cattle reference genome, and its application in pan-genome research. Additionally, it summarizes relevant studies on the discovery of functional genes associated with key traits in beef cattle, such as growth, meat quality, reproduction, polled traits, disease resistance, and environmental adaptability. Finally, the review explores the potential of telomere-to-telomere (T2T) genome assembly, structural variations (SVs), and multi-omics techniques in future beef cattle genetic breeding. These advancements collectively offer promising avenues for enhancing beef cattle breeding and improving genetic traits.
Collapse
Affiliation(s)
- Zhendong Gao
- Yunnan Provincial Key Laboratory of Animal Nutrition and Feed, Faculty of Animal Science and Technology, Yunnan Agricultural University, Kunming 650201, China
| | - Ying Lu
- Yunnan Provincial Key Laboratory of Animal Nutrition and Feed, Faculty of Animal Science and Technology, Yunnan Agricultural University, Kunming 650201, China
| | - Yuqing Chong
- Yunnan Provincial Key Laboratory of Animal Nutrition and Feed, Faculty of Animal Science and Technology, Yunnan Agricultural University, Kunming 650201, China
| | - Mengfei Li
- Yunnan Provincial Key Laboratory of Animal Nutrition and Feed, Faculty of Animal Science and Technology, Yunnan Agricultural University, Kunming 650201, China
| | - Jieyun Hong
- Yunnan Provincial Key Laboratory of Animal Nutrition and Feed, Faculty of Animal Science and Technology, Yunnan Agricultural University, Kunming 650201, China
| | - Jiao Wu
- Yunnan Provincial Key Laboratory of Animal Nutrition and Feed, Faculty of Animal Science and Technology, Yunnan Agricultural University, Kunming 650201, China
| | - Dongwang Wu
- Yunnan Provincial Key Laboratory of Animal Nutrition and Feed, Faculty of Animal Science and Technology, Yunnan Agricultural University, Kunming 650201, China
| | - Dongmei Xi
- Yunnan Provincial Key Laboratory of Animal Nutrition and Feed, Faculty of Animal Science and Technology, Yunnan Agricultural University, Kunming 650201, China
| | - Weidong Deng
- Yunnan Provincial Key Laboratory of Animal Nutrition and Feed, Faculty of Animal Science and Technology, Yunnan Agricultural University, Kunming 650201, China
- State Key Laboratory for Conservation and Utilization of Bio-Resource in Yunnan, Kunming 650201, China
| |
Collapse
|
2
|
Safar HA, Alatar F, Mustafa AS. Three Rounds of Read Correction Significantly Improve Eukaryotic Protein Detection in ONT Reads. Microorganisms 2024; 12:247. [PMID: 38399651 PMCID: PMC10893331 DOI: 10.3390/microorganisms12020247] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2023] [Revised: 01/19/2024] [Accepted: 01/23/2024] [Indexed: 02/25/2024] Open
Abstract
BACKGROUND Eukaryotes' whole-genome sequencing is crucial for species identification, gene detection, and protein annotation. Oxford Nanopore Technology (ONT) is an affordable and rapid platform for sequencing eukaryotes; however, the relatively higher error rates require computational and bioinformatic efforts to produce more accurate genome assemblies. Here, we evaluated the effect of read correction tools on eukaryote genome completeness, gene detection and protein annotation. METHODS Reads generated by ONT of four eukaryotes, C. albicans, C. gattii, S. cerevisiae, and P. falciparum, were assembled using minimap2 and underwent three rounds of read correction using flye, medaka and racon. The generates consensus FASTA files were compared for total length (bp), genome completeness, gene detection, and protein-annotation by QUAST, BUSCO, BRAKER1 and InterProScan, respectively. RESULTS Genome completeness was dependent on the assembly method rather than on the read correction tool; however, medaka performed better than flye and racon. Racon significantly performed better than flye and medaka in gene detection, while both racon and medaka significantly performed better than flye in protein-annotation. CONCLUSION We show that three rounds of read correction significantly affect gene detection and protein annotation, which are dependent on assembly quality in preference to assembly completeness.
Collapse
Affiliation(s)
- Hussain A. Safar
- OMICS Research Unit, Health Science Centre, Kuwait University, Kuwait City 13110, Kuwait;
| | - Fatemah Alatar
- Serology and Molecular Microbiology Reference Laboratory, Mubarak Al-Kabeer Hospital, Ministry of Health, Kuwait City 13110, Kuwait;
| | - Abu Salim Mustafa
- Department of Microbiology, Faculty of Medicine, Kuwait University, Kuwait City 13110, Kuwait
| |
Collapse
|
3
|
Drews SJ, Kjemtrup AM, Krause PJ, Lambert G, Leiby DA, Lewin A, O'Brien SF, Renaud C, Tonnetti L, Bloch EM. Transfusion-transmitted Babesia spp.: a changing landscape of epidemiology, regulation, and risk mitigation. J Clin Microbiol 2023; 61:e0126822. [PMID: 37750699 PMCID: PMC10595070 DOI: 10.1128/jcm.01268-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/27/2023] Open
Abstract
Babesia spp. are tick-borne parasites with a global distribution and diversity of vertebrate hosts. Over the next several decades, climate change is expected to impact humans, vectors, and vertebrate hosts and change the epidemiology of Babesia. Although humans are dead-end hosts for tick-transmitted Babesia, human-to-human transmission of Babesia spp. from transfusion of red blood cells and whole blood-derived platelet concentrates has been reported. In most patients, transfusion-transmitted Babesia (TTB) results in a moderate-to-severe illness. Currently, in North America, most cases of TTB have been described in the United States. TTB cases outside North America are rare, but case numbers may change over time with increased recognition of babesiosis and as the epidemiology of Babesia is impacted by climate change. Therefore, TTB is a concern of microbiologists working in blood operator settings, as well as in clinical settings where transfusion occurs. Microbiologists play an important role in deploying blood donor screening assays in Babesia endemic regions, identifying changing risks for Babesia in non-endemic areas, investigating recipients of blood products for TTB, and drafting TTB policies and guidelines. In this review, we provide an overview of the clinical presentation and epidemiology of TTB. We identify approaches and technologies to reduce the risk of collecting blood products from Babesia-infected donors and describe how investigations of TTB are undertaken. We also describe how microbiologists in Babesia non-endemic regions can assess for changing risks of TTB and decide when to focus on laboratory-test-based approaches or pathogen reduction to reduce TTB risk.
Collapse
Affiliation(s)
- Steven J. Drews
- Microbiology, Donation Policy and Studies, Canadian Blood Services, Edmonton, Alberta, Canada
- Department of Laboratory Medicine and Pathology, Division of Diagnostic and Applied Microbiology, University of Alberta, Edmonton, Alberta, Canada
| | - Anne M. Kjemtrup
- California Department of Public Health, Vector-Borne Disease Section, Sacramento, California, USA
| | - Peter J. Krause
- Department of Epidemiology of Microbial Diseases, Yale School of Public Health and Yale School of Medicine, New Haven, Connecticut, USA
| | - Grayson Lambert
- Department of Epidemiology of Microbial Diseases, Yale School of Public Health and Yale School of Medicine, New Haven, Connecticut, USA
| | - David A. Leiby
- Department of Microbiology, Immunology, and Tropical Medicine, George Washington University, Washington, USA
| | - Antoine Lewin
- Epidemiology, Surveillance and Biological Risk Assessment, Medical Affairs and Innovation, Héma-Québec, Montréal, Quebec, Canada
- Département d'Obstétrique et de Gynécologie, Faculté de Médecine et des Sciences de la Santé, Université de Sherbrooke, Sherbrooke, Quebec, Canada
| | - Sheila F. O'Brien
- Epidemiology and Surveillance, Canadian Blood Services, Donation Policy and Studies, Ottawa, Ontario, Canada
- School of Epidemiology and Public Health, University of Ottawa, Ottawa, Ontario, Canada
| | - Christian Renaud
- Department of Microbiology, CHU Sainte-Justine, Université de Montréal, Montréal, Quebec, Canada
| | - Laura Tonnetti
- American Red Cross, Scientific Affairs, Holland Laboratories for the Biomedical Sciences, Rockville, Maryland, USA
| | - Evan M. Bloch
- Department of Pathology, Johns Hopkins University School of Medicine, Baltimore, Maryland, USA
| |
Collapse
|
4
|
Schelkunov MI. Mabs, a suite of tools for gene-informed genome assembly. BMC Bioinformatics 2023; 24:377. [PMID: 37794322 PMCID: PMC10548655 DOI: 10.1186/s12859-023-05499-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2023] [Accepted: 09/26/2023] [Indexed: 10/06/2023] Open
Abstract
BACKGROUND Despite constantly improving genome sequencing methods, error-free eukaryotic genome assembly has not yet been achieved. Among other kinds of problems of eukaryotic genome assembly are so-called "haplotypic duplications", which may manifest themselves as cases of alleles being mistakenly assembled as paralogues. Haplotypic duplications are dangerous because they create illusions of gene family expansions and, thus, may lead scientists to incorrect conclusions about genome evolution and functioning. RESULTS Here, I present Mabs, a suite of tools that serve as parameter optimizers of the popular genome assemblers Hifiasm and Flye. By optimizing the parameters of Hifiasm and Flye, Mabs tries to create genome assemblies with the genes assembled as accurately as possible. Tests on 6 eukaryotic genomes showed that in 6 out of 6 cases, Mabs created assemblies with more accurately assembled genes than those generated by Hifiasm and Flye when they were run with default parameters. When assemblies of Mabs, Hifiasm and Flye were postprocessed by a popular tool for haplotypic duplication removal, Purge_dups, genes were better assembled by Mabs in 5 out of 6 cases. CONCLUSIONS Mabs is useful for making high-quality genome assemblies. It is available at https://github.com/shelkmike/Mabs.
Collapse
|
5
|
Vigil K, Aw TG. Comparison of de novo assembly using long-read shotgun metagenomic sequencing of viruses in fecal and serum samples from marine mammals. Front Microbiol 2023; 14:1248323. [PMID: 37808316 PMCID: PMC10556685 DOI: 10.3389/fmicb.2023.1248323] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Accepted: 09/04/2023] [Indexed: 10/10/2023] Open
Abstract
Introduction Viral diseases of marine mammals are difficult to study, and this has led to a limited knowledge on emerging known and unknown viruses which are ongoing threats to animal health. Viruses are the leading cause of infectious disease-induced mass mortality events among marine mammals. Methods In this study, we performed viral metagenomics in stool and serum samples from California sea lions (Zalophus californianus) and bottlenose dolphins (Tursiops truncates) using long-read nanopore sequencing. Two widely used long-read de novo assemblers, Canu and Metaflye, were evaluated to assemble viral metagenomic sequencing reads from marine mammals. Results Both Metaflye and Canu assembled similar viral contigs of vertebrates, such as Parvoviridae, and Poxviridae. Metaflye assembled viral contigs that aligned with one viral family that was not reproduced by Canu, while Canu assembled viral contigs that aligned with seven viral families that was not reproduced by Metaflye. Only Canu assembled viral contigs from dolphin and sea lion fecal samples that matched both protein and nucleotide RefSeq viral databases using BLASTx and BLASTn for Anelloviridae, Parvoviridae and Circoviridae families. Viral contigs assembled with Canu aligned with torque teno viruses and anelloviruses from vertebrate hosts. Viruses associated with invertebrate hosts including densoviruses, Ambidensovirus, and various Circoviridae isolates were also aligned. Some of the invertebrate and vertebrate viruses reported here are known to potentially cause mortality events and/or disease in different seals, sea stars, fish, and bivalve species. Discussion Canu performed better by producing the most viral contigs as compared to Metaflye with assemblies aligning to both protein and nucleotide databases. This study suggests that marine mammals can be used as important sentinels to surveil marine viruses that can potentially cause diseases in vertebrate and invertebrate hosts.
Collapse
Affiliation(s)
| | - Tiong Gim Aw
- Department of Environmental Health Sciences, School of Public Health and Tropical Medicine, Tulane University, New Orleans, LA, United States
| |
Collapse
|
6
|
Lee Y, Woo DU, Kang YJ. SoyDBean: a database for SNPs reconciliation by multiple versions of soybean reference genomes. Sci Rep 2023; 13:15712. [PMID: 37735613 PMCID: PMC10514325 DOI: 10.1038/s41598-023-42898-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2023] [Accepted: 09/15/2023] [Indexed: 09/23/2023] Open
Abstract
Due to the development of sequence technology and decreased cost, many whole genome sequences have been obtained. As a result, extensive genetic variations have been discovered from many populations and germplasms to understand the genetic diversity of soybean (Glycine max [L.] Merr.). However, assessing the quality of variation is essential because the published variants were collected using different bioinformatic methods and parameters. Furthermore, despite the enhanced genome contiguity and more efficient filling of "N" stretches in the new reference genome, there remains a dearth of endeavors to verify the caliber of variations present in it. The primary goal of this research was to discern a dependable set of SNPs that can withstand reconciliation across multiple reference genomes. Additionally, the investigation aimed to reconfirm the variations through the utilization of numerous whole genome sequencing data obtained from publicly available databases. Based on the result, we created datasets that comprised the thoroughly verified SNP coordinates between the reference assemblies. The resulting "SoyDBean" database is now publicly accessible through the following URL: http://soydbean.plantprofile.net/ .
Collapse
Affiliation(s)
- Yejin Lee
- Division of Bio and Medical Bigdata Department (BK4 Program), Gyeongsang National University, 501, Jinju-daero, Jinju-si, Gyeongsangnam-do, 52828, Republic of Korea
- Division of Life Science Department, Gyeongsang National University, Jinju, Republic of Korea
| | - Dong U Woo
- Division of Bio and Medical Bigdata Department (BK4 Program), Gyeongsang National University, 501, Jinju-daero, Jinju-si, Gyeongsangnam-do, 52828, Republic of Korea
- Division of Life Science Department, Gyeongsang National University, Jinju, Republic of Korea
| | - Yang Jae Kang
- Division of Bio and Medical Bigdata Department (BK4 Program), Gyeongsang National University, 501, Jinju-daero, Jinju-si, Gyeongsangnam-do, 52828, Republic of Korea.
- Division of Life Science Department, Gyeongsang National University, Jinju, Republic of Korea.
| |
Collapse
|
7
|
Safar HA, Alatar F, Nasser K, Al-Ajmi R, Alfouzan W, Mustafa AS. The impact of applying various de novo assembly and correction tools on the identification of genome characterization, drug resistance, and virulence factors of clinical isolates using ONT sequencing. BMC Biotechnol 2023; 23:26. [PMID: 37525145 PMCID: PMC10391896 DOI: 10.1186/s12896-023-00797-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2023] [Accepted: 07/21/2023] [Indexed: 08/02/2023] Open
Abstract
Oxford Nanopore sequencing technology (ONT) is currently widely used due to its affordability, simplicity, and reliability. Despite the advantage ONT has over next-generation sequencing in detecting resistance genes in mobile genetic elements, its relatively high error rate (10-15%) is still a deterrent. Several bioinformatic tools are freely available for raw data processing and obtaining complete and more accurate genome assemblies. In this study, we evaluated the impact of using mix-and-matched read assembly (Flye, Canu, Wtdbg2, and NECAT) and read correction (Medaka, NextPolish, and Racon) tools in generating complete and accurate genome assemblies, and downstream genomic analysis of nine clinical Escherichia coli isolates. Flye and Canu assemblers were the most robust in genome assembly, and Medaka and Racon correction tools significantly improved assembly parameters. Flye functioned well in pan-genome analysis, while Medaka increased the number of core genes detected. Flye, Canu, and NECAT assembler functioned well in detecting antimicrobial resistance genes (AMR), while Wtdbg2 required correction tools for better detection. Flye was the best assembler for detecting and locating both virulence and AMR genes (i.e., chromosomal vs. plasmid). This study provides insight into the performance of several read assembly and read correction tools for analyzing ONT sequencing reads for clinical isolates.
Collapse
Affiliation(s)
- Hussain A Safar
- OMICS Research Unit, Health Science Centre, Kuwait University, Hawalli Governorate, Kuwait
| | - Fatemah Alatar
- Serology and Molecular Microbiology Reference Laboratory, Mubarak Al-Kabeer Hospital, Ministry of Health, Hawalli Governorate, Kuwait
| | - Kother Nasser
- Serology and Molecular Microbiology Reference Laboratory, Mubarak Al-Kabeer Hospital, Ministry of Health, Hawalli Governorate, Kuwait
| | - Rehab Al-Ajmi
- Department of Microbiology, Faculty of Medicine, Kuwait University, Hawalli Governorate, Kuwait
| | - Wadha Alfouzan
- Department of Microbiology, Faculty of Medicine, Kuwait University, Hawalli Governorate, Kuwait
- Microbiology Unit, Farwaniya Hospital, Ministry of Health, Al Farwaniyah Governorate, Kuwait
| | - Abu Salim Mustafa
- Department of Microbiology, Faculty of Medicine, Kuwait University, Hawalli Governorate, Kuwait.
| |
Collapse
|
8
|
Spealman P, De T, Chuong JN, Gresham D. Best Practices in Microbial Experimental Evolution: Using Reporters and Long-Read Sequencing to Identify Copy Number Variation in Experimental Evolution. J Mol Evol 2023; 91:356-368. [PMID: 37012421 PMCID: PMC10275804 DOI: 10.1007/s00239-023-10102-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2022] [Accepted: 02/21/2023] [Indexed: 04/05/2023]
Abstract
Copy number variants (CNVs), comprising gene amplifications and deletions, are a pervasive class of heritable variation. CNVs play a key role in rapid adaptation in both natural, and experimental, evolution. However, despite the advent of new DNA sequencing technologies, detection and quantification of CNVs in heterogeneous populations has remained challenging. Here, we summarize recent advances in the use of CNV reporters that provide a facile means of quantifying de novo CNVs at a specific locus in the genome, and nanopore sequencing, for resolving the often complex structures of CNVs. We provide guidance for the engineering and analysis of CNV reporters and practical guidelines for single-cell analysis of CNVs using flow cytometry. We summarize recent advances in nanopore sequencing, discuss the utility of this technology, and provide guidance for the bioinformatic analysis of these data to define the molecular structure of CNVs. The combination of reporter systems for tracking and isolating CNV lineages and long-read DNA sequencing for characterizing CNV structures enables unprecedented resolution of the mechanisms by which CNVs are generated and their evolutionary dynamics.
Collapse
Affiliation(s)
- Pieter Spealman
- Department of Biology, New York University, New York, NY, 10003, USA
- Center for Genomics and Systems Biology, New York University, New York, NY, 10003, USA
| | - Titir De
- Department of Biology, New York University, New York, NY, 10003, USA
- Center for Genomics and Systems Biology, New York University, New York, NY, 10003, USA
| | - Julie N Chuong
- Department of Biology, New York University, New York, NY, 10003, USA
- Center for Genomics and Systems Biology, New York University, New York, NY, 10003, USA
| | - David Gresham
- Department of Biology, New York University, New York, NY, 10003, USA.
- Center for Genomics and Systems Biology, New York University, New York, NY, 10003, USA.
| |
Collapse
|
9
|
De La Cerda GY, Landis JB, Eifler E, Hernandez AI, Li F, Zhang J, Tribble CM, Karimi N, Chan P, Givnish T, Strickler SR, Specht CD. Balancing read length and sequencing depth: Optimizing Nanopore long-read sequencing for monocots with an emphasis on the Liliales. APPLICATIONS IN PLANT SCIENCES 2023; 11:e11524. [PMID: 37342170 PMCID: PMC10278932 DOI: 10.1002/aps3.11524] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/30/2022] [Revised: 01/20/2023] [Accepted: 01/30/2023] [Indexed: 06/22/2023]
Abstract
PREMISE We present approaches used to generate long-read Nanopore sequencing reads for the Liliales and demonstrate how modifications to standard protocols directly impact read length and total output. The goal is to help those interested in generating long-read sequencing data determine which steps may be necessary for optimizing output and results. METHODS Four species of Calochortus (Liliaceae) were sequenced. Modifications made to sodium dodecyl sulfate (SDS) extractions and cleanup protocols included grinding with a mortar and pestle, using cut or wide-bore tips, chloroform cleaning, bead cleaning, eliminating short fragments, and using highly purified DNA. RESULTS Steps taken to maximize read length can decrease overall output. Notably, the number of pores in a flow cell is correlated with the overall output, yet we did not see an association between the pore number and the read length or the number of reads produced. DISCUSSION Many factors contribute to the overall success of a Nanopore sequencing run. We showed the direct impact that several modifications to the DNA extraction and cleaning steps have on the total sequencing output, read size, and number of reads generated. We show a tradeoff between read length and the number of reads and, to a lesser extent, the total sequencing output, all of which are important factors for successful de novo genome assembly.
Collapse
Affiliation(s)
- Gisel Y. De La Cerda
- School of Integrative Plant Science, Section of Plant Biology and the L. H. Bailey HortoriumCornell UniversityIthacaNew York14853USA
| | - Jacob B. Landis
- School of Integrative Plant Science, Section of Plant Biology and the L. H. Bailey HortoriumCornell UniversityIthacaNew York14853USA
- BTI Computational Biology CenterBoyce Thompson InstituteIthacaNew York14853USA
| | - Evan Eifler
- Department of BotanyUniversity of Wisconsin–MadisonMadisonWisconsin53706USA
| | - Adriana I. Hernandez
- School of Integrative Plant Science, Section of Plant Biology and the L. H. Bailey HortoriumCornell UniversityIthacaNew York14853USA
| | - Fay‐Wei Li
- BTI Computational Biology CenterBoyce Thompson InstituteIthacaNew York14853USA
| | - Jing Zhang
- BTI Computational Biology CenterBoyce Thompson InstituteIthacaNew York14853USA
| | - Carrie M. Tribble
- School of Life SciencesUniversity of Hawaiʻi, MānoaHonoluluHawaiʻi96822USA
| | - Nisa Karimi
- Department of BotanyUniversity of Wisconsin–MadisonMadisonWisconsin53706USA
| | - Patricia Chan
- Department of BotanyUniversity of Wisconsin–MadisonMadisonWisconsin53706USA
| | - Thomas Givnish
- Department of BotanyUniversity of Wisconsin–MadisonMadisonWisconsin53706USA
| | - Susan R. Strickler
- BTI Computational Biology CenterBoyce Thompson InstituteIthacaNew York14853USA
- Present address:
Plant Science and ConservationChicago Botanic GardenGlencoeIllinois60022USA
- Present address:
Plant Biology and Conservation ProgramNorthwestern UniversityEvanstonIllinois60208USA
| | - Chelsea D. Specht
- School of Integrative Plant Science, Section of Plant Biology and the L. H. Bailey HortoriumCornell UniversityIthacaNew York14853USA
| |
Collapse
|
10
|
Wang J, Chen K, Yang J, Zhang S, Li Y, Liu G, Luo J, Yin H, Wang G, Guan G. Comparative genomic analysis of Babesia duncani responsible for human babesiosis. BMC Biol 2022; 20:153. [PMID: 35790982 PMCID: PMC9258201 DOI: 10.1186/s12915-022-01361-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2022] [Accepted: 06/23/2022] [Indexed: 11/29/2022] Open
Abstract
Background Human babesiosis, caused by parasites of the genus Babesia, is an emerging and re-emerging tick-borne disease that is mainly transmitted by tick bites and infected blood transfusion. Babesia duncani has caused majority of human babesiosis in Canada; however, limited data are available to correlate its genomic information and biological features. Results We generated a B. duncani reference genome using Oxford Nanopore Technology (ONT) and Illumina sequencing technology and uncovered its biological features and phylogenetic relationship with other Apicomplexa parasites. Phylogenetic analyses revealed that B. duncani form a clade distinct from B. microti, Babesia spp. infective to bovine and ovine species, and Theileria spp. infective to bovines. We identified the largest species-specific gene family that could be applied as diagnostic markers for this pathogen. In addition, two gene families show signals of significant expansion and several genes that present signatures of positive selection in B. duncani, suggesting their possible roles in the capability of this parasite to infect humans or tick vectors. Conclusions Using ONT sequencing and Illumina sequencing technologies, we provide the first B. duncani reference genome and confirm that B. duncani forms a phylogenetically distinct clade from other Piroplasm parasites. Comparative genomic analyses show that two gene families are significantly expanded in B. duncani and may play important roles in host cell invasion and virulence of B. duncani. Our study provides basic information for further exploring B. duncani features, such as host-parasite and tick-parasite interactions. Supplementary Information The online version contains supplementary material available at 10.1186/s12915-022-01361-9.
Collapse
Affiliation(s)
- Jinming Wang
- State Key Laboratory of Veterinary Etiological Biology, Key Laboratory of Veterinary Parasitology of Gansu Province, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Science, Lanzhou, 730046, Gansu, China.
| | - Kai Chen
- Key Laboratory of Aquatic Biodiversity and Conservation, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, 430072, China
| | - Jifei Yang
- State Key Laboratory of Veterinary Etiological Biology, Key Laboratory of Veterinary Parasitology of Gansu Province, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Science, Lanzhou, 730046, Gansu, China
| | - Shangdi Zhang
- Department of Clinical Laboratory, The Second Hospital of Lanzhou University, Lanzhou, 730030, China
| | - Youquan Li
- State Key Laboratory of Veterinary Etiological Biology, Key Laboratory of Veterinary Parasitology of Gansu Province, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Science, Lanzhou, 730046, Gansu, China
| | - Guangyuan Liu
- State Key Laboratory of Veterinary Etiological Biology, Key Laboratory of Veterinary Parasitology of Gansu Province, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Science, Lanzhou, 730046, Gansu, China
| | - Jianxun Luo
- State Key Laboratory of Veterinary Etiological Biology, Key Laboratory of Veterinary Parasitology of Gansu Province, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Science, Lanzhou, 730046, Gansu, China
| | - Hong Yin
- State Key Laboratory of Veterinary Etiological Biology, Key Laboratory of Veterinary Parasitology of Gansu Province, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Science, Lanzhou, 730046, Gansu, China.,Jiangsu Co-Innovation Center for the Prevention and Control of Important Animal Infectious Disease and Zoonoses, Yangzhou University, Yangzhou, 225009, China
| | - Guangying Wang
- Key Laboratory of Aquatic Biodiversity and Conservation, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, 430072, China.
| | - Guiquan Guan
- State Key Laboratory of Veterinary Etiological Biology, Key Laboratory of Veterinary Parasitology of Gansu Province, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Science, Lanzhou, 730046, Gansu, China.
| |
Collapse
|
11
|
Faulk C. De novo sequencing, diploid assembly, and annotation of the black carpenter ant, Camponotus pennsylvanicus, and its symbionts by one person for $1000, using nanopore sequencing. Nucleic Acids Res 2022; 51:17-28. [PMID: 35724982 PMCID: PMC9841434 DOI: 10.1093/nar/gkac510] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2022] [Revised: 05/19/2022] [Accepted: 05/31/2022] [Indexed: 02/07/2023] Open
Abstract
The black carpenter ant (Camponotus pennsylvanicus) is a pest species found widely throughout North America. From a single individual I used long-read nanopore sequencing to assemble a phased diploid genome of 306 Mb and 60X coverage, with quality assessed by a 97.0% BUSCO score, improving upon other ant assemblies. The mitochondrial genome reveals minor rearrangements from other ants. The reads also allowed assembly of parasitic and symbiont genomes. I include a complete Wolbachia bacterial assembly with a size of 1.2 Mb, as well as a commensal symbiont Blochmannia pennsylvanicus, at 791 kb. DNA methylation and hydroxymethylation were measured at base-pair resolution level from the same reads and confirmed extremely low levels seen in the Formicidae family. There was moderate heterozygosity, with 0.16% of bases being biallelic from the parental haplotypes. Protein prediction yielded 14 415 amino acid sequences with 95.8% BUSCO score and 86% matching to previously known proteins. All assemblies were derived from a single MinION flow cell generating 20 Gb of sequence for a cost of $1047 including consumable reagents. Adding fixed costs for equipment brings the total for an ant-sized genome to less than $5000. All analyses were performed in 1 week on a single desktop computer.
Collapse
|
12
|
Petersen C, Sørensen T, Westphal KR, Fechete LI, Sondergaard TE, Sørensen JL, Nielsen KL. High molecular weight DNA extraction methods lead to high quality filamentous ascomycete fungal genome assemblies using Oxford Nanopore sequencing. Microb Genom 2022; 8. [PMID: 35438621 PMCID: PMC9453082 DOI: 10.1099/mgen.0.000816] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
Abstract
During the last two decades, whole-genome sequencing has revolutionized genetic research in all kingdoms, including fungi. More than 1000 fungal genomes have been submitted to sequence databases, mostly obtained through second generation short-read DNA sequencing. As a result, highly fragmented genome drafts have typically been obtained. However, with the emergence of third generation long-read DNA sequencing, the assembly challenge can be overcome and highly contiguous assemblies obtained. Such attractive results, however, are extremely dependent on the ability to extract highly purified high molecular weight (HMW) DNA. Extraction of such DNA is currently a significant challenge for all species with cell walls, not least fungi. In this study, four isolates of filamentous ascomycetes (Apiospora pterospermum, Aspergillus sp. (subgen. Cremei), Aspergillus westerdijkiae, and Penicillium aurantiogriseum) were used to develop extraction and purification methods that result in HMW DNA suitable for third generation sequencing. We have tested and propose two straightforward extraction methods based on treatment with either a commercial kit or traditional phenol-chloroform extraction both in combination with a single commercial purification method that result in high quality HMW DNA from filamentous ascomycetes. Our results demonstrated that using these DNA extraction methods and coverage, above 75 x of our haploid filamentous ascomycete fungal genomes result in complete and contiguous assemblies.
Collapse
Affiliation(s)
- Celine Petersen
- Department of Chemistry and Bioscience, Fredrik Bajers Vej 7H, 9220 Aalborg, Denmark, Aalborg University
| | - Trine Sørensen
- Department of Chemistry and Bioscience, Fredrik Bajers Vej 7H, 9220 Aalborg, Denmark, Aalborg University
| | - Klaus R Westphal
- Department of Chemistry and Bioscience, Fredrik Bajers Vej 7H, 9220 Aalborg, Denmark, Aalborg University
| | - Lavinia I Fechete
- Department of Chemistry and Bioscience, Fredrik Bajers Vej 7H, 9220 Aalborg, Denmark, Aalborg University
| | - Teis E Sondergaard
- Department of Chemistry and Bioscience, Fredrik Bajers Vej 7H, 9220 Aalborg, Denmark, Aalborg University
| | - Jens L Sørensen
- Department of Chemistry and Bioscience, Niels-Bohrs Vej 8, 6700 Esbjerg, Denmark, Aalborg University
| | - Kåre L Nielsen
- Department of Chemistry and Bioscience, Fredrik Bajers Vej 7H, 9220 Aalborg, Denmark, Aalborg University
| |
Collapse
|