1
|
Ong CT, Blackall PJ, Boe-Hansen GB, deWet S, Hayes BJ, Indjein L, Korolik V, Minchin C, Nguyen LT, Nordin Y, Siddle H, Turni C, Venus B, Westman ME, Zhang Z, Tabor AE. Whole-genome comparison using complete genomes from Campylobacter fetus strains revealed single nucleotide polymorphisms on non-genomic islands for subspecies differentiation. Front Microbiol 2024; 15:1452564. [PMID: 39328909 PMCID: PMC11424552 DOI: 10.3389/fmicb.2024.1452564] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2024] [Accepted: 08/27/2024] [Indexed: 09/28/2024] Open
Abstract
Introduction Bovine Genital Campylobacteriosis (BGC), caused by Campylobacter fetus subsp. venerealis, is a sexually transmitted bacterium that significantly impacts cattle reproductive performance. However, current detection methods lack consistency and reliability due to the close genetic similarity between C. fetus subsp. venerealis and C. fetus subsp. fetus. Therefore, this study aimed to utilize complete genome analysis to distinguish genetic features between C. fetus subsp. venerealis and other subspecies, thereby enhancing BGC detection for routine screening and epidemiological studies. Methods and results This study reported the complete genomes of four C. fetus subsp. fetus and five C. fetus subsp. venerealis, sequenced using long-read sequencing technologies. Comparative whole-genome analyses (n = 25) were conducted, incorporating an additional 16 complete C. fetus genomes from the NCBI database, to investigate the genomic differences between these two closely related C. fetus subspecies. Pan-genomic analyses revealed a core genome consisting of 1,561 genes and an accessory pangenome of 1,064 genes between the two C. fetus subspecies. However, no unique predicted genes were identified in either subspecies. Nonetheless, whole-genome single nucleotide polymorphisms (SNPs) analysis identified 289 SNPs unique to one or the C. fetus subspecies. After the removal of SNPs located on putative genomic islands, recombination sites, and those causing synonymous amino acid changes, the remaining 184 SNPs were functionally annotated. Candidate SNPs that were annotated with the KEGG "Peptidoglycan Biosynthesis" pathway were recruited for further analysis due to their potential association with the glycine intolerance characteristic of C. fetus subsp. venerealis and its biovar variant. Verification with 58 annotated C. fetus genomes, both complete and incomplete, from RefSeq, successfully classified these seven SNPs into two groups, aligning with their phenotypic identification as CFF (Campylobacter fetus subsp. fetus) or CFV/CFVi (Campylobacter fetus subsp. venerealis and its biovar variant). Furthermore, we demonstrated the application of mraY SNPs for detecting C. fetus subspecies using a quantitative PCR assay. Discussion Our results highlighted the high genetic stability of C. fetus subspecies. Nevertheless, Campylobacter fetus subsp. venerealis and its biovar variants encoded common SNPs in genes related to glycine intolerance, which differentiates them from C. fetus subsp. fetus. This discovery highlights the potential of employing a multiple-SNP assay for the precise differentiation of C. fetus subspecies.
Collapse
Affiliation(s)
- Chian Teng Ong
- Queensland Alliance for Agriculture and Food Innovation, Centre for Animal Science, The University of Queensland, St Lucia, QLD, Australia
| | - Patrick J Blackall
- Queensland Alliance for Agriculture and Food Innovation, Centre for Animal Science, The University of Queensland, Dutton Park, QLD, Australia
| | - Gry B Boe-Hansen
- School of Veterinary Science, The University of Queensland, Gatton, QLD, Australia
| | - Sharon deWet
- Department of Agriculture and Fisheries, Biosecurity Sciences Laboratory, Coopers Plains, QLD, Australia
| | - Ben J Hayes
- Queensland Alliance for Agriculture and Food Innovation, Centre for Animal Science, The University of Queensland, St Lucia, QLD, Australia
| | - Lea Indjein
- School of Veterinary Science, The University of Queensland, Gatton, QLD, Australia
| | - Victoria Korolik
- Institute for Glycomics, Griffith University, Nathan, QLD, Australia
| | - Catherine Minchin
- Department of Agriculture and Fisheries, Agri-Science Queensland, Animal Science, Dutton Park, QLD, Australia
| | - Loan To Nguyen
- Queensland Alliance for Agriculture and Food Innovation, Centre for Animal Science, The University of Queensland, St Lucia, QLD, Australia
| | - Yusralimuna Nordin
- Queensland Alliance for Agriculture and Food Innovation, Centre for Animal Science, The University of Queensland, St Lucia, QLD, Australia
| | - Hannah Siddle
- Queensland Alliance for Agriculture and Food Innovation, Centre for Animal Science, The University of Queensland, St Lucia, QLD, Australia
| | - Conny Turni
- Queensland Alliance for Agriculture and Food Innovation, Centre for Animal Science, The University of Queensland, Dutton Park, QLD, Australia
| | - Bronwyn Venus
- Queensland Alliance for Agriculture and Food Innovation, Centre for Animal Science, The University of Queensland, St Lucia, QLD, Australia
| | - Mark E Westman
- Department of Primary Industries, Elizabeth Macarthur Agricultural Institute, Menangle, NSW, Australia
| | - Zhetao Zhang
- Queensland Alliance for Agriculture and Food Innovation, Centre for Animal Science, The University of Queensland, St Lucia, QLD, Australia
| | - Ala E Tabor
- Queensland Alliance for Agriculture and Food Innovation, Centre for Animal Science, The University of Queensland, St Lucia, QLD, Australia
- School of Chemistry and Molecular Biosciences, The University of Queensland, St Lucia, QLD, Australia
| |
Collapse
|
2
|
Sapan V, Simsek SZ, Filoğlu G, Bulbul O. Forensic DNA phenotyping using Oxford Nanopore Sequencing system. Electrophoresis 2024. [PMID: 38794987 DOI: 10.1002/elps.202300252] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2023] [Revised: 03/27/2024] [Accepted: 05/08/2024] [Indexed: 05/27/2024]
Abstract
In forensic science, the demand for precision, consistency, and cost-effectiveness has driven the exploration of next-generation sequencing technologies. This study investigates the potential of Oxford Nanopore Sequencing (ONT) Technology for analyzing the HIrisPlex-S panel, a set of 41 single nucleotide polymorphism (SNP) markers used to predict eye, hair, and skin color. Using ONT sequencing, we assessed the accuracy and reliability of ONT-generated data by comparing it with conventional capillary electrophoresis (CE) in 18 samples. The Guppy v6.1 was used as a basecaller, and sample profiles were obtained using Burrows-Wheeler Aligner, Samtools, BCFtools, and Python. Comparing accuracy with CE, we found that 62% of SNPs in ONT-unligated samples were correctly genotyped, with 36% showing allele dropout, and 2% being incorrectly genotyped. In the ONT-ligated samples, 85% of SNPs were correctly genotyped, with 10% showing allele dropout, and 5% being incorrectly genotyped. Our findings indicate that ONT, particularly when combined with ligation, enhances genotyping accuracy and coverage, thereby reducing allele dropouts. However, challenges associated with the technology's error rates and the impact on genotyping accuracy are recognized. Phenotype predictions based on ONT data demonstrate varying degrees of success, with the technology showing high accuracy in several cases. Although ONT technology holds promise in forensic genetics, further optimization and quality control measures are essential to harness its full potential. This study contributes to the ongoing efforts to refine sequence read tuning and improve correction tools in the context of ONT technology's application in forensic genetics.
Collapse
Affiliation(s)
- Veysel Sapan
- Institute of Forensic Sciences and Legal Medicine, Istanbul University-Cerrahpasa, Istanbul, Turkey
| | - Sumeyye Zulal Simsek
- Institute of Forensic Sciences and Legal Medicine, Istanbul University-Cerrahpasa, Istanbul, Turkey
| | - Gonul Filoğlu
- Institute of Forensic Sciences and Legal Medicine, Istanbul University-Cerrahpasa, Istanbul, Turkey
| | - Ozlem Bulbul
- Institute of Forensic Sciences and Legal Medicine, Istanbul University-Cerrahpasa, Istanbul, Turkey
| |
Collapse
|
3
|
Gayathiri E, Prakash P, Kumaravel P, Jayaprakash J, Ragunathan MG, Sankar S, Pandiaraj S, Thirumalaivasan N, Thiruvengadam M, Govindasamy R. Computational approaches for modeling and structural design of biological systems: A comprehensive review. PROGRESS IN BIOPHYSICS AND MOLECULAR BIOLOGY 2023; 185:17-32. [PMID: 37821048 DOI: 10.1016/j.pbiomolbio.2023.08.002] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/30/2023] [Revised: 08/14/2023] [Accepted: 08/27/2023] [Indexed: 10/13/2023]
Abstract
The convergence of biology and computational science has ushered in a revolutionary era, revolutionizing our understanding of biological systems and providing novel solutions to global problems. The field of genetic engineering has facilitated the manipulation of genetic codes, thus providing opportunities for the advancement of innovative disease therapies and environmental enhancements. The emergence of bio-molecular simulation represents a significant advancement in this particular field, as it offers the ability to gain microscopic insights into molecular-level biological processes over extended periods. Biomolecular simulation plays a crucial role in advancing our comprehension of organismal mechanisms by establishing connections between molecular structures, interactions, and biological functions. The field of computational biology has demonstrated its significance in deciphering intricate biological enigmas through the utilization of mathematical models and algorithms. The process of decoding the human genome has resulted in the advancement of therapies for a wide range of genetic disorders, while the simulation of biological systems contributes to the identification of novel pharmaceutical compounds. The potential of biomolecular simulation and computational biology is vast and limitless. As the exploration of the underlying principles that govern living organisms progresses, the potential impact of this understanding on cancer treatment, environmental restoration, and other domains is anticipated to be transformative. This review examines the notable advancements achieved in the field of computational biology, emphasizing its potential to revolutionize the comprehension and enhancement of biological systems.
Collapse
Affiliation(s)
- Ekambaram Gayathiri
- Department of Plant Biology and Plant Biotechnology, Guru Nanak College (Autonomous), Chennai, 42, Tamil Nadu, India
| | - Palanisamy Prakash
- Department of Botany, Periyar University, Periyar Palkalai Nagar, Salem, 636011, Tamil Nadu, India
| | - Priya Kumaravel
- Department of Biotechnology, St. Joseph College (Arts & Science), Kovur, Chennai, Tamil Nadu, India
| | - Jayanthi Jayaprakash
- Department of Advanced Zoology and Biotechnology, Guru Nanak College, Chennai, Tamil Nadu, India
| | | | - Sharmila Sankar
- Department of Advanced Zoology and Biotechnology, Guru Nanak College, Chennai, Tamil Nadu, India
| | - Saravanan Pandiaraj
- Department of Self-Development Skills, King Saud University, P.O. Box 2455, Riyadh, 11451, Saudi Arabia
| | - Natesan Thirumalaivasan
- Department of Periodontics, Saveetha Dental College, and Hospitals, Saveetha Institute of Medical and Technical Sciences (SIMTAS), Chennai, 600077, Tamil Nadu, India
| | - Muthu Thiruvengadam
- Department of Applied Bioscience, College of Life and Environmental Sciences, Konkuk University, Seoul, 05029, South Korea
| | - Rajakumar Govindasamy
- Department of Orthodontics, Saveetha Dental College and Hospitals, Saveetha University, Chennai, India.
| |
Collapse
|
4
|
González-Recio O, López-Catalina A, Peiró-Pastor R, Nieto-Valle A, Castro M, Fernández A. Evaluating the potential of (epi)genotype-by-low pass nanopore sequencing in dairy cattle: a study on direct genomic value and methylation analysis. J Anim Sci Biotechnol 2023; 14:98. [PMID: 37434255 PMCID: PMC10337168 DOI: 10.1186/s40104-023-00896-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2023] [Accepted: 05/17/2023] [Indexed: 07/13/2023] Open
Abstract
BACKGROUND Genotype-by-sequencing has been proposed as an alternative to SNP genotyping arrays in genomic selection to obtain a high density of markers along the genome. It requires a low sequencing depth to be cost effective, which may increase the error at the genotype assigment. Third generation nanopore sequencing technology offers low cost sequencing and the possibility to detect genome methylation, which provides added value to genotype-by-sequencing. The aim of this study was to evaluate the performance of genotype-by-low pass nanopore sequencing for estimating the direct genomic value in dairy cattle, and the possibility to obtain methylation marks simultaneously. RESULTS Latest nanopore chemistry (LSK14 and Q20) achieved a modal base calling accuracy of 99.55%, whereas previous kit (LSK109) achieved slightly lower accuracy (99.1%). The direct genomic value accuracy from genotype-by-low pass sequencing ranged between 0.79 and 0.99, depending on the trait (milk, fat or protein yield), with a sequencing depth as low as 2 × and using the latest chemistry (LSK114). Lower sequencing depth led to biased estimates, yet with high rank correlations. The LSK109 and Q20 achieved lower accuracies (0.57-0.93). More than one million high reliable methylated sites were obtained, even at low sequencing depth, located mainly in distal intergenic (87%) and promoter (5%) regions. CONCLUSIONS This study showed that the latest nanopore technology in useful in a LowPass sequencing framework to estimate direct genomic values with high reliability. It may provide advantages in populations with no available SNP chip, or when a large density of markers with a wide range of allele frequencies is needed. In addition, low pass sequencing provided nucleotide methylation status of > 1 million nucleotides at ≥ 10 × , which is an added value for epigenetic studies.
Collapse
Affiliation(s)
- Oscar González-Recio
- Dpt. Mejora Genética Animal, INIA-CSIC, Ctra La Coruña Km 7.5, 28040, Madrid, Spain.
| | | | - Ramón Peiró-Pastor
- Dpt. Mejora Genética Animal, INIA-CSIC, Ctra La Coruña Km 7.5, 28040, Madrid, Spain
| | - Alicia Nieto-Valle
- ETSIAAB, Universidad Politécnica de Madrid. Ciudad Universitaria S/N, 28040, Madrid, Spain
| | - Monica Castro
- Dpt. Mejora Genética Animal, INIA-CSIC, Ctra La Coruña Km 7.5, 28040, Madrid, Spain
| | - Almudena Fernández
- Dpt. Mejora Genética Animal, INIA-CSIC, Ctra La Coruña Km 7.5, 28040, Madrid, Spain
| |
Collapse
|
5
|
Gurgul A, Jasielczuk I, Szmatoła T, Sawicki S, Semik-Gurgul E, Długosz B, Bugno-Poniewierska M. Application of Nanopore Sequencing for High Throughput Genotyping in Horses. Animals (Basel) 2023; 13:2227. [PMID: 37444025 DOI: 10.3390/ani13132227] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Revised: 07/03/2023] [Accepted: 07/04/2023] [Indexed: 07/15/2023] Open
Abstract
Nanopore sequencing is a third-generation biopolymer sequencing technique that relies on monitoring the changes in an electrical current that occur as nucleic acids are passed through a protein nanopore. Increasing quality of reads generated by nanopore sequencing systems encourages their application in genome-wide polymorphism detection and genotyping. In this study, we employed nanopore sequencing to identify genome-wide polymorphisms in the horse genome. To reduce the size and complexity of genome fragments for sequencing in a simple and cost-efficient manner, we amplified random DNA fragments using a modified DOP-PCR and sequenced the resulting products using the MinION system. After initial filtering, this generated 28,426 polymorphisms, which were validated at a 3% error rate. Upon further filtering for polymorphism and reproducibility, we identified 9495 SNPs that reflected the horse population structure. To conclude, the use of nanopore sequencing, in conjunction with a genome enrichment step, is a promising tool that can be practical in a variety of applications, including genotyping, population genomics, association studies, linkage mapping, and potentially genomic selection.
Collapse
Affiliation(s)
- Artur Gurgul
- Center of Experimental and Innovative Medicine, University of Agriculture in Krakow, al. Mickiewicza 24/28, 30-059 Krakow, Poland
| | - Igor Jasielczuk
- Center of Experimental and Innovative Medicine, University of Agriculture in Krakow, al. Mickiewicza 24/28, 30-059 Krakow, Poland
| | - Tomasz Szmatoła
- Center of Experimental and Innovative Medicine, University of Agriculture in Krakow, al. Mickiewicza 24/28, 30-059 Krakow, Poland
| | - Sebastian Sawicki
- Department of Animal Reproduction, Anatomy and Genomics, University of Agriculture in Krakow, al. Mickiewicza 24/28, 30-059 Krakow, Poland
| | - Ewelina Semik-Gurgul
- Department of Animal Molecular Biology, National Research Institute of Animal Production, Krakowska 1, 32-083 Balice, Poland
| | - Bogusława Długosz
- Department of Animal Reproduction, Anatomy and Genomics, University of Agriculture in Krakow, al. Mickiewicza 24/28, 30-059 Krakow, Poland
| | - Monika Bugno-Poniewierska
- Department of Animal Reproduction, Anatomy and Genomics, University of Agriculture in Krakow, al. Mickiewicza 24/28, 30-059 Krakow, Poland
| |
Collapse
|
6
|
Bai S, Han X, Feng D. Shoot-root signal circuit: Phytoremediation of heavy metal contaminated soil. FRONTIERS IN PLANT SCIENCE 2023; 14:1139744. [PMID: 36890896 PMCID: PMC9987563 DOI: 10.3389/fpls.2023.1139744] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/07/2023] [Accepted: 02/08/2023] [Indexed: 06/18/2023]
Abstract
High concentrations of heavy metals in the environment will cause serious harm to ecosystems and human health. It is urgent to develop effective methods to control soil heavy metal pollution. Phytoremediation has advantages and potential for soil heavy metal pollution control. However, the current hyperaccumulators have the disadvantages of poor environmental adaptability, single enrichment species and small biomass. Based on the concept of modularity, synthetic biology makes it possible to design a wide range of organisms. In this paper, a comprehensive strategy of "microbial biosensor detection - phytoremediation - heavy metal recovery" for soil heavy metal pollution control was proposed, and the required steps were modified by using synthetic biology methods. This paper summarizes the new experimental methods that promote the discovery of synthetic biological elements and the construction of circuits, and combs the methods of producing transgenic plants to facilitate the transformation of constructed synthetic biological vectors. Finally, the problems that should be paid more attention to in the remediation of soil heavy metal pollution based on synthetic biology were discussed.
Collapse
Affiliation(s)
- Shiyan Bai
- College of Biological Science and Engineering, Fuzhou University, Fujian, China
| | - Xiao Han
- College of Biological Science and Engineering, Fuzhou University, Fujian, China
| | - Dan Feng
- Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing, China
| |
Collapse
|
7
|
Khan A, Korban SS. Breeding and genetics of disease resistance in temperate fruit trees: challenges and new opportunities. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2022; 135:3961-3985. [PMID: 35441862 DOI: 10.1007/s00122-022-04093-0] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/03/2021] [Accepted: 03/29/2022] [Indexed: 06/14/2023]
Abstract
Climate change, large monocultures of disease-susceptible cultivars, overuse of pesticides, and the emergence of new pathogens or pathogenic strains causing economic losses are all major threats to our environment, health, food, and nutritional supply. Temperate tree fruit crops belonging to the Rosaceae family are the most economically important and widely grown fruit crops. These long-lived crops are under attack from many different pathogens, incurring major economic losses. Multiple chemical sprays to control various diseases annually is a common practice, resulting in significant input costs, as well as environmental and health concerns. Breeding for disease resistance has been undertaken primarily in pome fruit crops (apples and pears) for a few fungal and bacterial diseases, and to a lesser extent in some stone fruit crops. These breeding efforts have taken multiple decades due to the biological constraints and complex genetics of these tree fruit crops. Over the past couple of decades, major advances have been made in genetic and physical mapping, genomics, biotechnology, genome sequencing, and phenomics, along with accumulation of large germplasm collections in repositories. These valuable resources offer opportunities to make significant advances in greatly reducing the time needed to either develop new cultivars or modify existing economic cultivars for enhanced resistance to multiple diseases. This review will cover current knowledge, challenges, and opportunities in breeding for disease resistance in temperate tree fruit crops.
Collapse
Affiliation(s)
- Awais Khan
- Plant Pathology and Plant-Microbe Biology Section, Cornell University, Geneva, NY, 14456, USA.
| | - Schuyler S Korban
- Department of Natural Sciences and Environmental Sciences, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, USA
| |
Collapse
|
8
|
Liu G, Mullan D, Zhang A, Liu H, Liu D, Yan G. Identification of KASP markers and putative genes for pre-harvest sprouting resistance in common wheat (Triticum aestivum L.). THE CROP JOURNAL 2022. [DOI: 10.1016/j.cj.2022.09.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
|
9
|
Wakita S, Hara M, Kitabatake Y, Kawatani K, Kurahashi H, Hashizume R. Experimental method for haplotype phasing across the entire length of chromosome 21 in trisomy 21 cells using a chromosome elimination technique. J Hum Genet 2022; 67:565-572. [PMID: 35637312 PMCID: PMC9510051 DOI: 10.1038/s10038-022-01049-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2022] [Revised: 04/25/2022] [Accepted: 05/12/2022] [Indexed: 11/09/2022]
Abstract
Modern sequencing technologies produce a single consensus sequence without distinguishing between homologous chromosomes. Haplotype phasing solves this limitation by identifying alleles on the maternal and paternal chromosomes. This information is critical for understanding gene expression models in genetic disease research. Furthermore, the haplotype phasing of three homologous chromosomes in trisomy cells is more complicated than that in disomy cells. In this study, we attempted the accurate and complete haplotype phasing of chromosome 21 in trisomy 21 cells. To separate homologs, we established three corrected disomy cell lines (ΔPaternal chromosome, ΔMaternal chromosome 1, and ΔMaternal chromosome 2) from trisomy 21 induced pluripotent stem cells by eliminating one chromosome 21 utilizing the Cre-loxP system. These cells were then whole-genome sequenced by a next-generation sequencer. By simply comparing the base information of the whole-genome sequence data at the same position between each corrected disomy cell line, we determined the base on the eliminated chromosome and performed phasing. We phased 51,596 single nucleotide polymorphisms (SNPs) on chromosome 21, randomly selected seven SNPs spanning the entire length of the chromosome, and confirmed that there was no contradiction by direct sequencing.
Collapse
Affiliation(s)
- Sachiko Wakita
- Department of Pathology and Matrix Biology, Mie University Graduate School of Medicine, Mie, Japan
| | - Mari Hara
- Department of Pathology and Matrix Biology, Mie University Graduate School of Medicine, Mie, Japan
| | - Yasuji Kitabatake
- Department of Pediatrics, Graduate School of Medicine, Osaka University, Suita, Osaka, Japan
| | - Keiji Kawatani
- Department of Pediatrics, Graduate School of Medicine, Osaka University, Suita, Osaka, Japan.,Department of Neuroscience, Mayo Clinic, Scottsdale, AZ, USA
| | - Hiroki Kurahashi
- Division of Molecular Genetics, Institute for Comprehensive Medical Science, Fujita Health University, Toyoake, Japan
| | - Ryotaro Hashizume
- Department of Pathology and Matrix Biology, Mie University Graduate School of Medicine, Mie, Japan. .,Department of Genomic Medicine, Mie University Hospital, Mie, Japan.
| |
Collapse
|
10
|
Dorfner M, Ott T, Ott P, Oberprieler C. Long-read genotyping with SLANG (Simple Long-read loci Assembly of Nanopore data for Genotyping). APPLICATIONS IN PLANT SCIENCES 2022; 10:e11484. [PMID: 35774992 PMCID: PMC9215276 DOI: 10.1002/aps3.11484] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/23/2021] [Revised: 04/05/2022] [Accepted: 04/11/2022] [Indexed: 06/15/2023]
Abstract
Premise Most phylogenomic library preparation methods and bioinformatic analysis tools in restriction site-associated DNA sequencing (RADseq)/genotyping-by-sequencing (GBS) studies are designed for use with Illumina data. The lack of alternative bioinformatic pipelines hinders the exploration of long-read multi-locus data from other sequencing platforms. The Simple Long-read loci Assembly of Nanopore data for Genotyping (SLANG) pipeline enables locus assembly, orthology estimation, and single-nucleotide polymorphism (SNP) calling using Nanopore-sequenced multi-locus data. Methods and Results Two test libraries (Leucanthemum spp., Senecio spp.; Compositae) were prepared using an amplified fragment length polymorphism (AFLP)-based method to reduce genome complexity, then Nanopore-sequenced, and analyzed with SLANG. We identified 704 and 448 orthologous loci with 12,368 and 10,048 SNPs, respectively. The constructed phylogenetic networks were identical to a GBS network produced using Leucanthemum Illumina data and were consistent with Senecio species circumscriptions based on morphology. Conclusions SLANG identifies orthologous loci and extracts SNPs from long-read multi-locus Nanopore data for phylogenetic inference, population genetics, or phylogeographical studies. Combined with an AFLP-based library preparation, SLANG provides an easily scalable, cost-effective, and affordable alternative to Illumina-based RADseq/GBS procedures.
Collapse
Affiliation(s)
- Marco Dorfner
- Institute of Plant Sciences, University of Regensburg, Universitätsstraße 31RegensburgBYD‐93053Germany
| | - Tankred Ott
- Institute of Plant Sciences, University of Regensburg, Universitätsstraße 31RegensburgBYD‐93053Germany
| | - Philipp Ott
- Institute of Plant Sciences, University of Regensburg, Universitätsstraße 31RegensburgBYD‐93053Germany
| | - Christoph Oberprieler
- Institute of Plant Sciences, University of Regensburg, Universitätsstraße 31RegensburgBYD‐93053Germany
| |
Collapse
|
11
|
Huang B, Wei G, Wang B, Ju F, Zhong Y, Shi Z, Sun S, Bu D. Filling gaps of genome scaffolds via probabilistic searching optical maps against assembly graph. BMC Bioinformatics 2021; 22:533. [PMID: 34717539 PMCID: PMC8557617 DOI: 10.1186/s12859-021-04448-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2021] [Accepted: 10/18/2021] [Indexed: 11/24/2022] Open
Abstract
BACKGROUND Optical maps record locations of specific enzyme recognition sites within long genome fragments. This long-distance information enables aligning genome assembly contigs onto optical maps and ordering contigs into scaffolds. The generated scaffolds, however, often contain a large amount of gaps. To fill these gaps, a feasible way is to search genome assembly graph for the best-matching contig paths that connect boundary contigs of gaps. The combination of searching and evaluation procedures might be "searching followed by evaluation", which is infeasible for long gaps, or "searching by evaluation", which heavily relies on heuristics and thus usually yields unreliable contig paths. RESULTS We here report an accurate and efficient approach to filling gaps of genome scaffolds with aids of optical maps. Using simulated data from 12 species and real data from 3 species, we demonstrate the successful application of our approach in gap filling with improved accuracy and completeness of genome scaffolds. CONCLUSION Our approach applies a sequential Bayesian updating technique to measure the similarity between optical maps and candidate contig paths. Using this similarity to guide path searching, our approach achieves higher accuracy than the existing "searching by evaluation" strategy that relies on heuristics. Furthermore, unlike the "searching followed by evaluation" strategy enumerating all possible paths, our approach prunes the unlikely sub-paths and extends the highly-probable ones only, thus significantly increasing searching efficiency.
Collapse
Affiliation(s)
- Bin Huang
- Key Lab of Intelligent Information Processing, Big-Data Academy, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, 100190 China
- Institute of Biology, University of Chinese Academy of Sciences, Beijing, 100049 China
| | - Guozheng Wei
- Key Lab of Intelligent Information Processing, Big-Data Academy, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, 100190 China
- Institute of Biology, University of Chinese Academy of Sciences, Beijing, 100049 China
| | - Bing Wang
- Key Lab of Intelligent Information Processing, Big-Data Academy, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, 100190 China
- Institute of Biology, University of Chinese Academy of Sciences, Beijing, 100049 China
| | - Fusong Ju
- Key Lab of Intelligent Information Processing, Big-Data Academy, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, 100190 China
- Institute of Biology, University of Chinese Academy of Sciences, Beijing, 100049 China
| | - Yi Zhong
- School of Computer Science, University of Washington, Seattle, 98195 USA
| | - Zhuozheng Shi
- Department of Computer Science and Engineering, University of California, San Diego, La Jolla, 92093 USA
| | - Shiwei Sun
- Key Lab of Intelligent Information Processing, Big-Data Academy, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, 100190 China
- Institute of Biology, University of Chinese Academy of Sciences, Beijing, 100049 China
- Zhongke Big Data Academy, Zhengzhou, 450046 Henan China
| | - Dongbo Bu
- Key Lab of Intelligent Information Processing, Big-Data Academy, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, 100190 China
- Institute of Biology, University of Chinese Academy of Sciences, Beijing, 100049 China
- Zhongke Big Data Academy, Zhengzhou, 450046 Henan China
| |
Collapse
|
12
|
Begum G, Albanna A, Bankapur A, Nassir N, Tambi R, Berdiev BK, Akter H, Karuvantevida N, Kellam B, Alhashmi D, Sung WWL, Thiruvahindrapuram B, Alsheikh-Ali A, Scherer SW, Uddin M. Long-Read Sequencing Improves the Detection of Structural Variations Impacting Complex Non-Coding Elements of the Genome. Int J Mol Sci 2021; 22:2060. [PMID: 33669700 PMCID: PMC7923155 DOI: 10.3390/ijms22042060] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2020] [Revised: 01/21/2021] [Accepted: 01/27/2021] [Indexed: 12/17/2022] Open
Abstract
The advent of long-read sequencing offers a new assessment method of detecting genomic structural variation (SV) in numerous rare genetic diseases. For autism spectrum disorders (ASD) cases where pathogenic variants fail to be found in the protein-coding genic regions along chromosomes, we proposed a scalable workflow to characterize the risk factor of SVs impacting non-coding elements of the genome. We applied whole-genome sequencing on an Emirati family having three children with ASD using long and short-read sequencing technology. A series of analytical pipelines were established to identify a set of SVs with high sensitivity and specificity. At 15-fold coverage, we observed that long-read sequencing technology (987 variants) detected a significantly higher number of SVs when compared to variants detected using short-read technology (509 variants) (p-value < 1.1020 × 10-57). Further comparison showed 97.9% of long-read sequencing variants were spanning within the 1-100 kb size range (p-value < 9.080 × 10-67) and impacting over 5000 genes. Moreover, long-read variants detected 604 non-coding RNAs (p-value < 9.02 × 10-9), comprising 58% microRNA, 31.9% lncRNA, and 9.1% snoRNA. Even at low coverage, long-read sequencing has shown to be a reliable technology in detecting SVs impacting complex elements of the genome.
Collapse
Affiliation(s)
- Ghausia Begum
- College of Medicine, Mohammed Bin Rashid University of Medicine and Health Sciences, Dubai 505055, United Arab Emirates; (G.B.); (A.A.); (A.B.); (N.N.); (R.T.); (B.K.B.); (N.K.); (D.A.); (A.A.-A.)
| | - Ammar Albanna
- College of Medicine, Mohammed Bin Rashid University of Medicine and Health Sciences, Dubai 505055, United Arab Emirates; (G.B.); (A.A.); (A.B.); (N.N.); (R.T.); (B.K.B.); (N.K.); (D.A.); (A.A.-A.)
- Department of Psychiatry, Al Jalila Children’s Specialty Hospital, Dubai 7662, United Arab Emirates
| | - Asma Bankapur
- College of Medicine, Mohammed Bin Rashid University of Medicine and Health Sciences, Dubai 505055, United Arab Emirates; (G.B.); (A.A.); (A.B.); (N.N.); (R.T.); (B.K.B.); (N.K.); (D.A.); (A.A.-A.)
| | - Nasna Nassir
- College of Medicine, Mohammed Bin Rashid University of Medicine and Health Sciences, Dubai 505055, United Arab Emirates; (G.B.); (A.A.); (A.B.); (N.N.); (R.T.); (B.K.B.); (N.K.); (D.A.); (A.A.-A.)
| | - Richa Tambi
- College of Medicine, Mohammed Bin Rashid University of Medicine and Health Sciences, Dubai 505055, United Arab Emirates; (G.B.); (A.A.); (A.B.); (N.N.); (R.T.); (B.K.B.); (N.K.); (D.A.); (A.A.-A.)
| | - Bakhrom K. Berdiev
- College of Medicine, Mohammed Bin Rashid University of Medicine and Health Sciences, Dubai 505055, United Arab Emirates; (G.B.); (A.A.); (A.B.); (N.N.); (R.T.); (B.K.B.); (N.K.); (D.A.); (A.A.-A.)
| | - Hosneara Akter
- Genetics and Genomic Medicine Centre, NeuroGen Children’s Healthcare, Dhaka 1205, Bangladesh;
- Department of Biochemistry and Molecular Biology, Dhaka University, Dhaka 1000, Bangladesh
| | - Noushad Karuvantevida
- College of Medicine, Mohammed Bin Rashid University of Medicine and Health Sciences, Dubai 505055, United Arab Emirates; (G.B.); (A.A.); (A.B.); (N.N.); (R.T.); (B.K.B.); (N.K.); (D.A.); (A.A.-A.)
- Department of Biotechnology, Bharathidasan University, Tiruchirappalli 620024, India
| | - Barbara Kellam
- The Centre for Applied Genomics, The Hospital for Sick Children, Toronto, ON M5S 1A1, Canada; (B.K.); (W.W.L.S.); (B.T.)
| | - Deena Alhashmi
- College of Medicine, Mohammed Bin Rashid University of Medicine and Health Sciences, Dubai 505055, United Arab Emirates; (G.B.); (A.A.); (A.B.); (N.N.); (R.T.); (B.K.B.); (N.K.); (D.A.); (A.A.-A.)
| | - Wilson W. L. Sung
- The Centre for Applied Genomics, The Hospital for Sick Children, Toronto, ON M5S 1A1, Canada; (B.K.); (W.W.L.S.); (B.T.)
| | - Bhooma Thiruvahindrapuram
- The Centre for Applied Genomics, The Hospital for Sick Children, Toronto, ON M5S 1A1, Canada; (B.K.); (W.W.L.S.); (B.T.)
| | - Alawi Alsheikh-Ali
- College of Medicine, Mohammed Bin Rashid University of Medicine and Health Sciences, Dubai 505055, United Arab Emirates; (G.B.); (A.A.); (A.B.); (N.N.); (R.T.); (B.K.B.); (N.K.); (D.A.); (A.A.-A.)
| | - Stephen W. Scherer
- The Centre for Applied Genomics, The Hospital for Sick Children, Toronto, ON M5S 1A1, Canada; (B.K.); (W.W.L.S.); (B.T.)
- Department of Genetics and Genome Biology, The Hospital for Sick Children, Toronto, ON M5G 1X8, Canada
- McLaughlin Centre and Department of Molecular Genetics, University of Toronto, Toronto, ON M5S, Canada
| | - Mohammed Uddin
- College of Medicine, Mohammed Bin Rashid University of Medicine and Health Sciences, Dubai 505055, United Arab Emirates; (G.B.); (A.A.); (A.B.); (N.N.); (R.T.); (B.K.B.); (N.K.); (D.A.); (A.A.-A.)
| |
Collapse
|
13
|
Giraldo PA, Shinozuka H, Spangenberg GC, Smith KF, Cogan NOI. Rapid and Detailed Characterization of Transgene Insertion Sites in Genetically Modified Plants via Nanopore Sequencing. FRONTIERS IN PLANT SCIENCE 2021; 11:602313. [PMID: 33613582 PMCID: PMC7889508 DOI: 10.3389/fpls.2020.602313] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/03/2020] [Accepted: 12/24/2020] [Indexed: 05/15/2023]
Abstract
Molecular characterization of genetically modified plants can provide crucial information for the development of detection and identification methods, to comply with traceability, and labeling requirements prior to commercialization. Detailed description of the genetic modification was previously a challenging step in the safety assessment, since it required the use of laborious and time-consuming techniques. In this study an accurate, simple, and fast method was developed for molecular characterization of genetically modified (GM) plants, following a user-friendly workflow for researchers with limited bioinformatic capabilities. Three GM events from a diverse array of crop species-perennial ryegrass, white clover, and canola-were used to test the approach that exploits long-read sequencing by the MinION device, from Oxford Nanopore Technologies. The method delivered a higher degree of resolution of the transgenic events within the host genome than has previously been possible with the standard Illumina short-range sequencing strategies. The flanking sequences, copy number, and presence of backbone sequences, and overall transgene insertion structure were determined for each of the plant genomes, with the additional identification of moderate-sized secondary insertions that would have previously been missed. The proposed workflow takes only about 1 week from DNA extraction to analyzed result, and the method will complement the existing approaches for molecular characterization of GM plants, since it makes the process faster, simpler, and more cost-effective.
Collapse
Affiliation(s)
- Paula A. Giraldo
- Faculty of Veterinary and Agricultural Sciences, The University of Melbourne, Parkville, VIC, Australia
- Agriculture Victoria Research, AgriBio, The Centre for AgriBioscience, Bundoora, VIC, Australia
| | - Hiroshi Shinozuka
- Agriculture Victoria Research, AgriBio, The Centre for AgriBioscience, Bundoora, VIC, Australia
| | - German C. Spangenberg
- Agriculture Victoria Research, AgriBio, The Centre for AgriBioscience, Bundoora, VIC, Australia
- AgriBio, The Centre for AgriBioscience, School of Applied Systems Biology, La Trobe University, Bundoora, VIC, Australia
| | - Kevin F. Smith
- Faculty of Veterinary and Agricultural Sciences, The University of Melbourne, Parkville, VIC, Australia
- Agriculture Victoria Research, Hamilton, VIC, Australia
| | - Noel O. I. Cogan
- Agriculture Victoria Research, AgriBio, The Centre for AgriBioscience, Bundoora, VIC, Australia
- AgriBio, The Centre for AgriBioscience, School of Applied Systems Biology, La Trobe University, Bundoora, VIC, Australia
| |
Collapse
|
14
|
Lamb HJ, Hayes BJ, Nguyen LT, Ross EM. The Future of Livestock Management: A Review of Real-Time Portable Sequencing Applied to Livestock. Genes (Basel) 2020; 11:E1478. [PMID: 33317066 PMCID: PMC7763041 DOI: 10.3390/genes11121478] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2020] [Revised: 11/10/2020] [Accepted: 12/01/2020] [Indexed: 12/12/2022] Open
Abstract
Oxford Nanopore Technologies' MinION has proven to be a valuable tool within human and microbial genetics. Its capacity to produce long reads in real time has opened up unique applications for portable sequencing. Examples include tracking the recent African swine fever outbreak in China and providing a diagnostic tool for disease in the cassava plant in Eastern Africa. Here we review the current applications of Oxford Nanopore sequencing in livestock, then focus on proposed applications in livestock agriculture for rapid diagnostics, base modification detection, reference genome assembly and genomic prediction. In particular, we propose a future application: 'crush-side genotyping' for real-time on-farm genotyping for extensive industries such as northern Australian beef production. An initial in silico experiment to assess the feasibility of crush-side genotyping demonstrated promising results. SNPs were called from simulated Nanopore data, that included the relatively high base call error rate that is characteristic of the data, and calling parameters were varied to understand the feasibility of SNP calling at low coverages in a heterozygous population. With optimised genotype calling parameters, over 85% of the 10,000 simulated SNPs were able to be correctly called with coverages as low as 6×. These results provide preliminary evidence that Oxford Nanopore sequencing has potential to be used for real-time SNP genotyping in extensive livestock operations.
Collapse
Affiliation(s)
- Harrison J. Lamb
- Centre for Animal Science, Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St. Lucia, QLD 4067, Australia; (B.J.H.); (L.T.N.); (E.M.R.)
| | | | | | | |
Collapse
|
15
|
Xie Y, Zhong Y, Chang J, Kwan HS. Chromosome-level de novo assembly of Coprinopsis cinerea A43mut B43mut pab1-1 #326 and genetic variant identification of mutants using Nanopore MinION sequencing. Fungal Genet Biol 2020; 146:103485. [PMID: 33253902 DOI: 10.1016/j.fgb.2020.103485] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2020] [Revised: 10/22/2020] [Accepted: 11/13/2020] [Indexed: 11/26/2022]
Abstract
The homokaryotic Coprinopsis cinerea strain A43mut B43mut pab1-1 #326 is a widely used experimental model for developmental studies in mushroom-forming fungi. It can grow on defined artificial media and complete the whole lifecycle within two weeks. The mutations in mating type factors A and B result in the special feature of clamp formation and fruiting without mating. This feature allows investigations and manipulations with a homokaryotic genetic background. Current genome assembly of strain #326 was based on short-read sequencing data and was highly fragmented, leading to the bias in gene annotation and downstream analyses. Here, we report a chromosome-level genome assembly of strain #326. Oxford Nanopore Technology (ONT) MinION sequencing was used to get long reads. Illumina short reads was used to polish the sequences. A combined assembly yield 13 chromosomes and a mitochondrial genome as individual scaffolds. The assembly has 15,250 annotated genes with a high synteny with the C. cinerea strain Okayama-7 #130. This assembly has great improvement on contiguity and annotations. It is a suitable reference for further genomic studies, especially for the genetic, genomic and transcriptomic analyses in ONT long reads. Single nucleotide variants and structural variants in six mutagenized and cisplatin-screened mutants could be identified and validated. A 66 bp deletion in Ras GTPase-activating protein (RasGAP) was found in all mutants. To make a better use of ONT sequencing platform, we modified a high-molecular-weight genomic DNA isolation protocol based on magnetic beads for filamentous fungi. This study showed the use of MinION to construct a fungal reference genome and to perform downstream studies in an individual laboratory. An experimental workflow was proposed, from DNA isolation and whole genome sequencing, to genome assembly and variant calling. Our results provided solutions and parameters for fungal genomic analysis on MinION sequencing platform.
Collapse
Affiliation(s)
- Yichun Xie
- School of Life Sciences, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong Special Administrative Region
| | - Yiyi Zhong
- School of Life Sciences, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong Special Administrative Region
| | - Jinhui Chang
- School of Life Sciences, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong Special Administrative Region; The Hong Kong Polytechnic University Shenzhen Research Institute, Shenzhen, China
| | - Hoi Shan Kwan
- School of Life Sciences, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong Special Administrative Region.
| |
Collapse
|
16
|
Dumschott K, Schmidt MHW, Chawla HS, Snowdon R, Usadel B. Oxford Nanopore sequencing: new opportunities for plant genomics? JOURNAL OF EXPERIMENTAL BOTANY 2020; 71:5313-5322. [PMID: 32459850 PMCID: PMC7501810 DOI: 10.1093/jxb/eraa263] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/14/2019] [Accepted: 05/25/2020] [Indexed: 05/06/2023]
Abstract
DNA sequencing was dominated by Sanger's chain termination method until the mid-2000s, when it was progressively supplanted by new sequencing technologies that can generate much larger quantities of data in a shorter time. At the forefront of these developments, long-read sequencing technologies (third-generation sequencing) can produce reads that are several kilobases in length. This greatly improves the accuracy of genome assemblies by spanning the highly repetitive segments that cause difficulty for second-generation short-read technologies. Third-generation sequencing is especially appealing for plant genomes, which can be extremely large with long stretches of highly repetitive DNA. Until recently, the low basecalling accuracy of third-generation technologies meant that accurate genome assembly required expensive, high-coverage sequencing followed by computational analysis to correct for errors. However, today's long-read technologies are more accurate and less expensive, making them the method of choice for the assembly of complex genomes. Oxford Nanopore Technologies (ONT), a third-generation platform for the sequencing of native DNA strands, is particularly suitable for the generation of high-quality assemblies of highly repetitive plant genomes. Here we discuss the benefits of ONT, especially for the plant science community, and describe the issues that remain to be addressed when using ONT for plant genome sequencing.
Collapse
Affiliation(s)
- Kathryn Dumschott
- Institute for Biology I, BioSC, RWTH Aachen University, Aachen, Germany
- IBG-4 Bioinformatics, CEPLAS, Forschungszentrum Jülich, Jülich, Germany
| | - Maximilian H-W Schmidt
- Institute for Biology I, BioSC, RWTH Aachen University, Aachen, Germany
- IBG-4 Bioinformatics, CEPLAS, Forschungszentrum Jülich, Jülich, Germany
| | - Harmeet Singh Chawla
- Department of Plant Breeding, Justus Liebig University Giessen, Giessen, Germany
| | - Rod Snowdon
- Department of Plant Breeding, Justus Liebig University Giessen, Giessen, Germany
| | - Björn Usadel
- Institute for Biology I, BioSC, RWTH Aachen University, Aachen, Germany
- IBG-4 Bioinformatics, CEPLAS, Forschungszentrum Jülich, Jülich, Germany
- Institute for Biological Data Science, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
| |
Collapse
|