1
|
Zang H, Guo S, Dong S, Song Y, Li K, Fan X, Qiu J, Zheng Y, Jiang H, Wu Y, Lü Y, Chen D, Guo R. Construction of a Full-Length Transcriptome of Western Honeybee Midgut Tissue and Improved Genome Annotation. Genes (Basel) 2024; 15:728. [PMID: 38927663 PMCID: PMC11202838 DOI: 10.3390/genes15060728] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2024] [Revised: 05/22/2024] [Accepted: 05/26/2024] [Indexed: 06/28/2024] Open
Abstract
Honeybees are an indispensable pollinator in nature with pivotal ecological, economic, and scientific value. However, a full-length transcriptome for Apis mellifera, assembled with the advanced third-generation nanopore sequencing technology, has yet to be reported. Here, nanopore sequencing of the midgut tissues of uninoculated and Nosema ceranae-inoculated A. mellifera workers was conducted, and the full-length transcriptome was then constructed and annotated based on high-quality long reads. Next followed improvement of sequences and annotations of the current reference genome of A. mellifera. A total of 5,942,745 and 6,664,923 raw reads were produced from midguts of workers at 7 days post-inoculation (dpi) with N. ceranae and 10 dpi, while 7,100,161 and 6,506,665 raw reads were generated from the midguts of corresponding uninoculated workers. After strict quality control, 6,928,170, 6,353,066, 5,745,048, and 6,416,987 clean reads were obtained, with a length distribution ranging from 1 kb to 10 kb. Additionally, 16,824, 17,708, 15,744, and 18,246 full-length transcripts were respectively detected, including 28,019 nonredundant ones. Among these, 43,666, 30,945, 41,771, 26,442, and 24,532 full-length transcripts could be annotated to the Nr, KOG, eggNOG, GO, and KEGG databases, respectively. Additionally, 501 novel genes (20,326 novel transcripts) were identified for the first time, among which 401 (20,255), 193 (13,365), 414 (19,186), 228 (12,093), and 202 (11,703) were respectively annotated to each of the aforementioned five databases. The expression and sequences of three randomly selected novel transcripts were confirmed by RT-PCR and Sanger sequencing. The 5' UTR of 2082 genes, the 3' UTR of 2029 genes, and both the 5' and 3' UTRs of 730 genes were extended. Moreover, 17,345 SSRs, 14,789 complete ORFs, 1224 long non-coding RNAs (lncRNAs), and 650 transcription factors (TFs) from 37 families were detected. Findings from this work not only refine the annotation of the A. mellifera reference genome, but also provide a valuable resource and basis for relevant molecular and -omics studies.
Collapse
Affiliation(s)
- He Zang
- College of Bee Science and Biomedicine, Fujian Agriculture and Forestry University, Fuzhou 350002, China; (H.Z.); (S.G.); (S.D.); (Y.S.); (K.L.); (X.F.); (J.Q.); (Y.Z.); (D.C.)
- National & Local United Engineering Laboratory of Natural Biotoxin, Fuzhou 350002, China
- Apitherapy Research Institute of Fujian Province, Fuzhou 350002, China
| | - Sijia Guo
- College of Bee Science and Biomedicine, Fujian Agriculture and Forestry University, Fuzhou 350002, China; (H.Z.); (S.G.); (S.D.); (Y.S.); (K.L.); (X.F.); (J.Q.); (Y.Z.); (D.C.)
| | - Shunan Dong
- College of Bee Science and Biomedicine, Fujian Agriculture and Forestry University, Fuzhou 350002, China; (H.Z.); (S.G.); (S.D.); (Y.S.); (K.L.); (X.F.); (J.Q.); (Y.Z.); (D.C.)
| | - Yuxuan Song
- College of Bee Science and Biomedicine, Fujian Agriculture and Forestry University, Fuzhou 350002, China; (H.Z.); (S.G.); (S.D.); (Y.S.); (K.L.); (X.F.); (J.Q.); (Y.Z.); (D.C.)
| | - Kunze Li
- College of Bee Science and Biomedicine, Fujian Agriculture and Forestry University, Fuzhou 350002, China; (H.Z.); (S.G.); (S.D.); (Y.S.); (K.L.); (X.F.); (J.Q.); (Y.Z.); (D.C.)
| | - Xiaoxue Fan
- College of Bee Science and Biomedicine, Fujian Agriculture and Forestry University, Fuzhou 350002, China; (H.Z.); (S.G.); (S.D.); (Y.S.); (K.L.); (X.F.); (J.Q.); (Y.Z.); (D.C.)
- National & Local United Engineering Laboratory of Natural Biotoxin, Fuzhou 350002, China
- Apitherapy Research Institute of Fujian Province, Fuzhou 350002, China
| | - Jianfeng Qiu
- College of Bee Science and Biomedicine, Fujian Agriculture and Forestry University, Fuzhou 350002, China; (H.Z.); (S.G.); (S.D.); (Y.S.); (K.L.); (X.F.); (J.Q.); (Y.Z.); (D.C.)
- National & Local United Engineering Laboratory of Natural Biotoxin, Fuzhou 350002, China
- Apitherapy Research Institute of Fujian Province, Fuzhou 350002, China
| | - Yidi Zheng
- College of Bee Science and Biomedicine, Fujian Agriculture and Forestry University, Fuzhou 350002, China; (H.Z.); (S.G.); (S.D.); (Y.S.); (K.L.); (X.F.); (J.Q.); (Y.Z.); (D.C.)
| | - Haibin Jiang
- Apiculture Science Institute of Jilin Province, Jilin 132000, China; (H.J.); (Y.W.)
| | - Ying Wu
- Apiculture Science Institute of Jilin Province, Jilin 132000, China; (H.J.); (Y.W.)
| | - Yang Lü
- Mudanjiang Branch of Heilongjiang Academy of Agricultural Sciences, Mudanjiang 157000, China;
| | - Dafu Chen
- College of Bee Science and Biomedicine, Fujian Agriculture and Forestry University, Fuzhou 350002, China; (H.Z.); (S.G.); (S.D.); (Y.S.); (K.L.); (X.F.); (J.Q.); (Y.Z.); (D.C.)
- National & Local United Engineering Laboratory of Natural Biotoxin, Fuzhou 350002, China
- Apitherapy Research Institute of Fujian Province, Fuzhou 350002, China
| | - Rui Guo
- College of Bee Science and Biomedicine, Fujian Agriculture and Forestry University, Fuzhou 350002, China; (H.Z.); (S.G.); (S.D.); (Y.S.); (K.L.); (X.F.); (J.Q.); (Y.Z.); (D.C.)
- National & Local United Engineering Laboratory of Natural Biotoxin, Fuzhou 350002, China
- Apitherapy Research Institute of Fujian Province, Fuzhou 350002, China
| |
Collapse
|
2
|
Next generation sequencing technologies to explore the diversity of germplasm resources: achievements and trends in tomato. Comput Struct Biotechnol J 2022; 20:6250-6258. [PMID: 36420160 PMCID: PMC9676195 DOI: 10.1016/j.csbj.2022.11.028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2022] [Revised: 11/10/2022] [Accepted: 11/10/2022] [Indexed: 11/14/2022] Open
Abstract
Tomato is one of the major vegetable crops grown worldwide and a model species for genetic and biological research. Progress in genomic technologies made possible the development of forefront methods for high-scale sequencing, providing comprehensive insight into the genetic architecture of germplasm resources. This review revisits next-generation sequencing strategies and applications to investigate the diversity of tomato, describing the common platforms used for SNP genotyping of large collections, de novo sequencing, and whole genome resequencing. Significant findings in evolutionary history are outlined, thus discussing how genomics has provided new hints about the processes behind domestication. Finally, achievement and perspectives on pan-genome construction and graphical pan-genome development toward precise mining of the natural variation to be exploited for breeding purposes are presented.
Collapse
|
3
|
Marcozzi A, Jager M, Elferink M, Straver R, van Ginkel JH, Peltenburg B, Chen LT, Renkens I, van Kuik J, Terhaard C, de Bree R, Devriese LA, Willems SM, Kloosterman WP, de Ridder J. Accurate detection of circulating tumor DNA using nanopore consensus sequencing. NPJ Genom Med 2021; 6:106. [PMID: 34887408 PMCID: PMC8660781 DOI: 10.1038/s41525-021-00272-y] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2021] [Accepted: 11/09/2021] [Indexed: 12/26/2022] Open
Abstract
Levels of circulating tumor DNA (ctDNA) in liquid biopsies may serve as a sensitive biomarker for real-time, minimally-invasive tumor diagnostics and monitoring. However, detecting ctDNA is challenging, as much fewer than 5% of the cell-free DNA in the blood typically originates from the tumor. To detect lowly abundant ctDNA molecules based on somatic variants, extremely sensitive sequencing methods are required. Here, we describe a new technique, CyclomicsSeq, which is based on Oxford Nanopore sequencing of concatenated copies of a single DNA molecule. Consensus calling of the DNA copies increased the base-calling accuracy ~60×, enabling accurate detection of TP53 mutations at frequencies down to 0.02%. We demonstrate that a TP53-specific CyclomicsSeq assay can be successfully used to monitor tumor burden during treatment for head-and-neck cancer patients. CyclomicsSeq can be applied to any genomic locus and offers an accurate diagnostic liquid biopsy approach that can be implemented in clinical workflows.
Collapse
Affiliation(s)
- Alessio Marcozzi
- Center for Molecular Medicine and Oncode Institute, University Medical Center Utrecht, Utrecht University, Utrecht, CX, The Netherlands.,Cyclomics, Utrecht, CG, The Netherlands
| | - Myrthe Jager
- Center for Molecular Medicine and Oncode Institute, University Medical Center Utrecht, Utrecht University, Utrecht, CX, The Netherlands
| | - Martin Elferink
- Department of Genetics, University Medical Center Utrecht, Utrecht University, Utrecht, CX, The Netherlands
| | - Roy Straver
- Center for Molecular Medicine and Oncode Institute, University Medical Center Utrecht, Utrecht University, Utrecht, CX, The Netherlands
| | - Joost H van Ginkel
- Department of pathology, University Medical Center Utrecht, Utrecht University, Utrecht, CX, The Netherlands.,Department of Oral and Maxillofacial Surgery, University Medical Center Utrecht, Utrecht University, Utrecht, CX, The Netherlands
| | - Boris Peltenburg
- Department of Radiotherapy, UMC Utrecht Cancer Center, University Medical Center Utrecht, Utrecht, The Netherlands.,Department of Head and Neck Surgical Oncology, UMC Utrecht Cancer Center, University Medical Center Utrecht, Utrecht, The Netherlands
| | - Li-Ting Chen
- Center for Molecular Medicine and Oncode Institute, University Medical Center Utrecht, Utrecht University, Utrecht, CX, The Netherlands
| | - Ivo Renkens
- Center for Molecular Medicine and Oncode Institute, University Medical Center Utrecht, Utrecht University, Utrecht, CX, The Netherlands
| | - Joyce van Kuik
- Department of pathology, University Medical Center Utrecht, Utrecht University, Utrecht, CX, The Netherlands
| | - Chris Terhaard
- Department of Radiotherapy, UMC Utrecht Cancer Center, University Medical Center Utrecht, Utrecht, The Netherlands
| | - Remco de Bree
- Department of Head and Neck Surgical Oncology, UMC Utrecht Cancer Center, University Medical Center Utrecht, Utrecht, The Netherlands
| | - Lot A Devriese
- Department of Medical Oncology, UMC Utrecht Cancer Center, University Medical Center Utrecht, Utrecht, The Netherlands
| | - Stefan M Willems
- Department of pathology, University Medical Center Utrecht, Utrecht University, Utrecht, CX, The Netherlands.,Department of Pathology and Medical Biology, University Medical Center Groningen, Rijksuniversiteit Groningen, Groningen, GZ, The Netherlands
| | - Wigard P Kloosterman
- Cyclomics, Utrecht, CG, The Netherlands. .,Center for Molecular Medicine, University Medical Center Utrecht, Utrecht University, Utrecht, CX, The Netherlands.
| | - Jeroen de Ridder
- Center for Molecular Medicine and Oncode Institute, University Medical Center Utrecht, Utrecht University, Utrecht, CX, The Netherlands. .,Cyclomics, Utrecht, CG, The Netherlands.
| |
Collapse
|
4
|
Anifandis G, Tempest HG, Oliva R, Swanson GM, Simopoulou M, Easley CA, Primig M, Messini CI, Turek PJ, Sutovsky P, Ory SJ, Krawetz SA. COVID-19 and human reproduction: A pandemic that packs a serious punch. Syst Biol Reprod Med 2021; 67:3-23. [PMID: 33719829 DOI: 10.1080/19396368.2020.1855271] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
The COVID-19 pandemic has led to a worldwide health emergency that has impacted 188 countries at last count. The rapid community transmission and relatively high mortality rates with COVID-19 in modern times are relatively unique features of this flu pandemic and have resulted in an unparalleled global health crisis. SARS-CoV-2, being a respiratory virus, mainly affects the lungs, but is capable of infecting other vital organs, such as brain, heart and kidney. Emerging evidence suggests that the virus also targets male and female reproductive organs that express its main receptor ACE2, although it is as yet unclear if this has any implications for human fertility. Furthermore, professional bodies have recommended discontinuing fertility services during the pandemic such that reproductive services have also been affected. Although increased safety measures have helped to mitigate the propagation of COVID-19 in a number of countries, it seems that there is no predictable timeline to containment of the virus, a goal likely to remain elusive until an effective vaccine becomes available and widely distributed across the globe. In parallel, research on reproduction has been postponed for obvious reasons, while diagnostic tests that detect the virus or antibodies against it are of vital importance to support public health policies, such as social distancing and our obligation to wear masks in public spaces. This review aims to provide an overview of critical research and ethics issues that have been continuously emerging in the field of reproductive medicine as the COVID-19 pandemic tragically unfolds.Abbreviations: ACE2: angiotensin- converting enzyme 2; ART: Assisted reproductive technology; ASRM: American Society for Reproductive Medicine; CCR9: C-C Motif Chemokine Receptor 9; CDC: Centers for Disease Control and Prevention; COVID-19: Coronavirus disease 2019; Ct: Cycle threshold; CXCR6: C-X-C Motif Chemokine Receptor 6; ELISA: enzyme-linked immunosorbent assay; ESHRE: European Society of Human Reproduction and Embryology; ET: Embryo transfer; FSH: Follicle Stimulating Hormone; FFPE: formalin fixed paraffin embedded; FYCO1: FYVE And Coiled-Coil Domain Autophagy Adaptor 1; IFFS: International Federation of Fertility Societies; IUI: Intrauterine insemination; IVF: In vitro fertilization; LH: Luteinizing Hormone; LZTFL1: Leucine Zipper Transcription Factor Like 1; MAR: medically assisted reproduction services; MERS: Middle East Respiratory syndrome; NGS: Next Generation Sequencing; ORF: Open Reading Frame; PPE: personal protective equipment; RE: RNA Element; REDa: RNA Element Discovery algorithm; RT-PCR: Reverse=trascriptase transcriptase-polymerase chain reaction; SARS: Severe acute respiratory syndrome; SARS-CoV-2: Severe Acute Respiratory Syndrome Coronavirus 2; SLC6A20: Solute Carrier Family 6 Member 20; SMS: Single Molecule Sequencing; T: Testosterone; TMPRSS2: transmembrane serine protease 2; WHO: World Health Organization; XCR1: X-C Motif Chemokine Receptor.
Collapse
Affiliation(s)
- George Anifandis
- Department of Obstetrics and Gynecology, School of Health Sciences, Faculty of Medicine, University of Thessaly, Larisa, Greece
| | - Helen G Tempest
- Department of Human and Molecular Genetics, Herbert Wertheim College of Medicine, Florida International University, Miami, FL, USA.,Biomolecular Sciences Institute, Florida International University, Miami, FL, USA
| | - Rafael Oliva
- Molecular Biology of Reproduction and Development Research Group, Institut d'Investigacions Biomèdiques August Pi I Sunyer (IDIBAPS), Department of Biomedical Sciences, Faculty of Medicine and Health Sciences, Universitat De Barcelona, and Hospital Clinic from Barcelona, Spain
| | - Grace M Swanson
- Department of Obstetrics and Gynecology, Wayne State University School of Medicine, Detroit, Michigan, USA.,Center for Molecular Medicine and Genetics, Wayne State University School of Medicine, Detroit, Michigan, USA
| | - Mara Simopoulou
- Department of Experimental Physiology, School of Health Sciences, Faculty of Medicine, National and Kapodistrian University of Athens, Athens, Greece, Athens, Greece
| | - Charles A Easley
- Department of Environmental Health Science, College of Public Health, University of Georgia, Athens, GA, USA.,Regenerative Bioscience Center, University of Georgia, Athens, GA, USA
| | - Michael Primig
- Inserm, EHESP, Irset (Institut De Recherche En Santé, Environnement Et Travail), Rennes, France
| | - Christina I Messini
- Department of Obstetrics and Gynecology, School of Health Sciences, Faculty of Medicine, University of Thessaly, Larisa, Greece
| | - Paul J Turek
- It is a private Clinic, The Turek Clinic, Beverly Hills, CA, USA
| | - Peter Sutovsky
- Division of Animal Sciences and the Department of Obstetrics, Gynecology and Women's Health, University of Missouri, Columbia, MO, USA
| | - Steve J Ory
- It is a private Clinic, IVF Florida Reproductive Institutes, Margate, FL, USA.,Department of Obstetrics and Gynecology, Herbert Wertheim College of Medicine, Florida International University, Miami, FL, USA
| | - Stephen A Krawetz
- Department of Obstetrics and Gynecology, Wayne State University School of Medicine, Detroit, Michigan, USA.,Center for Molecular Medicine and Genetics, Wayne State University School of Medicine, Detroit, Michigan, USA.,Department of Obstetrics and Gynecology and Center of Molecular Medicine and Genetics, C.S. Mott Center for Human Growth and Development, Wayne State University, Detroit, MI, USA
| |
Collapse
|
5
|
Low-complexity and highly robust barcodes for error-rich single molecular sequencing. 3 Biotech 2021; 11:78. [PMID: 33505833 DOI: 10.1007/s13205-020-02607-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2020] [Accepted: 12/23/2020] [Indexed: 12/28/2022] Open
Abstract
DNA barcodes are frequently corrupted due to insertion, deletion, and substitution errors during DNA synthesis, amplification and sequencing, resulting in index hopping. In this paper, we propose a new DNA barcode construction scheme that combines a cyclic block code with a predetermined pseudo-random sequence bit by bit to form bit pairs, and then converts the bit pairs to bases, i.e., the DNA barcodes. Then, we present a barcode identification scheme for noisy sequencing reads, which uses a combination of cyclic shifting and traditional dynamic programming to mark the insertion and deletion positions, and then performs erasure-and-error-correction decoding on the corrupted codewords. Furthermore, we verify the identification error rate of barcodes for multiple errors and evaluate the reliability of the barcodes in DNA context. This method can be easily generalized for constructing long barcodes, which may be used in scenarios with serious errors. Simulation results show that the bit error rate after identifying insertions/deletions is greatly reduced using the combination of cyclic shift and dynamic programming compared to using dynamic programming only. It indicates that the proposed method can effectively improve the accuracy for estimating insertion/deletion errors. And the overall identification error rate of the proposed method is lower than 10 - 5 when the probability of each base mutation is less than 0.1, which is the typical scenario in third-generation sequencing.
Collapse
|
6
|
Hatfield RG, Batista FM, Bean TP, Fonseca VG, Santos A, Turner AD, Lewis A, Dean KJ, Martinez-Urtaza J. The Application of Nanopore Sequencing Technology to the Study of Dinoflagellates: A Proof of Concept Study for Rapid Sequence-Based Discrimination of Potentially Harmful Algae. Front Microbiol 2020; 11:844. [PMID: 32457722 PMCID: PMC7227484 DOI: 10.3389/fmicb.2020.00844] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2019] [Accepted: 04/08/2020] [Indexed: 01/05/2023] Open
Abstract
Harmful algal blooms (HABs) are a naturally occurring global phenomena that have the potential to impact fisheries, leisure and ecosystems, as well as posing a significant hazard to animal and human health. There is significant interest in the development and application of methodologies to study all aspects of the causative organisms and toxins associated with these events. This paper reports the first application of nanopore sequencing technology for the detection of eukaryotic harmful algal bloom organisms. The MinION sequencing platform from Oxford Nanopore technologies provides long read sequencing capabilities in a compact, low cost, and portable format. In this study we used the MinION to sequence long-range PCR amplicons from multiple dinoflagellate species with a focus on the genus Alexandrium. Primers applicable to a wide range of dinoflagellates were selected, meaning that although the study was primarily focused on Alexandrium the applicability to three additional genera of toxic algae, namely; Gonyaulax, Prorocentrum, and Lingulodinium was also demonstrated. The amplicon generated here spanned approximately 3 kb of the rDNA cassette, including most of the 18S, the complete ITS1, 5.8S, ITS2 and regions D1 and D2 of the 28S. The inclusion of barcode genes as well as highly conserved regions resulted in identification of organisms to the species level. The analysis of reference cultures resulted in over 99% of all sequences being attributed to the correct species with an average identity above 95% from a reference list of over 200 species (see Supplementary Material 1). The use of mock community analysis within environmental samples highlighted that complex matrices did not prevent the ability to distinguish between phylogenetically similar species. Successful identification of causative organisms in environmental samples during natural toxic events further highlighted the potential of the assay. This study proves the suitability of nanopore sequencing technology for taxonomic identification of harmful algal bloom organisms and acquisition of data relevant to the World Health Organisations "one health" approach to marine monitoring.
Collapse
Affiliation(s)
- Robert G. Hatfield
- Centre for Environment, Fisheries and Aquaculture Science, Dorset, United Kingdom
| | - Frederico M. Batista
- Centre for Environment, Fisheries and Aquaculture Science, Dorset, United Kingdom
| | | | - Vera G. Fonseca
- Centre for Environment, Fisheries and Aquaculture Science, Dorset, United Kingdom
| | - Andres Santos
- Centre for Environment, Fisheries and Aquaculture Science, Dorset, United Kingdom
- Scientific and Technological Bioresource Nucleus (BIOREN), Universidad de La Frontera, Temuco, Chile
| | - Andrew D. Turner
- Centre for Environment, Fisheries and Aquaculture Science, Dorset, United Kingdom
| | - Adam Lewis
- Centre for Environment, Fisheries and Aquaculture Science, Dorset, United Kingdom
| | - Karl J. Dean
- Centre for Environment, Fisheries and Aquaculture Science, Dorset, United Kingdom
| | | |
Collapse
|
7
|
Knot IE, Zouganelis GD, Weedall GD, Wich SA, Rae R. DNA Barcoding of Nematodes Using the MinION. Front Ecol Evol 2020. [DOI: 10.3389/fevo.2020.00100] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open
|
8
|
Sequencing barcode construction and identification methods based on block error-correction codes. SCIENCE CHINA-LIFE SCIENCES 2020; 63:1580-1592. [PMID: 32303959 DOI: 10.1007/s11427-019-1651-3] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/23/2019] [Accepted: 02/11/2020] [Indexed: 02/07/2023]
Abstract
Multiplexed sequencing relies on specific sample labels, the barcodes, to tag DNA fragments belonging to different samples and to separate the output of the sequencers. However, the barcodes are often corrupted by insertion, deletion and substitution errors introduced during sequencing, which may lead to sample misassignment. In this paper, we propose a barcode construction method, which combines a block error-correction code with a predetermined pseudorandom sequence to generate a base sequence for labeling different samples. Furthermore, to identify the corrupted barcodes for assigning reads to their respective samples, we present a soft decision identification method that consists of inner decoding and outer decoding. The inner decoder establishes the hidden Markov model (HMM) for base insertion/deletion estimation with the pseudorandom sequence, and adapts the forward-backward (FB) algorithm to output the soft information of each bit in the block code. The outer decoder performs soft decision decoding using the soft information to effectively correct multiple errors in the barcodes. Simulation results show that the proposed methods are highly robust to high error rates of insertions, deletions and substitutions in the barcodes. In addition, compared with the inner decoding algorithm of the barcodes based on watermarks, the proposed inner decoding algorithm can greatly reduce the decoding complexity.
Collapse
|
9
|
Krehenwinkel H, Pomerantz A, Prost S. Genetic Biomonitoring and Biodiversity Assessment Using Portable Sequencing Technologies: Current Uses and Future Directions. Genes (Basel) 2019; 10:E858. [PMID: 31671909 PMCID: PMC6895800 DOI: 10.3390/genes10110858] [Citation(s) in RCA: 47] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2019] [Revised: 10/18/2019] [Accepted: 10/25/2019] [Indexed: 12/12/2022] Open
Abstract
We live in an era of unprecedented biodiversity loss, affecting the taxonomic composition of ecosystems worldwide. The immense task of quantifying human imprints on global ecosystems has been greatly simplified by developments in high-throughput DNA sequencing technology (HTS). Approaches like DNA metabarcoding enable the study of biological communities at unparalleled detail. However, current protocols for HTS-based biodiversity exploration have several drawbacks. They are usually based on short sequences, with limited taxonomic and phylogenetic information content. Access to expensive HTS technology is often restricted in developing countries. Ecosystems of particular conservation priority are often remote and hard to access, requiring extensive time from field collection to laboratory processing of specimens. The advent of inexpensive mobile laboratory and DNA sequencing technologies show great promise to facilitate monitoring projects in biodiversity hot-spots around the world. Recent attention has been given to portable DNA sequencing studies related to infectious organisms, such as bacteria and viruses, yet relatively few studies have focused on applying these tools to Eukaryotes, such as plants and animals. Here, we outline the current state of genetic biodiversity monitoring of higher Eukaryotes using Oxford Nanopore Technology's MinION portable sequencing platform, as well as summarize areas of recent development.
Collapse
Affiliation(s)
| | - Aaron Pomerantz
- Department of Integrative Biology, University of California, Berkeley, CA-94720, USA.
- Marine Biology Laboratory, Woods Hole, MA-02543, USA.
| | - Stefan Prost
- LOEWE-Centre for Translational Biodiversity Genomics, Senckenberg Museum, 60325 Frankfurt, Germany.
- South African National Biodiversity Institute, National Zoological Garden, Pretoria 0002, South Africa.
| |
Collapse
|