Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Buschmann T, Zhang R, Brash DE, Bystrykh LV. Enhancing the detection of barcoded reads in high throughput DNA sequencing data by controlling the false discovery rate. BMC Bioinformatics 2014;15:264. [PMID: 25099007 PMCID: PMC4133078 DOI: 10.1186/1471-2105-15-264] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2013] [Accepted: 07/19/2014] [Indexed: 02/05/2023] Open

For:	Buschmann T, Zhang R, Brash DE, Bystrykh LV. Enhancing the detection of barcoded reads in high throughput DNA sequencing data by controlling the false discovery rate. BMC Bioinformatics 2014;15:264. [PMID: 25099007 PMCID: PMC4133078 DOI: 10.1186/1471-2105-15-264] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2013] [Accepted: 07/19/2014] [Indexed: 02/05/2023] Open

Number

Cited by Other Article(s)

Menon V, Brash DE. Next-generation sequencing methodologies to detect low-frequency mutations: "Catch me if you can". MUTATION RESEARCH. REVIEWS IN MUTATION RESEARCH 2023;792:108471. [PMID: 37716438 PMCID: PMC10843083 DOI: 10.1016/j.mrrev.2023.108471] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/04/2023] [Revised: 09/06/2023] [Accepted: 09/07/2023] [Indexed: 09/18/2023]

Abstract

Mutations, the irreversible changes in an organism's DNA sequence, are present in tissues at a variant allele frequency (VAF) ranging from ∼10-8 per bp for a founder mutation to ∼10-3 for a histologically normal tissue sample containing several independent clones - compared to 1%- 50% for a heterozygous tumor mutation or a polymorphism. The rarity of these events poses a challenge for accurate clinical diagnosis and prognosis, toxicology, and discovering new disease etiologies. Standard Next-Generation Sequencing (NGS) technologies report VAFs as low as 0.5% per nt, but reliably observing rarer precursor events requires additional sophistication to measure ultralow-frequency mutations. We detail the challenge; define terms used to characterize the results, which vary between laboratories and sometimes conflict between biologists and bioinformaticists; and describe recent innovations to improve standard NGS methodologies including: single-strand consensus sequence methods such as Safe-SeqS and SiMSen-Seq; tandem-strand consensus sequence methods such as o2n-Seq and SMM-Seq; and ultrasensitive parent-strand consensus sequence methods such as DuplexSeq, PacBio HiFi, SinoDuplex, OPUSeq, EcoSeq, BotSeqS, Hawk-Seq, NanoSeq, SaferSeq, and CODEC. Practical applications are also noted. Several methods quantify VAF down to 10-5 at a nt and mutation frequency (MF) in a target region down to 10-7 per nt. By expanding to > 1 Mb of sites never observed twice, thus forgoing VAF, other methods quantify MF < 10-9 per nt or < 15 errors per haploid genome. Clonal expansion cannot be directly distinguished from independent mutations by sequencing, so it is essential for a paper to report whether its MF counted only different mutations - the minimum independent-mutation frequency MFminI - or all mutations observed including recurrences - the larger maximum independent-mutation frequency MFmaxI which may reflect clonal expansion. Ultrasensitive methods reveal that, without their use, even mutations with VAF 0.5-1% are usually spurious.

Collapse

Sequencing barcode construction and identification methods based on block error-correction codes. SCIENCE CHINA-LIFE SCIENCES 2020;63:1580-1592. [PMID: 32303959 DOI: 10.1007/s11427-019-1651-3] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/23/2019] [Accepted: 02/11/2020] [Indexed: 02/07/2023]

Sanhueza D, Guégan JF, Jordan H, Chevillon C. Environmental Variations in Mycobacterium ulcerans Transcriptome: Absence of Mycolactone Expression in Suboptimal Environments. Toxins (Basel) 2019;11:E146. [PMID: 30836720 PMCID: PMC6468629 DOI: 10.3390/toxins11030146] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2019] [Revised: 02/18/2019] [Accepted: 02/27/2019] [Indexed: 12/30/2022] Open

Wang B, Zheng X, Zhou S, Zhou C, Wei X, Zhang Q, Wei Z. Constructing DNA Barcode Sets Based on Particle Swarm Optimization. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2018;15:999-1002. [PMID: 28287980 DOI: 10.1109/tcbb.2017.2679004] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]

Groot-Kormelink PJ, Ferrand S, Kelley N, Bill A, Freuler F, Imbert PE, Marelli A, Gerwin N, Sivilotti LG, Miraglia L, Orth AP, Oakeley EJ, Schopfer U, Siehler S. High Throughput Random Mutagenesis and Single Molecule Real Time Sequencing of the Muscle Nicotinic Acetylcholine Receptor. PLoS One 2016;11:e0163129. [PMID: 27649498 PMCID: PMC5029940 DOI: 10.1371/journal.pone.0163129] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2016] [Accepted: 09/03/2016] [Indexed: 12/15/2022] Open

Effects of early feeding on the host rumen transcriptome and bacterial diversity in lambs. Sci Rep 2016;6:32479. [PMID: 27576848 PMCID: PMC5006043 DOI: 10.1038/srep32479] [Citation(s) in RCA: 64] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2016] [Accepted: 08/08/2016] [Indexed: 11/08/2022] Open

Embryonal Control of Yellow Seed Coat Locus ECY1 Is Related to Alanine and Phenylalanine Metabolism in the Seed Embryo of Brassica napus. G3-GENES GENOMES GENETICS 2016;6:1073-81. [PMID: 26896439 PMCID: PMC4825642 DOI: 10.1534/g3.116.027110] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Abstract

Seed coat color is determined by the type of pigment deposited in the seed coat cells. It is related to important agronomic traits of seeds such as seed dormancy, longevity, oil content, protein content and fiber content. In Brassica napus, inheritance of seed coat color is related to maternal effects and pollen effects (xenia effects). In this research we isolated a mutation of yellow seeded B. napus controlled by a single Mendelian locus, which is named Embryonal Control of Yellow seed coat 1 (Ecy1). Microscopy of transverse sections of the mature seed show that pigment is deposited only in the outer layer of the seed coat. Using Illumina Hisequation 2000 sequencing technology, a total of 12 GB clean data, 116× coverage of coding sequences of B. napus, was achieved from seeds 26 d after pollination (DAP). It was assembled into 172,238 independent transcripts, and 55,637 unigenes. A total of 139 orthologous genes of Arabidopsis transparent testa (TT) genes were mapped in silico to 19 chromosomes of B. napus. Only 49 of the TT orthologous genes are transcribed in seeds. However transcription of all orthologs was independent of embryonal control of seed coat color. Only 55 genes were found to be differentially expressed between brown seeds and the yellow mutant. Of these 55, 50 were upregulated and five were downregulated in yellow seeds as compared to their brown counterparts. By KEGG classification, 14 metabolic pathways were significantly enriched. Of these, five pathways: phenylpropanoid biosynthesis, cyanoamino acid metabolism, plant hormone signal transduction, metabolic pathways, and biosynthesis of secondary metabolites, were related with seed coat pigmentation. Free amino acid quantification showed that Ala and Phe were present at higher levels in the embryos of yellow seeds as compared to those of brown seeds. This increase was not observed in the seed coat. Moreover, the excess amount of free Ala was exactly twice that of Phe in the embryo. The pigment substrate chalcone is synthesized from two molecules of Ala and one molecule of Phe. The correlation between accumulation of Ala and Phe, and disappearance of pigment in the yellow seeded mutant, suggests that embryonal control of seed coat color is related with Phe and Ala metabolism in the embryo of B. napus.

Collapse

Tapia E, Spetale F, Krsticevic F, Angelone L, Bulacio P. DNA Barcoding through Quaternary LDPC Codes. PLoS One 2015;10:e0140459. [PMID: 26492348 PMCID: PMC4619643 DOI: 10.1371/journal.pone.0140459] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2015] [Accepted: 09/23/2015] [Indexed: 12/04/2022] Open

Kracht D, Schober S. Insertion and deletion correcting DNA barcodes based on watermarks. BMC Bioinformatics 2015;16:50. [PMID: 25887410 PMCID: PMC4339740 DOI: 10.1186/s12859-015-0482-7] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2014] [Accepted: 01/29/2015] [Indexed: 01/12/2023] Open

Abstract

Background

Barcode multiplexing is a key strategy for sharing the rising capacity of next-generation sequencing devices: Synthetic DNA tags, called barcodes, are attached to natural DNA fragments within the library preparation procedure. Different libraries, can individually be labeled with barcodes for a joint sequencing procedure. A post-processing step is needed to sort the sequencing data according to their origin, utilizing these DNA labels. The final separation step is called demultiplexing and is mainly determined by the characteristics of the DNA code words used as labels.

Currently, we are facing two different strategies for barcoding: One is based on the Hamming distance, the other uses the edit metric to measure distances of code words. The theory of channel coding provides well-known code constructions for Hamming metric. They provide a large number of code words with variable lengths and maximal correction capability regarding substitution errors. However, some sequencing platforms are known to have exceptional high numbers of insertion or deletion errors. Barcodes based on the edit distance can take insertion and deletion errors into account in the decoding process. Unfortunately, there is no explicit code-construction known that gives optimal codes for edit metric.

Results

In the present work we focus on an entirely different perspective to obtain DNA barcodes. We consider a concatenated code construction, producing so-called watermark codes, which were first proposed by Davey and Mackay, to communicate via binary channels with synchronization errors. We adapt and extend the concepts of watermark codes to use them for DNA sequencing. Moreover, we provide an exemplary set of barcodes that are experimentally compatible with common next-generation sequencing platforms. Finally, a realistic simulation scenario is use to evaluate the proposed codes to show that the watermark concept is suitable for DNA sequencing applications.

Conclusion

Our adaption of watermark codes enables the construction of barcodes that are capable of correcting substitutions, insertion and deletion errors. The presented approach has the advantage of not needing any markers or technical sequences to recover the position of the barcode in the sequencing reads, which poses a significant restriction with other approaches.

Electronic supplementary material

The online version of this article (doi:10.1186/s12859-015-0482-7) contains supplementary material, which is available to authorized users.

Collapse