101
|
Abstract
The $1000 genome may still be years away, but with the arrival of next-generation sequencing (NGS) technologies that are much faster and cheaper than the traditional Sanger method, large-scale sequencing of hundreds or even thousands of human genomes is fast becoming reality.
Collapse
|
102
|
|
103
|
|
104
|
Skol AD, Scott LJ, Abecasis GR, Boehnke M. Optimal designs for two-stage genome-wide association studies. Genet Epidemiol 2008; 31:776-88. [PMID: 17549752 DOI: 10.1002/gepi.20240] [Citation(s) in RCA: 105] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Genome-wide association (GWA) studies require genotyping hundreds of thousands of markers on thousands of subjects, and are expensive at current genotyping costs. To conserve resources, many GWA studies are adopting a staged design in which a proportion of the available samples are genotyped on all markers in stage 1, and a proportion of these markers are genotyped on the remaining samples in stage 2. We describe a strategy for designing cost-effective two-stage GWA studies. Our strategy preserves much of the power of the corresponding one-stage design and minimizes the genotyping cost of the study while allowing for differences in per genotyping cost between stages 1 and 2. We show that the ratio of stage 2 to stage 1 per genotype cost can strongly influence both the optimal design and the genotyping cost of the study. Increasing the stage 2 per genotype cost shifts more of the genotyping and study cost to stage 1, and increases the cost of the study. This higher cost can be partially mitigated by adopting a design with reduced power while preserving the false positive rate or by increasing the false positive rate while preserving power. For example, reducing the power preserved in the two-stage design from 99 to 95% that of the one-stage design decreases the two-stage study cost by approximately 15%. Alternatively, the same cost savings can be had by relaxing the false positive rate by 2.5-fold, for example from 1/300,000 to 2.5/300,000, while retaining the same power.
Collapse
|
105
|
Xiao P, Huang H, Zhou G, Lu Z. Gel immobilization of acrylamide-modified single-stranded DNA template for pyrosequencing. Electrophoresis 2007; 28:1903-12. [PMID: 17487922 DOI: 10.1002/elps.200600794] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
A novel two-step process was developed to prepare ssDNA templates for pyrosequencing. First, PCR-amplified DNA templates modified with an acrylamide group and acrylamide monomers were copolymerized in 0.1 M NaOH solution to form polyacrylamide gel spots. Second, ssDNA templates for pyrosequencing were prepared by removing electrophoretically unbound complementary strands, unmodified PCR primers, inorganic pyrophosphate (PPi), and excess deoxyribonucleotides under alkali conditions. The results show that the 3-D polyacrylamide gel network has a high immobilization capacity and the modified PCR fragments are efficiently captured. After electrophoresis, gel spots copolymerized from 10 microL of the crude PCR products and the acrylamide monomers contain template molecules on the order of pmol, which generate enough light to be detected by a regular photomultiplier tube. The porous structure of gel spots facilitated the fast transportation of the enzyme, dNTPs and other reagents, and the solution-mimicking microenvironment guaranteed polymerase efficiency for pyrosequencing. Successful genotyping from the crude PCR products was demonstrated. This method can be applied in any laboratory; it is cheap, fast, simple, and has the potential to be incorporated into a DNA-chip format for high-throughput pyrosequencing analysis.
Collapse
|
106
|
Wang YZ, Wu GX, Luo LB, Chen M, Ruan LH. Oligonucleotide chip, real-time PCR and sequencing for genotyping of hepatitis B virus. World J Gastroenterol 2007; 13:4260-3. [PMID: 17696258 PMCID: PMC4250628 DOI: 10.3748/wjg.v13.i31.4260] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
AIM: To compare the oligonucleotide chip, real-time PCR and sequencing for genotyping of hepatitis B virus in Chinese patients with chronic hepatitis B.
METHODS: Mixture of samples with different genotypes and clinical serum samples from 126 chronic hepatitis B patients was tested for hepatitis B virus genotypes by oligonucleotide chip, real-time PCR and sequencing of PCR products, respectively. Clinical performances, time required and costs of the three assays were evaluated.
RESULTS: Oligonucleotide chips and real-time PCR detected 1% and 0.1% genotypes, respectively, in mixed samples. Of the 126 clinical samples from patients with chronic hepatitis B, genotype B was detected in 41 (33%), 41 (33%) and 45 (36%) samples, and genotype C in 76 (60%), 76 (60%) and 81 (64%) samples, by oligonucleotide chip, real-time PCR and sequencing, respectively. Oligonucleotide chip and real-time PCR detected mixed genotypes B and C in 9 samples. Real-time PCR was the rapidest and cheapest among the three assays.
CONCLUSION: Oligonucleotide chip and real-time PCR are able to detect mixed genotypes, while sequencing only detects the dominant genotype in clinical samples.
Collapse
|
107
|
|
108
|
|
109
|
Baudhuin LM, Highsmith WE, Skierka J, Holtegaard L, Moore BE, O'Kane DJ. Comparison of three methods for genotyping the UGT1A1 (TA)n repeat polymorphism. Clin Biochem 2007; 40:710-7. [PMID: 17459361 DOI: 10.1016/j.clinbiochem.2007.03.007] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2006] [Revised: 03/05/2007] [Accepted: 03/06/2007] [Indexed: 11/20/2022]
Abstract
OBJECTIVES The UGT1A1 promoter contains a (TA)n repeat polymorphism. The 7 repeat allele is associated with decreased enzyme activity and patients homozygous for this allele treated with irinotecan may experience life-threatening toxicity. Here, we have compared three methods [DNA sequencing, fragment analysis, and the Invader assay (Third Wave Technologies)] for genotyping this polymorphism. RESULTS All of the DNA samples (n=119) had concordant genotype calls between the sequencing and size-based methods. The Invader method was also concordant if the genotypes were 6/6, 6/7, or 7/7. Both the size-based method and the Invader method had straightforward data analysis, while interpretation of the sequencing results was occasionally more challenging. The Invader method required more concentrated DNA for analysis, was more expensive, and had a limited genotyping spectrum. CONCLUSION All three methods were valuable for genotyping the UGT1A1 (TA)n repeat, with the sequencing and size-based assays having the fewest drawbacks.
Collapse
|
110
|
Abstract
New technologies will soon enable re-sequencing of human genomes at a much lower cost. The $1,000 human genome is an important goal for research and clinical diagnostics. A new generation of DNA-sequencing platforms will become commercially available over the next few years. These instruments will enable re-sequencing of human genomes at a previously unimagined throughput and low cost. Here, I examine why the $1,000 human genome is an important goal for research and clinical diagnostics, and what will be required to achieve it.
Collapse
|
111
|
|
112
|
Needleman D, Adam D, Detwiler M, Escobar H, Kieleczawa J, Pershard R, Schweitzer P, Wiebe G. DNA sequencing research group: 2006 general survey of DNA sequencing facilities. J Biomol Tech 2007; 18:113-9. [PMID: 17496223 PMCID: PMC2062535] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
Over the past few years, technological advances in automated DNA sequencing have had a profound effect on the nature of DNA sequencing laboratories. To characterize the changes occurring within DNA sequencing facilities, the DNA Sequencing Research Group conducted three previous studies, in 1998, 2000, and 2003. A new general survey has been designed and conducted by the DSRG to capture the current status of DNA sequencing facilities in all sectors. Included were questions regarding facility administration, pricing, instrumentation, technology, protocols, and operation. The results of the survey are presented here, accompanied by comparisons to the previous surveys. These comparisons formed a basis for the discussion of trends within the facilities in response to the dynamics of a changing technology.
Collapse
|
113
|
Rapid, low-cost DNA testing. Biotechnol J 2007; 2:303. [PMID: 17715508] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/16/2023]
|
114
|
Benlloch S, Payá A, Alenda C, Bessa X, Andreu M, Jover R, Castells A, Llor X, Aranda FI, Massutí B. Detection of BRAF V600E mutation in colorectal cancer: comparison of automatic sequencing and real-time chemistry methodology. J Mol Diagn 2007; 8:540-3. [PMID: 17065421 PMCID: PMC1876165 DOI: 10.2353/jmoldx.2006.060070] [Citation(s) in RCA: 122] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Mutation V600E of BRAF, a kinase-encoding gene from the RAS/RAF/MAPK pathway, in colorectal carcinoma (CRC) suggests a sporadic origin of the disease, providing an exclusion criterion for hereditary nonpolyposis colorectal cancer. Here we describe detection of this mutation by real-time chemistry TaqMan MGB probes, confirmed by direct DNA sequencing as the gold standard. DNA was extracted from paraffin-embedded tissue from 112 tumors obtained from the EPICOLON study. Seventy-two tumors were CRC with defective DNA mismatch repair (MMR; microsatellite instability and/or loss of protein expression by immunohistochemical analysis), and 40 were proficient MMR controls. BRAF mutation was detected in 20/72 (27.8%) CRC with defective MMR and in 3/40 (7.5%) proficient MMR controls (P = 0.011). BRAF mutation was detected in 19/51 (37.3%) tumors with loss of MLH1 expression and in none of the tumors with loss of MSH2 expression (0/13). BRAF mutation was not found in cases with germline mutation of MLH1 (4/112) or MSH2 (3/112) genes. The sensitivity and specificity of our real-time chemistry were both 100% for detecting the V600E mutation. Because real-time chemistry methodology has advantages in cost, time, and labor, we consider it a valuable alternative to automatic direct sequencing, particularly for serial measurements.
Collapse
|
115
|
|
116
|
Stensvold CR, Traub RJ, von Samson-Himmelstjerna G, Jespersgaard C, Nielsen HV, Thompson RCA. Blastocystis: subtyping isolates using pyrosequencing technology. Exp Parasitol 2006; 116:111-9. [PMID: 17266951 DOI: 10.1016/j.exppara.2006.12.002] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2006] [Revised: 12/01/2006] [Accepted: 12/08/2006] [Indexed: 10/23/2022]
Abstract
Blastocystis is a prevalent single-celled enteric parasite of unresolved clinical significance. Efforts based on molecular methodologies to establish whether pathogenicity is linked to specific isolates of the genetically diverse genus of Blastocystis have been scarce and so far yielded ambiguous results which can be difficult to interpret. To alleviate some of the problems related to unravelling the molecular epidemiology of Blastocystis infections we developed and evaluated a simple and high-throughput sequence analysis (SQA) pyrosequencing technique based on the detection of genotype-specific nucleotide polymorphisms in the 18S small subunit rRNA gene for a rapid and cost-effective post-PCR screening of Blastocystis genotypes. The method was effectively capable of genotyping 48/48 isolates positive by nested PCR in approximately one hour, and in 94% of the cases the isolate detected by PCR and pyrosequencing was also detected by one of two different PCR assays with subsequent dideoxy sequencing.
Collapse
|
117
|
Abstract
In 2004, the NIH set a remarkable challenge: the 1000 dollars genome. Roughly speaking, success would provide, by 2015, the ability to sequence the complete genome of an individual human, quickly and at an accessible price. An intermediate goal of a 100,000 dollars genome was set for 2010. While the cost of Sanger sequencing has dropped dramatically over the past two decades, it is unlikely that the 100,000 dollars genome will be achieved by this means. New massively parallel technologies will push the cost of sequencing towards this mark, but it is doubtful whether these efforts will match the 1000 dollars goal. The best bets for ultrarapid, low-cost sequencing are single-molecule approaches.
Collapse
|
118
|
Boudewijns M, Bakkers JM, Sturm PDJ, Melchers WJG. 16S rRNA gene sequencing and the routine clinical microbiology laboratory: a perfect marriage? J Clin Microbiol 2006; 44:3469-70. [PMID: 16954306 PMCID: PMC1594676 DOI: 10.1128/jcm.01017-06] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
|
119
|
|
120
|
Cheung F, Haas BJ, Goldberg SMD, May GD, Xiao Y, Town CD. Sequencing Medicago truncatula expressed sequenced tags using 454 Life Sciences technology. BMC Genomics 2006; 7:272. [PMID: 17062153 PMCID: PMC1635983 DOI: 10.1186/1471-2164-7-272] [Citation(s) in RCA: 195] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2006] [Accepted: 10/24/2006] [Indexed: 11/25/2022] Open
Abstract
Background In this study, we addressed whether a single 454 Life Science GS20 sequencing run provides new gene discovery from a normalized cDNA library, and whether the short reads produced via this technology are of value in gene structure annotation. Results A single 454 GS20 sequencing run on adapter-ligated cDNA, from a normalized cDNA library, generated 292,465 reads that were reduced to 252,384 reads with an average read length of 92 nucleotides after cleaning. After clustering and assembly, a total of 184,599 unique sequences were generated containing over 400 SSRs. The 454 sequences generated hits to more genes than a comparable amount of sequence from MtGI. Although short, the 454 reads are of sufficient length to map to a unique genome location as effectively as longer ESTs produced by conventional sequencing. Functional interpretation of the sequences was carried out by Gene Ontology assignments from matches to Arabidopsis and was shown to cover a broad range of GO categories. 53,796 assemblies and singletons (29%) had no match in the existing MtGI. Within the previously unobserved Medicago transcripts, thousands had matches in a comprehensive protein database and one or more of the TIGR Plant Gene Indices. Approximately 20% of these novel sequences could be found in the Medicago genome sequence. A total of 70,026 reads generated by the 454 technology were mapped to 785 Medicago finished BACs using PASA and over 1,000 gene models required modification. In parallel to 454 sequencing, 4,445 5'-prime reads were generated by conventional sequencing using the same library and from the assembled sequences it was shown to contain about 52% full length cDNAs encoding proteins from 50 to over 500 amino acids in length. Conclusion Due to the large number of reads afforded by the 454 DNA sequencing technology, it is effective in revealing the expression of transcripts from a broad range of GO categories and contains many rare transcripts in normalized cDNA libraries, although only a limited portion of their sequence is uncovered. As with longer ESTs, 454 reads can be mapped uniquely onto genomic sequence to provide support for, and modifications of, gene predictions.
Collapse
|
121
|
Yu J, Ni P, Wong GKS. Comparing the whole-genome-shotgun and map-based sequences of the rice genome. TRENDS IN PLANT SCIENCE 2006; 11:387-91. [PMID: 16843033 DOI: 10.1016/j.tplants.2006.06.005] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/31/2006] [Revised: 05/03/2006] [Accepted: 06/28/2006] [Indexed: 05/10/2023]
Abstract
The rice genome has now been sequenced using whole-genome-shotgun and map-based methods. The relative merits of the two methods are the subject of debate, as they were in the human genome project. In this Opinion article, we will show that the serious discrepancies between the resultant sequences are mostly found in the large transposable elements such as copia and gypsy that populate the intergenic regions of plant genomes. Differences in published gene counts and polymorphism rates are similarly resolved by considering how transposable elements affect the sequence analysis.
Collapse
|
122
|
Wade N. The quest for the $1,000 human genome: DNA sequencing in the doctor's office? At birth? It may be coming closer. THE NEW YORK TIMES ON THE WEB 2006:F1, F3. [PMID: 16874933] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
|
123
|
Abbott WGH, Tukuitonga CF, Ofanoa M, Munn SR, Gane EJ. Low-cost, simultaneous, single-sequence genotyping of the HLA-A, HLA-B and HLA-C loci. ACTA ACUST UNITED AC 2006; 68:28-37. [PMID: 16774537 DOI: 10.1111/j.1399-0039.2006.00620.x] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
New automated DNA sequencing technology has enabled the development of an assay for genotyping the three major HLA class 1 loci from a single sequence of each gene containing exon 3, intron 2 and exon 2. The assay allows 31 subjects (with 3 negative controls) to be genotyped at all three loci simultaneously, using a 96-well plate format. Genotypes were assigned by comparing each sequence to a database of 307 HLA-A, 563 HLA-B and 166 HLA-C alleles. Unequivocal, 4-digit allele assignments were made for 40 of 130 HLA-A genes, 82 of 130 HLA-B genes and 97 of 130 HLA-C genes from 21 European, 20 Tongan and 24 Niuean subjects. Ambiguity in interpretation of the sequence contributed to 66 of the 170 equivocal allele assignments, and 105 equivocal assignments were due to polymorphisms outside exons 2 and 3. All known alternative interpretations of ambiguous genotypes were identified, and seven HLA-B and two HLA-C ambiguities were resolved by reading the out-of-phase exon 2 sequence that followed an indel in intron 2. The genotypes of a subgroup of 27 heterozygous subjects, whose genotypes contained all of the alleles identified in this study, were confirmed with commercial, generic PCR-SSP typing. In European subjects, the repertoire of HLA-B/HLA-C haplotypes was almost identical to previously published data. We identified five new HLA-B/HLA-C haplotypes in the Polynesian subjects, and the remaining haplotypes were of Asian origin. In summary, we are describing a low-cost, sequencing assay for the three major HLA class I loci that provides a level of resolution that is comparable with a commercial PCR-SSP assay.
Collapse
|
124
|
|
125
|
|
126
|
|
127
|
Thakur S, Gebreyes WA. Campylobacter coli in swine production: antimicrobial resistance mechanisms and molecular epidemiology. J Clin Microbiol 2005; 43:5705-14. [PMID: 16272508 PMCID: PMC1287812 DOI: 10.1128/jcm.43.11.5705-5714.2005] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2005] [Revised: 07/05/2005] [Accepted: 08/15/2005] [Indexed: 11/20/2022] Open
Abstract
The aim of this study was to determine antimicrobial resistance, to evaluate and compare the use of two genotyping methods for molecular epidemiology purposes, and to determine the genotypic diversity of Campylobacter coli of porcine origin. A total of 100 C. coli isolates from swine were tested for susceptibility to six antimicrobials using the agar dilution method and genotyped using two high-resolution fingerprinting approaches: multilocus sequence typing (MLST) and pulsed-field gel electrophoresis (PFGE). Evaluation of the methods was based on their resistance patterns, discriminatory indexes (DI), high test throughputs, costs, and turnaround times. Resistance to erythromycin and tetracycline was the most common. Both genotypic methods were found to have high discriminatory power, although MLST had a higher DI (0.936) than PFGE (DI = 0.889). It also had a higher throughput than PFGE. Isolates were clustered into 27 groups by MLST compared to 11 by PFGE. MLST was able to further discriminate the isolates grouped under the same cluster by PFGE. Out of the 65 MLST sequence types (STs) identified among the total isolates, 50 were reported for the first time. Most STs were found to be specific to the farm (n = 38) and to slaughter (n = 22). Resistance against tetracycline and erythromycin was encoded by the tet(O) gene and a A2075G point mutation in the 23S rRNA gene, respectively. A high ciprofloxacin MIC (>64 microg/liter) was conferred by a point mutation in the gyrA gene. The weak clonal structure of the C. coli population among swine was further highlighted by the index of association value of 0.293. The findings of this study indicate that multidrug-resistant diverse C. coli strains exhibiting resistance to ciprofloxacin and erythromycin are concerning, since these are the drugs of choice for treating invasive campylobacteriosis cases in humans.
Collapse
|
128
|
|
129
|
Sequence specificity. Nat Genet 2005; 37:1013. [PMID: 16195710 DOI: 10.1038/ng1005-1013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
130
|
|
131
|
Normile D. Painstaking Approach Pays Off for Rice Sequencing Project. Science 2005; 309:997. [PMID: 16099955 DOI: 10.1126/science.309.5737.997] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
|
132
|
|
133
|
Shendure J, Porreca GJ, Reppas NB, Lin X, McCutcheon JP, Rosenbaum AM, Wang MD, Zhang K, Mitra RD, Church GM. Accurate multiplex polony sequencing of an evolved bacterial genome. Science 2005; 309:1728-32. [PMID: 16081699 DOI: 10.1126/science.1117389] [Citation(s) in RCA: 788] [Impact Index Per Article: 41.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
We describe a DNA sequencing technology in which a commonly available, inexpensive epifluorescence microscope is converted to rapid nonelectrophoretic DNA sequencing automation. We apply this technology to resequence an evolved strain of Escherichia coli at less than one error per million consensus bases. A cell-free, mate-paired library provided single DNA molecules that were amplified in parallel to 1-micrometer beads by emulsion polymerase chain reaction. Millions of beads were immobilized in a polyacrylamide gel and subjected to automated cycles of sequencing by ligation and four-color imaging. Cost per base was roughly one-ninth as much as that of conventional sequencing. Our protocols were implemented with off-the-shelf instrumentation and reagents.
Collapse
|
134
|
Margulies M, Egholm M, Altman WE, Attiya S, Bader JS, Bemben LA, Berka J, Braverman MS, Chen YJ, Chen Z, Dewell SB, Du L, Fierro JM, Gomes XV, Goodwin BC, He W, Helgesen S, Ho CH, Irzyk GP, Jando SC, Alenquer ML, Jarvie TP, Jirage KB, Kim JB, Knight JR, Lanza JR, Leamon JH, Lefkowitz SM, Lei M, Li J, Lohman KL, Lu H, Makhijani VB, McDade KE, McKenna MP, Myers EW, Nickerson E, Nobile JR, Plant R, Puc BP, Ronan MT, Roth GT, Sarkis GJ, Simons JF, Simpson JW, Srinivasan M, Tartaro KR, Tomasz A, Vogt KA, Volkmer GA, Wang SH, Wang Y, Weiner MP, Yu P, Begley RF, Rothberg JM. Genome sequencing in microfabricated high-density picolitre reactors. Nature 2005; 437:376-80. [PMID: 16056220 PMCID: PMC1464427 DOI: 10.1038/nature03959] [Citation(s) in RCA: 4967] [Impact Index Per Article: 261.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2005] [Accepted: 06/10/2005] [Indexed: 02/06/2023]
Abstract
The proliferation of large-scale DNA-sequencing projects in recent years has driven a search for alternative methods to reduce time and cost. Here we describe a scalable, highly parallel sequencing system with raw throughput significantly greater than that of state-of-the-art capillary electrophoresis instruments. The apparatus uses a novel fibre-optic slide of individual wells and is able to sequence 25 million bases, at 99% or better accuracy, in one four-hour run. To achieve an approximately 100-fold increase in throughput over current Sanger sequencing technology, we have developed an emulsion method for DNA amplification and an instrument for sequencing by synthesis using a pyrosequencing protocol optimized for solid support and picolitre-scale volumes. Here we show the utility, throughput, accuracy and robustness of this system by shotgun sequencing and de novo assembly of the Mycoplasma genitalium genome with 96% coverage at 99.96% accuracy in one run of the machine.
Collapse
|
135
|
DeWoody JA, Schupp J, Kenefic L, Busch J, Murfitt L, Keim P. Universal method for producing ROX-labeled size standards suitable for automated genotyping. Biotechniques 2005; 37:348, 350, 352. [PMID: 15470886 DOI: 10.2144/04373bm02] [Citation(s) in RCA: 130] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
|
136
|
Langaee T, Ronaghi M. Genetic variation analyses by Pyrosequencing. Mutat Res 2005; 573:96-102. [PMID: 15829240 DOI: 10.1016/j.mrfmmm.2004.07.023] [Citation(s) in RCA: 80] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2004] [Accepted: 07/16/2004] [Indexed: 10/25/2022]
Abstract
Pyrosequencing is a real-time bioluminometric technique for determination of nucleic acid sequence. Here, we review recent advances and discuss new applications of this technique. Cost reduction efforts and future potentials of this technique for large-scale genotyping applications will also be discussed.
Collapse
|
137
|
Meaburn E, Butcher LM, Liu L, Fernandes C, Hansen V, Al-Chalabi A, Plomin R, Craig I, Schalkwyk LC. Genotyping DNA pools on microarrays: tackling the QTL problem of large samples and large numbers of SNPs. BMC Genomics 2005; 6:52. [PMID: 15811185 PMCID: PMC1079828 DOI: 10.1186/1471-2164-6-52] [Citation(s) in RCA: 55] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2004] [Accepted: 04/05/2005] [Indexed: 11/10/2022] Open
Abstract
Background Quantitative trait locus (QTL) theory predicts that genetic influence on complex traits involves multiple genes of small effect size. To detect QTL associations of small effect size, large samples and systematic screens of thousands of DNA markers are required. An efficient solution is to genotype case and control DNA pools using SNP microarrays. We demonstrate that this is practical using DNA pools of 100 individuals. Results Using standard microarray protocols for the Affymetrix GeneChip® Mapping 10 K Array Xba 131, we show that relative allele signal (RAS) values provide a quantitative index of allele frequencies in pooled DNA that correlate 0.986 with allele frequencies for 104 SNPs that were genotyped individually for 100 individuals. The sensitivity of the assay was demonstrated empirically in a spiking experiment in which 15% and 20% of one individual's DNA was added to a DNA pool. Conclusion We conclude that this approach, which we call SNP-MaP (SNP microarrays and pooling), is rapid, cost effective and promises to be a valuable initial screening method in the hunt for QTLs.
Collapse
|
138
|
Yang GS, Stott JM, Smailus D, Barber SA, Balasundaram M, Marra MA, Holt RA. High-throughput sequencing: a failure mode analysis. BMC Genomics 2005; 6:2. [PMID: 15631628 PMCID: PMC546001 DOI: 10.1186/1471-2164-6-2] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2004] [Accepted: 01/04/2005] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Basic manufacturing principles are becoming increasingly important in high-throughput sequencing facilities where there is a constant drive to increase quality, increase efficiency, and decrease operating costs. While high-throughput centres report failure rates typically on the order of 10%, the causes of sporadic sequencing failures are seldom analyzed in detail and have not, in the past, been formally reported. RESULTS Here we report the results of a failure mode analysis of our production sequencing facility based on detailed evaluation of 9,216 ESTs generated from two cDNA libraries. Two categories of failures are described; process-related failures (failures due to equipment or sample handling) and template-related failures (failures that are revealed by close inspection of electropherograms and are likely due to properties of the template DNA sequence itself). CONCLUSIONS Preventative action based on a detailed understanding of failure modes is likely to improve the performance of other production sequencing pipelines.
Collapse
|
139
|
Zwick ME, Mcafee F, Cutler DJ, Read TD, Ravel J, Bowman GR, Galloway DR, Mateczun A. Microarray-based resequencing of multiple Bacillus anthracis isolates. Genome Biol 2004; 6:R10. [PMID: 15642093 PMCID: PMC549062 DOI: 10.1186/gb-2004-6-1-r10] [Citation(s) in RCA: 58] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2004] [Revised: 10/18/2004] [Accepted: 11/19/2004] [Indexed: 11/15/2022] Open
Abstract
Custom-designed resequencing arrays were used to generate 3.1 Mb of genomic sequence from a panel of 56 Bacillus anthracis strains. Sequence quality was shown to be very high by replication and by comparison to independently generated shotgun sequence We used custom-designed resequencing arrays to generate 3.1 Mb of genomic sequence from a panel of 56 Bacillus anthracis strains. Sequence quality was shown to be very high by replication (discrepancy rate of 7.4 × 10-7) and by comparison to independently generated shotgun sequence (discrepancy rate < 2.5 × 10-6). Population genomics studies of microbial pathogens using rapid resequencing technologies such as resequencing arrays are critical for recognizing newly emerging or genetically engineered strains.
Collapse
|
140
|
Yao T. [World trends of systems biology related to the genome network project]. TANPAKUSHITSU KAKUSAN KOSO. PROTEIN, NUCLEIC ACID, ENZYME 2004; 49:2993-3000. [PMID: 15669290] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 05/01/2023]
|
141
|
Takatsu K, Yokomaku T, Kurata S, Kanagawa T. A FRET-based analysis of SNPs without fluorescent probes. Nucleic Acids Res 2004; 32:e156. [PMID: 15534363 PMCID: PMC528829 DOI: 10.1093/nar/gnh155] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open
Abstract
Fluorescence resonance energy transfer (FRET) is a simple procedure for detecting specific DNA sequences, and is therefore used in many fields. However, the cost is relatively high, because FRET-based methods usually require fluorescent probes. We have designed a cost-effective way of using FRET, and developed a novel approach for the genotyping of single nucleotide polymorphisms (SNPs) and allele frequency estimation. The key feature of this method is that it uses a DNA-binding fluorogenic molecule, SYBR Green I, as an energy donor for FRET. In this method, single base extension is performed with dideoxynucleotides labeled with an orange dye and a red dye in the presence of SYBR Green I. The dyes incorporated into the extended products accept energy from SYBR Green I and emit fluorescence. We have validated the method with ten SNPs, which were successfully discriminated by end-point measurements of orange and red fluorescence intensity in a microplate fluorescence reader. Using a mixture of homozygous samples, we also confirmed the potential of this method for estimation of allele frequency. Application of this strategy to large-scale studies will reduce the time and cost of genotyping a vast number of SNPs.
Collapse
|
142
|
Blakesley RW, Hansen NF, Mullikin JC, Thomas PJ, McDowell JC, Maskeri B, Young AC, Benjamin B, Brooks SY, Coleman BI, Gupta J, Ho SL, Karlins EM, Maduro QL, Stantripop S, Tsurgeon C, Vogt JL, Walker MA, Masiello CA, Guan X, Bouffard GG, Green ED. An intermediate grade of finished genomic sequence suitable for comparative analyses. Genome Res 2004; 14:2235-44. [PMID: 15479945 PMCID: PMC525681 DOI: 10.1101/gr.2648404] [Citation(s) in RCA: 66] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2004] [Accepted: 08/16/2004] [Indexed: 11/25/2022]
Abstract
Although the cost of generating draft-quality genomic sequence continues to decline, refining that sequence by the process of "sequence finishing" remains expensive. Near-perfect finished sequence is an appropriate goal for the human genome and a small set of reference genomes; however, such a high-quality product cannot be cost-justified for large numbers of additional genomes, at least for the foreseeable future. Here we describe the generation and quality of an intermediate grade of finished genomic sequence (termed comparative-grade finished sequence), which is tailored for use in multispecies sequence comparisons. Our analyses indicate that this sequence is very high quality (with the residual gaps and errors mostly falling within repetitive elements) and reflects 99% of the total sequence. Importantly, comparative-grade sequence finishing requires approximately 40-fold less reagents and approximately 10-fold less personnel effort compared to the generation of near-perfect finished sequence, such as that produced for the human genome. Although applied here to finishing sequence derived from individual bacterial artificial chromosome (BAC) clones, one could envision establishing routines for refining sequences emanating from whole-genome shotgun sequencing projects to a similar quality level. Our experience to date demonstrates that comparative-grade sequence finishing represents a practical and affordable option for sequence refinement en route to comparative analyses.
Collapse
|
143
|
Aquilante CL, Lobmeyer MT, Langaee TY, Johnson JA. Comparison of Cytochrome P450 2C9 Genotyping Methods and Implications for the Clinical Laboratory. Pharmacotherapy 2004; 24:720-6. [PMID: 15222661 DOI: 10.1592/phco.24.8.720.36074] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
STUDY OBJECTIVE To compare the accuracy, speed, and cost of two methodologies used for genotyping known variants in the cytochrome P450 (CYP) 2C9 metabolizing enzyme gene. DESIGN Comparative study. SETTING University research center. SAMPLES Fifteen-milliliter mouthwash samples collected from 253 subjects participating in a warfarin pharmacogenomic study. INTERVENTION Genotyping for the isoleucine-to-leucine change at codon 359 (Ile359Leu [*3] polymorphism) was performed by using the Pyrosequencing and polymerase chain reaction (PCR)-restriction fragment length polymorphism (RFLP) methods in all 253 samples. Genotyping for the arginine-to-cysteine change at codon 144 (Arg144Cys [*2] polymorphism) was performed by using Pyrosequencing in all samples and by PCR-RFLP in a random subset of 136 samples. MEASUREMENTS AND MAIN RESULTS Comparisons of genotyping success rates, time efficiency, and cost analyses were conducted for Pyrosequencing and PCR-RFLP at each variant site. Pyrosequencing and PCR-RFLP produced similar success rates on the first genotyping attempt for the Arg144Cys variant (93.3% vs 90.4%, respectively) and the Ile359Leu variant (83.8% vs 79.1%, respectively). With Pyrosequencing, genotyping 96 samples for either polymorphism could be performed in 1 hour. In contrast, genotyping 96 samples by RFLP took 10 hours for the Arg144Cys variant and 20 hours for the Ile359Leu variant. Total cost/sample for Arg144Cys genotyping was dollars 1.90 with PCR-Pyrosequencing and dollars 3.14 with PCR-RFLP. Total cost/sample for Ile359Leu genotyping was dollars 1.88 with PCR-Pyrosequencing and dollars 10.18 with PCR-RFLP CONCLUSION: Compared with RFLP, genotype determination by Pyrosequencing is a more time-efficient, cost-effective, and robust method for CYP2C9 genotyping. Because of its wide applicability and ease of use, Pyrosequencing is a promising technology for future pharmacogenomic investigations.
Collapse
|
144
|
Blondal T, Waage BG, Smarason SV, Jonsson F, Fjalldal SB, Stefansson K, Gulcher J, Smith AV. A novel MALDI-TOF based methodology for genotyping single nucleotide polymorphisms. Nucleic Acids Res 2004; 31:e155. [PMID: 14654708 PMCID: PMC291883 DOI: 10.1093/nar/gng156] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
A new MALDI-TOF based detection assay was developed for analysis of single nucleotide polymorphisms (SNPs). It is a significant modification on the classic three-step minisequencing method, which includes a polymerase chain reaction (PCR), removal of excess nucleotides and primers, followed by primer extension in the presence of dideoxynucleotides using modified thermostable DNA polymerase. The key feature of this novel assay is reliance upon deoxynucleotide mixes, lacking one of the nucleotides at the polymorphic position. During primer extension in the presence of depleted nucleotide mixes, standard thermostable DNA polymerases dissociate from the template at positions requiring a depleted nucleotide; this principal was harnessed to create a genotyping assay. The assay design requires a primer- extension primer having its 3'-end one nucleotide upstream from the interrogated site. The assay further utilizes the same DNA polymerase in both PCR and the primer extension step. This not only simplifies the assay but also greatly reduces the cost per genotype compared to minisequencing methodology. We demonstrate accurate genotyping using this methodology for two SNPs run in both singleplex and duplex reactions. We term this assay nucleotide depletion genotyping (NUDGE). Nucleotide depletion genotyping could be extended to other genotyping assays based on primer extension such as detection by gel or capillary electrophoresis.
Collapse
|
145
|
Rahim A, Coutelle C, Harbottle R. High-throughput Pyrosequencing of a phage display library for the identification of enriched target-specific peptides. Biotechniques 2003; 35:317-20, 322, 324. [PMID: 12951773 DOI: 10.2144/03352st04] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Gene therapy clinical trials have highlighted the importance of specific cellular/tissue targeting of gene delivery vectors. Phage display libraries are powerful tools for the selection of novel peptide ligands as targeting moieties because of their high-throughput screening potential. However, a severe rate-limiting step in this procedure in terms of time, numbers, and cost is the sequence identification of selected phages. Here we describe the application of Pyrosequencing technology for sequencing phage isolates after panning a random 7-mer peptide expressing phage library against the A549 bronchial epithelial cell line to search for enrichment of possible targeting peptides. Pyrosequencing allows sequencing of 96 phages at one time in approximately 45 min at only a sixth of the cost of conventional sequencing methods. Using this technology, we have identified four sequences of interest. A phage binding assay revealed that three of the four sequences show a significant increase in binding abilities and specificity for A549 cells when compared to an unrelated cell line.
Collapse
|
146
|
Jobs M, Howell WM, Stromqvist L, Mayr T, Brookes AJ. DASH-2: flexible, low-cost, and high-throughput SNP genotyping by dynamic allele-specific hybridization on membrane arrays. Genome Res 2003; 13:916-24. [PMID: 12727908 PMCID: PMC430914 DOI: 10.1101/gr.801103] [Citation(s) in RCA: 58] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Genotyping technologies need to be continually improved in terms of their flexibility, cost-efficiency, and throughput, to push forward genome variation analysis. To this end, we have leveraged the inherent simplicity of dynamic allele-specific hybridization (DASH) and coupled it to recent innovations of centrifugal arrays and iFRET. We have thereby created a new genotyping platform we term DASH-2, which we demonstrate and evaluate in this report. The system is highly flexible in many ways (any plate format, PCR multiplexing, serial and parallel array processing, spectral-multiplexing of hybridization probes), thus supporting a wide range of application scales and objectives. Precision is demonstrated to be in the range 99.8-100%, and assay costs are 0.05 USD or less per genotype assignment. DASH-2 thus provides a powerful new alternative for genotyping practice, which can be used without the need for expensive robotics support.
Collapse
|
147
|
Robertson JA. The $1000 genome: ethical and legal issues in whole genome sequencing of individuals. THE AMERICAN JOURNAL OF BIOETHICS : AJOB 2003; 3:W-IF1. [PMID: 14735880 DOI: 10.1162/152651603322874762] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Progress in gene sequencing could make rapid whole genome sequencing of individuals affordable to millions of persons and useful for many purposes in a future era of genomic medicine. Using the idea of $1000 genome as a focus, this article reviews the main technical, ethical, and legal issues that must be resolved to make mass genotyping of individuals cost-effective and ethically effective. It presents the case for individual ownership of a person's genome and its formation, and shows the implications of that position for rights to informed consent and privacy over sequencing, testing, and disclosing genomic information about identifiable individuals. Legal recognition of a person's right to control his or her genome and the information that it contains is essential for further progress in applying genomic discoveries to human lives.
Collapse
|
148
|
Nelkin D. Personal genome sequencing: the answer to all of our worries. Hastings Cent Rep 2003; 33:9. [PMID: 12613377] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/01/2023]
|
149
|
|
150
|
Livesay EA, Liu YH, Luebke KJ, Irick J, Belosludtsev Y, Rayner S, Balog R, Johnston SA. A scalable high-throughput chemical synthesizer. Genome Res 2002; 12:1950-60. [PMID: 12466300 PMCID: PMC187575 DOI: 10.1101/gr.359002] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
A machine that employs a novel reagent delivery technique for biomolecular synthesis has been developed. This machine separates the addressing of individual synthesis sites from the actual process of reagent delivery by using masks placed over the sites. Because of this separation, this machine is both cost-effective and scalable, and thus the time required to synthesize 384 or 1536 unique biomolecules is very nearly the same. Importantly, the mask design allows scaling of the number of synthesis sites without the addition of new valving. Physical and biological comparisons between DNA made on a commercially available synthesizer and this unit show that it produces DNA of similar quality.
Collapse
|