Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Bertone P, Trifonov V, Rozowsky JS, Schubert F, Emanuelsson O, Karro J, Kao MY, Snyder M, Gerstein M. Design optimization methods for genomic DNA tiling arrays. Genome Res 2005;16:271-81. [PMID: 16365382 PMCID: PMC1361723 DOI: 10.1101/gr.4452906] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]

For:	Bertone P, Trifonov V, Rozowsky JS, Schubert F, Emanuelsson O, Karro J, Kao MY, Snyder M, Gerstein M. Design optimization methods for genomic DNA tiling arrays. Genome Res 2005;16:271-81. [PMID: 16365382 PMCID: PMC1361723 DOI: 10.1101/gr.4452906] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]

Number

Cited by Other Article(s)

Nunes R, Storer C, Doleck T, Kawahara AY, Pierce NE, Lohman DJ. Predictors of sequence capture in a large-scale anchored phylogenomics project. Front Ecol Evol 2022. [DOI: 10.3389/fevo.2022.943361] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/19/2023] Open

Abstract Next-generation sequencing (NGS) technologies have revolutionized phylogenomics by decreasing the cost and time required to generate sequence data from multiple markers or whole genomes. Further, the fragmented DNA of biological specimens collected decades ago can be sequenced with NGS, reducing the need for collecting fresh specimens. Sequence capture, also known as anchored hybrid enrichment, is a method to produce reduced representation libraries for NGS sequencing. The technique uses single-stranded oligonucleotide probes that hybridize with pre-selected regions of the genome that are sequenced via NGS, culminating in a dataset of numerous orthologous loci from multiple taxa. Phylogenetic analyses using these sequences have the potential to resolve deep and shallow phylogenetic relationships. Identifying the factors that affect sequence capture success could save time, money, and valuable specimens that might be destructively sampled despite low likelihood of sequencing success. We investigated the impacts of specimen age, preservation method, and DNA concentration on sequence capture (number of captured sequences and sequence quality) while accounting for taxonomy and extracted tissue type in a large-scale butterfly phylogenomics project. This project used two probe sets to extract 391 loci or a subset of 13 loci from over 6,000 butterfly specimens. We found that sequence capture is a resilient method capable of amplifying loci in samples of varying age (0–111 years), preservation method (alcohol, papered, pinned), and DNA concentration (0.020 ng/μl - 316 ng/ul). Regression analyses demonstrate that sequence capture is positively correlated with DNA concentration. However, sequence capture and DNA concentration are negatively correlated with sample age and preservation method. Our findings suggest that sequence capture projects should prioritize the use of alcohol-preserved samples younger than 20 years old when available. In the absence of such specimens, dried samples of any age can yield sequence data, albeit with returns that diminish with increasing age. Collapse

Dickson ZW, Hackenberger D, Kuch M, Marzok A, Banerjee A, Rossi L, Klowak JA, Fox-Robichaud A, Mossmann K, Miller MS, Surette MG, Golding GB, Poinar H. Probe design for simultaneous, targeted capture of diverse metagenomic targets. CELL REPORTS METHODS 2021;1:100069. [PMID: 35474894 PMCID: PMC9017208 DOI: 10.1016/j.crmeth.2021.100069] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/22/2020] [Revised: 06/10/2021] [Accepted: 08/05/2021] [Indexed: 11/20/2022]

Affiliation(s)

Zachery W. Dickson Department of Biology, McMaster University, Hamilton, ON L8S 4K1, Canada
Dirk Hackenberger Department of Biochemistry and Biomedical Sciences, McMaster University, Hamilton, ON L8S 4K1, Canada Michael G. DeGroote Institute for Infectious Disease Research, McMaster University, Hamilton, ON L8S 4K1, Canada
Melanie Kuch McMaster aDNA Center, Department of Anthropology, McMaster University, Hamilton, ON L8S 4L9, Canada
Art Marzok Department of Biochemistry and Biomedical Sciences, McMaster University, Hamilton, ON L8S 4K1, Canada Michael G. DeGroote Institute for Infectious Disease Research, McMaster University, Hamilton, ON L8S 4K1, Canada McMaster Immunology Research Center, McMaster University, Hamilton, ON L8S 4K1, Canada
Arinjay Banerjee Michael G. DeGroote Institute for Infectious Disease Research, McMaster University, Hamilton, ON L8S 4K1, Canada McMaster Immunology Research Center, McMaster University, Hamilton, ON L8S 4K1, Canada Department of Pathology and Molecular Medicine, McMaster University, Hamilton, ON L8S 4K1, Canada Vaccine and Infectious Disease Organization, Department of Veterinary Microbiology, University of Saskatchewan, Saskatoon, SK S7N 5E3, Canada
Laura Rossi Department of Biochemistry and Biomedical Sciences, McMaster University, Hamilton, ON L8S 4K1, Canada Michael G. DeGroote Institute for Infectious Disease Research, McMaster University, Hamilton, ON L8S 4K1, Canada
Jennifer Ann Klowak Department of Pediatrics, McMaster University, Hamilton, ON L8S 4K1, Canada
Alison Fox-Robichaud Department of Medicine, McMaster University, Hamilton, ON L8S 4K1, Canada
Karen Mossmann Michael G. DeGroote Institute for Infectious Disease Research, McMaster University, Hamilton, ON L8S 4K1, Canada McMaster Immunology Research Center, McMaster University, Hamilton, ON L8S 4K1, Canada Department of Medicine, McMaster University, Hamilton, ON L8S 4K1, Canada
Matthew S. Miller Department of Biochemistry and Biomedical Sciences, McMaster University, Hamilton, ON L8S 4K1, Canada Michael G. DeGroote Institute for Infectious Disease Research, McMaster University, Hamilton, ON L8S 4K1, Canada McMaster Immunology Research Center, McMaster University, Hamilton, ON L8S 4K1, Canada
Michael G. Surette Department of Biochemistry and Biomedical Sciences, McMaster University, Hamilton, ON L8S 4K1, Canada Michael G. DeGroote Institute for Infectious Disease Research, McMaster University, Hamilton, ON L8S 4K1, Canada Department of Medicine, McMaster University, Hamilton, ON L8S 4K1, Canada
Geoffrey Brian Golding Department of Biology, McMaster University, Hamilton, ON L8S 4K1, Canada
Hendrik Poinar Department of Biochemistry and Biomedical Sciences, McMaster University, Hamilton, ON L8S 4K1, Canada Michael G. DeGroote Institute for Infectious Disease Research, McMaster University, Hamilton, ON L8S 4K1, Canada McMaster aDNA Center, Department of Anthropology, McMaster University, Hamilton, ON L8S 4L9, Canada

Collapse

Andermann T, Torres Jiménez MF, Matos-Maraví P, Batista R, Blanco-Pastor JL, Gustafsson ALS, Kistler L, Liberal IM, Oxelman B, Bacon CD, Antonelli A. A Guide to Carrying Out a Phylogenomic Target Sequence Capture Project. Front Genet 2020;10:1407. [PMID: 32153629 PMCID: PMC7047930 DOI: 10.3389/fgene.2019.01407] [Citation(s) in RCA: 44] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2019] [Accepted: 12/24/2019] [Indexed: 12/17/2022] Open

Affiliation(s)

Tobias Andermann Department of Biological and Environmental Sciences, University of Gothenburg, Gothenburg, Sweden Gothenburg Global Biodiversity Centre, Gothenburg, Sweden
Maria Fernanda Torres Jiménez Department of Biological and Environmental Sciences, University of Gothenburg, Gothenburg, Sweden Gothenburg Global Biodiversity Centre, Gothenburg, Sweden
Pável Matos-Maraví Department of Biological and Environmental Sciences, University of Gothenburg, Gothenburg, Sweden Gothenburg Global Biodiversity Centre, Gothenburg, Sweden Institute of Entomology, Biology Centre of the Czech Academy of Sciences, České Budějovice, Czechia
Romina Batista Gothenburg Global Biodiversity Centre, Gothenburg, Sweden Programa de Pós-Graduação em Genética, Conservação e Biologia Evolutiva, PPG GCBEv–Instituto Nacional de Pesquisas da Amazônia—INPA Campus II, Manaus, Brazil Coordenação de Zoologia, Museu Paraense Emílio Goeldi, Belém, Brazil
José L. Blanco-Pastor Department of Biological and Environmental Sciences, University of Gothenburg, Gothenburg, Sweden INRAE, Centre Nouvelle-Aquitaine-Poitiers, Lusignan, France
A. Lovisa S. Gustafsson Natural History Museum, University of Oslo, Oslo, Norway
Logan Kistler Department of Anthropology, National Museum of Natural History, Smithsonian Institution, Washington, DC, United States
Isabel M. Liberal Department of Biological and Environmental Sciences, University of Gothenburg, Gothenburg, Sweden
Bengt Oxelman Department of Biological and Environmental Sciences, University of Gothenburg, Gothenburg, Sweden Gothenburg Global Biodiversity Centre, Gothenburg, Sweden
Christine D. Bacon Department of Biological and Environmental Sciences, University of Gothenburg, Gothenburg, Sweden Gothenburg Global Biodiversity Centre, Gothenburg, Sweden
Alexandre Antonelli Department of Biological and Environmental Sciences, University of Gothenburg, Gothenburg, Sweden Gothenburg Global Biodiversity Centre, Gothenburg, Sweden Royal Botanic Gardens, Kew, Richmond-Surrey, United Kingdom

Collapse

Veeckman E, Van Glabeke S, Haegeman A, Muylle H, van Parijs FRD, Byrne SL, Asp T, Studer B, Rohde A, Roldán-Ruiz I, Vandepoele K, Ruttink T. Overcoming challenges in variant calling: exploring sequence diversity in candidate genes for plant development in perennial ryegrass (Lolium perenne). DNA Res 2019;26:1-12. [PMID: 30325414 PMCID: PMC6379033 DOI: 10.1093/dnares/dsy033] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2018] [Accepted: 09/06/2018] [Indexed: 11/13/2022] Open

Parisot N, Peyretaillade E, Dugat-Bony E, Denonfoux J, Mahul A, Peyret P. Probe Design Strategies for Oligonucleotide Microarrays. Methods Mol Biol 2016;1368:67-82. [PMID: 26614069 DOI: 10.1007/978-1-4939-3136-1_6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]

Copy Number Variation in Chickens: A Review and Future Prospects. MICROARRAYS 2014;3:24-38. [PMID: 27605028 PMCID: PMC5003453 DOI: 10.3390/microarrays3010024] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 12/15/2013] [Revised: 01/22/2014] [Accepted: 01/23/2014] [Indexed: 12/19/2022]

Empirical assessment of competitive hybridization and noise in ultra high density canine tiling arrays. BMC Bioinformatics 2013;14:231. [PMID: 23870167 PMCID: PMC3733988 DOI: 10.1186/1471-2105-14-231] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2012] [Accepted: 07/15/2013] [Indexed: 11/25/2022] Open

Uitdewilligen JGAML, Wolters AMA, D’hoop BB, Borm TJA, Visser RGF, van Eck HJ. A next-generation sequencing method for genotyping-by-sequencing of highly heterozygous autotetraploid potato. PLoS One 2013;8:e62355. [PMID: 23667470 PMCID: PMC3648547 DOI: 10.1371/journal.pone.0062355] [Citation(s) in RCA: 238] [Impact Index Per Article: 21.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2012] [Accepted: 03/20/2013] [Indexed: 11/23/2022] Open

Abstract

Assessment of genomic DNA sequence variation and genotype calling in autotetraploids implies the ability to distinguish among five possible alternative allele copy number states. This study demonstrates the accuracy of genotyping-by-sequencing (GBS) of a large collection of autotetraploid potato cultivars using next-generation sequencing. It is still costly to reach sufficient read depths on a genome wide scale, across the cultivated gene pool. Therefore, we enriched cultivar-specific DNA sequencing libraries using an in-solution hybridisation method (SureSelect). This complexity reduction allowed to confine our study to 807 target genes distributed across the genomes of 83 tetraploid cultivars and one reference (DM 1–3 511). Indexed sequencing libraries were paired-end sequenced in 7 pools of 12 samples using Illumina HiSeq2000. After filtering and processing the raw sequence data, 12.4 Gigabases of high-quality sequence data was obtained, which mapped to 2.1 Mb of the potato reference genome, with a median average read depth of 63× per cultivar. We detected 129,156 sequence variants and genotyped the allele copy number of each variant for every cultivar. In this cultivar panel a variant density of 1 SNP/24 bp in exons and 1 SNP/15 bp in introns was obtained. The average minor allele frequency (MAF) of a variant was 0.14. Potato germplasm displayed a large number of relatively rare variants and/or haplotypes, with 61% of the variants having a MAF below 0.05. A very high average nucleotide diversity (π = 0.0107) was observed. Nucleotide diversity varied among potato chromosomes. Several genes under selection were identified. Genotyping-by-sequencing results, with allele copy number estimates, were validated with a KASP genotyping assay. This validation showed that read depths of ∼60–80× can be used as a lower boundary for reliable assessment of allele copy number of sequence variants in autotetraploids. Genotypic data were associated with traits, and alleles strongly influencing maturity and flesh colour were identified.

Collapse

Ward M, Wilson M, Barbosa-Morais N, Schmidt D, Stark R, Pan Q, Schwalie P, Menon S, Lukk M, Watt S, Thybert D, Kutter C, Kirschner K, Flicek P, Blencowe B, Odom D. Latent regulatory potential of human-specific repetitive elements. Mol Cell 2013;49:262-72. [PMID: 23246434 PMCID: PMC3560060 DOI: 10.1016/j.molcel.2012.11.013] [Citation(s) in RCA: 52] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2012] [Revised: 09/28/2012] [Accepted: 11/09/2012] [Indexed: 12/26/2022]

Affiliation(s)

Michelle C. Ward University of Cambridge, Cancer Research UK-Cambridge Institute, Robinson Way, Cambridge CB2 0RE, UK
Michael D. Wilson University of Cambridge, Cancer Research UK-Cambridge Institute, Robinson Way, Cambridge CB2 0RE, UK
Nuno L. Barbosa-Morais Banting and Best Department of Medical Research and Department of Molecular Genetics, Donnelly Centre, Toronto, ON M5S 3E1, Canada
Dominic Schmidt University of Cambridge, Cancer Research UK-Cambridge Institute, Robinson Way, Cambridge CB2 0RE, UK
Rory Stark University of Cambridge, Cancer Research UK-Cambridge Institute, Robinson Way, Cambridge CB2 0RE, UK
Qun Pan Banting and Best Department of Medical Research and Department of Molecular Genetics, Donnelly Centre, Toronto, ON M5S 3E1, Canada
Petra C. Schwalie European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton CB10 1SD, UK
Suraj Menon University of Cambridge, Cancer Research UK-Cambridge Institute, Robinson Way, Cambridge CB2 0RE, UK
Margus Lukk University of Cambridge, Cancer Research UK-Cambridge Institute, Robinson Way, Cambridge CB2 0RE, UK
Stephen Watt University of Cambridge, Cancer Research UK-Cambridge Institute, Robinson Way, Cambridge CB2 0RE, UK
David Thybert European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton CB10 1SD, UK
Claudia Kutter University of Cambridge, Cancer Research UK-Cambridge Institute, Robinson Way, Cambridge CB2 0RE, UK
Kristina Kirschner University of Cambridge, Cancer Research UK-Cambridge Institute, Robinson Way, Cambridge CB2 0RE, UK
Paul Flicek European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton CB10 1SD, UK Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SA, UK
Benjamin J. Blencowe Banting and Best Department of Medical Research and Department of Molecular Genetics, Donnelly Centre, Toronto, ON M5S 3E1, Canada
Duncan T. Odom University of Cambridge, Cancer Research UK-Cambridge Institute, Robinson Way, Cambridge CB2 0RE, UK Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SA, UK

Collapse

Coman D, Gruissem W, Hennig L. Transcript profiling in Arabidopsis with genome tiling microarrays. Methods Mol Biol 2013;1067:35-49. [PMID: 23975784 DOI: 10.1007/978-1-62703-607-8_3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]

Lemetre C, Zhang ZD. A brief introduction to tiling microarrays: principles, concepts, and applications. Methods Mol Biol 2013;1067:3-19. [PMID: 23975782 DOI: 10.1007/978-1-62703-607-8_1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]

Du Y, Murani E, Ponsuksili S, Wimmers K. Flexible and efficient genome tiling design with penalized uniqueness score. BMC Bioinformatics 2012;13:323. [PMID: 23216884 PMCID: PMC3583072 DOI: 10.1186/1471-2105-13-323] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2012] [Accepted: 10/26/2012] [Indexed: 11/24/2022] Open

Cheng JB, Cho RJ. Genetics and epigenetics of the skin meet deep sequence. J Invest Dermatol 2012;132:923-32. [PMID: 22237701 DOI: 10.1038/jid.2011.436] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]

Hafemeister C, Krause R, Schliep A. Selecting oligonucleotide probes for whole-genome tiling arrays with a cross-hybridization potential. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2011;8:1642-1652. [PMID: 21358006 DOI: 10.1109/tcbb.2011.39] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]

Weinhouse C, Anderson OS, Jones TR, Kim J, Liberman SA, Nahar MS, Rozek LS, Jirtle RL, Dolinoy DC. An expression microarray approach for the identification of metastable epialleles in the mouse genome. Epigenetics 2011;6:1105-13. [PMID: 21829099 DOI: 10.4161/epi.6.9.17103] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open

Abstract

Genetic loci displaying environmentally responsive epigenetic marks, termed metastable epialleles, offer a solution to the paradox presented by genetically identical yet phenotypically distinct individuals. The murine viable yellow agouti (A (vy) ) metastable epiallele exhibits stochastic DNA methylation and histone modifications associated with coat color variation in isogenic individuals. The distribution of A (vy) variable expressivity shifts following maternal nutritional and environmental exposures. To characterize additional murine metastable epialleles, we utilized genome-wide expression arrays (N = 10 male individuals, 3 tissues per individual) and identified candidates displaying large variability in gene expression among individuals (Vi = inter-individual variance), concomitant with a low variability in gene expression across tissues from the three germ layers (Vt = inter-tissue variance), two features characteristic of the A (vy) metastable epiallele. The CpG island in the promoter of Dnajb1 and two contraoriented ERV class II repeats in Glcci1 were validated to display underlying stochasticity in methylation patterns common to metastable epialleles. Furthermore, liver DNA methylation in mice exposed in utero to 50 mg bisphenol A (BPA)/kg diet (N = 91) or a control diet (N = 79) confirmed environmental lability at validated candidate genes. Significant effects of exposure on mean CpG methylation were observed at the Glcci1 Repeat 1 locus (p < 0.0001). Significant effects of BPA also were observed at the first and fifth CpG sites studied in Glcci1 Repeat 2 (p < 0.0001 and p = 0.004, respectively). BPA did not affect methylation in the promoter of Dnajb1 (p = 0.59). The characterization of metastable epialleles in humans is crucial for the development of novel screening and therapeutic targets for human disease prevention.

Collapse

Dufour YS, Wesenberg GE, Tritt AJ, Glasner JD, Perna NT, Mitchell JC, Donohue TJ. chipD: a web tool to design oligonucleotide probes for high-density tiling arrays. Nucleic Acids Res 2010;38:W321-5. [PMID: 20529880 PMCID: PMC2896189 DOI: 10.1093/nar/gkq517] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Mulle JG, Patel VC, Warren ST, Hegde MR, Cutler DJ, Zwick ME. Empirical evaluation of oligonucleotide probe selection for DNA microarrays. PLoS One 2010;5:e9921. [PMID: 20360966 PMCID: PMC2847945 DOI: 10.1371/journal.pone.0009921] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2009] [Accepted: 12/04/2009] [Indexed: 12/04/2022] Open

Jourdren L, Duclos A, Brion C, Portnoy T, Mathis H, Margeot A, Le Crom S. Teolenn: an efficient and customizable workflow to design high-quality probes for microarray experiments. Nucleic Acids Res 2010;38:e117. [PMID: 20176570 PMCID: PMC2879536 DOI: 10.1093/nar/gkq110] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Høvik H, Chen T. Dynamic probe selection for studying microbial transcriptome with high-density genomic tiling microarrays. BMC Bioinformatics 2010;11:82. [PMID: 20144223 PMCID: PMC2836303 DOI: 10.1186/1471-2105-11-82] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2009] [Accepted: 02/09/2010] [Indexed: 12/27/2022] Open

Ye K, Jia Z, Wang Y, Flicek P, Apweiler R. Mining Unique-m Substrings from Genomes. ACTA ACUST UNITED AC 2010;3:099-103. [PMID: 29657484 PMCID: PMC5894807 DOI: 10.4172/jpb.1000127] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Phillippy AM, Deng X, Zhang W, Salzberg SL. Efficient oligonucleotide probe selection for pan-genomic tiling arrays. BMC Bioinformatics 2009;10:293. [PMID: 19758451 PMCID: PMC2753849 DOI: 10.1186/1471-2105-10-293] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2009] [Accepted: 09/16/2009] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Array comparative genomic hybridization is a fast and cost-effective method for detecting, genotyping, and comparing the genomic sequence of unknown bacterial isolates. This method, as with all microarray applications, requires adequate coverage of probes targeting the regions of interest. An unbiased tiling of probes across the entire length of the genome is the most flexible design approach. However, such a whole-genome tiling requires that the genome sequence is known in advance. For the accurate analysis of uncharacterized bacteria, an array must query a fully representative set of sequences from the species' pan-genome. Prior microarrays have included only a single strain per array or the conserved sequences of gene families. These arrays omit potentially important genes and sequence variants from the pan-genome.

RESULTS

This paper presents a new probe selection algorithm (PanArray) that can tile multiple whole genomes using a minimal number of probes. Unlike arrays built on clustered gene families, PanArray uses an unbiased, probe-centric approach that does not rely on annotations, gene clustering, or multi-alignments. Instead, probes are evenly tiled across all sequences of the pan-genome at a consistent level of coverage. To minimize the required number of probes, probes conserved across multiple strains in the pan-genome are selected first, and additional probes are used only where necessary to span polymorphic regions of the genome. The viability of the algorithm is demonstrated by array designs for seven different bacterial pan-genomes and, in particular, the design of a 385,000 probe array that fully tiles the genomes of 20 different Listeria monocytogenes strains with overlapping probes at greater than twofold coverage.

CONCLUSION

PanArray is an oligonucleotide probe selection algorithm for tiling multiple genome sequences using a minimal number of probes. It is capable of fully tiling all genomes of a species on a single microarray chip. These unique pan-genome tiling arrays provide maximum flexibility for the analysis of both known and uncharacterized strains.

Collapse

Sasidharan R, Agarwal A, Rozowsky J, Gerstein M. An approach to comparing tiling array and high throughput sequencing technologies for genomic transcript mapping. BMC Res Notes 2009;2:150. [PMID: 19630981 PMCID: PMC2764720 DOI: 10.1186/1756-0500-2-150] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2009] [Accepted: 07/24/2009] [Indexed: 11/24/2022] Open

Tang H, Therneau TM. Statistical metrics for quality assessment of high-density tiling array data. Biometrics 2009;66:630-5. [PMID: 19645697 DOI: 10.1111/j.1541-0420.2009.01298.x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Mita H, Toyota M, Aoki F, Akashi H, Maruyama R, Sasaki Y, Suzuki H, Idogawa M, Kashima L, Yanagihara K, Fujita M, Hosokawa M, Kusano M, Sabau SV, Tatsumi H, Imai K, Shinomura Y, Tokino T. A novel method, digital genome scanning detects KRAS gene amplification in gastric cancers: involvement of overexpressed wild-type KRAS in downstream signaling and cancer cell growth. BMC Cancer 2009;9:198. [PMID: 19545448 PMCID: PMC2717977 DOI: 10.1186/1471-2407-9-198] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2008] [Accepted: 06/23/2009] [Indexed: 01/02/2023] Open

Abstract

Background

Gastric cancer is the third most common malignancy affecting the general population worldwide. Aberrant activation of KRAS is a key factor in the development of many types of tumor, however, oncogenic mutations of KRAS are infrequent in gastric cancer. We have developed a novel quantitative method of analysis of DNA copy number, termed digital genome scanning (DGS), which is based on the enumeration of short restriction fragments, and does not involve PCR or hybridization. In the current study, we used DGS to survey copy-number alterations in gastric cancer cells.

Methods

DGS of gastric cancer cell lines was performed using the sequences of 5000 to 15000 restriction fragments. We screened 20 gastric cancer cell lines and 86 primary gastric tumors for KRAS amplification by quantitative PCR, and investigated KRAS amplification at the DNA, mRNA and protein levels by mutational analysis, real-time PCR, immunoblot analysis, GTP-RAS pull-down assay and immunohistochemical analysis. The effect of KRAS knock-down on the activation of p44/42 MAP kinase and AKT and on cell growth were examined by immunoblot and colorimetric assay, respectively.

Results

DGS analysis of the HSC45 gastric cancer cell line revealed the amplification of a 500-kb region on chromosome 12p12.1, which contains the KRAS gene locus. Amplification of the KRAS locus was detected in 15% (3/20) of gastric cancer cell lines (8–18-fold amplification) and 4.7% (4/86) of primary gastric tumors (8–50-fold amplification). KRAS mutations were identified in two of the three cell lines in which KRAS was amplified, but were not detected in any of the primary tumors. Overexpression of KRAS protein correlated directly with increased KRAS copy number. The level of GTP-bound KRAS was elevated following serum stimulation in cells with amplified wild-type KRAS, but not in cells with amplified mutant KRAS. Knock-down of KRAS in gastric cancer cells that carried amplified wild-type KRAS resulted in the inhibition of cell growth and suppression of p44/42 MAP kinase and AKT activity.

Conclusion

Our study highlights the utility of DGS for identification of copy-number alterations. Using DGS, we identified KRAS as a gene that is amplified in human gastric cancer. We demonstrated that gene amplification likely forms the molecular basis of overactivation of KRAS in gastric cancer. Additional studies using a larger cohort of gastric cancer specimens are required to determine the diagnostic and therapeutic implications of KRAS amplification and overexpression.

Collapse

Thomassen GOS, Rowe AD, Lagesen K, Lindvall JM, Rognes T. Custom design and analysis of high-density oligonucleotide bacterial tiling microarrays. PLoS One 2009;4:e5943. [PMID: 19536279 PMCID: PMC2691959 DOI: 10.1371/journal.pone.0005943] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2009] [Accepted: 05/18/2009] [Indexed: 11/21/2022] Open

Abstract

Background

High-density tiling microarrays are a powerful tool for the characterization of complete genomes. The two major computational challenges associated with custom-made arrays are design and analysis. Firstly, several genome dependent variables, such as the genome's complexity and sequence composition, need to be considered in the design to ensure a high quality microarray. Secondly, since tiling projects today very often exceed the limits of conventional array-experiments, researchers cannot use established computer tools designed for commercial arrays, and instead have to redesign previous methods or create novel tools.

Principal Findings

Here we describe the multiple aspects involved in the design of tiling arrays for transcriptome analysis and detail the normalisation and analysis procedures for such microarrays. We introduce a novel design method to make two 280,000 feature microarrays covering the entire genome of the bacterial species Escherichia coli and Neisseria meningitidis, respectively, as well as the use of multiple copies of control probe-sets on tiling microarrays. Furthermore, a novel normalisation and background estimation procedure for tiling arrays is presented along with a method for array analysis focused on detection of short transcripts. The design, normalisation and analysis methods have been applied in various experiments and several of the detected novel short transcripts have been biologically confirmed by Northern blot tests.

Conclusions

Tiling-arrays are becoming increasingly applicable in genomic research, but researchers still lack both the tools for custom design of arrays, as well as the systems and procedures for analysis of the vast amount of data resulting from such experiments. We believe that the methods described herein will be a useful contribution and resource for researchers designing and analysing custom tiling arrays for both bacteria and higher organisms.

Collapse

Lemoine S, Combes F, Le Crom S. An evaluation of custom microarray applications: the oligonucleotide design challenge. Nucleic Acids Res 2009;37:1726-39. [PMID: 19208645 PMCID: PMC2665234 DOI: 10.1093/nar/gkp053] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Lorenzi H, Thiagarajan M, Haas B, Wortman J, Hall N, Caler E. Genome wide survey, discovery and evolution of repetitive elements in three Entamoeba species. BMC Genomics 2008;9:595. [PMID: 19077187 PMCID: PMC2657916 DOI: 10.1186/1471-2164-9-595] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2008] [Accepted: 12/10/2008] [Indexed: 11/14/2022] Open

Abstract

Background

Identification and mapping of repetitive elements is a key step for accurate gene prediction and overall structural annotation of genomes. During the assembly and annotation of three highly repetitive amoeba genomes, Entamoeba histolytica, Entamoeba dispar, and Entamoeba invadens, we performed comparative sequence analysis to identify and map all class I and class II transposable elements in their sequences.

Results

Here, we report the identification of two novel Entamoeba-specific repeats: ERE1 and ERE2; ERE1 is spread across the three genomes and associated with different repeats in a species-specific manner, while ERE2 is unique to E. histolytica. We also report the identification of two novel subfamilies of LINE and SINE retrotransposons in E. dispar and provide evidence for how the different LINE and SINE subfamilies evolved in these species. Additionally, we found a putative transposase-coding gene in E. histolytica and E. dispar related to the mariner transposon Hydargos from E. invadens. The distribution of transposable elements in these genomes is markedly skewed with a tendency of forming clusters. More than 70% of the three genomes have a repeat density below their corresponding average value indicating that transposable elements are not evenly distributed. We show that repeats and repeat-clusters are found at syntenic break points between E. histolytica and E. dispar and hence, could work as recombination hot spots promoting genome rearrangements.

Conclusion

The mapping of all transposable elements found in these parasites shows that repeat coverage is up to three times higher than previously reported. LINE, ERE1 and mariner elements were present in the common ancestor to the three Entamoeba species while ERE2 was likely acquired by E. histolytica after its separation from E. dispar. We demonstrate that E. histolytica and E. dispar share their entire repertoire of LINE and SINE retrotransposons and that Eh_SINE3/Ed_SINE1 originated as a chimeric SINE from Eh/Ed_SINE2 and Eh_SINE1/Ed_SINE3. Our work shows that transposable elements are organized in clusters, frequently found at syntenic break points providing insights into their contribution to chromosome instability and therefore, to genomic variation and speciation in these parasites.

Collapse

Schliep A, Krause R. Efficient algorithms for the computational design of optimal tiling arrays. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2008;5:557-567. [PMID: 18989043 DOI: 10.1109/tcbb.2008.50] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]

Srivastava GP, Guo J, Shi H, Xu D. PRIMEGENS-v2: genome-wide primer design for analyzing DNA methylation patterns of CpG islands. Bioinformatics 2008;24:1837-42. [PMID: 18579568 DOI: 10.1093/bioinformatics/btn320] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

He H, Wang J, Liu T, Liu XS, Li T, Wang Y, Qian Z, Zheng H, Zhu X, Wu T, Shi B, Deng W, Zhou W, Skogerbø G, Chen R. Mapping the C. elegans noncoding transcriptome with a whole-genome tiling microarray. Genome Res 2007;17:1471-7. [PMID: 17785534 PMCID: PMC1987347 DOI: 10.1101/gr.6611807] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

Affiliation(s)

Housheng He Bioinformatics Laboratory and National Laboratory of Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China Graduate School of the Chinese Academy of Science, Beijing 100080, China Corresponding author.E-MAIL ; fax 86-10-64889892.E-mail ; fax 86-10-64889892
Jie Wang Bioinformatics Laboratory and National Laboratory of Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China Graduate School of the Chinese Academy of Science, Beijing 100080, China Corresponding author.E-MAIL ; fax 86-10-64889892.E-mail ; fax 86-10-64889892
Tao Liu Bioinformatics Laboratory and National Laboratory of Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China Graduate School of the Chinese Academy of Science, Beijing 100080, China Corresponding author.E-MAIL ; fax 86-10-64889892.E-mail ; fax 86-10-64889892
X. Shirley Liu Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, Massachusetts 02115, USA Harvard School of Public Health, Boston, Massachusetts 02115, USA
Tiantian Li Bioinformatics Laboratory and National Laboratory of Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China Graduate School of the Chinese Academy of Science, Beijing 100080, China
Yunfei Wang Bioinformatics Laboratory and National Laboratory of Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China Graduate School of the Chinese Academy of Science, Beijing 100080, China
Zuwei Qian Affymetrix, Inc., Santa Clara, California 95051, USA
Haixia Zheng Bioinformatics Laboratory and National Laboratory of Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China Graduate School of the Chinese Academy of Science, Beijing 100080, China
Xiaopeng Zhu Bioinformatics Laboratory and National Laboratory of Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China Graduate School of the Chinese Academy of Science, Beijing 100080, China
Tao Wu Bioinformatics Laboratory and National Laboratory of Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China Graduate School of the Chinese Academy of Science, Beijing 100080, China
Baochen Shi Bioinformatics Laboratory and National Laboratory of Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China Graduate School of the Chinese Academy of Science, Beijing 100080, China
Wei Deng Bioinformatics Laboratory and National Laboratory of Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
Wei Zhou Affymetrix, Inc., Santa Clara, California 95051, USA
Geir Skogerbø Bioinformatics Laboratory and National Laboratory of Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China Corresponding author.E-MAIL ; fax 86-10-64889892.E-mail ; fax 86-10-64889892
Runsheng Chen Bioinformatics Laboratory and National Laboratory of Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China Bioinformatics Research Group, Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Science, Beijing 100080, China Chinese National Human Genome Center, Beijing 100176, China Corresponding author.E-MAIL ; fax 86-10-64889892.E-mail ; fax 86-10-64889892

Collapse

Gräf S, Nielsen FGG, Kurtz S, Huynen MA, Birney E, Stunnenberg H, Flicek P. Optimized design and assessment of whole genome tiling arrays. ACTA ACUST UNITED AC 2007;23:i195-204. [PMID: 17646297 PMCID: PMC5892713 DOI: 10.1093/bioinformatics/btm200] [Citation(s) in RCA: 48] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]

Rivals E, Boureux A, Lejeune M, Ottones F, Pecharromàn Pérez O, Tarhio J, Pierrat F, Ruffle F, Commes T, Marti J. Transcriptome annotation using tandem SAGE tags. Nucleic Acids Res 2007;35:e108. [PMID: 17709346 PMCID: PMC2034470 DOI: 10.1093/nar/gkm495] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Affiliation(s)

Eric Rivals Laboratoire d’Informatique, de Robotique et de Microélectronique, UMR 5506 CNRS – Université de Montpellier II, 161 rue Ada, 34392 Montpellier 05, Institut de Génétique Humaine, CNRS UPR 1142, 141 rue de la Cardonille, 34396 Montpellier 05, France, Helsinki University of Technology, P.O. Box 5400, FI-02015 HUT, Finland and Skuld-Tech, 134, rue du Curat – Bat. Amarante, 34090 Montpellier, France
Anthony Boureux Laboratoire d’Informatique, de Robotique et de Microélectronique, UMR 5506 CNRS – Université de Montpellier II, 161 rue Ada, 34392 Montpellier 05, Institut de Génétique Humaine, CNRS UPR 1142, 141 rue de la Cardonille, 34396 Montpellier 05, France, Helsinki University of Technology, P.O. Box 5400, FI-02015 HUT, Finland and Skuld-Tech, 134, rue du Curat – Bat. Amarante, 34090 Montpellier, France
Mireille Lejeune Laboratoire d’Informatique, de Robotique et de Microélectronique, UMR 5506 CNRS – Université de Montpellier II, 161 rue Ada, 34392 Montpellier 05, Institut de Génétique Humaine, CNRS UPR 1142, 141 rue de la Cardonille, 34396 Montpellier 05, France, Helsinki University of Technology, P.O. Box 5400, FI-02015 HUT, Finland and Skuld-Tech, 134, rue du Curat – Bat. Amarante, 34090 Montpellier, France
Florence Ottones Laboratoire d’Informatique, de Robotique et de Microélectronique, UMR 5506 CNRS – Université de Montpellier II, 161 rue Ada, 34392 Montpellier 05, Institut de Génétique Humaine, CNRS UPR 1142, 141 rue de la Cardonille, 34396 Montpellier 05, France, Helsinki University of Technology, P.O. Box 5400, FI-02015 HUT, Finland and Skuld-Tech, 134, rue du Curat – Bat. Amarante, 34090 Montpellier, France
Oscar Pecharromàn Pérez Laboratoire d’Informatique, de Robotique et de Microélectronique, UMR 5506 CNRS – Université de Montpellier II, 161 rue Ada, 34392 Montpellier 05, Institut de Génétique Humaine, CNRS UPR 1142, 141 rue de la Cardonille, 34396 Montpellier 05, France, Helsinki University of Technology, P.O. Box 5400, FI-02015 HUT, Finland and Skuld-Tech, 134, rue du Curat – Bat. Amarante, 34090 Montpellier, France
Jorma Tarhio Laboratoire d’Informatique, de Robotique et de Microélectronique, UMR 5506 CNRS – Université de Montpellier II, 161 rue Ada, 34392 Montpellier 05, Institut de Génétique Humaine, CNRS UPR 1142, 141 rue de la Cardonille, 34396 Montpellier 05, France, Helsinki University of Technology, P.O. Box 5400, FI-02015 HUT, Finland and Skuld-Tech, 134, rue du Curat – Bat. Amarante, 34090 Montpellier, France
Fabien Pierrat Laboratoire d’Informatique, de Robotique et de Microélectronique, UMR 5506 CNRS – Université de Montpellier II, 161 rue Ada, 34392 Montpellier 05, Institut de Génétique Humaine, CNRS UPR 1142, 141 rue de la Cardonille, 34396 Montpellier 05, France, Helsinki University of Technology, P.O. Box 5400, FI-02015 HUT, Finland and Skuld-Tech, 134, rue du Curat – Bat. Amarante, 34090 Montpellier, France
Florence Ruffle Laboratoire d’Informatique, de Robotique et de Microélectronique, UMR 5506 CNRS – Université de Montpellier II, 161 rue Ada, 34392 Montpellier 05, Institut de Génétique Humaine, CNRS UPR 1142, 141 rue de la Cardonille, 34396 Montpellier 05, France, Helsinki University of Technology, P.O. Box 5400, FI-02015 HUT, Finland and Skuld-Tech, 134, rue du Curat – Bat. Amarante, 34090 Montpellier, France
Thérèse Commes Laboratoire d’Informatique, de Robotique et de Microélectronique, UMR 5506 CNRS – Université de Montpellier II, 161 rue Ada, 34392 Montpellier 05, Institut de Génétique Humaine, CNRS UPR 1142, 141 rue de la Cardonille, 34396 Montpellier 05, France, Helsinki University of Technology, P.O. Box 5400, FI-02015 HUT, Finland and Skuld-Tech, 134, rue du Curat – Bat. Amarante, 34090 Montpellier, France *To whom correspondence should be addressed. +33 4 67 14 42 36+33 4 67 14 42 36 Correspondence may also be addressed to Jacques Marti. +334 67 144241
Jacques Marti Laboratoire d’Informatique, de Robotique et de Microélectronique, UMR 5506 CNRS – Université de Montpellier II, 161 rue Ada, 34392 Montpellier 05, Institut de Génétique Humaine, CNRS UPR 1142, 141 rue de la Cardonille, 34396 Montpellier 05, France, Helsinki University of Technology, P.O. Box 5400, FI-02015 HUT, Finland and Skuld-Tech, 134, rue du Curat – Bat. Amarante, 34090 Montpellier, France

Collapse

Euskirchen GM, Rozowsky JS, Wei CL, Lee WH, Zhang ZD, Hartman S, Emanuelsson O, Stolc V, Weissman S, Gerstein MB, Ruan Y, Snyder M. Mapping of transcription factor binding regions in mammalian cells by ChIP: comparison of array- and sequencing-based technologies. Genome Res 2007;17:898-909. [PMID: 17568005 PMCID: PMC1891348 DOI: 10.1101/gr.5583007] [Citation(s) in RCA: 160] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

An efficient pseudomedian filter for tiling microrrays. BMC Bioinformatics 2007;8:186. [PMID: 17555595 PMCID: PMC1913926 DOI: 10.1186/1471-2105-8-186] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2007] [Accepted: 06/07/2007] [Indexed: 11/17/2022] Open

Abstract

Background

Tiling microarrays are becoming an essential technology in the functional genomics toolbox. They have been applied to the tasks of novel transcript identification, elucidation of transcription factor binding sites, detection of methylated DNA and several other applications in several model organisms. These experiments are being conducted at increasingly finer resolutions as the microarray technology enjoys increasingly greater feature densities. The increased densities naturally lead to increased data analysis requirements. Specifically, the most widely employed algorithm for tiling array analysis involves smoothing observed signals by computing pseudomedians within sliding windows, a O(n²logn) calculation in each window. This poor time complexity is an issue for tiling array analysis and could prove to be a real bottleneck as tiling microarray experiments become grander in scope and finer in resolution.

Results

We therefore implemented Monahan's HLQEST algorithm that reduces the runtime complexity for computing the pseudomedian of n numbers to O(nlogn) from O(n²logn). For a representative tiling microarray dataset, this modification reduced the smoothing procedure's runtime by nearly 90%. We then leveraged the fact that elements within sliding windows remain largely unchanged in overlapping windows (as one slides across genomic space) to further reduce computation by an additional 43%. This was achieved by the application of skip lists to maintaining a sorted list of values from window to window. This sorted list could be maintained with simple O(log n) inserts and deletes. We illustrate the favorable scaling properties of our algorithms with both time complexity analysis and benchmarking on synthetic datasets.

Conclusion

Tiling microarray analyses that rely upon a sliding window pseudomedian calculation can require many hours of computation. We have eased this requirement significantly by implementing efficient algorithms that scale well with genomic feature density. This result not only speeds the current standard analyses, but also makes possible ones where many iterations of the filter may be required, such as might be required in a bootstrap or parameter estimation setting. Source code and executables are available at .

Collapse

Schnieper-Samec S, Feger G, Wells TN. New biological therapies from the human genome. Expert Opin Drug Discov 2007;2:621-31. [PMID: 23488954 DOI: 10.1517/17460441.2.5.621] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]

Plomin R, Schalkwyk LC. Microarrays. Dev Sci 2007;10:19-23. [PMID: 17181694 PMCID: PMC2776927 DOI: 10.1111/j.1467-7687.2007.00558.x] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Emanuelsson O, Nagalakshmi U, Zheng D, Rozowsky JS, Urban AE, Du J, Lian Z, Stolc V, Weissman S, Snyder M, Gerstein MB. Assessing the performance of different high-density tiling microarray strategies for mapping transcribed regions of the human genome. Genome Res 2006;17:886-97. [PMID: 17119069 PMCID: PMC1891347 DOI: 10.1101/gr.5014606] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]

Ryder E, Jackson R, Ferguson-Smith A, Russell S. MAMMOT--a set of tools for the design, management and visualization of genomic tiling arrays. Bioinformatics 2006;22:883-4. [PMID: 16452111 DOI: 10.1093/bioinformatics/btl031] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open