1
|
Fasold M, Binder H. Variation of RNA Quality and Quantity Are Major Sources of Batch Effects in Microarray Expression Data. MICROARRAYS 2014; 3:322-39. [PMID: 27600351 PMCID: PMC4979052 DOI: 10.3390/microarrays3040322] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/15/2014] [Revised: 09/30/2014] [Accepted: 12/08/2014] [Indexed: 01/03/2023]
Abstract
The great utility of microarrays for genome-scale expression analysis is challenged by the widespread presence of batch effects, which bias expression measurements in particular within large data sets. These unwanted technical artifacts can obscure biological variation and thus significantly reduce the reliability of the analysis results. It is largely unknown which are the predominant technical sources leading to batch effects. We here quantitatively assess the prevalence and impact of several known technical effects on microarray expression results. Particularly, we focus on important factors such as RNA degradation, RNA quantity, and sequence biases including multiple guanine effects. We find that the common variation of RNA quality and RNA quantity can not only yield low-quality expression results, but that both factors also correlate with batch effects and biological characteristics of the samples.
Collapse
Affiliation(s)
- Mario Fasold
- Interdisciplinary Centre for Bioinformatics, Universität Leipzig, Härtelstr. 16-18, 04107 Leipzig, Germany.
- ecSeq Bioinformatics, Brandvorwerkstrasse 43, 04275 Leipzig, Germany.
| | - Hans Binder
- Interdisciplinary Centre for Bioinformatics, Universität Leipzig, Härtelstr. 16-18, 04107 Leipzig, Germany.
- Leipzig Research Center for Civilization Diseases, Universität Leipzig, Philipp-Rosenthal-Straße 27, 04103 Leipzig, Germany.
| |
Collapse
|
2
|
Moshayedi P, Ng G, Kwok JCF, Yeo GSH, Bryant CE, Fawcett JW, Franze K, Guck J. The relationship between glial cell mechanosensitivity and foreign body reactions in the central nervous system. Biomaterials 2014; 35:3919-25. [PMID: 24529901 DOI: 10.1016/j.biomaterials.2014.01.038] [Citation(s) in RCA: 251] [Impact Index Per Article: 22.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2013] [Accepted: 01/15/2014] [Indexed: 12/18/2022]
Abstract
Devices implanted into the body become encapsulated due to a foreign body reaction. In the central nervous system (CNS), this can lead to loss of functionality in electrodes used to treat disorders. Around CNS implants, glial cells are activated, undergo gliosis and ultimately encapsulate the electrodes. The primary cause of this reaction is unknown. Here we show that the mechanical mismatch between nervous tissue and electrodes activates glial cells. Both primary rat microglial cells and astrocytes responded to increasing the contact stiffness from physiological values (G' ∼ 100 Pa) to shear moduli G' ≥ 10 kPa by changes in morphology and upregulation of inflammatory genes and proteins. Upon implantation of composite foreign bodies into rat brains, foreign body reactions were significantly enhanced around their stiff portions in vivo. Our results indicate that CNS glial cells respond to mechanical cues, and suggest that adapting the surface stiffness of neural implants to that of nervous tissue could minimize adverse reactions and improve biocompatibility.
Collapse
Affiliation(s)
- Pouria Moshayedi
- Cavendish Laboratory, Physics Department, University of Cambridge, UK; John van Geest Centre for Brain Repair, University of Cambridge, UK
| | - Gilbert Ng
- Cavendish Laboratory, Physics Department, University of Cambridge, UK
| | - Jessica C F Kwok
- John van Geest Centre for Brain Repair, University of Cambridge, UK
| | - Giles S H Yeo
- Metabolic Research Labs, Institute of Metabolic Science, University of Cambridge, UK
| | - Clare E Bryant
- Department of Veterinary Medicine, University of Cambridge, UK
| | - James W Fawcett
- John van Geest Centre for Brain Repair, University of Cambridge, UK
| | - Kristian Franze
- Cavendish Laboratory, Physics Department, University of Cambridge, UK; Department of Physiology, Development and Neuroscience, University of Cambridge, UK.
| | - Jochen Guck
- Cavendish Laboratory, Physics Department, University of Cambridge, UK; Biotechnology Center, Technische Universität Dresden, Dresden, Germany
| |
Collapse
|
3
|
Abstract
The systematic analysis of miRNA expression and its potential mRNA targets constitutes a basal objective in miRNA research in addition to miRNA gene detection and miRNA target prediction. In this chapter we address methodical issues of miRNA expression analysis using self-organizing maps (SOM), a neural network machine learning algorithm with strong visualization and second-level analysis capabilities widely used to categorize large-scale, high-dimensional data. We shortly review selected experimental and theoretical aspects of miRNA expression analysis. Then, the protocol of our SOM method is outlined with special emphasis on miRNA/mRNA coexpression. The method allows extracting differentially expressed RNA transcripts, their functional context, and also characterization of global properties of expression states and profiles. In addition to the separate study of miRNA and mRNA expression landscapes, we propose the combined analysis of both entities using a covariance SOM.
Collapse
Affiliation(s)
- Henry Wirth
- Interdisciplinary Centre for Bioinformatics, University of Leipzig, Leipzig, Germany
| | | | | | | |
Collapse
|
4
|
Zhu X, Wang H, Liu F, Chen L, Luo W, Su P, Li W, Yu L, Yang X, Cai J. Identification of micro-RNA networks in end-stage heart failure because of dilated cardiomyopathy. J Cell Mol Med 2013; 17:1173-87. [PMID: 23998897 PMCID: PMC4118176 DOI: 10.1111/jcmm.12096] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2013] [Accepted: 05/20/2013] [Indexed: 01/12/2023] Open
Abstract
Micro-RNAs regulate gene expression by directly binding to the target mRNAs. The goal of the study was to examine the expression profiling of miRNAs in human failing hearts and identify the key miRNAs that regulate molecular signalling networks and thus contribute to this pathological process. The levels of miRNAs and expressed genes were analysed in myocardial biopsy samples from patients with end-stage heart failure (n = 14) and those from normal heart samples (n = 8). Four networks were built including the Gene regulatory network, Signal-Network, miRNA-GO-Network and miRNA-Gene-Network. According to the fold change in the network and probability values in the microarray cohort, RT-PCR was performed to measure the expression of five of the 72 differentially regulated miRNAs. miR-340 achieved statistically significant. miR-340 was identified for the first time in cardiac pathophysiological condition. We overexpressed miR-340 in cultured neonatal rat cardiomyocytes to identify whether miR-340 plays a determining role in the progression of heart failure. ANP, BNP and caspase-3 were significantly elevated in the miR-340 transfected cells compared with controls (P < 0.05). The cross-sectional area of overexpressing miR-340 cardiomyocytes (1952.22 ± 106.59) was greater (P < 0.0001) than controls (1059.99 ± 45.59) documented by Laser Confocal Microscopy. The changes of cellular structure and the volume were statistical significance. Our study provided a comprehensive miRNA expression profiling in the end-stage heart failure and identified miR-340 as a key miRNA contributing to the occurrence and progression of heart failure. Our discoveries provide novel therapeutic targets for patients with heart failure.
Collapse
Affiliation(s)
- Xiaoming Zhu
- Department of Cardiology, Beijing Chaoyang Hospital, Capital Medical University, Beijing, China
| | | | | | | | | | | | | | | | | | | |
Collapse
|
5
|
Mustroph A, Zanetti ME, Girke T, Bailey-Serres J. Isolation and analysis of mRNAs from specific cell types of plants by ribosome immunopurification. Methods Mol Biol 2013; 959:277-302. [PMID: 23299683 DOI: 10.1007/978-1-62703-221-6_19] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]
Abstract
Multiple ribosomes assemble onto an individual mRNA to form a polyribosome (polysome) complex. The epitope tagging of specific ribosomal proteins can enable the immunopurification of polysomes from crude cell extracts derived from cryopreserved tissue samples. Through expression of the epitope-tagged ribosomal protein in cell-type and regional specific domains of Arabidopsis thaliana and other organisms it is feasible to quantitatively assess the mRNAs that are associated with ribosomes with cell-specific resolution. Here we present detailed methods for development of transgenics that express a FLAG-tagged version of ribosomal protein L18 (RPL18) under the direction of individual promoters with specific domains of expression, the immunopurification of ribosomes, and bioinformatic analyses of the resultant datasets obtained by microarray profiling. This methodology provides researchers with the opportunity to assess rapid changes at the organ, tissue, regional or cell-type specific level of mRNAs that are associated with ribosomes and therefore engaged in translation.
Collapse
Affiliation(s)
- Angelika Mustroph
- Department of Plant Physiology, University of Bayreuth, Bayreuth, Germany.
| | | | | | | |
Collapse
|
6
|
Wirth H, von Bergen M, Binder H. Mining SOM expression portraits: feature selection and integrating concepts of molecular function. BioData Min 2012; 5:18. [PMID: 23043905 PMCID: PMC3599960 DOI: 10.1186/1756-0381-5-18] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2011] [Accepted: 09/14/2012] [Indexed: 11/30/2022] Open
Abstract
Background Self organizing maps (SOM) enable the straightforward portraying of high-dimensional data of large sample collections in terms of sample-specific images. The analysis of their texture provides so-called spot-clusters of co-expressed genes which require subsequent significance filtering and functional interpretation. We address feature selection in terms of the gene ranking problem and the interpretation of the obtained spot-related lists using concepts of molecular function. Results Different expression scores based either on simple fold change-measures or on regularized Student’s t-statistics are applied to spot-related gene lists and compared with special emphasis on the error characteristics of microarray expression data. The spot-clusters are analyzed using different methods of gene set enrichment analysis with the focus on overexpression and/or overrepresentation of predefined sets of genes. Metagene-related overrepresentation of selected gene sets was mapped into the SOM images to assign gene function to different regions. Alternatively we estimated set-related overexpression profiles over all samples studied using a gene set enrichment score. It was also applied to the spot-clusters to generate lists of enriched gene sets. We used the tissue body index data set, a collection of expression data of human tissues as an illustrative example. We found that tissue related spots typically contain enriched populations of gene sets well corresponding to molecular processes in the respective tissues. In addition, we display special sets of housekeeping and of consistently weak and high expressed genes using SOM data filtering. Conclusions The presented methods allow the comprehensive downstream analysis of SOM-transformed expression data in terms of cluster-related gene lists and enriched gene sets for functional interpretation. SOM clustering implies the ability to define either new gene sets using selected SOM spots or to verify and/or to amend existing ones.
Collapse
Affiliation(s)
- Henry Wirth
- Interdisciplinary Centre for Bioinformatics of Leipzig University, Härtelstr, 16-18, D-4107, Leipzig, Germany.
| | | | | |
Collapse
|
7
|
Stable feature selection and classification algorithms for multiclass microarray data. Biol Direct 2012; 7:33. [PMID: 23031190 PMCID: PMC3599581 DOI: 10.1186/1745-6150-7-33] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2012] [Accepted: 09/07/2012] [Indexed: 01/04/2023] Open
Abstract
Background Recent studies suggest that gene expression profiles are a promising alternative for clinical cancer classification. One major problem in applying DNA microarrays for classification is the dimension of obtained data sets. In this paper we propose a multiclass gene selection method based on Partial Least Squares (PLS) for selecting genes for classification. The new idea is to solve multiclass selection problem with the PLS method and decomposition to a set of two-class sub-problems: one versus rest (OvR) and one versus one (OvO). We use OvR and OvO two-class decomposition for other recently published gene selection method. Ranked gene lists are highly unstable in the sense that a small change of the data set often leads to big changes in the obtained ordered lists. In this paper, we take a look at the assessment of stability of the proposed methods. We use the linear support vector machines (SVM) technique in different variants: one versus one, one versus rest, multiclass SVM (MSVM) and the linear discriminant analysis (LDA) as a classifier. We use balanced bootstrap to estimate the prediction error and to test the variability of the obtained ordered lists. Results This paper focuses on effective identification of informative genes. As a result, a new strategy to find a small subset of significant genes is designed. Our results on real multiclass cancer data show that our method has a very high accuracy rate for different combinations of classification methods, giving concurrently very stable feature rankings. Conclusions This paper shows that the proposed strategies can improve the performance of selected gene sets substantially. OvR and OvO techniques applied to existing gene selection methods improve results as well. The presented method allows to obtain a more reliable classifier with less classifier error. In the same time the method generates more stable ordered feature lists in comparison with existing methods. Reviewers This article was reviewed by Prof Marek Kimmel, Dr Hans Binder (nominated by Dr Tomasz Lipniacki) and Dr Yuriy Gusev
Collapse
|
8
|
Berkovits BD, Wang L, Guarnieri P, Wolgemuth DJ. The testis-specific double bromodomain-containing protein BRDT forms a complex with multiple spliceosome components and is required for mRNA splicing and 3'-UTR truncation in round spermatids. Nucleic Acids Res 2012; 40:7162-75. [PMID: 22570411 PMCID: PMC3424537 DOI: 10.1093/nar/gks342] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2012] [Revised: 04/05/2012] [Accepted: 04/10/2012] [Indexed: 12/02/2022] Open
Abstract
Members of the BET (bromodomain and extra terminal motif) family of proteins have been shown to be chromatin-interacting regulators of transcription. We previously generated a mutation in the testis-specific mammalian BET gene Brdt (bromodomain, testis-specific) that yields protein lacking the first bromodomain (BRDT(ΔBD1)) and observed disrupted spermiogenesis and male sterility. To determine whether BRDT(ΔBD1) protein results in altered transcription, we analyzed the transcriptomes of control versus Brdt(ΔBD1/ΔBD1) round spermatids. Over 400 genes showed statistically significant differential expression, and among the up-regulated genes, there was an enrichment of RNA splicing genes. Over 60% of these splicing genes had transcripts that lacked truncation of their 3'-untranslated region (UTR) typical of round spermatids. We selected four of these genes to characterize: Srsf2, Ddx5, Hnrnpk and Tardbp. The 3'-UTRs of Srsf2, Ddx5 and Hnrnpk mRNAs were longer in mutant round spermatids and resulted in reduced protein levels. Tardbp was transcriptionally up-regulated and a splicing shift toward the longer variant was observed. All four splicing proteins were found to complex with BRDT in control and mutant testes. We thus suggest that, along with modulating transcription, BRDT modulates gene expression as part of the splicing machinery. These modulations alter 3'-UTR processing in round spermatids; importantly, the BD1 is essential for these functions.
Collapse
Affiliation(s)
- Binyamin D. Berkovits
- Department of Genetics and Development, Biomedical Informatics Shared Resources, Bioinformatics Division, The Herbert Irving Comprehensive Cancer Center, The Institute of Human Nutrition and Department of Obstetrics and Gynecology, Columbia University Medical Center, New York, NY 10032, USA
| | - Li Wang
- Department of Genetics and Development, Biomedical Informatics Shared Resources, Bioinformatics Division, The Herbert Irving Comprehensive Cancer Center, The Institute of Human Nutrition and Department of Obstetrics and Gynecology, Columbia University Medical Center, New York, NY 10032, USA
| | - Paolo Guarnieri
- Department of Genetics and Development, Biomedical Informatics Shared Resources, Bioinformatics Division, The Herbert Irving Comprehensive Cancer Center, The Institute of Human Nutrition and Department of Obstetrics and Gynecology, Columbia University Medical Center, New York, NY 10032, USA
| | - Debra J. Wolgemuth
- Department of Genetics and Development, Biomedical Informatics Shared Resources, Bioinformatics Division, The Herbert Irving Comprehensive Cancer Center, The Institute of Human Nutrition and Department of Obstetrics and Gynecology, Columbia University Medical Center, New York, NY 10032, USA
| |
Collapse
|
9
|
Dietz A, Wichmann G. [Translational research in head and neck cancer. Biological characteristics and general aspects]. HNO 2012; 59:874-84. [PMID: 21861150 DOI: 10.1007/s00106-011-2361-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
Translational research in head and neck oncology is subject to the same laws as all other solid tumors. It is based on the one hand on a solid framework of well prepared clinical studies and / or workflows according to consensus criteria with comparable documentation of clinical outcomes, while on the other on methodolgically solid and reproducible laboratory research within an effeciently interacting network. Translationally applicable single molecular markers from basic research [with the exception of p16(INK4a) as a surrogate marker for human papillomavirus (HPV)] have not found their way into clinical routine in head and neck squamous cell carcinoma (HNSCC). "Correlated gene sets" and "metagenes", including genetic profiling (omics) within clinically characterized patient groups, play an increasing role in the translational research of HNSCC. Although methodological problems currently hinder clinical oncological research, increasing focus on translational research can be observed.
Collapse
Affiliation(s)
- A Dietz
- Klinik und Poliklinik für HNO-Erkrankungen, Universitätsklinikum Leipzig, Liebigstr. 10-14, 04103, Leipzig, Deutschland.
| | | |
Collapse
|
10
|
Lee-Liu D, Almonacid LI, Faunes F, Melo F, Larrain J. Transcriptomics using next generation sequencing technologies. Methods Mol Biol 2012; 917:293-317. [PMID: 22956096 DOI: 10.1007/978-1-61779-992-1_18] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Abstract
Next generation sequencing technologies may now be applied to the study of transcriptomics. RNA-Seq or RNA sequencing employs high-throughput sequencing of complementary DNA fragments delivering a transcriptional profile. In this chapter, we aim to provide a starting point for Xenopus researchers planning on starting an RNA-Seq transcriptomics study. We begin by providing a section on template isolation and library preparation. The next section comprises the main bioinformatics procedures that need to be performed for raw data processing, normalization, and differential gene expression. Finally, we have included a section on studying deep sequencing results in Xenopus, which offers general guidance as to what can be done in this model.
Collapse
Affiliation(s)
- Dasfne Lee-Liu
- Center for Aging and Regeneration and Millennium Nucleus in Regenerative Biology, Pontificia Universidad Catolica de Chile, Santiago, Chile
| | | | | | | | | |
Collapse
|
11
|
Herrero MJ, Monleon D, Morales JM, Mata M, Serna E, Aliño SF. Analysis of metabolic and gene expression changes after hydrodynamic DNA injection into mouse liver. Biol Pharm Bull 2011; 34:167-72. [PMID: 21212539 DOI: 10.1248/bpb.34.167] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
The hydrodynamic injection in mice tail vein of a plasmid (40 µg DNA) bearing the human α1-antitrypsin gene mediates: a) good liver gene transfer resulting in therapeutic plasma levels of human protein (1 mg/ml, approximately) from days 1-10 after injection; b) low liver injury as demonstrated by a poor and transient increase of aspartate aminotransferase (AST) and alanine transaminase (ALT) in mouse plasma; 3) limited expression and metabolic changes in host liver genes and metabolites as evaluated on days 2 and 10 after injection. Groups of three mice were uninjected (control) or hydrodynamically injected with saline or plasmid DNA and then sacrificed on days 2 and 10 after injection. The results of principal component analysis (PCA) show, both in expression microarray and metabolomic analysis, that changes between control and hydrodynamically injected groups are not dramatic and tend to normalize after 10 d. The differences are even smaller between DNA and saline hydrodynamically injected mice. Hydrodynamic injection induces a complex but limited gene expression and metabolic change which includes variations in molecules related to energy metabolism and stress response. The results contribute to support that hydrodynamic method is a safe procedure of liver gene transfer but the long-term effect of hydrodynamic gene transfer procedure, remains to be studied.
Collapse
Affiliation(s)
- Maria Jose Herrero
- Gene Therapy Unit, Department of Pharmacology, Faculty of Medicine, University of Valencia, Spain
| | | | | | | | | | | |
Collapse
|
12
|
McCormick KP, Willmann MR, Meyers BC. Experimental design, preprocessing, normalization and differential expression analysis of small RNA sequencing experiments. SILENCE 2011; 2:2. [PMID: 21356093 PMCID: PMC3055805 DOI: 10.1186/1758-907x-2-2] [Citation(s) in RCA: 78] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/22/2010] [Accepted: 02/28/2011] [Indexed: 01/30/2023]
Abstract
Prior to the advent of new, deep sequencing methods, small RNA (sRNA) discovery was dependent on Sanger sequencing, which was time-consuming and limited knowledge to only the most abundant sRNA. The innovation of large-scale, next-generation sequencing has exponentially increased knowledge of the biology, diversity and abundance of sRNA populations. In this review, we discuss issues involved in the design of sRNA sequencing experiments, including choosing a sequencing platform, inherent biases that affect sRNA measurements and replication. We outline the steps involved in preprocessing sRNA sequencing data and review both the principles behind and the current options for normalization. Finally, we discuss differential expression analysis in the absence and presence of biological replicates. While our focus is on sRNA sequencing experiments, many of the principles discussed are applicable to the sequencing of other RNA populations.
Collapse
Affiliation(s)
- Kevin P McCormick
- Department of Plant and Soil Sciences and Delaware Biotechnology Institute, University of Delaware, Newark, DE 19711, USA
| | - Matthew R Willmann
- Department of Biology, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Blake C Meyers
- Department of Plant and Soil Sciences and Delaware Biotechnology Institute, University of Delaware, Newark, DE 19711, USA
| |
Collapse
|
13
|
Fasold M, Stadler PF, Binder H. G-stack modulated probe intensities on expression arrays - sequence corrections and signal calibration. BMC Bioinformatics 2010; 11:207. [PMID: 20423484 PMCID: PMC2884167 DOI: 10.1186/1471-2105-11-207] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2010] [Accepted: 04/27/2010] [Indexed: 02/02/2023] Open
Abstract
Background The brightness of the probe spots on expression microarrays intends to measure the abundance of specific mRNA targets. Probes with runs of at least three guanines (G) in their sequence show abnormal high intensities which reflect rather probe effects than target concentrations. This G-bias requires correction prior to downstream expression analysis. Results Longer runs of three or more consecutive G along the probe sequence and in particular triple degenerated G at its solution end ((GGG)1-effect) are associated with exceptionally large probe intensities on GeneChip expression arrays. This intensity bias is related to non-specific hybridization and affects both perfect match and mismatch probes. The (GGG)1-effect tends to increase gradually for microarrays of later GeneChip generations. It was found for DNA/RNA as well as for DNA/DNA probe/target-hybridization chemistries. Amplification of sample RNA using T7-primers is associated with strong positive amplitudes of the G-bias whereas alternative amplification protocols using random primers give rise to much smaller and partly even negative amplitudes. We applied positional dependent sensitivity models to analyze the specifics of probe intensities in the context of all possible short sequence motifs of one to four adjacent nucleotides along the 25meric probe sequence. Most of the longer motifs are adequately described using a nearest-neighbor (NN) model. In contrast, runs of degenerated guanines require explicit consideration of next nearest neighbors (GGG terms). Preprocessing methods such as vsn, RMA, dChip, MAS5 and gcRMA only insufficiently remove the G-bias from data. Conclusions Positional and motif dependent sensitivity models accounts for sequence effects of oligonucleotide probe intensities. We propose a positional dependent NN+GGG hybrid model to correct the intensity bias associated with probes containing poly-G motifs. It is implemented as a single-chip based calibration algorithm for GeneChips which can be applied in a pre-correction step prior to standard preprocessing.
Collapse
Affiliation(s)
- Mario Fasold
- Interdisciplinary Centre for Bioinformatics, University Leipzig, Germany
| | | | | |
Collapse
|
14
|
Burden CJ, Binder H. Physico-chemical modelling of target depletion during hybridization on oligonulceotide microarrays. Phys Biol 2009; 7:016004. [PMID: 20026877 DOI: 10.1088/1478-3975/7/1/016004] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
|
15
|
Binder H, Krohn K, Preibisch S. "Hook"-calibration of GeneChip-microarrays: chip characteristics and expression measures. Algorithms Mol Biol 2008; 3:11. [PMID: 18759984 PMCID: PMC2543012 DOI: 10.1186/1748-7188-3-11] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2008] [Accepted: 08/29/2008] [Indexed: 11/10/2022] Open
Abstract
Background Microarray experiments rely on several critical steps that may introduce biases and uncertainty in downstream analyses. These steps include mRNA sample extraction, amplification and labelling, hybridization, and scanning causing chip-specific systematic variations on the raw intensity level. Also the chosen array-type and the up-to-dateness of the genomic information probed on the chip affect the quality of the expression measures. In the accompanying publication we presented theory and algorithm of the so-called hook method which aims at correcting expression data for systematic biases using a series of new chip characteristics. Results In this publication we summarize the essential chip characteristics provided by this method, analyze special benchmark experiments to estimate transcript related expression measures and illustrate the potency of the method to detect and to quantify the quality of a particular hybridization. It is shown that our single-chip approach provides expression measures responding linearly on changes of the transcript concentration over three orders of magnitude. In addition, the method calculates a detection call judging the relation between the signal and the detection limit of the particular measurement. The performance of the method in the context of different chip generations and probe set assignments is illustrated. The hook method characterizes the RNA-quality in terms of the 3'/5'-amplification bias and the sample-specific calling rate. We show that the proper judgement of these effects requires the disentanglement of non-specific and specific hybridization which, otherwise, can lead to misinterpretations of expression changes. The consequences of modifying probe/target interactions by either changing the labelling protocol or by substituting RNA by DNA targets are demonstrated. Conclusion The single-chip based hook-method provides accurate expression estimates and chip-summary characteristics using the natural metrics given by the hybridization reaction with the potency to develop new standards for microarray quality control and calibration.
Collapse
|