Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Nykter M, Aho T, Ahdesmäki M, Ruusuvuori P, Lehmussola A, Yli-Harja O. Simulation of microarray data with realistic characteristics. BMC Bioinformatics 2006;7:349. [PMID: 16848902 PMCID: PMC1574357 DOI: 10.1186/1471-2105-7-349] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2005] [Accepted: 07/18/2006] [Indexed: 02/07/2023] Open

For:	Nykter M, Aho T, Ahdesmäki M, Ruusuvuori P, Lehmussola A, Yli-Harja O. Simulation of microarray data with realistic characteristics. BMC Bioinformatics 2006;7:349. [PMID: 16848902 PMCID: PMC1574357 DOI: 10.1186/1471-2105-7-349] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2005] [Accepted: 07/18/2006] [Indexed: 02/07/2023] Open

Number

Cited by Other Article(s)

Joseph SM, Sathidevi PS. An Automated cDNA Microarray Image Analysis for the Determination of Gene Expression Ratios. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023;20:136-150. [PMID: 34910637 DOI: 10.1109/tcbb.2021.3135650] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]

Ma S, Ren B, Mallick H, Moon YS, Schwager E, Maharjan S, Tickle TL, Lu Y, Carmody RN, Franzosa EA, Janson L, Huttenhower C. A statistical model for describing and simulating microbial community profiles. PLoS Comput Biol 2021;17:e1008913. [PMID: 34516542 PMCID: PMC8491899 DOI: 10.1371/journal.pcbi.1008913] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2021] [Revised: 10/05/2021] [Accepted: 08/19/2021] [Indexed: 12/26/2022] Open

Abstract

Many methods have been developed for statistical analysis of microbial community profiles, but due to the complex nature of typical microbiome measurements (e.g. sparsity, zero-inflation, non-independence, and compositionality) and of the associated underlying biology, it is difficult to compare or evaluate such methods within a single systematic framework. To address this challenge, we developed SparseDOSSA (Sparse Data Observations for the Simulation of Synthetic Abundances): a statistical model of microbial ecological population structure, which can be used to parameterize real-world microbial community profiles and to simulate new, realistic profiles of known structure for methods evaluation. Specifically, SparseDOSSA's model captures marginal microbial feature abundances as a zero-inflated log-normal distribution, with additional model components for absolute cell counts and the sequence read generation process, microbe-microbe, and microbe-environment interactions. Together, these allow fully known covariance structure between synthetic features (i.e. "taxa") or between features and "phenotypes" to be simulated for method benchmarking. Here, we demonstrate SparseDOSSA's performance for 1) accurately modeling human-associated microbial population profiles; 2) generating synthetic communities with controlled population and ecological structures; 3) spiking-in true positive synthetic associations to benchmark analysis methods; and 4) recapitulating an end-to-end mouse microbiome feeding experiment. Together, these represent the most common analysis types in assessment of real microbial community environmental and epidemiological statistics, thus demonstrating SparseDOSSA's utility as a general-purpose aid for modeling communities and evaluating quantitative methods. An open-source implementation is available at http://huttenhower.sph.harvard.edu/sparsedossa2.

Collapse

Affiliation(s)

Siyuan Ma Harvard Chan Microbiome in Public Health Center, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America Broad Institute, Cambridge, Massachusetts, United States of America
Boyu Ren Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America Broad Institute, Cambridge, Massachusetts, United States of America
Himel Mallick Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America Broad Institute, Cambridge, Massachusetts, United States of America
Yo Sup Moon Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America
Emma Schwager Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America
Sagun Maharjan Harvard Chan Microbiome in Public Health Center, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America Broad Institute, Cambridge, Massachusetts, United States of America
Timothy L. Tickle Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America Broad Institute, Cambridge, Massachusetts, United States of America
Yiren Lu Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America
Rachel N. Carmody Department of Human Evolutionary Biology, Harvard University, Cambridge, Massachusetts, United States of America
Eric A. Franzosa Harvard Chan Microbiome in Public Health Center, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America Broad Institute, Cambridge, Massachusetts, United States of America
Lucas Janson Department of Statistics, Harvard University, Cambridge, Massachusetts, United States of America
Curtis Huttenhower Harvard Chan Microbiome in Public Health Center, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America Broad Institute, Cambridge, Massachusetts, United States of America Department of Immunology and Infectious Diseases, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America

Collapse

Larriba Y, Rueda C, Fernández MA, Peddada SD. Microarray Data Normalization and Robust Detection of Rhythmic Features. Methods Mol Biol 2019;1986:207-225. [PMID: 31115890 DOI: 10.1007/978-1-4939-9442-7_9] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]

Abstract

Data derived from microarray technologies are generally subject to various sources of noise and accordingly the raw data are pre-processed before formally analysed. Data normalization is a key pre-processing step when dealing with microarray experiments, such as circadian gene-expressions, since it removes systematic variations across arrays. A wide variety of normalization methods are available in the literature. However, from our experience in the study of rhythmic expression patterns in oscillatory systems (e.g. cell-cycle, circadian clock), the choice of the normalization method may substantially impair the identification of rhythmic genes. Hence, the identification of a gene as rhythmic could be just as an artefact of how the data were normalized. Yet, gene rhythmicity detection is crucial in modern toxicological and pharmacological studies, thus a procedure to truly identify rhythmic genes that are robust to the choice of a normalization method is required.To perform the task of detecting rhythmic features, we propose a rhythmicity measure based on bootstrap methodology to robustly identify rhythmic genes in oscillatory systems. Although our methodology can be extended to any high-throughput experiment, in this chapter, we illustrate how to apply it to a publicly available circadian clock microarray gene-expression data and give full details (both statistical and computational) so that the methodology can be used in an easy way. We will show that the choice of normalization method has very little effect on the proposed methodology since the results derived from the bootstrap-based rhythmicity measure are highly rank correlated for any pair of normalization methods considered. This suggests, on the one hand, that the rhythmicity measure proposed is robust to the choice of the normalization method, and on the other hand, that gene rhythmicity detected using this measure is potentially not a mere artefact of the normalization method used. In this way the researcher using this methodology will be protected against the possible effect of different normalizations, as the conclusions obtained will not depend so strongly on them. Additionally, the described bootstrap methodology can also be employed as a tool to simulate gene-expression participating in an oscillatory system from a reference data set.

Collapse

Larriba Y, Rueda C, Fernández MA, Peddada SD. A Bootstrap Based Measure Robust to the Choice of Normalization Methods for Detecting Rhythmic Features in High Dimensional Data. Front Genet 2018;9:24. [PMID: 29456555 PMCID: PMC5801422 DOI: 10.3389/fgene.2018.00024] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2017] [Accepted: 01/17/2018] [Indexed: 01/01/2023] Open

Abstract

Motivation: Gene-expression data obtained from high throughput technologies are subject to various sources of noise and accordingly the raw data are pre-processed before formally analyzed. Normalization of the data is a key pre-processing step, since it removes systematic variations across arrays. There are numerous normalization methods available in the literature. Based on our experience, in the context of oscillatory systems, such as cell-cycle, circadian clock, etc., the choice of the normalization method may substantially impact the determination of a gene to be rhythmic. Thus rhythmicity of a gene can purely be an artifact of how the data were normalized. Since the determination of rhythmic genes is an important component of modern toxicological and pharmacological studies, it is important to determine truly rhythmic genes that are robust to the choice of a normalization method.

Results: In this paper we introduce a rhythmicity measure and a bootstrap methodology to detect rhythmic genes in an oscillatory system. Although the proposed methodology can be used for any high-throughput gene expression data, in this paper we illustrate the proposed methodology using several publicly available circadian clock microarray gene-expression datasets. We demonstrate that the choice of normalization method has very little effect on the proposed methodology. Specifically, for any pair of normalization methods considered in this paper, the resulting values of the rhythmicity measure are highly correlated. Thus it suggests that the proposed measure is robust to the choice of a normalization method. Consequently, the rhythmicity of a gene is potentially not a mere artifact of the normalization method used. Lastly, as demonstrated in the paper, the proposed bootstrap methodology can also be used for simulating data for genes participating in an oscillatory system using a reference dataset.

Availability: A user friendly code implemented in R language can be downloaded from http://www.eio.uva.es/~miguel/robustdetectionprocedure.html

Collapse

Kang S, Song J. Robust gene selection methods using weighting schemes for microarray data analysis. BMC Bioinformatics 2017;18:389. [PMID: 28865426 PMCID: PMC5581932 DOI: 10.1186/s12859-017-1810-x] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2017] [Accepted: 08/27/2017] [Indexed: 11/10/2022] Open

Abstract

Background

A common task in microarray data analysis is to identify informative genes that are differentially expressed between two different states. Owing to the high-dimensional nature of microarray data, identification of significant genes has been essential in analyzing the data. However, the performances of many gene selection techniques are highly dependent on the experimental conditions, such as the presence of measurement error or a limited number of sample replicates.

Results

We have proposed new filter-based gene selection techniques, by applying a simple modification to significance analysis of microarrays (SAM). To prove the effectiveness of the proposed method, we considered a series of synthetic datasets with different noise levels and sample sizes along with two real datasets. The following findings were made. First, our proposed methods outperform conventional methods for all simulation set-ups. In particular, our methods are much better when the given data are noisy and sample size is small. They showed relatively robust performance regardless of noise level and sample size, whereas the performance of SAM became significantly worse as the noise level became high or sample size decreased. When sufficient sample replicates were available, SAM and our methods showed similar performance. Finally, our proposed methods are competitive with traditional methods in classification tasks for microarrays.

Conclusions

The results of simulation study and real data analysis have demonstrated that our proposed methods are effective for detecting significant genes and classification tasks, especially when the given data are noisy or have few sample replicates. By employing weighting schemes, we can obtain robust and reliable results for microarray data analysis.

Electronic supplementary material

The online version of this article (10.1186/s12859-017-1810-x) contains supplementary material, which is available to authorized users.

Collapse

Katsigiannis S, Zacharia E, Maroulis D. MIGS-GPU: Microarray Image Gridding and Segmentation on the GPU. IEEE J Biomed Health Inform 2016;21:867-874. [PMID: 26960232 DOI: 10.1109/jbhi.2016.2537922] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

A Synthetic Kinome Microarray Data Generator. MICROARRAYS 2015;4:432-53. [PMID: 27600233 PMCID: PMC4996406 DOI: 10.3390/microarrays4040432] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/09/2015] [Revised: 08/25/2015] [Accepted: 10/10/2015] [Indexed: 02/02/2023]

Mizeranschi A, Zheng H, Thompson P, Dubitzky W. Evaluating a common semi-mechanistic mathematical model of gene-regulatory networks. BMC SYSTEMS BIOLOGY 2015;9 Suppl 5:S2. [PMID: 26356485 PMCID: PMC4565562 DOI: 10.1186/1752-0509-9-s5-s2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]

Hendrickx DM, Jennen DGJ, Briedé JJ, Cavill R, de Kok TM, Kleinjans JCS. Pattern recognition methods to relate time profiles of gene expression with phenotypic data: a comparative study. Bioinformatics 2015;31:2115-22. [DOI: 10.1093/bioinformatics/btv108] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2014] [Accepted: 02/16/2015] [Indexed: 12/13/2022] Open

Katsigiannis S, Zacharia E, Maroulis D. Grow-cut based automatic cDNA microarray image segmentation. IEEE Trans Nanobioscience 2014;14:138-45. [PMID: 25438323 DOI: 10.1109/tnb.2014.2369961] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

SITDEM: a simulation tool for disease/endpoint models of association studies based on single nucleotide polymorphism genotypes. Comput Biol Med 2013;45:136-42. [PMID: 24480173 DOI: 10.1016/j.compbiomed.2013.11.021] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2013] [Revised: 11/24/2013] [Accepted: 11/26/2013] [Indexed: 01/29/2023]

Flores JL, Inza I, Larrañaga P, Calvo B. A new measure for gene expression biclustering based on non-parametric correlation. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2013;112:367-397. [PMID: 24079964 DOI: 10.1016/j.cmpb.2013.07.025] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/16/2012] [Revised: 06/14/2013] [Accepted: 07/26/2013] [Indexed: 06/02/2023]

Hedjazi L, Le Lann MV, Kempowsky T, Dalenc F, Aguilar-Martin J, Favre G. Symbolic data analysis to defy low signal-to-noise ratio in microarray data for breast cancer prognosis. J Comput Biol 2013;20:610-20. [PMID: 23899014 DOI: 10.1089/cmb.2012.0249] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open

Giannakeas N, Karvelis PS, Exarchos TP, Kalatzis FG, Fotiadis DI. Segmentation of microarray images using pixel classification—Comparison with clustering-based methods. Comput Biol Med 2013;43:705-16. [DOI: 10.1016/j.compbiomed.2013.03.003] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2012] [Revised: 07/26/2012] [Accepted: 03/14/2013] [Indexed: 11/16/2022]

Dembélé D. A Flexible Microarray Data Simulation Model. MICROARRAYS 2013;2:115-30. [PMID: 27605184 PMCID: PMC5003477 DOI: 10.3390/microarrays2020115] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/01/2013] [Revised: 04/07/2013] [Accepted: 04/15/2013] [Indexed: 11/16/2022]

Yoo C, Brilz EM, Wilcox M, Pershouse MA, Putnam EA. Gene Pathways Discovery in Asbestos-Related Diseases using Local Causal Discovery Algorithm. COMMUN STAT-SIMUL C 2012. [DOI: 10.1080/03610918.2011.621573] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]

Zhang J, Coombes KR. Sources of variation in false discovery rate estimation include sample size, correlation, and inherent differences between groups. BMC Bioinformatics 2012;13 Suppl 13:S1. [PMID: 23320794 PMCID: PMC3426804 DOI: 10.1186/1471-2105-13-s13-s1] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open

Zacharia E, Maroulis DE. 3-D Spot Modeling for Automatic Segmentation of cDNA Microarray Images. IEEE Trans Nanobioscience 2010;9:181-92. [DOI: 10.1109/tnb.2010.2050900] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Zeisel A, Amir A, Köstler WJ, Domany E. Intensity dependent estimation of noise in microarrays improves detection of differentially expressed genes. BMC Bioinformatics 2010;11:400. [PMID: 20663218 PMCID: PMC2920277 DOI: 10.1186/1471-2105-11-400] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2010] [Accepted: 07/27/2010] [Indexed: 11/10/2022] Open

Daskalakis A, Glotsos D, Kostopoulos S, Cavouras D, Nikiforidis G. A comparative study of individual and ensemble majority vote cDNA microarray image segmentation schemes, originating from a spot-adjustable based restoration framework. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2009;95:72-88. [PMID: 19278747 DOI: 10.1016/j.cmpb.2009.01.007] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/19/2008] [Revised: 09/23/2008] [Accepted: 01/12/2009] [Indexed: 05/27/2023]

Shan WJ, Tong CF, Shi JS. [Comparison of statistical methods for detecting differential expression in microarray data]. YI CHUAN = HEREDITAS 2009;30:1640-6. [PMID: 19073583 DOI: 10.3724/sp.j.1005.2008.01640] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

Distribution modeling and simulation of gene expression data. Comput Stat Data Anal 2009. [DOI: 10.1016/j.csda.2008.03.023] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]

Marbach D, Schaffter T, Mattiussi C, Floreano D. Generating Realistic In Silico Gene Networks for Performance Assessment of Reverse Engineering Methods. J Comput Biol 2009;16:229-39. [PMID: 19183003 DOI: 10.1089/cmb.2008.09tt] [Citation(s) in RCA: 300] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Siegmund K, Ahlborn C, Richert C. ChipCheckII - predicting binding curves for multiple analyte strands on small DNA microarrays. NUCLEOSIDES NUCLEOTIDES & NUCLEIC ACIDS 2008;27:376-88. [PMID: 18404572 DOI: 10.1080/15257770801944147] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]

Xiong H, Zhang D, Martyniuk CJ, Trudeau VL, Xia X. Using generalized procrustes analysis (GPA) for normalization of cDNA microarray data. BMC Bioinformatics 2008;9:25. [PMID: 18199333 PMCID: PMC2275243 DOI: 10.1186/1471-2105-9-25] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2007] [Accepted: 01/16/2008] [Indexed: 01/16/2023] Open

Giannakeas N, Karvelis PS, Fotiadis DI. A classification-based segmentation of cDNA microarray images using Support Vector Machines. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2008;2008:875-878. [PMID: 19162796 DOI: 10.1109/iembs.2008.4649293] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]

Kim HY, Lee SE, Kim MJ, Han JI, Kim BK, Lee YS, Lee YS, Kim JH. Characterization and simulation of cDNA microarray spots using a novel mathematical model. BMC Bioinformatics 2007;8:485. [PMID: 18096047 PMCID: PMC2267720 DOI: 10.1186/1471-2105-8-485] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2007] [Accepted: 12/20/2007] [Indexed: 11/30/2022] Open

Abstract

Background

The quality of cDNA microarray data is crucial for expanding its application to other research areas, such as the study of gene regulatory networks. Despite the fact that a number of algorithms have been suggested to increase the accuracy of microarray gene expression data, it is necessary to obtain reliable microarray images by improving wet-lab experiments. As the first step of a cDNA microarray experiment, spotting cDNA probes is critical to determining the quality of spot images.

Results

We developed a governing equation of cDNA deposition during evaporation of a drop in the microarray spotting process. The governing equation included four parameters: the surface site density on the support, the extrapolated equilibrium constant for the binding of cDNA molecules with surface sites on glass slides, the macromolecular interaction factor, and the volume constant of a drop of cDNA solution. We simulated cDNA deposition from the single model equation by varying the value of the parameters. The morphology of the resulting cDNA deposit can be classified into three types: a doughnut shape, a peak shape, and a volcano shape. The spot morphology can be changed into a flat shape by varying the experimental conditions while considering the parameters of the governing equation of cDNA deposition. The four parameters were estimated by fitting the governing equation to the real microarray images. With the results of the simulation and the parameter estimation, the phenomenon of the formation of cDNA deposits in each type was investigated.

Conclusion

This study explains how various spot shapes can exist and suggests which parameters are to be adjusted for obtaining a good spot. This system is able to explore the cDNA microarray spotting process in a predictable, manageable and descriptive manner. We hope it can provide a way to predict the incidents that can occur during a real cDNA microarray experiment, and produce useful data for several research applications involving cDNA microarrays.

Collapse

Daskalakis A, Cavouras D, Bougioukos P, Kostopoulos S, Georgiadis P, Kalatzis I, Kagadis G, Nikiforidis G. Genes expression level quantification using a spot-based algorithmic pipeline. ACTA ACUST UNITED AC 2007;2007:1148-51. [PMID: 18002165 DOI: 10.1109/iembs.2007.4352499] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]

Daskalakis A, Cavouras D, Bougioukos P, Kostopoulos S, Glotsos D, Kalatzis I, Kagadis GC, Argyropoulos C, Nikiforidis G. Improving gene quantification by adjustable spot-image restoration. Bioinformatics 2007;23:2265-72. [PMID: 17599935 DOI: 10.1093/bioinformatics/btm337] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Abstract

MOTIVATION

One of the major factors that complicate the task of microarray image analysis is that microarray images are distorted by various types of noise. In this study a robust framework is proposed, designed to take into account the effect of noise in microarray images in order to assist the demanding task of microarray image analysis. The proposed framework, incorporates in the microarray image processing pipeline a novel combination of spot adjustable image analysis and processing techniques and consists of the following stages: (1) gridding for facilitating spot identification, (2) clustering (unsupervised discrimination between spot and background pixels) applied to spot image for automatic local noise assessment, (3) modeling of local image restoration process for spot image conditioning (adjustable wiener restoration using an empirically determined degradation function), (4) automatic spot segmentation employing seeded-region-growing, (5) intensity extraction and (6) assessment of the reproducibility (real data) and the validity (simulated data) of the extracted gene expression levels.

RESULTS

Both simulated and real microarray images were employed in order to assess the performance of the proposed framework against well-established methods implemented in publicly available software packages (Scanalyze and SPOT). Regarding simulated images, the novel combination of techniques, introduced in the proposed framework, rendered the detection of spot areas and the extraction of spot intensities more accurate. Furthermore, on real images the proposed framework proved of better stability across replicates. Results indicate that the proposed framework improves spots' segmentation and, consequently, quantification of gene expression levels.

AVAILABILITY

All algorithms were implemented in Matlab (The Mathworks, Inc., Natick, MA, USA) environment. The codes that implement microarray gridding, adaptive spot restoration and segmentation/intensity extraction are available upon request. Supplementary results and the simulated microarray images used in this study are available for download from: ftp://users:bioinformatics@mipa.med.upatras.gr.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

Collapse

Aho T, Smolander OP, Niemi J, Yli-Harja O. RMBNToolbox: random models for biochemical networks. BMC SYSTEMS BIOLOGY 2007;1:22. [PMID: 17524136 PMCID: PMC1896132 DOI: 10.1186/1752-0509-1-22] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/22/2007] [Accepted: 05/24/2007] [Indexed: 11/10/2022]