Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Conlon EM, Song JJ, Liu A. Bayesian meta-analysis models for microarray data: a comparative study. BMC Bioinformatics 2007;8:80. [PMID: 17343745 PMCID: PMC1851021 DOI: 10.1186/1471-2105-8-80] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2006] [Accepted: 03/07/2007] [Indexed: 11/10/2022] Open

For:	Conlon EM, Song JJ, Liu A. Bayesian meta-analysis models for microarray data: a comparative study. BMC Bioinformatics 2007;8:80. [PMID: 17343745 PMCID: PMC1851021 DOI: 10.1186/1471-2105-8-80] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2006] [Accepted: 03/07/2007] [Indexed: 11/10/2022] Open

Number

Cited by Other Article(s)

Pandey D, Perumal P. O. Improved meta-analysis pipeline ameliorates distinctive gene regulators of diabetic vasculopathy in human endothelial cell (hECs) RNA-Seq data. PLoS One 2023;18:e0293939. [PMID: 37943808 PMCID: PMC10635490 DOI: 10.1371/journal.pone.0293939] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Accepted: 10/21/2023] [Indexed: 11/12/2023] Open

Abstract

Enormous gene expression data generated through next-generation sequencing (NGS) technologies are accessible to the scientific community via public repositories. The data harboured in these repositories are foundational for data integrative studies enabling large-scale data analysis whose potential is yet to be fully realized. Prudent integration of individual gene expression data i.e. RNA-Seq datasets is remarkably challenging as it encompasses an assortment and series of data analysis steps that requires to be accomplished before arriving at meaningful insights on biological interrogations. These insights are at all times latent within the data and are not usually revealed from the modest individual data analysis owing to the limited number of biological samples in individual studies. Nevertheless, a sensibly designed meta-analysis of select individual studies would not only maximize the sample size of the analysis but also significantly improves the statistical power of analysis thereby revealing the latent insights. In the present study, a custom-built meta-analysis pipeline is presented for the integration of multiple datasets from different origins. As a case study, we have tested with the integration of two relevant datasets pertaining to diabetic vasculopathy retrieved from the open source domain. We report the meta-analysis ameliorated distinctive and latent gene regulators of diabetic vasculopathy and uncovered a total of 975 i.e. 930 up-regulated and 45 down-regulated gene signatures. Further investigation revealed a subset of 14 DEGs including CTLA4, CALR, G0S2, CALCR, OMA1, and DNAJC3 as latent i.e. novel as these signatures have not been reported earlier. Moreover, downstream investigations including enrichment analysis, and protein-protein interaction (PPI) network analysis of DEGs revealed durable disease association signifying their potential as novel transcriptomic biomarkers of diabetic vasculopathy. While the meta-analysis of individual whole transcriptomic datasets for diabetic vasculopathy is exclusive to our comprehension, however, the novel meta-analysis pipeline could very well be extended to study the mechanistic links of DEGs in other disease conditions.

Collapse

Mishra A, Chanchal S, Ashraf MZ. Host-Viral Interactions Revealed among Shared Transcriptomics Signatures of ARDS and Thrombosis: A Clue into COVID-19 Pathogenesis. TH OPEN 2020;4:e403-e412. [PMID: 33354650 PMCID: PMC7746517 DOI: 10.1055/s-0040-1721706] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2020] [Accepted: 11/02/2020] [Indexed: 01/07/2023] Open

Abstract

Severe novel corona virus disease 2019 (COVID-19) infection is associated with a considerable activation of coagulation pathways, endothelial damage, and subsequent thrombotic microvascular injuries. These consistent observations may have serious implications for the treatment and management of this highly pathogenic disease. As a consequence, the anticoagulant therapeutic strategies, such as low molecular weight heparin, have shown some encouraging results. Cytokine burst leading to sepsis which is one of the primary reasons for acute respiratory distress syndrome (ARDS) drive that could be worsened with the accumulation of coagulation factors in the lungs of COVID-19 patients. However, the obscurity of this syndrome remains a hurdle in making decisive treatment choices. Therefore, an attempt to characterize shared biological mechanisms between ARDS and thrombosis using comprehensive transcriptomics meta-analysis is made. We conducted an integrated gene expression meta-analysis of two independently publicly available datasets of ARDS and venous thromboembolism (VTE). Datasets GSE76293 and GSE19151 derived from National Centre for Biotechnology Information–Gene Expression Omnibus (NCBI-GEO) database were used for ARDS and VTE, respectively. Integrative meta-analysis of expression data (INMEX) tool preprocessed the datasets and effect size combination with random effect modeling was used for obtaining differentially expressed genes (DEGs). Network construction was done for hub genes and pathway enrichment analysis. Our meta-analysis identified a total of 1,878 significant DEGs among the datasets, which when subjected to enrichment analysis suggested inflammation–coagulation–hypoxemia convolutions in COVID-19 pathogenesis. The top hub genes of our study such as tumor protein 53 (TP53), lysine acetyltransferase 2B (KAT2B), DExH-box helicase 9 (DHX9), REL-associated protein (RELA), RING-box protein 1 (RBX1), and proteasome 20S subunit beta 2 (PSMB2) gave insights into the genes known to be participating in the host–virus interactions that could pave the way to understand the various strategies deployed by the virus to improve its replication and spreading.

Collapse

Vennou KE, Piovani D, Kontou PI, Bonovas S, Bagos PG. Multiple outcome meta-analysis of gene-expression data in inflammatory bowel disease. Genomics 2020;112:1761-1767. [DOI: 10.1016/j.ygeno.2019.09.019] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2019] [Revised: 09/26/2019] [Accepted: 09/27/2019] [Indexed: 01/02/2023]

Meta-analysis of gene expression profiles in preeclampsia. Pregnancy Hypertens 2020;19:52-60. [DOI: 10.1016/j.preghy.2019.12.007] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2019] [Accepted: 12/18/2019] [Indexed: 01/12/2023]

Kontou P, Pavlopoulou A, Braliou G, Bogiatzi S, Dimou N, Bangalore S, Bagos P. Identification of gene expression profiles in myocardial infarction: a systematic review and meta-analysis. BMC Med Genomics 2018;11:109. [PMID: 30482209 PMCID: PMC6260684 DOI: 10.1186/s12920-018-0427-x] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2018] [Accepted: 11/07/2018] [Indexed: 12/20/2022] Open

Abstract

BACKGROUND

Myocardial infarction (MI) is a multifactorial disease with complex pathogenesis, mainly the result of the interplay of genetic and environmental risk factors. The regulation of thrombosis, inflammation and cholesterol and lipid metabolism are the main factors that have been proposed thus far to be involved in the pathogenesis of MI. Traditional risk-estimation tools depend largely on conventional risk factors but there is a need for identification of novel biochemical and genetic markers. The aim of the study is to identify differentially expressed genes that are consistently associated with the incidence myocardial infarction (MI), which could be potentially incorporated into the traditional cardiovascular diseases risk factors models.

METHODS

The biomedical literature and gene expression databases, PubMed and GEO, respectively, were searched following the PRISMA guidelines. The key inclusion criteria were gene expression data derived from case-control studies on MI patients from blood samples. Gene expression datasets regarding the effect of medicinal drugs on MI were excluded. The t-test was applied to gene expression data from case-control studies in MI patients.

RESULTS

A total of 162 articles and 174 gene expression datasets were retrieved. Of those a total of 4 gene expression datasets met the inclusion criteria, which contained data on 31,180 loci in 93 MI patients and 89 healthy individuals. Collectively, 626 differentially expressed genes were detected in MI patients as compared to non-affected individuals at an FDR q-value = 0.01. Of those, 88 genes/gene products were interconnected in an interaction network. Totally, 15 genes were identified as hubs of the network.

CONCLUSIONS

Functional enrichment analyses revealed that the DEGs and that they are mainly involved in inflammatory/wound healing, RNA processing/transport mechanisms and a yet not fully characterized pathway implicated in RNA transport and nuclear pore proteins. The overlap between the DEGs identified in this study and the genes identified through genetic-association studies is minimal. These data could be useful in future studies on the molecular mechanisms of MI as well as diagnostic and prognostic markers.

Collapse

Kontou PI, Pavlopoulou A, Bagos PG. Methods of Analysis and Meta-Analysis for Identifying Differentially Expressed Genes. Methods Mol Biol 2018;1793:183-210. [PMID: 29876898 DOI: 10.1007/978-1-4939-7868-7_12] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]

Wei Z, Wang X, Conlon EM. Parallel Markov chain Monte Carlo for Bayesian dynamic item response models in educational testing. Stat (Int Stat Inst) 2017. [DOI: 10.1002/sta4.164] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Li N, McCall MN, Wu Z. Establishing Informative Prior for Gene Expression Variance from Public Databases. STATISTICS IN BIOSCIENCES 2017. [DOI: 10.1007/s12561-016-9172-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Wang T, Zhang L, Tian P, Tian S. Identification of differentially-expressed genes between early-stage adenocarcinoma and squamous cell carcinoma lung cancer using meta-analysis methods. Oncol Lett 2017;13:3314-3322. [PMID: 28521438 DOI: 10.3892/ol.2017.5838] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2015] [Accepted: 10/06/2016] [Indexed: 01/04/2023] Open

Kavakiotis I, Xochelli A, Agathangelidis A, Tsoumakas G, Maglaveras N, Stamatopoulos K, Hadzidimitriou A, Vlahavas I, Chouvarda I. Integrating multiple immunogenetic data sources for feature extraction and mining somatic hypermutation patterns: the case of "towards analysis" in chronic lymphocytic leukaemia. BMC Bioinformatics 2016;17 Suppl 5:173. [PMID: 27295298 PMCID: PMC4905615 DOI: 10.1186/s12859-016-1044-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/29/2023] Open

Li B, Sun Z, He Q, Zhu Y, Qin ZS. Bayesian inference with historical data-based informative priors improves detection of differentially expressed genes. Bioinformatics 2016;32:682-9. [PMID: 26519502 DOI: 10.1093/bioinformatics/btv631] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2015] [Accepted: 10/26/2015] [Indexed: 12/13/2022] Open

Bergon A, Belzeaux R, Comte M, Pelletier F, Hervé M, Gardiner EJ, Beveridge NJ, Liu B, Carr V, Scott RJ, Kelly B, Cairns MJ, Kumarasinghe N, Schall U, Blin O, Boucraut J, Tooney PA, Fakra E, Ibrahim EC. CX3CR1 is dysregulated in blood and brain from schizophrenia patients. Schizophr Res 2015;168:434-43. [PMID: 26285829 DOI: 10.1016/j.schres.2015.08.010] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/25/2015] [Revised: 08/05/2015] [Accepted: 08/06/2015] [Indexed: 12/31/2022]

Affiliation(s)

Aurélie Bergon INSERM, TAGC UMR_S 1090, 13288 Marseille Cedex 09, France; Aix Marseille Université, TAGC UMR_S 1090, 13288 Marseille Cedex 09, France
Raoul Belzeaux Aix Marseille Université, CNRS, CRN2M UMR 7286, 13344 Marseille Cedex 15, France; FondaMental, Fondation de Recherche et de Soins en Santé Mentale, 94000 Créteil, France; AP-HM, Hôpital Sainte Marguerite, Pôle de Psychiatrie Universitaire Solaris, 13009 Marseille, France
Magali Comte Aix-Marseille Université, CNRS, Institut de Neurosciences de la Timone UMR 7289, 13005 Marseille, France
Florence Pelletier Aix Marseille Université, CNRS, CRN2M UMR 7286, 13344 Marseille Cedex 15, France; FondaMental, Fondation de Recherche et de Soins en Santé Mentale, 94000 Créteil, France
Mylène Hervé Aix Marseille Université, CNRS, CRN2M UMR 7286, 13344 Marseille Cedex 15, France; FondaMental, Fondation de Recherche et de Soins en Santé Mentale, 94000 Créteil, France
Erin J Gardiner School of Biomedical Sciences and Pharmacy and School of Medicine and Public Health, Faculty of Health, The University of Newcastle, University Drive, Callaghan, NSW 2308 Australia; Centre for Translational Neuroscience and Mental Health, The University of Newcastle, Callaghan, NSW 2308 Australia; Hunter Medical Research Institute, New Lambton Heights, NSW 2305, Australia; Schizophrenia Research Institute, Darlinghurst, NSW 2010 Australia
Natalie J Beveridge School of Biomedical Sciences and Pharmacy and School of Medicine and Public Health, Faculty of Health, The University of Newcastle, University Drive, Callaghan, NSW 2308 Australia; Centre for Translational Neuroscience and Mental Health, The University of Newcastle, Callaghan, NSW 2308 Australia; Hunter Medical Research Institute, New Lambton Heights, NSW 2305, Australia; Schizophrenia Research Institute, Darlinghurst, NSW 2010 Australia
Bing Liu School of Biomedical Sciences and Pharmacy and School of Medicine and Public Health, Faculty of Health, The University of Newcastle, University Drive, Callaghan, NSW 2308 Australia; Centre for Translational Neuroscience and Mental Health, The University of Newcastle, Callaghan, NSW 2308 Australia; Kids Cancer Alliance, Cancer Institute NSW, Sydney, Australia
Vaughan Carr Schizophrenia Research Institute, Darlinghurst, NSW 2010 Australia; School of Psychiatry, University of New South Wales, Randwick, NSW 2301, Australia; Department of Psychiatry, Monash University, Clayton, VIC 3168, Australia
Rodney J Scott School of Biomedical Sciences and Pharmacy and School of Medicine and Public Health, Faculty of Health, The University of Newcastle, University Drive, Callaghan, NSW 2308 Australia; Centre for Translational Neuroscience and Mental Health, The University of Newcastle, Callaghan, NSW 2308 Australia; Hunter Medical Research Institute, New Lambton Heights, NSW 2305, Australia; Schizophrenia Research Institute, Darlinghurst, NSW 2010 Australia
Brian Kelly School of Biomedical Sciences and Pharmacy and School of Medicine and Public Health, Faculty of Health, The University of Newcastle, University Drive, Callaghan, NSW 2308 Australia; Centre for Translational Neuroscience and Mental Health, The University of Newcastle, Callaghan, NSW 2308 Australia; Hunter Medical Research Institute, New Lambton Heights, NSW 2305, Australia
Murray J Cairns School of Biomedical Sciences and Pharmacy and School of Medicine and Public Health, Faculty of Health, The University of Newcastle, University Drive, Callaghan, NSW 2308 Australia; Centre for Translational Neuroscience and Mental Health, The University of Newcastle, Callaghan, NSW 2308 Australia; Hunter Medical Research Institute, New Lambton Heights, NSW 2305, Australia; Schizophrenia Research Institute, Darlinghurst, NSW 2010 Australia
Nishantha Kumarasinghe School of Biomedical Sciences and Pharmacy and School of Medicine and Public Health, Faculty of Health, The University of Newcastle, University Drive, Callaghan, NSW 2308 Australia; Centre for Translational Neuroscience and Mental Health, The University of Newcastle, Callaghan, NSW 2308 Australia; Hunter Medical Research Institute, New Lambton Heights, NSW 2305, Australia; Schizophrenia Research Institute, Darlinghurst, NSW 2010 Australia; University of Sri Jayewardenepura, Nugegoda, Sri Lanka; National Institute of Mental Health, Angoda, Sri Lanka
Ulrich Schall School of Biomedical Sciences and Pharmacy and School of Medicine and Public Health, Faculty of Health, The University of Newcastle, University Drive, Callaghan, NSW 2308 Australia; Centre for Translational Neuroscience and Mental Health, The University of Newcastle, Callaghan, NSW 2308 Australia; Hunter Medical Research Institute, New Lambton Heights, NSW 2305, Australia; Schizophrenia Research Institute, Darlinghurst, NSW 2010 Australia
Olivier Blin CIC-UPCET et Pharmacologie Clinique, Hôpital de la Timone, 13005 Marseille, France
José Boucraut Aix Marseille Université, CNRS, CRN2M UMR 7286, 13344 Marseille Cedex 15, France; FondaMental, Fondation de Recherche et de Soins en Santé Mentale, 94000 Créteil, France
Paul A Tooney School of Biomedical Sciences and Pharmacy and School of Medicine and Public Health, Faculty of Health, The University of Newcastle, University Drive, Callaghan, NSW 2308 Australia; Centre for Translational Neuroscience and Mental Health, The University of Newcastle, Callaghan, NSW 2308 Australia; Hunter Medical Research Institute, New Lambton Heights, NSW 2305, Australia; Schizophrenia Research Institute, Darlinghurst, NSW 2010 Australia
Eric Fakra Aix-Marseille Université, CNRS, Institut de Neurosciences de la Timone UMR 7289, 13005 Marseille, France; CHU de Saint-Etienne, Pôle de Psychiatrie, 42100 Saint-Etienne, France
El Chérif Ibrahim Aix Marseille Université, CNRS, CRN2M UMR 7286, 13344 Marseille Cedex 15, France; FondaMental, Fondation de Recherche et de Soins en Santé Mentale, 94000 Créteil, France.

Collapse

Feng F, Kepler TB. Bayesian Estimation of the Active Concentration and Affinity Constants Using Surface Plasmon Resonance Technology. PLoS One 2015;10:e0130812. [PMID: 26098764 PMCID: PMC4476803 DOI: 10.1371/journal.pone.0130812] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2014] [Accepted: 05/25/2015] [Indexed: 11/19/2022] Open

Zollinger A, Davison AC, Goldstein DR. Meta-analysis of incomplete microarray studies. Biostatistics 2015;16:686-700. [PMID: 25987649 DOI: 10.1093/biostatistics/kxv014] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2013] [Accepted: 03/12/2015] [Indexed: 12/18/2022] Open

Parker JD, Torchin ME, Hufbauer RA, Lemoine NP, Alba C, Blumenthal DM, Bossdorf O, Byers JE, Dunn AM, Heckman RW, Hejda M, Jarošík V, Kanarek AR, Martin LB, Perkins SE, Pyšek P, Schierenbeck K, Schlöder C, van Klinken R, Vaughn KJ, Williams W, Wolfe LM. Do invasive species perform better in their new ranges? Ecology 2013;94:985-94. [DOI: 10.1890/12-1810.1] [Citation(s) in RCA: 183] [Impact Index Per Article: 16.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Conlon EM, Postier BL, Methé BA, Nevin KP, Lovley DR. A Bayesian model for pooling gene expression studies that incorporates co-regulation information. PLoS One 2012;7:e52137. [PMID: 23284902 PMCID: PMC3532429 DOI: 10.1371/journal.pone.0052137] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2012] [Accepted: 11/13/2012] [Indexed: 12/01/2022] Open

Wang Y, Hu YL, Cao J, He M. Bioinformatic screening of key genes expressed in both human and mouse hepatocellular carcinoma. Shijie Huaren Xiaohua Zazhi 2012;20:1012-1017. [DOI: 10.11569/wcjd.v20.i12.1012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open

Tseng GC, Ghosh D, Feingold E. Comprehensive literature review and statistical considerations for microarray meta-analysis. Nucleic Acids Res 2012;40:3785-99. [PMID: 22262733 PMCID: PMC3351145 DOI: 10.1093/nar/gkr1265] [Citation(s) in RCA: 266] [Impact Index Per Article: 22.2] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open

Fierro AC, Vandenbussche F, Engelen K, Van de Peer Y, Marchal K. Meta Analysis of Gene Expression Data within and Across Species. Curr Genomics 2011;9:525-34. [PMID: 19516959 PMCID: PMC2694560 DOI: 10.2174/138920208786847935] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2008] [Revised: 07/07/2008] [Accepted: 07/18/2008] [Indexed: 01/15/2023] Open

Abstract

Since the second half of the 1990s, a large number of genome-wide analyses have been described that study gene expression at the transcript level. To this end, two major strategies have been adopted, a first one relying on hybridization techniques such as microarrays, and a second one based on sequencing techniques such as serial analysis of gene expression (SAGE), cDNA-AFLP, and analysis based on expressed sequence tags (ESTs). Despite both types of profiling experiments becoming routine techniques in many research groups, their application remains costly and laborious. As a result, the number of conditions profiled in individual studies is still relatively small and usually varies from only two to few hundreds of samples for the largest experiments. More and more, scientific journals require the deposit of these high throughput experiments in public databases upon publication. Mining the information present in these databases offers molecular biologists the possibility to view their own small-scale analysis in the light of what is already available. However, so far, the richness of the public information remains largely unexploited. Several obstacles such as the correct association between ESTs and microarray probes with the corresponding gene transcript, the incompleteness and inconsistency in the annotation of experimental conditions, and the lack of standardized experimental protocols to generate gene expression data, all impede the successful mining of these data. Here, we review the potential and difficulties of combining publicly available expression data from respectively EST analyses and microarray experiments. With examples from literature, we show how meta-analysis of expression profiling experiments can be used to study expression behavior in a single organism or between organisms, across a wide range of experimental conditions. We also provide an overview of the methods and tools that can aid molecular biologists in exploiting these public data.

Collapse

Li J, Tseng GC. An adaptively weighted statistic for detecting differential gene expression when combining multiple transcriptomic studies. Ann Appl Stat 2011. [DOI: 10.1214/10-aoas393] [Citation(s) in RCA: 84] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Feng F, Sales AP, Kepler TB. A Bayesian approach for estimating calibration curves and unknown concentrations in immunoassays. ACTA ACUST UNITED AC 2010;27:707-12. [PMID: 21149344 DOI: 10.1093/bioinformatics/btq686] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]

Nguyen TT, Almon RR, Dubois DC, Jusko WJ, Androulakis IP. Comparative analysis of acute and chronic corticosteroid pharmacogenomic effects in rat liver: transcriptional dynamics and regulatory structures. BMC Bioinformatics 2010;11:515. [PMID: 20946642 PMCID: PMC2973961 DOI: 10.1186/1471-2105-11-515] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2010] [Accepted: 10/14/2010] [Indexed: 12/11/2022] Open

Abstract

Background

Comprehensively understanding corticosteroid pharmacogenomic effects is an essential step towards an insight into the underlying molecular mechanisms for both beneficial and detrimental clinical effects. Nevertheless, even in a single tissue different methods of corticosteroid administration can induce different patterns of expression and regulatory control structures. Therefore, rich in vivo datasets of pharmacological time-series with two dosing regimens sampled from rat liver are examined for temporal patterns of changes in gene expression and their regulatory commonalities.

Results

The study addresses two issues, including (1) identifying significant transcriptional modules coupled with dynamic expression patterns and (2) predicting relevant common transcriptional controls to better understand the underlying mechanisms of corticosteroid adverse effects. Following the orientation of meta-analysis, an extended computational approach that explores the concept of agreement matrix from consensus clustering has been proposed with the aims of identifying gene clusters that share common expression patterns across multiple dosing regimens as well as handling challenges in the analysis of microarray data from heterogeneous sources, e.g. different platforms and time-grids in this study. Six significant transcriptional modules coupled with typical patterns of expression have been identified. Functional analysis reveals that virtually all enriched functions (gene ontologies, pathways) in these modules are shown to be related to metabolic processes, implying the importance of these modules in adverse effects under the administration of corticosteroids. Relevant putative transcriptional regulators (e.g. RXRF, FKHD, SP1F) are also predicted to provide another source of information towards better understanding the complexities of expression patterns and the underlying regulatory mechanisms of those modules.

Conclusions

We have proposed a framework to identify significant coexpressed clusters of genes across multiple conditions experimented from different microarray platforms, time-grids, and also tissues if applicable. Analysis on rich in vivo datasets of corticosteroid time-series yielded significant insights into the pharmacogenomic effects of corticosteroids, especially the relevance to metabolic side-effects. This has been illustrated through enriched metabolic functions in those transcriptional modules and the presence of GRE binding motifs in those enriched pathways, providing significant modules for further analysis on pharmacogenomic corticosteroid effects.

Collapse

Lin S. Rank aggregation methods. ACTA ACUST UNITED AC 2010. [DOI: 10.1002/wics.111] [Citation(s) in RCA: 109] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Gholami AM, Fellenberg K. Cross-species common regulatory network inference without requirement for prior gene affiliation. ACTA ACUST UNITED AC 2010;26:1082-90. [PMID: 20200011 DOI: 10.1093/bioinformatics/btq096] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]

Yang X, Sun X. Meta-analysis of cancer gene-profiling data. Methods Mol Biol 2010;576:409-26. [PMID: 19882274 DOI: 10.1007/978-1-59745-545-9_21] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/19/2023]

Wang K, Narayanan M, Zhong H, Tompa M, Schadt EE, Zhu J. Meta-analysis of inter-species liver co-expression networks elucidates traits associated with common human diseases. PLoS Comput Biol 2009;5:e1000616. [PMID: 20019805 PMCID: PMC2787626 DOI: 10.1371/journal.pcbi.1000616] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2009] [Accepted: 11/16/2009] [Indexed: 12/02/2022] Open

Abstract

Co-expression networks are routinely used to study human diseases like obesity and diabetes. Systematic comparison of these networks between species has the potential to elucidate common mechanisms that are conserved between human and rodent species, as well as those that are species-specific characterizing evolutionary plasticity. We developed a semi-parametric meta-analysis approach for combining gene-gene co-expression relationships across expression profile datasets from multiple species. The simulation results showed that the semi-parametric method is robust against noise. When applied to human, mouse, and rat liver co-expression networks, our method out-performed existing methods in identifying gene pairs with coherent biological functions. We identified a network conserved across species that highlighted cell-cell signaling, cell-adhesion and sterol biosynthesis as main biological processes represented in genome-wide association study candidate gene sets for blood lipid levels. We further developed a heterogeneity statistic to test for network differences among multiple datasets, and demonstrated that genes with species-specific interactions tend to be under positive selection throughout evolution. Finally, we identified a human-specific sub-network regulated by RXRG, which has been validated to play a different role in hyperlipidemia and Type 2 diabetes between human and mouse. Taken together, our approach represents a novel step forward in integrating gene co-expression networks from multiple large scale datasets to leverage not only common information but also differences that are dataset-specific.

Two important aspects of drug development are drug target identification and biomarker discovery for early disease detection, disease progression, drug efficacy and drug toxicity, etc. Recently, many single nucleotide polymorphisms (SNPs) associated with human diseases are discovered through large genome-wide association studies (GWAS). However, it is still largely unclear how these candidate SNPs may cause human diseases. The ultimate aim of this paper is to put these GWAS candidate SNPs and their associated genes into a network context to understand their mechanism of action in human diseases. In addition to large-scale human data sets that are often heterogeneous in terms of genetic and environmental factors, many high quality data sets in rodents exist and are frequently used to model human diseases. To leverage such information, we developed a method for combining and contrasting gene networks between human and rodents, specifically to elucidate how GWAS candidate SNPs may contribute to human diseases. By identifying mechanisms that are conserved or divergent between human and rodents, we can also predict which disease causal genes can be studied using rodent models and which ones may not.

Collapse

Lu S, Li J, Song C, Shen K, Tseng GC. Biomarker detection in the integration of multiple multi-class genomic studies. ACTA ACUST UNITED AC 2009;26:333-40. [PMID: 19965884 DOI: 10.1093/bioinformatics/btp669] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]

Scharpf RB, Tjelmeland H, Parmigiani G, Nobel AB. A Bayesian model for cross-study differential gene expression. J Am Stat Assoc 2009;104:1295-1310. [PMID: 21127725 DOI: 10.1198/jasa.2009.ap07611] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]

Conlon EM, Postier BL, Methé BA, Nevin KP, Lovley DR. Hierarchical Bayesian meta-analysis models for cross-platform microarray studies. J Appl Stat 2009. [DOI: 10.1080/02664760802562480] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]

Marot G, Foulley JL, Mayer CD, Jaffrézic F. Moderated effect size and P-value combinations for microarray meta-analyses. Bioinformatics 2009;25:2692-9. [PMID: 19628502 DOI: 10.1093/bioinformatics/btp444] [Citation(s) in RCA: 116] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Wren JD. A global meta-analysis of microarray expression data to predict unknown gene functions and estimate the literature-data divide. ACTA ACUST UNITED AC 2009;25:1694-701. [PMID: 19447786 DOI: 10.1093/bioinformatics/btp290] [Citation(s) in RCA: 78] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]

Abstract

MOTIVATION

Approximately 9334 (37%) of human genes have no publications documenting their function and, for those that are published, the number of publications per gene is highly skewed. Furthermore, for reasons not clear, the entry of new gene names into the literature has slowed in recent years. If we are to better understand human/mammalian biology and complete the catalog of human gene function, it is important to finish predicting putative functions for these genes based upon existing experimental evidence.

RESULTS

A global meta-analysis (GMA) of all publicly available GEO two-channel human microarray datasets (3551 experiments total) was conducted to identify genes with recurrent, reproducible patterns of co-regulation across different conditions. Patterns of co-expression were divided into parallel (i.e. genes are up and down-regulated together) and anti-parallel. Several ranking methods to predict a gene's function based on its top 20 co-expressed gene pairs were compared. In the best method, 34% of predicted Gene Ontology (GO) categories matched exactly with the known GO categories for approximately 5000 genes analyzed versus only 3% for random gene sets. Only 2.4% of co-expressed gene pairs were found as co-occurring gene pairs in MEDLINE.

CONCLUSIONS

Via a GO enrichment analysis, genes co-expressed in parallel with the query gene were frequently associated with the same GO categories, whereas anti-parallel genes were not. Combining parallel and anti-parallel genes for analysis resulted in fewer significant GO categories, suggesting they are best analyzed separately. Expression databases contain much unexpected genetic knowledge that has not yet been reported in the literature. A total of 1642 Human genes with unknown function were differentially expressed in at least 30 experiments.

AVAILABILITY

Data matrix available upon request.

Collapse

Ma S, Huang J. Regularized gene selection in cancer microarray meta-analysis. BMC Bioinformatics 2009;10:1. [PMID: 19118496 PMCID: PMC2631520 DOI: 10.1186/1471-2105-10-1] [Citation(s) in RCA: 140] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2008] [Accepted: 01/01/2009] [Indexed: 11/10/2022] Open

Blangiardo M, Richardson S. A Bayesian calibration model for combining different pre-processing methods in Affymetrix chips. BMC Bioinformatics 2008;9:512. [PMID: 19046434 PMCID: PMC2639433 DOI: 10.1186/1471-2105-9-512] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2008] [Accepted: 12/01/2008] [Indexed: 11/10/2022] Open

Meta-analysis of genome-wide expression patterns associated with behavioral maturation in honey bees. BMC Genomics 2008;9:503. [PMID: 18950506 PMCID: PMC2582039 DOI: 10.1186/1471-2164-9-503] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2008] [Accepted: 10/24/2008] [Indexed: 11/22/2022] Open

Abstract

Background

The information from multiple microarray experiments can be integrated in an objective manner via meta-analysis. However, multiple meta-analysis approaches are available and their relative strengths have not been directly compared using experimental data in the context of different gene expression scenarios and studies with different degrees of relationship. This study investigates the complementary advantages of meta-analysis approaches to integrate information across studies, and further mine the transcriptome for genes that are associated with complex processes such as behavioral maturation in honey bees. Behavioral maturation and division of labor in honey bees are related to changes in the expression of hundreds of genes in the brain. The information from various microarray studies comparing the expression of genes at different maturation stages in honey bee brains was integrated using complementary meta-analysis approaches.

Results

Comparison of lists of genes with significant differential expression across studies failed to identify genes with consistent patterns of expression that were below the selected significance threshold, or identified genes with significant yet inconsistent patterns. The meta-analytical framework supported the identification of genes with consistent overall expression patterns and eliminated genes that exhibited contradictory expression patterns across studies. Sample-level meta-analysis of normalized gene-expression can detect more differentially expressed genes than the study-level meta-analysis of estimates for genes that were well described by similar model parameter estimates across studies and had small variation across studies. Furthermore, study-level meta-analysis was well suited for genes that exhibit consistent patterns across studies, genes that had substantial variation across studies, and genes that did not conform to the assumptions of the sample-level meta-analysis. Meta-analyses confirmed previously reported genes and helped identify genes (e.g. Tomosyn, Chitinase 5, Adar, Innexin 2, Transferrin 1, Sick, Oatp26F) and Gene Ontology categories (e.g. purine nucleotide binding) not previously associated with maturation in honey bees.

Conclusion

This study demonstrated that a combination of meta-analytical approaches best addresses the highly dimensional nature of genome-wide microarray studies. As expected, the integration of gene expression information from microarray studies using meta-analysis enhanced the characterization of the transcriptome of complex biological processes.

Collapse

Liang Y, Kelemen A. Bayesian models and meta analysis for multiple tissue gene expression data following corticosteroid administration. BMC Bioinformatics 2008;9:354. [PMID: 18755028 PMCID: PMC2579308 DOI: 10.1186/1471-2105-9-354] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2008] [Accepted: 08/28/2008] [Indexed: 11/29/2022] Open

Abstract

Background

This paper addresses key biological problems and statistical issues in the analysis of large gene expression data sets that describe systemic temporal response cascades to therapeutic doses in multiple tissues such as liver, skeletal muscle, and kidney from the same animals. Affymetrix time course gene expression data U34A are obtained from three different tissues including kidney, liver and muscle. Our goal is not only to find the concordance of gene in different tissues, identify the common differentially expressed genes over time and also examine the reproducibility of the findings by integrating the results through meta analysis from multiple tissues in order to gain a significant increase in the power of detecting differentially expressed genes over time and to find the differential differences of three tissues responding to the drug.

Results and conclusion

Bayesian categorical model for estimating the proportion of the 'call' are used for pre-screening genes. Hierarchical Bayesian Mixture Model is further developed for the identifications of differentially expressed genes across time and dynamic clusters. Deviance information criterion is applied to determine the number of components for model comparisons and selections. Bayesian mixture model produces the gene-specific posterior probability of differential/non-differential expression and the 95% credible interval, which is the basis for our further Bayesian meta-inference. Meta-analysis is performed in order to identify commonly expressed genes from multiple tissues that may serve as ideal targets for novel treatment strategies and to integrate the results across separate studies. We have found the common expressed genes in the three tissues. However, the up/down/no regulations of these common genes are different at different time points. Moreover, the most differentially expressed genes were found in the liver, then in kidney, and then in muscle.

Collapse

Combining transcriptional datasets using the generalized singular value decomposition. BMC Bioinformatics 2008;9:335. [PMID: 18687147 PMCID: PMC2562393 DOI: 10.1186/1471-2105-9-335] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2008] [Accepted: 08/08/2008] [Indexed: 11/17/2022] Open

Abstract

Background

Both microarrays and quantitative real-time PCR are convenient tools for studying the transcriptional levels of genes. The former is preferable for large scale studies while the latter is a more targeted technique. Because of platform-dependent systematic effects, simple comparisons or merging of datasets obtained by these technologies are difficult, even though they may often be desirable. These difficulties are exacerbated if there is only partial overlap between the experimental conditions and genes probed in the two datasets.

Results

We show here that the generalized singular value decomposition provides a practical tool for merging a small, targeted dataset obtained by quantitative real-time PCR of specific genes with a much larger microarray dataset. The technique permits, for the first time, the identification of genes present in only one dataset co-expressed with a target gene present exclusively in the other dataset, even when experimental conditions for the two datasets are not identical. With the rapidly increasing number of publically available large scale microarray datasets the latter is frequently the case. The method enables us to discover putative candidate genes involved in the biosynthesis of the (1,3;1,4)-β-D-glucan polysaccharide found in plant cell walls.

Conclusion

We show that the generalized singular value decomposition provides a viable tool for a combined analysis of two gene expression datasets with only partial overlap of both gene sets and experimental conditions. We illustrate how the decomposition can be optimized self-consistently by using a judicious choice of genes to define it. The ability of the technique to seamlessly define a concept of "co-expression" across both datasets provides an avenue for meaningful data integration. We believe that it will prove to be particularly useful for exploiting large, publicly available, microarray datasets for species with unsequenced genomes by complementing them with more limited in-house expression measurements.

Collapse

Sivaganesan M, Seifring S, Varma M, Haugland RA, Shanks OC. A Bayesian method for calculating real-time quantitative PCR calibration curves using absolute plasmid DNA standards. BMC Bioinformatics 2008;9:120. [PMID: 18298858 PMCID: PMC2292693 DOI: 10.1186/1471-2105-9-120] [Citation(s) in RCA: 70] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2007] [Accepted: 02/25/2008] [Indexed: 11/10/2022] Open

Conlon EM. A Bayesian mixture model for metaanalysis of microarray studies. Funct Integr Genomics 2007;8:43-53. [PMID: 17879102 DOI: 10.1007/s10142-007-0058-3] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2007] [Revised: 08/10/2007] [Accepted: 08/11/2007] [Indexed: 10/22/2022]