Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Al-Shahrour F, Díaz-Uriarte R, Dopazo J. Discovering molecular functions significantly related to phenotypes by combining gene expression data and biological information. Bioinformatics 2005;21:2988-93. [PMID: 15840702 DOI: 10.1093/bioinformatics/bti457] [Citation(s) in RCA: 78] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

For:	Al-Shahrour F, Díaz-Uriarte R, Dopazo J. Discovering molecular functions significantly related to phenotypes by combining gene expression data and biological information. Bioinformatics 2005;21:2988-93. [PMID: 15840702 DOI: 10.1093/bioinformatics/bti457] [Citation(s) in RCA: 78] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Number

Cited by Other Article(s)

Hui TX, Kasim S, Aziz IA, Fudzee MFM, Haron NS, Sutikno T, Hassan R, Mahdin H, Sen SC. Robustness evaluations of pathway activity inference methods on gene expression data. BMC Bioinformatics 2024;25:23. [PMID: 38216898 PMCID: PMC10785356 DOI: 10.1186/s12859-024-05632-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2023] [Accepted: 01/02/2024] [Indexed: 01/14/2024] Open

Shah I, Bundy J, Chambers B, Everett LJ, Haggard D, Harrill J, Judson RS, Nyffeler J, Patlewicz G. Navigating Transcriptomic Connectivity Mapping Workflows to Link Chemicals with Bioactivities. Chem Res Toxicol 2022;35:1929-1949. [PMID: 36301716 PMCID: PMC10483698 DOI: 10.1021/acs.chemrestox.2c00245] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]

Abstract

Screening new compounds for potential bioactivities against cellular targets is vital for drug discovery and chemical safety. Transcriptomics offers an efficient approach for assessing global gene expression changes, but interpreting chemical mechanisms from these data is often challenging. Connectivity mapping is a potential data-driven avenue for linking chemicals to mechanisms based on the observation that many biological processes are associated with unique gene expression signatures (gene signatures). However, mining the effects of a chemical on gene signatures for biological mechanisms is challenging because transcriptomic data contain thousands of noisy genes. New connectivity mapping approaches seeking to distinguish signal from noise continue to be developed, spurred by the promise of discovering chemical mechanisms, new drugs, and disease targets from burgeoning transcriptomic data. Here, we analyze these approaches in terms of diverse transcriptomic technologies, public databases, gene signatures, pattern-matching algorithms, and statistical evaluation criteria. To navigate the complexity of connectivity mapping, we propose a harmonized scheme to coherently organize and compare published workflows. We first standardize concepts underlying transcriptomic profiles and gene signatures based on various transcriptomic technologies such as microarrays, RNA-Seq, and L1000 and discuss the widely used data sources such as Gene Expression Omnibus, ArrayExpress, and MSigDB. Next, we generalize connectivity mapping as a pattern-matching task for finding similarity between a query (e.g., transcriptomic profile for new chemical) and a reference (e.g., gene signature of known target). Published pattern-matching approaches fall into two main categories: vector-based use metrics like correlation, Jaccard index, etc., and aggregation-based use parametric and nonparametric statistics (e.g., gene set enrichment analysis). The statistical methods for evaluating the performance of different approaches are described, along with comparisons reported in the literature on benchmark transcriptomic data sets. Lastly, we review connectivity mapping applications in toxicology and offer guidance on evaluating chemical-induced toxicity with concentration-response transcriptomic data. In addition to serving as a high-level guide and tutorial for understanding and implementing connectivity mapping workflows, we hope this review will stimulate new algorithms for evaluating chemical safety and drug discovery using transcriptomic data.

Collapse

DHULI KRISTJANA, BONETTI GABRIELE, ANPILOGOV KYRYLO, HERBST KARENL, CONNELLY STEPHENTHADDEUS, BELLINATO FRANCESCO, GISONDI PAOLO, BERTELLI MATTEO. Validating methods for testing natural molecules on molecular pathways of interest in silico and in vitro. JOURNAL OF PREVENTIVE MEDICINE AND HYGIENE 2022;63:E279-E288. [PMID: 36479497 PMCID: PMC9710400 DOI: 10.15167/2421-4248/jpmh2022.63.2s3.2770] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]

Grassi M, Tarantino B. SEMgsa: topology-based pathway enrichment analysis with structural equation models. BMC Bioinformatics 2022;23:344. [PMID: 35978279 PMCID: PMC9385099 DOI: 10.1186/s12859-022-04884-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2022] [Accepted: 08/09/2022] [Indexed: 11/25/2022] Open

Abstract

BACKGROUND

Pathway enrichment analysis is extensively used in high-throughput experimental studies to gain insight into the functional roles of pre-defined subsets of genes, proteins and metabolites. Methods that leverages information on the topology of the underlying pathways outperform simpler methods that only consider pathway membership, leading to improved performance. Among all the proposed software tools, there's the need to combine high statistical power together with a user-friendly framework, making it difficult to choose the best method for a particular experimental environment.

RESULTS

We propose SEMgsa, a topology-based algorithm developed into the framework of structural equation models. SEMgsa combine the SEM p values regarding node-specific group effect estimates in terms of activation or inhibition, after statistically controlling biological relations among genes within pathways. We used SEMgsa to identify biologically relevant results in a Coronavirus disease (COVID-19) RNA-seq dataset (GEO accession: GSE172114) together with a frontotemporal dementia (FTD) DNA methylation dataset (GEO accession: GSE53740) and compared its performance with some existing methods. SEMgsa is highly sensitive to the pathways designed for the specific disease, showing low p values ([Formula: see text]) and ranking in high positions, outperforming existing software tools. Three pathway dysregulation mechanisms were used to generate simulated expression data and evaluate the performance of methods in terms of type I error followed by their statistical power. Simulation results confirm best overall performance of SEMgsa.

CONCLUSIONS

SEMgsa is a novel yet powerful method for identifying enrichment with regard to gene expression data. It takes into account topological information and exploits pathway perturbation statistics to reveal biological information. SEMgsa is implemented in the R package SEMgraph, easily available at https://CRAN.R-project.org/package=SEMgraph .

Collapse

NMR in Metabolomics: From Conventional Statistics to Machine Learning and Neural Network Approaches. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12062824] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]

Fifteen Years of Gene Set Analysis for High-Throughput Genomic Data: A Review of Statistical Approaches and Future Challenges. ENTROPY 2020;22:e22040427. [PMID: 33286201 PMCID: PMC7516904 DOI: 10.3390/e22040427] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/24/2020] [Revised: 03/18/2020] [Accepted: 04/03/2020] [Indexed: 12/22/2022]

Nguyen TM, Shafi A, Nguyen T, Draghici S. Identifying significantly impacted pathways: a comprehensive review and assessment. Genome Biol 2019;20:203. [PMID: 31597578 PMCID: PMC6784345 DOI: 10.1186/s13059-019-1790-4] [Citation(s) in RCA: 96] [Impact Index Per Article: 19.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2019] [Accepted: 08/13/2019] [Indexed: 01/01/2023] Open

Amadoz A, Hidalgo MR, Çubuk C, Carbonell-Caballero J, Dopazo J. A comparison of mechanistic signaling pathway activity analysis methods. Brief Bioinform 2019;20:1655-1668. [PMID: 29868818 PMCID: PMC6917216 DOI: 10.1093/bib/bby040] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2018] [Revised: 03/31/2018] [Indexed: 12/11/2022] Open

Li Y, Wu Y, Zhang X, Bai Y, Akthar LM, Lu X, Shi M, Zhao J, Jiang Q, Li Y. SCIA: A Novel Gene Set Analysis Applicable to Data With Different Characteristics. Front Genet 2019;10:598. [PMID: 31293623 PMCID: PMC6603225 DOI: 10.3389/fgene.2019.00598] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2019] [Accepted: 06/05/2019] [Indexed: 01/06/2023] Open

Statistical approach for selection of biologically informative genes. Gene 2018;655:71-83. [PMID: 29458166 DOI: 10.1016/j.gene.2018.02.044] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2017] [Revised: 11/26/2017] [Accepted: 02/14/2018] [Indexed: 11/23/2022]

Abstract

Selection of informative genes from high dimensional gene expression data has emerged as an important research area in genomics. Many gene selection techniques have been proposed so far are either based on relevancy or redundancy measure. Further, the performance of these techniques has been adjudged through post selection classification accuracy computed through a classifier using the selected genes. This performance metric may be statistically sound but may not be biologically relevant. A statistical approach, i.e. Boot-MRMR, was proposed based on a composite measure of maximum relevance and minimum redundancy, which is both statistically sound and biologically relevant for informative gene selection. For comparative evaluation of the proposed approach, we developed two biological sufficient criteria, i.e. Gene Set Enrichment with QTL (GSEQ) and biological similarity score based on Gene Ontology (GO). Further, a systematic and rigorous evaluation of the proposed technique with 12 existing gene selection techniques was carried out using five gene expression datasets. This evaluation was based on a broad spectrum of statistically sound (e.g. subject classification) and biological relevant (based on QTL and GO) criteria under a multiple criteria decision-making framework. The performance analysis showed that the proposed technique selects informative genes which are more biologically relevant. The proposed technique is also found to be quite competitive with the existing techniques with respect to subject classification and computational time. Our results also showed that under the multiple criteria decision-making setup, the proposed technique is best for informative gene selection over the available alternatives. Based on the proposed approach, an R Package, i.e. BootMRMR has been developed and available at https://cran.r-project.org/web/packages/BootMRMR. This study will provide a practical guide to select statistical techniques for selecting informative genes from high dimensional expression data for breeding and system biology studies.

Collapse

Statistical Approach for Gene Set Analysis with Trait Specific Quantitative Trait Loci. Sci Rep 2018;8:2391. [PMID: 29402907 PMCID: PMC5799309 DOI: 10.1038/s41598-018-19736-w] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2017] [Accepted: 12/06/2017] [Indexed: 11/20/2022] Open

Gelli M, Konda AR, Liu K, Zhang C, Clemente TE, Holding DR, Dweikat IM. Validation of QTL mapping and transcriptome profiling for identification of candidate genes associated with nitrogen stress tolerance in sorghum. BMC PLANT BIOLOGY 2017;17:123. [PMID: 28697783 PMCID: PMC5505042 DOI: 10.1186/s12870-017-1064-9] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/13/2016] [Accepted: 06/25/2017] [Indexed: 05/10/2023]

Abstract

BACKGROUND

Quantitative trait loci (QTLs) detected in one mapping population may not be detected in other mapping populations at all the time. Therefore, before being used for marker assisted breeding, QTLs need to be validated in different environments and/or genetic backgrounds to rule out statistical anomalies. In this regard, we mapped the QTLs controlling various agronomic traits in a recombinant inbred line (RIL) population in response to Nitrogen (N) stress and validated these with the reported QTLs in our earlier study to find the stable and consistent QTLs across populations. Also, with Illumina RNA-sequencing we checked the differential expression of gene (DEG) transcripts between parents and pools of RILs with high and low nitrogen use efficiency (NUE) and overlaid these DEGs on to the common validated QTLs to find candidate genes associated with N-stress tolerance in sorghum.

RESULTS

An F₇ RIL population derived from a cross between CK60 (N-stress sensitive) and San Chi San (N-stress tolerant) inbred sorghum lines was used to map QTLs for 11 agronomic traits tested under different N-levels. Composite interval mapping analysis detected a total of 32 QTLs for 11 agronomic traits. Validation of these QTLs revealed that of the detected, nine QTLs from this population were consistent with the reported QTLs in earlier study using CK60/China17 RIL population. The validated QTLs were located on chromosomes 1, 6, 7, 8, and 9. In addition, root transcriptomic profiling detected 55 and 20 differentially expressed gene (DEG) transcripts between parents and pools of RILs with high and low NUE respectively. Also, overlay of these DEG transcripts on to the validated QTLs found candidate genes transcripts for NUE and also showed the expected differential expression. For example, DEG transcripts encoding Lysine histidine transporter 1 (LHT1) had abundant expression in San Chi San and the tolerant RIL pool, whereas DEG transcripts encoding seed storage albumin, transcription factor IIIC (TFIIIC) and dwarfing gene (DW2) encoding multidrug resistance-associated protein-9 homolog showed abundant expression in CK60 parent, similar to earlier study.

CONCLUSIONS

The validated QTLs among different mapping populations would be the most reliable and stable QTLs across germplasm. The DEG transcripts found in the validated QTL regions will serve as future candidate genes for enhancing NUE in sorghum using molecular approaches.

Collapse

Ren X, Hu Q, Liu S, Wang J, Miecznikowski JC. Gene set analysis controlling for length bias in RNA-seq experiments. BioData Min 2017;10:5. [PMID: 28184252 PMCID: PMC5294840 DOI: 10.1186/s13040-017-0125-9] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2016] [Accepted: 01/11/2017] [Indexed: 01/29/2023] Open

Du J, Li M, Yuan Z, Guo M, Song J, Xie X, Chen Y. A decision analysis model for KEGG pathway analysis. BMC Bioinformatics 2016;17:407. [PMID: 27716040 PMCID: PMC5053338 DOI: 10.1186/s12859-016-1285-1] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2015] [Accepted: 09/28/2016] [Indexed: 11/18/2022] Open

Abstract

Background

The knowledge base-driven pathway analysis is becoming the first choice for many investigators, in that it not only can reduce the complexity of functional analysis by grouping thousands of genes into just several hundred pathways, but also can increase the explanatory power for the experiment by identifying active pathways in different conditions. However, current approaches are designed to analyze a biological system assuming that each pathway is independent of the other pathways.

Results

A decision analysis model is developed in this article that accounts for dependence among pathways in time-course experiments and multiple treatments experiments. This model introduces a decision coefficient—a designed index, to identify the most relevant pathways in a given experiment by taking into account not only the direct determination factor of each Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway itself, but also the indirect determination factors from its related pathways. Meanwhile, the direct and indirect determination factors of each pathway are employed to demonstrate the regulation mechanisms among KEGG pathways, and the sign of decision coefficient can be used to preliminarily estimate the impact direction of each KEGG pathway. The simulation study of decision analysis demonstrated the application of decision analysis model for KEGG pathway analysis.

Conclusions

A microarray dataset from bovine mammary tissue over entire lactation cycle was used to further illustrate our strategy. The results showed that the decision analysis model can provide the promising and more biologically meaningful results. Therefore, the decision analysis model is an initial attempt of optimizing pathway analysis methodology.

Electronic supplementary material

The online version of this article (doi:10.1186/s12859-016-1285-1) contains supplementary material, which is available to authorized users.

Collapse

Ma J, Shojaie A, Michailidis G. Network-based pathway enrichment analysis with incomplete network information. Bioinformatics 2016;32:3165-3174. [PMID: 27357170 DOI: 10.1093/bioinformatics/btw410] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2016] [Accepted: 06/22/2016] [Indexed: 11/12/2022] Open

Mass spectrometry analysis and transcriptome sequencing reveal glowing squid crystal proteins are in the same superfamily as firefly luciferase. Sci Rep 2016;6:27638. [PMID: 27279452 PMCID: PMC4899746 DOI: 10.1038/srep27638] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2016] [Accepted: 05/18/2016] [Indexed: 01/14/2023] Open

Rue-Albrecht K, McGettigan PA, Hernández B, Nalpas NC, Magee DA, Parnell AC, Gordon SV, MacHugh DE. GOexpress: an R/Bioconductor package for the identification and visualisation of robust gene ontology signatures through supervised learning of gene expression data. BMC Bioinformatics 2016;17:126. [PMID: 26968614 PMCID: PMC4788925 DOI: 10.1186/s12859-016-0971-3] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2015] [Accepted: 02/25/2016] [Indexed: 02/06/2023] Open

Abstract

Background

Identification of gene expression profiles that differentiate experimental groups is critical for discovery and analysis of key molecular pathways and also for selection of robust diagnostic or prognostic biomarkers. While integration of differential expression statistics has been used to refine gene set enrichment analyses, such approaches are typically limited to single gene lists resulting from simple two-group comparisons or time-series analyses. In contrast, functional class scoring and machine learning approaches provide powerful alternative methods to leverage molecular measurements for pathway analyses, and to compare continuous and multi-level categorical factors.

Results

We introduce GOexpress, a software package for scoring and summarising the capacity of gene ontology features to simultaneously classify samples from multiple experimental groups. GOexpress integrates normalised gene expression data (e.g., from microarray and RNA-seq experiments) and phenotypic information of individual samples with gene ontology annotations to derive a ranking of genes and gene ontology terms using a supervised learning approach. The default random forest algorithm allows interactions between all experimental factors, and competitive scoring of expressed genes to evaluate their relative importance in classifying predefined groups of samples.

Conclusions

GOexpress enables rapid identification and visualisation of ontology-related gene panels that robustly classify groups of samples and supports both categorical (e.g., infection status, treatment) and continuous (e.g., time-series, drug concentrations) experimental factors. The use of standard Bioconductor extension packages and publicly available gene ontology annotations facilitates straightforward integration of GOexpress within existing computational biology pipelines.

Electronic supplementary material

The online version of this article (doi:10.1186/s12859-016-0971-3) contains supplementary material, which is available to authorized users.

Collapse

Alonso R, Salavert F, Garcia-Garcia F, Carbonell-Caballero J, Bleda M, Garcia-Alonso L, Sanchis-Juan A, Perez-Gil D, Marin-Garcia P, Sanchez R, Cubuk C, Hidalgo MR, Amadoz A, Hernansaiz-Ballesteros RD, Alemán A, Tarraga J, Montaner D, Medina I, Dopazo J. Babelomics 5.0: functional interpretation for new generations of genomic data. Nucleic Acids Res 2015;43:W117-21. [PMID: 25897133 PMCID: PMC4489263 DOI: 10.1093/nar/gkv384] [Citation(s) in RCA: 99] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2015] [Accepted: 04/11/2015] [Indexed: 02/02/2023] Open

Affiliation(s)

Roberto Alonso Computational Genomics Department, Centro de Investigación Príncipe Felipe (CIPF), Valencia, 46012, Spain Computational Genomics Chair, Bull-CIPF, Valencia, 46012, Spain
Francisco Salavert Computational Genomics Department, Centro de Investigación Príncipe Felipe (CIPF), Valencia, 46012, Spain Bioinformatics of Rare Diseases (BIER), CIBER de Enfermedades Raras (CIBERER), Valencia, 46012, Spain
Francisco Garcia-Garcia Computational Genomics Department, Centro de Investigación Príncipe Felipe (CIPF), Valencia, 46012, Spain
Jose Carbonell-Caballero Computational Genomics Department, Centro de Investigación Príncipe Felipe (CIPF), Valencia, 46012, Spain
Marta Bleda Department of Medicine, University of Cambridge, School of Clinical Medicine, Addenbrooke's Hospital, Hills Road, Cambridge CB2 0QQ, UK
Luz Garcia-Alonso Computational Genomics Department, Centro de Investigación Príncipe Felipe (CIPF), Valencia, 46012, Spain
Alba Sanchis-Juan Fundación Investigación Clínico de Valencia-INCLIVA, Valencia, 46010, Spain
Daniel Perez-Gil Fundación Investigación Clínico de Valencia-INCLIVA, Valencia, 46010, Spain
Pablo Marin-Garcia Fundación Investigación Clínico de Valencia-INCLIVA, Valencia, 46010, Spain
Ruben Sanchez Computational Genomics Department, Centro de Investigación Príncipe Felipe (CIPF), Valencia, 46012, Spain Functional Genomics Node, (INB) at CIPF, Valencia, 46012, Spain
Cankut Cubuk Computational Genomics Department, Centro de Investigación Príncipe Felipe (CIPF), Valencia, 46012, Spain
Marta R Hidalgo Computational Genomics Department, Centro de Investigación Príncipe Felipe (CIPF), Valencia, 46012, Spain
Alicia Amadoz Computational Genomics Department, Centro de Investigación Príncipe Felipe (CIPF), Valencia, 46012, Spain
Rosa D Hernansaiz-Ballesteros Computational Genomics Department, Centro de Investigación Príncipe Felipe (CIPF), Valencia, 46012, Spain
Alejandro Alemán Computational Genomics Department, Centro de Investigación Príncipe Felipe (CIPF), Valencia, 46012, Spain Bioinformatics of Rare Diseases (BIER), CIBER de Enfermedades Raras (CIBERER), Valencia, 46012, Spain
Joaquin Tarraga Computational Genomics Department, Centro de Investigación Príncipe Felipe (CIPF), Valencia, 46012, Spain
David Montaner Computational Genomics Department, Centro de Investigación Príncipe Felipe (CIPF), Valencia, 46012, Spain
Ignacio Medina HPC Services, University of Cambridge, Cambridge, CB3 0RB UK
Joaquin Dopazo Computational Genomics Department, Centro de Investigación Príncipe Felipe (CIPF), Valencia, 46012, Spain Computational Genomics Chair, Bull-CIPF, Valencia, 46012, Spain Bioinformatics of Rare Diseases (BIER), CIBER de Enfermedades Raras (CIBERER), Valencia, 46012, Spain Functional Genomics Node, (INB) at CIPF, Valencia, 46012, Spain

Collapse

Rizza S, Conesa A, Juarez J, Catara A, Navarro L, Duran-Vila N, Ancillo G. Microarray analysis of Etrog citron (Citrus medica L.) reveals changes in chloroplast, cell wall, peroxidase and symporter activities in response to viroid infection. MOLECULAR PLANT PATHOLOGY 2012;13:852-64. [PMID: 22420919 PMCID: PMC6638686 DOI: 10.1111/j.1364-3703.2012.00794.x] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]

Ontological Analysis and Pathway Modelling in Drug Discovery. Pharmaceut Med 2012. [DOI: 10.1007/bf03256689] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Wit EC, Bakewell DJG. Borrowing strength: a likelihood ratio test for related sparse signals. Bioinformatics 2012;28:1980-9. [PMID: 22668791 DOI: 10.1093/bioinformatics/bts316] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Ibrahim MAH, Jassim S, Cawthorne MA, Langlands K. A topology-based score for pathway enrichment. J Comput Biol 2012;19:563-73. [PMID: 22468678 DOI: 10.1089/cmb.2011.0182] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open

KVIST JOUNI, WHEAT CHRISTOPHERW, KALLIONIEMI EVELIINA, SAASTAMOINEN MARJO, HANSKI ILKKA, FRILANDER MIKKOJ. Temperature treatments during larval development reveal extensive heritable and plastic variation in gene expression and life history traits. Mol Ecol 2012;22:602-19. [DOI: 10.1111/j.1365-294x.2012.05521.x] [Citation(s) in RCA: 50] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Khatri P, Sirota M, Butte AJ. Ten years of pathway analysis: current approaches and outstanding challenges. PLoS Comput Biol 2012;8:e1002375. [PMID: 22383865 PMCID: PMC3285573 DOI: 10.1371/journal.pcbi.1002375] [Citation(s) in RCA: 1005] [Impact Index Per Article: 83.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022] Open

Ji RR, Ott KH, Yordanova R, Bruccoleri RE. FDR-FET: an optimizing gene set enrichment analysis method. Adv Appl Bioinform Chem 2011;4:37-42. [PMID: 21918636 PMCID: PMC3169954 DOI: 10.2147/aabc.s15840] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open

Gallego-Bartolomé J, Alabadí D, Blázquez MA. DELLA-induced early transcriptional changes during etiolated development in Arabidopsis thaliana. PLoS One 2011;6:e23918. [PMID: 21904598 PMCID: PMC3164146 DOI: 10.1371/journal.pone.0023918] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2011] [Accepted: 08/01/2011] [Indexed: 11/24/2022] Open

Natural selection on functional modules, a genome-wide analysis. PLoS Comput Biol 2011;7:e1001093. [PMID: 21390268 PMCID: PMC3048381 DOI: 10.1371/journal.pcbi.1001093] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2010] [Accepted: 01/27/2011] [Indexed: 12/24/2022] Open

Functional analysis: evaluation of response intensities--tailoring ANOVA for lists of expression subsets. BMC Bioinformatics 2010;11:510. [PMID: 20942918 PMCID: PMC2964684 DOI: 10.1186/1471-2105-11-510] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2010] [Accepted: 10/13/2010] [Indexed: 02/06/2023] Open

Abstract

Background

Microarray data is frequently used to characterize the expression profile of a whole genome and to compare the characteristics of that genome under several conditions. Geneset analysis methods have been described previously to analyze the expression values of several genes related by known biological criteria (metabolic pathway, pathology signature, co-regulation by a common factor, etc.) at the same time and the cost of these methods allows for the use of more values to help discover the underlying biological mechanisms.

Results

As several methods assume different null hypotheses, we propose to reformulate the main question that biologists seek to answer. To determine which genesets are associated with expression values that differ between two experiments, we focused on three ad hoc criteria: expression levels, the direction of individual gene expression changes (up or down regulation), and correlations between genes. We introduce the FAERI methodology, tailored from a two-way ANOVA to examine these criteria. The significance of the results was evaluated according to the self-contained null hypothesis, using label sampling or by inferring the null distribution from normally distributed random data. Evaluations performed on simulated data revealed that FAERI outperforms currently available methods for each type of set tested. We then applied the FAERI method to analyze three real-world datasets on hypoxia response. FAERI was able to detect more genesets than other methodologies, and the genesets selected were coherent with current knowledge of cellular response to hypoxia. Moreover, the genesets selected by FAERI were confirmed when the analysis was repeated on two additional related datasets.

Conclusions

The expression values of genesets are associated with several biological effects. The underlying mathematical structure of the genesets allows for analysis of data from several genes at the same time. Focusing on expression levels, the direction of the expression changes, and correlations, we showed that two-step data reduction allowed us to significantly improve the performance of geneset analysis using a modified two-way ANOVA procedure, and to detect genesets that current methods fail to detect.

Collapse

Minguez P, Dopazo J. Functional genomics and networks: new approaches in the extraction of complex gene modules. Expert Rev Proteomics 2010;7:55-63. [PMID: 20121476 DOI: 10.1586/epr.09.103] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]

Montaner D, Dopazo J. Multidimensional gene set analysis of genomic data. PLoS One 2010;5:e10348. [PMID: 20436964 PMCID: PMC2860497 DOI: 10.1371/journal.pone.0010348] [Citation(s) in RCA: 60] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2009] [Accepted: 03/30/2010] [Indexed: 11/27/2022] Open

Dopazo J. Functional profiling methods in cancer. Methods Mol Biol 2010;576:363-374. [PMID: 19882272 DOI: 10.1007/978-1-59745-545-9_19] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/28/2023]

Stavang JA, Gallego-Bartolomé J, Gómez MD, Yoshida S, Asami T, Olsen JE, García-Martínez JL, Alabadí D, Blázquez MA. Hormonal regulation of temperature-induced growth in Arabidopsis. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2009;60:589-601. [PMID: 19686536 DOI: 10.1111/j.1365-313x.2009.03983.x] [Citation(s) in RCA: 192] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]

Bartholomé K, Kreutz C, Timmer J. Estimation of gene induction enables a relevance-based ranking of gene sets. J Comput Biol 2009;16:959-67. [PMID: 19580524 DOI: 10.1089/cmb.2008.0226] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023] Open

Zhang L, Hammell M, Kudlow BA, Ambros V, Han M. Systematic analysis of dynamic miRNA-target interactions during C. elegans development. Development 2009;136:3043-55. [PMID: 19675127 PMCID: PMC2730362 DOI: 10.1242/dev.039008] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/24/2009] [Indexed: 11/20/2022]

Moreno-Manzano V, Rodríguez-Jiménez FJ, García-Roselló M, Laínez S, Erceg S, Calvo MT, Ronaghi M, Lloret M, Planells-Cases R, Sánchez-Puelles JM, Stojkovic M. Activated spinal cord ependymal stem cells rescue neurological function. Stem Cells 2009;27:733-43. [PMID: 19259940 DOI: 10.1002/stem.24] [Citation(s) in RCA: 115] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]

Guan P, Huang D, He M, Zhou B. Lung cancer gene expression database analysis incorporating prior knowledge with support vector machine-based classification method. JOURNAL OF EXPERIMENTAL & CLINICAL CANCER RESEARCH : CR 2009;28:103. [PMID: 19615083 PMCID: PMC2719616 DOI: 10.1186/1756-9966-28-103] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/03/2009] [Accepted: 07/18/2009] [Indexed: 01/13/2023]

Nueda MJ, Sebastián P, Tarazona S, García-García F, Dopazo J, Ferrer A, Conesa A. Functional assessment of time course microarray data. BMC Bioinformatics 2009;10 Suppl 6:S9. [PMID: 19534758 PMCID: PMC2697656 DOI: 10.1186/1471-2105-10-s6-s9] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022] Open

Abstract

Motivation

Time-course microarray experiments study the progress of gene expression along time across one or several experimental conditions. Most developed analysis methods focus on the clustering or the differential expression analysis of genes and do not integrate functional information. The assessment of the functional aspects of time-course transcriptomics data requires the use of approaches that exploit the activation dynamics of the functional categories to where genes are annotated.

Methods

We present three novel methodologies for the functional assessment of time-course microarray data. i) maSigFun derives from the maSigPro method, a regression-based strategy to model time-dependent expression patterns and identify genes with differences across series. maSigFun fits a regression model for groups of genes labeled by a functional class and selects those categories which have a significant model. ii) PCA-maSigFun fits a PCA model of each functional class-defined expression matrix to extract orthogonal patterns of expression change, which are then assessed for their fit to a time-dependent regression model. iii) ASCA-functional uses the ASCA model to rank genes according to their correlation to principal time expression patterns and assess functional enrichment on a GSA fashion. We used simulated and experimental datasets to study these novel approaches. Results were compared to alternative methodologies.

Results

Synthetic and experimental data showed that the different methods are able to capture different aspects of the relationship between genes, functions and co-expression that are biologically meaningful. The methods should not be considered as competitive but they provide different insights into the molecular and functional dynamic events taking place within the biological system under study.

Collapse

Medina I, Montaner D, Bonifaci N, Pujana MA, Carbonell J, Tarraga J, Al-Shahrour F, Dopazo J. Gene set-based analysis of polymorphisms: finding pathways or biological processes associated to traits in genome-wide association studies. Nucleic Acids Res 2009;37:W340-4. [PMID: 19502494 PMCID: PMC2703970 DOI: 10.1093/nar/gkp481] [Citation(s) in RCA: 55] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open

Montaner D, Minguez P, Al-Shahrour F, Dopazo J. Gene set internal coherence in the context of functional profiling. BMC Genomics 2009;10:197. [PMID: 19397819 PMCID: PMC2680416 DOI: 10.1186/1471-2164-10-197] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2008] [Accepted: 04/27/2009] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Functional profiling methods have been extensively used in the context of high-throughput experiments and, in particular, in microarray data analysis. Such methods use available biological information to define different types of functional gene modules (e.g. gene ontology -GO-, KEGG pathways, etc.) whose representation in a pre-defined list of genes is further studied. In the most popular type of microarray experimental designs (e.g. up- or down-regulated genes, clusters of co-expressing genes, etc.) or in other genomic experiments (e.g. Chip-on-chip, epigenomics, etc.) these lists are composed by genes with a high degree of co-expression. Therefore, an implicit assumption in the application of functional profiling methods within this context is that the genes corresponding to the modules tested are effectively defining sets of co-expressing genes. Nevertheless not all the functional modules are biologically coherent entities in terms of co-expression, which will eventually hinder its detection with conventional methods of functional enrichment.

RESULTS

Using a large collection of microarray data we have carried out a detailed survey of internal correlation in GO terms and KEGG pathways, providing a coherence index to be used for measuring functional module co-regulation. An unexpected low level of internal correlation was found among the modules studied. Only around 30% of the modules defined by GO terms and 57% of the modules defined by KEGG pathways display an internal correlation higher than the expected by chance.This information on the internal correlation of the genes within the functional modules can be used in the context of a logistic regression model in a simple way to improve their detection in gene expression experiments.

CONCLUSION

For the first time, an exhaustive study on the internal co-expression of the most popular functional categories has been carried out. Interestingly, the real level of coexpression within many of them is lower than expected (or even inexistent), which will preclude its detection by means of most conventional functional profiling methods. If the gene-to-function correlation information is used in functional profiling methods, the results obtained improve the ones obtained by conventional enrichment methods.

Collapse

Jantus Lewintre E, Reinoso Martín C, Montaner D, Marín M, José Terol M, Farrás R, Benet I, Calvete JJ, Dopazo J, García-Conde J. Analysis of chronic lymphotic leukemia transcriptomic profile: differences between molecular subgroups. Leuk Lymphoma 2009;50:68-79. [PMID: 19127482 DOI: 10.1080/10428190802541807] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

Zhu M, Yu M, Zhao S. Understanding quantitative genetics in the systems biology era. Int J Biol Sci 2009;5:161-70. [PMID: 19173038 PMCID: PMC2631226 DOI: 10.7150/ijbs.5.161] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2008] [Accepted: 01/21/2009] [Indexed: 01/06/2023] Open

Hamid JS, Hu P, Roslin NM, Ling V, Greenwood CMT, Beyene J. Data integration in genetics and genomics: methods and challenges. HUMAN GENOMICS AND PROTEOMICS : HGP 2009;2009. [PMID: 20948564 PMCID: PMC2950414 DOI: 10.4061/2009/869093] [Citation(s) in RCA: 87] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/25/2008] [Accepted: 12/01/2008] [Indexed: 01/18/2023]

Bonifaci N, Berenguer A, Díez J, Reina O, Medina I, Dopazo J, Moreno V, Pujana MA. Biological processes, properties and molecular wiring diagrams of candidate low-penetrance breast cancer susceptibility genes. BMC Med Genomics 2008;1:62. [PMID: 19094230 PMCID: PMC2628924 DOI: 10.1186/1755-8794-1-62] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2008] [Accepted: 12/18/2008] [Indexed: 12/24/2022] Open

Abstract

Background

Recent advances in whole-genome association studies (WGASs) for human cancer risk are beginning to provide the part lists of low-penetrance susceptibility genes. However, statistical analysis in these studies is complicated by the vast number of genetic variants examined and the weak effects observed, as a result of which constraints must be incorporated into the study design and analytical approach. In this scenario, biological attributes beyond the adjusted statistics generally receive little attention and, more importantly, the fundamental biological characteristics of low-penetrance susceptibility genes have yet to be determined.

Methods

We applied an integrative approach for identifying candidate low-penetrance breast cancer susceptibility genes, their characteristics and molecular networks through the analysis of diverse sources of biological evidence.

Results

First, examination of the distribution of Gene Ontology terms in ordered WGAS results identified asymmetrical distribution of Cell Communication and Cell Death processes linked to risk. Second, analysis of 11 different types of molecular or functional relationships in genomic and proteomic data sets defined the "omic" properties of candidate genes: i/ differential expression in tumors relative to normal tissue; ii/ somatic genomic copy number changes correlating with gene expression levels; iii/ differentially expressed across age at diagnosis; and iv/ expression changes after BRCA1 perturbation. Finally, network modeling of the effects of variants on germline gene expression showed higher connectivity than expected by chance between novel candidates and with known susceptibility genes, which supports functional relationships and provides mechanistic hypotheses of risk.

Conclusion

This study proposes that cell communication and cell death are major biological processes perturbed in risk of breast cancer conferred by low-penetrance variants, and defines the common omic properties, molecular interactions and possible functional effects of candidate genes and proteins.

Collapse

Tintle NL, Best AA, DeJongh M, Van Bruggen D, Heffron F, Porwollik S, Taylor RC. Gene set analyses for interpreting microarray experiments on prokaryotic organisms. BMC Bioinformatics 2008;9:469. [PMID: 18986519 PMCID: PMC2587482 DOI: 10.1186/1471-2105-9-469] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2008] [Accepted: 11/05/2008] [Indexed: 11/10/2022] Open

Larsson O, Diebold D, Fan D, Peterson M, Nho RS, Bitterman PB, Henke CA. Fibrotic myofibroblasts manifest genome-wide derangements of translational control. PLoS One 2008;3:e3220. [PMID: 18795102 PMCID: PMC2528966 DOI: 10.1371/journal.pone.0003220] [Citation(s) in RCA: 80] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2008] [Accepted: 08/20/2008] [Indexed: 11/19/2022] Open

Abstract

Background

As a group, fibroproliferative disorders of the lung, liver, kidney, heart, vasculature and integument are common, progressive and refractory to therapy. They can emerge following toxic insults, but are frequently idiopathic. Their enigmatic propensity to resist therapy and progress to organ failure has focused attention on the myofibroblast–the primary effector of the fibroproliferative response. We have recently shown that aberrant beta 1 integrin signaling in fibrotic fibroblasts results in defective PTEN function, unrestrained Akt signaling and subsequent activation of the translation initiation machinery. How this pathological integrin signaling alters the gene expression pathway has not been elucidated.

Results

Using a systems approach to study this question in a prototype fibrotic disease, Idiopathic Pulmonary Fibrosis (IPF); here we show organized changes in the gene expression pathway of primary lung myofibroblasts that persist for up to 9 sub-cultivations in vitro. When comparing IPF and control myofibroblasts in a 3-dimensional type I collagen matrix, more genes differed at the level of ribosome recruitment than at the level of transcript abundance, indicating pathological translational control as a major characteristic of IPF myofibroblasts. To determine the effect of matrix state on translational control, myofibroblasts were permitted to contract the matrix. Ribosome recruitment in control myofibroblasts was relatively stable. In contrast, IPF cells manifested large alterations in the ribosome recruitment pattern. Pathological studies suggest an epithelial origin for IPF myofibroblasts through the epithelial to mesenchymal transition (EMT). In accord with this, we found systems-level indications for TGF-β -driven EMT as one source of IPF myofibroblasts.

Conclusions

These findings establish the power of systems level genome-wide analysis to provide mechanistic insights into fibrotic disorders such as IPF. Our data point to derangements of translational control downstream of aberrant beta 1 integrin signaling as a fundamental component of IPF pathobiology and indicates that TGF-β -driven EMT is one source for IPF myofibroblasts.

Collapse

Conesa A, Bro R, García-García F, Prats JM, Götz S, Kjeldahl K, Montaner D, Dopazo J. Direct functional assessment of the composite phenotype through multivariate projection strategies. Genomics 2008;92:373-83. [PMID: 18652888 DOI: 10.1016/j.ygeno.2008.05.015] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2008] [Revised: 05/26/2008] [Accepted: 05/28/2008] [Indexed: 01/11/2023]

Dopazo J. Formulating and testing hypotheses in functional genomics. Artif Intell Med 2008;45:97-107. [PMID: 18789659 DOI: 10.1016/j.artmed.2008.08.003] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2008] [Revised: 08/04/2008] [Accepted: 08/04/2008] [Indexed: 01/08/2023]

Agudelo-Romero P, Carbonell P, de la Iglesia F, Carrera J, Rodrigo G, Jaramillo A, Pérez-Amador MA, Elena SF. Changes in the gene expression profile of Arabidopsis thaliana after infection with Tobacco etch virus. Virol J 2008;5:92. [PMID: 18684336 PMCID: PMC2518140 DOI: 10.1186/1743-422x-5-92] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2008] [Accepted: 08/07/2008] [Indexed: 12/22/2022] Open

Abstract

BACKGROUND

Tobacco etch potyvirus (TEV) has been extensively used as model system for the study of positive-sense RNA virus infecting plants. TEV ability to infect Arabidopsis thaliana varies among ecotypes. In this study, changes in gene expression of A. thaliana ecotype Ler infected with TEV have been explored using long-oligonucleotide arrays. A. thaliana Ler is a susceptible host that allows systemic movement, although the viral load is low and syndrome induced ranges from asymptomatic to mild. Gene expression profiles were monitored in whole plants 21 days post-inoculation (dpi). Microarrays contained 26,173 protein-coding genes and 87 miRNAs.

RESULTS

Expression analysis identified 1727 genes that displayed significant and consistent changes in expression levels either up or down, in infected plants. Identified TEV-responsive genes encode a diverse array of functional categories that include responses to biotic (such as the systemic acquired resistance pathway and hypersensitive responses) and abiotic stresses (droughtness, salinity, temperature, and wounding). The expression of many different transcription factors was also significantly affected, including members of the R2R3-MYB family and ABA-inducible TFs. In concordance with several other plant and animal viruses, the expression of heat-shock proteins (HSP) was also increased. Finally, we have associated functional GO categories with KEGG biochemical pathways, and found that many of the altered biological functions are controlled by changes in basal metabolism.

CONCLUSION

TEV infection significantly impacts a wide array of cellular processes, in particular, stress-response pathways, including the systemic acquired resistance and hypersensitive responses. However, many of the observed alterations may represent a global response to viral infection rather than being specific of TEV.

Collapse

Wei P, Pan W. Incorporating gene functions into regression analysis of DNA-protein binding data and gene expression data to construct transcriptional networks. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2008;5:401-415. [PMID: 18670043 DOI: 10.1109/tcbb.2007.1062] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]

Denton AM, Wu J, Townsend MK, Sule P, Prüss BM. Relating gene expression data on two-component systems to functional annotations in Escherichia coli. BMC Bioinformatics 2008;9:294. [PMID: 18578884 PMCID: PMC2478693 DOI: 10.1186/1471-2105-9-294] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2007] [Accepted: 06/25/2008] [Indexed: 11/30/2022] Open