Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Tsai CA, Liu LYD. Identifying gene set association enrichment using the coefficient of intrinsic dependence. PLoS One 2013;8:e58851. [PMID: 23516564 PMCID: PMC3597597 DOI: 10.1371/journal.pone.0058851] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2012] [Accepted: 02/08/2013] [Indexed: 11/18/2022] Open

For:	Tsai CA, Liu LYD. Identifying gene set association enrichment using the coefficient of intrinsic dependence. PLoS One 2013;8:e58851. [PMID: 23516564 PMCID: PMC3597597 DOI: 10.1371/journal.pone.0058851] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2012] [Accepted: 02/08/2013] [Indexed: 11/18/2022] Open

Number

Cited by Other Article(s)

Miao S, Ju G, Jiang C, Xue B, Zhao L, Zhang R, Diao H, Yu X, Zhang L, Pan X, Zhang H, Zang L, Wang L, Zhou T. Identification of DYNLT1 associated with proliferation, relapse, and metastasis in breast cancer. Front Med (Lausanne) 2023;10:1167676. [PMID: 37081842 PMCID: PMC10110886 DOI: 10.3389/fmed.2023.1167676] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2023] [Accepted: 03/13/2023] [Indexed: 04/07/2023] Open

Balestra C, Maj C, Müller E, Mayr A. Redundancy-aware unsupervised ranking based on game theory: Ranking pathways in collections of gene sets. PLoS One 2023;18:e0282699. [PMID: 36893181 PMCID: PMC9997904 DOI: 10.1371/journal.pone.0282699] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2022] [Accepted: 02/13/2023] [Indexed: 03/10/2023] Open

Abstract

In Genetics, gene sets are grouped in collections concerning their biological function. This often leads to high-dimensional, overlapping, and redundant families of sets, thus precluding a straightforward interpretation of their biological meaning. In Data Mining, it is often argued that techniques to reduce the dimensionality of data could increase the maneuverability and consequently the interpretability of large data. In the past years, moreover, we witnessed an increasing consciousness of the importance of understanding data and interpretable models in the machine learning and bioinformatics communities. On the one hand, there exist techniques aiming to aggregate overlapping gene sets to create larger pathways. While these methods could partly solve the large size of the collections' problem, modifying biological pathways is hardly justifiable in this biological context. On the other hand, the representation methods to increase interpretability of collections of gene sets that have been proposed so far have proved to be insufficient. Inspired by this Bioinformatics context, we propose a method to rank sets within a family of sets based on the distribution of the singletons and their size. We obtain sets' importance scores by computing Shapley values; Making use of microarray games, we do not incur the typical exponential computational complexity. Moreover, we address the challenge of constructing redundancy-aware rankings where, in our case, redundancy is a quantity proportional to the size of intersections among the sets in the collections. We use the obtained rankings to reduce the dimension of the families, therefore showing lower redundancy among sets while still preserving a high coverage of their elements. We finally evaluate our approach for collections of gene sets and apply Gene Sets Enrichment Analysis techniques to the now smaller collections: As expected, the unsupervised nature of the proposed rankings allows for unremarkable differences in the number of significant gene sets for specific phenotypic traits. In contrast, the number of performed statistical tests can be drastically reduced. The proposed rankings show a practical utility in bioinformatics to increase interpretability of the collections of gene sets and a step forward to include redundancy-awareness into Shapley values computations.

Collapse

Liu LYD, Hsiao YC, Chen HC, Yang YW, Chang MC. Construction of gene causal regulatory networks using microarray data with the coefficient of intrinsic dependence. BOTANICAL STUDIES 2019;60:22. [PMID: 31512008 PMCID: PMC6738364 DOI: 10.1186/s40529-019-0268-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/09/2019] [Accepted: 08/17/2019] [Indexed: 06/10/2023]

Abstract

BACKGROUND

In the past two decades, biologists have been able to identify the gene signatures associated with various phenotypes through the monitoring of gene expressions with high-throughput biotechnologies. These gene signatures have in turn been successfully applied to drug development, disease prevention, crop improvement, etc. However, ignoring the interactions among genes has weakened the predictive power of gene signatures in practical applications. Gene regulatory networks, in which genes are represented by nodes and the associations between genes are represented by edges, are typically constructed to analyze and visualize such gene interactions. More specifically, the present study sought to measure gene-gene associations by using the coefficient of intrinsic dependence (CID) to capture more nonlinear as well as cause-effect gene relationships.

RESULTS

A stepwise procedure using the CID along with the partial coefficient of intrinsic dependence (pCID) was demonstrated for the rebuilding of simulated networks and the well-known CBF-COR pathway under cold stress using Arabidopsis microarray data. The procedure was also applied to the construction of bHLH gene regulatory pathways under abiotic stresses using rice microarray data, in which OsbHLH104, a putative phytochrome-interacting factor (OsPIF14), and OsbHLH060, a positive regulator of iron homeostasis (OsPRI1) were inferred as the most affiliated genes. The inferred regulatory pathways were verified through literature reviews.

CONCLUSIONS

The proposed method can efficiently decipher gene regulatory pathways and may assist in achieving higher predictive power in practical applications. The lack of any mention in the literature of some of the regulatory event may have been due to the high complexity of the regulatory systems in the plant transcription, a possibility which could potentially be confirmed in the near future given ongoing rapid developments in bio-technology.

Collapse

Shen PC, Hour AL, Liu LYD. Microarray meta-analysis to explore abiotic stress-specific gene expression patterns in Arabidopsis. BOTANICAL STUDIES 2017;58:22. [PMID: 28510204 PMCID: PMC5432924 DOI: 10.1186/s40529-017-0176-8] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/09/2017] [Accepted: 05/05/2017] [Indexed: 05/26/2023]

Abstract

BACKGROUND

Abiotic stresses are the major limiting factors that affect plant growth, development, yield and final quality. Deciphering the underlying mechanisms of plants' adaptations to stresses using few datasets might overlook the different aspects of stress tolerance in plants, which might be simultaneously and consequently operated in the system. Fortunately, the accumulated microarray expression data offer an opportunity to infer abiotic stress-specific gene expression patterns through meta-analysis. In this study, we propose to combine microarray gene expression data under control, cold, drought, heat, and salt conditions and determined modules (gene sets) of genes highly associated with each other according to the observed expression data.

RESULTS

By analyzing the expression variations of the Eigen genes from different conditions, we had identified two, three, and five gene modules as cold-, heat-, and salt-specific modules, respectively. Most of the cold- or heat-specific modules were differentially expressed to a particular degree in shoot samples, while most of the salt-specific modules were differentially expressed to a particular degree in root samples. A gene ontology (GO) analysis on the stress-specific modules suggested that the gene modules exclusively enriched stress-related GO terms and that different genes under the same GO terms may be alternatively disturbed in different conditions. The gene regulatory events for two genes, DREB1A and DEAR1, in the cold-specific gene module had also been validated, as evidenced through the literature search.

CONCLUSIONS

Our protocols study the specificity of the gene modules that were specifically activated under a particular type of abiotic stress. The biplot can also assist to visualize the stress-specific gene modules. In conclusion, our approach has the potential to further elucidate mechanisms in plants and beneficial for future experiments design under different abiotic stresses.

Collapse

Hsiao YC, Liu LYD. A Stepwise Approach of Finding Dependent Variables via Coefficient of Intrinsic Dependence. J Comput Biol 2015;23:42-55. [PMID: 26645623 DOI: 10.1089/cmb.2015.0150] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open