2
|
Rowhanimanesh A. A novel approach for the analysis of time-course gene expression data based on computing with words. J Biomed Inform 2021; 120:103868. [PMID: 34271172 DOI: 10.1016/j.jbi.2021.103868] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2020] [Revised: 06/15/2021] [Accepted: 07/10/2021] [Indexed: 10/20/2022]
Abstract
In this paper, a novel approach is proposed for the analysis of time-course gene expression data based on the path-breaking work of Zadeh, Computing with Words. This method can automatically discover the patterns of temporal gene expression profile in terms of two distinguishing descriptions: linguistic description that is understandable and interpretable for human inference; and type-2 fuzzy description that is suitable for robust machine inference in the presence of uncertainty. In contrast to conventional static data mining methods which focus on the steady-state gene expression levels, the proposed scheme is a new time-series pattern mining technique for dynamical modeling of gene expression. To evaluate the performance of this paradigm, it is applied to a case study dataset from Gene Expression Omnibus (GEO) which includes the temporal transcriptional profile of human colon cancer cells. The goal is to investigate the pharmacodynamics of two anticancer drugs. The transient and steady-state analysis of the transcriptional response clearly demonstrates the ability of the proposed approach to reveal the pharmacodynamical effects of drug type and dosage on the expression of genes.
Collapse
Affiliation(s)
- Alireza Rowhanimanesh
- Intelligent Systems Laboratory, Department of Electrical Engineering, University of Neyshabur, Neyshabur, Iran.
| |
Collapse
|
3
|
Doostparast Torshizi A, Duan J, Wang K. A computational method for direct imputation of cell type-specific expression profiles and cellular compositions from bulk-tissue RNA-Seq in brain disorders. NAR Genom Bioinform 2021; 3:lqab056. [PMID: 34169279 PMCID: PMC8219045 DOI: 10.1093/nargab/lqab056] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2020] [Revised: 05/24/2021] [Accepted: 06/21/2021] [Indexed: 02/06/2023] Open
Abstract
The importance of cell type-specific gene expression in disease-relevant tissues is increasingly recognized in genetic studies of complex diseases. However, most gene expression studies are conducted on bulk tissues, without examining cell type-specific expression profiles. Several computational methods are available for cell type deconvolution (i.e. inference of cellular composition) from bulk RNA-Seq data, but few of them impute cell type-specific expression profiles. We hypothesize that with external prior information such as single cell RNA-seq and population-wide expression profiles, it can be computationally tractable to estimate both cellular composition and cell type-specific expression from bulk RNA-Seq data. Here we introduce CellR, which addresses cross-individual gene expression variations to adjust the weights of cell-specific gene markers. It then transforms the deconvolution problem into a linear programming model while taking into account inter/intra cellular correlations and uses a multi-variate stochastic search algorithm to estimate the cell type-specific expression profiles. Analyses on several complex diseases such as schizophrenia, Alzheimer’s disease, Huntington’s disease and type 2 diabetes validated the efficiency of CellR, while revealing how specific cell types contribute to different diseases. In summary, CellR compares favorably against competing approaches, enabling cell type-specific re-analysis of gene expression data on bulk tissues in complex diseases.
Collapse
Affiliation(s)
- Abolfazl Doostparast Torshizi
- Raymond G. Perelman Center for Cellular and Molecular Therapeutics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Jubao Duan
- Center for Psychiatric Genetics, NorthShore University HealthSystem, Evanston, IL 60201, USA
| | - Kai Wang
- Raymond G. Perelman Center for Cellular and Molecular Therapeutics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
| |
Collapse
|
4
|
Liang F, Lv K, Wang Y, Yuan Y, Lu L, Feng Q, Jing X, Wang H, Liu C, Rayner S, Ling S, Chen H, Wan Y, Zhou W, He L, Wu B, Qu L, Chen S, Xiong J, Li Y. Personalized Epigenome Remodeling Under Biochemical and Psychological Changes During Long-Term Isolation Environment. Front Physiol 2019; 10:932. [PMID: 31417412 PMCID: PMC6684777 DOI: 10.3389/fphys.2019.00932] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2019] [Accepted: 07/09/2019] [Indexed: 12/16/2022] Open
Abstract
It has been reported that several aspects of human health could be disturbed during a long-term isolated environment (for instance, the Mars-500 mission), including psychiatric disorders, circadian disruption, temporal dynamics of gut microbiota, immune responses, and physical-activity-related neuromuscular performance. Nevertheless, the mechanisms underlying these disturbances and the interactions among different aspects of human adaptation to extreme environments remain to be elucidated. Epigenetic features, like DNA methylation, might be a linking mechanism that explains the involvement of environmental factors between the human genome and the outcome of health. We conducted an exploration of personalized longitudinal DNA methylation patterns of the peripheral whole blood cells, profiling six subjects across six sampling points in the Mars-500 mission. Specifically, we developed a Personalized Epigenetic-Phenotype Synchronization Analysis (PeSa) algorithm to explore glucose- and mood-state-synchronized DNA methylation sites, focusing on finding the dynamic associations between epigenetic patterns and phenotypes in each individual, and exploring the underling epigenetic connections between glucose and mood-state disturbance. Results showed that DMPs (differentially methylated-probes) were significantly enriched in pathways related to glucose metabolism (Type II diabetes mellitus pathway), mood state (Long-term depression) and circadian rhythm (Circadian entrainment pathway) during the mission. Furthermore, our data revealed individualized glucose-synchronized and mood-state-synchronized DNA methylation sites, and PTPRN2 was found to be associated with both glucose and mood state disturbances across all six subjects. Our findings suggest that personalized phenotype-synchronized epigenetic features could reflect the effects on the human body, including the disturbances of glucose and mood-states. The association analysis of DNA methylation and phenotypes, like the PeSa analysis, could provide new possibilities in understanding the intrinsic relationship between phenotypic changes of the human body adapting to long-term isolation environmental factors.
Collapse
Affiliation(s)
- Fengji Liang
- State Key Laboratory of Space Medicine Fundamentals and Application, China Astronaut Research and Training Center, Beijing, China.,Lab of Epigenetics and Health Prediction, SPACEnter Space Science and Technology Institute, Shenzhen, China
| | - Ke Lv
- State Key Laboratory of Space Medicine Fundamentals and Application, China Astronaut Research and Training Center, Beijing, China.,Lab of Epigenetics and Health Prediction, SPACEnter Space Science and Technology Institute, Shenzhen, China
| | - Yue Wang
- State Key Laboratory of Space Medicine Fundamentals and Application, China Astronaut Research and Training Center, Beijing, China
| | - Yanhong Yuan
- State Key Laboratory of Space Medicine Fundamentals and Application, China Astronaut Research and Training Center, Beijing, China
| | - Liang Lu
- State Key Laboratory of Space Medicine Fundamentals and Application, China Astronaut Research and Training Center, Beijing, China
| | - Qiang Feng
- Lab of Epigenetics and Health Prediction, SPACEnter Space Science and Technology Institute, Shenzhen, China
| | - Xiaolu Jing
- State Key Laboratory of Space Medicine Fundamentals and Application, China Astronaut Research and Training Center, Beijing, China
| | - Honghui Wang
- State Key Laboratory of Space Medicine Fundamentals and Application, China Astronaut Research and Training Center, Beijing, China
| | - Changning Liu
- State Key Laboratory of Space Medicine Fundamentals and Application, China Astronaut Research and Training Center, Beijing, China
| | - Simon Rayner
- Department of Medical Genetics, Oslo University Hospital, University of Oslo, Oslo, Norway
| | - Shukuan Ling
- State Key Laboratory of Space Medicine Fundamentals and Application, China Astronaut Research and Training Center, Beijing, China
| | - Hailong Chen
- State Key Laboratory of Space Medicine Fundamentals and Application, China Astronaut Research and Training Center, Beijing, China
| | - Yumin Wan
- State Key Laboratory of Space Medicine Fundamentals and Application, China Astronaut Research and Training Center, Beijing, China
| | - Wanlong Zhou
- State Key Laboratory of Space Medicine Fundamentals and Application, China Astronaut Research and Training Center, Beijing, China
| | - Li He
- State Key Laboratory of Space Medicine Fundamentals and Application, China Astronaut Research and Training Center, Beijing, China
| | - Bin Wu
- State Key Laboratory of Space Medicine Fundamentals and Application, China Astronaut Research and Training Center, Beijing, China
| | - Lina Qu
- State Key Laboratory of Space Medicine Fundamentals and Application, China Astronaut Research and Training Center, Beijing, China
| | - Shanguang Chen
- State Key Laboratory of Space Medicine Fundamentals and Application, China Astronaut Research and Training Center, Beijing, China
| | - Jianghui Xiong
- State Key Laboratory of Space Medicine Fundamentals and Application, China Astronaut Research and Training Center, Beijing, China.,Lab of Epigenetics and Health Prediction, SPACEnter Space Science and Technology Institute, Shenzhen, China
| | - Yinghui Li
- State Key Laboratory of Space Medicine Fundamentals and Application, China Astronaut Research and Training Center, Beijing, China.,Lab of Epigenetics and Health Prediction, SPACEnter Space Science and Technology Institute, Shenzhen, China
| |
Collapse
|
5
|
Abstract
We propose a novel method for adaptive K-means clustering. The proposed method overcomes the problems of the traditional K-means algorithm. Specifically, the proposed method does not require prior knowledge of the number of clusters. Additionally, the initial identification of the cluster elements has no negative impact on the final generated clusters. Inspired by cell cloning in microorganism cultures, each added data sample causes the existing cluster ‘colonies’ to evaluate, with the other clusters, various merging or splitting actions in order for reaching the optimum cluster set. The proposed algorithm is adequate for clustering data in isolated or overlapped compact spherical clusters. Experimental results support the effectiveness of this clustering algorithm.
Collapse
|
6
|
Torshizi AD, Petzold L. Sparse Pathway-Induced Dynamic Network Biomarker Discovery for Early Warning Signal Detection in Complex Diseases. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2018; 15:1028-1034. [PMID: 28368826 DOI: 10.1109/tcbb.2017.2687925] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
In many complex diseases, the transition process from the healthy stage to the catastrophic stage does not occur gradually. Recent studies indicate that the initiation and progression of such diseases are comprised of three steps including healthy stage, pre-disease stage, and disease stage. It has been demonstrated that a certain set of trajectories can be observed in the genetic signatures at the molecular level, which might be used to detect the pre-disease stage and to take necessary medical interventions. In this paper, we propose two optimization-based algorithms for extracting the dynamic network biomarkers responsible for catastrophic transition into the disease stage, and to open new horizons to reverse the disease progression at an early stage through pinpointing molecular signatures provided by high-throughput microarray data. The first algorithm relies on meta-heuristic intelligent search to characterize dynamic network biomarkers represented as a complete graph. The second algorithm induces sparsity on the adjacency matrix of the genes by taking into account the biological signaling and metabolic pathways, since not all the genes in the ineractome are biologically linked. Comprehensive numerical and meta-analytical experiments verify the effectiveness of the results of the proposed approaches in terms of network size, biological meaningfulness, and verifiability.
Collapse
|
7
|
Doostparast Torshizi A, Petzold LR. Graph-based semi-supervised learning with genomic data integration using condition-responsive genes applied to phenotype classification. J Am Med Inform Assoc 2018; 25:99-108. [PMID: 28505320 PMCID: PMC7647127 DOI: 10.1093/jamia/ocx032] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2016] [Revised: 02/08/2017] [Accepted: 03/14/2017] [Indexed: 11/14/2022] Open
Abstract
Objective Data integration methods that combine data from different molecular levels such as genome, epigenome, transcriptome, etc., have received a great deal of interest in the past few years. It has been demonstrated that the synergistic effects of different biological data types can boost learning capabilities and lead to a better understanding of the underlying interactions among molecular levels. Methods In this paper we present a graph-based semi-supervised classification algorithm that incorporates latent biological knowledge in the form of biological pathways with gene expression and DNA methylation data. The process of graph construction from biological pathways is based on detecting condition-responsive genes, where 3 sets of genes are finally extracted: all condition responsive genes, high-frequency condition-responsive genes, and P-value-filtered genes. Results The proposed approach is applied to ovarian cancer data downloaded from the Human Genome Atlas. Extensive numerical experiments demonstrate superior performance of the proposed approach compared to other state-of-the-art algorithms, including the latest graph-based classification techniques. Conclusions Simulation results demonstrate that integrating various data types enhances classification performance and leads to a better understanding of interrelations between diverse omics data types. The proposed approach outperforms many of the state-of-the-art data integration algorithms.
Collapse
Affiliation(s)
| | - Linda R Petzold
- Department of Computer Science, University of California, Santa Barbara, CA, USA
| |
Collapse
|