1
|
Dong H, Ma B, Meng Y, Wu Y, Liu Y, Zeng T, Huang J. GRNMOPT: Inference of gene regulatory networks based on a multi-objective optimization approach. Comput Biol Chem 2024; 113:108223. [PMID: 39340962 DOI: 10.1016/j.compbiolchem.2024.108223] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2024] [Revised: 08/21/2024] [Accepted: 09/20/2024] [Indexed: 09/30/2024]
Abstract
BACKGROUND AND OBJECTIVE The reconstruction of gene regulatory networks (GRNs) stands as a vital approach in deciphering complex biological processes. The application of nonlinear ordinary differential equations (ODEs) models has demonstrated considerable efficacy in predicting GRNs. Notably, the decay rate and time delay are pivotal in authentic gene regulation, yet their systematic determination in ODEs models remains underexplored. The development of a comprehensive optimization framework for the effective estimation of these key parameters is essential for accurate GRN inference. METHOD This study introduces GRNMOPT, an innovative methodology for inferring GRNs from time-series and steady-state data. GRNMOPT employs a combined use of decay rate and time delay in constructing ODEs models to authentically represent gene regulatory processes. It incorporates a multi-objective optimization approach, optimizing decay rate and time delay concurrently to derive Pareto optimal sets for these factors, thereby maximizing accuracy metrics such as AUROC (Area Under the Receiver Operating Characteristic curve) and AUPR (Area Under the Precision-Recall curve). Additionally, the use of XGBoost for calculating feature importance aids in identifying potential regulatory gene links. RESULTS Comprehensive experimental evaluations on two simulated datasets from DREAM4 and three real gene expression datasets (Yeast, In vivo Reverse-engineering and Modeling Assessment [IRMA], and Escherichia coli [E. coli]) reveal that GRNMOPT performs commendably across varying network scales. Furthermore, cross-validation experiments substantiate the robustness of GRNMOPT. CONCLUSION We propose a novel approach called GRNMOPT to infer GRNs based on a multi-objective optimization framework, which effectively improves inference accuracy and provides a powerful tool for GRNs inference.
Collapse
Affiliation(s)
- Heng Dong
- School of Information Science and Technology, Dalian Maritime University, Dalian 116026, China
| | - Baoshan Ma
- School of Information Science and Technology, Dalian Maritime University, Dalian 116026, China.
| | - Yangyang Meng
- School of Information Science and Technology, Dalian Maritime University, Dalian 116026, China
| | - Yiming Wu
- School of Information Science and Technology, Dalian Maritime University, Dalian 116026, China
| | - Yongjing Liu
- Biomedical big data center, the First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou 310003, China; Zhejiang Provincial Key Laboratory of Pancreatic Disease, Zhejiang University School of Medicine First Affiliated Hospital, Hangzhou 310003, China; Zhejiang University Cancer Center, Zhejiang University, Hangzhou 310058, China
| | - Tao Zeng
- Biomedical big data center, the First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou 310003, China; Zhejiang Provincial Key Laboratory of Pancreatic Disease, Zhejiang University School of Medicine First Affiliated Hospital, Hangzhou 310003, China; Zhejiang University Cancer Center, Zhejiang University, Hangzhou 310058, China
| | - Jinyan Huang
- Biomedical big data center, the First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou 310003, China; Zhejiang Provincial Key Laboratory of Pancreatic Disease, Zhejiang University School of Medicine First Affiliated Hospital, Hangzhou 310003, China; Zhejiang University Cancer Center, Zhejiang University, Hangzhou 310058, China.
| |
Collapse
|
2
|
Zhu W, Li W, Zhang H, Li L. Big data and artificial intelligence-aided crop breeding: Progress and prospects. JOURNAL OF INTEGRATIVE PLANT BIOLOGY 2024. [PMID: 39467106 DOI: 10.1111/jipb.13791] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/30/2024] [Revised: 08/25/2024] [Accepted: 09/10/2024] [Indexed: 10/30/2024]
Abstract
The past decade has witnessed rapid developments in gene discovery, biological big data (BBD), artificial intelligence (AI)-aided technologies, and molecular breeding. These advancements are expected to accelerate crop breeding under the pressure of increasing demands for food. Here, we first summarize current breeding methods and discuss the need for new ways to support breeding efforts. Then, we review how to combine BBD and AI technologies for genetic dissection, exploring functional genes, predicting regulatory elements and functional domains, and phenotypic prediction. Finally, we propose the concept of intelligent precision design breeding (IPDB) driven by AI technology and offer ideas about how to implement IPDB. We hope that IPDB will enhance the predictability, efficiency, and cost of crop breeding compared with current technologies. As an example of IPDB, we explore the possibilities offered by CropGPT, which combines biological techniques, bioinformatics, and breeding art from breeders, and presents an open, shareable, and cooperative breeding system. IPDB provides integrated services and communication platforms for biologists, bioinformatics experts, germplasm resource specialists, breeders, dealers, and farmers, and should be well suited for future breeding.
Collapse
Affiliation(s)
- Wanchao Zhu
- Key Laboratory of Biology and Genetic Improvement of Maize in Arid Area of Northwest Region, College of Agronomy, Northwest A&F University, Yangling, 712100, China
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, 430070, China
| | - Weifu Li
- College of Informatics, Huazhong Agricultural University, Wuhan, 430070, China
- Engineering Research Center of Intelligent Technology for Agriculture, Ministry of Education, Wuhan, 430070, China
| | - Hongwei Zhang
- State Key Laboratory of Crop Gene Resources and Breeding, National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Lin Li
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, 430070, China
| |
Collapse
|
3
|
Yang G, Hu W, He L, Dou L. Nonlinear causal network learning via Granger causality based on extreme support vector regression. CHAOS (WOODBURY, N.Y.) 2024; 34:023127. [PMID: 38377295 DOI: 10.1063/5.0183537] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/23/2023] [Accepted: 01/22/2024] [Indexed: 02/22/2024]
Abstract
For complex networked systems, based on the consideration of nonlinearity and causality, a novel general method of nonlinear causal network learning, termed extreme support vector regression Granger causality (ESVRGC), is proposed. The nonuniform time-delayed influence of the driving nodes on the target node is particularly considered. Then, the restricted model and the unrestricted model of Granger causality are, respectively, formulated based on extreme support vector regression, which uses the selected time-delayed components of system variables as the inputs of kernel functions. The nonlinear conditional Granger causality index is finally calculated to confirm the strength of a causal interaction. Generally, based on the simulation of a nonlinear vector autoregressive model and nonlinear discrete time-delayed dynamic systems, ESVRGC demonstrates better performance than other popular methods. Also, the validity and robustness of ESVRGC are also verified by the different cases of network types, sample sizes, noise intensities, and coupling strengths. Finally, the superiority of ESVRGC is successful verified by the experimental study on real benchmark datasets.
Collapse
Affiliation(s)
- Guanxue Yang
- School of Electrical and Information Engineering, Jiangsu University, Zhenjiang 212013, China
| | - Weiwei Hu
- School of Electrical and Information Engineering, Jiangsu University, Zhenjiang 212013, China
| | - Lidong He
- School of Automation, Nanjing University of Science and Technology, Nanjing 210094, China
| | - Liya Dou
- Department of Automation, Beijing University of Chemical Technology, Beijing 100029, China
| |
Collapse
|
4
|
Gao Z, Tang J, Xia J, Zheng CH, Wei PJ. CNNGRN: A Convolutional Neural Network-Based Method for Gene Regulatory Network Inference From Bulk Time-Series Expression Data. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023; 20:2853-2861. [PMID: 37267145 DOI: 10.1109/tcbb.2023.3282212] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
Gene regulatory networks (GRNs) participate in many biological processes, and reconstructing them plays an important role in systems biology. Although many advanced methods have been proposed for GRN reconstruction, their predictive performance is far from the ideal standard, so it is urgent to design a more effective method to reconstruct GRN. Moreover, most methods only consider the gene expression data, ignoring the network structure information contained in GRN. In this study, we propose a supervised model named CNNGRN, which infers GRN from bulk time-series expression data via convolutional neural network (CNN) model, with a more informative feature. Bulk time series gene expression data imply the intricate regulatory associations between genes, and the network structure feature of ground-truth GRN contains rich neighbor information. Hence, CNNGRN integrates the above two features as model inputs. In addition, CNN is adopted to extract intricate features of genes and infer the potential associations between regulators and target genes. Moreover, feature importance visualization experiments are implemented to seek the key features. Experimental results show that CNNGRN achieved competitive performance on benchmark datasets compared to the state-of-the-art computational methods. Finally, hub genes identified based on CNNGRN have been confirmed to be involved in biological processes through literature.
Collapse
|
5
|
Marku M, Pancaldi V. From time-series transcriptomics to gene regulatory networks: A review on inference methods. PLoS Comput Biol 2023; 19:e1011254. [PMID: 37561790 PMCID: PMC10414591 DOI: 10.1371/journal.pcbi.1011254] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/12/2023] Open
Abstract
Inference of gene regulatory networks has been an active area of research for around 20 years, leading to the development of sophisticated inference algorithms based on a variety of assumptions and approaches. With the ever increasing demand for more accurate and powerful models, the inference problem remains of broad scientific interest. The abstract representation of biological systems through gene regulatory networks represents a powerful method to study such systems, encoding different amounts and types of information. In this review, we summarize the different types of inference algorithms specifically based on time-series transcriptomics, giving an overview of the main applications of gene regulatory networks in computational biology. This review is intended to give an updated reference of regulatory networks inference tools to biologists and researchers new to the topic and guide them in selecting the appropriate inference method that best fits their questions, aims, and experimental data.
Collapse
Affiliation(s)
- Malvina Marku
- CRCT, Université de Toulouse, Inserm, CNRS, Université Toulouse III-Paul Sabatier, Centre de Recherches en Cancérologie de Toulouse, Toulouse, France
| | - Vera Pancaldi
- CRCT, Université de Toulouse, Inserm, CNRS, Université Toulouse III-Paul Sabatier, Centre de Recherches en Cancérologie de Toulouse, Toulouse, France
- Barcelona Supercomputing Center, Barcelona, Spain
| |
Collapse
|
6
|
Yan J, Wang X. Machine learning bridges omics sciences and plant breeding. TRENDS IN PLANT SCIENCE 2023; 28:199-210. [PMID: 36153276 DOI: 10.1016/j.tplants.2022.08.018] [Citation(s) in RCA: 32] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/08/2022] [Revised: 08/15/2022] [Accepted: 08/23/2022] [Indexed: 06/16/2023]
Abstract
Some of the biological knowledge obtained from fundamental research will be implemented in applied plant breeding. To bridge basic research and breeding practice, machine learning (ML) holds great promise to translate biological knowledge and omics data into precision-designed plant breeding. Here, we review ML for multi-omics analysis in plants, including data dimensionality reduction, inference of gene-regulation networks, and gene discovery and prioritization. These applications will facilitate understanding trait regulation mechanisms and identifying target genes potentially applicable to knowledge-driven molecular design breeding. We also highlight applications of deep learning in plant phenomics and ML in genomic selection-assisted breeding, such as various ML algorithms that model the correlations among genotypes (genes), phenotypes (traits), and environments, to ultimately achieve data-driven genomic design breeding.
Collapse
Affiliation(s)
- Jun Yan
- National Maize Improvement Center, College of Agronomy and Biotechnology, China Agricultural University, Beijing 100094, China; Frontiers Science Center for Molecular Design Breeding, China Agricultural University, Beijing 100094, China
| | - Xiangfeng Wang
- National Maize Improvement Center, College of Agronomy and Biotechnology, China Agricultural University, Beijing 100094, China; Frontiers Science Center for Molecular Design Breeding, China Agricultural University, Beijing 100094, China.
| |
Collapse
|
7
|
Liu Q, Li J, Dong M, Liu M, Chai Y. Identification of Gene Regulatory Networks Using Variational Bayesian Inference in the Presence of Missing Data. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023; 20:399-409. [PMID: 35061589 DOI: 10.1109/tcbb.2022.3144418] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
The identification of gene regulatory networks (GRN) from gene expression time series data is a challenge and open problem in system biology. This paper considers the structure inference of GRN from the incomplete and noisy gene expression data, which is a not well-studied issue for GRN inference. In this paper, the dynamical behavior of the gene expression process is described by a stochastic nonlinear state-space model with unknown noise information. A variational Bayesian (VB) framework are proposed to estimate the parameters and gene expression levels simultaneously. One of the advantages of this method is that it can easily handle the missing observations by generating the prediction values. Considering the sparsity of GRN, the smoothed gene data are modeled by the extreme gradient boosting tree, and the regulatory interactions among genes are identified by the importance scores based on the tree model. The proposed method is tested on the artificial DREAM4 datasets and one real gene expression dataset of yeast. The comparative results show that the proposed method can effectively recover the regulatory interactions of GRN in the presence of missing observations and outperforms the existing methods for GRN identification.
Collapse
|
8
|
Lei J, Cai Z, He X, Zheng W, Liu J. An approach of gene regulatory network construction using mixed entropy optimizing context-related likelihood mutual information. Bioinformatics 2022; 39:6808612. [PMID: 36342190 PMCID: PMC9805593 DOI: 10.1093/bioinformatics/btac717] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2022] [Revised: 09/18/2022] [Accepted: 11/04/2022] [Indexed: 11/09/2022] Open
Abstract
MOTIVATION The question of how to construct gene regulatory networks has long been a focus of biological research. Mutual information can be used to measure nonlinear relationships, and it has been widely used in the construction of gene regulatory networks. However, this method cannot measure indirect regulatory relationships under the influence of multiple genes, which reduces the accuracy of inferring gene regulatory networks. APPROACH This work proposes a method for constructing gene regulatory networks based on mixed entropy optimizing context-related likelihood mutual information (MEOMI). First, two entropy estimators were combined to calculate the mutual information between genes. Then, distribution optimization was performed using a context-related likelihood algorithm to eliminate some indirect regulatory relationships and obtain the initial gene regulatory network. To obtain the complex interaction between genes and eliminate redundant edges in the network, the initial gene regulatory network was further optimized by calculating the conditional mutual inclusive information (CMI2) between gene pairs under the influence of multiple genes. The network was iteratively updated to reduce the impact of mutual information on the overestimation of the direct regulatory intensity. RESULTS The experimental results show that the MEOMI method performed better than several other kinds of gene network construction methods on DREAM challenge simulated datasets (DREAM3 and DREAM5), three real Escherichia coli datasets (E.coli SOS pathway network, E.coli SOS DNA repair network and E.coli community network) and two human datasets. AVAILABILITY AND IMPLEMENTATION Source code and dataset are available at https://github.com/Dalei-Dalei/MEOMI/ and http://122.205.95.139/MEOMI/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Jimeng Lei
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan 430070, China,Key Laboratory of Smart Farming for Agricultural Animals, Huazhong Agricultural University, Wuhan 430070, China,College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| | - Zongheng Cai
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan 430070, China,Key Laboratory of Smart Farming for Agricultural Animals, Huazhong Agricultural University, Wuhan 430070, China,College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| | - Xinyi He
- College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| | - Wanting Zheng
- College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| | | |
Collapse
|
9
|
Chen G, Liu ZP. Inferring causal gene regulatory network via GreyNet: From dynamic grey association to causation. Front Bioeng Biotechnol 2022; 10:954610. [PMID: 36237217 PMCID: PMC9551017 DOI: 10.3389/fbioe.2022.954610] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2022] [Accepted: 08/15/2022] [Indexed: 11/23/2022] Open
Abstract
Gene regulatory network (GRN) provides abundant information on gene interactions, which contributes to demonstrating pathology, predicting clinical outcomes, and identifying drug targets. Existing high-throughput experiments provide rich time-series gene expression data to reconstruct the GRN to further gain insights into the mechanism of organisms responding to external stimuli. Numerous machine-learning methods have been proposed to infer gene regulatory networks. Nevertheless, machine learning, especially deep learning, is generally a “black box,” which lacks interpretability. The causality has not been well recognized in GRN inference procedures. In this article, we introduce grey theory integrated with the adaptive sliding window technique to flexibly capture instant gene–gene interactions in the uncertain regulatory system. Then, we incorporate generalized multivariate Granger causality regression methods to transform the dynamic grey association into causation to generate directional regulatory links. We evaluate our model on the DREAM4 in silico benchmark dataset and real-world hepatocellular carcinoma (HCC) time-series data. We achieved competitive results on the DREAM4 compared with other state-of-the-art algorithms and gained meaningful GRN structure on HCC data respectively.
Collapse
Affiliation(s)
- Guangyi Chen
- Department of Biomedical Engineering, School of Control Science and Engineering, Shandong University, Jinan, Shandong, China
| | - Zhi-Ping Liu
- Department of Biomedical Engineering, School of Control Science and Engineering, Shandong University, Jinan, Shandong, China
- Center for Intelligent Medicine, Shandong University, Jinan, Shandong, China
- *Correspondence: Zhi-Ping Liu,
| |
Collapse
|
10
|
Liu X, Shi N, Wang Y, Ji Z, He S. Data-Driven Boolean Network Inference Using a Genetic Algorithm With Marker-Based Encoding. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:1558-1569. [PMID: 33513105 DOI: 10.1109/tcbb.2021.3055646] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
The inference of Boolean networks is crucial for analyzing the topology and dynamics of gene regulatory networks. Many data-driven approaches using evolutionary algorithms have been proposed based on time-series data. However, the ability to infer both network topology and dynamics is restricted by their inflexible encoding schemes. To address this problem, we propose a novel Boolean network inference algorithm for inferring both network topology and dynamics simultaneously. The main idea is that, we use a marker-based genetic algorithm to encode both regulatory nodes and logical operators in a chromosome. By using the markers and introducing more logical operators, the proposed algorithm can infer more diverse candidate Boolean functions. The proposed algorithm is applied to five networks, including two artificial Boolean networks and three real-world gene regulatory networks. Compared with other algorithms, the experimental results demonstrate that our proposed algorithm infers more accurate topology and dynamics.
Collapse
|
11
|
Shen L, Lee S, Joo JC, Hong E, Cui ZY, Jo E, Park SJ, Jang HJ. Chelidonium majus Induces Apoptosis of Human Ovarian Cancer Cells via ATF3-Mediated Regulation of Foxo3a by Tip60. J Microbiol Biotechnol 2022; 32:493-503. [PMID: 35283423 PMCID: PMC9628819 DOI: 10.4014/jmb.2109.09030] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2021] [Revised: 01/16/2022] [Accepted: 02/14/2022] [Indexed: 12/15/2022]
Abstract
Forkhead transcription factor 3a (Foxo3a) is believed to be a tumor suppressor as its inactivation leads to cell transformation and tumor development. However, further investigation is required regarding the involvement of the activating transcription factor 3 (ATF3)-mediated Tat-interactive protein 60 (Tip60)/Foxo3a pathway in cancer cell apoptosis. This study demonstrated that Chelidonium majus upregulated the expression of ATF3 and Tip60 and promoted Foxo3a nuclear translocation, ultimately increasing the level of Bcl-2-associated X protein (Bax) protein. ATF3 overexpression stimulated Tip60 expression, while ATF3 inhibition by siRNA repressed Tip60 expression. Furthermore, siRNA-mediated Tip60 inhibition significantly promoted Foxo3a phosphorylation, leading to blockade of Foxo3a translocation into the nucleus. Thus, we were able to deduce that ATF3 mediates the regulation of Foxo3a by Tip60. Moreover, siRNA-mediated Foxo3a inhibition suppressed the expression of Bax and subsequent apoptosis. Taken together, our data demonstrate that Chelidonium majus induces SKOV-3 cell death by increasing ATF3 levels and its downstream proteins Tip60 and Foxo3a. This suggests a potential therapeutic role of Chelidonium majus against ovarian cancer.
Collapse
Affiliation(s)
- Lei Shen
- Aerospace Center Hospital, Beijing 100049, P.R. China
| | - Soon Lee
- Division of Analytical Science, Korea Basic Science Institute, Daejeon 34133, Republic of Korea,Division of Analytical Science, University of Science and Technology, Daejeon 34113, Republic of Korea
| | - Jong Cheon Joo
- Department of Sasang Constitutional Medicine, College of Korean Medicine, Wonkwang University, Iksan 54538, Republic of Korea
| | - Eunmi Hong
- Division of Analytical Science, Korea Basic Science Institute, Daejeon 34133, Republic of Korea
| | - Zhen Yang Cui
- Rehabilitation Medicine College, Weifang Medical University, Weifang 261042, P.R. China
| | - Eunbi Jo
- Department of Life Science and Research Institute for Natural Sciences, College of Natural Sciences, Hanyang University, Seoul 04763, Republic of Korea
| | - Soo Jung Park
- Department of Sasang Constitutional Medicine, College of Korean Medicine, Woosuk University, Jeonju 54987, Republic of Korea,
S.J. Park Phone: +82-63-220-8676 E-mail:
| | - Hyun-Jin Jang
- Laboratory of Chemical Biology and Genomics, Korea Research Institute of Bioscience and Biotechnology, Daejeon 34141, Republic of Korea,Corresponding authors H.J. Jang Phone: +42-860-4563 E-mail:
| |
Collapse
|
12
|
Abstract
Cancer is a genetic disease in which multiple genes are perturbed. Thus, information about the regulatory relationships between genes is necessary for the identification of biomarkers and therapeutic targets. In this review, methods for inference of gene regulatory networks (GRNs) from transcriptomics data that are used in cancer research are introduced. The methods are classified into three categories according to the analysis model. The first category includes methods that use pair-wise measures between genes, including correlation coefficient and mutual information. The second category includes methods that determine the genetic regulatory relationship using multivariate measures, which consider the expression profiles of all genes concurrently. The third category includes methods using supervised and integrative approaches. The supervised approach estimates the regulatory relationship using a supervised learning method that constructs a regression or classification model for predicting whether there is a regulatory relationship between genes with input data of gene expression profiles and class labels of prior biological knowledge. The integrative method is an expansion of the supervised method and uses more data and biological knowledge for predicting the regulatory relationship. Furthermore, simulation and experimental validation of the estimated GRNs are also discussed in this review. This review identified that most GRN inference methods are not specific for cancer transcriptome data, and such methods are required for better understanding of cancer pathophysiology. In addition, more systematic methods for validation of the estimated GRNs need to be developed in the context of cancer biology.
Collapse
|
13
|
Pirgazi J, Olyaee MH, Khanteymoori A. KFGRNI: A robust method to inference gene regulatory network from time-course gene data based on ensemble Kalman filter. J Bioinform Comput Biol 2021; 19:2150002. [PMID: 33657986 DOI: 10.1142/s0219720021500025] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
A central problem of systems biology is the reconstruction of Gene Regulatory Networks (GRNs) by the use of time series data. Although many attempts have been made to design an efficient method for GRN inference, providing a best solution is still a challenging task. Existing noise, low number of samples, and high number of nodes are the main reasons causing poor performance of existing methods. The present study applies the ensemble Kalman filter algorithm to model a GRN from gene time series data. The inference of a GRN is decomposed with p genes into p subproblems. In each subproblem, the ensemble Kalman filter algorithm identifies the weight of interactions for each target gene. With the use of the ensemble Kalman filter, the expression pattern of the target gene is predicted from the expression patterns of all the remaining genes. The proposed method is compared with several well-known approaches. The results of the evaluation indicate that the proposed method improves inference accuracy and demonstrates better regulatory relations with noisy data.
Collapse
Affiliation(s)
- Jamshid Pirgazi
- Department of Electrical and Computer Engineering, University of Science and Technology of Mazandaran Behshahr, Iran
| | - Mohammad Hossein Olyaee
- Department of Computer Engineering, Engineering Faculty, University of Gonabad, Gonabad, Iran
| | - Alireza Khanteymoori
- Bioinformatics Group, Department of Computer Science, University of Freiburg, Germany.,Department of Computer Engineering, Engineering Faculty, University of Zanjan Zanjan Province, Iran
| |
Collapse
|
14
|
Causal Network Inference for Neural Ensemble Activity. Neuroinformatics 2021; 19:515-527. [PMID: 33393054 PMCID: PMC8233245 DOI: 10.1007/s12021-020-09505-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/03/2020] [Indexed: 11/11/2022]
Abstract
Interactions among cellular components forming a mesoscopic scale brain network (microcircuit) display characteristic neural dynamics. Analysis of microcircuits provides a system-level understanding of the neurobiology of health and disease. Causal discovery aims to detect causal relationships among variables based on observational data. A key barrier in causal discovery is the high dimensionality of the variable space. A method called Causal Inference for Microcircuits (CAIM) is proposed to reconstruct causal networks from calcium imaging or electrophysiology time series. CAIM combines neural recording, Bayesian network modeling, and neuron clustering. Validation experiments based on simulated data and a real-world reaching task dataset demonstrated that CAIM accurately revealed causal relationships among neural clusters.
Collapse
|
15
|
Jang HJ, Yang JH, Hong E, Jo E, Lee S, Lee S, Choi JS, Yoo HS, Kang H. Chelidonine Induces Apoptosis via GADD45a-p53 Regulation in Human Pancreatic Cancer Cells. Integr Cancer Ther 2021; 20:15347354211006191. [PMID: 33884928 PMCID: PMC8077490 DOI: 10.1177/15347354211006191] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Chelidonium majus has been used as a traditional medicine in China and western countries for various diseases, including inflammation and cancer. However, the anti-cancer effect of chelidonine, a major compound of C. majus extracts, on pancreatic cancer remains poorly understood. In this study, we found that treatment with chelidonine inhibited proliferation of BxPC-3 and MIA PaCa-2 human pancreatic cancer cells. Annexin-V/propidium iodide staining assay showed that this growth inhibitory effect of chelidonine was induced through apoptosis. We found that chelidonine treatment upregulated mRNA levels and transcription factor activity in both cell lines. Increases in protein expression levels of p53, GADD45A, p21 and cleaved caspase-3 were also observed, with more distinct changes in MIA PaCa-2 cells compared to the BxPC-3 cells. These results suggest that chelidonine induces pancreatic cancer apoptosis through the p53 and GADD45A pathways. Our findings provide new insights into the use of chelidonine for the treatment of pancreatic cancer.
Collapse
Affiliation(s)
- Hyun-Jin Jang
- Korea Basic Science Institute, Daejeon,
Republic of Korea
- Sungkyunkwan University, Suwon,
Republic of Korea
| | - Jae Ho Yang
- Daejeon Korean Medicine Hospital of
Daejeon University, Seoul, Republic of Korea
| | - Eunmi Hong
- Korea Basic Science Institute, Daejeon,
Republic of Korea
| | - Eunbi Jo
- Korea Basic Science Institute, Daejeon,
Republic of Korea
- Hanyang University, Seoul, Republic of
Korea
| | - Soon Lee
- Korea Basic Science Institute, Daejeon,
Republic of Korea
- University of Science and Technology,
Daejeon, Republic of Korea
| | - Sanghun Lee
- Korea Institute of Oriental Medicine,
Daejeon, Republic of Korea
| | - Jong Soon Choi
- Korea Basic Science Institute, Daejeon,
Republic of Korea
| | - Hwa Seung Yoo
- Daejeon Korean Medicine Hospital of
Daejeon University, Seoul, Republic of Korea
- Hwa Seung Yoo, East West Cancer Center,
Seoul Korean Medicine Hospital of Daejeon University, Seoul 05836, Rep. of
Korea.
| | - Hyuno Kang
- Korea Basic Science Institute, Daejeon,
Republic of Korea
- Hyuno Kang, Division of Analytical Science,
Korea Basic Science Institute, 169-148, Gwahak-ro, Yuseong-gu, Daejeon 34133,
Republic of Korea.
| |
Collapse
|
16
|
Barman S, Kwon YK. A neuro-evolution approach to infer a Boolean network from time-series gene expressions. Bioinformatics 2020; 36:i762-i769. [PMID: 33381823 DOI: 10.1093/bioinformatics/btaa840] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/14/2020] [Indexed: 11/14/2022] Open
Abstract
SUMMARY In systems biology, it is challenging to accurately infer a regulatory network from time-series gene expression data, and a variety of methods have been proposed. Most of them were computationally inefficient in inferring very large networks, though, because of the increasing number of candidate regulatory genes. Although a recent approach called GABNI (genetic algorithm-based Boolean network inference) was presented to resolve this problem using a genetic algorithm, there is room for performance improvement because it employed a limited representation model of regulatory functions.In this regard, we devised a novel genetic algorithm combined with a neural network for the Boolean network inference, where a neural network is used to represent the regulatory function instead of an incomplete Boolean truth table used in the GABNI. In addition, our new method extended the range of the time-step lag parameter value between the regulatory and the target genes for more flexible representation of the regulatory function. Extensive simulations with the gene expression datasets of the artificial and real networks were conducted to compare our method with five well-known existing methods including GABNI. Our proposed method significantly outperformed them in terms of both structural and dynamics accuracy. CONCLUSION Our method can be a promising tool to infer a large-scale Boolean regulatory network from time-series gene expression data. AVAILABILITY AND IMPLEMENTATION The source code is freely available at https://github.com/kwon-uou/NNBNI. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Shohag Barman
- Department of Computer Science, American International University-Bangladesh (AIUB), Dhaka 1229, Bangladesh
| | - Yung-Keun Kwon
- School of IT Convergence, University of Ulsan, Ulsan 44610, Republic of Korea
| |
Collapse
|
17
|
Ma B, Fang M, Jiao X. Inference of gene regulatory networks based on nonlinear ordinary differential equations. Bioinformatics 2020; 36:4885-4893. [DOI: 10.1093/bioinformatics/btaa032] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2019] [Revised: 12/30/2019] [Accepted: 01/15/2020] [Indexed: 01/05/2023] Open
Abstract
Abstract
Motivation
Gene regulatory networks (GRNs) capture the regulatory interactions between genes, resulting from the fundamental biological process of transcription and translation. In some cases, the topology of GRNs is not known, and has to be inferred from gene expression data. Most of the existing GRNs reconstruction algorithms are either applied to time-series data or steady-state data. Although time-series data include more information about the system dynamics, steady-state data imply stability of the underlying regulatory networks.
Results
In this article, we propose a method for inferring GRNs from time-series and steady-state data jointly. We make use of a non-linear ordinary differential equations framework to model dynamic gene regulation and an importance measurement strategy to infer all putative regulatory links efficiently. The proposed method is evaluated extensively on the artificial DREAM4 dataset and two real gene expression datasets of yeast and Escherichia coli. Based on public benchmark datasets, the proposed method outperforms other popular inference algorithms in terms of overall score. By comparing the performance on the datasets with different scales, the results show that our method still keeps good robustness and accuracy at a low computational complexity.
Availability and implementation
The proposed method is written in the Python language, and is available at: https://github.com/lab319/GRNs_nonlinear_ODEs
Supplementary information
Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Baoshan Ma
- College of Information Science and Technology, Dalian Maritime University, Dalian 116026, China
| | - Mingkun Fang
- College of Information Science and Technology, Dalian Maritime University, Dalian 116026, China
| | - Xiangtian Jiao
- College of Information Science and Technology, Dalian Maritime University, Dalian 116026, China
| |
Collapse
|
18
|
Yang M, Petralia F, Li Z, Li H, Ma W, Song X, Kim S, Lee H, Yu H, Lee B, Bae S, Heo E, Kaczmarczyk J, Stępniak P, Warchoł M, Yu T, Calinawan AP, Boutros PC, Payne SH, Reva B, Boja E, Rodriguez H, Stolovitzky G, Guan Y, Kang J, Wang P, Fenyö D, Saez-Rodriguez J, Aderinwale T, Afyounian E, Agrawal P, Ali M, Amadoz A, Azuaje F, Bachman J, Bae S, Bhalla S, Carbonell-Caballero J, Chakraborty P, Chaudhary K, Choi Y, Choi Y, Çubuk C, Dhanda SK, Dopazo J, Elo LL, Fóthi Á, Gevaert O, Granberg K, Greiner R, Heo E, Hidalgo MR, Jayaswal V, Jeon H, Jeon M, Kalmady SV, Kambara Y, Kang J, Kang K, Kaoma T, Kaur H, Kazan H, Kesar D, Kesseli J, Kim D, Kim K, Kim SY, Kim S, Kumar S, Lee B, Lee H, Liu Y, Luethy R, Mahajan S, Mahmoudian M, Muller A, Nazarov PV, Nguyen H, Nykter M, Okuda S, Park S, Pal Singh Raghava G, Rajapakse JC, Rantapero T, Ryu H, Salavert F, Saraei S, Sharma R, Siitonen A, Sokolov A, Subramanian K, Suni V, Suomi T, Tranchevent LC, Usmani SS, Välikangas T, Vega R, Zhong H. Community Assessment of the Predictability of Cancer Protein and Phosphoprotein Levels from Genomics and Transcriptomics. Cell Syst 2020; 11:186-195.e9. [PMID: 32710834 DOI: 10.1016/j.cels.2020.06.013] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2019] [Revised: 03/12/2020] [Accepted: 06/29/2020] [Indexed: 10/23/2022]
|
19
|
Jang HJ, Yang KE, Oh WK, Lee SI, Hwang IH, Ban KT, Yoo HS, Choi JS, Yeo EJ, Jang IS. Nectandrin B-mediated activation of the AMPK pathway prevents cellular senescence in human diploid fibroblasts by reducing intracellular ROS levels. Aging (Albany NY) 2020; 11:3731-3749. [PMID: 31199782 PMCID: PMC6594796 DOI: 10.18632/aging.102013] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2019] [Accepted: 05/31/2019] [Indexed: 12/13/2022]
Abstract
Nectandrin B (NecB) is a bioactive lignan compound isolated from Myristica fragrans (nutmeg), which functions as an activator of AMP-activated protein kinase (AMPK). Because we recently found that treatment with NecB increased the cell viability of old human diploid fibroblasts (HDFs), the underlying molecular mechanism was investigated. NecB treatment in old HDFs reduced the activity staining of senescence-associated β-galactosidase and the levels of senescence markers, such as the Ser15 phosphorylated p53, caveolin-1, p21waf1, p16ink4a, p27kip1, and cyclin D1. NecB treatment increased that in S phase, indicating a enhancement of cell cycle entry. Interestingly, NecB treatment ameliorated age-dependent activation of AMPK in old HDFs. Moreover, NecB reversed the age-dependent expression and/or activity changes of certain sirtuins (SIRT1-5), and cell survival/death-related proteins. The transcriptional activity of Yin-Yang 1 and the expression of downstream proteins were elevated in NecB-treated old HDFs. In addition, NecB treatment exerted a radical scavenging effect in vitro, reduced cellular ROS levels, and increased antioxidant enzymes in old HDFs. Moreover, NecB-mediated activation of the AMPK pathway reduced intracellular ROS levels. These results suggest that NecB-induced protection against cellular senescence is mediated by ROS-scavenging through activation of AMPK. NecB might be useful in ameliorating age-related diseases and extending human lifespan.
Collapse
Affiliation(s)
- Hyun-Jin Jang
- Drug & Disease Target Group, Division of Bioconvergence Analysis, Korea Basic Science Institute, Daejeon 305-333, Republic of Korea.,Department of Biological Sciences, Sungkyunkwan University, Suwon 16419, Republic of Korea
| | - Kyeong Eun Yang
- Drug & Disease Target Group, Division of Bioconvergence Analysis, Korea Basic Science Institute, Daejeon 305-333, Republic of Korea
| | - Won Keun Oh
- Korea Bioactive Natural Material Bank, College of Pharmacy, Seoul National University, Seoul 08826, Republic of Korea
| | - Song-I Lee
- Department of Health Sciences and Technology, GAIHST, Gachon University, Incheon 21999, Republic of Korea
| | - In-Hu Hwang
- Neuroscience Research Institute, Korea University College of Medicine, Seoul 136-705, Republic of Korea
| | - Kyung-Tae Ban
- East-West Cancer Center, Daejeon University, Daejeon, 302-120, Republic of Korea
| | - Hwa-Seung Yoo
- East-West Cancer Center, Daejeon University, Daejeon, 302-120, Republic of Korea
| | - Jong-Soon Choi
- Drug & Disease Target Group, Division of Bioconvergence Analysis, Korea Basic Science Institute, Daejeon 305-333, Republic of Korea
| | - Eui-Ju Yeo
- Department of Health Sciences and Technology, GAIHST, Gachon University, Incheon 21999, Republic of Korea.,Department of Biochemistry, College of Medicine, Gachon University, Incheon 21999, Republic of Korea
| | - Ik-Soon Jang
- Drug & Disease Target Group, Division of Bioconvergence Analysis, Korea Basic Science Institute, Daejeon 305-333, Republic of Korea.,Division of Analytical Science, University of Science and Technology, Daejeon 34113, Republic of Korea
| |
Collapse
|
20
|
Yamada TG, Hiki Y, Hiroi NF, Shagimardanova E, Gusev O, Cornette R, Kikawada T, Funahashi A. Identification of a master transcription factor and a regulatory mechanism for desiccation tolerance in the anhydrobiotic cell line Pv11. PLoS One 2020; 15:e0230218. [PMID: 32191739 PMCID: PMC7082025 DOI: 10.1371/journal.pone.0230218] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2019] [Accepted: 02/24/2020] [Indexed: 01/10/2023] Open
Abstract
Water is essential for living organisms. Terrestrial organisms are incessantly exposed to the stress of losing water, desiccation stress. Avoiding the mortality caused by desiccation stress, many organisms acquired molecular mechanisms to tolerate desiccation. Larvae of the African midge, Polypedilum vanderplanki, and its embryonic cell line Pv11 tolerate desiccation stress by entering an ametabolic state, anhydrobiosis, and return to active life after rehydration. The genes related to desiccation tolerance have been comprehensively analyzed, but transcriptional regulatory mechanisms to induce these genes after desiccation or rehydration remain unclear. Here, we comprehensively analyzed the gene regulatory network in Pv11 cells and compared it with that of Drosophila melanogaster, a desiccation sensitive species. We demonstrated that nuclear transcription factor Y subunit gamma-like, which is important for drought stress tolerance in plants, and its transcriptional regulation of downstream positive feedback loops have a pivotal role in regulating various anhydrobiosis-related genes. This study provides an initial insight into the systemic mechanism of desiccation tolerance.
Collapse
Affiliation(s)
- Takahiro G. Yamada
- Department of Biosciences and Informatics, Keio University, Yokohama, Kanagawa, Japan
| | - Yusuke Hiki
- Department of Biosciences and Informatics, Keio University, Yokohama, Kanagawa, Japan
| | - Noriko F. Hiroi
- Faculty of Pharmaceutical Science, Sanyo-Onoda City University, Sanyo-Onoda, Yamaguchi, Japan
| | | | - Oleg Gusev
- Kazan Federal University, Kazan, Russia
- RIKEN Cluster for Science, Technology and Innovation Hub, RIKEN, Yokohama, Kanagawa, Japan
| | - Richard Cornette
- Institute of Agrobiological Sciences, National Agriculture and Food Research Organization (NARO), Tsukuba, Ibaraki, Japan
| | - Takahiro Kikawada
- Institute of Agrobiological Sciences, National Agriculture and Food Research Organization (NARO), Tsukuba, Ibaraki, Japan
- Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa, Chiba, Japan
- * E-mail: (TK); (AF)
| | - Akira Funahashi
- Department of Biosciences and Informatics, Keio University, Yokohama, Kanagawa, Japan
- * E-mail: (TK); (AF)
| |
Collapse
|
21
|
Turki T, Taguchi YH. SCGRNs: Novel supervised inference of single-cell gene regulatory networks of complex diseases. Comput Biol Med 2020; 118:103656. [PMID: 32174324 DOI: 10.1016/j.compbiomed.2020.103656] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2020] [Revised: 02/06/2020] [Accepted: 02/07/2020] [Indexed: 12/19/2022]
|
22
|
Iacobas DA, Iacobas S, Lee PR, Cohen JE, Fields RD. Coordinated Activity of Transcriptional Networks Responding to the Pattern of Action Potential Firing in Neurons. Genes (Basel) 2019; 10:genes10100754. [PMID: 31561430 PMCID: PMC6826514 DOI: 10.3390/genes10100754] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2019] [Revised: 09/17/2019] [Accepted: 09/20/2019] [Indexed: 12/13/2022] Open
Abstract
Transcriptional responses to the appropriate temporal pattern of action potential firing are essential for long-term adaption of neuronal properties to the functional activity of neural circuits and environmental experience. However, standard transcriptome analysis methods can be too limited in identifying critical aspects that coordinate temporal coding of action potential firing with transcriptome response. A Pearson correlation analysis was applied to determine how pairs of genes in the mouse dorsal root ganglion (DRG) neurons are coordinately expressed in response to stimulation producing the same number of action potentials by two different temporal patterns. Analysis of 4728 distinct gene-pairs related to calcium signaling, 435,711 pairs of transcription factors, 820 pairs of voltage-gated ion channels, and 86,862 pairs of calcium signaling genes with transcription factors indicated that genes become coordinately activated by distinct action potential firing patterns and this depends on the duration of stimulation. Moreover, a measure of expression variance revealed that the control of transcripts abundances is sensitive to the pattern of stimulation. Thus, action potentials impact intracellular signaling and the transcriptome in dynamic manner that not only alter gene expression levels significantly (as previously reported) but also affects the control of their expression fluctuations and profoundly remodel the transcriptional networks.
Collapse
Affiliation(s)
- Dumitru A Iacobas
- Personalized Genomics Laboratory, Center for Computational Systems Biology, Prairie View A&M University, Prairie View, TX 77446, USA.
- DP Purpura Department of Neuroscience, Albert Einstein College of Medicine, Bronx, NY 10461, USA.
| | - Sanda Iacobas
- Department of Pathology, New York Medical College, Valhalla, NY 10595, USA.
| | - Philip R Lee
- Section on Nervous System Development and Plasticity, the Eunice Kennedy Shriver National Institute of Child Health and Human Development (NICHD), NIH, Bethesda, MD 20892, USA.
| | - Jonathan E Cohen
- Division of Medical Imaging Products, U.S. Food and Drug Administration, Silver Spring, 20993 MD, USA.
| | - R Douglas Fields
- Section on Nervous System Development and Plasticity, the Eunice Kennedy Shriver National Institute of Child Health and Human Development (NICHD), NIH, Bethesda, MD 20892, USA.
| |
Collapse
|
23
|
Ahn H, Jo K, Jeong D, Pak M, Hur J, Jung W, Kim S. PropaNet: Time-Varying Condition-Specific Transcriptional Network Construction by Network Propagation. FRONTIERS IN PLANT SCIENCE 2019; 10:698. [PMID: 31258543 PMCID: PMC6587906 DOI: 10.3389/fpls.2019.00698] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/04/2019] [Accepted: 05/09/2019] [Indexed: 06/09/2023]
Abstract
Transcription factor (TF) has a significant influence on the state of a cell by regulating multiple down-stream genes. Thus, experimental and computational biologists have made great efforts to construct TF gene networks for regulatory interactions between TFs and their target genes. Now, an important research question is how to utilize TF networks to investigate the response of a plant to stress at the transcription control level using time-series transcriptome data. In this article, we present a new computational network, PropaNet, to investigate dynamics of TF networks from time-series transcriptome data using two state-of-the-art network analysis techniques, influence maximization and network propagation. PropaNet uses the influence maximization technique to produce a ranked list of TFs, in the order of TF that explains differentially expressed genes (DEGs) better at each time point. Then, a network propagation technique is used to select a group of TFs that explains DEGs best as a whole. For the analysis of Arabidopsis time series datasets from AtGenExpress, we used PlantRegMap as a template TF network and performed PropaNet analysis to investigate transcriptional dynamics of Arabidopsis under cold and heat stress. The time varying TF networks showed that Arabidopsis responded to cold and heat stress quite differently. For cold stress, bHLH and bZIP type TFs were the first responding TFs and the cold signal influenced histone variants, various genes involved in cell architecture, osmosis and restructuring of cells. However, the consequences of plants under heat stress were up-regulation of genes related to accelerating differentiation and starting re-differentiation. In terms of energy metabolism, plants under heat stress show elevated metabolic process and resulting in an exhausted status. We believe that PropaNet will be useful for the construction of condition-specific time-varying TF network for time-series data analysis in response to stress. PropaNet is available at http://biohealth.snu.ac.kr/software/PropaNet.
Collapse
Affiliation(s)
- Hongryul Ahn
- Bioinformatics Institute, Seoul National University, Seoul, South Korea
| | - Kyuri Jo
- Bioinformatics Institute, Seoul National University, Seoul, South Korea
| | - Dabin Jeong
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, South Korea
| | - Minwoo Pak
- Department of Computer Science and Engineering, Seoul National University, Seoul, South Korea
| | - Jihye Hur
- Department of Crop Science, Konkuk University, Seoul, South Korea
| | - Woosuk Jung
- Department of Crop Science, Konkuk University, Seoul, South Korea
| | - Sun Kim
- Bioinformatics Institute, Seoul National University, Seoul, South Korea
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, South Korea
- Department of Computer Science and Engineering, Seoul National University, Seoul, South Korea
| |
Collapse
|
24
|
Zhang W, Li W, Zhang J, Wang N. Data Integration of Hybrid Microarray and Single Cell Expression Data to Enhance Gene Network Inference. Curr Bioinform 2019. [DOI: 10.2174/1574893614666190104142228] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Background:
Gene Regulatory Network (GRN) inference algorithms aim to explore
casual interactions between genes and transcriptional factors. High-throughput transcriptomics
data including DNA microarray and single cell expression data contain complementary
information in network inference.
Objective:
To enhance GRN inference, data integration across various types of expression data
becomes an economic and efficient solution.
Method:
In this paper, a novel E-alpha integration rule-based ensemble inference algorithm is
proposed to merge complementary information from microarray and single cell expression data.
This paper implements a Gradient Boosting Tree (GBT) inference algorithm to compute
importance scores for candidate gene-gene pairs. The proposed E-alpha rule quantitatively
evaluates the credibility levels of each information source and determines the final ranked list.
Results:
Two groups of in silico gene networks are applied to illustrate the effectiveness of the
proposed E-alpha integration. Experimental outcomes with size50 and size100 in silico gene
networks suggest that the proposed E-alpha rule significantly improves performance metrics
compared with single information source.
Conclusion:
In GRN inference, the integration of hybrid expression data using E-alpha rule
provides a feasible and efficient way to enhance performance metrics than solely increasing
sample sizes.
Collapse
Affiliation(s)
- Wei Zhang
- Institute of Cyber-Systems and Control, Zhejiang University, Hangzhou, 310013, China
| | - Wenchao Li
- Institute of Cyber-Systems and Control, Zhejiang University, Hangzhou, 310013, China
| | - Jianming Zhang
- Institute of Cyber-Systems and Control, Zhejiang University, Hangzhou, 310013, China
| | - Ning Wang
- Institute of Cyber-Systems and Control, Zhejiang University, Hangzhou, 310013, China
| |
Collapse
|
25
|
Haque S, Ahmad JS, Clark NM, Williams CM, Sozzani R. Computational prediction of gene regulatory networks in plant growth and development. CURRENT OPINION IN PLANT BIOLOGY 2019; 47:96-105. [PMID: 30445315 DOI: 10.1016/j.pbi.2018.10.005] [Citation(s) in RCA: 44] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/15/2018] [Revised: 10/05/2018] [Accepted: 10/18/2018] [Indexed: 05/22/2023]
Abstract
Plants integrate a wide range of cellular, developmental, and environmental signals to regulate complex patterns of gene expression. Recent advances in genomic technologies enable differential gene expression analysis at a systems level, allowing for improved inference of the network of regulatory interactions between genes. These gene regulatory networks, or GRNs, are used to visualize the causal regulatory relationships between regulators and their downstream target genes. Accordingly, these GRNs can represent spatial, temporal, and/or environmental regulations and can identify functional genes. This review summarizes recent computational approaches applied to different types of gene expression data to infer GRNs in the context of plant growth and development. Three stages of GRN inference are described: first, data collection and analysis based on the dataset type; second, network inference application based on data availability and proposed hypotheses; and third, validation based on in silico, in vivo, and in planta methods. In addition, this review relates data collection strategies to biological questions, organizes inference algorithms based on statistical methods and data types, discusses experimental design considerations, and provides guidelines for GRN inference with an emphasis on the benefits of integrative approaches, especially when a priori information is limited. Finally, this review concludes that computational frameworks integrating large-scale heterogeneous datasets are needed for a more accurate (e.g. fewer false interactions), detailed (e.g. discrimination between direct versus indirect interactions), and comprehensive (e.g. genetic regulation under various conditions and spatial locations) inference of GRNs.
Collapse
Affiliation(s)
- Samiul Haque
- Electrical and Computer Engineering, North Carolina State University, Raleigh, USA
| | - Jabeen S Ahmad
- Plant and Microbial Biology, North Carolina State University, Raleigh, USA
| | - Natalie M Clark
- Plant and Microbial Biology, North Carolina State University, Raleigh, USA
| | - Cranos M Williams
- Electrical and Computer Engineering, North Carolina State University, Raleigh, USA.
| | - Rosangela Sozzani
- Plant and Microbial Biology, North Carolina State University, Raleigh, USA.
| |
Collapse
|