1
|
Lin J, Zeng S, Chen Q, Liu G, Pan S, Liu X. Identification of disease-related genes in Plasmodium berghei by network module analysis. BMC Microbiol 2023; 23:264. [PMID: 37735351 PMCID: PMC10512555 DOI: 10.1186/s12866-023-03019-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2023] [Accepted: 09/12/2023] [Indexed: 09/23/2023] Open
Abstract
BACKGROUND Plasmodium berghei has been used as a preferred model for studying human malaria, but only a limited number of disease-associated genes of P. berghei have been reported to date. Identification of new disease-related genes as many as possible will provide a landscape for better understanding the pathogenesis of P. berghei. METHODS Network module analysis method was developed and applied to identify disease-related genes in P. berghei genome. Sequence feature identification, gene ontology annotation, and T-cell epitope analysis were performed on these genes to illustrate their functions in the pathogenesis of P. berghei. RESULTS 33,314 genes were classified into 4,693 clusters. 4,127 genes shared by six malaria parasites were identified and are involved in many aspects of biological processes. Most of the known essential genes belong to shared genes. A total of 63 clusters consisting of 405 P. berghei genes were enriched in rodent malaria parasites. These genes participate in various stages of parasites such as liver stage development and immune evasion. Combination of these genes might be responsible for P. berghei infecting mice. Comparing with P. chabaudi, none of the clusters were specific to P. berghei. P. berghei lacks some proteins belonging to P. chabaudi and possesses some specific T-cell epitopes binding by class-I MHC, which might together contribute to the occurrence of experimental cerebral malaria (ECM). CONCLUSIONS We successfully identified disease-associated P. berghei genes by network module analysis. These results will deepen understanding of the pathogenesis of P. berghei and provide candidate parasite genes for further ECM investigation.
Collapse
Affiliation(s)
- Junhao Lin
- Department of Neurology, Nanfang Hospital, Southern Medical University, Guangzhou, 510515, China
| | - Shan Zeng
- Department of Neurology, Nanfang Hospital, Southern Medical University, Guangzhou, 510515, China
| | - Qiong Chen
- Department of Neurology, Nanfang Hospital, Southern Medical University, Guangzhou, 510515, China
| | - Guanghui Liu
- Department of Neurology, Nanfang Hospital, Southern Medical University, Guangzhou, 510515, China
| | - Suyue Pan
- Department of Neurology, Nanfang Hospital, Southern Medical University, Guangzhou, 510515, China.
| | - Xuewu Liu
- Department of Neurology, Nanfang Hospital, Southern Medical University, Guangzhou, 510515, China.
| |
Collapse
|
2
|
Guil F, Hidalgo JF, García JM. Boosting the extraction of elementary flux modes in genome-scale metabolic networks using the linear programming approach. Bioinformatics 2020; 36:4163-4170. [PMID: 32348455 PMCID: PMC7390993 DOI: 10.1093/bioinformatics/btaa280] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2019] [Revised: 04/16/2020] [Accepted: 04/22/2020] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION Elementary flux modes (EFMs) are a key tool for analyzing genome-scale metabolic networks, and several methods have been proposed to compute them. Among them, those based on solving linear programming (LP) problems are known to be very efficient if the main interest lies in computing large enough sets of EFMs. RESULTS Here, we propose a new method called EFM-Ta that boosts the efficiency rate by analyzing the information provided by the LP solver. We base our method on a further study of the final tableau of the simplex method. By performing additional elementary steps and avoiding trivial solutions consisting of two cycles, we obtain many more EFMs for each LP problem posed, improving the efficiency rate of previously proposed methods by more than one order of magnitude. AVAILABILITY AND IMPLEMENTATION Software is freely available at https://github.com/biogacop/Boost_LP_EFM. CONTACT fguil@um.es. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Francisco Guil
- Departamento de Ingeniería y Tecnología de Computadores, Universidad de Murcia, Murcia 30080, Spain
| | - José F Hidalgo
- Departamento de Ingeniería y Tecnología de Computadores, Universidad de Murcia, Murcia 30080, Spain
| | - José M García
- Departamento de Ingeniería y Tecnología de Computadores, Universidad de Murcia, Murcia 30080, Spain
| |
Collapse
|
3
|
Arabzadeh M, Sedighi M, Saheb Zamani M, Marashi SA. A system architecture for parallel analysis of flux-balanced metabolic pathways. Comput Biol Chem 2020; 88:107309. [PMID: 32650065 DOI: 10.1016/j.compbiolchem.2020.107309] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2019] [Revised: 05/05/2020] [Accepted: 06/13/2020] [Indexed: 10/24/2022]
Abstract
Elementary flux mode (EFM) analysis is a well-studied method in constraint-based modeling of metabolic networks. In EFM analysis, a network is decomposed into minimal functional pathways based on the assumption of balanced metabolic fluxes. In this paper, a system architecture is proposed that approximately models the functionality of metabolic networks. The AND/OR graph model is used to represent the metabolic network and each processing element in the system emulates the functionality of a metabolite. The system is implemented on a graphics processing unit (GPU) as the hardware platform using CUDA environment. The proposed architecture takes advantage of the inherent parallelism in the network structure in terms of both pathway and metabolite traversal. The function of each element is defined such that it can find flux-balanced pathways. Pathways in both small and large metabolic networks are applied to the proposed architecture and the results are discussed.
Collapse
Affiliation(s)
- Mona Arabzadeh
- Department of Computer Engineering and Information Technology, Amirkabir University of Technology, Tehran, Iran.
| | - Mehdi Sedighi
- Department of Computer Engineering and Information Technology, Amirkabir University of Technology, Tehran, Iran.
| | - Morteza Saheb Zamani
- Department of Computer Engineering and Information Technology, Amirkabir University of Technology, Tehran, Iran.
| | - Sayed-Amir Marashi
- Department of Biotechnology, College of Science, University of Tehran, Tehran, Iran.
| |
Collapse
|
4
|
Hu Z, Jiao R, Wang P, Zhu Y, Zhao J, De Jager P, Bennett DA, Jin L, Xiong M. Shared Causal Paths underlying Alzheimer's dementia and Type 2 Diabetes. Sci Rep 2020; 10:4107. [PMID: 32139775 PMCID: PMC7058072 DOI: 10.1038/s41598-020-60682-3] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2019] [Accepted: 02/03/2020] [Indexed: 12/19/2022] Open
Abstract
Although Alzheimer's disease (AD) is a central nervous system disease and type 2 diabetes MELLITUS (T2DM) is a metabolic disorder, an increasing number of genetic epidemiological studies show clear link between AD and T2DM. The current approach to uncovering the shared pathways between AD and T2DM involves association analysis; however such analyses lack power to discover the mechanisms of the diseases. As an alternative, we developed novel causal inference methods for genetic studies of AD and T2DM and pipelines for systematic multi-omic casual analysis to infer multilevel omics causal networks for the discovery of common paths from genetic variants to AD and T2DM. The proposed pipelines were applied to 448 individuals from the ROSMAP Project. We identified 13 shared causal genes, 16 shared causal pathways between AD and T2DM, and 754 gene expression and 101 gene methylation nodes that were connected to both AD and T2DM in multi-omics causal networks.
Collapse
Affiliation(s)
- Zixin Hu
- State Key Laboratory of Genetic Engineering and Innovation Center of Genetics and Development, School of Life Sciences, Fudan University, Shanghai, China
- Human Phenome Institute, Fudan University, Shanghai, China
| | - Rong Jiao
- Department of Biostatistics and Data Science, School of Public Health, University of Texas Health Science Center at Houston, Houston, Texas, USA
| | - Panpan Wang
- State Key Laboratory of Genetic Engineering and Innovation Center of Genetics and Development, School of Life Sciences, Fudan University, Shanghai, China
| | - Yun Zhu
- Department of Epidemiology, University of Florida, Florida, USA
| | - Jinying Zhao
- Department of Epidemiology, University of Florida, Florida, USA
| | - Phil De Jager
- Center for Translational & Computational Neuroimmunology, Department of Neurology, Columbia University Medical Center, New York, 10033, USA
| | - David A Bennett
- Rush Alzheimer's Disease Center, Rush University Medical Center, Chicago, IL, 60612, USA
| | - Li Jin
- State Key Laboratory of Genetic Engineering and Innovation Center of Genetics and Development, School of Life Sciences, Fudan University, Shanghai, China
- Human Phenome Institute, Fudan University, Shanghai, China
| | - Momiao Xiong
- Department of Biostatistics and Data Science, School of Public Health, University of Texas Health Science Center at Houston, Houston, Texas, USA.
| |
Collapse
|
5
|
Song HS, Goldberg N, Mahajan A, Ramkrishna D. Sequential computation of elementary modes and minimal cut sets in genome-scale metabolic networks using alternate integer linear programming. Bioinformatics 2018; 33:2345-2353. [PMID: 28369193 DOI: 10.1093/bioinformatics/btx171] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2016] [Accepted: 03/23/2017] [Indexed: 01/22/2023] Open
Abstract
Motivation Elementary (flux) modes (EMs) have served as a valuable tool for investigating structural and functional properties of metabolic networks. Identification of the full set of EMs in genome-scale networks remains challenging due to combinatorial explosion of EMs in complex networks. It is often, however, that only a small subset of relevant EMs needs to be known, for which optimization-based sequential computation is a useful alternative. Most of the currently available methods along this line are based on the iterative use of mixed integer linear programming (MILP), the effectiveness of which significantly deteriorates as the number of iterations builds up. To alleviate the computational burden associated with the MILP implementation, we here present a novel optimization algorithm termed alternate integer linear programming (AILP). Results Our algorithm was designed to iteratively solve a pair of integer programming (IP) and linear programming (LP) to compute EMs in a sequential manner. In each step, the IP identifies a minimal subset of reactions, the deletion of which disables all previously identified EMs. Thus, a subsequent LP solution subject to this reaction deletion constraint becomes a distinct EM. In cases where no feasible LP solution is available, IP-derived reaction deletion sets represent minimal cut sets (MCSs). Despite the additional computation of MCSs, AILP achieved significant time reduction in computing EMs by orders of magnitude. The proposed AILP algorithm not only offers a computational advantage in the EM analysis of genome-scale networks, but also improves the understanding of the linkage between EMs and MCSs. Availability and Implementation The software is implemented in Matlab, and is provided as supplementary information . Contact hyunseob.song@pnnl.gov. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Hyun-Seob Song
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99352, USA
| | - Noam Goldberg
- Department of Management, Bar-Ilan University, Ramat Gan 52900, Israel
| | - Ashutosh Mahajan
- Industrial Engineering and Operations Research, IIT Bombay, Powai, Mumbai 400076, India
| | | |
Collapse
|
6
|
Arabzadeh M, Saheb Zamani M, Sedighi M, Marashi SA. A graph-based approach to analyze flux-balanced pathways in metabolic networks. Biosystems 2018; 165:40-51. [PMID: 29337084 DOI: 10.1016/j.biosystems.2017.12.001] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2017] [Revised: 11/02/2017] [Accepted: 12/05/2017] [Indexed: 10/18/2022]
Abstract
An elementary flux mode (EFM) is a pathway with minimum set of reactions that are functional in steady-state constrained space. Due to the high computational complexity of calculating EFMs, different approaches have been proposed to find these flux-balanced pathways. In this paper, an approach to find a subset of EFMs is proposed based on a graph data model. The given metabolic network is mapped to the graph model and decisions for reaction inclusion can be made based on metabolites and their associated reactions. This notion makes the approach more convenient to categorize the output pathways. Implications of the proposed method on metabolic networks are discussed.
Collapse
Affiliation(s)
- Mona Arabzadeh
- Department of Computer Engineering and Information Technology, Amirkabir University of Technology, Tehran, Iran.
| | - Morteza Saheb Zamani
- Department of Computer Engineering and Information Technology, Amirkabir University of Technology, Tehran, Iran.
| | - Mehdi Sedighi
- Department of Computer Engineering and Information Technology, Amirkabir University of Technology, Tehran, Iran.
| | - Sayed-Amir Marashi
- Department of Biotechnology, College of Science, University of Tehran, Tehran, Iran.
| |
Collapse
|
7
|
Abd Algfoor Z, Shahrizal Sunar M, Abdullah A, Kolivand H. Identification of metabolic pathways using pathfinding approaches: a systematic review. Brief Funct Genomics 2017; 16:87-98. [PMID: 26969656 DOI: 10.1093/bfgp/elw002] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Metabolic pathways have become increasingly available for various microorganisms. Such pathways have spurred the development of a wide array of computational tools, in particular, mathematical pathfinding approaches. This article can facilitate the understanding of computational analysis of metabolic pathways in genomics. Moreover, stoichiometric and pathfinding approaches in metabolic pathway analysis are discussed. Three major types of studies are elaborated: stoichiometric identification models, pathway-based graph analysis and pathfinding approaches in cellular metabolism. Furthermore, evaluation of the outcomes of the pathways with mathematical benchmarking metrics is provided. This review would lead to better comprehension of metabolism behaviors in living cells, in terms of computed pathfinding approaches.
Collapse
Affiliation(s)
- Zeyad Abd Algfoor
- MaGIC-X (Media and Games Innovation Centre of Excellence), UTM-IRDA Digital Media Centre, Universiti Teknologi Malaysia, Johor Bahru, Malaysia
| | - Mohd Shahrizal Sunar
- MaGIC-X (Media and Games Innovation Centre of Excellence), UTM-IRDA Digital Media Centre, Universiti Teknologi Malaysia, Johor Bahru, Malaysia
| | - Afnizanfaizal Abdullah
- Boston University School of Medicine, Boston Medical Center, Boston, MA, USA.,Duke Global Health Institute, Duke University, Durham, NC, USA.,Global Health Program, Duke Kunshan University, Jiangsu, China
| | - Hoshang Kolivand
- Department of Computer Science, Liverpool John Moores University, Liverpool, UK
| |
Collapse
|
8
|
Wang R, Xu Y, Liu B. Recombination spot identification Based on gapped k-mers. Sci Rep 2016; 6:23934. [PMID: 27030570 PMCID: PMC4814916 DOI: 10.1038/srep23934] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2015] [Accepted: 03/16/2016] [Indexed: 12/14/2022] Open
Abstract
Recombination is crucial for biological evolution, which provides many new combinations of genetic diversity. Accurate identification of recombination spots is useful for DNA function study. To improve the prediction accuracy, researchers have proposed several computational methods for recombination spot identification. The k-mer feature is one of the most useful features for modeling the properties and function of DNA sequences. However, it suffers from the inherent limitation. If the value of word length k is large, the occurrences of k-mers are closed to a binary variable, with a few k-mers present once and most k-mers are absent. This usually causes the sparse problem and reduces the classification accuracy. To solve this problem, we add gaps into k-mer and introduce a new feature called gapped k-mer (GKM) for identification of recombination spots. By using this feature, we present a new predictor called SVM-GKM, which combines the gapped k-mers and Support Vector Machine (SVM) for recombination spot identification. Experimental results on a widely used benchmark dataset show that SVM-GKM outperforms other highly related predictors. Therefore, SVM-GKM would be a powerful predictor for computational genomics.
Collapse
Affiliation(s)
- Rong Wang
- School of Computer Science and Technology, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, Guangdong 518055, China
| | - Yong Xu
- School of Computer Science and Technology, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, Guangdong 518055, China
| | - Bin Liu
- School of Computer Science and Technology, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, Guangdong 518055, China
| |
Collapse
|
9
|
Tobalina L, Pey J, Planes FJ. Direct calculation of minimal cut sets involving a specific reaction knock-out. Bioinformatics 2016; 32:2001-7. [PMID: 27153694 DOI: 10.1093/bioinformatics/btw072] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2015] [Accepted: 02/02/2016] [Indexed: 12/12/2022] Open
Abstract
MOTIVATION The concept of Minimal Cut Sets (MCSs) is used in metabolic network modeling to describe minimal groups of reactions or genes whose simultaneous deletion eliminates the capability of the network to perform a specific task. Previous work showed that MCSs where closely related to Elementary Flux Modes (EFMs) in a particular dual problem, opening up the possibility to use the tools developed for computing EFMs to compute MCSs. Until recently, however, there existed no method to compute an EFM with some specific characteristic, meaning that, in the case of MCSs, the only strategy to obtain them was to enumerate them using, for example, the standard K-shortest EFMs algorithm. RESULTS In this work, we adapt the recently developed theory to compute EFMs satisfying several constraints to the calculation of MCSs involving a specific reaction knock-out. Importantly, we emphasize that not all the EFMs in the dual problem correspond to real MCSs, and propose a new formulation capable of correctly identifying the MCS wanted. Furthermore, this formulation brings interesting insights about the relationship between the primal and the dual problem of the MCS computation. AVAILABILITY AND IMPLEMENTATION A Matlab-Cplex implementation of the proposed algorithm is available as a supplementary material CONTACT fplanes@ceit.es SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Luis Tobalina
- CEIT and Tecnun, University of Navarra, San Sebastián 20018, Spain
| | - Jon Pey
- CEIT and Tecnun, University of Navarra, San Sebastián 20018, Spain
| | | |
Collapse
|
10
|
Folch-Fortuny A, Marques R, Isidro IA, Oliveira R, Ferrer A. Principal elementary mode analysis (PEMA). MOLECULAR BIOSYSTEMS 2016; 12:737-46. [PMID: 26905301 DOI: 10.1039/c5mb00828j] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Principal component analysis (PCA) has been widely applied in fluxomics to compress data into a few latent structures in order to simplify the identification of metabolic patterns. These latent structures lack a direct biological interpretation due to the intrinsic constraints associated with a PCA model. Here we introduce a new method that significantly improves the interpretability of the principal components with a direct link to metabolic pathways. This method, called principal elementary mode analysis (PEMA), establishes a bridge between a PCA-like model, aimed at explaining the maximum variance in flux data, and the set of elementary modes (EMs) of a metabolic network. It provides an easy way to identify metabolic patterns in large fluxomics datasets in terms of the simplest pathways of the organism metabolism. The results using a real metabolic model of Escherichia coli show the ability of PEMA to identify the EMs that generated the different simulated flux distributions. Actual flux data of E. coli and Pichia pastoris cultures confirm the results observed in the simulated study, providing a biologically meaningful model to explain flux data of both organisms in terms of the EM activation. The PEMA toolbox is freely available for non-commercial purposes on http://mseg.webs.upv.es.
Collapse
Affiliation(s)
- Abel Folch-Fortuny
- Departamento de Estadística e Investigación Operativa Aplicadas y Calidad, Universitat Politècnica de València, 46022 València, Spain.
| | | | | | | | | |
Collapse
|
11
|
Tagore S, De RK. Evolutionary growth of certain metabolic pathways involved in the functioning of GAD and INS genes in Type 1 Diabetes Mellitus: Their architecture and stability. Comput Biol Med 2015; 61:19-35. [DOI: 10.1016/j.compbiomed.2015.03.012] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2014] [Revised: 03/12/2015] [Accepted: 03/13/2015] [Indexed: 11/25/2022]
|