1
|
Shrestha AMS, Gonzales MEM, Ong PCL, Larmande P, Lee HS, Jeung JU, Kohli A, Chebotarov D, Mauleon RP, Lee JS, McNally KL. RicePilaf: a post-GWAS/QTL dashboard to integrate pangenomic, coexpression, regulatory, epigenomic, ontology, pathway, and text-mining information to provide functional insights into rice QTLs and GWAS loci. Gigascience 2024; 13:giae013. [PMID: 38832465 PMCID: PMC11148593 DOI: 10.1093/gigascience/giae013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2023] [Revised: 02/21/2024] [Accepted: 03/12/2024] [Indexed: 06/05/2024] Open
Abstract
BACKGROUND As the number of genome-wide association study (GWAS) and quantitative trait locus (QTL) mappings in rice continues to grow, so does the already long list of genomic loci associated with important agronomic traits. Typically, loci implicated by GWAS/QTL analysis contain tens to hundreds to thousands of single-nucleotide polmorphisms (SNPs)/genes, not all of which are causal and many of which are in noncoding regions. Unraveling the biological mechanisms that tie the GWAS regions and QTLs to the trait of interest is challenging, especially since it requires collating functional genomics information about the loci from multiple, disparate data sources. RESULTS We present RicePilaf, a web app for post-GWAS/QTL analysis, that performs a slew of novel bioinformatics analyses to cross-reference GWAS results and QTL mappings with a host of publicly available rice databases. In particular, it integrates (i) pangenomic information from high-quality genome builds of multiple rice varieties, (ii) coexpression information from genome-scale coexpression networks, (iii) ontology and pathway information, (iv) regulatory information from rice transcription factor databases, (v) epigenomic information from multiple high-throughput epigenetic experiments, and (vi) text-mining information extracted from scientific abstracts linking genes and traits. We demonstrate the utility of RicePilaf by applying it to analyze GWAS peaks of preharvest sprouting and genes underlying yield-under-drought QTLs. CONCLUSIONS RicePilaf enables rice scientists and breeders to shed functional light on their GWAS regions and QTLs, and it provides them with a means to prioritize SNPs/genes for further experiments. The source code, a Docker image, and a demo version of RicePilaf are publicly available at https://github.com/bioinfodlsu/rice-pilaf.
Collapse
Affiliation(s)
- Anish M S Shrestha
- Bioinformatics Lab, Advanced Research Institute for Informatics, Computing and Networking, College of Computer Studies, De La Salle University, Manila 1004, Philippines
- International Rice Research Institute (IRRI), Metro Manila 1301, Philippines
| | - Mark Edward M Gonzales
- Bioinformatics Lab, Advanced Research Institute for Informatics, Computing and Networking, College of Computer Studies, De La Salle University, Manila 1004, Philippines
| | - Phoebe Clare L Ong
- Bioinformatics Lab, Advanced Research Institute for Informatics, Computing and Networking, College of Computer Studies, De La Salle University, Manila 1004, Philippines
| | - Pierre Larmande
- DIADE, Univ Montpellier, Cirad, IRD, 34394 Montpellier, France
| | - Hyun-Sook Lee
- National Institute of Crop Science, Wanju-gun 55365, Republic of Korea
| | - Ji-Ung Jeung
- National Institute of Crop Science, Wanju-gun 55365, Republic of Korea
| | - Ajay Kohli
- International Rice Research Institute (IRRI), Metro Manila 1301, Philippines
| | - Dmytro Chebotarov
- International Rice Research Institute (IRRI), Metro Manila 1301, Philippines
| | - Ramil P Mauleon
- International Rice Research Institute (IRRI), Metro Manila 1301, Philippines
| | - Jae-Sung Lee
- International Rice Research Institute (IRRI), Metro Manila 1301, Philippines
| | - Kenneth L McNally
- International Rice Research Institute (IRRI), Metro Manila 1301, Philippines
| |
Collapse
|
2
|
Loers JU, Vermeirssen V. SUBATOMIC: a SUbgraph BAsed mulTi-OMIcs clustering framework to analyze integrated multi-edge networks. BMC Bioinformatics 2022; 23:363. [PMID: 36064320 PMCID: PMC9442970 DOI: 10.1186/s12859-022-04908-3] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2022] [Accepted: 08/24/2022] [Indexed: 11/02/2022] Open
Abstract
BACKGROUND Representing the complex interplay between different types of biomolecules across different omics layers in multi-omics networks bears great potential to gain a deep mechanistic understanding of gene regulation and disease. However, multi-omics networks easily grow into giant hairball structures that hamper biological interpretation. Module detection methods can decompose these networks into smaller interpretable modules. However, these methods are not adapted to deal with multi-omics data nor consider topological features. When deriving very large modules or ignoring the broader network context, interpretability remains limited. To address these issues, we developed a SUbgraph BAsed mulTi-OMIcs Clustering framework (SUBATOMIC), which infers small and interpretable modules with a specific topology while keeping track of connections to other modules and regulators. RESULTS SUBATOMIC groups specific molecular interactions in composite network subgraphs of two and three nodes and clusters them into topological modules. These are functionally annotated, visualized and overlaid with expression profiles to go from static to dynamic modules. To preserve the larger network context, SUBATOMIC investigates statistically the connections in between modules as well as between modules and regulators such as miRNAs and transcription factors. We applied SUBATOMIC to analyze a composite Homo sapiens network containing transcription factor-target gene, miRNA-target gene, protein-protein, homologous and co-functional interactions from different databases. We derived and annotated 5586 modules with diverse topological, functional and regulatory properties. We created novel functional hypotheses for unannotated genes. Furthermore, we integrated modules with condition specific expression data to study the influence of hypoxia in three cancer cell lines. We developed two prioritization strategies to identify the most relevant modules in specific biological contexts: one considering GO term enrichments and one calculating an activity score reflecting the degree of differential expression. Both strategies yielded modules specifically reacting to low oxygen levels. CONCLUSIONS We developed the SUBATOMIC framework that generates interpretable modules from integrated multi-omics networks and applied it to hypoxia in cancer. SUBATOMIC can infer and contextualize modules, explore condition or disease specific modules, identify regulators and functionally related modules, and derive novel gene functions for uncharacterized genes. The software is available at https://github.com/CBIGR/SUBATOMIC .
Collapse
Affiliation(s)
- Jens Uwe Loers
- Lab for Computational Biology, Integromics and Gene Regulation (CBIGR), Cancer Research Institute Ghent (CRIG), Ghent, Belgium.,Department of Biomedical Molecular Biology, Ghent University, Ghent, Belgium.,Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
| | - Vanessa Vermeirssen
- Lab for Computational Biology, Integromics and Gene Regulation (CBIGR), Cancer Research Institute Ghent (CRIG), Ghent, Belgium. .,Department of Biomedical Molecular Biology, Ghent University, Ghent, Belgium. .,Department of Biomolecular Medicine, Ghent University, Ghent, Belgium.
| |
Collapse
|
3
|
Wu G, Li X, Guo W, Wei Z, Hu T, Shan Y, Gu J. JEBIN: analyzing gene co-expressions across multiple datasets by joint network embedding. Brief Bioinform 2022; 23:6519533. [PMID: 35134135 DOI: 10.1093/bib/bbab603] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2021] [Revised: 12/15/2021] [Accepted: 12/27/2021] [Indexed: 11/13/2022] Open
Abstract
The inference of gene co-expression associations is one of the fundamental tasks for large-scale transcriptomic data analysis. Due to the high dimensionality and high noises in transcriptomic data, it is difficult to infer stable gene co-expression associations from single dataset. Meta-analysis of multisource data can effectively tackle this problem. We proposed Joint Embedding of multiple BIpartite Networks (JEBIN) to learn the low-dimensional consensus representation for genes by integrating multiple expression datasets. JEBIN infers gene co-expression associations in a nonlinear and global similarity manner and can integrate datasets with different distributions in linear time complexity with the gene and total sample size. The effectiveness and scalability of JEBIN were verified by simulation experiments, and its superiority over the commonly used integration methods was proved by three indexes on real biological datasets. Then, JEBIN was applied to study the gene co-expression patterns of hepatocellular carcinoma (HCC) based on multiple expression datasets of HCC and adjacent normal tissues, and further on latest HCC single-cell RNA-seq data. Results show that gene co-expressions are highly different between bulk and single-cell datasets. Finally, many differentially co-expressed ligand-receptor pairs were discovered by comparing HCC with adjacent normal data, providing candidate HCC targets for abnormal cell-cell communications.
Collapse
Affiliation(s)
- Guiying Wu
- MOE Key Laboratory of Bioinformatics, BNRIST Bioinformatics Division, Department of Automation, Tsinghua University, Beijing 100084, China
| | - Xiangyu Li
- School of Software Engineering, Beijing Jiaotong University, Beijing 100044, China
| | - Wenbo Guo
- MOE Key Laboratory of Bioinformatics, BNRIST Bioinformatics Division, Department of Automation, Tsinghua University, Beijing 100084, China
| | - Zheng Wei
- MOE Key Laboratory of Bioinformatics, BNRIST Bioinformatics Division, Department of Automation, Tsinghua University, Beijing 100084, China
| | - Tao Hu
- MOE Key Laboratory of Bioinformatics, BNRIST Bioinformatics Division, Department of Automation, Tsinghua University, Beijing 100084, China
| | - Yiran Shan
- MOE Key Laboratory of Bioinformatics, BNRIST Bioinformatics Division, Department of Automation, Tsinghua University, Beijing 100084, China
| | - Jin Gu
- MOE Key Laboratory of Bioinformatics, BNRIST Bioinformatics Division, Department of Automation, Tsinghua University, Beijing 100084, China
| |
Collapse
|
4
|
Lee S, Zhang C, Liu Z, Klevstig M, Mukhopadhyay B, Bergentall M, Cinar R, Ståhlman M, Sikanic N, Park JK, Deshmukh S, Harzandi AM, Kuijpers T, Grøtli M, Elsässer SJ, Piening BD, Snyder M, Smith U, Nielsen J, Bäckhed F, Kunos G, Uhlen M, Boren J, Mardinoglu A. Network analyses identify liver-specific targets for treating liver diseases. Mol Syst Biol 2017; 13:938. [PMID: 28827398 PMCID: PMC5572395 DOI: 10.15252/msb.20177703] [Citation(s) in RCA: 89] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2017] [Revised: 07/19/2017] [Accepted: 07/24/2017] [Indexed: 01/02/2023] Open
Abstract
We performed integrative network analyses to identify targets that can be used for effectively treating liver diseases with minimal side effects. We first generated co-expression networks (CNs) for 46 human tissues and liver cancer to explore the functional relationships between genes and examined the overlap between functional and physical interactions. Since increased de novo lipogenesis is a characteristic of nonalcoholic fatty liver disease (NAFLD) and hepatocellular carcinoma (HCC), we investigated the liver-specific genes co-expressed with fatty acid synthase (FASN). CN analyses predicted that inhibition of these liver-specific genes decreases FASN expression. Experiments in human cancer cell lines, mouse liver samples, and primary human hepatocytes validated our predictions by demonstrating functional relationships between these liver genes, and showing that their inhibition decreases cell growth and liver fat content. In conclusion, we identified liver-specific genes linked to NAFLD pathogenesis, such as pyruvate kinase liver and red blood cell (PKLR), or to HCC pathogenesis, such as PKLR, patatin-like phospholipase domain containing 3 (PNPLA3), and proprotein convertase subtilisin/kexin type 9 (PCSK9), all of which are potential targets for drug development.
Collapse
Affiliation(s)
- Sunjae Lee
- Science for Life Laboratory, KTH - Royal Institute of Technology, Stockholm, Sweden
| | - Cheng Zhang
- Science for Life Laboratory, KTH - Royal Institute of Technology, Stockholm, Sweden
| | - Zhengtao Liu
- Science for Life Laboratory, KTH - Royal Institute of Technology, Stockholm, Sweden
| | - Martina Klevstig
- Department of Molecular and Clinical Medicine, University of Gothenburg and Sahlgrenska University Hospital, Gothenburg, Sweden
| | - Bani Mukhopadhyay
- Laboratory of Physiologic Studies, National Institute on Alcohol Abuse and Alcoholism, National Institutes of Health, Bethesda, MD, USA
| | - Mattias Bergentall
- Department of Molecular and Clinical Medicine, University of Gothenburg and Sahlgrenska University Hospital, Gothenburg, Sweden
| | - Resat Cinar
- Laboratory of Physiologic Studies, National Institute on Alcohol Abuse and Alcoholism, National Institutes of Health, Bethesda, MD, USA
| | - Marcus Ståhlman
- Department of Molecular and Clinical Medicine, University of Gothenburg and Sahlgrenska University Hospital, Gothenburg, Sweden
| | - Natasha Sikanic
- Science for Life Laboratory, KTH - Royal Institute of Technology, Stockholm, Sweden
| | - Joshua K Park
- Laboratory of Physiologic Studies, National Institute on Alcohol Abuse and Alcoholism, National Institutes of Health, Bethesda, MD, USA
| | - Sumit Deshmukh
- Science for Life Laboratory, KTH - Royal Institute of Technology, Stockholm, Sweden
| | - Azadeh M Harzandi
- Science for Life Laboratory, KTH - Royal Institute of Technology, Stockholm, Sweden
| | - Tim Kuijpers
- Science for Life Laboratory, KTH - Royal Institute of Technology, Stockholm, Sweden
| | - Morten Grøtli
- Department of Chemistry and Molecular Biology, University of Gothenburg, Gothenburg, Sweden
| | - Simon J Elsässer
- Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Stockholm, Sweden
| | - Brian D Piening
- Department of Genetics, Stanford University, Stanford, CA, USA
| | - Michael Snyder
- Department of Genetics, Stanford University, Stanford, CA, USA
| | - Ulf Smith
- Department of Molecular and Clinical Medicine, University of Gothenburg and Sahlgrenska University Hospital, Gothenburg, Sweden
| | - Jens Nielsen
- Science for Life Laboratory, KTH - Royal Institute of Technology, Stockholm, Sweden
- Department of Biology and Biological Engineering, Chalmers University of Technology, Gothenburg, Sweden
| | - Fredrik Bäckhed
- Department of Molecular and Clinical Medicine, University of Gothenburg and Sahlgrenska University Hospital, Gothenburg, Sweden
| | - George Kunos
- Laboratory of Physiologic Studies, National Institute on Alcohol Abuse and Alcoholism, National Institutes of Health, Bethesda, MD, USA
| | - Mathias Uhlen
- Science for Life Laboratory, KTH - Royal Institute of Technology, Stockholm, Sweden
| | - Jan Boren
- Department of Molecular and Clinical Medicine, University of Gothenburg and Sahlgrenska University Hospital, Gothenburg, Sweden
| | - Adil Mardinoglu
- Science for Life Laboratory, KTH - Royal Institute of Technology, Stockholm, Sweden
- Department of Biology and Biological Engineering, Chalmers University of Technology, Gothenburg, Sweden
| |
Collapse
|
5
|
Benfeitas R, Uhlen M, Nielsen J, Mardinoglu A. New Challenges to Study Heterogeneity in Cancer Redox Metabolism. Front Cell Dev Biol 2017; 5:65. [PMID: 28744456 PMCID: PMC5504267 DOI: 10.3389/fcell.2017.00065] [Citation(s) in RCA: 48] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2017] [Accepted: 06/26/2017] [Indexed: 12/13/2022] Open
Abstract
Reactive oxygen species (ROS) are important pathophysiological molecules involved in vital cellular processes. They are extremely harmful at high concentrations because they promote the generation of radicals and the oxidation of lipids, proteins, and nucleic acids, which can result in apoptosis. An imbalance of ROS and a disturbance of redox homeostasis are now recognized as a hallmark of complex diseases. Considering that ROS levels are significantly increased in cancer cells due to mitochondrial dysfunction, ROS metabolism has been targeted for the development of efficient treatment strategies, and antioxidants are used as potential chemotherapeutic drugs. However, initial ROS-focused clinical trials in which antioxidants were supplemented to patients provided inconsistent results, i.e., improved treatment or increased malignancy. These different outcomes may result from the highly heterogeneous redox responses of tumors in different patients. Hence, population-based treatment strategies are unsuitable and patient-tailored therapeutic approaches are required for the effective treatment of patients. Moreover, due to the crosstalk between ROS, reducing equivalents [e.g., NAD(P)H] and central metabolism, which is heterogeneous in cancer, finding the best therapeutic target requires the consideration of system-wide approaches that are capable of capturing the complex alterations observed in all of the associated pathways. Systems biology and engineering approaches may be employed to overcome these challenges, together with tools developed in personalized medicine. However, ROS- and redox-based therapies have yet to be addressed by these methodologies in the context of disease treatment. Here, we review the role of ROS and their coupled redox partners in tumorigenesis. Specifically, we highlight some of the challenges in understanding the role of hydrogen peroxide (H2O2), one of the most important ROS in pathophysiology in the progression of cancer. We also discuss its interplay with antioxidant defenses, such as the coupled peroxiredoxin/thioredoxin and glutathione/glutathione peroxidase systems, and its reducing equivalent metabolism. Finally, we highlight the need for system-level and patient-tailored approaches to clarify the roles of these systems and identify therapeutic targets through the use of the tools developed in personalized medicine.
Collapse
Affiliation(s)
- Rui Benfeitas
- Science for Life Laboratory, KTH Royal Institute of TechnologyStockholm, Sweden
| | - Mathias Uhlen
- Science for Life Laboratory, KTH Royal Institute of TechnologyStockholm, Sweden
| | - Jens Nielsen
- Science for Life Laboratory, KTH Royal Institute of TechnologyStockholm, Sweden.,Department of Biology and Biological Engineering, Chalmers University of TechnologyGothenburg, Sweden
| | - Adil Mardinoglu
- Science for Life Laboratory, KTH Royal Institute of TechnologyStockholm, Sweden.,Department of Biology and Biological Engineering, Chalmers University of TechnologyGothenburg, Sweden
| |
Collapse
|