1
|
Hwangbo S, Lee S, Hosain MM, Goo T, Lee S, Kim I, Park T. Kernel-based hierarchical structural component models for pathway analysis on survival phenotype. Genes Genomics 2024:10.1007/s13258-024-01569-9. [PMID: 39327384 DOI: 10.1007/s13258-024-01569-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2024] [Accepted: 09/07/2024] [Indexed: 09/28/2024]
Abstract
BACKGROUND High-throughput sequencing, particularly RNA-sequencing (RNA-seq), has advanced differential gene expression analysis, revealing pathways involved in various biological conditions. Traditional pathway-based methods generally consider pathways independently, overlooking the correlations among them and ignoring quite a few overlapping biomarkers between pathways. In addition, most pathway-based approaches assume that biomarkers have linear effects on the phenotype of interest. OBJECTIVE This study aims to develop the HisCoM-KernelS model to identify survival phenotype-related pathways by accommodating complex, nonlinear relationships between genes and survival outcomes, while accounting for inter-pathway correlations. METHODS We applied HisCoM-KernelS model to the TCGA pancreatic ductal adenocarcinoma (PDAC) RNA-seq dataset, comprising 4,498 protein-coding genes mapped to 186 KEGG pathways from 148 PDAC samples. Kernel machine regression was used to model pathway effects on survival outcomes, incorporating hierarchical gene-pathway structures. Model parameters were estimated using the alternating least squares algorithm, and the significance of pathways was assessed through a permutation test. RESULTS HisCoM-KernelS identified several pathways significantly associated with pancreatic cancer survival, including those corroborated by previous studies. HisCoM-KernelS, especially with the Gaussian kernel, showed a better balance of detection rate and number of significant pathways compared to four other existing pathway-based methods: HisCoM-PAGE, Global Test, GSEA, and CoxKM. CONCLUSION HisCoM-KernelS successfully extends pathway-based analysis to survival outcomes, capturing complex nonlinear gene effects and inter-pathway correlations. Its application to the TCGA PDAC dataset emphasizes its utility in identifying biologically relevant pathways, offering a robust tool for survival phenotype research in high-throughput sequencing data.
Collapse
Affiliation(s)
- Suhyun Hwangbo
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, 151-747, Korea
- Department of Genomic Medicine, Seoul National University Hospital, Seoul, 03080, Korea
| | - Sungyoung Lee
- Department of Genomic Medicine, Seoul National University Hospital, Seoul, 03080, Korea
| | - Md Mozaffar Hosain
- Department of Statistics, Seoul National University, Seoul, 151-747, Korea
| | - Taewan Goo
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, 151-747, Korea
| | - Seungyeoun Lee
- Department of Mathematics and Statistics, Sejong University, Sejong, 05006, Korea
| | - Inyoung Kim
- Department of Statistics, Virginia Tech, Blacksburg, VA, 24060, USA
| | - Taesung Park
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, 151-747, Korea.
- Department of Statistics, Seoul National University, Seoul, 151-747, Korea.
| |
Collapse
|
2
|
Hwangbo S, Lee S, Lee S, Hwang H, Kim I, Park T. Kernel-based hierarchical structural component models for pathway analysis. Bioinformatics 2022; 38:3078-3086. [PMID: 35460238 DOI: 10.1093/bioinformatics/btac276] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2021] [Revised: 04/08/2022] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION Pathway analyses have led to more insight into the underlying biological functions related to the phenotype of interest in various types of omics data. Pathway-based statistical approaches have been actively developed, but most of them do not consider correlations among pathways. Because it is well known that there are quite a few biomarkers that overlap between pathways, these approaches may provide misleading results. In addition, most pathway-based approaches tend to assume that biomarkers within a pathway have linear associations with the phenotype of interest, even though the relationships are more complex. RESULTS To model complex effects including nonlinear effects, we propose a new approach, Hierarchical structural CoMponent analysis using Kernel (HisCoM-Kernel). The proposed method models nonlinear associations between biomarkers and phenotype by extending the kernel machine regression and analyzes entire pathways simultaneously by using the biomarker-pathway hierarchical structure. HisCoM-Kernel is a flexible model that can be applied to various omics data. It was successfully applied to three omics datasets generated by different technologies. Our simulation studies showed that HisCoM-Kernel provided higher statistical power than other existing pathway-based methods in all datasets. The application of HisCoM-Kernel to three types of omics dataset showed its superior performance compared to existing methods in identifying more biologically meaningful pathways, including those reported in previous studies. AVAILABILITY AND IMPLEMENTATION Freely available at http://statgen.snu.ac.kr/software/HisCom-Kernel/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Suhyun Hwangbo
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, 151-747, Korea.,Department of Genomic Medicine, Seoul National University Hospital, Seoul, 03080, Korea
| | - Sungyoung Lee
- Department of Genomic Medicine, Seoul National University Hospital, Seoul, 03080, Korea
| | - Seungyeoun Lee
- Department of Mathematics and Statistics, Sejong University, Sejong, 05006, Korea
| | - Heungsun Hwang
- Department of Psychology, McGill University, Montreal, QC, H3A 1B1, Canada
| | - Inyoung Kim
- Department of Statistics, Virginia Tech, Blacksburg, Virginia, 24060, U.S.A
| | - Taesung Park
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, 151-747, Korea.,Department of Statistics, Seoul National University, Seoul, 151-747, Korea
| |
Collapse
|
3
|
Ma C, Wu M, Ma S. Analysis of cancer omics data: a selective review of statistical techniques. Brief Bioinform 2022; 23:6510158. [PMID: 35039832 DOI: 10.1093/bib/bbab585] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2021] [Revised: 12/19/2021] [Accepted: 12/20/2021] [Indexed: 11/13/2022] Open
Abstract
Cancer is an omics disease. The development in high-throughput profiling has fundamentally changed cancer research and clinical practice. Compared with clinical, demographic and environmental data, the analysis of omics data-which has higher dimensionality, weaker signals and more complex distributional properties-is much more challenging. Developments in the literature are often 'scattered', with individual studies focused on one or a few closely related methods. The goal of this review is to assist cancer researchers with limited statistical expertise in establishing the 'overall framework' of cancer omics data analysis. To facilitate understanding, we mainly focus on intuition, concepts and key steps, and refer readers to the original publications for mathematical details. This review broadly covers unsupervised and supervised analysis, as well as individual-gene-based, gene-set-based and gene-network-based analysis. We also briefly discuss 'special topics' including interaction analysis, multi-datasets analysis and multi-omics analysis.
Collapse
Affiliation(s)
- Chenjin Ma
- College of Statistics and Data Science, Faculty of Science, Beijing University of Technology, Beijing, China
| | - Mengyun Wu
- School of Statistics and Management, Shanghai University of Finance and Economics, Shanghai, China
| | - Shuangge Ma
- Department of Biostatistics, Yale School of Public Health, New Haven, CT, USA
| |
Collapse
|
4
|
Ameri M, Salimi H, Eskandari S, Nezafat N. Identification of potential biomarkers in hepatocellular carcinoma: A network-based approach. INFORMATICS IN MEDICINE UNLOCKED 2022. [DOI: 10.1016/j.imu.2022.100864] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
|
5
|
Blood and urine biomarkers in invasive ductal breast cancer: Mass spectrometry applied to identify metabolic alterations. J Mol Struct 2022. [DOI: 10.1016/j.molstruc.2021.131369] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
|
6
|
Nies HW, Mohamad MS, Zakaria Z, Chan WH, Remli MA, Nies YH. Enhanced Directed Random Walk for the Identification of Breast Cancer Prognostic Markers from Multiclass Expression Data. ENTROPY (BASEL, SWITZERLAND) 2021; 23:1232. [PMID: 34573857 PMCID: PMC8472068 DOI: 10.3390/e23091232] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/15/2021] [Revised: 09/14/2021] [Accepted: 09/16/2021] [Indexed: 12/12/2022]
Abstract
Artificial intelligence in healthcare can potentially identify the probability of contracting a particular disease more accurately. There are five common molecular subtypes of breast cancer: luminal A, luminal B, basal, ERBB2, and normal-like. Previous investigations showed that pathway-based microarray analysis could help in the identification of prognostic markers from gene expressions. For example, directed random walk (DRW) can infer a greater reproducibility power of the pathway activity between two classes of samples with a higher classification accuracy. However, most of the existing methods (including DRW) ignored the characteristics of different cancer subtypes and considered all of the pathways to contribute equally to the analysis. Therefore, an enhanced DRW (eDRW+) is proposed to identify breast cancer prognostic markers from multiclass expression data. An improved weight strategy using one-way ANOVA (F-test) and pathway selection based on the greatest reproducibility power is proposed in eDRW+. The experimental results show that the eDRW+ exceeds other methods in terms of AUC. Besides this, the eDRW+ identifies 294 gene markers and 45 pathway markers from the breast cancer datasets with better AUC. Therefore, the prognostic markers (pathway markers and gene markers) can identify drug targets and look for cancer subtypes with clinically distinct outcomes.
Collapse
Affiliation(s)
- Hui Wen Nies
- School of Computing, Faculty of Engineering, Universiti Teknologi Malaysia, Skudai 81310, Malaysia; (Z.Z.); (W.H.C.)
| | - Mohd Saberi Mohamad
- Health Data Science Lab, Department of Genetics and Genomics, College of Medical and Health Sciences, United Arab Emirates University, Al Ain 17666, United Arab Emirates;
| | - Zalmiyah Zakaria
- School of Computing, Faculty of Engineering, Universiti Teknologi Malaysia, Skudai 81310, Malaysia; (Z.Z.); (W.H.C.)
| | - Weng Howe Chan
- School of Computing, Faculty of Engineering, Universiti Teknologi Malaysia, Skudai 81310, Malaysia; (Z.Z.); (W.H.C.)
| | - Muhammad Akmal Remli
- Institute for Artificial Intelligence and Big Data, Universiti Malaysia Kelantan, Kota Bharu 16100, Malaysia;
| | - Yong Hui Nies
- Department of Anatomy, Faculty of Medicine, Universiti Kebangsaan Malaysia Medical Centre, Cheras, Kuala Lumpur 56000, Malaysia;
| |
Collapse
|
7
|
Auslander N, Gussow AB, Koonin EV. Incorporating Machine Learning into Established Bioinformatics Frameworks. Int J Mol Sci 2021; 22:2903. [PMID: 33809353 PMCID: PMC8000113 DOI: 10.3390/ijms22062903] [Citation(s) in RCA: 35] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2021] [Revised: 03/08/2021] [Accepted: 03/10/2021] [Indexed: 12/23/2022] Open
Abstract
The exponential growth of biomedical data in recent years has urged the application of numerous machine learning techniques to address emerging problems in biology and clinical research. By enabling the automatic feature extraction, selection, and generation of predictive models, these methods can be used to efficiently study complex biological systems. Machine learning techniques are frequently integrated with bioinformatic methods, as well as curated databases and biological networks, to enhance training and validation, identify the best interpretable features, and enable feature and model investigation. Here, we review recently developed methods that incorporate machine learning within the same framework with techniques from molecular evolution, protein structure analysis, systems biology, and disease genomics. We outline the challenges posed for machine learning, and, in particular, deep learning in biomedicine, and suggest unique opportunities for machine learning techniques integrated with established bioinformatics approaches to overcome some of these challenges.
Collapse
Affiliation(s)
| | | | - Eugene V. Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA;
| |
Collapse
|
8
|
Mi Z, Guo B, Yang X, Yin Z, Zheng Z. LAMP: disease classification derived from layered assessment on modules and pathways in the human gene network. BMC Bioinformatics 2020; 21:487. [PMID: 33126852 PMCID: PMC7597061 DOI: 10.1186/s12859-020-03800-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2020] [Accepted: 10/05/2020] [Indexed: 11/10/2022] Open
Abstract
Background Classification of diseases based on genetic information is of great significance as the basis for precision medicine, increasing the understanding of disease etiology and revolutionizing personalized medicine. Much effort has been directed at understanding disease associations by constructing disease networks, and classifying patient samples according to gene expression data. Integrating human gene networks overcomes limited coverage of genes. Incorporating pathway information into disease classification procedure addresses the challenge of cellular heterogeneity across patients.
Results In this work, we propose a disease classification model LAMP, which concentrates on the layered assessment on modules and pathways. Directed human gene interactions are the foundation of constructing the human gene network, where the significant roles of disease and pathway genes are recognized. The fast unfolding algorithm identifies 11 modules in the largest connected component. Then layered networks are introduced to distinguish positions of genes in propagating information from sources to targets. After gene screening, hierarchical clustering and refined process, 1726 diseases from KEGG are classified into 18 categories. Also, it is expounded that diseases with overlapping genes may not belong to the same category in LAMP. Within each category, entropy is applied to measure the compositional complexity, and to evaluate the prospects for combination diagnosis and gene-targeted therapy for diseases. Conclusion In this work, by collecting data from BioGRID and KEGG, we develop a disease classification model LAMP, to support people to view diseases from the perspective of commonalities in etiology and pathology. Comprehensive research on existing diseases can help meet the challenges of unknown diseases. The results provide suggestions for combination diagnosis and gene-targeted therapy, which motivates clinicians and researchers to reposition the understanding of diseases and explore diagnosis and therapy strategies.
Collapse
Affiliation(s)
- Zhilong Mi
- Beijing Advanced Innovation Center for Big Data and Brain Computing and LMIB, Beihang University, Beijing, China.,Peng Cheng Laboratory, Shenzhen, Guangdong Province, China.,School of Mathematical Sciences and Shenyuan Honors College, Beihang University, Beijing, China
| | - Binghui Guo
- Beijing Advanced Innovation Center for Big Data and Brain Computing and LMIB, Beihang University, Beijing, China. .,Peng Cheng Laboratory, Shenzhen, Guangdong Province, China. .,School of Mathematical Sciences and Shenyuan Honors College, Beihang University, Beijing, China.
| | - Xiaobo Yang
- Beijing Advanced Innovation Center for Big Data and Brain Computing and LMIB, Beihang University, Beijing, China.,Peng Cheng Laboratory, Shenzhen, Guangdong Province, China.,School of Mathematical Sciences and Shenyuan Honors College, Beihang University, Beijing, China
| | - Ziqiao Yin
- Beijing Advanced Innovation Center for Big Data and Brain Computing and LMIB, Beihang University, Beijing, China.,Peng Cheng Laboratory, Shenzhen, Guangdong Province, China.,School of Mathematical Sciences and Shenyuan Honors College, Beihang University, Beijing, China
| | - Zhiming Zheng
- Beijing Advanced Innovation Center for Big Data and Brain Computing and LMIB, Beihang University, Beijing, China.,Peng Cheng Laboratory, Shenzhen, Guangdong Province, China.,School of Mathematical Sciences and Shenyuan Honors College, Beihang University, Beijing, China
| |
Collapse
|
9
|
Zhang W, Zeng Y, Wang L, Liu Y, Cheng YN. An Effective Graph Clustering Method to Identify Cancer Driver Modules. Front Bioeng Biotechnol 2020; 8:271. [PMID: 32318558 PMCID: PMC7154174 DOI: 10.3389/fbioe.2020.00271] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2020] [Accepted: 03/16/2020] [Indexed: 12/15/2022] Open
Abstract
Identifying the molecular modules that drive cancer progression can greatly deepen the understanding of cancer mechanisms and provide useful information for targeted therapies. Most methods currently addressing this issue primarily use mutual exclusivity without making full use of the extra layer of module property. In this paper, we propose MCLCluster to identity cancer driver modules, which use somatic mutation data, Cancer Cell Fraction (CCF) data, gene functional interaction network and protein-protein interaction (PPI) network to derive the module property on mutual exclusivity, connectivity in PPI network and functionally similarity of genes. We have taken three effective measures to ensure the effectiveness of our algorithm. First, we use CCF data to choose stronger signals and more confident mutations. Second, the weighted gene functional interaction network is used to quantify the gene functional similarity in PPI. The third, graph clustering method based on Markov is exploited to extract the candidate module. MCLCluster is tested in the two TCGA datasets (GBM and BRCA), and identifies several well-known oncogenes driver modules and some modules with functionally associated driver genes. Besides, we compare it with Multi-Dendrix, FSME Cluster and RME in simulated dataset with background noise and passenger rate, MCLCluster outperforming all of these methods.
Collapse
Affiliation(s)
- Wei Zhang
- College of Computer Engineering and Applied Mathematics, Changsha University, Changsha, China.,Hunan Province Key Laboratory of Industrial Internet Technology and Security, Changsha University, Changsha, China
| | - Yifu Zeng
- College of Computer Engineering and Applied Mathematics, Changsha University, Changsha, China.,Hunan Province Key Laboratory of Industrial Internet Technology and Security, Changsha University, Changsha, China
| | - Lei Wang
- College of Computer Engineering and Applied Mathematics, Changsha University, Changsha, China.,Key Laboratory of Hunan Province for Internet of Things and Information Security, Xiangtan University, Xiangtan, China
| | - Yue Liu
- College of Computer Science and Electronics Engineering, Hunan University, Changsha, China
| | - Yi-Nan Cheng
- College of Science, Southern University of Science and Technology, Shenzhen, China
| |
Collapse
|
10
|
Sadeghi M, Ordway B, Rafiei I, Borad P, Fang B, Koomen JL, Zhang C, Yoder S, Johnson J, Damaghi M. Integrative Analysis of Breast Cancer Cells Reveals an Epithelial-Mesenchymal Transition Role in Adaptation to Acidic Microenvironment. Front Oncol 2020; 10:304. [PMID: 32211331 PMCID: PMC7076123 DOI: 10.3389/fonc.2020.00304] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2019] [Accepted: 02/20/2020] [Indexed: 01/06/2023] Open
Abstract
Early ducts of breast tumors are unequivocally acidic. High rates of glycolysis combined with poor perfusion lead to a congestion of acidic metabolites in the tumor microenvironment, and pre-malignant cells must adapt to this acidosis to thrive. Adaptation to acidosis selects cancer cells that can thrive in harsh conditions and are capable of outgrowing the normal or non-adapted neighbors. This selection is usually accompanied by phenotypic change. Epithelial mesenchymal transition (EMT) is one of the most important switches correlated to malignant tumor cell phenotype and has been shown to be induced by tumor acidosis. New evidence shows that the EMT switch is not a binary system and occurs on a spectrum of transition states. During confirmation of the EMT phenotype, our results demonstrated a partial EMT phenotype in our acid-adapted cell population. Using RNA sequencing and network analysis we found 10 dysregulated network motifs in acid-adapted breast cancer cells playing a role in EMT. Our further integrative analysis of RNA sequencing and SILAC proteomics resulted in recognition of S100B and S100A6 proteins at both the RNA and protein level. Higher expression of S100B and S100A6 was validated in vitro by Immunocytochemistry. We further validated our finding both in vitro and in patients' samples by IHC analysis of Tissue Microarray (TMA). Correlation analysis of S100A6 and LAMP2b as marker of acidosis in each patient from Moffitt TMA approved the acid related role of S100A6 in breast cancer patients. Also, DCIS patients with higher expression of S100A6 showed lower survival compared to lower expression. We propose essential roles of acid adaptation in cancer cells EMT process through S100 proteins such as S100A6 that can be used as therapeutic strategy targeting both acid-adapted and malignant phenotypes.
Collapse
Affiliation(s)
- Mehdi Sadeghi
- Department of Cell and Molecular Biology, Faculty of Science, Semnan University, Semnan, Iran
| | - Bryce Ordway
- Department of Cancer Physiology, Moffitt Cancer Center and Research Institute, Tampa, FL, United States
| | - Ilyia Rafiei
- Department of Cell and Molecular Biology, Faculty of Science, Semnan University, Semnan, Iran
| | - Punit Borad
- Department of Cancer Physiology, Moffitt Cancer Center and Research Institute, Tampa, FL, United States
| | - Bin Fang
- Proteomics Core, Moffitt Cancer Center and Research Institute, Tampa, FL, United States
| | - John L Koomen
- Proteomics Core, Moffitt Cancer Center and Research Institute, Tampa, FL, United States
| | - Chaomei Zhang
- Molecular Biology Core, Moffitt Cancer Center and Research Institute, Tampa, FL, United States
| | - Sean Yoder
- Molecular Biology Core, Moffitt Cancer Center and Research Institute, Tampa, FL, United States
| | - Joseph Johnson
- Microscopy Core, Moffitt Cancer Center and Research Institute, Tampa, FL, United States
| | - Mehdi Damaghi
- Department of Cancer Physiology, Moffitt Cancer Center and Research Institute, Tampa, FL, United States.,Department of Oncologic Sciences, Morsani College of Medicine, University of South Florida, Tampa, FL, United States
| |
Collapse
|
11
|
Ullah MA, Sarkar B, Akter F. Prediction of biomarker signatures and therapeutic agents from blood sample against Pancreatic Ductal Adenocarcinoma (PDAC): A network-based study. INFORMATICS IN MEDICINE UNLOCKED 2020. [DOI: 10.1016/j.imu.2020.100346] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
|
12
|
Characterization and Validation of an "Acute Aerobic Exercise Load" as a Tool to Assess Antioxidative and Anti-inflammatory Nutrition in Healthy Subjects Using a Statistically Integrated Approach in a Comprehensive Clinical Trial. OXIDATIVE MEDICINE AND CELLULAR LONGEVITY 2019; 2019:9526725. [PMID: 31612079 PMCID: PMC6755301 DOI: 10.1155/2019/9526725] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/19/2019] [Accepted: 06/09/2019] [Indexed: 11/17/2022]
Abstract
The homeostatic challenge may provide unique opportunities for quantitative assessment of the health-promoting effects of nutritional interventions in healthy individuals. Objective. The present study is aimed at characterizing and validating the use of acute aerobic exercise (AAE) on a treadmill at 60% of VO2max for 30 min, in assessing the antioxidative and anti-inflammatory effects of a nutritional intervention. In a controlled, randomized, parallel trial of Korean black raspberry (KBR) (n = 24/group), fasting blood and urine samples collected before and following the AAE load at either baseline or 4-week follow-up were analyzed for biochemical markers, 1H-NMR metabolomics, and transcriptomics. The AAE was characterized using the placebo data only, and either the placebo or the treatment data were used in the validation. The AAE load generated a total of 50 correlations of 44 selected markers, based on Pearson's correlation coefficient analysis of 105 differential markers. Subsequent mapping of selected markers onto the KEGG pathway dataset showed 127 pathways relevant to the AAE load. Of these, 54 pathways involving 18 key targets were annotated to be related to oxidative stress and inflammation. The biochemical responses were amplified with the AAE load as compared to those with no load, whereas, the metabolomic and transcriptomic responses were downgraded. Furthermore, target-pathway network analysis revealed that the AAE load provided more explanations on how KBR exerted antioxidant effects in healthy subjects (29 pathways involving 12 key targets with AAE vs. 12 pathways involving 2 key targets without AAE). This study provides considerable insight into the molecular changes incurred by AAE and furthers our understanding that AAE-induced homeostatic perturbation could magnify oxidative and inflammatory responses, thereby providing a unique opportunity to test functional foods for antioxidant and anti-inflammatory purposes in clinical settings with healthy subjects.
Collapse
|
13
|
Tang J, He D, Yang P, He J, Zhang Y. Genome-wide expression profiling of glioblastoma using a large combined cohort. Sci Rep 2018; 8:15104. [PMID: 30305647 PMCID: PMC6180049 DOI: 10.1038/s41598-018-33323-z] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2018] [Accepted: 09/24/2018] [Indexed: 01/12/2023] Open
Abstract
Glioblastomas (GBMs), are the most common intrinsic brain tumors in adults and are almost universally fatal. Despite the progresses made in surgery, chemotherapy, and radiation over the past decades, the prognosis of patients with GBM remained poor and the average survival time of patients suffering from GBM was still short. Discovering robust gene signatures toward better understanding of the complex molecular mechanisms leading to GBM is an important prerequisite to the identification of novel and more effective therapeutic strategies. Herein, a comprehensive study of genome-scale mRNA expression data by combining GBM and normal tissue samples from 48 studies was performed. The 147 robust gene signatures were identified to be significantly differential expression between GBM and normal samples, among which 100 (68%) genes were reported to be closely associated with GBM in previous publications. Moreover, function annotation analysis based on these 147 robust DEGs showed certain deregulated gene expression programs (e.g., cell cycle, immune response and p53 signaling pathway) were associated with GBM development, and PPI network analysis revealed three novel hub genes (RFC4, ZWINT and TYMS) play important role in GBM development. Furthermore, survival analysis based on the TCGA GBM data demonstrated 38 robust DEGs significantly affect the prognosis of GBM in OS (p < 0.05). These findings provided new insights into molecular mechanisms underlying GBM and suggested the 38 robust DEGs could be potential targets for the diagnosis and treatment.
Collapse
Affiliation(s)
- Jing Tang
- Innovative Drug Research and Bioinformatics Group, School of Pharmaceutical Sciences and Innovative Drug Research Centre, Chongqing University, Chongqing, 401331, China.,Materia Medica Development Group, Institute of Medicinal Chemistry, Lanzhou University School of Pharmacy, Lanzhou, 730000, China
| | - Dian He
- Materia Medica Development Group, Institute of Medicinal Chemistry, Lanzhou University School of Pharmacy, Lanzhou, 730000, China. .,Gansu Institute for Drug Control, Lanzhou, 730070, China.
| | - Pingrong Yang
- Materia Medica Development Group, Institute of Medicinal Chemistry, Lanzhou University School of Pharmacy, Lanzhou, 730000, China.,Gansu Institute for Drug Control, Lanzhou, 730070, China
| | - Junquan He
- Materia Medica Development Group, Institute of Medicinal Chemistry, Lanzhou University School of Pharmacy, Lanzhou, 730000, China.,Gansu Institute for Drug Control, Lanzhou, 730070, China
| | - Yang Zhang
- Innovative Drug Research and Bioinformatics Group, School of Pharmaceutical Sciences and Innovative Drug Research Centre, Chongqing University, Chongqing, 401331, China. .,Materia Medica Development Group, Institute of Medicinal Chemistry, Lanzhou University School of Pharmacy, Lanzhou, 730000, China.
| |
Collapse
|
14
|
Kondratova M, Sompairac N, Barillot E, Zinovyev A, Kuperstein I. Signalling maps in cancer research: construction and data analysis. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2018; 2018:4964960. [PMID: 29688383 PMCID: PMC5890450 DOI: 10.1093/database/bay036] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/03/2017] [Accepted: 03/19/2018] [Indexed: 12/22/2022]
Abstract
Generation and usage of high-quality molecular signalling network maps can be augmented by standardizing notations, establishing curation workflows and application of computational biology methods to exploit the knowledge contained in the maps. In this manuscript, we summarize the major aims and challenges of assembling information in the form of comprehensive maps of molecular interactions. Mainly, we share our experience gained while creating the Atlas of Cancer Signalling Network. In the step-by-step procedure, we describe the map construction process and suggest solutions for map complexity management by introducing a hierarchical modular map structure. In addition, we describe the NaviCell platform, a computational technology using Google Maps API to explore comprehensive molecular maps similar to geographical maps and explain the advantages of semantic zooming principles for map navigation. We also provide the outline to prepare signalling network maps for navigation using the NaviCell platform. Finally, several examples of cancer high-throughput data analysis and visualization in the context of comprehensive signalling maps are presented.
Collapse
Affiliation(s)
- Maria Kondratova
- Institut Curie, PSL Research University, F-75005 Paris, France.,INSERM, U900, F-75005 Paris, France.,MINES ParisTech, PSL Research University, CBIO-Centre for Computational Biology, F-75006 Paris, France
| | - Nicolas Sompairac
- Institut Curie, PSL Research University, F-75005 Paris, France.,INSERM, U900, F-75005 Paris, France.,MINES ParisTech, PSL Research University, CBIO-Centre for Computational Biology, F-75006 Paris, France
| | - Emmanuel Barillot
- Institut Curie, PSL Research University, F-75005 Paris, France.,INSERM, U900, F-75005 Paris, France.,MINES ParisTech, PSL Research University, CBIO-Centre for Computational Biology, F-75006 Paris, France
| | - Andrei Zinovyev
- Institut Curie, PSL Research University, F-75005 Paris, France.,INSERM, U900, F-75005 Paris, France.,MINES ParisTech, PSL Research University, CBIO-Centre for Computational Biology, F-75006 Paris, France
| | - Inna Kuperstein
- Institut Curie, PSL Research University, F-75005 Paris, France.,INSERM, U900, F-75005 Paris, France.,MINES ParisTech, PSL Research University, CBIO-Centre for Computational Biology, F-75006 Paris, France
| |
Collapse
|
15
|
Zhang L, Kim I. Semiparametric Bayesian kernel survival model for evaluating pathway effects. Stat Methods Med Res 2018; 28:3301-3317. [PMID: 30289021 DOI: 10.1177/0962280218797360] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
Massive amounts of high-dimensional data have been accumulated over the past two decades, which has cultured increasing interests in identifying gene pathways related to certain biological processes. In particular, since pathway-based analysis has the ability to detect subtle changes of differentially expressed genes that could be missed when using gene-based analysis, detecting the gene pathways that regulate certain diseases can provide new strategies for medical procedures and new targets for drug discovery. Limited work has been carried out, primarily in regression settings, to study the effects of pathways on survival outcomes. Motivated by a breast cancer gene-pathway data set, which exhibits the "small n, large p" characteristics, we propose a semiparametric Bayesian kernel survival model (s-BKSurv) to study the effects of both clinical covariates and gene expression levels within a pathway on survival time. We model the unknown high-dimensional functions of pathways via Gaussian kernel machine to consider the possibility that genes within the same pathway interact with each other. To address the multiple comparisons problem under a full Bayesian setting, we propose a similarity-dependent procedure based on Bayes factor to control the family-wise error rate. We demonstrate the outperformance of our approach under various simulation settings and pathways data.
Collapse
Affiliation(s)
- Lin Zhang
- Department of Statistics, Virginia Polytechnic Institute and State University, Blacksburg, VA, USA
| | - Inyoung Kim
- Department of Statistics, Virginia Polytechnic Institute and State University, Blacksburg, VA, USA
| |
Collapse
|
16
|
Misra BB, Langefeld CD, Olivier M, Cox LA. Integrated Omics: Tools, Advances, and Future Approaches. J Mol Endocrinol 2018; 62:JME-18-0055. [PMID: 30006342 DOI: 10.1530/jme-18-0055] [Citation(s) in RCA: 220] [Impact Index Per Article: 36.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/24/2018] [Revised: 07/02/2018] [Accepted: 07/12/2018] [Indexed: 12/13/2022]
Abstract
With the rapid adoption of high-throughput omic approaches to analyze biological samples such as genomics, transcriptomics, proteomics, and metabolomics, each analysis can generate tera- to peta-byte sized data files on a daily basis. These data file sizes, together with differences in nomenclature among these data types, make the integration of these multi-dimensional omics data into biologically meaningful context challenging. Variously named as integrated omics, multi-omics, poly-omics, trans-omics, pan-omics, or shortened to just 'omics', the challenges include differences in data cleaning, normalization, biomolecule identification, data dimensionality reduction, biological contextualization, statistical validation, data storage and handling, sharing, and data archiving. The ultimate goal is towards the holistic realization of a 'systems biology' understanding of the biological question in hand. Commonly used approaches in these efforts are currently limited by the 3 i's - integration, interpretation, and insights. Post integration, these very large datasets aim to yield unprecedented views of cellular systems at exquisite resolution for transformative insights into processes, events, and diseases through various computational and informatics frameworks. With the continued reduction in costs and processing time for sample analyses, and increasing types of omics datasets generated such as glycomics, lipidomics, microbiomics, and phenomics, an increasing number of scientists in this interdisciplinary domain of bioinformatics face these challenges. We discuss recent approaches, existing tools, and potential caveats in the integration of omics datasets for development of standardized analytical pipelines that could be adopted by the global omics research community.
Collapse
Affiliation(s)
- Biswapriya B Misra
- B Misra, Internal Medicine, Wake Forest University School of Medicine, Winston-Salem, United States
| | - Carl D Langefeld
- C Langefeld, Biostatistical Sciences, Wake Forest University School of Medicine, Winston-Salem, United States
| | - Michael Olivier
- M Olivier, Internal Medicine, Wake Forest University School of Medicine, Winston-Salem, United States
| | - Laura A Cox
- L Cox, Internal Medicine, Wake Forest University School of Medicine, Winston-Salem, United States
| |
Collapse
|
17
|
Monraz Gomez LC, Kondratova M, Ravel JM, Barillot E, Zinovyev A, Kuperstein I. Application of Atlas of Cancer Signalling Network in preclinical studies. Brief Bioinform 2018; 20:701-716. [DOI: 10.1093/bib/bby031] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2017] [Revised: 03/28/2018] [Indexed: 12/27/2022] Open
Affiliation(s)
- L Cristobal Monraz Gomez
- Institut Curie, PSL Research University, F-75005 Paris, France, INSERM, U900, F-75005 Paris, France and MINES ParisTech, PSL Research University, CBIO-Centre for Computational Biology, F-75006 Paris, France
| | - Maria Kondratova
- Institut Curie, PSL Research University, F-75005 Paris, France, INSERM, U900, F-75005 Paris, France and MINES ParisTech, PSL Research University, CBIO-Centre for Computational Biology, F-75006 Paris, France
| | - Jean-Marie Ravel
- Genetic Laboratory, Nancy's Regional University Hospital, Vandœuvre-lès-Nancy and INSERM UMR 954, Lorraine University, Vandœuvre-lès-Nancy
| | - Emmanuel Barillot
- Institut Curie, PSL Research University, F-75005 Paris, France, INSERM, U900, F-75005 Paris, France and MINES ParisTech, PSL Research University, CBIO-Centre for Computational Biology, F-75006 Paris, France
| | - Andrei Zinovyev
- Institut Curie, PSL Research University, F-75005 Paris, France, INSERM, U900, F-75005 Paris, France and MINES ParisTech, PSL Research University, CBIO-Centre for Computational Biology, F-75006 Paris, France
| | - Inna Kuperstein
- Institut Curie, PSL Research University, F-75005 Paris, France, INSERM, U900, F-75005 Paris, France and MINES ParisTech, PSL Research University, CBIO-Centre for Computational Biology, F-75006 Paris, France
| |
Collapse
|
18
|
Downs DM, Bazurto JV, Gupta A, Fonseca LL, Voit EO. The three-legged stool of understanding metabolism: integrating metabolomics with biochemical genetics and computational modeling. AIMS Microbiol 2018; 4:289-303. [PMID: 31294216 PMCID: PMC6604926 DOI: 10.3934/microbiol.2018.2.289] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2018] [Accepted: 04/02/2018] [Indexed: 12/23/2022] Open
Abstract
Traditional biochemical research has resulted in a good understanding of many aspects of metabolism. However, this reductionist approach is time consuming and requires substantial resources, thus raising the question whether modern metabolomics and genomics should take over and replace the targeted experiments of old. We proffer that such a replacement is neither feasible not desirable and propose instead the tight integration of modern, system-wide omics with traditional experimental bench science and dedicated computational approaches. This integration is an important prerequisite toward the optimal acquisition of knowledge regarding metabolism and physiology in health and disease. The commentary describes advantages and drawbacks of current approaches to assessing metabolism and highlights the challenges to be overcome as we strive to achieve a deeper level of metabolic understanding in the future.
Collapse
Affiliation(s)
- Diana M Downs
- Department of Microbiology, University of Georgia, Athens, GA, 30602, USA
| | - Jannell V Bazurto
- Department of Biological Sciences, University of Idaho, Moscow, ID, 83844, USA
| | - Anuj Gupta
- Department of Biomedical Engineering, Georgia Institute of Technology, 950 Atlantic Drive, Suite 2115, Atlanta, GA, 30332-2000, USA
| | - Luis L Fonseca
- Department of Biomedical Engineering, Georgia Institute of Technology, 950 Atlantic Drive, Suite 2115, Atlanta, GA, 30332-2000, USA
| | - Eberhard O Voit
- Department of Biomedical Engineering, Georgia Institute of Technology, 950 Atlantic Drive, Suite 2115, Atlanta, GA, 30332-2000, USA
| |
Collapse
|
19
|
Grinchuk OV, Yenamandra SP, Iyer R, Singh M, Lee HK, Lim KH, Chow PK, Kuznetsov VA. Tumor-adjacent tissue co-expression profile analysis reveals pro-oncogenic ribosomal gene signature for prognosis of resectable hepatocellular carcinoma. Mol Oncol 2017; 12:89-113. [PMID: 29117471 PMCID: PMC5748488 DOI: 10.1002/1878-0261.12153] [Citation(s) in RCA: 121] [Impact Index Per Article: 17.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2017] [Revised: 10/03/2017] [Accepted: 10/16/2017] [Indexed: 12/18/2022] Open
Abstract
Currently, molecular markers are not used when determining the prognosis and treatment strategy for patients with hepatocellular carcinoma (HCC). In the present study, we proposed that the identification of common pro‐oncogenic pathways in primary tumors (PT) and adjacent non‐malignant tissues (AT) typically used to predict HCC patient risks may result in HCC biomarker discovery. We examined the genome‐wide mRNA expression profiles of paired PT and AT samples from 321 HCC patients. The workflow integrated differentially expressed gene selection, gene ontology enrichment, computational classification, survival predictions, image analysis and experimental validation methods. We developed a 24‐ribosomal gene‐based HCC classifier (RGC), which is prognostically significant in both PT and AT. The RGC gene overexpression in PT was associated with a poor prognosis in the training (hazard ratio = 8.2, P = 9.4 × 10−6) and cross‐cohort validation (hazard ratio = 2.63, P = 0.004) datasets. The multivariate survival analysis demonstrated the significant and independent prognostic value of the RGC. The RGC displayed a significant prognostic value in AT of the training (hazard ratio = 5.0, P = 0.03) and cross‐validation (hazard ratio = 1.9, P = 0.03) HCC groups, confirming the accuracy and robustness of the RGC. Our experimental and bioinformatics analyses suggested a key role for c‐MYC in the pro‐oncogenic pattern of ribosomal biogenesis co‐regulation in PT and AT. Microarray, quantitative RT‐PCR and quantitative immunohistochemical studies of the PT showed that DKK1 in PT is the perspective biomarker for poor HCC outcomes. The common co‐transcriptional pattern of ribosome biogenesis genes in PT and AT from HCC patients suggests a new scalable prognostic system, as supported by the model of tumor‐like metabolic redirection/assimilation in non‐malignant AT. The RGC, comprising 24 ribosomal genes, is introduced as a robust and reproducible prognostic model for stratifying HCC patient risks. The adjacent non‐malignant liver tissue alone, or in combination with HCC tissue biopsy, could be an important target for developing predictive and monitoring strategies, as well as evidence‐based therapeutic interventions, that aim to reduce the risk of post‐surgery relapse in HCC patients.
Collapse
Affiliation(s)
| | | | | | - Malay Singh
- Bioinformatics InstituteSingapore
- Department of Computer ScienceSchool of ComputingNational University of SingaporeSingapore
| | - Hwee Kuan Lee
- Bioinformatics InstituteSingapore
- Department of Computer ScienceSchool of ComputingNational University of SingaporeSingapore
| | - Kiat Hon Lim
- Division of Surgical OncologyNational Cancer CentreSingaporeSingapore
| | - Pierce Kah‐Hoe Chow
- Division of Surgical OncologyNational Cancer CentreSingaporeSingapore
- Office of Clinical SciencesDuke‐NUS Graduate Medical SchoolSingaporeSingapore
- Department of HPB and Transplantation SurgerySingapore General HospitalSingapore
| | | |
Collapse
|
20
|
Do Canto LM, Marian C, Varghese RS, Ahn J, Da Cunha PA, Willey S, Sidawy M, Rone JD, Cheema AK, Luta G, Nezami ranjbar MR, Ressom HW, Haddad BR. Metabolomic profiling of breast tumors using ductal fluid. Int J Oncol 2016; 49:2245-2254. [PMID: 27748798 PMCID: PMC5117995 DOI: 10.3892/ijo.2016.3732] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2016] [Accepted: 09/26/2016] [Indexed: 12/12/2022] Open
Abstract
Identification of new biomarkers for breast cancer remains critical in order to enhance early detection of the disease and improve its prognosis. Towards this end, we performed an untargeted metabolomic analysis of breast ductal fluid using an ultra-performance liquid chromatography coupled with a quadrupole time-of-light (UPLC-QTOF) mass spectrometer. We investigated the metabolomic profiles of breast tumors using ductal fluid samples collected by ductal lavage (DL). We studied fluid from both the affected breasts and the unaffected contralateral breasts (as controls) from 43 women with confirmed unilateral breast cancer. Using this approach, we identified 1560 ions in the positive mode and 538 ions in the negative mode after preprocessing of the UPLC‑QTOF data. Paired t-tests applied on these data matrices identified 209 ions (positive and negative modes combined) with significant change in intensity level between affected and unaffected control breasts (adjusted p-values <0.05). Among these, 83 ions (39.7%) showed a fold change (FC) >1.2 and 66 ions (31.6%) were identified with putative compound names. The metabolites that we identified included endogenous metabolites such as amino acid derivatives (N-Acetyl-DL-tryptophan) or products of lipid metabolism such as N-linoleoyl taurine, trans-2-dodecenoylcarnitine, lysophosphatidylcholine LysoPC(18:2(9Z,12Z)), glycerophospholipids PG(18:0/0:0), and phosphatidylserine PS(20:4(5Z,8Z,11Z,14Z). Generalized LASSO regression further selected 21 metabolites when race, menopausal status, smoking, grade and TNM stage were adjusted for. A predictive conditional logistic regression model, using the LASSO selected 21 ions, provided diagnostic accuracy with the area under the curve of 0.956 (sensitivity/specificity of 0.907/0.884). This is the first study that shows the feasibility of conducting a comprehensive metabolomic profiling of breast tumors using breast ductal fluid to detect changes in the cellular microenvironment of the tumors and shows the potential for this approach to be used to improve detection of breast cancer.
Collapse
MESH Headings
- Biomarkers, Tumor/metabolism
- Breast Neoplasms/diagnosis
- Breast Neoplasms/pathology
- Carcinoma, Intraductal, Noninfiltrating/diagnosis
- Carcinoma, Intraductal, Noninfiltrating/pathology
- Chromatography, Liquid
- Female
- Humans
- Mammary Glands, Human/physiology
- Mass Spectrometry
- Metabolome/physiology
- Metabolomics/methods
- Middle Aged
- Receptor, ErbB-2/metabolism
- Receptors, Estrogen/metabolism
- Receptors, Progesterone/metabolism
Collapse
Affiliation(s)
- Luisa Matos Do Canto
- Lombardi Comprehensive Cancer Center and Department of Oncology, Georgetown University Medical Center, Georgetown University, Washington DC, USA
| | - Catalin Marian
- Biochemistry Department, ‘Victor Babes’ University of Medicine and Pharmacy, Timisoara, Romania
- Ohio State University Comprehensive Cancer Center, The Ohio State University, Columbus, OH, USA
| | - Rency S. Varghese
- Lombardi Comprehensive Cancer Center and Department of Oncology, Georgetown University Medical Center, Georgetown University, Washington DC, USA
| | - Jaeil Ahn
- Lombardi Comprehensive Cancer Center and Department of Oncology, Georgetown University Medical Center, Georgetown University, Washington DC, USA
- Department of Biostatistics, Bioinformatics and Biomathematics, Georgetown University Medical Center, Georgetown University, Washington DC, 20007, USA
| | - Patricia A. Da Cunha
- Lombardi Comprehensive Cancer Center and Department of Oncology, Georgetown University Medical Center, Georgetown University, Washington DC, USA
| | - Shawna Willey
- Lombardi Comprehensive Cancer Center and Department of Oncology, Georgetown University Medical Center, Georgetown University, Washington DC, USA
- Department of Surgery, MedStar Georgetown University Hospital, Georgetown University, Washington DC, 20007, USA
| | - Mary Sidawy
- Lombardi Comprehensive Cancer Center and Department of Oncology, Georgetown University Medical Center, Georgetown University, Washington DC, USA
- Department of Pathology, MedStar Georgetown University Hospital, Georgetown University, Washington DC, 20007, USA
| | - Janice D. Rone
- Lombardi Comprehensive Cancer Center and Department of Oncology, Georgetown University Medical Center, Georgetown University, Washington DC, USA
| | - Amrita K. Cheema
- Lombardi Comprehensive Cancer Center and Department of Oncology, Georgetown University Medical Center, Georgetown University, Washington DC, USA
- Department of Biochemistry and Molecular and Cellular Biology, Georgetown University Medical Center, Georgetown University, Washington DC, 20007, USA
| | - George Luta
- Lombardi Comprehensive Cancer Center and Department of Oncology, Georgetown University Medical Center, Georgetown University, Washington DC, USA
- Department of Biostatistics, Bioinformatics and Biomathematics, Georgetown University Medical Center, Georgetown University, Washington DC, 20007, USA
| | - Mohammad R. Nezami ranjbar
- Lombardi Comprehensive Cancer Center and Department of Oncology, Georgetown University Medical Center, Georgetown University, Washington DC, USA
| | - Habtom W. Ressom
- Lombardi Comprehensive Cancer Center and Department of Oncology, Georgetown University Medical Center, Georgetown University, Washington DC, USA
| | - Bassem R. Haddad
- Lombardi Comprehensive Cancer Center and Department of Oncology, Georgetown University Medical Center, Georgetown University, Washington DC, USA
| |
Collapse
|
21
|
Peyser ND, Pendleton K, Gooding WE, Lui VWY, Johnson DE, Grandis JR. Genomic and Transcriptomic Alterations Associated with STAT3 Activation in Head and Neck Cancer. PLoS One 2016; 11:e0166185. [PMID: 27855189 PMCID: PMC5113908 DOI: 10.1371/journal.pone.0166185] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2016] [Accepted: 10/24/2016] [Indexed: 11/18/2022] Open
Abstract
BACKGROUND Hyperactivation of STAT3 via constitutive phosphorylation of tyrosine 705 (Y705) is common in most human cancers, including head and neck squamous carcinoma (HNSCC). STAT3 is rarely mutated in cancer and the (epi)genetic alterations that lead to STAT3 activation are incompletely understood. Here we used an unbiased approach to identify genomic and epigenomic changes associated with pSTAT3(Y705) expression using data generated by The Cancer Genome Atlas (TCGA). METHODS AND FINDINGS Mutation, mRNA expression, promoter methylation, and copy number alteration data were extracted from TCGA and examined in the context of pSTAT3(Y705) protein expression. mRNA expression levels of 1279 genes were found to be associated with pSTAT3(705) expression. Association of pSTAT3(Y705) expression with caspase-8 mRNA expression was validated by immunoblot analysis in HNSCC cells. Mutation, promoter hypermethylation, and copy number alteration of any gene were not significantly associated with increased pSTAT3(Y705) protein expression. CONCLUSIONS These cumulative results suggest that unbiased approaches may be useful in identifying the molecular underpinnings of oncogenic signaling, including STAT3 activation, in HNSCC. Larger datasets will likely be necessary to elucidate signaling consequences of infrequent alterations.
Collapse
Affiliation(s)
- Noah D. Peyser
- Department of Otolaryngology–Head and Neck Surgery, University of California San Francisco, San Francisco, CA, United States of America, 94143
| | - Kelsey Pendleton
- Department of Otolaryngology, University of Pittsburgh and the University of Pittsburgh Cancer Institute, Pittsburgh, PA, United States of America, 15213
| | - William E. Gooding
- Biostatistics Facility, University of Pittsburgh Cancer Institute, Pittsburgh, PA, United States of America, 15213
| | - Vivian W. Y. Lui
- Department of Pharmacology and Pharmacy, School of Biomedical Sciences, Li-Ka Shing Faculty of Medicine, the University of Hong Kong, Hong Kong SAR, China
| | - Daniel E. Johnson
- Department of Medicine, University of Pittsburgh and the University of Pittsburgh Cancer Institute, Pittsburgh, PA, United States of America
| | - Jennifer R. Grandis
- Department of Otolaryngology–Head and Neck Surgery, University of California San Francisco, San Francisco, CA, United States of America, 94143
- * E-mail:
| |
Collapse
|
22
|
Yan W, Xue W, Chen J, Hu G. Biological Networks for Cancer Candidate Biomarkers Discovery. Cancer Inform 2016; 15:1-7. [PMID: 27625573 PMCID: PMC5012434 DOI: 10.4137/cin.s39458] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2016] [Revised: 06/06/2016] [Accepted: 06/16/2016] [Indexed: 12/16/2022] Open
Abstract
Due to its extraordinary heterogeneity and complexity, cancer is often proposed as a model case of a systems biology disease or network disease. There is a critical need of effective biomarkers for cancer diagnosis and/or outcome prediction from system level analyses. Methods based on integrating omics data into networks have the potential to revolutionize the identification of cancer biomarkers. Deciphering the biological networks underlying cancer is undoubtedly important for understanding the molecular mechanisms of the disease and identifying effective biomarkers. In this review, the networks constructed for cancer biomarker discovery based on different omics level data are described and illustrated from recent advances in the field.
Collapse
Affiliation(s)
- Wenying Yan
- Center for Systems Biology, Soochow University, Suzhou, Jiangsu, China
| | - Wenjin Xue
- Department of Electrical Engineering, Technician College of Taizhou, Taizhou, Jiangsu, China
| | - Jiajia Chen
- School of Chemistry, Biology and Material Engineering, Suzhou University of Science and Technology, Suzhou, China
| | - Guang Hu
- Center for Systems Biology, Soochow University, Suzhou, Jiangsu, China
| |
Collapse
|
23
|
A New Strategy for Analyzing Time-Series Data Using Dynamic Networks: Identifying Prospective Biomarkers of Hepatocellular Carcinoma. Sci Rep 2016; 6:32448. [PMID: 27578360 PMCID: PMC5006023 DOI: 10.1038/srep32448] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2015] [Accepted: 08/08/2016] [Indexed: 02/08/2023] Open
Abstract
Time-series metabolomics studies can provide insight into the dynamics of disease development and facilitate the discovery of prospective biomarkers. To improve the performance of early risk identification, a new strategy for analyzing time-series data based on dynamic networks (ATSD-DN) in a systematic time dimension is proposed. In ATSD-DN, the non-overlapping ratio was applied to measure the changes in feature ratios during the process of disease development and to construct dynamic networks. Dynamic concentration analysis and network topological structure analysis were performed to extract early warning information. This strategy was applied to the study of time-series lipidomics data from a stepwise hepatocarcinogenesis rat model. A ratio of lyso-phosphatidylcholine (LPC) 18:1/free fatty acid (FFA) 20:5 was identified as the potential biomarker for hepatocellular carcinoma (HCC). It can be used to classify HCC and non-HCC rats, and the area under the curve values in the discovery and external validation sets were 0.980 and 0.972, respectively. This strategy was also compared with a weighted relative difference accumulation algorithm (wRDA), multivariate empirical Bayes statistics (MEBA) and support vector machine-recursive feature elimination (SVM-RFE). The better performance of ATSD-DN suggests its potential for a more complete presentation of time-series changes and effective extraction of early warning information.
Collapse
|
24
|
Smita S, Lange F, Wolkenhauer O, Köhling R. Deciphering hallmark processes of aging from interaction networks. Biochim Biophys Acta Gen Subj 2016; 1860:2706-15. [PMID: 27456767 DOI: 10.1016/j.bbagen.2016.07.017] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2016] [Revised: 07/18/2016] [Accepted: 07/20/2016] [Indexed: 12/18/2022]
Abstract
BACKGROUND Aging is broadly considered to be a dynamic process that accumulates unfavourable structural and functional changes in a time dependent fashion, leading to a progressive loss of physiological integrity of an organism, which eventually leads to age-related diseases and finally to death. SCOPE OF REVIEW The majority of aging-related studies are based on reductionist approaches, focusing on single genes/proteins or on individual pathways without considering possible interactions between them. Over the last few decades, several such genes/proteins were independently analysed and linked to a role that is affecting the longevity of an organism. However, an isolated analysis on genes and proteins largely fails to explain the mechanistic insight of a complex phenotype due to the involvement and integration of multiple factors. MAJOR CONCLUSIONS Technological advance makes it possible to generate high-throughput temporal and spatial data that provide an opportunity to use computer-based methods. These techniques allow us to go beyond reductionist approaches to analyse large-scale networks that provide deeper understanding of the processes that drive aging. GENERAL SIGNIFICANCE In this review, we focus on systems biology approaches, based on network inference methods to understand the dynamics of hallmark processes leading to aging phenotypes. We also describe computational methods for the interpretation and identification of important molecular hubs involved in the mechanistic linkage between aging related processes. This article is part of a Special Issue entitled "System Genetics" Guest Editor: Dr. Yudong Cai and Dr. Tao Huang.
Collapse
Affiliation(s)
- Suchi Smita
- Department of Systems Biology & Bioinformatics, University of Rostock, Rostock, Germany; Oscar-Langendorff-Institute of Physiology, Rostock University Medical Center, Rostock, Germany.
| | - Falko Lange
- Oscar-Langendorff-Institute of Physiology, Rostock University Medical Center, Rostock, Germany.
| | - Olaf Wolkenhauer
- Department of Systems Biology & Bioinformatics, University of Rostock, Rostock, Germany; Stellenbosch Institute for Advanced Study (STIAS), Wallenberg Research Centre at Stellenbosch University, Stellenbosch, South Africa.
| | - Rüdiger Köhling
- Oscar-Langendorff-Institute of Physiology, Rostock University Medical Center, Rostock, Germany.
| |
Collapse
|
25
|
Vargas JE, Porto BN, Puga R, Stein RT, Pitrez PM. Identifying a biomarker network for corticosteroid resistance in asthma from bronchoalveolar lavage samples. Mol Biol Rep 2016; 43:697-710. [PMID: 27188427 DOI: 10.1007/s11033-016-4007-x] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2015] [Accepted: 05/10/2016] [Indexed: 12/12/2022]
Abstract
Corticosteroid resistance (CR) is a major barrier to the effective treatment of severe asthma. Hence, a better understanding of the molecular mechanisms involved in this condition is a priority. Network analysis is an emerging strategy to explore this complex heterogeneous disorder at system level to identify a small own network for CR in asthma. Gene expression profile of GSE7368 from bronchoalveolar lavage (BAL) of CR in subjects with asthma was downloaded from the gene expression omnibus (GEO) database and compared to BAL of corticosteroid-sensitive (CS) patients. DEGs were identified by the Limma package in R language. In addition, DEGs were mapped to STRING to acquire protein-protein interaction (PPI) pairs. Topological properties of PPI network were calculated by Centiscape, ClusterOne and BINGO. Subsequently, text-mining tools were applied to design one own cell signalling for CR in asthma. Thirty-five PPI networks were obtained; including a major network consisted of 370 nodes, connected by 777 edges. After topological analysis, a minor PPI network composed by 48 nodes was indentified, which is composed by most relevant nodes of major PPI network. In this subnetwork, several receptors (EGFR, EGR1, ESR2, PGR), transcription factors (MYC, JAK), cytokines (IL8, IL6, IL1B), one chemokine (CXCL1), one kinase (SRC) and one cyclooxygenase (PTGS2) were described to be associated with inflammatory environment and steroid resistance in asthma. We suggest a biomarker network composed by 48 nodes that could be potentially explored with diagnostic or therapeutic use.
Collapse
Affiliation(s)
- José Eduardo Vargas
- Centro Infant - Pontifical Catholic University of Rio Grande do Sul - PUCRS, Av. Ipiranga, 6681, Porto Alegre, RS, 91501-970, Brazil.
| | - Bárbara Nery Porto
- Centro Infant - Pontifical Catholic University of Rio Grande do Sul - PUCRS, Av. Ipiranga, 6681, Porto Alegre, RS, 91501-970, Brazil
| | - Renato Puga
- Clinical Research Center, Hospital Israelita Albert Einstein- HIAE, São Paulo, Brazil
| | - Renato Tetelbom Stein
- Centro Infant - Pontifical Catholic University of Rio Grande do Sul - PUCRS, Av. Ipiranga, 6681, Porto Alegre, RS, 91501-970, Brazil
| | - Paulo Márcio Pitrez
- Centro Infant - Pontifical Catholic University of Rio Grande do Sul - PUCRS, Av. Ipiranga, 6681, Porto Alegre, RS, 91501-970, Brazil
| |
Collapse
|
26
|
Abstract
As our knowledge of the mechanisms underlying cancer development and progression has increased, so too have more effective, less toxic, and targeted therapies begun to reach the clinic. However, the full impact of these clinical advances and the practical success of the emerging field of precision medicine are dependent on the discovery and validation of sensitive and accurate biomarkers that can enable appropriate and rigorous sample type and patient selection, reliable longitudinal monitoring of therapeutic efficacy, and even risk assessment and early detection. Within the context of this review, we examine state-of-the-art approaches to the discovery and validation of noninvasive cancer biomarkers, with a specific emphasis on those that are protein or protein-associated ones. We also review sample selection strategies, currently utilized proteomic approaches for both discovery and validation requirements, and data analysis standards. Finally, we provide examples of these elements of biomarker discovery and validation from our own biomarker research.
Collapse
|
27
|
Kim M, Hwang D. Network-Based Protein Biomarker Discovery Platforms. Genomics Inform 2016; 14:2-11. [PMID: 27103885 PMCID: PMC4838525 DOI: 10.5808/gi.2016.14.1.2] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2015] [Revised: 01/06/2016] [Accepted: 01/07/2016] [Indexed: 02/06/2023] Open
Abstract
The advances in mass spectrometry-based proteomics technologies have enabled the generation of global proteome data from tissue or body fluid samples collected from a broad spectrum of human diseases. Comparative proteomic analysis of global proteome data identifies and prioritizes the proteins showing altered abundances, called differentially expressed proteins (DEPs), in disease samples, compared to control samples. Protein biomarker candidates that can serve as indicators of disease states are then selected as key molecules among these proteins. Recently, it has been addressed that cellular pathways can provide better indications of disease states than individual molecules and also network analysis of the DEPs enables effective identification of cellular pathways altered in disease conditions and key molecules representing the altered cellular pathways. Accordingly, a number of network-based approaches to identify disease-related pathways and representative molecules of such pathways have been developed. In this review, we summarize analytical platforms for network-based protein biomarker discovery and key components in the platforms.
Collapse
Affiliation(s)
- Minhyung Kim
- Department of New Biology and Center for Plant Aging Research, Institute for Basic Science, Daegu Gyeongbuk Institute of Science and Technology, Daegu 42988, Korea
| | - Daehee Hwang
- Department of New Biology and Center for Plant Aging Research, Institute for Basic Science, Daegu Gyeongbuk Institute of Science and Technology, Daegu 42988, Korea
| |
Collapse
|
28
|
Hsu MK, Pan CL, Chen FC. Functional divergence and convergence between the transcript network and gene network in lung adenocarcinoma. Onco Targets Ther 2016; 9:335-47. [PMID: 26834492 PMCID: PMC4716766 DOI: 10.2147/ott.s94897] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
INTRODUCTION Alternative RNA splicing is a critical regulatory mechanism during tumorigenesis. However, previous oncological studies mainly focused on the splicing of individual genes. Whether and how transcript isoforms are coordinated to affect cellular functions remain underexplored. Also of great interest is how the splicing regulome cooperates with the transcription regulome to facilitate tumorigenesis. The answers to these questions are of fundamental importance to cancer biology. RESULTS Here, we report a comparative study between the transcript-based network (TN) and the gene-based network (GN) derived from the transcriptomes of paired tumor-normal tissues from 77 lung adenocarcinoma patients. We demonstrate that the two networks differ significantly from each other in terms of patient clustering and the number and functions of network modules. Interestingly, the majority (89.5%) of multi-transcript genes have their transcript isoforms distributed in at least two TN modules, suggesting regulatory and functional divergences between transcript isoforms. Furthermore, TN and GN modules share onlŷ50%-60% of their biological functions. TN thus appears to constitute a regulatory layer separate from GN. Nevertheless, our results indicate that functional convergence and divergence both occur between TN and GN, implying complex interactions between the two regulatory layers. Finally, we report that the expression profiles of module members in both TN and GN shift dramatically yet concordantly during tumorigenesis. The mechanisms underlying this coordinated shifting remain unclear yet are worth further explorations. CONCLUSION We show that in lung adenocarcinoma, transcript isoforms per se are coordinately regulated to conduct biological functions not conveyed by the network of genes. However, the two networks may interact closely with each other by sharing the same or related biological functions. Unraveling the effects and mechanisms of such interactions will significantly advance our understanding of this deadly disease.
Collapse
Affiliation(s)
- Min-Kung Hsu
- Department of Biological Science and Technology, National Chiao-Tung University, Hsinchu, Taiwan
| | - Chia-Lin Pan
- Institute of Population Health Sciences, National Health Research Institutes, Zhunan, Taiwan
| | - Feng-Chi Chen
- Department of Biological Science and Technology, National Chiao-Tung University, Hsinchu, Taiwan; Institute of Population Health Sciences, National Health Research Institutes, Zhunan, Taiwan; School of Dentistry, China Medical University, Taichung, Taiwan
| |
Collapse
|
29
|
Jeanquartier F, Jean-Quartier C, Kotlyar M, Tokar T, Hauschild AC, Jurisica I, Holzinger A. Machine Learning for In Silico Modeling of Tumor Growth. LECTURE NOTES IN COMPUTER SCIENCE 2016. [DOI: 10.1007/978-3-319-50478-0_21] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
|
30
|
Wanichthanarak K, Fahrmann JF, Grapov D. Genomic, Proteomic, and Metabolomic Data Integration Strategies. Biomark Insights 2015; 10:1-6. [PMID: 26396492 PMCID: PMC4562606 DOI: 10.4137/bmi.s29511] [Citation(s) in RCA: 62] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2015] [Revised: 07/21/2015] [Accepted: 07/22/2015] [Indexed: 12/31/2022] Open
Abstract
Robust interpretation of experimental results measuring discreet biological domains remains a significant challenge in the face of complex biochemical regulation processes such as organismal versus tissue versus cellular metabolism, epigenetics, and protein post-translational modification. Integration of analyses carried out across multiple measurement or omic platforms is an emerging approach to help address these challenges. This review focuses on select methods and tools for the integration of metabolomic with genomic and proteomic data using a variety of approaches including biochemical pathway-, ontology-, network-, and empirical-correlation-based methods.
Collapse
|