1
|
The PDGF Family Is Associated with Activated Tumor Stroma and Poor Prognosis in Ovarian Cancer. DISEASE MARKERS 2022; 2022:5940049. [PMID: 36199822 PMCID: PMC9529473 DOI: 10.1155/2022/5940049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/31/2022] [Accepted: 08/01/2022] [Indexed: 12/04/2022]
Abstract
The initiation and progression of cancer depend on the genetic alterations inherent in cancer cells, coupled with the mutual interplay of cancer cells with the surrounding tumor stroma. The platelet-derived growth factor (PDGF) family, as a mesenchymal growth factor, was involved in tumor progression by affecting the surrounding tumor stroma in some cancer types. However, the association of the PDGF family with the ovarian cancer stroma remains elusive. In our study, we first explored the expression pattern of the PDGF family using RNA expression profiles from public databases. We found that the PDGF family was highly expressed in tumor stroma compared with the corresponding epithelial components of ovarian cancer. In particular, PDGF receptors were weakly expressed in ovarian cancer tissues compared with the respective normal tissues; even in tumor mass, PDGF receptors were predominantly expressed by tumor stroma rather than ovarian cancer cells. Importantly, functional enrichment analyses and correlation analyses revealed that the PDGF family was strongly associated with activated stromal scores in ovarian cancer, including higher stromal scores, enriched pathways related to the extracellular matrix (ECM) organization and remodeling, elevated cancer-associated fibroblasts (CAFs) infiltration, and increased tumor-associated macrophages (TAMs) infiltration, especially macrophage M2. Besides, the positive correlations of the PDGF family with CAFs infiltration and macrophage M2 infiltration were observed in other various cancer types. Of note, the PDGF family was also involved in tumor progression-related pathways, such as transforming growth factor β (TGF-β) signaling, epithelial-mesenchymal transition (EMT), angiogenesis, and phosphatidylinositol 3-kinase-Akt (PI3K-Akt) signaling. Higher expressions of PDGF receptors were also observed in ovarian cancer patients with venous or lymphatic invasion. Furthermore, we uncovered the prognostic prediction of the PDGF family in ovarian cancer and constructed a PDGF family-based risk prognosis model with a hazard ratio of 1.932 (95%confidence interval (CI) = 1.27–2.95) and P value < 0.01 (AUC = 0.782, 0.752 for 1 year and 2 years, respectively). Taken together, we demonstrated that ovarian cancers with high PDGF family expression biologically exhibit malignant progression behaviors as well as poor clinical survival, which is attributed to the activated tumor stroma in ovarian cancer.
Collapse
|
2
|
Wu G, Li X, Guo W, Wei Z, Hu T, Shan Y, Gu J. JEBIN: analyzing gene co-expressions across multiple datasets by joint network embedding. Brief Bioinform 2022; 23:6519533. [PMID: 35134135 DOI: 10.1093/bib/bbab603] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2021] [Revised: 12/15/2021] [Accepted: 12/27/2021] [Indexed: 11/13/2022] Open
Abstract
The inference of gene co-expression associations is one of the fundamental tasks for large-scale transcriptomic data analysis. Due to the high dimensionality and high noises in transcriptomic data, it is difficult to infer stable gene co-expression associations from single dataset. Meta-analysis of multisource data can effectively tackle this problem. We proposed Joint Embedding of multiple BIpartite Networks (JEBIN) to learn the low-dimensional consensus representation for genes by integrating multiple expression datasets. JEBIN infers gene co-expression associations in a nonlinear and global similarity manner and can integrate datasets with different distributions in linear time complexity with the gene and total sample size. The effectiveness and scalability of JEBIN were verified by simulation experiments, and its superiority over the commonly used integration methods was proved by three indexes on real biological datasets. Then, JEBIN was applied to study the gene co-expression patterns of hepatocellular carcinoma (HCC) based on multiple expression datasets of HCC and adjacent normal tissues, and further on latest HCC single-cell RNA-seq data. Results show that gene co-expressions are highly different between bulk and single-cell datasets. Finally, many differentially co-expressed ligand-receptor pairs were discovered by comparing HCC with adjacent normal data, providing candidate HCC targets for abnormal cell-cell communications.
Collapse
Affiliation(s)
- Guiying Wu
- MOE Key Laboratory of Bioinformatics, BNRIST Bioinformatics Division, Department of Automation, Tsinghua University, Beijing 100084, China
| | - Xiangyu Li
- School of Software Engineering, Beijing Jiaotong University, Beijing 100044, China
| | - Wenbo Guo
- MOE Key Laboratory of Bioinformatics, BNRIST Bioinformatics Division, Department of Automation, Tsinghua University, Beijing 100084, China
| | - Zheng Wei
- MOE Key Laboratory of Bioinformatics, BNRIST Bioinformatics Division, Department of Automation, Tsinghua University, Beijing 100084, China
| | - Tao Hu
- MOE Key Laboratory of Bioinformatics, BNRIST Bioinformatics Division, Department of Automation, Tsinghua University, Beijing 100084, China
| | - Yiran Shan
- MOE Key Laboratory of Bioinformatics, BNRIST Bioinformatics Division, Department of Automation, Tsinghua University, Beijing 100084, China
| | - Jin Gu
- MOE Key Laboratory of Bioinformatics, BNRIST Bioinformatics Division, Department of Automation, Tsinghua University, Beijing 100084, China
| |
Collapse
|
3
|
Gondal MN, Chaudhary SU. Navigating Multi-Scale Cancer Systems Biology Towards Model-Driven Clinical Oncology and Its Applications in Personalized Therapeutics. Front Oncol 2021; 11:712505. [PMID: 34900668 PMCID: PMC8652070 DOI: 10.3389/fonc.2021.712505] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2021] [Accepted: 10/26/2021] [Indexed: 12/19/2022] Open
Abstract
Rapid advancements in high-throughput omics technologies and experimental protocols have led to the generation of vast amounts of scale-specific biomolecular data on cancer that now populates several online databases and resources. Cancer systems biology models built using this data have the potential to provide specific insights into complex multifactorial aberrations underpinning tumor initiation, development, and metastasis. Furthermore, the annotation of these single- and multi-scale models with patient data can additionally assist in designing personalized therapeutic interventions as well as aid in clinical decision-making. Here, we have systematically reviewed the emergence and evolution of (i) repositories with scale-specific and multi-scale biomolecular cancer data, (ii) systems biology models developed using this data, (iii) associated simulation software for the development of personalized cancer therapeutics, and (iv) translational attempts to pipeline multi-scale panomics data for data-driven in silico clinical oncology. The review concludes that the absence of a generic, zero-code, panomics-based multi-scale modeling pipeline and associated software framework, impedes the development and seamless deployment of personalized in silico multi-scale models in clinical settings.
Collapse
Affiliation(s)
- Mahnoor Naseer Gondal
- Biomedical Informatics Research Laboratory, Department of Biology, Syed Babar Ali School of Science and Engineering, Lahore University of Management Sciences, Lahore, Pakistan
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, United States
| | - Safee Ullah Chaudhary
- Biomedical Informatics Research Laboratory, Department of Biology, Syed Babar Ali School of Science and Engineering, Lahore University of Management Sciences, Lahore, Pakistan
| |
Collapse
|
4
|
Cantini L, Kairov U, de Reyniès A, Barillot E, Radvanyi F, Zinovyev A. Assessing reproducibility of matrix factorization methods in independent transcriptomes. Bioinformatics 2020; 35:4307-4313. [PMID: 30938767 PMCID: PMC6821374 DOI: 10.1093/bioinformatics/btz225] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2018] [Revised: 03/20/2019] [Accepted: 04/01/2019] [Indexed: 12/26/2022] Open
Abstract
Motivation Matrix factorization (MF) methods are widely used in order to reduce dimensionality of transcriptomic datasets to the action of few hidden factors (metagenes). MF algorithms have never been compared based on the between-datasets reproducibility of their outputs in similar independent datasets. Lack of this knowledge might have a crucial impact when generalizing the predictions made in a study to others. Results We systematically test widely used MF methods on several transcriptomic datasets collected from the same cancer type (14 colorectal, 8 breast and 4 ovarian cancer transcriptomic datasets). Inspired by concepts of evolutionary bioinformatics, we design a novel framework based on Reciprocally Best Hit (RBH) graphs in order to benchmark the MF methods for their ability to produce generalizable components. We show that a particular protocol of application of independent component analysis (ICA), accompanied by a stabilization procedure, leads to a significant increase in the between-datasets reproducibility. Moreover, we show that the signals detected through this method are systematically more interpretable than those of other standard methods. We developed a user-friendly tool for performing the Stabilized ICA-based RBH meta-analysis. We apply this methodology to the study of colorectal cancer (CRC) for which 14 independent transcriptomic datasets can be collected. The resulting RBH graph maps the landscape of interconnected factors associated to biological processes or to technological artifacts. These factors can be used as clinical biomarkers or robust and tumor-type specific transcriptomic signatures of tumoral cells or tumoral microenvironment. Their intensities in different samples shed light on the mechanistic basis of CRC molecular subtyping. Availability and implementation The RBH construction tool is available from http://goo.gl/DzpwYp Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Laura Cantini
- Institut Curie, PSL Research University, F-75005 Paris, France.,INSERM U900, F-75005 Paris, France.,CBIO-Centre for Computational Biology, Mines ParisTech, PSL Research University, F-75006 Paris, France.,Computational Systems Biology Team, Institut de Biologie de l'École Normale Supérieure, CNRS UMR8197, INSERM U1024, École Normale Supérieure, PSL Research University, Paris, France
| | - Ulykbek Kairov
- Laboratory of Bioinformatics and Systems Biology, Center for Life Sciences, National Laboratory Astana, Nazarbayev University, Astana, Kazakhstan
| | - Aurélien de Reyniès
- Programme Cartes d'Identité des Tumeurs (CIT), Ligue Nationale Contre le Cancer, Paris, France
| | - Emmanuel Barillot
- Institut Curie, PSL Research University, F-75005 Paris, France.,INSERM U900, F-75005 Paris, France.,CBIO-Centre for Computational Biology, Mines ParisTech, PSL Research University, F-75006 Paris, France
| | - François Radvanyi
- Institut Curie, PSL Research University, CNRS, UMR144, Equipe Labellisée Ligue Contre le Cancer, Paris, France.,Sorbonne Universités, UPMC Université Paris 06, CNRS, UMR144, Paris
| | - Andrei Zinovyev
- Institut Curie, PSL Research University, F-75005 Paris, France.,INSERM U900, F-75005 Paris, France.,CBIO-Centre for Computational Biology, Mines ParisTech, PSL Research University, F-75006 Paris, France.,Lobachevsky University, Nizhny Novgorod, Russia
| |
Collapse
|
5
|
Robinson JM, Turkington S, Abey SA, Kenea N, Henderson WA. Differential gene expression and gene-set enrichment analysis in Caco-2 monolayers during a 30-day timeline with Dexamethasone exposure. Tissue Barriers 2019; 7:e1651597. [PMID: 31438773 PMCID: PMC6748367 DOI: 10.1080/21688370.2019.1651597] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
Glucocorticoid hormones affect gene expression via activation of glucocorticoid receptor NR3C1, causing modulation of inflammation and autoimmune activation. The glucocorticoid Dexamethasone is an important pharmaceutical for the treatment of colitis and other inflammatory bowel diseases. While suppressive effects of glucocorticoids on activated immune cells is significant, their effects upon epithelial cells are less well studied. Previous research shows that the effects of Dexamethasone treatment on polarized Caco-2 cell layer permeability is delayed for >10 treatment days (as measured by transepithelial electrical resistance). In vivo intestinal epithelial cells turn over every 3–5 days; we therefore hypothesized that culture age may produce marked effects on gene expression, potentially acting as a confounding variable. To investigate this issue, we cultured polarized Caco-2 monolayers during a 30-day timecourse with ~15 days of continuous Dexamethasone exposure. We collected samples during the timecourse and tested differential expression using a 250-plex gene expression panel and Nanostring nCounter® system. Our custom panel was selectively enriched for KEGG annotations for tight-junction, actin cytoskeleton regulation, and colorectal cancer-associated genes, allowing for focused gene ontology-based pathway enrichment analyses. To test for confounding effects of time and Dexamethasone variables, we used the Nanostring nSolver differential expression data model which includes a mixturenegative binomial modelwith optimization. We identified a time-associated “EMT-like” signature with differential expression seen in important actomyosin cytoskeleton, tight junction, integrin, and cell cycle pathway genes. Dexamethasone treatment resulted in a subtle yet significant counter-signal showing suppression of actomyosin genes and differential expression of various growth factor receptors.
Collapse
Affiliation(s)
- J M Robinson
- Digestive Disorder Unit, Biobehavioral Branch, Division of Intramural Research, National Institute of Nursing Research (NINR), NIH, DHHS , Bethesda , MD , USA
| | - S Turkington
- Digestive Disorder Unit, Biobehavioral Branch, Division of Intramural Research, National Institute of Nursing Research (NINR), NIH, DHHS , Bethesda , MD , USA
| | - S A Abey
- Digestive Disorder Unit, Biobehavioral Branch, Division of Intramural Research, National Institute of Nursing Research (NINR), NIH, DHHS , Bethesda , MD , USA
| | - N Kenea
- Digestive Disorder Unit, Biobehavioral Branch, Division of Intramural Research, National Institute of Nursing Research (NINR), NIH, DHHS , Bethesda , MD , USA
| | - W A Henderson
- Digestive Disorder Unit, Biobehavioral Branch, Division of Intramural Research, National Institute of Nursing Research (NINR), NIH, DHHS , Bethesda , MD , USA
| |
Collapse
|
6
|
Dhruba SR, Rahman R, Matlock K, Ghosh S, Pal R. Application of transfer learning for cancer drug sensitivity prediction. BMC Bioinformatics 2018; 19:497. [PMID: 30591023 PMCID: PMC6309077 DOI: 10.1186/s12859-018-2465-y] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023] Open
Abstract
Background In precision medicine, scarcity of suitable biological data often hinders the design of an appropriate predictive model. In this regard, large scale pharmacogenomics studies, like CCLE and GDSC hold the promise to mitigate the issue. However, one cannot directly employ data from multiple sources together due to the existing distribution shift in data. One way to solve this problem is to utilize the transfer learning methodologies tailored to fit in this specific context. Results In this paper, we present two novel approaches for incorporating information from a secondary database for improving the prediction in a target database. The first approach is based on latent variable cost optimization and the second approach considers polynomial mapping between the two databases. Utilizing CCLE and GDSC databases, we illustrate that the proposed approaches accomplish a better prediction of drug sensitivities for different scenarios as compared to the existing approaches. Conclusion We have compared the performance of the proposed predictive models with database-specific individual models as well as existing transfer learning approaches. We note that our proposed approaches exhibit superior performance compared to the abovementioned alternative techniques for predicting sensitivity for different anti-cancer compounds, particularly the nonlinear mapping model shows the best overall performance. Electronic supplementary material The online version of this article (10.1186/s12859-018-2465-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Saugato Rahman Dhruba
- Department of Electrical and Computer Engineering, Texas Tech University, 1012 Boston Ave, Lubbock, 79409, TX, USA
| | - Raziur Rahman
- Department of Electrical and Computer Engineering, Texas Tech University, 1012 Boston Ave, Lubbock, 79409, TX, USA
| | - Kevin Matlock
- Department of Electrical and Computer Engineering, Texas Tech University, 1012 Boston Ave, Lubbock, 79409, TX, USA
| | - Souparno Ghosh
- Department of Mathematics and Statistics, Texas Tech University, 1108 Memorial Circle, Lubbock, 79409, TX, USA
| | - Ranadip Pal
- Department of Electrical and Computer Engineering, Texas Tech University, 1012 Boston Ave, Lubbock, 79409, TX, USA.
| |
Collapse
|
7
|
Dhruba SR, Rahmanl R, Matlockl K, Ghosh S, Pal R. Dimensionality Reduction based Transfer Learning applied to Pharmacogenomics Databases. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2018; 2018:1246-1249. [PMID: 30440616 DOI: 10.1109/embc.2018.8512457] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Recent years have observed a number of Pharmacogenomics databases being published that enable testing of various predictive modeling techniques for personalized therapy applications. However, the consistencies between the databases are usually limited in spite of having significant number of common cell lines and drugs. In this article, we consider the problem of whether we can use the model learned from one secondary database to improve the prediction for the other target database. We illustrate using two pharmacogenomics databases that representing the databases using common basis vectors can improve prediction performance as compared to the naive application of a model trained on one database to another. We also elucidate the robustness of using PCA based basis vectors for scenarios with low correlated input features.
Collapse
|
8
|
Lee SI, Celik S, Logsdon BA, Lundberg SM, Martins TJ, Oehler VG, Estey EH, Miller CP, Chien S, Dai J, Saxena A, Blau CA, Becker PS. A machine learning approach to integrate big data for precision medicine in acute myeloid leukemia. Nat Commun 2018; 9:42. [PMID: 29298978 PMCID: PMC5752671 DOI: 10.1038/s41467-017-02465-5] [Citation(s) in RCA: 120] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2016] [Accepted: 11/30/2017] [Indexed: 02/06/2023] Open
Abstract
Cancers that appear pathologically similar often respond differently to the same drug regimens. Methods to better match patients to drugs are in high demand. We demonstrate a promising approach to identify robust molecular markers for targeted treatment of acute myeloid leukemia (AML) by introducing: data from 30 AML patients including genome-wide gene expression profiles and in vitro sensitivity to 160 chemotherapy drugs, a computational method to identify reliable gene expression markers for drug sensitivity by incorporating multi-omic prior information relevant to each gene’s potential to drive cancer. We show that our method outperforms several state-of-the-art approaches in identifying molecular markers replicated in validation data and predicting drug sensitivity accurately. Finally, we identify SMARCA4 as a marker and driver of sensitivity to topoisomerase II inhibitors, mitoxantrone, and etoposide, in AML by showing that cell lines transduced to have high SMARCA4 expression reveal dramatically increased sensitivity to these agents. Identification of markers of drug response is essential for precision therapy. Here the authors introduce an algorithm that uses prior information about each gene’s importance in AML to identify the most predictive gene-drug associations from transcriptome and drug response data from 30 AML samples.
Collapse
Affiliation(s)
- Su-In Lee
- Paul G. Allen School of Computer Science and Engineering, University of Washington, 185 E Stevens Way NE, Seattle, WA, 98195, USA. .,Department of Genome Sciences, University of Washington, 3720 15th Ave NE, Seattle, WA, 98195, USA. .,Center for Cancer Innovation, University of Washington, 850 Republican Street, Seattle, WA, 98109, USA.
| | - Safiye Celik
- Paul G. Allen School of Computer Science and Engineering, University of Washington, 185 E Stevens Way NE, Seattle, WA, 98195, USA
| | | | - Scott M Lundberg
- Paul G. Allen School of Computer Science and Engineering, University of Washington, 185 E Stevens Way NE, Seattle, WA, 98195, USA
| | - Timothy J Martins
- Quellos High Throughput Screening Core, University of Washington, 850 Republican Street, Seattle, WA, 98109, USA
| | - Vivian G Oehler
- Clinical Research Division, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave N, Seattle, WA, 98109, USA.,Division of Hematology, Department of Medicine and Institute for Stem Cell and Regenerative Medicine, University of Washington, 850 Republican Street, Seattle, WA, 98109, USA
| | - Elihu H Estey
- Clinical Research Division, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave N, Seattle, WA, 98109, USA.,Division of Hematology, Department of Medicine and Institute for Stem Cell and Regenerative Medicine, University of Washington, 850 Republican Street, Seattle, WA, 98109, USA
| | - Chris P Miller
- Division of Hematology, Department of Medicine and Institute for Stem Cell and Regenerative Medicine, University of Washington, 850 Republican Street, Seattle, WA, 98109, USA
| | - Sylvia Chien
- Division of Hematology, Department of Medicine and Institute for Stem Cell and Regenerative Medicine, University of Washington, 850 Republican Street, Seattle, WA, 98109, USA
| | - Jin Dai
- Division of Hematology, Department of Medicine and Institute for Stem Cell and Regenerative Medicine, University of Washington, 850 Republican Street, Seattle, WA, 98109, USA
| | - Akanksha Saxena
- Division of Hematology, Department of Medicine and Institute for Stem Cell and Regenerative Medicine, University of Washington, 850 Republican Street, Seattle, WA, 98109, USA
| | - C Anthony Blau
- Center for Cancer Innovation, University of Washington, 850 Republican Street, Seattle, WA, 98109, USA.,Division of Hematology, Department of Medicine and Institute for Stem Cell and Regenerative Medicine, University of Washington, 850 Republican Street, Seattle, WA, 98109, USA
| | - Pamela S Becker
- Center for Cancer Innovation, University of Washington, 850 Republican Street, Seattle, WA, 98109, USA.,Clinical Research Division, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave N, Seattle, WA, 98109, USA.,Division of Hematology, Department of Medicine and Institute for Stem Cell and Regenerative Medicine, University of Washington, 850 Republican Street, Seattle, WA, 98109, USA
| |
Collapse
|
9
|
Wang JY, Chen LL, Zhou XH. Identifying prognostic signature in ovarian cancer using DirGenerank. Oncotarget 2017; 8:46398-46413. [PMID: 28615526 PMCID: PMC5542276 DOI: 10.18632/oncotarget.18189] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2017] [Accepted: 04/26/2017] [Indexed: 12/27/2022] Open
Abstract
Identifying the prognostic genes in cancer is essential not only for the treatment of cancer patients, but also for drug discovery. However, it's still a big challenge to select the prognostic genes that can distinguish the risk of cancer patients across various data sets because of tumor heterogeneity. In this situation, the selected genes whose expression levels are statistically related to prognostic risks may be passengers. In this paper, based on gene expression data and prognostic data of ovarian cancer patients, we used conditional mutual information to construct gene dependency network in which the nodes (genes) with more out-degrees have more chances to be the modulators of cancer prognosis. After that, we proposed DirGenerank (Generank in direct netowrk) algorithm, which concerns both the gene dependency network and genes' correlations to prognostic risks, to identify the gene signature that can predict the prognostic risks of ovarian cancer patients. Using ovarian cancer data set from TCGA (The Cancer Genome Atlas) as training data set, 40 genes with the highest importance were selected as prognostic signature. Survival analysis of these patients divided by the prognostic signature in testing data set and four independent data sets showed the signature can distinguish the prognostic risks of cancer patients significantly. Enrichment analysis of the signature with curated cancer genes and the drugs selected by CMAP showed the genes in the signature may be drug targets for therapy. In summary, we have proposed a useful pipeline to identify prognostic genes of cancer patients.
Collapse
Affiliation(s)
- Jian-Yong Wang
- College of Informatics, Huazhong Agricultural University, Wuhan 430070, P.R. China
| | - Ling-Ling Chen
- College of Informatics, Huazhong Agricultural University, Wuhan 430070, P.R. China
| | - Xiong-Hui Zhou
- College of Informatics, Huazhong Agricultural University, Wuhan 430070, P.R. China
| |
Collapse
|
10
|
Comprehensive Cross-Population Analysis of High-Grade Serous Ovarian Cancer Supports No More Than Three Subtypes. G3-GENES GENOMES GENETICS 2016; 6:4097-4103. [PMID: 27729437 PMCID: PMC5144978 DOI: 10.1534/g3.116.033514] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/04/2022]
Abstract
Four gene expression subtypes of high-grade serous ovarian cancer (HGSC) have been previously described. In these early studies, a fraction of samples that did not fit well into the four subtype classifications were excluded. Therefore, we sought to systematically determine the concordance of transcriptomic HGSC subtypes across populations without removing any samples. We created a bioinformatics pipeline to independently cluster the five largest mRNA expression datasets using k-means and nonnegative matrix factorization (NMF). We summarized differential expression patterns to compare clusters across studies. While previous studies reported four subtypes, our cross-population comparison does not support four. Because these results contrast with previous reports, we attempted to reproduce analyses performed in those studies. Our results suggest that early results favoring four subtypes may have been driven by the inclusion of serous borderline tumors. In summary, our analysis suggests that either two or three, but not four, gene expression subtypes are most consistent across datasets.
Collapse
|