1
|
Wang X, Wang J, Raza SHA, Deng J, Ma J, Qu X, Yu S, Zhang D, Alshammari AM, Almohaimeed HM, Zan L. Identification of the hub genes related to adipose tissue metabolism of bovine. Front Vet Sci 2022; 9:1014286. [PMID: 36439361 PMCID: PMC9682410 DOI: 10.3389/fvets.2022.1014286] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2022] [Accepted: 09/16/2022] [Indexed: 11/11/2022] Open
Abstract
Due to the demand for high-quality animal protein, there has been consistent interest in how to obtain more high-quality beef. As well-known, the adipose content of beef has a close connection with the taste and quality of beef, and cattle with different energy or protein diet have corresponding effects on the lipid metabolism of beef. Thus, we performed weighted gene co-expression network analysis (WGCNA) with subcutaneous adipose genes from Norwegian red heifers fed different diets to identify hub genes regulating bovine lipid metabolism. For this purpose, the RNA sequencing data of subcutaneous adipose tissue of 12-month-old Norwegian red heifers (n = 48) with different energy or protein levels were selected from the GEO database, and 7,630 genes with the largest variation were selected for WGCNA analysis. Then, three modules were selected as hub genes candidate modules according to the correlation between modules and phenotypes, including pink, magenta and grey60 modules. GO and KEGG enrichment analysis showed that genes were related to metabolism, and participated in Rap, MAPK, AMPK, VEGF signaling pathways, and so forth. Combined gene interaction network analysis using Cytoscape software, eight hub genes of lipid metabolism were identified, including TIA1, LOC516108, SNAPC4, CPSF2, ZNF574, CLASRP, MED15 and U2AF2. Further, the expression levels of hub genes in the cattle tissue were also measured to verify the results, and we found hub genes in higher expression in muscle and adipose tissue in adult cattle. In summary, we predicted the key genes of lipid metabolism in the subcutaneous adipose tissue that were affected by the intake of various energy diets to find the hub genes that coordinate lipid metabolism, which provide a theoretical basis for regulating beef quality.
Collapse
Affiliation(s)
- Xiaohui Wang
- College of Animal Science and Technology, Northwest A&F University, Xianyang, China
| | - Jianfang Wang
- College of Animal Science and Technology, Northwest A&F University, Xianyang, China
| | | | - Jiahan Deng
- College of Animal Science and Technology, Northwest A&F University, Xianyang, China
| | - Jing Ma
- College of Animal Science and Technology, Northwest A&F University, Xianyang, China
| | - Xiaopeng Qu
- College of Animal Science and Technology, Northwest A&F University, Xianyang, China
| | - Shengchen Yu
- College of Animal Science and Technology, Northwest A&F University, Xianyang, China
| | - Dianqi Zhang
- College of Animal Science and Technology, Northwest A&F University, Xianyang, China
| | | | - Hailah M. Almohaimeed
- Department of Basic Science, College of Medicine, Princess Nourah Bint Abdulrahman University, Riyadh, Saudi Arabia
| | - Linsen Zan
- College of Animal Science and Technology, Northwest A&F University, Xianyang, China
- National Beef Cattle Improvement Center, Northwest A&F University, Xianyang, China
- *Correspondence: Linsen Zan
| |
Collapse
|
2
|
Liu Y, Ye X, Yu CY, Shao W, Hou J, Feng W, Zhang J, Huang K. TPSC: a module detection method based on topology potential and spectral clustering in weighted networks and its application in gene co-expression module discovery. BMC Bioinformatics 2021; 22:111. [PMID: 34689740 PMCID: PMC8543836 DOI: 10.1186/s12859-021-03964-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2020] [Accepted: 01/08/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Gene co-expression networks are widely studied in the biomedical field, with algorithms such as WGCNA and lmQCM having been developed to detect co-expressed modules. However, these algorithms have limitations such as insufficient granularity and unbalanced module size, which prevent full acquisition of knowledge from data mining. In addition, it is difficult to incorporate prior knowledge in current co-expression module detection algorithms. RESULTS In this paper, we propose a novel module detection algorithm based on topology potential and spectral clustering algorithm to detect co-expressed modules in gene co-expression networks. By testing on TCGA data, our novel method can provide more complete coverage of genes, more balanced module size and finer granularity than current methods in detecting modules with significant overall survival difference. In addition, the proposed algorithm can identify modules by incorporating prior knowledge. CONCLUSION In summary, we developed a method to obtain as much as possible information from networks with increased input coverage and the ability to detect more size-balanced and granular modules. In addition, our method can integrate data from different sources. Our proposed method performs better than current methods with complete coverage of input genes and finer granularity. Moreover, this method is designed not only for gene co-expression networks but can also be applied to any general fully connected weighted network.
Collapse
Affiliation(s)
- Yusong Liu
- Collage of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin, 150001, Heilongjiang, China.,Indiana University School of Medicine, Indianapolis, IN, 46202, USA
| | - Xiufen Ye
- Collage of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin, 150001, Heilongjiang, China.
| | - Christina Y Yu
- Indiana University School of Medicine, Indianapolis, IN, 46202, USA.,Department of Biomedical Informatics, The Ohio State University, Columbus, OH, 43210, USA
| | - Wei Shao
- Indiana University School of Medicine, Indianapolis, IN, 46202, USA
| | - Jie Hou
- Collage of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin, 150001, Heilongjiang, China
| | - Weixing Feng
- Collage of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin, 150001, Heilongjiang, China
| | - Jie Zhang
- Indiana University School of Medicine, Indianapolis, IN, 46202, USA
| | - Kun Huang
- Indiana University School of Medicine, Indianapolis, IN, 46202, USA. .,Regenstrief Institute, Indianapolis, IN, 46202, USA.
| |
Collapse
|
3
|
Reed CJ, Hutinet G, de Crécy-Lagard V. Comparative Genomic Analysis of the DUF34 Protein Family Suggests Role as a Metal Ion Chaperone or Insertase. Biomolecules 2021; 11:1282. [PMID: 34572495 PMCID: PMC8469502 DOI: 10.3390/biom11091282] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2021] [Revised: 08/20/2021] [Accepted: 08/24/2021] [Indexed: 12/12/2022] Open
Abstract
Members of the DUF34 (domain of unknown function 34) family, also known as the NIF3 protein superfamily, are ubiquitous across superkingdoms. Proteins of this family have been widely annotated as "GTP cyclohydrolase I type 2" through electronic propagation based on one study. Here, the annotation status of this protein family was examined through a comprehensive literature review and integrative bioinformatic analyses that revealed varied pleiotropic associations and phenotypes. This analysis combined with functional complementation studies strongly challenges the current annotation and suggests that DUF34 family members may serve as metal ion insertases, chaperones, or metallocofactor maturases. This general molecular function could explain how DUF34 subgroups participate in highly diversified pathways such as cell differentiation, metal ion homeostasis, pathogen virulence, redox, and universal stress responses.
Collapse
Affiliation(s)
- Colbie J. Reed
- Department of Microbiology and Cell Science, University of Florida, Gainesville, FL 32611, USA; (C.J.R.); (G.H.)
| | - Geoffrey Hutinet
- Department of Microbiology and Cell Science, University of Florida, Gainesville, FL 32611, USA; (C.J.R.); (G.H.)
| | - Valérie de Crécy-Lagard
- Department of Microbiology and Cell Science, University of Florida, Gainesville, FL 32611, USA; (C.J.R.); (G.H.)
- Genetics Institute, University of Florida, Gainesville, FL 32611, USA
| |
Collapse
|
4
|
Cao F, Fan Y, Yu Y, Yang G, Zhong H. Dissecting Prognosis Modules and Biomarkers in Glioblastoma Based on Weighted Gene Co-Expression Network Analysis. Cancer Manag Res 2021; 13:5477-5489. [PMID: 34267555 PMCID: PMC8276137 DOI: 10.2147/cmar.s310346] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2021] [Accepted: 06/03/2021] [Indexed: 01/03/2023] Open
Abstract
Introduction As one of the most prevalent and malignant brain cancers, glioblastoma multiforme (GBM) presents a poor prognosis and the molecular mechanisms remain poorly understood. Consequently, molecular research, including various biomarkers, is essential to exploit the occurrence and development of glioma. Methods Weighted gene co-expression network analysis (WGCNA) was used to construct gene co-expression modules and networks based on the Chinese Glioma Genome Atlas (CGGA) glioblastoma specimens. Then, protein–protein interaction (PPI) and gene ontology (GO) analyses were performed to mine hub genes. RT-PCR and immunohistochemistry were employed to examine the expression level of GRPR, CXCL5, and CXCL11 in glioma patients. Results We confirmed two gene modules by protein–protein interaction networks. Functional enrichment analysis was performed to identify the significance of gene modules. Prognostic biomarkers GRPR, CXCL5, and CXCL11 related to the survival time of GBM samples were mined in The Cancer Genome Atlas (TCGA) dataset. qRT-PCR revealed that GRPR, CXCL5, and CXCL11 led to a significant increase in GBM sample compared to control. Conclusion In this study, we developed and confirmed three mRNA signatures (GRPR, CXCL5, and CXCL11) for evaluating overall survival in GBM patients. Our research assists in existing understanding of GBM diagnosis and prognosis.
Collapse
Affiliation(s)
- Fang Cao
- Department of Cerebrovascular Disease, Affiliated Hospital of Zunyi Medical University, Zunyi, Guizhou, 563000, People's Republic of China
| | - Yinchun Fan
- Department of Cerebrovascular Disease, Affiliated Hospital of Zunyi Medical University, Zunyi, Guizhou, 563000, People's Republic of China
| | - Yunhu Yu
- Clinical Research Center for Neurological Disease, the People's Hospital of Hong Hua Gang District of ZunYi, Zunyi, 563000, People's Republic of China
| | - Guohua Yang
- Demonstration Center for Experimental Basic Medicine Education of Wuhan University, Wuhan, Hubei, 430071, People's Republic of China
| | - Hua Zhong
- College of Life Sciences, Wuhan University, Wuhan, Hubei, 430072, People's Republic of China
| |
Collapse
|
5
|
Fan F, Yao D, Yan P, Jiang X, Hu J. MicroRNA-744-5p inhibits glioblastoma malignancy by suppressing replication factor C subunit 2. Oncol Lett 2021; 22:608. [PMID: 34188710 PMCID: PMC8227640 DOI: 10.3892/ol.2021.12869] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2020] [Accepted: 04/13/2021] [Indexed: 12/23/2022] Open
Abstract
Glioblastoma (GBM) is the most common malignant primary brain tumor, accounting for ~57% of all gliomas and 48% of all malignant primary central nervous system tumors in the United States. Abnormal expression of the replication factor C subunit 2 (RFC2) gene and microRNA (miR)-744-5p is associated with tumorigenic characteristics, including cellular proliferation, migration and invasiveness. However, the mechanism underlying the interaction between miR-744-5p and RFC2 in GBM remains unknown. Reverse transcription-quantitative (RT-q) PCR analysis of RFC2 and miR-744-5p was performed using GBM tumor tissues and cells, and the association between miR-744-5p and RFC2 was determined by dual-luciferase reporter assay. Cell Counting Kit 8, 5-bromo-2-deoxyuridine (BrdU), wound-healing and cellular adhesion assays, as well as the detection of caspase-3 activity and western blotting were used to detect cellular proliferation, migration and adhesion, caspase-3 activity, and Bax and Bcl-2 protein expression, respectively, in GBM cells. The results of the present study demonstrated that RFC2 expression was increased in GBM tissues and cell lines. Overexpression of RFC2 promoted cellular proliferation, migration, adhesion and an increase in Bcl-2 protein levels, and suppressed cellular caspase-3 activity and Bax protein expression, while silencing RFC2 resulted in the opposite effect. The effects of miR-744-5p inhibition were similar to those of RFC2 overexpression. Moreover, miR-744-5p was found to target RFC2 in GBM cells, and inhibiting the expression of RFC2 suppressed GBM tumorigenesis. In conclusion, the present study demonstrated that miR-744-5p targets RFC2 and suppresses the progression of GBM by repressing cellular proliferation, migration and Bcl-2 protein expression, and effectively promoting caspase-3 activity and Bax protein expression. These findings suggest a new target for the clinical treatment and improved prognosis of patients with GBM in the future.
Collapse
Affiliation(s)
- Fei Fan
- Department of Neurosurgery, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei 430022, P.R. China
| | - Dongxiao Yao
- Department of Neurosurgery, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei 430022, P.R. China
| | - Pengfei Yan
- Department of Neurosurgery, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei 430022, P.R. China
| | - Xiaobing Jiang
- Department of Neurosurgery, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei 430022, P.R. China
| | - Jie Hu
- Department of Neurosurgery, General Hospital of the Yangtze River Shipping, Jiangan, Wuhan, Hubei 430010, P.R. China
| |
Collapse
|
6
|
Kałuzińska Ż, Kołat D, Bednarek AK, Płuciennik E. PLEK2, RRM2, GCSH: A Novel WWOX-Dependent Biomarker Triad of Glioblastoma at the Crossroads of Cytoskeleton Reorganization and Metabolism Alterations. Cancers (Basel) 2021; 13:cancers13122955. [PMID: 34204789 PMCID: PMC8231639 DOI: 10.3390/cancers13122955] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2021] [Revised: 04/30/2021] [Accepted: 06/11/2021] [Indexed: 02/07/2023] Open
Abstract
Glioblastoma is one of the deadliest human cancers. Its malignancy depends on cytoskeleton reorganization, which is related to, e.g., epithelial-to-mesenchymal transition and metastasis. The malignant phenotype of glioblastoma is also affected by the WWOX gene, which is lost in nearly a quarter of gliomas. Although the role of WWOX in the cytoskeleton rearrangement has been found in neural progenitor cells, its function as a modulator of cytoskeleton in gliomas was not investigated. Therefore, this study aimed to investigate the role of WWOX and its collaborators in cytoskeleton dynamics of glioblastoma. Methodology on RNA-seq data integrated the use of databases, bioinformatics tools, web-based platforms, and machine learning algorithm, and the obtained results were validated through microarray data. PLEK2, RRM2, and GCSH were the most relevant WWOX-dependent genes that could serve as novel biomarkers. Other genes important in the context of cytoskeleton (BMP4, CCL11, CUX2, DUSP7, FAM92B, GRIN2B, HOXA1, HOXA10, KIF20A, NF2, SPOCK1, TTR, UHRF1, and WT1), metabolism (MTHFD2), or correlation with WWOX (COL3A1, KIF20A, RNF141, and RXRG) were also discovered. For the first time, we propose that changes in WWOX expression dictate a myriad of alterations that affect both glioblastoma cytoskeleton and metabolism, rendering new therapeutic possibilities.
Collapse
|
7
|
Guo L, Mao L, Lu W, Yang J. Identification of breast cancer prognostic modules via differential module selection based on weighted gene Co-expression network analysis. Biosystems 2020; 199:104317. [PMID: 33279569 DOI: 10.1016/j.biosystems.2020.104317] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2020] [Revised: 11/30/2020] [Accepted: 11/30/2020] [Indexed: 02/06/2023]
Abstract
Breast cancer is a complex cancer which includes many different subtypes. Identifying prognostic modules, i.e., functionally related gene networks that play crucial roles in cancer development is essential in breast cancer study. Different subtypes of breast cancer correspond to different treatment methods. The purpose of this study is to use a new method to divide breast cancer into different prognostic modules, so as to provide scientific basis for improving clinical management. The method is based on comparing similarities between modules detected from different weighted gene co-expression networks. The method was applied on genomic data of breast cancer from The Cancer Genome Atlas database and was applied to select differential modules between two groups of patients with significant differences in survival times. It was compared with a previously proposed module selection method. The result shows that our method outperforms the previously proposed one. Moreover, within the identified two differential modules, the first one is highly enriched with genes involved in hormone responds, the second one is highly related with biological process engaged in M-phase. The two modules were further validated by log-rank test in the validation dataset GSE3494. Both of the two modules show significantly different with p-values less than 0.02. The identified two modules confirmed previous findings including importance of biological networks in breast cancer involved in hormone response and M-phase. Out of the top twenty hub genes in the two modules, fifteen genes were previously shown to be prognostic markers for breast cancer.
Collapse
Affiliation(s)
- Ling Guo
- Key Laboratory of China's Ethnic Languages and Information Technology of Ministry of Education, Northwest Minzu University, Lanzhou, China; College of Electrical Engineering, Northwest Minzu University, Lanzhou, China
| | - Leer Mao
- Key Laboratory of China's Ethnic Languages and Information Technology of Ministry of Education, Northwest Minzu University, Lanzhou, China.
| | - WenTing Lu
- College of Electrical Engineering, Northwest Minzu University, Lanzhou, China
| | - Jun Yang
- College of Electrical Engineering, Northwest Minzu University, Lanzhou, China
| |
Collapse
|
8
|
Abstract
BACKGROUND Data mining technology used in the field of medicine has been widely studied by scholars all over the world. But there is little research on medical data mining (MDM) from the perspectives of bibliometrics and visualization, and the research topics and development trends in this field are still unclear. METHODS This paper has applied bibliometric visualization software tools, VOSviewer 1.6.10 and CiteSpace V, to study the citation characteristics, international cooperation, author cooperation, and geographical distribution of the MDM. RESULTS A total of 1575 documents are obtained, and the most frequent document type is article (1376). SHAN NH is the most productive author, with the highest number of publications of 12, and the Gillies's article (750 times citation) is the most cited paper. The most productive country and institution in MDM is the USA (559) and US FDA (35), respectively. The Journal of Biomedical Informatics, Expert Systems with Applications and Journal of Medical Systems are the most productive journals, which reflected the nature of the research, and keywords "classification (790)" and "system (576)" have the strongest strength. The hot topics in MDM are drug discovery, medical imaging, vaccine safety, and so on. The 3 frontier topics are reporting system, precision medicine, and inflammation, and would be the foci of future research. CONCLUSION The present study provides a panoramic view of data mining methods applied in medicine by visualization and bibliometrics. Analysis of authors, journals, institutions, and countries could provide reference for researchers who are fresh to the field in different ways. Researchers may also consider the emerging trends when deciding the direction of their study.
Collapse
Affiliation(s)
- Yuanzhang Hu
- School of Medical Information Engineering, Chengdu University of Traditional Chinese Medicine, Chengdu, Sichuan
| | - Zeyun Yu
- College of Acupuncture and TuiNa, Chengdu University of Traditional Chinese Medicine, Chengdu, China
| | - Xiaoen Cheng
- School of Medical Information Engineering, Chengdu University of Traditional Chinese Medicine, Chengdu, Sichuan
| | - Yue Luo
- School of Medical Information Engineering, Chengdu University of Traditional Chinese Medicine, Chengdu, Sichuan
| | - Chuanbiao Wen
- School of Medical Information Engineering, Chengdu University of Traditional Chinese Medicine, Chengdu, Sichuan
| |
Collapse
|
9
|
Wang T, Zhang J, Huang K. Generalized gene co-expression analysis via subspace clustering using low-rank representation. BMC Bioinformatics 2019; 20:196. [PMID: 31074376 PMCID: PMC6509871 DOI: 10.1186/s12859-019-2733-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022] Open
Abstract
Background Gene Co-expression Network Analysis (GCNA) helps identify gene modules with potential biological functions and has become a popular method in bioinformatics and biomedical research. However, most current GCNA algorithms use correlation to build gene co-expression networks and identify modules with highly correlated genes. There is a need to look beyond correlation and identify gene modules using other similarity measures for finding novel biologically meaningful modules. Results We propose a new generalized gene co-expression analysis algorithm via subspace clustering that can identify biologically meaningful gene co-expression modules with genes that are not all highly correlated. We use low-rank representation to construct gene co-expression networks and local maximal quasi-clique merger to identify gene co-expression modules. We applied our method on three large microarray datasets and a single-cell RNA sequencing dataset. We demonstrate that our method can identify gene modules with different biological functions than current GCNA methods and find gene modules with prognostic values. Conclusions The presented method takes advantage of subspace clustering to generate gene co-expression networks rather than using correlation as the similarity measure between genes. Our generalized GCNA method can provide new insights from gene expression datasets and serve as a complement to current GCNA algorithms.
Collapse
Affiliation(s)
- Tongxin Wang
- Department of Computer Science, Indiana University Bloomington, Bloomington, 47408, IN, USA
| | - Jie Zhang
- Department of Medical and Molecular Genetics, Indiana University School of Medicine, Indianapolis, 46202, IN, USA
| | - Kun Huang
- Department of Medicine, Indiana University School of Medicine, Indianapolis, 46202, IN, USA. .,Regenstrief Institute, Indianapolis, 46202, IN, USA.
| |
Collapse
|
10
|
Huang Z, Zhan X, Xiang S, Johnson TS, Helm B, Yu CY, Zhang J, Salama P, Rizkalla M, Han Z, Huang K. SALMON: Survival Analysis Learning With Multi-Omics Neural Networks on Breast Cancer. Front Genet 2019; 10:166. [PMID: 30906311 PMCID: PMC6419526 DOI: 10.3389/fgene.2019.00166] [Citation(s) in RCA: 119] [Impact Index Per Article: 23.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2018] [Accepted: 02/14/2019] [Indexed: 12/22/2022] Open
Abstract
Improved cancer prognosis is a central goal for precision health medicine. Though many models can predict differential survival from data, there is a strong need for sophisticated algorithms that can aggregate and filter relevant predictors from increasingly complex data inputs. In turn, these models should provide deeper insight into which types of data are most relevant to improve prognosis. Deep Learning-based neural networks offer a potential solution for both problems because they are highly flexible and account for data complexity in a non-linear fashion. In this study, we implement Deep Learning-based networks to determine how gene expression data predicts Cox regression survival in breast cancer. We accomplish this through an algorithm called SALMON (Survival Analysis Learning with Multi-Omics Neural Networks), which aggregates and simplifies gene expression data and cancer biomarkers to enable prognosis prediction. The results revealed improved performance when more omics data were used in model construction. Rather than use raw gene expression values as model inputs, we innovatively use eigengene modules from the result of gene co-expression network analysis. The corresponding high impact co-expression modules and other omics data are identified by feature selection technique, then examined by conducting enrichment analysis and exploiting biological functions, escalated the interpretation of input feature from gene level to co-expression modules level. Our study shows the feasibility of discovering breast cancer related co-expression modules, sketch a blueprint of future endeavors on Deep Learning-based survival analysis. SALMON source code is available at https://github.com/huangzhii/SALMON/.
Collapse
Affiliation(s)
- Zhi Huang
- School of Electrical and Computer Engineering, Purdue University, West Lafayette, IN, United States.,Department of Medicine, Indiana University School of Medicine, Indianapolis, IN, United States.,Department of Electrical and Computer Engineering, Indiana University-Purdue University Indianapolis, Indianapolis, IN, United States
| | - Xiaohui Zhan
- Department of Medicine, Indiana University School of Medicine, Indianapolis, IN, United States.,National-Regional Key Technology Engineering Laboratory for Medical Ultrasound, Guangdong Key Laboratory for Biomedical Measurements and Ultrasound Imaging, School of Biomedical Engineering, Health Science Center, Shenzhen University, Shenzhen, China
| | - Shunian Xiang
- National-Regional Key Technology Engineering Laboratory for Medical Ultrasound, Guangdong Key Laboratory for Biomedical Measurements and Ultrasound Imaging, School of Biomedical Engineering, Health Science Center, Shenzhen University, Shenzhen, China.,Department of Medical and Molecular Genetics, Indiana University School of Medicine, Indianapolis, IN, United States
| | - Travis S Johnson
- Department of Medicine, Indiana University School of Medicine, Indianapolis, IN, United States.,Department of Biomedical Informatics, The Ohio State University, Columbus, OH, United States
| | - Bryan Helm
- Department of Medicine, Indiana University School of Medicine, Indianapolis, IN, United States
| | - Christina Y Yu
- Department of Medicine, Indiana University School of Medicine, Indianapolis, IN, United States.,Department of Biomedical Informatics, The Ohio State University, Columbus, OH, United States
| | - Jie Zhang
- Department of Medical and Molecular Genetics, Indiana University School of Medicine, Indianapolis, IN, United States
| | - Paul Salama
- Department of Electrical and Computer Engineering, Indiana University-Purdue University Indianapolis, Indianapolis, IN, United States
| | - Maher Rizkalla
- Department of Electrical and Computer Engineering, Indiana University-Purdue University Indianapolis, Indianapolis, IN, United States
| | - Zhi Han
- Department of Medicine, Indiana University School of Medicine, Indianapolis, IN, United States.,Regenstrief Institute, Indianapolis, IN, United States
| | - Kun Huang
- Department of Medicine, Indiana University School of Medicine, Indianapolis, IN, United States.,Department of Electrical and Computer Engineering, Indiana University-Purdue University Indianapolis, Indianapolis, IN, United States.,Regenstrief Institute, Indianapolis, IN, United States
| |
Collapse
|
11
|
Xu P, Yang J, Liu J, Yang X, Liao J, Yuan F, Xu Y, Liu B, Chen Q. Identification of glioblastoma gene prognosis modules based on weighted gene co-expression network analysis. BMC Med Genomics 2018; 11:96. [PMID: 30382873 PMCID: PMC6211550 DOI: 10.1186/s12920-018-0407-1] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2018] [Accepted: 09/25/2018] [Indexed: 02/03/2023] Open
Abstract
Background Glioblastoma multiforme, the most prevalent and aggressive brain tumour, has a poor prognosis. The molecular mechanisms underlying gliomagenesis remain poorly understood. Therefore, molecular research, including various markers, is necessary to understand the occurrence and development of glioma. Method Weighted gene co-expression network analysis (WGCNA) was performed to construct a gene co-expression network in TCGA glioblastoma samples. Gene ontology (GO) and pathway-enrichment analysis were used to identify significance of gene modules. Cox proportional hazards regression model was used to predict outcome of glioblastoma patients. Results We performed weighted gene co-expression network analysis (WGCNA) and identified a gene module (yellow module) related to the survival time of TCGA glioblastoma samples. Then, 228 hub genes were calculated based on gene significance (GS) and module significance (MS). Four genes (OSMR + SOX21 + MED10 + PTPRN) were selected to construct a Cox proportional hazards regression model with high accuracy (AUC = 0.905). The prognostic value of the Cox proportional hazards regression model was also confirmed in GSE16011 dataset (GBM: n = 156). Conclusion We developed a promising mRNA signature for estimating overall survival in glioblastoma patients. Electronic supplementary material The online version of this article (10.1186/s12920-018-0407-1) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Pengfei Xu
- Department of Neurosurgery, Renmin Hospital of Wuhan University, 9 Zhangzhidong Road and 238 Jiefang Road, Wuchang, Wuhan, Hubei, 430060, People's Republic of China
| | - Jian Yang
- Department of Neurosurgery, Renmin Hospital of Wuhan University, 9 Zhangzhidong Road and 238 Jiefang Road, Wuchang, Wuhan, Hubei, 430060, People's Republic of China
| | - Junhui Liu
- Department of Neurosurgery, Renmin Hospital of Wuhan University, 9 Zhangzhidong Road and 238 Jiefang Road, Wuchang, Wuhan, Hubei, 430060, People's Republic of China
| | - Xue Yang
- Department of Neurosurgery, Renmin Hospital of Wuhan University, 9 Zhangzhidong Road and 238 Jiefang Road, Wuchang, Wuhan, Hubei, 430060, People's Republic of China
| | - Jianming Liao
- Department of Neurosurgery, Renmin Hospital of Wuhan University, 9 Zhangzhidong Road and 238 Jiefang Road, Wuchang, Wuhan, Hubei, 430060, People's Republic of China
| | - Fanen Yuan
- Department of Neurosurgery, Renmin Hospital of Wuhan University, 9 Zhangzhidong Road and 238 Jiefang Road, Wuchang, Wuhan, Hubei, 430060, People's Republic of China
| | - Yang Xu
- Department of Neurosurgery, Renmin Hospital of Wuhan University, 9 Zhangzhidong Road and 238 Jiefang Road, Wuchang, Wuhan, Hubei, 430060, People's Republic of China
| | - Baohui Liu
- Department of Neurosurgery, Renmin Hospital of Wuhan University, 9 Zhangzhidong Road and 238 Jiefang Road, Wuchang, Wuhan, Hubei, 430060, People's Republic of China
| | - Qianxue Chen
- Department of Neurosurgery, Renmin Hospital of Wuhan University, 9 Zhangzhidong Road and 238 Jiefang Road, Wuchang, Wuhan, Hubei, 430060, People's Republic of China.
| |
Collapse
|
12
|
Candidate Biomarkers and Molecular Mechanism Investigation for Glioblastoma Multiforme Utilizing WGCNA. BIOMED RESEARCH INTERNATIONAL 2018; 2018:4246703. [PMID: 30356407 PMCID: PMC6178162 DOI: 10.1155/2018/4246703] [Citation(s) in RCA: 56] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/16/2018] [Revised: 08/13/2018] [Accepted: 08/29/2018] [Indexed: 12/05/2022]
Abstract
To reveal the potential molecular mechanism of glioblastoma multiforme (GBM) and provide the candidate biomarkers for GBM gene therapy. Microarray dataset GSE50161 was obtained from GEO database. The differentially expressed genes (DEGs) were identified between GBM samples and control samples, followed by the module partition analysis based on WGCNA. Then, the pathway and functional enrichment analyses of DEGs were performed. The hub genes were further investigated, followed by the survival analysis and data validation. A total of 1913 DEGs were investigated between two groups, followed by analysis of 5 modules using WGCNA. These DEGs were mainly enriched in functions like inflammatory response. The hub genes including upregulated N-Myc and STAT Interactor (NMI), Capping Actin Protein-Gelsolin Like (CAPG), and Proteasome Subunit Beta 8 (PSMB8) were revealed as potential liquid biopsy molecules for GBM diagnose. Moreover, Nucleolar and Spindle Associated Protein 1 (NUSAP1) and G Protein-Coupled Receptor 65 (GPR65) were outstanding genes in survival analysis. Our results suggested that CPNE6, HAPLN2, CMTM3, NMI, CAPG, and PSMB8 might be used as potential molecules for liquid biopsy of GBM. NUSAP1 and GPR65 might be novel prognostic targets for GBM gene therapy. Furthermore, the upregulated NMI might play an important role in GBM progression via inflammatory response.
Collapse
|
13
|
Zhang X, Feng H, Li Z, Li D, Liu S, Huang H, Li M. Application of weighted gene co-expression network analysis to identify key modules and hub genes in oral squamous cell carcinoma tumorigenesis. Onco Targets Ther 2018; 11:6001-6021. [PMID: 30275705 PMCID: PMC6157991 DOI: 10.2147/ott.s171791] [Citation(s) in RCA: 97] [Impact Index Per Article: 16.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Purpose Oral squamous cell carcinoma (OSCC) is one of the most common malignant diseases worldwide, yet its molecular mechanisms are largely unknown. We aimed to construct gene co-expression networks to identify key modules and hub genes involved in the pathogenesis of OSCC. Patients and methods We used dataset GSE30784 to construct co-expression networks by weighted gene co-expression network analysis (WGCNA). Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses were performed by Database for Annotation, Visualization and Integrated Discovery (DAVID). Hub genes were screened and validated by other datasets. Results Turquoise and brown modules were found to be the most significantly related to tumorigenesis. Functional enrichment analysis showed that the turquoise module was associated with cell–cell adhesion, extracellular matrix and collagen catabolic process. A total of 10 hub genes (MMP1, TNFRSF12A, PLAU, FSCN1, PDPN, KRT78, EVPL, GGT6, SMIM5 and CYSRT1) were identified and validated at transcriptional and translational levels. Their genetic alteration and survival analysis were also revealed. Conclusion We identified two modules and 10 hub genes, which were associated with the tumorigenesis of OSCC. The two modules provided references that will advance the understanding of mechanisms of tumorigenesis in OSCC. Moreover, the hub genes may serve as biomarkers and therapeutic targets for precise diagnosis and treatment of OSCC in the future.
Collapse
Affiliation(s)
- Xiaoqi Zhang
- Department of Bone Metabolism, School of Stomatology, Shandong University, Shandong Provincial Key Laboratory of Oral Tissue Regeneration, Jinan, China, ;
| | - Hao Feng
- Department of Bone Metabolism, School of Stomatology, Shandong University, Shandong Provincial Key Laboratory of Oral Tissue Regeneration, Jinan, China, ;
| | - Ziyu Li
- Department of Bone Metabolism, School of Stomatology, Shandong University, Shandong Provincial Key Laboratory of Oral Tissue Regeneration, Jinan, China, ;
| | - Dongfang Li
- Department of Bone Metabolism, School of Stomatology, Shandong University, Shandong Provincial Key Laboratory of Oral Tissue Regeneration, Jinan, China, ;
| | - Shanshan Liu
- Department of Bone Metabolism, School of Stomatology, Shandong University, Shandong Provincial Key Laboratory of Oral Tissue Regeneration, Jinan, China, ;
| | - Haiyun Huang
- Department of Bone Metabolism, School of Stomatology, Shandong University, Shandong Provincial Key Laboratory of Oral Tissue Regeneration, Jinan, China, ;
| | - Minqi Li
- Department of Bone Metabolism, School of Stomatology, Shandong University, Shandong Provincial Key Laboratory of Oral Tissue Regeneration, Jinan, China, ;
| |
Collapse
|
14
|
Fan SMY, Tsai CF, Yen CM, Lin MH, Wang WH, Chan CC, Chen CL, Phua KKL, Pan SH, Plikus MV, Yu SL, Chen YJ, Lin SJ. Inducing hair follicle neogenesis with secreted proteins enriched in embryonic skin. Biomaterials 2018; 167:121-131. [PMID: 29567388 PMCID: PMC6050066 DOI: 10.1016/j.biomaterials.2018.03.003] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2017] [Revised: 02/22/2018] [Accepted: 03/02/2018] [Indexed: 12/17/2022]
Abstract
Organ development is a sophisticated process of self-organization. However, despite growing understanding of the developmental mechanisms, little is known about how to reactivate them postnatally for regeneration. We found that treatment of adult non-hair fibroblasts with cell-free extract from embryonic skin conferred upon them the competency to regenerate hair follicles. Proteomics analysis identified three secreted proteins enriched in the embryonic skin, apolipoprotein-A1, galectin-1 and lumican that together were essential and sufficient to induce new hair follicles. These 3 proteins show a stage-specific co-enrichment in the perifolliculogenetic embryonic dermis. Mechanistically, exposure to embryonic skin extract or to the combination of the 3 proteins altered the gene expression to an inductive hair follicle dermal papilla fibroblast-like profile and activated Igf and Wnt signaling, which are crucial for the regeneration process. Therefore, a cocktail of organ-specific extracellular proteins from the embryonic environment can render adult cells competent to re-engage in developmental interactions for organ neogenesis. Identification of factors that recreate the extracellular context of respective developing tissues can become an important strategy to promote regeneration in adult organs.
Collapse
Affiliation(s)
- Sabrina Mai-Yi Fan
- Institute of Biomedical Engineering, College of Medicine and College of Engineering, National Taiwan University, Taipei, Taiwan
| | - Chia-Feng Tsai
- Institute of Chemistry, Academia Sinica, Taipei, Taiwan; Department of Chemistry, National Taiwan University, Taipei, Taiwan
| | - Chien-Mei Yen
- Institute of Biomedical Engineering, College of Medicine and College of Engineering, National Taiwan University, Taipei, Taiwan; Department of Dermatology, National Taiwan University Hospital and College of Medicine, Taipei, Taiwan
| | - Miao-Hsia Lin
- Institute of Chemistry, Academia Sinica, Taipei, Taiwan
| | - Wei-Hung Wang
- Institute of Biomedical Engineering, College of Medicine and College of Engineering, National Taiwan University, Taipei, Taiwan
| | - Chih-Chieh Chan
- Institute of Biomedical Engineering, College of Medicine and College of Engineering, National Taiwan University, Taipei, Taiwan; Department of Dermatology, National Taiwan University Hospital and College of Medicine, Taipei, Taiwan
| | - Chih-Lung Chen
- Institute of Biomedical Engineering, College of Medicine and College of Engineering, National Taiwan University, Taipei, Taiwan; Department of Dermatology, National Taiwan University Hospital and College of Medicine, Taipei, Taiwan
| | - Kyle K L Phua
- Department of Chemical and Biomolecular Engineering, National University of Singapore, Singapore
| | - Szu-Hua Pan
- Graduate Institute of Medical Genomics and Proteomics, College of Medicine, National Taiwan University, Taipei, Taiwan; Doctoral Degree Program of Translational Medicine, National Taiwan University, Taipei, Taiwan; Genome and Systems Biology Degree Program, National Taiwan University and Academia Sinica, Taipei, Taiwan
| | - Maksim V Plikus
- Department of Developmental and Cell Biology, Sue and Bill Gross Stem Cell Research Center, Center for Complex Biological Systems, University of California, Irvine, Irvine, CA, USA
| | - Sung-Liang Yu
- Department of Clinical Laboratory Sciences and Medical Biotechnology, National Taiwan University College of Medicine, Taipei, Taiwan; Department of Laboratory Medicine, National Taiwan University Hospital, Taipei, Taiwan
| | - Yu-Ju Chen
- Institute of Chemistry, Academia Sinica, Taipei, Taiwan; Department of Chemistry, National Taiwan University, Taipei, Taiwan.
| | - Sung-Jan Lin
- Institute of Biomedical Engineering, College of Medicine and College of Engineering, National Taiwan University, Taipei, Taiwan; Department of Dermatology, National Taiwan University Hospital and College of Medicine, Taipei, Taiwan; Genome and Systems Biology Degree Program, National Taiwan University and Academia Sinica, Taipei, Taiwan; Research Center for Developmental Biology and Regenerative Medicine, National Taiwan University, Taipei, Taiwan; Graduate Institute of Clinical Medicine, College of Medicine, National Taiwan University, Taipei, Taiwan.
| |
Collapse
|
15
|
Abstract
Wnt signaling is important for breast development and remodeling during pregnancy and lactation. Epigenetic modifications change expression levels of components of the Wnt pathway, underlying oncogenic transformation. However, no clear Wnt component increasing expression universally across breast cancer (BC) or its most Wnt-dependent triple-negative BC (TNBC) subgroup has been identified, delaying development of targeted therapies. Here we perform network correlation analysis of expression of >100 Wnt pathway components in hundreds of healthy and cancerous breast tissues. Varying in expression levels among people, Wnt components remarkably coordinate their production; this coordination is dramatically decreased in BC. Clusters with coordinated gene expression exist within the healthy cohort, highlighting Wnt signaling subtypes. Different BC subgroups are identified, characterized by different remaining Wnt signaling signatures, providing the rational for patient stratification for personalizing the therapeutic applications. Key pairwise interactions within the Wnt pathway (some inherited and some established de novo) emerge as targets for future drug discovery against BC.
Collapse
|
16
|
Connelly KE, Hedrick V, Paschoal Sobreira TJ, Dykhuizen EC, Aryal UK. Analysis of Human Nuclear Protein Complexes by Quantitative Mass Spectrometry Profiling. Proteomics 2018; 18:e1700427. [PMID: 29655301 DOI: 10.1002/pmic.201700427] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2017] [Revised: 03/07/2018] [Indexed: 12/23/2022]
Abstract
Analysis of protein complexes provides insights into how the ensemble of expressed proteome is organized into functional units. While there have been advances in techniques for proteome-wide profiling of cytoplasmic protein complexes, information about human nuclear protein complexes are very limited. To close this gap, we combined native size exclusion chromatography (SEC) with label-free quantitative MS profiling to characterize hundreds of nuclear protein complexes isolated from human glioblastoma multiforme T98G cells. We identified 1794 proteins that overlapped between two biological replicates of which 1244 proteins were characterized as existing within stably associated putative complexes. co-IP experiments confirmed the interaction of PARP1 with Ku70/Ku80 proteins and HDAC1 (histone deacetylase complex 1) and CHD4. HDAC1/2 also co-migrated with various SIN3A and nucleosome remodeling and deacetylase components in SEC fractionation including SIN3A, SAP30, RBBP4, RBBP7, and NCOR1. Co-elution of HDAC1/2/3 with both the KDM1A and RCOR1 further confirmed that these proteins are integral components of human deacetylase complexes. Our approach also demonstrated the ability to identify potential moonlighting complexes and novel complexes containing uncharacterized proteins. Overall, the results demonstrated the utility of SEC fractionation and LC-MS analysis for system-wide profiling of proteins to predict the existence of distinct forms of nuclear protein complexes.
Collapse
Affiliation(s)
- Katelyn E Connelly
- Department of Medicinal Chemistry and Molecular Pharmacology, Purdue University, 201 S. University Street, 47907, West Lafayette, IN, USA
| | - Victoria Hedrick
- Purdue Proteomics Facility, Bindley Biosciences Center, Discovery Park, Purdue University, 1203 W. State Street, 47907, West Lafayette, IN, USA
| | - Tiago Jose Paschoal Sobreira
- Purdue Proteomics Facility, Bindley Biosciences Center, Discovery Park, Purdue University, 1203 W. State Street, 47907, West Lafayette, IN, USA
| | - Emily C Dykhuizen
- Department of Medicinal Chemistry and Molecular Pharmacology, Purdue University, 201 S. University Street, 47907, West Lafayette, IN, USA
| | - Uma K Aryal
- Purdue Proteomics Facility, Bindley Biosciences Center, Discovery Park, Purdue University, 1203 W. State Street, 47907, West Lafayette, IN, USA
| |
Collapse
|
17
|
Zhu XF, Zhu BS, Wu FM, Hu HB. DNA methylation biomarkers for the occurrence of lung adenocarcinoma from TCGA data mining. J Cell Physiol 2018; 233:6777-6784. [PMID: 29667778 DOI: 10.1002/jcp.26531] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2017] [Accepted: 02/02/2018] [Indexed: 01/11/2023]
Abstract
The development of lung cancer is a combination of multifactor, multistage, and multiple genetic alterations processes. DNA methylation is an important factor. Currently, the study on the genome-scale epigenetic modification for studying the pathogenesis of lung cancer is still lacking. Here, we aimed to identify the epigenetic modifications of lung cancer, thus to provide scientific basis for the personalized medicine, and research of classification screening for lung adenocarcinoma patients. The DNA methylation data, and the corresponding clinical information of lung adenocarcinoma samples were extracted from the Cancer Genome Atlas (TCGA) database. We explored the association of DNA methylation and gene transcription expression of lung adenocarcinoma by identifying the differentially expressed genes, DNA methylated locis, functional gene clusters, and the relevant genes associated with the survival. We identified 17 differentially expressed genes which had differentially methylated locis, 4 functional gene clusters regulated by methylation, and 522 genes, which were relevant to the survival time of patients. Our study suggested that methylation controlled the gene expression in a variety of ways, which had high/low expression and hyper-/hypo-methylation. Genes of different methylation status showed the different survival curve. The genes and methylated locis identified in this study could be potential biomarkers and therapeutic targets for lung adenocarcinoma.
Collapse
Affiliation(s)
- Xiao-Feng Zhu
- Department of Cardiothoracic Surgery, Xianning Central Hospital, The First Affiliated Hospital of Hubei University of Science and Technology, Xianning, China
| | - Bi-Sheng Zhu
- Department of Oncology, Xianning Central Hospital, The First Affiliated Hospital of Hubei University of Science and Technology, Xianning, China
| | - Fei-Ma Wu
- Department of Cardiothoracic Surgery, Xianning Central Hospital, The First Affiliated Hospital of Hubei University of Science and Technology, Xianning, China
| | - Hai-Bo Hu
- Department of Thoracic Surgery, Huai'an Second People's Hospital, The Affiliated Huai'an Hospital of Xuzhou Medical University, Huai'an, China
| |
Collapse
|
18
|
Liu G, Chai B, Yang K, Yu J, Zhou X. Overlapping functional modules detection in PPI network with pair-wise constrained non-negative matrix tri-factorisation. IET Syst Biol 2018. [PMID: 29533217 PMCID: PMC8687432 DOI: 10.1049/iet-syb.2017.0084] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
A large amount of available protein–protein interaction (PPI) data has been generated by high‐throughput experimental techniques. Uncovering functional modules from PPI networks will help us better understand the underlying mechanisms of cellular functions. Numerous computational algorithms have been designed to identify functional modules automatically in the past decades. However, most community detection methods (non‐overlapping or overlapping types) are unsupervised models, which cannot incorporate the well‐known protein complexes as a priori. The authors propose a novel semi‐supervised model named pairwise constrains nonnegative matrix tri‐factorisation (PCNMTF), which takes full advantage of the well‐known protein complexes to find overlapping functional modules based on protein module indicator matrix and module correlation matrix simultaneously from PPI networks. PCNMTF determinately models and learns the mixed module memberships of each protein by considering the correlation among modules simultaneously based on the non‐negative matrix tri‐factorisation. The experiment results on both synthetic and real‐world biological networks demonstrate that PCNMTF gains more precise functional modules than that of state‐of‐the‐art methods.
Collapse
Affiliation(s)
- Guangming Liu
- Beijing Key Lab of Traffic Data Analysis and Mining, Beijing Jiaotong University, No. 3 Shangyuancun Haidian District, Beijing, People's Republic of China
| | - Bianfang Chai
- Department of Information Engineering, Hebei GEO University, Shijiazhuang, People's Republic of China
| | - Kuo Yang
- Beijing Key Lab of Traffic Data Analysis and Mining, Beijing Jiaotong University, No. 3 Shangyuancun Haidian District, Beijing, People's Republic of China
| | - Jian Yu
- Beijing Key Lab of Traffic Data Analysis and Mining, Beijing Jiaotong University, No. 3 Shangyuancun Haidian District, Beijing, People's Republic of China
| | - Xuezhong Zhou
- Beijing Key Lab of Traffic Data Analysis and Mining, Beijing Jiaotong University, No. 3 Shangyuancun Haidian District, Beijing, People's Republic of China.
| |
Collapse
|
19
|
Guo L, Zhang K, Bing Z. Application of a co‑expression network for the analysis of aggressive and non‑aggressive breast cancer cell lines to predict the clinical outcome of patients. Mol Med Rep 2017; 16:7967-7978. [PMID: 28944917 PMCID: PMC5779881 DOI: 10.3892/mmr.2017.7608] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2016] [Accepted: 07/20/2017] [Indexed: 01/07/2023] Open
Abstract
Breast cancer metastasis is a demanding problem in clinical treatment of patients with breast cancer. It is necessary to examine the mechanisms of metastasis for developing therapies. Classification of the aggressiveness of breast cancer is an important issue in biological study and for clinical decisions. Although aggressive and non‑aggressive breast cancer cells can be easily distinguished among different cell lines, it is very difficult to distinguish in clinical practice. The aim of the current study was to use the gene expression analysis from breast cancer cell lines to predict clinical outcomes of patients with breast cancer. Weighted gene co‑expression network analysis (WGCNA) is a powerful method to account for correlations between genes and extract co‑expressed modules of genes from large expression datasets. Therefore, WGCNA was applied to explore the differences in sub‑networks between aggressive and non‑aggressive breast cancer cell lines. The greatest difference topological overlap networks in both groups include potential information to understand the mechanisms of aggressiveness. The results show that the blue and red modules were significantly associated with the biological processes of aggressiveness. The sub‑network, which consisted of TMEM47, GJC1, ANXA3, TWIST1 and C19orf33 in the blue module, was associated with an aggressive phenotype. The sub‑network of LOC100653217, CXCL12, SULF1, DOK5 and DKK3 in the red module was associated with a non‑aggressive phenotype. In order to validate the hazard ratio of these genes, the prognostic index was constructed to integrate them and examined using data from the Cancer Genomic Atlas (TCGA) and Gene Expression Omnibus (GEO) databases. Patients with breast cancer from TCGA in the high‑risk group had a significantly shorter overall survival time compared with patients in the low‑risk group (hazard ratio=1.231, 95% confidence interval=1.058‑1.433, P=0.0071, by the Wald test). A similar result was produced from the GEO database. The findings may provide a novel strategy for measuring cancer aggressiveness in patients with breast cancer.
Collapse
Affiliation(s)
- Ling Guo
- College of Electrical Engineering, Northwest University for Nationalities, Lanzhou, Gansu 730030, P.R. China
| | - Kun Zhang
- College of Electrical Engineering, Northwest University for Nationalities, Lanzhou, Gansu 730030, P.R. China
| | - Zhitong Bing
- Evidence Based Medicine Center, School of Basic Medical Science of Lanzhou University, Lanzhou, Gansu 730000, P.R. China
| |
Collapse
|
20
|
Golestan Hashemi FS, Razi Ismail M, Rafii Yusop M, Golestan Hashemi MS, Nadimi Shahraki MH, Rastegari H, Miah G, Aslani F. Intelligent mining of large-scale bio-data: Bioinformatics applications. BIOTECHNOL BIOTEC EQ 2017. [DOI: 10.1080/13102818.2017.1364977] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Affiliation(s)
- Farahnaz Sadat Golestan Hashemi
- Plant Genetics, AgroBioChem Department, Gembloux Agro-Bio Tech, University of Liege, Liege, Belgium
- Laboratory of Food Crops, Institute of Tropical Agriculture and Food Security, Universiti Putra Malaysia, Serdang, Selangor, Malaysia
| | - Mohd Razi Ismail
- Laboratory of Food Crops, Institute of Tropical Agriculture and Food Security, Universiti Putra Malaysia, Serdang, Selangor, Malaysia
- Department of Crop Science, Faculty of Agriculture, Universiti Putra Malaysia, Serdang, Selangor, Malaysia
| | - Mohd Rafii Yusop
- Laboratory of Food Crops, Institute of Tropical Agriculture and Food Security, Universiti Putra Malaysia, Serdang, Selangor, Malaysia
- Department of Crop Science, Faculty of Agriculture, Universiti Putra Malaysia, Serdang, Selangor, Malaysia
| | - Mahboobe Sadat Golestan Hashemi
- Department of Software Engineering, Faculty of Computer Engineering, Najafabad Branch, Islamic Azad University, Isfahan,Iran
- Big Data Research Center, Najafabad Branch, Islamic Azad University, Isfahan, Iran
| | - Mohammad Hossein Nadimi Shahraki
- Department of Software Engineering, Faculty of Computer Engineering, Najafabad Branch, Islamic Azad University, Isfahan,Iran
- Big Data Research Center, Najafabad Branch, Islamic Azad University, Isfahan, Iran
| | - Hamid Rastegari
- Department of Software Engineering, Faculty of Computer Engineering, Najafabad Branch, Islamic Azad University, Isfahan,Iran
| | - Gous Miah
- Laboratory of Food Crops, Institute of Tropical Agriculture and Food Security, Universiti Putra Malaysia, Serdang, Selangor, Malaysia
| | - Farzad Aslani
- Department of Crop Science, Faculty of Agriculture, Universiti Putra Malaysia, Serdang, Selangor, Malaysia
| |
Collapse
|
21
|
Han Z, Zhang J, Sun G, Liu G, Huang K. A matrix rank based concordance index for evaluating and detecting conditional specific co-expressed gene modules. BMC Genomics 2016; 17 Suppl 7:519. [PMID: 27556416 PMCID: PMC5001231 DOI: 10.1186/s12864-016-2912-y] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
BACKGROUND Gene co-expression network analysis (GCNA) is widely adopted in bioinformatics and biomedical research with applications such as gene function prediction, protein-protein interaction inference, disease markers identification, and copy number variance discovery. Currently there is a lack of rigorous analysis on the mathematical condition for which the co-expressed gene module should satisfy. METHODS In this paper, we present a linear algebraic based Centralized Concordance Index (CCI) for evaluating the concordance of co-expressed gene modules from gene co-expression network analysis. The CCI can be used to evaluate the performance for co-expression network analysis algorithms as well as for detecting condition specific co-expression modules. We applied CCI in detecting lung tumor specific gene modules. RESULTS AND DISCUSSION Simulation showed that CCI is a robust indicator for evaluating the concordance of a group of co-expressed genes. The application to lung cancer datasets revealed interesting potential tumor specific genetic alterations including CNVs and even hints for gene-fusion. Deeper analysis required for understanding the molecular mechanisms of all such condition specific co-expression relationships. CONCLUSION The CCI can be used to evaluate the performance for co-expression network analysis algorithms as well as for detecting condition specific co-expression modules. It is shown to be more robust to outliers and interfering modules than density based on Pearson correlation coefficients.
Collapse
Affiliation(s)
- Zhi Han
- College of Computer and Control Engineering, Nankai University, Tianjin, China
- College of Software, Nankai University, Tianjin, China
- Department of Biomedical Informatics, The Ohio State University, Columbus, OH USA
| | - Jie Zhang
- Department of Biomedical Informatics, The Ohio State University, Columbus, OH USA
- The CCC Biomedical Informatics Shared Resource, The Ohio State University, Columbus, OH USA
| | - Guoyuan Sun
- College of Computer and Control Engineering, Nankai University, Tianjin, China
- College of Software, Nankai University, Tianjin, China
| | - Gang Liu
- College of Computer and Control Engineering, Nankai University, Tianjin, China
- College of Software, Nankai University, Tianjin, China
| | - Kun Huang
- Department of Biomedical Informatics, The Ohio State University, Columbus, OH USA
- The CCC Biomedical Informatics Shared Resource, The Ohio State University, Columbus, OH USA
| |
Collapse
|
22
|
Construction of co-expression network based on natural expression variation of xylogenesis-related transcripts in Eucalyptus tereticornis. Mol Biol Rep 2016; 43:1129-46. [DOI: 10.1007/s11033-016-4046-3] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2016] [Accepted: 07/20/2016] [Indexed: 12/23/2022]
|
23
|
Zhang J, Huang K. Normalized lmQCM: An Algorithm for Detecting Weak Quasi-Cliques in Weighted Graph with Applications in Gene Co-Expression Module Discovery in Cancers. Cancer Inform 2016; 13:137-46. [PMID: 27486298 PMCID: PMC4962959 DOI: 10.4137/cin.s14021] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2014] [Revised: 04/20/2015] [Accepted: 04/28/2015] [Indexed: 01/14/2023] Open
Abstract
In this paper, we present a new approach for mining weighted networks to identify densely connected modules such as quasi-cliques. Quasi-cliques are densely connected subnetworks in a network. Detecting quasi-cliques is an important topic in data mining, with applications such as social network study and biomedicine. Our approach has two major improvements upon previous work. The first is the use of local maximum edges to initialize the search in order to avoid excessive overlaps among the modules, thereby greatly reducing the computing time. The second is the inclusion of a weight normalization procedure to enable discovery of "subtle" modules with more balanced sizes. We carried out careful tests on multiple parameters and settings using two large cancer datasets. This approach allowed us to identify a large number of gene modules enriched in both biological functions and chromosomal bands in cancer data, suggesting potential roles of copy number variations (CNVs) involved in the cancer development. We then tested the genes in selected modules with enriched chromosomal bands using The Cancer Genome Atlas data, and the results strongly support our hypothesis that the coexpression in these modules are associated with CNVs. While gene coexpression network analyses have been widely adopted in disease studies, most of them focus on the functional relationships of coexpressed genes. The relationship between coexpression gene modules and CNVs are much less investigated despite the potential advantage that we can infer from such relationship without genotyping data. Our new approach thus provides a means to carry out deep mining of the gene coexpression network to obtain both functional and genetic information from the expression data.
Collapse
Affiliation(s)
- Jie Zhang
- Department of Biomedical Informatics and Biomedical Informatics Shared Resource, The Ohio State University, Columbus, USA
| | - Kun Huang
- Department of Biomedical Informatics and Biomedical Informatics Shared Resource, The Ohio State University, Columbus, USA
| |
Collapse
|
24
|
Shroff S, Zhang J, Huang K. Gene Co-Expression Analysis Predicts Genetic Variants Associated with Drug Responsiveness in Lung Cancer. AMIA JOINT SUMMITS ON TRANSLATIONAL SCIENCE PROCEEDINGS. AMIA JOINT SUMMITS ON TRANSLATIONAL SCIENCE 2016; 2016:32-41. [PMID: 27570645 PMCID: PMC5001757] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
Responsiveness to drugs is an important concern in designing personalized treatment for cancer patients. Currently genetic markers are often used to guide targeted therapy. However, deeper understanding of the molecular basis for drug responses and discovery of new predictive biomarkers for drug sensitivity are much needed. In this paper, we present a workflow for identifying condition-specific gene co-expression networks associated with responses to the tyrosine kinase inhibitor, Erlotinib, in lung adenocarcinoma cell lines using data from the Cancer Cell Line Encyclopedia by combining network mining and statistical analysis. Particularly, we have identified multiple gene modules specifically co-expressed in the drug responsive cell lines but not in the unresponsive group. Interestingly, most of these modules are enriched on specific cytobands, suggesting potential copy number variation events on these loci. Our results therefore imply that there are multiple genetic loci with copy number variations associated with the Erlotinib responses. The existence of CNVs in these loci is also confirmed in lung cancer tissue samples using the TCGA data. Since these structural variations are inferred from functional genomics data, these CNVs are functional variations. These results suggest the condition specific gene co- expression network mining approach is an effective approach in predicting candidate biomarkers for drug responses.
Collapse
Affiliation(s)
- Sanaya Shroff
- Chemical and Biomolecular Engineering, Cornell University, Ithaca, NY 14853 (USA)
| | - Jie Zhang
- Biomedical Informatics, The Ohio State University, Columbus, OH 43210 (USA)
| | - Kun Huang
- Biomedical Informatics, The Ohio State University, Columbus, OH 43210 (USA),Corresponding author:
| |
Collapse
|
25
|
Du N, Jiang K, Sawle AD, Frank MB, Wallace CA, Zhang A, Jarvis JN. Dynamic tracking of functional gene modules in treated juvenile idiopathic arthritis. Genome Med 2015; 7:109. [PMID: 26497493 PMCID: PMC4619406 DOI: 10.1186/s13073-015-0227-2] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2015] [Accepted: 10/01/2015] [Indexed: 12/25/2022] Open
Abstract
BACKGROUND We have previously shown that childhood-onset rheumatic diseases show aberrant patterns of gene expression that reflect pathology-associated co-expression networks. In this study, we used novel computational approaches to examine how disease-associated networks are altered in one of the most common rheumatic diseases of childhood, juvenile idiopathic arthritis (JIA). METHODS Using whole blood gene expression profiles derived from children in a pediatric rheumatology clinical trial, we used a network approach to understanding the impact of therapy and the underlying biology of response/non-response to therapy. RESULTS We demonstrate that therapy for JIA is associated with extensive re-ordering of gene expression networks, even in children who respond inadequately to therapy. Furthermore, we observe distinct differences in the evolution of specific network properties when we compare children who have been treated successfully with those who have inadequate treatment response. CONCLUSIONS Despite the inherent noisiness of whole blood gene expression data, our findings demonstrate how therapeutic response might be mapped and understood in pathologically informative cells in a broad range of human inflammatory diseases.
Collapse
Affiliation(s)
- Nan Du
- Department of Computer Sciences and Engineering, University at Buffalo, Buffalo, NY, USA.
| | - Kaiyu Jiang
- Department of Pediatrics, Rheumatology Research, University at Buffalo School of Medicine, Buffalo, NY, USA.
| | - Ashley D Sawle
- The Herbert Irving Comprehensive Cancer Center, Columbia University Medical Center, New York, NY, 10032, USA.
| | - Mark Barton Frank
- Oklahoma Medical Research Foundation, Clinical Immunology Program, Oklahoma City, OK, USA.
| | - Carol A Wallace
- Department of Pediatrics, University of Washington, Seattle, WA, USA.
| | - Aidong Zhang
- Department of Computer Sciences and Engineering, University at Buffalo, Buffalo, NY, USA.
| | - James N Jarvis
- Department of Pediatrics, Rheumatology Research, University at Buffalo School of Medicine, Buffalo, NY, USA.
- Genetics, Genomics, and Bioinformatics Program, University at Buffalo, Buffalo, NY, USA.
- Pediatric Rheumatology Research, University at Buffalo Clinical & Translational Research Center, 875 Ellicott St, Buffalo, NY, 14203, USA.
| |
Collapse
|
26
|
Myers EM, Bartlett CW, Machiraju R, Bohland JW. An integrative analysis of regional gene expression profiles in the human brain. Methods 2015; 73:54-70. [DOI: 10.1016/j.ymeth.2014.12.010] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2014] [Revised: 11/27/2014] [Accepted: 12/06/2014] [Indexed: 10/24/2022] Open
|
27
|
Chang X, Shi L, Gao F, Russin J, Zeng L, He S, Chen TC, Giannotta SL, Weisenberger DJ, Zada G, Wang K, Mack WJ. Genomic and transcriptome analysis revealing an oncogenic functional module in meningiomas. Neurosurg Focus 2014; 35:E3. [PMID: 24289128 DOI: 10.3171/2013.10.focus13326] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
OBJECT Meningiomas are among the most common primary adult brain tumors. Although typically benign, roughly 2%-5% display malignant pathological features. The key molecular pathways involved in malignant transformation remain to be determined. METHODS Illumina expression microarrays were used to assess gene expression levels, and Illumina single-nucleotide polymorphism arrays were used to identify copy number variants in benign, atypical, and malignant meningiomas (19 tumors, including 4 malignant ones). The authors also reanalyzed 2 expression data sets generated on Affymetrix microarrays (n = 68, including 6 malignant ones; n = 56, including 3 malignant ones). A weighted gene coexpression network approach was used to identify coexpression modules associated with malignancy. RESULTS At the genomic level, malignant meningiomas had more chromosomal losses than atypical and benign meningiomas, with average length of 528, 203, and 34 megabases, respectively. Monosomic loss of chromosome 22 was confirmed to be one of the primary chromosomal level abnormalities in all subtypes of meningiomas. At the transcriptome level, the authors identified 23 coexpression modules from the weighted gene coexpression network. Gene functional enrichment analysis highlighted a module with 356 genes that was highly related to tumorigenesis. Four intramodular hubs within the module (GAB2, KLF2, ID1, and CTF1) were oncogenic in other cancers such as leukemia. A putative meningioma tumor suppressor MN1 was also identified in this module with differential expression between malignant and benign meningiomas. CONCLUSIONS The authors' genomic and transcriptome analysis of meningiomas provides novel insights into the molecular pathways involved in malignant transformation of meningiomas, with implications for molecular heterogeneity of the disease.
Collapse
Affiliation(s)
- Xiao Chang
- Zilkha Neurogenetic Institute, Keck School of Medicine, University of Southern California, Los Angeles, California
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
28
|
Xiang Y, Zhang J, Huang K. Mining the tissue-tissue gene co-expression network for tumor microenvironment study and biomarker prediction. BMC Genomics 2013; 14 Suppl 5:S4. [PMID: 24564578 PMCID: PMC3852209 DOI: 10.1186/1471-2164-14-s5-s4] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
BACKGROUND Recent discovery in tumor development indicates that the tumor microenvironment (mostly stroma cells) plays an important role in cancer development. To understand how the tumor microenvironment (TME) interacts with the tumor, we explore the correlation of the gene expressions between tumor and stroma. The tumor and stroma gene expression data are modeled as a weighted bipartite network (tumor-stroma coexpression network) where the weight of an edge indicates the correlation between the expression profiles of the corresponding tumor gene and stroma gene. In order to efficiently mine this weighted bipartite network, we developed the Bipartite subnetwork Component Mining algorithm (BCM), and we show that the BCM algorithm can efficiently mine weighted bipartite networks for dense Bipartite sub-Networks (BiNets) with density guarantees. RESULTS We applied BCM to the tumor-stroma coexpression network and find 372 BiNets that demonstrate statistical significance in survival tests. A good number of these BiNets demonstrate strong prognosis powers on at least one breast cancer patient cohort, which suggests that these BiNets are potential biomarkers for breast cancer prognosis. Further study on these 372 BiNets by the network merging approach reveals that they form 10 macro bipartite networks which show orchestrated key biological processes in both tumor and stroma. In addition, by further examining the BiNets that are significant in ER-negative breast cancer patient prognosis, we discovered a ubiquitin C (UBC) gene network that demonstrates strong prognosis power in nearly all types of breast cancer subtypes we used in this study. CONCLUSIONS The results support our hypothesis that the UBC gene network plays an important role in breast cancer prognosis and therapy and it is a potential prognostic biomarker for multiple breast cancer subtypes.
Collapse
|
29
|
Abstract
Purpose Personalized medicine is predicated on the concept of identifying subgroups of a common disease for better treatment. Identifying biomarkers that predict disease subtypes has been a major focus of biomedical science. In the era of genome-wide profiling, there is controversy as to the optimal number of genes as an input of a feature selection algorithm for survival modeling. Patients and methods The expression profiles and outcomes of 544 patients were retrieved from The Cancer Genome Atlas. We compared four different survival prediction methods: (1) 1-nearest neighbor (1-NN) survival prediction method; (2) random patient selection method and a Cox-based regression method with nested cross-validation; (3) least absolute shrinkage and selection operator (LASSO) optimization using whole-genome gene expression profiles; or (4) gene expression profiles of cancer pathway genes. Results The 1-NN method performed better than the random patient selection method in terms of survival predictions, although it does not include a feature selection step. The Cox-based regression method with LASSO optimization using whole-genome gene expression data demonstrated higher survival prediction power than the 1-NN method, but was outperformed by the same method when using gene expression profiles of cancer pathway genes alone. Conclusion The 1-NN survival prediction method may require more patients for better performance, even when omitting censored data. Using preexisting biological knowledge for survival prediction is reasonable as a means to understand the biological system of a cancer, unless the analysis goal is to identify completely unknown genes relevant to cancer biology.
Collapse
Affiliation(s)
- Hyunsoo Kim
- Department of Pathology, The University of Alabama at Birmingham, Birmingham, AL, USA
| | | |
Collapse
|