1
|
Yang Z, Carrio-Cordo P, Baudis M. Copy number variation heterogeneity reveals biological inconsistency in hierarchical cancer classifications. Mol Cytogenet 2024; 17:26. [PMID: 39506842 DOI: 10.1186/s13039-024-00692-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2024] [Accepted: 10/02/2024] [Indexed: 11/08/2024] Open
Abstract
Cancers are heterogeneous diseases with unifying features of abnormal and consuming cell growth, where the deregulation of normal cellular functions is initiated by the accumulation of genomic mutations in cells of - potentially - any organ. At diagnosis malignancies typically present with patterns of somatic genome variants on diverse levels of heterogeneity. Among the different types of genomic alterations, copy number variants (CNV) represent a distinct, near-ubiquitous class of structural variants. Cancer classifications are foundational for patient care and oncology research. Terminologies such as the National Cancer Institute Thesaurus provide large sets of hierarchical cancer classification vocabularies and promote data interoperability and ontology-driven computational analysis. To find out how categorical classifications correspond to genomic observations, we conducted a meta-analysis of inter-sample genomic heterogeneity for classification hierarchies on CNV profiles from 97,142 individual samples across 512 cancer entities, and evaluated recurring CNV signatures across diagnostic subsets. Our results highlight specific biological mechanisms across cancer entities with the potential for improvement of patient stratification and future enhancement of cancer classification systems and provide some indications for cooperative genomic events across distinct clinical entities.
Collapse
Affiliation(s)
- Ziying Yang
- Department of Molecular Life Sciences, University of Zurich, Winterthurerstr. 190, 8057, Zurich, Switzerland.
- Swiss Institute of Bioinformatics, Zurich, Switzerland.
| | - Paula Carrio-Cordo
- Department of Molecular Life Sciences, University of Zurich, Winterthurerstr. 190, 8057, Zurich, Switzerland
- Swiss Institute of Bioinformatics, Zurich, Switzerland
| | - Michael Baudis
- Department of Molecular Life Sciences, University of Zurich, Winterthurerstr. 190, 8057, Zurich, Switzerland.
- Swiss Institute of Bioinformatics, Zurich, Switzerland.
| |
Collapse
|
2
|
Chattopadhyay A, Teoh ZH, Wu CY, Juang JMJ, Lai LC, Tsai MH, Wu CH, Lu TP, Chuang EY. CNVIntegrate: the first multi-ethnic database for identifying copy number variations associated with cancer. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2021; 2021:6321046. [PMID: 34259866 PMCID: PMC8278790 DOI: 10.1093/database/baab044] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/23/2020] [Revised: 05/29/2021] [Accepted: 07/02/2021] [Indexed: 11/14/2022]
Abstract
Human copy number variations (CNVs) and copy number alterations (CNAs) are DNA segments (>1000 base pairs) of duplications or deletions with respect to the reference genome, potentially causing genomic imbalance leading to diseases such as cancer. CNVs further cause genetic diversity in healthy populations and are predominant drivers of gene/genome evolution. Initiatives have been taken by the research community to establish large-scale databases to comprehensively characterize CNVs in humans. Exome Aggregation Consortium (ExAC) is one such endeavor that catalogs CNVs, of nearly 60 000 healthy individuals across five demographic clusters. Furthermore, large projects such as the Catalogue of Somatic Mutations in Cancer (COSMIC) and the Cancer Cell Line Encyclopedia (CCLE) combine CNA data from cancer-affected individuals and large panels of human cancer cell lines, respectively. However, we lack a structured and comprehensive CNV/CNA resource including both healthy individuals and cancer patients across large populations. CNVIntegrate is the first web-based system that hosts CNV and CNA data from both healthy populations and cancer patients, respectively, and concomitantly provides statistical comparisons between copy number frequencies of multiple ethnic populations. It further includes, for the first time, well-cataloged CNV and CNA data from Taiwanese healthy individuals and Taiwan Breast Cancer data, respectively, along with imported resources from ExAC, COSMIC and CCLE. CNVIntegrate offers a CNV/CNA-data hub for structured information retrieval for clinicians and scientists towards important drug discoveries and precision treatments. Database URL: http://cnvintegrate.cgm.ntu.edu.tw/.
Collapse
Affiliation(s)
- Amrita Chattopadhyay
- Bioinformatics and Biostatistics Core, Center of Genomic and Precision Medicine, National Taiwan University, Taipei 10055, Taiwan
| | - Zi Han Teoh
- Graduate Institute of Biomedical Electronics and Bioinformatics, National Taiwan University, Taipei 10617, Taiwan
| | - Chi-Yun Wu
- Graduate Institute of Biomedical Electronics and Bioinformatics, National Taiwan University, Taipei 10617, Taiwan
| | - Jyh-Ming Jimmy Juang
- Cardiovascular Center and Division of Cardiology, Department of Internal Medicine, National Taiwan University Hospital, Taipei 10008, Taiwan.,College of Medicine, National Taiwan University, Taipei 10051, Taiwan
| | - Liang-Chuan Lai
- Bioinformatics and Biostatistics Core, Center of Genomic and Precision Medicine, National Taiwan University, Taipei 10055, Taiwan.,Graduate Institute of Physiology, National Taiwan University, Taipei 10051, Taiwan
| | - Mong-Hsun Tsai
- Bioinformatics and Biostatistics Core, Center of Genomic and Precision Medicine, National Taiwan University, Taipei 10055, Taiwan.,Institute of Biotechnology, National Taiwan University, Taipei 10672, Taiwan.,Center for Biotechnology, National Taiwan University, Taipei 10672, Taiwan
| | - Chia-Hsin Wu
- Graduate Institute of Biomedical Electronics and Bioinformatics, National Taiwan University, Taipei 10617, Taiwan
| | - Tzu-Pin Lu
- Bioinformatics and Biostatistics Core, Center of Genomic and Precision Medicine, National Taiwan University, Taipei 10055, Taiwan.,Department of Public Health, Institute of Epidemiology and Preventive Medicine, National Taiwan University, Taipei 10055, Taiwan
| | - Eric Y Chuang
- Bioinformatics and Biostatistics Core, Center of Genomic and Precision Medicine, National Taiwan University, Taipei 10055, Taiwan.,Graduate Institute of Biomedical Electronics and Bioinformatics, National Taiwan University, Taipei 10617, Taiwan.,Master Program for Biomedical Engineering, China Medical University, Taichung 40402, Taiwan
| |
Collapse
|
3
|
Li Q, Bian Y, Li Q. Down-Regulation of TMPO-AS1 Induces Apoptosis in Lung Carcinoma Cells by Regulating miR-143-3p/CDK1 Axis. Technol Cancer Res Treat 2021; 20:1533033820948880. [PMID: 33685293 PMCID: PMC8093611 DOI: 10.1177/1533033820948880] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
Evidence has shown that long non-coding RNAs (lncRNA) play pivotal roles in cancer promotion as well as suppression. But the molecular mechanism of lncRNA TMPO antisense transcript 1 (TMPO-AS1) in lung cancer (LC) remains unclear. This study mainly investigated the effect of TMPO-AS1 in LC treatment. TMPO-AS1 was tested by Kaplan-Meier method. Quantitative real time polymerase chain reaction (qRT-PCR) was employed to assess the expressions of TMPO-AS1, miR-143-3p, and CDK1 respectively in LC tissues and cell lines. TMPO-AS1, miR-143-3p and CDK1 expressions in LC cells were regulated through cell transfection, followed by MTT for cell viability detection. Besides, dual-luciferase reporter assays were performed to verify the interrelated microRNA of TMPO-AS1 and the target of miR-143-3p. Western blot experiments were used to examine the expressions of apoptosis-related proteins, and flow cytometry tested the cell apoptosis in treated cells. TMPO-AS1 and CDK1 were overexpressed in LC tissues and cells, while miR-143-3p level was suppressed. The decrease of TMPO-AS1 led to the increase of miR-143-3p, which further resulted in the reduction of CDK1. Down-regulating TMPO-AS1 reduced LC cell viability, motivated cell apoptosis, as well as promoted the expressions of Bcl and CCND1 and restrained Caspase-3 level, but all these consequences were abrogated by miR-143-3p inhibitor. Simultaneously, siCDK1 could reverse the anti-apoptosis and pro-activity functions of miR-143-3p inhibitor in LC cells. Down-regulation of TMPO-AS1 has the effects of pro-apoptosis in LC by manipulating miR-143-3p/CDK1, which is hopeful to be a novel therapy for LC patients.
Collapse
Affiliation(s)
- Qiu Li
- Department of Respiratory, Zhuji Affiliated Hospital of Shaoxing University, Zhuji, Zhejiang Province, China
| | - Yuan Bian
- Department of Respiratory, Zhuji Affiliated Hospital of Shaoxing University, Zhuji, Zhejiang Province, China
| | - Qiaolian Li
- Department of Respiratory, Zhuji Affiliated Hospital of Shaoxing University, Zhuji, Zhejiang Province, China
| |
Collapse
|
4
|
Gao C, Zhuang J, Li H, Liu C, Zhou C, Liu L, Feng F, Sun C. Gene signatures of 6-methyladenine regulators in women with lung adenocarcinoma and development of a risk scoring system: a retrospective study using the cancer genome atlas database. Aging (Albany NY) 2021; 13:3957-3968. [PMID: 33428597 PMCID: PMC7906130 DOI: 10.18632/aging.202364] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2020] [Accepted: 11/23/2020] [Indexed: 01/22/2023]
Abstract
Although the emergence of new treatments has improved the prognosis of women with lung adenocarcinoma (LUAD), the emergence of drug resistance limits their clinical efficacy. Therefore, there is an urgent need to identify new targets and develop a risk scoring system to evaluate the prognosis of patients. 6-methyladenine (M6A), as the most common methyl modification in RNA modification, its clinicopathological features, diagnosis and prognostic value in lung cancer, especially in LUAD remain to be discussed. We analyzed the clinical and sequencing data of the female LUAD cohort from The Cancer Genome Atlas (TCGA), evaluated the expression profiles of 16 M6A regulation-related genes in the cohort and the relationships between genetic changes and clinical characteristics, developed an M6A-related risk scoring system using Cox analysis. Finally, the copy number variations (CNVs) of the related genes in the samples were analyzed and verified using the cBioPortal platform. Compared with other clinical factors, this risk scoring system showed a higher predictive sensitivity and specificity. The M6A-related risk scoring system developed in this study may help to improve the screening of female patients at high risk of LUAD and provides important theoretical bioinformatics support for evaluating the prognosis of such patients.
Collapse
Affiliation(s)
- Chundi Gao
- College of First Clinical Medicine, Shandong University of Traditional Chinese Medicine, Jinan 250014, Shandong, PR China
| | - Jing Zhuang
- Departmen of Oncology, Weifang Traditional Chinese Hospital, Weifang 261041, Shandong, PR China
| | - Huayao Li
- College of Basic Medical Sciences, Shandong University of Traditional Chinese Medicine, Jinan 250014, Shandong, PR China
| | - Cun Liu
- College of First Clinical Medicine, Shandong University of Traditional Chinese Medicine, Jinan 250014, Shandong, PR China
| | - Chao Zhou
- Departmen of Oncology, Weifang Traditional Chinese Hospital, Weifang 261041, Shandong, PR China
| | - Lijuan Liu
- Departmen of Oncology, Weifang Traditional Chinese Hospital, Weifang 261041, Shandong, PR China
| | - Fubin Feng
- Departmen of Oncology, Weifang Traditional Chinese Hospital, Weifang 261041, Shandong, PR China
| | - Changgang Sun
- Departmen of Oncology, Weifang Traditional Chinese Hospital, Weifang 261041, Shandong, PR China.,Cancer and Immunology Institute, Shandong University of Traditional Chinese Medicine, Jinan, Shandong, PR China
| |
Collapse
|
5
|
Xia Q, Shu Z, Ye T, Zhang M. Identification and Analysis of the Blood lncRNA Signature for Liver Cirrhosis and Hepatocellular Carcinoma. Front Genet 2020; 11:595699. [PMID: 33365048 PMCID: PMC7750531 DOI: 10.3389/fgene.2020.595699] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2020] [Accepted: 10/13/2020] [Indexed: 12/12/2022] Open
Abstract
As one of the most common malignant tumors, hepatocellular carcinoma (HCC) is the fifth major cause of cancer-associated mortality worldwide. In 90% of cases, HCC develops in the context of liver cirrhosis and chronic hepatitis B virus (HBV) infection is an important etiology for cirrhosis and HCC, accounting for 53% of all HCC cases. To understand the underlying mechanisms of the dynamic chain reactions from normal to HBV infection, from HBV infection to liver cirrhosis, from liver cirrhosis to HCC, we analyzed the blood lncRNA expression profiles from 38 healthy control samples, 45 chronic hepatitis B patients, 46 liver cirrhosis patients, and 46 HCC patients. Advanced machine-learning methods including Monte Carlo feature selection, incremental feature selection (IFS), and support vector machine (SVM) were applied to discover the signature associated with HCC progression and construct the prediction model. One hundred seventy-one key HCC progression-associated lncRNAs were identified and their overall accuracy was 0.823 as evaluated with leave-one-out cross validation (LOOCV). The accuracies of the lncRNA signature for healthy control, chronic hepatitis B, liver cirrhosis, and HCC were 0.895, 0.711, 0.870, and 0.826, respectively. The 171-lncRNA signature is not only useful for early detection and intervention of HCC, but also helpful for understanding the multistage tumorigenic processes of HCC.
Collapse
Affiliation(s)
- Qi Xia
- State Key Laboratory for Diagnosis and Treatment of Infectious Diseases, National Clinical Research Center for Infectious Diseases, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, The First Affiliated Hospital, College of Medicine, Zhejiang University, Hangzhou, China.,Key Laboratory for Biomedical Engineering of Ministry of Education, Zhejiang University, Hangzhou, China.,Zhejiang University, Hangzhou, China
| | - Zheyue Shu
- Zhejiang University, Hangzhou, China.,Division of Hepatobiliary and Pancreatic Surgery, Department of Surgery, The First Affiliated Hospital, College of Medicine, Zhejiang University, Hangzhou, China.,Key Laboratory of Combined Multi-Organ Transplantation, Ministry of Public Health, Hangzhou, China
| | - Ting Ye
- Zhejiang University, Hangzhou, China
| | - Min Zhang
- Zhejiang University, Hangzhou, China.,Division of Hepatobiliary and Pancreatic Surgery, Department of Surgery, The First Affiliated Hospital, College of Medicine, Zhejiang University, Hangzhou, China.,Key Laboratory of Combined Multi-Organ Transplantation, Ministry of Public Health, Hangzhou, China
| |
Collapse
|
6
|
Wu Z, Shou L, Wang J, Huang T, Xu X. The Methylation Pattern for Knee and Hip Osteoarthritis. Front Cell Dev Biol 2020; 8:602024. [PMID: 33240895 PMCID: PMC7677303 DOI: 10.3389/fcell.2020.602024] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2020] [Accepted: 10/22/2020] [Indexed: 01/08/2023] Open
Abstract
Osteoarthritis is one of the most prevalent chronic joint diseases for middle-aged and elderly people. But in recent years, the number of young people suffering from the disease increases quickly. It is known that osteoarthritis is a common degenerative disease caused by the combination and interaction of many factors such as natural and environmental factors. DNA methylations reflect the effects of environmental factors. Several researches on DNA methylation at specific genes in OA cartilage indicated the great potential roles of DNA methylation in OA. To systematically investigate the methylation pattern in knee and hip osteoarthritis, we analyzed the methylation profiles in cartilage of 16 OA hip samples, 19 control hip samples and 62 OA knee samples. 12 discriminative methylation sites were identified using advanced minimal Redundancy Maximal Relevance (mRMR) and Incremental Feature Selection (IFS) methods. The SVM classifier of these 12 methylation sites from genes like MEIS1, GABRG3, RXRA, and EN1, can perfectly classify the OA hip samples, control hip samples and OA knee samples evaluated with LOOCV (Leave-One Out-Cross Validation). These 12 methylation sites can not only serve as biomarker, but also provide underlying mechanism of OA.
Collapse
Affiliation(s)
- Zhen Wu
- Departmemt of Orthopaedics, Tongde Hospital of Zhejiang Province, Hangzhou, China
| | - Lu Shou
- Departmemt of Pneumology, Tongde Hospital of Zhejiang Province, Hangzhou, China
| | - Jian Wang
- Departmemt of Orthopaedics, Tongde Hospital of Zhejiang Province, Hangzhou, China
| | - Tao Huang
- Shanghai Institute of Nutrition and Health, Chinese Academy of Sciences, Shanghai, China
| | - Xinwei Xu
- Departmemt of Orthopaedics, Tongde Hospital of Zhejiang Province, Hangzhou, China
| |
Collapse
|
7
|
Raman L, Van der Linden M, Van der Eecken K, Vermaelen K, Demedts I, Surmont V, Himpe U, Dedeurwaerdere F, Ferdinande L, Lievens Y, Claes K, Menten B, Van Dorpe J. Shallow whole-genome sequencing of plasma cell-free DNA accurately differentiates small from non-small cell lung carcinoma. Genome Med 2020; 12:35. [PMID: 32317009 PMCID: PMC7175544 DOI: 10.1186/s13073-020-00735-4] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2019] [Accepted: 04/07/2020] [Indexed: 01/08/2023] Open
Abstract
Background Accurate lung cancer classification is crucial to guide therapeutic decisions. However, histological subtyping by pathologists requires tumor tissue—a necessity that is often intrinsically associated with procedural difficulties. The analysis of circulating tumor DNA present in minimal-invasive blood samples, referred to as liquid biopsies, could therefore emerge as an attractive alternative. Methods Concerning adenocarcinoma, squamous cell carcinoma, and small cell carcinoma, our proof of concept study investigates the potential of liquid biopsy-derived copy number alterations, derived from single-end shallow whole-genome sequencing (coverage 0.1–0.5×), across 51 advanced stage lung cancer patients. Results Genomic abnormality testing reveals anomalies in 86.3% of the liquid biopsies (16/20 for adenocarcinoma, 13/16 for squamous cell, and 15/15 for small cell carcinoma). We demonstrate that copy number profiles from formalin-fixed paraffin-embedded tumor biopsies are well represented by their liquid equivalent. This is especially valid within the small cell carcinoma group, where paired profiles have an average Pearson correlation of 0.86 (95% CI 0.79–0.93). A predictive model trained with public data, derived from 843 tissue biopsies, shows that liquid biopsies exhibit multiple deviations that reflect histological classification. Most notably, distinguishing small from non-small cell lung cancer is characterized by an area under the curve of 0.98 during receiver operating characteristic analysis. Additionally, we investigated how deeper paired-end sequencing, which will eventually become feasible for routine diagnosis, empowers tumor read enrichment by insert size filtering: for all of the 29 resequenced liquid biopsies, the tumor fraction could be increased in silico, thereby “rescuing” three out of five cases with previously undetectable alterations. Conclusions Copy number profiling of cell-free DNA enables histological classification. Since shallow whole-genome sequencing is inexpensive and often fully operational at routine molecular laboratories, this finding has current diagnostic potential, especially for patients with lesions that are difficult to reach.
Collapse
Affiliation(s)
- Lennart Raman
- Department of Pathology, Ghent University Hospital, Ghent University, Corneel Heymanslaan 10, 9000, Ghent, Belgium.,Center for Medical Genetics, Department of Biomolecular Medicine, Ghent University Hospital, Ghent University, Corneel Heymanslaan 10, 9000, Ghent, Belgium
| | - Malaïka Van der Linden
- Department of Pathology, Ghent University Hospital, Ghent University, Corneel Heymanslaan 10, 9000, Ghent, Belgium
| | - Kim Van der Eecken
- Department of Pathology, Ghent University Hospital, Ghent University, Corneel Heymanslaan 10, 9000, Ghent, Belgium
| | - Karim Vermaelen
- Department of Respiratory Medicine, Ghent University Hospital, Ghent University, Corneel Heymanslaan 10, 9000, Ghent, Belgium
| | - Ingel Demedts
- Department of Respiratory Medicine, AZ Delta, Deltalaan 1, 8800, Roeselare, Belgium
| | - Veerle Surmont
- Department of Respiratory Medicine, Ghent University Hospital, Ghent University, Corneel Heymanslaan 10, 9000, Ghent, Belgium
| | - Ulrike Himpe
- Department of Respiratory Medicine, AZ Delta, Deltalaan 1, 8800, Roeselare, Belgium
| | | | - Liesbeth Ferdinande
- Department of Pathology, Ghent University Hospital, Ghent University, Corneel Heymanslaan 10, 9000, Ghent, Belgium
| | - Yolande Lievens
- Department of Radiation Oncology, Ghent University Hospital, Ghent University, Corneel Heymanslaan 10, 9000, Ghent, Belgium
| | - Kathleen Claes
- Center for Medical Genetics, Department of Biomolecular Medicine, Ghent University Hospital, Ghent University, Corneel Heymanslaan 10, 9000, Ghent, Belgium
| | - Björn Menten
- Center for Medical Genetics, Department of Biomolecular Medicine, Ghent University Hospital, Ghent University, Corneel Heymanslaan 10, 9000, Ghent, Belgium
| | - Jo Van Dorpe
- Department of Pathology, Ghent University Hospital, Ghent University, Corneel Heymanslaan 10, 9000, Ghent, Belgium.
| |
Collapse
|
8
|
Gao C, Zhuang J, Li H, Liu C, Zhou C, Liu L, Feng F, Sun C, Wu J. Development of a risk scoring system for evaluating the prognosis of patients with Her2-positive breast cancer. Cancer Cell Int 2020; 20:121. [PMID: 32322168 PMCID: PMC7161270 DOI: 10.1186/s12935-020-01175-1] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2019] [Accepted: 03/13/2020] [Indexed: 12/19/2022] Open
Abstract
Background As one of the many breast cancer subtypes, human epidermal growth factor receptor 2 (Her2)-positive breast cancer has higher invasiveness and poor prognosis, although the advent of anti-Her2 drugs has brought good news to patients. However, the emergence of drug resistance still limits its clinical efficacy, so there is an urgent need to explore new targets and develop a risk scoring system to improve treatments and evaluate patient prognosis. Methods Differentially expressed mRNAs associated with Her2-positive breast cancer were screened from a TCGA cohort. The prognostic risk scoring system was constructed according to univariate and Lasso Cox regression model analyses and combined with clinical factors (such as age and TNM) for univariate and multivariate analyses to verify the specificity and sensitivity of the risk scoring system. Finally, based on correlation and CNV mutation analyses, we explored the research value of the mRNAs involved in the system as key genes of the model. Results In this study, six mRNAs were screened and identified to construct a prognostic risk scoring system, including four up-regulated mRNA (RDH16, SPC25, SPC24, and SCUBE3) and two down-regulated mRNA (DGAT2 and CCDC69). The risk scoring system can divide Her2-positive breast cancer samples into high-risk and low-risk groups to evaluate patient prognosis. In addition, whether through the time-dependent receiver operating characteristics curve or compared with clinical factors, the risk scoring system showed high predictive sensitivity and specificity. Moreover, some CNV mutations in mRNA increase patient risk by influencing expression levels. Conclusion The risk scoring system constructed in this study is helpful to improve the screening of high-risk patients with Her2-positive breast cancer and is beneficial for implementing early diagnosis and personalized treatment. It is suggested that these mRNAs may play an important role in the progression of Her2-positive breast cancer.
Collapse
Affiliation(s)
- Chundi Gao
- 1College of First Clinical Medicine, Shandong University of Traditional Chinese Medicine, Jinan, 250014 Shandong People's Republic of China
| | - Jing Zhuang
- Departmen of Oncology, Weifang Traditional Chinese Hospital, Weifang, 261041 Shandong People's Republic of China
| | - Huayao Li
- 2College of Basic Medical, Shandong University of Traditional Chinese Medicine, Jinan, 250014 Shandong People's Republic of China
| | - Cun Liu
- 1College of First Clinical Medicine, Shandong University of Traditional Chinese Medicine, Jinan, 250014 Shandong People's Republic of China
| | - Chao Zhou
- Departmen of Oncology, Weifang Traditional Chinese Hospital, Weifang, 261041 Shandong People's Republic of China
| | - Lijuan Liu
- Departmen of Oncology, Weifang Traditional Chinese Hospital, Weifang, 261041 Shandong People's Republic of China
| | - Fubin Feng
- Departmen of Oncology, Weifang Traditional Chinese Hospital, Weifang, 261041 Shandong People's Republic of China
| | - Changgang Sun
- 4Cancer and Immunology Institute, Shandong University of Traditional Chinese Medicine, Jinan, Shandong People's Republic of China
| | - Jibiao Wu
- 2College of Basic Medical, Shandong University of Traditional Chinese Medicine, Jinan, 250014 Shandong People's Republic of China
| |
Collapse
|
9
|
Cheng Q, Li J, Fan F, Cao H, Dai ZY, Wang ZY, Feng SS. Identification and Analysis of Glioblastoma Biomarkers Based on Single Cell Sequencing. Front Bioeng Biotechnol 2020; 8:167. [PMID: 32195242 PMCID: PMC7066068 DOI: 10.3389/fbioe.2020.00167] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2019] [Accepted: 02/19/2020] [Indexed: 12/16/2022] Open
Abstract
Glioblastoma (GBM) is one of the most common and aggressive primary adult brain tumors. Tumor heterogeneity poses a great challenge to the treatment of GBM, which is determined by both heterogeneous GBM cells and a complex tumor microenvironment. Single-cell RNA sequencing (scRNA-seq) enables the transcriptomes of great deal of individual cells to be assayed in an unbiased manner and has been applied in head and neck cancer, breast cancer, blood disease, and so on. In this study, based on the scRNA-seq results of infiltrating neoplastic cells in GBM, computational methods were applied to screen core biomarkers that can distinguish the discrepancy between GBM tumor and pericarcinomatous environment. The gene expression profiles of GBM from 2343 tumor cells and 1246 periphery cells were analyzed by maximum relevance minimum redundancy (mRMR). Upon further analysis of the feature lists yielded by the mRMR method, 31 important genes were extracted that may be essential biomarkers for GBM tumor cells. Besides, an optimal classification model using a support vector machine (SVM) algorithm as the classifier was also built. Our results provided insights of GBM mechanisms and may be useful for GBM diagnosis and therapy.
Collapse
Affiliation(s)
- Quan Cheng
- Department of Neurosurgery, Xiangya Hospital, Central South University, Changsha, China.,Department of Clinical Pharmacology, Xiangya Hospital, Central South University, Changsha, China
| | - Jing Li
- Department of Rehabilitation, The Second Xiangya Hospital, Central South University, Changsha, China
| | - Fan Fan
- Department of Neurosurgery, Xiangya Hospital, Central South University, Changsha, China
| | - Hui Cao
- Department of Psychiatry, The Second People's Hospital of Hunan University of Chinese Medicine, Changsha, China
| | - Zi-Yu Dai
- Department of Neurosurgery, Xiangya Hospital, Central South University, Changsha, China
| | - Ze-Yu Wang
- Department of Neurosurgery, Xiangya Hospital, Central South University, Changsha, China
| | - Song-Shan Feng
- Department of Neurosurgery, Xiangya Hospital, Central South University, Changsha, China
| |
Collapse
|
10
|
Zhang H, Jin Z, Cheng L, Zhang B. Integrative Analysis of Methylation and Gene Expression in Lung Adenocarcinoma and Squamous Cell Lung Carcinoma. Front Bioeng Biotechnol 2020; 8:3. [PMID: 32117905 PMCID: PMC7019569 DOI: 10.3389/fbioe.2020.00003] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2019] [Accepted: 01/03/2020] [Indexed: 12/18/2022] Open
Abstract
Lung cancer is a highly prevalent type of cancer with a poor 5-year survival rate of about 4-17%. Eighty percent lung cancer belongs to non-small-cell lung cancer (NSCLC). For a long time, the treatment of NSCLC has been mostly guided by tumor stage, and there has been no significant difference between the therapy strategy of lung adenocarcinoma (LUAD) and squamous cell lung carcinoma (SCLC), the two major subtypes of NSCLC. In recent years, important molecular differences between LUAD and SCLC are increasingly identified, indicating that targeted therapy will be more and more histologically specific in the future. To investigate the LUAD and SCLC difference on multi-omics scale, we analyzed the methylation and gene expression data together. With the Boruta method to remove irrelevant features and the MCFS (Monte Carlo Feature Selection) method to identify the significantly important features, we identified 113 key methylation features and 23 key gene expression features. HNF1B and TP63 were found to be dysfunctional on both methylation and gene expression levels. The experimentally determined interaction network suggested that TP63 may play an important role in connecting methylation genes and expression genes. Many of the discovered signature genes have been supported by literature. Our results may provide directions of precision diagnosis and therapy of LUAD and SCLC.
Collapse
Affiliation(s)
- Hao Zhang
- Department of Respiratory and Critical Care Medicine, Second Affiliated Hospital of Zhejiang University School of Medicine, Hangzhou, China
| | - Zhou Jin
- Department of Respiratory and Critical Care Medicine, Second Affiliated Hospital of Zhejiang University School of Medicine, Hangzhou, China.,Department of Respiration, Hospital of Traditional Chinese Medicine of Zhenhai, Ningbo, China
| | - Ling Cheng
- Shanghai Engineering Research Center of Pharmaceutical Translation, Shanghai, China
| | - Bin Zhang
- Department of Respiratory and Critical Care Medicine, Second Affiliated Hospital of Zhejiang University School of Medicine, Hangzhou, China
| |
Collapse
|
11
|
Li L, Barth NKH, Pilarsky C, Taher L. Cancer Is Associated with Alterations in the Three-Dimensional Organization of the Genome. Cancers (Basel) 2019; 11:cancers11121886. [PMID: 31783642 PMCID: PMC6966451 DOI: 10.3390/cancers11121886] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2019] [Revised: 11/21/2019] [Accepted: 11/23/2019] [Indexed: 12/17/2022] Open
Abstract
The human genome is organized into topologically associating domains (TADs), which represent contiguous regions with a higher frequency of intra-interactions as opposed to inter-interactions. TADs contribute to gene expression regulation by restricting the interactions between their regulatory elements, and TAD disruption has been associated with cancer. Here, we provide a proof of principle that mutations within TADs can be used to predict the survival of cancer patients. Specifically, we constructed a set of 1467 consensus TADs representing the three-dimensional organization of the human genome and used Cox regression analysis to identify a total of 35 prognostic TADs in different cancer types. Interestingly, only 46% of the 35 prognostic TADs comprised genes with known clinical relevance. Moreover, in the vast majority of such cases, the prognostic value of the TAD was not directly related to the presence/absence of mutations in the gene(s), emphasizing the importance of regulatory mutations. In addition, we found that 34% of the prognostic TADs show strong structural perturbations in the cancer genome, consistent with the widespread, global epigenetic dysregulation often observed in cancer patients. In summary, this study elucidates the mechanisms through which non-coding variants may influence cancer progression and opens new avenues for personalized medicine.
Collapse
Affiliation(s)
- Lifei Li
- Division of Bioinformatics, Department of Biology, Friedrich-Alexander-Universität Erlangen-Nürnberg, 91058 Erlangen, Germany; (L.L.); (N.K.H.B.)
| | - Nicolai K. H. Barth
- Division of Bioinformatics, Department of Biology, Friedrich-Alexander-Universität Erlangen-Nürnberg, 91058 Erlangen, Germany; (L.L.); (N.K.H.B.)
| | - Christian Pilarsky
- Department of Surgery, Friedrich-Alexander-Universität Erlangen-Nürnberg and Universitätsklinikum Erlangen, 91054 Erlangen, Germany;
| | - Leila Taher
- Division of Bioinformatics, Department of Biology, Friedrich-Alexander-Universität Erlangen-Nürnberg, 91058 Erlangen, Germany; (L.L.); (N.K.H.B.)
- Institute for Biomedical Informatics, Graz University of Technology, 8010 Graz, Austria
- Correspondence:
| |
Collapse
|
12
|
Yao Y, Gu Y, Yang M, Cao D, Wu F. The Gene Expression Biomarkers for Chronic Obstructive Pulmonary Disease and Interstitial Lung Disease. Front Genet 2019; 10:1154. [PMID: 31824564 PMCID: PMC6879656 DOI: 10.3389/fgene.2019.01154] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2019] [Accepted: 10/22/2019] [Indexed: 01/01/2023] Open
Abstract
COPD (chronic obstructive pulmonary disease) and ILD (interstitial lung disease) are two common respiratory diseases. They share similar clinical traits but require different therapeutic treatments. Identifying the biomarkers that are differentially expressed between them will not only help the diagnosis of COPD and ILD, but also provide candidate drug targets that may facilitate the development of new treatment for COPD and ILD. Due to the irreversible complex pathological changes of COPD, there are very limited therapeutic options for COPD patients. In this study, we analyzed the gene expression profiles of two datasets: one training dataset that includes 144 COPD patients and 194 ILD patients, and one test dataset that includes 75 COPD patients and 61 ILD patients. Advanced feature selection methods, mRMR (minimal Redundancy Maximal Relevance) and incremental feature selection (IFS), were applied to identify the 38-gene biomarker. An SVM (support vector machine) classifier was built based on the 38-gene biomarker. Its accuracy, sensitivity, and specificity on training dataset evaluated by leave one out cross-validation were 0.905, 0.896, and 0.912, respectively. And on independent test dataset, the accuracy, sensitivity, and specificity on were as great as and were 0.904, 0.933, and 0.869, respectively. The biological function analysis of the 38 genes indicated that many of them can be potential treatment targets that may benefit COPD and ILD patients.
Collapse
Affiliation(s)
- Yangwei Yao
- Department of Pulmonary and Critical Care Medicine, The Second Hospital of Jiaxing, Jiaxing, China
| | - Yangyang Gu
- Department of Pulmonary and Critical Care Medicine, The Second Hospital of Jiaxing, Jiaxing, China
| | - Meng Yang
- Department of Pulmonary and Critical Care Medicine, The Second Hospital of Jiaxing, Jiaxing, China
| | - Dakui Cao
- Department of Pulmonary and Critical Care Medicine, The Second Hospital of Jiaxing, Jiaxing, China
| | - Fengjie Wu
- Department of Pulmonary and Critical Care Medicine, The Second Hospital of Jiaxing, Jiaxing, China
| |
Collapse
|
13
|
Yu XJ, Chen G, Yang J, Yu GC, Zhu PF, Jiang ZK, Feng K, Lu Y, Bao B, Zhong FM. Smoking alters the evolutionary trajectory of non-small cell lung cancer. Exp Ther Med 2019; 18:3315-3324. [PMID: 31602204 PMCID: PMC6777332 DOI: 10.3892/etm.2019.7958] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2018] [Accepted: 05/16/2019] [Indexed: 12/14/2022] Open
Abstract
Smoking is the biggest risk factor for lung cancer. Smokers have a much higher chance of developing lung tumors with a worse survival rate; however, non-smokers also develop lung tumors. A number of questions remain including the underlying difference between smoker and non-smoker lung cancer patients and the involvement of genetic and epigenetic processes in tumor development. The present study analyzed the mutation data of 100 non-small cell lung cancer (NSCLC) patients, 12 non-smokers, 48 ex-smokers and 40 smokers, from Tracking Non-Small Cell Lung Cancer Evolution through Therapy Consortium. A total of 68 genes exhibited different mutation patterns across non-smokers, ex-smokers and smokers. A number of these 68 genes encode membrane proteins with biological regulation, metabolic process, and response to stimulus functions. For each group of patients, the top 10 most frequently mutated genes were selected and their oncogenetic tree inferred, which reflected how the genes evolve during tumor genesis. By comparing the oncogenetic trees of non-smokers and smokers, it was identified that in non-smokers, the mutation of epidermal growth factor receptor (EGFR) was an early genetic alteration event and EGFR was the key driver, but in smokers, the mutation of titin (TTN) was more important. Based on network analysis, TTN can interact with spectrin α erythrocytic 1 through calmodulin 2 and troponin C1. These genetic differences during tumorigenesis of non-smoker and smoker lung cancer patients provided novel insights into the effects of smoking on the evolutionary trajectory of non-small cell lung cancer and may prove helpful for targeted therapy of different lung cancer subtypes.
Collapse
Affiliation(s)
- Xiao-Jun Yu
- Department of Thoracic Surgery, The First People's Hospital of Fuyang Hangzhou, Hangzhou, Zhejiang 311400, P.R. China
| | - Gang Chen
- Department of Thoracic Surgery, Hangzhou Red Cross Hospital, Hangzhou, Zhejiang 310003, P.R. China
| | - Jun Yang
- Department of Thoracic Surgery, Hangzhou Red Cross Hospital, Hangzhou, Zhejiang 310003, P.R. China
| | - Guo-Can Yu
- Department of Thoracic Surgery, Hangzhou Red Cross Hospital, Hangzhou, Zhejiang 310003, P.R. China
| | - Peng-Fei Zhu
- Department of Thoracic Surgery, Hangzhou Red Cross Hospital, Hangzhou, Zhejiang 310003, P.R. China
| | - Zheng-Ke Jiang
- Department of Surgery, Hangzhou Fuyang Hospital of Traditional Chinese Medicine, Hangzhou, Zhejiang 311400, P.R. China
| | - Kan Feng
- Department of Thoracic Surgery, The First People's Hospital of Fuyang Hangzhou, Hangzhou, Zhejiang 311400, P.R. China
| | - Yong Lu
- Department of Thoracic Surgery, The First People's Hospital of Fuyang Hangzhou, Hangzhou, Zhejiang 311400, P.R. China
| | - Bin Bao
- Department of Thoracic Surgery, The First People's Hospital of Fuyang Hangzhou, Hangzhou, Zhejiang 311400, P.R. China
| | - Fang-Ming Zhong
- Department of Thoracic Surgery, Hangzhou Red Cross Hospital, Hangzhou, Zhejiang 310003, P.R. China
| |
Collapse
|
14
|
Zhang GL, Pan LL, Huang T, Wang JH. The transcriptome difference between colorectal tumor and normal tissues revealed by single-cell sequencing. J Cancer 2019; 10:5883-5890. [PMID: 31737124 PMCID: PMC6843882 DOI: 10.7150/jca.32267] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2018] [Accepted: 06/17/2019] [Indexed: 12/29/2022] Open
Abstract
The previous cancer studies were difficult to reproduce since the tumor tissues were analyzed directly. But the tumor tissues were actually a mixture of different cancer cells. The transcriptome of single-cell was much robust than the transcriptome of a mixed tissue. The single-cell transcriptome had much smaller variance. In this study, we analyzed the single-cell transcriptome of 272 colorectal cancer (CRC) epithelial cells and 160 normal epithelial cells and identified 342 discriminative transcripts using advanced machine learning methods. The most discriminative transcripts were LGALS4, PHGR1, C15orf48, HEPACAM2, PERP, FABP1, FCGBP, MT1G, TSPAN1 and CKB. We further clustered the 342 transcripts into two categories. The upregulated transcripts in CRC epithelial cells were significantly enriched in Ribosome, Protein processing in endoplasmic reticulum, Antigen processing and presentation and p53 signaling pathway. The downregulated transcripts in CRC epithelial cells were significantly enriched in Mineral absorption, Aldosterone-regulated sodium reabsorption and Oxidative phosphorylation pathways. The biological analysis of the discriminative transcripts revealed the possible mechanism of colorectal cancer.
Collapse
Affiliation(s)
- Guo-Liang Zhang
- Department of Colorectal Surgery, The First Affiliated Hospital, College of Medicine, Zhejiang University, Hangzhou 310003, Zhejiang, China
| | - Le-Lin Pan
- Department of Colorectal Surgery, The First Affiliated Hospital, College of Medicine, Zhejiang University, Hangzhou 310003, Zhejiang, China
| | - Tao Huang
- Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Jin-Hai Wang
- Department of Colorectal Surgery, The First Affiliated Hospital, College of Medicine, Zhejiang University, Hangzhou 310003, Zhejiang, China
| |
Collapse
|
15
|
A Shallow Convolutional Learning Network for Classification of Cancers Based on Copy Number Variations. SENSORS 2019; 19:s19194207. [PMID: 31569801 PMCID: PMC6806227 DOI: 10.3390/s19194207] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/20/2019] [Revised: 09/18/2019] [Accepted: 09/25/2019] [Indexed: 12/29/2022]
Abstract
Genomic copy number variations (CNVs) are among the most important structural variations. They are linked to several diseases and cancer types. Cancer is a leading cause of death worldwide. Several studies were conducted to investigate the causes of cancer and its association with genomic changes to enhance its management and improve the treatment opportunities. Classification of cancer types based on the CNVs falls in this category of research. We reviewed the recent, most successful methods that used machine learning algorithms to solve this problem and obtained a dataset that was tested by some of these methods for evaluation and comparison purposes. We propose three deep learning techniques to classify cancer types based on CNVs: a six-layer convolutional net (CNN6), residual six-layer convolutional net (ResCNN6), and transfer learning of pretrained VGG16 net. The results of the experiments performed on the data of six cancer types demonstrated a high accuracy of 86% for ResCNN6 followed by 85% for CNN6 and 77% for VGG16. The results revealed a lower prediction accuracy for one of the classes (uterine corpus endometrial carcinoma (UCEC)). Repeating the experiments after excluding this class reveals improvements in the accuracies: 91% for CNN6 and 92% for Res CNN6. We observed that UCEC and ovarian serous carcinoma (OV) share a considerable subset of their features, which causes a struggle for learning in the classifiers. We repeated the experiment again by balancing the six classes through oversampling of the training dataset and the result was an enhancement in both overall and UCEC classification accuracies.
Collapse
|
16
|
Deng M, Lv XD, Fang ZX, Xie XS, Chen WY. The blood transcriptional signature for active and latent tuberculosis. Infect Drug Resist 2019; 12:321-328. [PMID: 30787624 PMCID: PMC6363485 DOI: 10.2147/idr.s184640] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023] Open
Abstract
BACKGROUND Although the incidence of tuberculosis (TB) has dropped substantially, it still is a serious threat to human health. And in recent years, the emergence of resistant bacilli and inadequate disease control and prevention has led to a significant rise in the global TB epidemic. It is known that the cause of TB is Mycobacterium tuberculosis infection. But it is not clear why some infected patients are active while others are latent. METHODS We analyzed the blood gene expression profiles of 69 latent TB patients and 54 active pulmonary TB patients from GEO (Transcript Expression Omnibus) database. RESULTS By applying minimal redundancy maximal relevance and incremental feature selection, we identified 24 signature genes which can predict the TB activation. The support vector machine predictor based on these 24 genes had a sensitivity of 0.907, specificity of 0.913, and accuracy of 0.911, respectively. Although they need to be validated in a large independent dataset, the biological analysis of these 24 genes showed great promise. CONCLUSION We found that cytokine production was a key process during TB activation and genes like CYBB, TSPO, CD36, and STAT1 worth further investigation.
Collapse
Affiliation(s)
- Min Deng
- Department of Infectious Diseases, The First Hospital of Jiaxing, The First Affiliated Hospital of Jiaxing University, Jiaxing 314000, China,
| | - Xiao-Dong Lv
- Department of Respiration, The First Hospital of Jiaxing, The First Affiliated Hospital of Jiaxing University, Jiaxing 314000, China
| | - Zhi-Xian Fang
- Department of Respiration, The First Hospital of Jiaxing, The First Affiliated Hospital of Jiaxing University, Jiaxing 314000, China
| | - Xin-Sheng Xie
- Department of Infectious Diseases, The First Hospital of Jiaxing, The First Affiliated Hospital of Jiaxing University, Jiaxing 314000, China,
| | - Wen-Yu Chen
- Department of Respiration, The First Hospital of Jiaxing, The First Affiliated Hospital of Jiaxing University, Jiaxing 314000, China
| |
Collapse
|
17
|
Sheng M, Dong Z, Xie Y. Identification of tumor-educated platelet biomarkers of non-small-cell lung cancer. Onco Targets Ther 2018; 11:8143-8151. [PMID: 30532555 PMCID: PMC6241732 DOI: 10.2147/ott.s177384] [Citation(s) in RCA: 36] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
BACKGROUND Lung cancer is a severe cancer with a high death rate. The 5-year survival rate for stage III lung cancer is much lower than stage I. Early detection and intervention of lung cancer patients can significantly increase their survival time. However, conventional lung cancer-screening methods, such as chest X-rays, sputum cytology, positron-emission tomography (PET), low-dose computed tomography (CT), magnetic resonance imaging, and gene-mutation, -methylation, and -expression biomarkers of lung tissue, are invasive, radiational, or expensive. Liquid biopsy is non-invasive and does little harm to the body. It can reflect early-stage dysfunctions of tumorigenesis and enable early detection and intervention. METHODS In this study, we analyzed RNA-sequencing data of tumor-educated platelets (TEPs) in 402 non-small-cell lung cancer (NSCLC) patients and 231 healthy controls. A total of 48 biomarker genes were selected with advanced minimal-redundancy, maximal-relevance, and incremental feature-selection (IFS) methods. RESULTS A support vector-machine (SVM) classifier based on the 48 biomarker genes accurately predicted NSCLC with leave-one-out cross-validation (LOOCV) sensitivity, specificity, accuracy, and Matthews correlation coefficients of 0.925, 0.827, 0.889, and 0.760, respectively. Network analysis of the 48 genes revealed that the WASF1 actin cytoskeleton module, PRKAB2 kinase module, RSRC1 ribosomal protein module, PDHB carbohydrate-metabolism module, and three intermodule hubs (TPM2, MYL9, and PPP1R12C) may play important roles in NSCLC tumorigenesis and progression. CONCLUSION The 48-gene TEP liquid-biopsy biomarkers will facilitate early screening of NSCLC and prolong the survival of cancer patients.
Collapse
Affiliation(s)
- Meiling Sheng
- Department of Respiration, Jinhua People's Hospital, Jinhua, Zhejiang 321000, China
| | - Zhaohui Dong
- Department of Intensive Care Unit, First Hospital of Huzhou, First Affiliated Hospital of Huzhou University, Huzhou, Zhejiang 313000, China
| | - Yanping Xie
- Department of Respiratory Medicine, First Hospital of Huzhou, First Affiliated Hospital of Huzhou University, Huzhou, Zhejiang 313000, China,
| |
Collapse
|
18
|
The early detection of asthma based on blood gene expression. Mol Biol Rep 2018; 46:217-223. [PMID: 30421126 DOI: 10.1007/s11033-018-4463-6] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2018] [Accepted: 11/01/2018] [Indexed: 01/10/2023]
Abstract
Asthma is a complex heterogeneous disorder with hereditary tendency and the most widely used therapy is inhalation of anti-inflammatory corticosteroids. But it has systemic side effects. If the chronic inflammation can be detected in early stage, the dosage of corticosteroids will be low and the side effects can be avoided. Therefore, to discover the early stage blood biomarkers for asthma, we analyzed the gene expression profiles in the blood of 77 moderate asthma patients and 87 healthy controls. With advanced feature selection methods, minimal Redundancy Maximal Relevance and Incremental Feature Selection, we identified 31 genes, such as MYD88, ZFP36, CCR3 and CYP3A5, as the optimal asthma biomarker. The sensitivity, specificity and accuracy of the 31-gene Support Vector Machine predictor evaluated with Leave-One-Out Cross Validation were 0.870, 0.816 and 0.841, respectively. Through literature survey, many biomarker genes have asthma associated functions. Our results not only provided the easy-to-apply blood gene expression biomarkers for early detection of asthma, but also an explainable qualitative model with biological significance.
Collapse
|
19
|
Lin H, Qiu X, Zhang B, Zhang J. Identification of the predictive genes for the response of colorectal cancer patients to FOLFOX therapy. Onco Targets Ther 2018; 11:5943-5955. [PMID: 30271178 PMCID: PMC6149834 DOI: 10.2147/ott.s167656] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Background Colorectal cancer is a malignant tumor with high death rate. Chemotherapy, radiotherapy and surgery are the three common treatments of colorectal cancer. For early colorectal cancer patients, postoperative adjuvant chemotherapy can reduce the risk of recurrence. For advanced colorectal cancer patients, palliative chemotherapy can significantly improve the life quality of patients and prolong survival. FOLFOX is one of the mainstream chemotherapies in colorectal cancer, however, its response rate is only about 50%. Methods To systematically investigate why some of the colorectal cancer patients have response to FOLFOX therapy while others do not, we searched all publicly available database and combined three gene expression datasets of colorectal cancer patients with FOLFOX therapy. With advanced minimal redundancy maximal relevance and incremental feature selection method, we identified the biomarker genes. Results A Support Vector Machine-based classifier was constructed to predict the response of colorectal cancer patients to FOLFOX therapy. Its accuracy, sensitivity and specificity were 0.854, 0.845 and 0.863, respectively. Conclusion The biological analysis of representative biomarker genes suggested that apoptosis and inflammation signaling pathways were essential for the response of colorectal cancer patients to FOLFOX chemotherapy.
Collapse
Affiliation(s)
- Hengjun Lin
- Department of Tumor, Anus and Intestine, Jinhua People's Hospital, Jinhua, Zhejiang 321000, China,
| | - Xueke Qiu
- Department of Tumor, Anus and Intestine, Jinhua People's Hospital, Jinhua, Zhejiang 321000, China,
| | - Bo Zhang
- Department of Tumor, Anus and Intestine, Jinhua People's Hospital, Jinhua, Zhejiang 321000, China,
| | - Jichao Zhang
- Department of Tumor, Anus and Intestine, Jinhua People's Hospital, Jinhua, Zhejiang 321000, China,
| |
Collapse
|
20
|
Pan X, Hu X, Zhang YH, Chen L, Zhu L, Wan S, Huang T, Cai YD. Identification of the copy number variant biomarkers for breast cancer subtypes. Mol Genet Genomics 2018; 294:95-110. [PMID: 30203254 DOI: 10.1007/s00438-018-1488-4] [Citation(s) in RCA: 51] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2018] [Accepted: 09/03/2018] [Indexed: 01/07/2023]
Abstract
Breast cancer is a common and threatening malignant disease with multiple biological and clinical subtypes. It can be categorized into subtypes of luminal A, luminal B, Her2 positive, and basal-like. Copy number variants (CNVs) have been reported to be a potential and even better biomarker for cancer diagnosis than mRNA biomarkers, because it is considerably more stable and robust than gene expression. Thus, it is meaningful to detect CNVs of different cancers. To identify the CNV biomarker for breast cancer subtypes, we integrated the CNV data of more than 2000 samples from two large breast cancer databases, METABRIC and The Cancer Genome Atlas (TCGA). A Monte Carlo feature selection-based and incremental feature selection-based computational method was proposed and tested to identify the distinctive core CNVs in different breast cancer subtypes. We identified the CNV genes that may contribute to breast cancer tumorigenesis as well as built a set of quantitative distinctive rules for recognition of the breast cancer subtypes. The tenfold cross-validation Matthew's correlation coefficient (MCC) on METABRIC training set and the independent test on TCGA dataset were 0.515 and 0.492, respectively. The CNVs of PGAP3, GRB7, MIR4728, PNMT, STARD3, TCAP and ERBB2 were important for the accurate diagnosis of breast cancer subtypes. The findings reported in this study may further uncover the difference between different breast cancer subtypes and improve the diagnosis accuracy.
Collapse
Affiliation(s)
- Xiaoyong Pan
- College of Life Science, Shanghai University, Shanghai, 200444, People's Republic of China.,Department of Medical Informatics, Erasmus MC, Rotterdam, The Netherlands
| | - XiaoHua Hu
- Department of Biostatistics and Computational Biology, School of Life Sciences, Fudan University, Shanghai, 200438, People's Republic of China
| | - Yu-Hang Zhang
- Institute of Health Sciences, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, 200031, People's Republic of China
| | - Lei Chen
- College of Information Engineering, Shanghai Maritime University, Shanghai, 201306, People's Republic of China.,Shanghai Key Laboratory of PMMP, East China Normal University, Shanghai, 200241, People's Republic of China
| | - LiuCun Zhu
- College of Life Science, Shanghai University, Shanghai, 200444, People's Republic of China
| | - ShiBao Wan
- College of Life Science, Shanghai University, Shanghai, 200444, People's Republic of China
| | - Tao Huang
- Institute of Health Sciences, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, 200031, People's Republic of China.
| | - Yu-Dong Cai
- College of Life Science, Shanghai University, Shanghai, 200444, People's Republic of China.
| |
Collapse
|
21
|
Li J, Lan CN, Kong Y, Feng SS, Huang T. Identification and Analysis of Blood Gene Expression Signature for Osteoarthritis With Advanced Feature Selection Methods. Front Genet 2018; 9:246. [PMID: 30214455 PMCID: PMC6125376 DOI: 10.3389/fgene.2018.00246] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2018] [Accepted: 06/22/2018] [Indexed: 12/15/2022] Open
Abstract
Osteoarthritis (OA) is a complex disease that affects articular joints and may cause disability. The incidence of OA is extremely high. Most elderly people have the symptoms of osteoarthritis. The physiotherapy of OA is time consuming, and the chances of full recovery from OA are very minimal. The most effective way of fighting OA is early diagnosis and early intervention. Liquid biopsy has become a popular noninvasive test. To find the blood gene expression signature for OA, we reanalyzed the publicly available blood gene expression profiles of 106 patients with OA and 33 control samples using an automatic computational pipeline based on advanced feature selection methods. Finally, a compact 23-gene set was identified. On the basis of these 23 genes, we constructed a Support Vector Machine (SVM) classifier and evaluated it with leave-one-out cross-validation. Its sensitivity (Sn), specificity (Sp), accuracy (ACC), and Mathew's correlation coefficient (MCC) were 0.991, 0.909, 0.971, and 0.920, respectively. Obviously, the performance needed to be validated in an independent large dataset, but the in-depth biological analysis of the 23 biomarkers showed great promise and suggested that mRNA surveillance pathway and multicellular organism growth played important roles in OA. Our results shed light on OA diagnosis through liquid biopsy.
Collapse
Affiliation(s)
- Jing Li
- Department of Rehabilitation, The Second Xiangya Hospital, Central South University, Changsha, China
| | - Chun-Na Lan
- Department of Rehabilitation, The Second Xiangya Hospital, Central South University, Changsha, China
| | - Ying Kong
- Department of Rehabilitation, The Second Xiangya Hospital, Central South University, Changsha, China
| | - Song-Shan Feng
- Department of Neurosurgery, Xiangya Hospital, Central South University, Changsha, China
| | - Tao Huang
- Institute of Health Sciences, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China
| |
Collapse
|
22
|
Zhang TM, Huang T, Wang RF. Cross talk of chromosome instability, CpG island methylator phenotype and mismatch repair in colorectal cancer. Oncol Lett 2018; 16:1736-1746. [PMID: 30008861 PMCID: PMC6036478 DOI: 10.3892/ol.2018.8860] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2017] [Accepted: 05/22/2018] [Indexed: 12/20/2022] Open
Abstract
Colorectal cancer is a severe cancer associated with a high prevalence and fatality rate. There are three major mechanisms for colorectal cancer: (1) Chromosome instability (CIN), (2) CpG island methylator phenotype (CIMP) and (3) mismatch repair (MMR), of which CIN is the most common type. However, these subtypes are not exclusive and overlap. To investigate their biological mechanisms and cross talk, the gene expression profiles of 585 colorectal cancer patients with CIN, CIMP and MMR status records were collected. By comparing the CIN+ and CIN-samples, CIMP+ and CIMP-samples, MMR+ and MMR-samples with minimal redundancy maximal relevance (mRMR) and incremental feature selection (IFS) methods, the CIN, CIMP and MMR associated genes were selected. Unfortunately, there was little direct overlap among them. To investigate their indirect interactions, downstream genes of CIN, CIMP and MMR were identified using the random walk with restart (RWR) method and a greater overlap of downstream genes was indicated. The common downstream genes were involved in biosynthetic and metabolic pathways. These findings were consistent with the clinical observation of wide range metabolite aberrations in colorectal cancer. To conclude, the present study gave a gene level explanation of CIN, CIMP and MMR, but also showed the network level cross talk of CIN, CIMP and MMR. The common genes of CIN, CIMP and MMR may be useful for cross-subtype general colorectal cancer drug development.
Collapse
Affiliation(s)
- Tian-Ming Zhang
- Department of Colorectal and Anal Surgery, Jinhua Hospital of Zhejiang University, Jinhua, Zhejiang 321000, P.R. China
| | - Tao Huang
- Institute of Health Sciences, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, P.R. China
| | - Rong-Fei Wang
- Department of Colorectal and Anal Surgery, Jinhua People's Hospital, Jinhua, Zhejiang 321000, P.R. China
| |
Collapse
|
23
|
Proscillaridin A Promotes Oxidative Stress and ER Stress, Inhibits STAT3 Activation, and Induces Apoptosis in A549 Lung Adenocarcinoma Cells. OXIDATIVE MEDICINE AND CELLULAR LONGEVITY 2018; 2018:3853409. [PMID: 29576846 PMCID: PMC5821950 DOI: 10.1155/2018/3853409] [Citation(s) in RCA: 34] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/26/2017] [Revised: 10/21/2017] [Accepted: 11/16/2017] [Indexed: 12/13/2022]
Abstract
Cardiac glycosides are natural compounds used for the treatment of cardiovascular disorders. Although originally prescribed for cardiovascular diseases, more recently, they have been rediscovered for their potential use in the treatment of cancer. Proscillaridin A (PSD-A), a cardiac glycoside component of Urginea maritima, has been reported to exhibit anticancer activity. However, the cellular targets and anticancer mechanism of PSD-A in various cancers including lung cancer remain largely unexplored. In the present study, we found that PSD-A inhibits growth and induces apoptosis in A549 lung adenocarcinoma cells. The anticancer activity of PSD-A was found to be associated with the activation of JNK, induction of ER stress, mitochondrial dysfunction, and inhibition of STAT3 activation. PSD-A induces oxidative stress as evidenced from ROS generation, GSH depletion, and decreased activity of TrxR1. PSD-A-mediated ER stress was verified by increased phosphorylation of eIF2α and expression of its downstream effector proteins ATF4, CHOP, and caspases-4. PSD-A triggered apoptosis by inducing JNK (1/2) activation, increasing bax/bcl-2 ratio, dissipating mitochondrial membrane potential, and inducing cleavage of caspases and PARP. Further study revealed that PSD-A inhibits both constitutive and inducible STAT3 activations and decreases STAT3 DNA-binding activity. Moreover, PSD-A-mediated inhibition of STAT3 activation was found to be associated with increased SHP-1 expression, decreased phosphorylation of Src, and binding of PSD-A with STAT3 SH2 domain. Finally, STAT3 knockdown by shRNA inhibited growth and enhanced apoptotic efficacy of PSD-A. Taken together, the data suggest that PSD-A could be developed into a potential therapeutic agent against lung adenocarcinoma.
Collapse
|
24
|
Qiu ZW, Bi JH, Gazdar AF, Song K. Genome-wide copy number variation pattern analysis and a classification signature for non-small cell lung cancer. Genes Chromosomes Cancer 2017; 56:559-569. [PMID: 28379620 DOI: 10.1002/gcc.22460] [Citation(s) in RCA: 36] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2016] [Revised: 03/25/2017] [Accepted: 03/26/2017] [Indexed: 02/06/2023] Open
Abstract
The accurate classification of non-small cell lung carcinoma (NSCLC) into lung adenocarcinoma (LUAD) and lung squamous cell carcinoma (LUSC) is essential for both clinical practice and lung cancer research. Although the standard WHO diagnosis of NSCLC on biopsy material is rapid and economic, more than 13% of NSCLC tumors in the USA are not further classified. The purpose of this study was to analyze the genome-wide pattern differences in copy number variations (CNVs) and to develop a CNV signature as an adjunct test for the routine histopathologic classification of NSCLCs. We investigated the genome-wide CNV differences between these two tumor types using three independent patient datasets. Approximately half of the genes examined exhibited significant differences between LUAD and LUSC tumors and the corresponding non-malignant tissues. A new classifier was developed to identify signature genes out of 20 000 genes. Thirty-three genes were identified as a CNV signature of NSCLC. Using only their CNV values, the classification model separated the LUADs from the LUSCs with an accuracy of 0.88 and 0.84, respectively, in the training and validation datasets. The same signature also classified NSCLC tumors from their corresponding non-malignant samples with an accuracy of 0.96 and 0.98, respectively. We also compared the CNV patterns of NSCLC tumors with those of histologically similar tumors arising at other sites, such as the breast, head, and neck, and four additional tumors. Of greater importance, the significant differences between these tumors may offer the possibility of identifying the origin of tumors whose origin is unknown.
Collapse
Affiliation(s)
- Zhe-Wei Qiu
- School of Chemical Engineering and Technology, Tianjin University, 300072 Tianjin, People's Republic of China
| | - Jia-Hao Bi
- School of Chemical Engineering and Technology, Tianjin University, 300072 Tianjin, People's Republic of China
| | - Adi F Gazdar
- Hamon Center for Therapeutic Oncology, University of Texas Southwestern Medical Center, Dallas, Texas, 75390, USA.,Department of Pathology, University of Texas Southwestern Medical Center, Dallas, Texas, 75390, USA
| | - Kai Song
- School of Chemical Engineering and Technology, Tianjin University, 300072 Tianjin, People's Republic of China.,Hamon Center for Therapeutic Oncology, University of Texas Southwestern Medical Center, Dallas, Texas, 75390, USA
| |
Collapse
|
25
|
Huang T, Shu Y, Cai YD. Genetic differences among ethnic groups. BMC Genomics 2015; 16:1093. [PMID: 26690364 PMCID: PMC4687076 DOI: 10.1186/s12864-015-2328-0] [Citation(s) in RCA: 90] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2015] [Accepted: 12/15/2015] [Indexed: 12/12/2022] Open
Abstract
BACKGROUND Many differences between different ethnic groups have been observed, such as skin color, eye color, height, susceptibility to some diseases, and response to certain drugs. However, the genetic bases of such differences have been under-investigated. Since the HapMap project, large-scale genotype data from Caucasian, African and Asian population samples have been available. The project found that these populations were located in different areas of the PCA (Principal Component Analysis) plot. However, as an unsupervised method, PCA does not measure the differences in each single nucleotide polymorphism (SNP) among populations. RESULTS We applied an advanced mutual information-based feature selection method to detect associations between SNP status and ethnic groups using the latest HapMap Phase 3 release version 3, which included more sub-populations. A total of 299 SNPs were identified, and they can accurately predicted the ethnicity of all HapMap populations. The 10-fold cross validation accuracy of the SMO (sequential minimal optimization) model on training dataset was 0.901, and the accuracy on independent test dataset was 0.895. CONCLUSIONS In-depth functional analysis of these SNPs and their nearby genes revealed the genetic bases of skin and eye color differences among populations.
Collapse
Affiliation(s)
- Tao Huang
- Institute of Health Sciences, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, 200031, P. R. China.
| | - Yang Shu
- Sate Key Laboratory of Biotherapy, Sichuan University, Sichuan, 610041, P. R. China.
| | - Yu-Dong Cai
- College of Life Science, Shanghai University, Shanghai, 200444, P. R. China.
| |
Collapse
|
26
|
Li F, Li C, Wang M, Webb GI, Zhang Y, Whisstock JC, Song J. GlycoMine: a machine learning-based approach for predicting N-, C- and O-linked glycosylation in the human proteome. Bioinformatics 2015; 31:1411-9. [DOI: 10.1093/bioinformatics/btu852] [Citation(s) in RCA: 129] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2014] [Accepted: 12/23/2014] [Indexed: 12/31/2022] Open
|
27
|
Cai Z, Xu D, Zhang Q, Zhang J, Ngai SM, Shao J. Classification of lung cancer using ensemble-based feature selection and machine learning methods. MOLECULAR BIOSYSTEMS 2014; 11:791-800. [PMID: 25512221 DOI: 10.1039/c4mb00659c] [Citation(s) in RCA: 71] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Lung cancer is one of the leading causes of death worldwide. There are three major types of lung cancers, non-small cell lung cancer (NSCLC), small cell lung cancer (SCLC) and carcinoid. NSCLC is further classified into lung adenocarcinoma (LADC), squamous cell lung cancer (SQCLC) as well as large cell lung cancer. Many previous studies demonstrated that DNA methylation has emerged as potential lung cancer-specific biomarkers. However, whether there exists a set of DNA methylation markers simultaneously distinguishing such three types of lung cancers remains elusive. In the present study, ROC (Receiving Operating Curve), RFs (Random Forests) and mRMR (Maximum Relevancy and Minimum Redundancy) were proposed to capture the unbiased, informative as well as compact molecular signatures followed by machine learning methods to classify LADC, SQCLC and SCLC. As a result, a panel of 16 DNA methylation markers exhibits an ideal classification power with an accuracy of 86.54%, 84.6% and a recall 84.37%, 85.5% in the leave-one-out cross-validation (LOOCV) and independent data set test experiments, respectively. Besides, comparison results indicate that ensemble-based feature selection methods outperform individual ones when combined with the incremental feature selection (IFS) strategy in terms of the informative and compact property of features. Taken together, results obtained suggest the effectiveness of the ensemble-based feature selection approach and the possible existence of a common panel of DNA methylation markers among such three types of lung cancer tissue, which would facilitate clinical diagnosis and treatment.
Collapse
Affiliation(s)
- Zhihua Cai
- Affiliated Cancer Hospital of Guangzhou Medical University, Guangzhou, Guangdong Province, China
| | | | | | | | | | | |
Collapse
|
28
|
Xu J, Wang L, Li J. Biological network module-based model for the analysis of differential expression in shotgun proteomics. J Proteome Res 2014; 13:5743-50. [PMID: 25327611 DOI: 10.1021/pr5007203] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
Protein differential expression analysis plays an important role in the understanding of molecular mechanisms as well as the pathogenesis of complex diseases. With the rapid development of mass spectrometry, shotgun proteomics using spectral counts has become a prevailing method for the quantitative analysis of complex protein mixtures. Existing methods in differential proteomics expression typically carry out analysis at the single-protein level. However, it is well-known that proteins interact with each other when they function in biological processes. In this study, focusing on biological network modules, we proposed a negative binomial generalized linear model for differential expression analysis of spectral count data in shotgun proteomics. In order to show the efficacy of the model in protein expression analysis at the level of protein modules, we conducted two simulation studies using synthetic data sets generated from theoretical distribution of count data and a real data set with shuffled counts. Then, we applied our method to a colorectal cancer data set and a nonsmall cell lung cancer data set. When compared with single-protein analysis methods, the results showed that module-based statistical model which takes account of the interactions among proteins led to more effective identification of subtle but coordinated changes at the systems level.
Collapse
Affiliation(s)
- Jia Xu
- Department of Bioinformatics and Biostatistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University , Shanghai 200240, People's Republic of China
| | | | | |
Collapse
|