1
|
Fahmy AM, Hammad MS, Mabrouk MS, Al-Atabany WI. On leveraging self-supervised learning for accurate HCV genotyping. Sci Rep 2024; 14:15463. [PMID: 38965254 PMCID: PMC11224313 DOI: 10.1038/s41598-024-64209-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2024] [Accepted: 06/06/2024] [Indexed: 07/06/2024] Open
Abstract
Hepatitis C virus (HCV) is a major global health concern, affecting millions of individuals worldwide. While existing literature predominantly focuses on disease classification using clinical data, there exists a critical research gap concerning HCV genotyping based on genomic sequences. Accurate HCV genotyping is essential for patient management and treatment decisions. While the neural models excel at capturing complex patterns, they still face challenges, such as data scarcity, that exist a lot in computational genomics. To overcome this challenges, this paper introduces an advanced deep learning approach for HCV genotyping based on the graphical representation of nucleotide sequences that outperforms classical approaches. Notably, it is effective for both partial and complete HCV genomes and addresses challenges associated with imbalanced datasets. In this work, ten HCV genotypes: 1a, 1b, 2a, 2b, 2c, 3a, 3b, 4, 5, and 6 were used in the analysis. This study utilizes Chaos Game Representation for 2D mapping of genomic sequences, employing self-supervised learning using convolutional autoencoder for deep feature extraction, resulting in an outstanding performance for HCV genotyping compared to various machine learning and deep learning models. This baseline provides a benchmark against which the performance of the proposed approach and other models can be evaluated. The experimental results showcase a remarkable classification accuracy of over 99%, outperforming traditional deep learning models. This performance demonstrates the capability of the proposed model to accurately identify HCV genotypes in both partial and complete sequences and in dealing with data scarcity for certain genotypes. The results of the proposed model are compared to NCBI genotyping tool.
Collapse
Affiliation(s)
- Ahmed M Fahmy
- Computer Science program, School of Information Technology and Computer Science (ITCS), Nile University, Sheikh Zayed City, Egypt.
| | - Muhammed S Hammad
- Biomedical Engineering Department, Faculty of Engineering, Helwan University, Cairo, Egypt
| | - Mai S Mabrouk
- Biomedical informatics program, School of Information Technology and Computer Science (ITCS), Nile University, Sheikh Zayed City, Egypt
| | - Walid I Al-Atabany
- Biomedical informatics program, School of Information Technology and Computer Science (ITCS), Nile University, Sheikh Zayed City, Egypt
| |
Collapse
|
2
|
Arias PM, Butler J, Randhawa GS, Soltysiak MPM, Hill KA, Kari L. Environment and taxonomy shape the genomic signature of prokaryotic extremophiles. Sci Rep 2023; 13:16105. [PMID: 37752120 PMCID: PMC10522608 DOI: 10.1038/s41598-023-42518-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2023] [Accepted: 09/11/2023] [Indexed: 09/28/2023] Open
Abstract
This study provides comprehensive quantitative evidence suggesting that adaptations to extreme temperatures and pH imprint a discernible environmental component in the genomic signature of microbial extremophiles. Both supervised and unsupervised machine learning algorithms were used to analyze genomic signatures, each computed as the k-mer frequency vector of a 500 kbp DNA fragment arbitrarily selected to represent a genome. Computational experiments classified/clustered genomic signatures extracted from a curated dataset of [Formula: see text] extremophile (temperature, pH) bacteria and archaea genomes, at multiple scales of analysis, [Formula: see text]. The supervised learning resulted in high accuracies for taxonomic classifications at [Formula: see text], and medium to medium-high accuracies for environment category classifications of the same datasets at [Formula: see text]. For [Formula: see text], our findings were largely consistent with amino acid compositional biases and codon usage patterns in coding regions, previously attributed to extreme environment adaptations. The unsupervised learning of unlabelled sequences identified several exemplars of hyperthermophilic organisms with large similarities in their genomic signatures, in spite of belonging to different domains in the Tree of Life.
Collapse
Affiliation(s)
- Pablo Millán Arias
- School of Computer Science, University of Waterloo, Waterloo, ON, Canada.
| | - Joseph Butler
- Department of Biology, University of Western Ontario, London, ON, Canada
| | - Gurjit S Randhawa
- School of Mathematical and Computational Sciences, University of Prince Edward Island, Charlottetown, PE, Canada
| | | | - Kathleen A Hill
- Department of Biology, University of Western Ontario, London, ON, Canada
| | - Lila Kari
- School of Computer Science, University of Waterloo, Waterloo, ON, Canada
| |
Collapse
|
3
|
Lichtblau D, Stoean C. Chaos game representation for authorship attribution. ARTIF INTELL 2023. [DOI: 10.1016/j.artint.2023.103858] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
|
4
|
de la Fuente R, Díaz-Villanueva W, Arnau V, Moya A. Genomic Signature in Evolutionary Biology: A Review. BIOLOGY 2023; 12:biology12020322. [PMID: 36829597 PMCID: PMC9953303 DOI: 10.3390/biology12020322] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/08/2022] [Revised: 02/11/2023] [Accepted: 02/13/2023] [Indexed: 02/19/2023]
Abstract
Organisms are unique physical entities in which information is stored and continuously processed. The digital nature of DNA sequences enables the construction of a dynamic information reservoir. However, the distinction between the hardware and software components in the information flow is crucial to identify the mechanisms generating specific genomic signatures. In this work, we perform a bibliometric analysis to identify the different purposes of looking for particular patterns in DNA sequences associated with a given phenotype. This study has enabled us to make a conceptual breakdown of the genomic signature and differentiate the leading applications. On the one hand, it refers to gene expression profiling associated with a biological function, which may be shared across taxa. This signature is the focus of study in precision medicine. On the other hand, it also refers to characteristic patterns in species-specific DNA sequences. This interpretation plays a key role in comparative genomics, identifying evolutionary relationships. Looking at the relevant studies in our bibliographic database, we highlight the main factors causing heterogeneities in genome composition and how they can be quantified. All these findings lead us to reformulate some questions relevant to evolutionary biology.
Collapse
Affiliation(s)
- Rebeca de la Fuente
- Institute of Integrative Systems Biology (I2Sysbio), University of Valencia and Spanish Research Council (CSIC), 46980 Valencia, Spain
- Correspondence:
| | - Wladimiro Díaz-Villanueva
- Institute of Integrative Systems Biology (I2Sysbio), University of Valencia and Spanish Research Council (CSIC), 46980 Valencia, Spain
| | - Vicente Arnau
- Institute of Integrative Systems Biology (I2Sysbio), University of Valencia and Spanish Research Council (CSIC), 46980 Valencia, Spain
| | - Andrés Moya
- Institute of Integrative Systems Biology (I2Sysbio), University of Valencia and Spanish Research Council (CSIC), 46980 Valencia, Spain
- Foundation for the Promotion of Sanitary and Biomedical Research of the Valencian Community (FISABIO), 46020 Valencia, Spain
- CIBER in Epidemiology and Public Health (CIBEResp), 28029 Madrid, Spain
| |
Collapse
|
5
|
Ning Y, Li Y, Wang H. ANXA2 is a potential biomarker for cancer prognosis and immune infiltration: A systematic pan-cancer analysis. Front Genet 2023; 14:1108167. [PMID: 36713082 PMCID: PMC9877333 DOI: 10.3389/fgene.2023.1108167] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2022] [Accepted: 01/02/2023] [Indexed: 01/15/2023] Open
Abstract
Background: Annexin A2 (ANXA2) belongs to the Annexin A family and plays a role in epithelial-mesenchymal transition, fibrinolysis, and other physiological processes. Annexin A2 has been extensively implicated in tumorigenesis and development in previous studies, but its precise role in pan-cancer remains largely unknown. Methods: We adopted bioinformatics methods to explore the oncogenic role of Annexin A2 using different databases, including the Cancer Genome Atlas (TCGA), the Genotype-Tissue Expression (GTEx) biobank, the Human Protein Atlas (HPA), the Gene Expression Profiling Interaction Analysis (GEPIA) and cBioPortal. We analyzed the differential expression of Annexin A2 in different tumors and its relationship with cancer prognosis, immune cell infiltration, DNA methylation, tumor mutation burden (TMB), microsatellite instability (MSI) and mismatch repair (MMR). Furtherly, we conducted a Gene Set Enrichment Analysis (GSEA) to identify the Annexin A2-related pathways. Results: Annexin A2 expression was upregulated in most cancers, except in kidney chromophobe (KICH) and prostate adenocarcinoma (PRAD). Annexin A2 showed a good diagnostic efficacy in twelve types of cancer. The high expression of Annexin A2 was significantly associated with a reduced overall survival, disease-specific survival and progression-free interval in seven cancers. The Annexin A2 expression was variably associated with infiltration of 24 types of immune cells in 32 tumor microenvironments. In addition, Annexin A2 expression was differently associated with 47 immune checkpoints, immunoregulators, DNA methylation, tumor mutation burden, microsatellite instability and mismatch repair in pan-cancer. Gene Set Enrichment Analysis revealed that Annexin A2 was significantly correlated with immune-related pathways in fifteen cancers. Conclusion: Annexin A2 widely correlates with immune infiltration and may function as a promising prognostic biomarker in many tumors, showing its potential as a target for immunotherapy in pan-cancer.
Collapse
Affiliation(s)
- Yijie Ning
- Department of Neurosurgery, The First Hospital of Shanxi Medical University, Taiyuan, China
| | - Yufei Li
- Department of Neurosurgery, The Second Hospital of Shanxi Medical University, Taiyuan, China
| | - Hongqin Wang
- Department of Neurosurgery, The First Hospital of Shanxi Medical University, Taiyuan, China,*Correspondence: Hongqin Wang,
| |
Collapse
|
6
|
Xie XH, Huang YJ, Han GS, Yu ZG, Ma YL. Microbial characterization based on multifractal analysis of metagenomes. Front Cell Infect Microbiol 2023; 13:1117421. [PMID: 36779183 PMCID: PMC9910082 DOI: 10.3389/fcimb.2023.1117421] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2022] [Accepted: 01/09/2023] [Indexed: 01/28/2023] Open
Abstract
Introduction The species diversity of microbiomes is a cutting-edge concept in metagenomic research. In this study, we propose a multifractal analysis for metagenomic research. Method and Results Firstly, we visualized the chaotic game representation (CGR) of simulated metagenomes and real metagenomes. We find that metagenomes are visualized with self-similarity. Then we defined and calculated the multifractal dimension for the visualized plot of simulated and real metagenomes, respectively. By analyzing the Pearson correlation coefficients between the multifractal dimension and the traditional species diversity index, we obtain that the correlation coefficients between the multifractal dimension and the species richness index and Shannon diversity index reached the maximum value when q = 0, 1, and the correlation coefficient between the multifractal dimension and the Simpson diversity index reached the maximum value when q = 5. Finally, we apply our method to real metagenomes of the gut microbiota of 100 infants who are newborn and 4 and 12 months old. The results show that the multifractal dimensions of an infant's gut microbiomes can distinguish age differences. Conclusion and Discussion There is self-similarity among the CGRs of WGS of metagenomes, and the multifractal spectrum is an important characteristic for metagenomes. The traditional diversity indicators can be unified under the framework of multifractal analysis. These results coincided with similar results in macrobial ecology. The multifractal spectrum of infants' gut microbiomes are related to the development of the infants.
Collapse
Affiliation(s)
- Xian-hua Xie
- Key Laboratory of Jiangxi Province for Numerical Simulation and Emulation Techniques, Gannan Normal University, Ganzhoiu, China
- Key Laboratory of Intelligent Computing and Information Processing of Ministry of Education and Hunan Key Laboratory for Computation and Simulation in Science and Engineering, Xiangtan University, Xiangtan, China
- *Correspondence: Xian-hua Xie,
| | - Yu-jie Huang
- Key Laboratory of Jiangxi Province for Numerical Simulation and Emulation Techniques, Gannan Normal University, Ganzhoiu, China
| | - Guo-sheng Han
- Key Laboratory of Intelligent Computing and Information Processing of Ministry of Education and Hunan Key Laboratory for Computation and Simulation in Science and Engineering, Xiangtan University, Xiangtan, China
| | - Zu-guo Yu
- Key Laboratory of Intelligent Computing and Information Processing of Ministry of Education and Hunan Key Laboratory for Computation and Simulation in Science and Engineering, Xiangtan University, Xiangtan, China
| | - Yuan-lin Ma
- School of Economics, Zhengzhou University of Aeronautics, Zhengzhou, China
| |
Collapse
|
7
|
Lei K, Tan B, Liang R, Lyu Y, Wang K, Wang W, Wang K, Hu X, Wu D, Lin H, Wang M. Development and clinical validation of a necroptosis-related gene signature for prediction of prognosis and tumor immunity in lung adenocarcinoma. Am J Cancer Res 2022; 12:5160-5182. [PMID: 36504901 PMCID: PMC9729905] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2022] [Accepted: 11/08/2022] [Indexed: 12/15/2022] Open
Abstract
Necroptosis is a new programmed formation of necrotizing cell death, which plays important role in tumor biological regulation, including tumorigenesis and immunity. In this study, we aimed to establish and validate a prediction model based on necroptosis-related genes (NRGs) for lung adenocarcinoma (LUAD) prognosis and tumor immunity. The training set consisted of samples from The Cancer Genome Atlas (TCGA) dataset (n = 334), and the validation sets consisted of samples from the Gene Expression Omnibus (GEO) (n = 439) and clinical (n = 20) datasets. Gene Oncology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis showed that 28 necroptosis-related differentially expressed genes (DEGs) were enriched in cell death and immune regulation. RT-qPCR and western blot results showed the low expression of necroptosis markers in LUAD cells. A prognostic gene signature based on 6 NRGs (PYGB, IL1A, IFNAR2, BIRC3, H2AFY2, and H2AFX) was constructed and the risk score was calculated. Multivariate Cox regression analysis showed that the risk score was an independent risk factor [hazard ratio (HR) = 1.220, 95% confidence interval (CI): 1.154-1.290, P<0.001]. In the TCGA cohort, a high-risk score was associated with poor prognosis, weak immune infiltration, and low expression at immune checkpoints, which was validated in the GEO and clinical cohorts. Our findings showed that the patients in the low-risk group had a better progression-free survival (PFS) [not reached vs. 8.5 months, HR = 0.18, 95% CI: 0.04-0.72, P<0.001] than those in the high-risk score group. Immunotherapy tolerance was found to be correlated with the high-risk score, and the risk score combined with PD-L1 (AUC = 0.808, 95% CI: 0.613-1.000) could better predict the immunotherapy response of LUAD. A nomogram was shown to have a strong ability to predict the individual survival rate of patients with LUAD in the TCGA and GSE68465 cohorts. We constructed and validated a potential prognostic signature consisting of 6 NRGs to predict the prognosis and tumor immunity of LUAD, which may be helpful to guide the individualized immunotherapy of LUAD.
Collapse
Affiliation(s)
- Kai Lei
- Guangdong Provincial Key Laboratory of Malignant Tumor Epigenetics and Gene Regulation, Sun Yat-sen Memorial Hospital, Sun Yat-sen UniversityGuangzhou, Guangdong, China,Department of Thoracic Surgery, Sun Yat-sen Memorial Hospital, Sun Yat-sen UniversityGuangzhou, Guangdong, China
| | - Binghua Tan
- Guangdong Provincial Key Laboratory of Malignant Tumor Epigenetics and Gene Regulation, Sun Yat-sen Memorial Hospital, Sun Yat-sen UniversityGuangzhou, Guangdong, China,Department of Thoracic Surgery, Sun Yat-sen Memorial Hospital, Sun Yat-sen UniversityGuangzhou, Guangdong, China
| | - Ruihao Liang
- Guangdong Provincial Key Laboratory of Malignant Tumor Epigenetics and Gene Regulation, Sun Yat-sen Memorial Hospital, Sun Yat-sen UniversityGuangzhou, Guangdong, China,Department of Thoracic Surgery, Sun Yat-sen Memorial Hospital, Sun Yat-sen UniversityGuangzhou, Guangdong, China
| | - Yingcheng Lyu
- Guangdong Provincial Key Laboratory of Malignant Tumor Epigenetics and Gene Regulation, Sun Yat-sen Memorial Hospital, Sun Yat-sen UniversityGuangzhou, Guangdong, China,Department of Thoracic Surgery, Sun Yat-sen Memorial Hospital, Sun Yat-sen UniversityGuangzhou, Guangdong, China
| | - Kexi Wang
- Guangdong Provincial Key Laboratory of Malignant Tumor Epigenetics and Gene Regulation, Sun Yat-sen Memorial Hospital, Sun Yat-sen UniversityGuangzhou, Guangdong, China,Department of Thoracic Surgery, Sun Yat-sen Memorial Hospital, Sun Yat-sen UniversityGuangzhou, Guangdong, China
| | - Wenjian Wang
- Guangdong Provincial Key Laboratory of Malignant Tumor Epigenetics and Gene Regulation, Sun Yat-sen Memorial Hospital, Sun Yat-sen UniversityGuangzhou, Guangdong, China,Department of Thoracic Surgery, Sun Yat-sen Memorial Hospital, Sun Yat-sen UniversityGuangzhou, Guangdong, China
| | - Kefeng Wang
- Guangdong Provincial Key Laboratory of Malignant Tumor Epigenetics and Gene Regulation, Sun Yat-sen Memorial Hospital, Sun Yat-sen UniversityGuangzhou, Guangdong, China,Department of Thoracic Surgery, Sun Yat-sen Memorial Hospital, Sun Yat-sen UniversityGuangzhou, Guangdong, China
| | - Xueting Hu
- Guangdong Provincial Key Laboratory of Malignant Tumor Epigenetics and Gene Regulation, Sun Yat-sen Memorial Hospital, Sun Yat-sen UniversityGuangzhou, Guangdong, China,Department of Thoracic Surgery, Sun Yat-sen Memorial Hospital, Sun Yat-sen UniversityGuangzhou, Guangdong, China
| | - Duoguang Wu
- Guangdong Provincial Key Laboratory of Malignant Tumor Epigenetics and Gene Regulation, Sun Yat-sen Memorial Hospital, Sun Yat-sen UniversityGuangzhou, Guangdong, China,Department of Thoracic Surgery, Sun Yat-sen Memorial Hospital, Sun Yat-sen UniversityGuangzhou, Guangdong, China
| | - Huayue Lin
- Guangdong Provincial Key Laboratory of Malignant Tumor Epigenetics and Gene Regulation, Sun Yat-sen Memorial Hospital, Sun Yat-sen UniversityGuangzhou, Guangdong, China,Breast Tumor Center, Sun Yat-sen Memorial Hospital, Sun Yat-sen UniversityGuangzhou, Guangdong, China
| | - Minghui Wang
- Guangdong Provincial Key Laboratory of Malignant Tumor Epigenetics and Gene Regulation, Sun Yat-sen Memorial Hospital, Sun Yat-sen UniversityGuangzhou, Guangdong, China,Department of Thoracic Surgery, Sun Yat-sen Memorial Hospital, Sun Yat-sen UniversityGuangzhou, Guangdong, China
| |
Collapse
|
8
|
Sun Q, Wang H, Xiao B, Xue D, Wang G. Development and Validation of a 6-Gene Hypoxia-Related Prognostic Signature For Cholangiocarcinoma. Front Oncol 2022; 12:954366. [PMID: 35924146 PMCID: PMC9339701 DOI: 10.3389/fonc.2022.954366] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2022] [Accepted: 06/21/2022] [Indexed: 11/22/2022] Open
Abstract
Cholangiocarcinoma (CHOL) is highly malignant and has a poor prognosis. This study is committed to creating a new prognostic model based on hypoxia related genes. Here, we established a novel tumor hypoxia-related prognostic model consisting of 6 hypoxia-related genes by univariate Cox regression and the least absolute shrinkage and selection operator (LASSO) algorithm to predict CHOL prognosis and then the risk score for each patient was calculated. The results showed that the patients with high-risk scores had poor prognosis compared with those with low-risk scores, which was verified as an independent predictor by multivariate analysis. The hypoxia-related prognostic model was validated in both TCGA and GEO cohorts and exhibited excellent performance in predicting overall survival in CHOL. The PPI results suggested that hypoxia-related genes involved in the model may play a central role in regulating the hypoxic state. In addition, the presence of IDH1 mutations in the high-risk group was high, and GSEA results showed that some metabolic pathways were upregulated, but immune response processes were generally downregulated. These factors may be potential reasons for the high-risk group with worse prognosis. The analysis of different immune regulation-related processes in the high- and low-risk groups revealed that the expression of genes related to immune checkpoints would show differences between these two groups. We further verified the expression of the oncogene PPFIA4 in the model, and found that compared with normal samples, CHOL patients were generally highly expressed, and the patients with high-expression of PPFIA4 had a poor prognosis. In summary, the present study may provide a valid prognostic model for bile duct cancer to inform better clinical management of patients.
Collapse
Affiliation(s)
- Qi Sun
- Department of General Surgery, The First Affiliated Hospital of Xi’an Jiaotong University, Xi’an, China
| | - Huxia Wang
- Mammary Department, Shaanxi Provincial Cancer Hospital, Xi’an, China
| | - Baoan Xiao
- Department of General Surgery, The First Affiliated Hospital of Xi’an Jiaotong University, Xi’an, China
| | - Dong Xue
- Department of General Surgery, The First Affiliated Hospital of Xi’an Jiaotong University, Xi’an, China
| | - Guanghui Wang
- Department of General Surgery, The First Affiliated Hospital of Xi’an Jiaotong University, Xi’an, China
- *Correspondence: Guanghui Wang,
| |
Collapse
|
9
|
Yin R, Luo Z, Kwoh CK. Exploring the Lethality of Human-Adapted Coronavirus Through Alignment-Free Machine Learning Approaches Using Genomic Sequences. Curr Genomics 2021; 22:583-595. [PMID: 35386190 PMCID: PMC8922323 DOI: 10.2174/1389202923666211221110857] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2021] [Revised: 12/02/2021] [Accepted: 12/14/2021] [Indexed: 11/29/2022] Open
Abstract
Background A newly emerging novel coronavirus appeared and rapidly spread worldwide and World Health Organization declared a pandemic on March 11, 2020. The roles and characteristics of coronavirus have captured much attention due to its power of causing a wide variety of infectious diseases, from mild to severe, on humans. The detection of the lethality of human coronavirus is key to estimate the viral toxicity and provide perspectives for treatment. Methods We developed an alignment-free framework that utilizes machine learning approaches for an ultra-fast and highly accurate prediction of the lethality of human-adapted coronavirus using genomic sequences. We performed extensive experiments through six different feature transformation and machine learning algorithms combining digital signal processing to identify the lethality of possible future novel coronaviruses using existing strains. Results The results tested on SARS-CoV, MERS-CoV and SARS-CoV-2 datasets show an average 96.7% prediction accuracy. We also provide preliminary analysis validating the effectiveness of our models through other human coronaviruses. Our framework achieves high levels of prediction performance that is alignment-free and based on RNA sequences alone without genome annotations and specialized biological knowledge. Conclusion The results demonstrate that, for any novel human coronavirus strains, this study can offer a reliable real-time estimation for its viral lethality.
Collapse
Affiliation(s)
- Rui Yin
- School of Computer Science and Engineering, Nanyang Technological University, 50 Nanyang Avenue, 639798, Singapore
- Department of Biomedical Informatics, Harvard University, Boston, MA 02138, USA
| | - Zihan Luo
- School of Electronic Information and Communications, Huazhong University of Science and Technology, Wuhan, 430074, China
| | - Chee Keong Kwoh
- School of Computer Science and Engineering, Nanyang Technological University, 50 Nanyang Avenue, 639798, Singapore
| |
Collapse
|
10
|
Duan L, Cao L, Zhang R, Niu L, Yang W, Feng W, Zhou W, Chen J, Wang X, Li Y, Zhang Y, Liu J, Zhao Q, Fan D, Hong L. Development and validation of a survival model for esophageal adenocarcinoma based on autophagy-associated genes. Bioengineered 2021; 12:3434-3454. [PMID: 34252349 PMCID: PMC8806464 DOI: 10.1080/21655979.2021.1946235] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2021] [Accepted: 06/16/2021] [Indexed: 12/15/2022] Open
Abstract
Autophagy is a highly conserved catabolic process which has been implicated in esophageal adenocarcinoma (EAC). We sought to investigate the biological functions and prognostic value of autophagy-related genes (ARGs) in EAC. A total of 21 differentially expressed ARGs were identified between EAC and normal samples. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis were then applied for the differentially expressed ARGs in EAC, and the protein-protein interaction (PPI) network was established. Cox survival analysis and Lasso regression analysis were performed to establish a prognostic prediction model based on nine overall survival (OS)-related ARGs (CAPN1, GOPC, TBK1, SIRT1, ARSA, BNIP1, ERBB2, NRG2, PINK1). The 9-gene prognostic signature significantly stratified patient outcomes in The Cancer Genome of Atlas (TCGA)-EAC cohort and was considered as an independently prognostic predictor for EAC patients. Moreover, Gene set enrichment analysis (GSEA) analyses revealed several important cellular processes and signaling pathways correlated with the high-risk group in EAC. This prognostic prediction model was confirmed in an independent validation cohort (GSE13898) from The Gene Expression Omnibus (GEO) database. We also developed a nomogram with a concordance index of 0.78 to predict the survival possibility of EAC patients by integrating the risk signature and clinicopathological features. The calibration curves substantiated favorable concordance between actual observation and nomogram prediction. Last but not least, Erb-B2 Receptor Tyrosine Kinase 2 (ERBB2), a member of the prognostic gene signature, was identified as a potential therapeutic target for EAC patients. To sum up, we established and verified a novel prognostic prediction model based on ARGs which could optimize the individualized survival prediction in EAC.
Collapse
Affiliation(s)
- Lili Duan
- Division of Digestive Surgery, State Key Laboratory of Cancer Biology and National Clinical Research Center for Digestive Diseases, Xijing Hospital of Digestive Diseases, Fourth Military Medical University, Xi’an, Shaanxi Province, China
| | - Lu Cao
- Department of Biomedical Engineering, Fourth Military Medical University, Xi’an, Shaanxi Province, China
| | - Rui Zhang
- Division of Digestive Surgery, State Key Laboratory of Cancer Biology and National Clinical Research Center for Digestive Diseases, Xijing Hospital of Digestive Diseases, Fourth Military Medical University, Xi’an, Shaanxi Province, China
| | - Liaoran Niu
- Division of Digestive Surgery, State Key Laboratory of Cancer Biology and National Clinical Research Center for Digestive Diseases, Xijing Hospital of Digestive Diseases, Fourth Military Medical University, Xi’an, Shaanxi Province, China
| | - Wanli Yang
- Division of Digestive Surgery, State Key Laboratory of Cancer Biology and National Clinical Research Center for Digestive Diseases, Xijing Hospital of Digestive Diseases, Fourth Military Medical University, Xi’an, Shaanxi Province, China
| | - Weibo Feng
- Division of Digestive Surgery, State Key Laboratory of Cancer Biology and National Clinical Research Center for Digestive Diseases, Xijing Hospital of Digestive Diseases, Fourth Military Medical University, Xi’an, Shaanxi Province, China
| | - Wei Zhou
- Division of Digestive Surgery, State Key Laboratory of Cancer Biology and National Clinical Research Center for Digestive Diseases, Xijing Hospital of Digestive Diseases, Fourth Military Medical University, Xi’an, Shaanxi Province, China
| | - Junfeng Chen
- Division of Digestive Surgery, State Key Laboratory of Cancer Biology and National Clinical Research Center for Digestive Diseases, Xijing Hospital of Digestive Diseases, Fourth Military Medical University, Xi’an, Shaanxi Province, China
| | - Xiaoqian Wang
- Division of Digestive Surgery, State Key Laboratory of Cancer Biology and National Clinical Research Center for Digestive Diseases, Xijing Hospital of Digestive Diseases, Fourth Military Medical University, Xi’an, Shaanxi Province, China
| | - Yiding Li
- Division of Digestive Surgery, State Key Laboratory of Cancer Biology and National Clinical Research Center for Digestive Diseases, Xijing Hospital of Digestive Diseases, Fourth Military Medical University, Xi’an, Shaanxi Province, China
| | - Yujie Zhang
- Division of Digestive Surgery, State Key Laboratory of Cancer Biology and National Clinical Research Center for Digestive Diseases, Xijing Hospital of Digestive Diseases, Fourth Military Medical University, Xi’an, Shaanxi Province, China
| | - Jinqiang Liu
- Division of Digestive Surgery, State Key Laboratory of Cancer Biology and National Clinical Research Center for Digestive Diseases, Xijing Hospital of Digestive Diseases, Fourth Military Medical University, Xi’an, Shaanxi Province, China
| | - Qingchuan Zhao
- Division of Digestive Surgery, State Key Laboratory of Cancer Biology and National Clinical Research Center for Digestive Diseases, Xijing Hospital of Digestive Diseases, Fourth Military Medical University, Xi’an, Shaanxi Province, China
| | - Daiming Fan
- Division of Digestive Surgery, State Key Laboratory of Cancer Biology and National Clinical Research Center for Digestive Diseases, Xijing Hospital of Digestive Diseases, Fourth Military Medical University, Xi’an, Shaanxi Province, China
| | - Liu Hong
- Division of Digestive Surgery, State Key Laboratory of Cancer Biology and National Clinical Research Center for Digestive Diseases, Xijing Hospital of Digestive Diseases, Fourth Military Medical University, Xi’an, Shaanxi Province, China
| |
Collapse
|
11
|
Li Z, Du G, Zhao R, Yang W, Li C, Huang J, Wen Z, Li H, Zhang B. Identification and validation of a hypoxia-related prognostic signature in clear cell renal cell carcinoma patients. Medicine (Baltimore) 2021; 100:e27374. [PMID: 34596153 PMCID: PMC8483867 DOI: 10.1097/md.0000000000027374] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/12/2020] [Accepted: 09/11/2021] [Indexed: 01/05/2023] Open
Abstract
Increasing evidence has shown that hypoxia is closely related to the development, progression, and prognosis of clear cell renal cell carcinoma (ccRCC). Nevertheless, reliable prognostic signatures based on hypoxia have not been well-established. This study aimed to establish a hypoxia-related prognostic signature and construct an optimized nomogram for patients with ccRCC.We accessed hallmark gene sets of hypoxia, including 200 genes, and an original RNA seq dataset of ccRCC cases with integrated clinical information obtained by mining the Cancer Genome Atlas database and the International Cancer Genome Consortium (ICGC) database. Univariate Cox regression analysis and multivariate Cox proportional hazards regression were performed to identify prognostic hub genes and further established prognostic model as well as visualized the nomogram. External validation of the optimized nomogram was performed in independent cohorts from the ICGC database.ANKZF1, ETS1, PLAUR, SERPINE1, FBP1, and PFKP were selected as prognostic hypoxia-related hub genes, and the prognostic model effectively distinguishes high-risk and low-risk patients with ccRCC. The results of receiver operating characteristic curve, risk plots, survival analysis, and independent analysis suggested that RiskScore was a useful tool and independent predictive factor. A novel prognosis nomogram optimized via RiskScore showed its promising performance in both the Cancer Genome Atlas-ccRCC cohort and an ICGC-ccRCC cohort.Our study reveals that the differential expressions of hypoxia-related genes are associated with the overall survival of patients with ccRCC. The prognostic model we established showed a good predictive and discerning ability in ccRCC patients. The novel nomogram optimized via RiskScore exhibited a promising predictive ability. It may be able to serve as a visualized tool for guiding clinical decisions and selecting effective individualized treatments.
Collapse
Affiliation(s)
| | - Gang Du
- Department of Bone and Joint Surgery, The First Affiliated Hospital of Guangxi Medical University, Nanning, China
| | - Rong Zhao
- Guangxi Medical University, Nanning, China
| | | | - Chan Li
- Guangxi Medical University, Nanning, China
| | - Jun Huang
- Guangxi Medical University, Nanning, China
| | | | - Hening Li
- Guangxi Medical University, Nanning, China
| | - Bo Zhang
- Department of Orthopedics Trauma, The Third Affiliated Hospital of Guangxi Medical University, Nanning, China
| |
Collapse
|
12
|
Hu J, Xu J, Feng X, Li Y, Hua F, Xu G. Differential Expression of the TLR4 Gene in Pan-Cancer and Its Related Mechanism. Front Cell Dev Biol 2021; 9:700661. [PMID: 34631699 PMCID: PMC8495169 DOI: 10.3389/fcell.2021.700661] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2021] [Accepted: 07/27/2021] [Indexed: 01/03/2023] Open
Abstract
Previous studies have revealed the relationship between toll-like receptor 4 (TLR4) polymorphisms and cancer susceptibility. However, the relationship between TLR4 and prognosis and immune cell infiltration in pan-cancer patients is still unclear. Through the Genotype-Tissue Expression (GTEx) and The Cancer Genome Atlas (TCGA) databases, the distinct expression of the TLR4 gene in 24 tumors and normal tissues was analyzed. Univariate Cox proportional hazards regression analysis was used to identify the cancer types whose TLR4 gene expression was related to prognosis. The relationship between TLR4 and tumor cell immune invasion was studied. Spearman's rank correlation coefficient was used to analyze the relationship among TLR4 and immune neoantigens, tumor mutation burden (TMB), microsatellite instability (MSI), DNA repair genes, and DNA methylation. Gene Set Enrichment Analysis (GSEA) was used to identify the tumor-related pathways that the TLR4 gene was highly expressed in; the expression of the TLR4 gene was verified with the Human Protein Atlas (HPA) database. Low expression of TLR4 was associated with an inferior prognosis in kidney renal clear cell carcinoma (KIRC), skin cutaneous melanoma (SKCM), and uterine corpus endometrial carcinoma (UCEC), while high expression was related to a poor prognosis in head and neck squamous cell carcinoma (HNSC), prostate adenocarcinoma (PRAD), stomach adenocarcinoma (STAD), and testicular germ cell tumor (TGCT). The expression of TLR4 was negatively correlated with the expression of B cells in STAD. The expression of TLR4 was positively correlated with the infiltration of B cells, CD4 and CD8 T cells, neutrophils, macrophages, and dendritic cells in STAD, KIRC, UCEC, TGCT, and SKCM. The expression of the TLR4 gene in KIRC, SKCM, STAD, TGCT, and UCEC was highly correlated with inducible T-cell costimulator (ICOS), cytotoxic T lymphocyte-associated molecule 4 (CTLA4), and CD28 immune checkpoints. Spearman's rank correlation coefficient showed that the expression of TLR4 gene was significantly correlated with TMB in STAD and UCEC and was prominently correlated with MSI in TGCT, STAD, and SKCM. The expression of the TLR4 gene was highly correlated with MLH1, MSH2, and MSH6 in KIRC, SKCM, and STAD. The expression of the TLR4 gene was remarkably correlated with the methyltransferases DNA methyltransferase 2 (DNMT2) and DNA methyltransferase 3-beta (DNMT3B) in SKCM and STAD. Enrichment analysis showed that TLR4 was highly expressed in the chemokine signaling pathway and the cell adhesion molecule and cytokine receptor interaction pathway. In summary, the expression of TLR4 is linked to the prognosis of KIRC, SKCM, STAD, TGCT, and UCEC patients and the level of immune infiltration of CD4, CD8 T cells, macrophages, neutrophils, and dendritic cells.
Collapse
Affiliation(s)
- Jialing Hu
- Department of Anesthesiology, The Second Affiliated Hospital of Nanchang University, Nanchang, China
| | - Jiasheng Xu
- Department of Surgical Oncology, Zhejiang University Cancer Center, Hangzhou, China
| | - Xiaojin Feng
- Department of Anesthesiology, The Second Affiliated Hospital of Nanchang University, Nanchang, China
| | - Yiran Li
- Queen Mary College, Nanchang University, Nanchang, China
| | - Fuzhou Hua
- Department of Anesthesiology, The Second Affiliated Hospital of Nanchang University, Nanchang, China
| | - Guohai Xu
- Department of Anesthesiology, The Second Affiliated Hospital of Nanchang University, Nanchang, China
| |
Collapse
|
13
|
Shi H, Zhong F, Yi X, Shi Z, Ou F, Xu Z, Zuo Y. Application of an Autophagy-Related Gene Prognostic Risk Model Based on TCGA Database in Cervical Cancer. Front Genet 2021; 11:616998. [PMID: 33633773 PMCID: PMC7900625 DOI: 10.3389/fgene.2020.616998] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2020] [Accepted: 12/22/2020] [Indexed: 12/24/2022] Open
Abstract
Background: Autophagy plays an important role in the development of cancer. However, the prognostic value of autophagy-related genes (ARGs) in cervical cancer (CC) is unclear. The purpose of this study is to construct a survival model for predicting the prognosis of CC patients based on ARG signature. Methods: ARGs were obtained from the Human Autophagy Database and Molecular Signatures Database. The expression profiles of ARGs and clinical data were downloaded from the TCGA database. Differential expression analysis of CC tissues and normal tissues was performed using R software to screen out ARGs with an aberrant expression. Univariate Cox, Lasso, and multivariate Cox regression analyses were used to construct a prognostic model which was validated by using the test set and the entire set. We also performed an independent prognostic analysis of risk score and some clinicopathological factors of CC. Finally, a clinical practical nomogram was established to predict individual survival probability. Results: Compared with normal tissues, there were 63 ARGs with an aberrant expression in CC tissues. A risk model based on 3 ARGs was finally obtained by Lasso and Cox regression analysis. Patients with high risk had significantly shorter overall survival (OS) than low-risk patients in both train set and validation set. The ROC curve validated its good performance in survival prediction, suggesting that this model has a certain extent sensitivity and specificity. Multivariate Cox analysis showed that the risk score was an independent prognostic factor. Finally, we mapped a nomogram to predict 1-, 3-, and 5-year survival for CC patients. The calibration curves indicated that the model was reliable. Conclusion: A risk prediction model based on CHMP4C, FOXO1, and RRAGB was successfully constructed, which could effectively predict the prognosis of CC patients. This model can provide a reference for CC patients to make precise treatment strategy.
Collapse
Affiliation(s)
- Huadi Shi
- Cancer Center, Affiliated Hospital of Guangdong Medical University, Zhanjiang, China
| | - Fulan Zhong
- Cancer Center, Affiliated Hospital of Guangdong Medical University, Zhanjiang, China
| | - Xiaoqiong Yi
- Cancer Center, Affiliated Hospital of Guangdong Medical University, Zhanjiang, China
| | - Zhenyi Shi
- Cancer Center, Affiliated Hospital of Guangdong Medical University, Zhanjiang, China
| | - Feiyan Ou
- Cancer Center, Affiliated Hospital of Guangdong Medical University, Zhanjiang, China
| | - Zumin Xu
- Cancer Center, Affiliated Hospital of Guangdong Medical University, Zhanjiang, China
| | - Yufang Zuo
- Cancer Center, Affiliated Hospital of Guangdong Medical University, Zhanjiang, China
| |
Collapse
|
14
|
Qiu J, Sun M, Wang Y, Chen B. Identification and validation of an individualized autophagy-clinical prognostic index in gastric cancer patients. Cancer Cell Int 2020; 20:178. [PMID: 32477008 PMCID: PMC7240997 DOI: 10.1186/s12935-020-01267-y] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2020] [Accepted: 05/14/2020] [Indexed: 12/24/2022] Open
Abstract
Background The purpose of this study is to perform bioinformatics analysis of autophagy-related genes in gastric cancer, and to construct a multi-gene joint signature for predicting the prognosis of gastric cancer. Methods GO and KEGG analysis were applied for differentially expressed autophagy-related genes in gastric cancer, and PPI network was constructed in Cytoscape software. In order to optimize the prognosis evaluation system of gastric cancer, we established a prognosis model integrating autophagy-related genes. We used single factor Cox proportional risk regression analysis to screen genes related to prognosis from 204 autophagy-related genes in The Atlas Cancer Genome (TCGA) gastric cancer cohort. Then, the generated genes were applied to the Least Absolute Shrinkage and Selection Operator (LASSO). Finally, the selected genes were further included in the multivariate Cox proportional hazard regression analysis to establish the prognosis model. According to the median risk score, patients were divided into high-risk group and low-risk group, and survival analysis was conducted to evaluate the prognostic value of risk score. Finally, by combining clinic-pathological features and prognostic gene signatures, a nomogram was established to predict individual survival probability. Results GO analysis showed that the 28 differently expressed autophagy-related genes was enriched in cell growth, neuron death, and regulation of cell growth. KEGG analysis showed that the 28 differently expressed autophagy-related genes were related to platinum drug resistance, apoptosis and p53 signaling pathway. The risk score was constructed based on 4 genes (GRID2, ATG4D,GABARAPL2, CXCR4), and gastric cancer patients were significantly divided into high-risk and low-risk groups according to overall survival. In multivariate Cox regression analysis, risk score was still an independent prognostic factor (HR = 1.922, 95% CI = 1.573–2.349, P < 0.001). Cumulative curve showed that the survival time of patients with low-risk score was significantly longer than that of patients with high-risk score (P < 0.001). The external data GSE62254 proved that nomograph had a great ability to evaluate the prognosis of individual gastric cancer patients. Conclusions This study provides a potential prognostic marker for predicting the prognosis of GC patients and the molecular biology of GC autophagy.
Collapse
Affiliation(s)
- Jieping Qiu
- 1Department of Clinical Medicine, The First Clinical College, Anhui Medical University, Hefei, China
| | - Mengyu Sun
- 1Department of Clinical Medicine, The First Clinical College, Anhui Medical University, Hefei, China
| | - Yaoqun Wang
- 1Department of Clinical Medicine, The First Clinical College, Anhui Medical University, Hefei, China
| | - Bo Chen
- 2Department of Gastrointestinal Surgery Center, The First Affiliated Hospital of Anhui Medical University, NO. 218 Jixi Road, Hefei, Anhui 230000 China
| |
Collapse
|
15
|
Lichtblau D. Alignment-free genomic sequence comparison using FCGR and signal processing. BMC Bioinformatics 2019; 20:742. [PMID: 31888438 PMCID: PMC6937637 DOI: 10.1186/s12859-019-3330-3] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2019] [Accepted: 12/17/2019] [Indexed: 01/14/2023] Open
Abstract
Background Alignment-free methods of genomic comparison offer the possibility of scaling to large data sets of nucleotide sequences comprised of several thousand or more base pairs. Such methods can be used for purposes of deducing “nearby” species in a reference data set, or for constructing phylogenetic trees. Results We describe one such method that gives quite strong results. We use the Frequency Chaos Game Representation (FCGR) to create images from such sequences, We then reduce dimension, first using a Fourier trig transform, followed by a Singular Values Decomposition (SVD). This gives vectors of modest length. These in turn are used for fast sequence lookup, construction of phylogenetic trees, and classification of virus genomic data. We illustrate the accuracy and scalability of this approach on several benchmark test sets. Conclusions The tandem of FCGR and dimension reductions using Fourier-type transforms and SVD provides a powerful approach for alignment-free genomic comparison. Results compare favorably and often surpass best results reported in prior literature. Good scalability is also observed.
Collapse
|
16
|
An open-source k-mer based machine learning tool for fast and accurate subtyping of HIV-1 genomes. PLoS One 2018; 13:e0206409. [PMID: 30427878 PMCID: PMC6235296 DOI: 10.1371/journal.pone.0206409] [Citation(s) in RCA: 36] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2018] [Accepted: 10/14/2018] [Indexed: 01/11/2023] Open
Abstract
For many disease-causing virus species, global diversity is clustered into a taxonomy of subtypes with clinical significance. In particular, the classification of infections among the subtypes of human immunodeficiency virus type 1 (HIV-1) is a routine component of clinical management, and there are now many classification algorithms available for this purpose. Although several of these algorithms are similar in accuracy and speed, the majority are proprietary and require laboratories to transmit HIV-1 sequence data over the network to remote servers. This potentially exposes sensitive patient data to unauthorized access, and makes it impossible to determine how classifications are made and to maintain the data provenance of clinical bioinformatic workflows. We propose an open-source supervised and alignment-free subtyping method (Kameris) that operates on k-mer frequencies in HIV-1 sequences. We performed a detailed study of the accuracy and performance of subtype classification in comparison to four state-of-the-art programs. Based on our testing data set of manually curated real-world HIV-1 sequences (n = 2, 784), Kameris obtained an overall accuracy of 97%, which matches or exceeds all other tested software, with a processing rate of over 1,500 sequences per second. Furthermore, our fully standalone general-purpose software provides key advantages in terms of data security and privacy, transparency and reproducibility. Finally, we show that our method is readily adaptable to subtype classification of other viruses including dengue, influenza A, and hepatitis B and C virus.
Collapse
|
17
|
Nagahashi M, Shimada Y, Ichikawa H, Nakagawa S, Sato N, Kaneko K, Homma K, Kawasaki T, Kodama K, Lyle S, Takabe K, Wakai T. Formalin-fixed paraffin-embedded sample conditions for deep next generation sequencing. J Surg Res 2017; 220:125-132. [PMID: 29180174 PMCID: PMC5726294 DOI: 10.1016/j.jss.2017.06.077] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2017] [Revised: 06/26/2017] [Accepted: 06/28/2017] [Indexed: 12/16/2022]
Abstract
INTRODUCTION Precision medicine is only possible in oncology practice if targetable genes in fragmented DNA, such as DNA from formalin-fixed paraffin-embedded (FFPE) samples, can be sequenced using next generation sequencing (NGS). The aim of this study was to examine the quality and quantity of DNA from FFPE cancerous tissue samples from surgically resected and biopsy specimens. METHODS DNA was extracted from unstained FFPE tissue sections prepared from surgically resected specimens of breast, colorectal and gastric cancer, and biopsy specimens of breast cancer. A total quantity of DNA ≥60 ng from a sample was considered adequate for NGS. The DNA quality was assessed by Q-ratios, with a Q-ratio >0.1 considered sufficient for NGS. RESULTS The Q-ratio for DNA from FFPE tissue processed with neutral-buffered formalin was significantly better than that processed with unbuffered formalin. All Q-ratios for DNA from breast, colorectal and gastric cancer samples indicated DNA levels sufficient for NGS. DNA extracted from gastric cancer FFPE samples prepared within the last 7 years is suitable for NGS analysis, whereas those older than 7 years may not be suitable. Our data suggested that adequate amounts of DNA can be extracted from FFPE samples, not only of surgically resected tissue but also of biopsy specimens. CONCLUSIONS The type of formalin used for fixation and the time since FFPE sample preparation affect DNA quality. Sufficient amounts of DNA can be extracted from FFPE samples of both surgically resected and biopsy tissue, thus expanding the potential diagnostic uses of NGS in a clinical setting.
Collapse
Affiliation(s)
- Masayuki Nagahashi
- Division of Digestive and General Surgery, Niigata University Graduate School of Medical and Dental Sciences, Niigata City, Niigata, Japan.
| | - Yoshifumi Shimada
- Division of Digestive and General Surgery, Niigata University Graduate School of Medical and Dental Sciences, Niigata City, Niigata, Japan
| | - Hiroshi Ichikawa
- Division of Digestive and General Surgery, Niigata University Graduate School of Medical and Dental Sciences, Niigata City, Niigata, Japan
| | - Satoru Nakagawa
- Department of Surgery, Niigata Cancer Center Hospital, Niigata City, Niigata, Japan
| | - Nobuaki Sato
- Department of Breast Oncology, Niigata Cancer Center Hospital, Niigata City, Niigata, Japan
| | - Koji Kaneko
- Department of Breast Oncology, Niigata Cancer Center Hospital, Niigata City, Niigata, Japan
| | - Keiichi Homma
- Department of Pathology, Niigata Cancer Center Hospital, Niigata City, Niigata, Japan
| | - Takashi Kawasaki
- Department of Pathology, Niigata Cancer Center Hospital, Niigata City, Niigata, Japan
| | - Keisuke Kodama
- Diagnostics Research Department, Life innovation Research Institute, Denka innovation Center, Denka Co., Ltd, Machida City, Tokyo, Japan
| | - Stephen Lyle
- Department of Molecular, Cell and Cancer Biology, University of Massachusetts Medical School, Boston, Massachusetts; KEW, Inc, Boston, Massachusetts
| | - Kazuaki Takabe
- Breast Surgery, Department of Surgical Oncology, Roswell Park Cancer Institute, Buffalo, New York; Department of Surgery, University at Buffalo Jacobs School of Medicine and Biomedical Sciences, The State University of New York, Buffalo, New York
| | - Toshifumi Wakai
- Division of Digestive and General Surgery, Niigata University Graduate School of Medical and Dental Sciences, Niigata City, Niigata, Japan
| |
Collapse
|