1
|
González A, Fullaondo A, Odriozola A. Host genetics and microbiota data analysis in colorectal cancer research. ADVANCES IN GENETICS 2024; 112:31-81. [PMID: 39396840 DOI: 10.1016/bs.adgen.2024.08.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/15/2024]
Abstract
Colorectal cancer (CRC) is a heterogeneous disease with a complex aetiology influenced by a myriad of genetic and environmental factors. Despite advances in CRC research, it is a major burden of disease, with the second highest incidence and third leading cause of cancer deaths worldwide. To individualise diagnosis, prognosis, and treatment of CRC, developing new strategies combining precision medicine and bioinformatic procedures is promising. Precision medicine is based on omics technologies and aims to individualise the management of CRC based on patient host genetic characteristics and microbiota. Bioinformatics is central to the application of personalised medicine because it enables the analysis of large datasets generated by these technologies. At the level of host genetics, bioinformatics allows the identification of mutations, genes, molecular pathways, biomarkers and drugs relevant to colorectal carcinogenesis. At the microbiota level, bioinformatics is fundamental to analysing microbial communities' composition and functionality and developing biomarkers and personalised microbiota-based therapies. This paper explores the host and microbiota genetic data analysis in CRC research.
Collapse
Affiliation(s)
- Adriana González
- Hologenomics Research Group, Department of Genetics, Physical Anthropology, and Animal Physiology, University of the Basque Country, Spain
| | - Asier Fullaondo
- Hologenomics Research Group, Department of Genetics, Physical Anthropology, and Animal Physiology, University of the Basque Country, Spain
| | - Adrian Odriozola
- Hologenomics Research Group, Department of Genetics, Physical Anthropology, and Animal Physiology, University of the Basque Country, Spain.
| |
Collapse
|
2
|
Khurshid S, Usmani S, Ali R, Hamid S, Masoodi T, Sadida HQ, Ahmed I, Khan MS, Abeer I, Albalawi IA, Bedaiwi RI, Mir R, Al-Shabeeb Akil AS, Bhat AA, Macha MA. Integrating network analysis with differential expression to uncover therapeutic and prognostic biomarkers in esophageal squamous cell carcinoma. Front Mol Biosci 2024; 11:1425422. [PMID: 39234567 PMCID: PMC11371674 DOI: 10.3389/fmolb.2024.1425422] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2024] [Accepted: 07/30/2024] [Indexed: 09/06/2024] Open
Abstract
Introduction: Esophageal squamous cell carcinoma (ESCC) accounts for over 90% of all esophageal tumors. However, the molecular mechanism underlying ESCC development and prognosis remains unclear, and there are still no effective molecular biomarkers for diagnosing or predicting the clinical outcome of patients with ESCC. Here, we used bioinformatics analysis to identify potential biomarkers and therapeutic targets for ESCC. Methodology: Differentially expressed genes (DEGs) between ESCC and normal esophageal tissue samples were obtained by comprehensively analyzing publicly available RNA-seq datasets from the TCGA and GTEX. Gene Ontology (GO) annotation and Reactome pathway analysis identified the biological roles of the DEGs. Moreover, the Cytoscape 3.10.1 platform and subsidiary tools such as CytoHubba were used to visualize the DEGs' protein-protein interaction (PPI) network and identify hub genes, Furthermore our results are validated by using Single-cell RNA analysis. Results: Identification of 2524 genes exhibiting altered expression enriched in pathways including keratinization, epidermal cell differentiation, G alpha(s) signaling events, and biological process of cell proliferation and division, extracellular matrix (ECM) disassembly, and muscle function. Moreover, upregulation of hallmarks E2F targets, G2M checkpoints, and TNF signaling. CytoHubba revealed 20 hub genes that had a valuable influence on the progression of ESCC in these patients. Among these, the high expression levels of four genes, CDK1 MAD2L1, PLK1, and TOP2A, were associated with critical dependence for cell survival in ESCC cell lines, as indicated by CRISPR dependency scores, gene expression data, and cell line metadata. We also identify the molecules targeting these essential hub genes, among which GSK461364 is a promising inhibitor of PLK1, BMS265246, and Valrubicin inhibitors of CDK1 and TOP2A, respectively. Moreover, we identified that elevated expression of MMP9 is associated with worse overall survival in ESCC patients, which may serve as potential prognostic biomarker or therapeutic target for ESCC. The single-cell RNA analysis showed MMP9 is highly expressed in myeloid, fibroblast, and epithelial cells, but low in T cells, endothelial cells, and B cells. This suggests MMP9's role in tumor progression and matrix remodeling, highlighting its potential as a prognostic marker and therapeutic target. Discussion: Our study identified key hub genes in ESCC, assessing their potential as therapeutic targets and biomarkers through detailed expression and dependency analyses. Notably, MMP9 emerged as a significant prognostic marker with high expression correlating with poor survival, underscoring its potential for targeted therapy. These findings enhance our understanding of ESCC pathogenesis and highlight promising avenues for treatment.
Collapse
Affiliation(s)
- Sana Khurshid
- Watson-Crick Centre for Molecular Medicine, Islamic University of Science and Technology, Awantipora, India
| | - Shahabuddin Usmani
- Department of Human Genetics-Precision Medicine in Diabetes, Obesity and Cancer Program, Sidra Medicine, Doha, Qatar
| | - Raiyan Ali
- Council of Scientific and Industrial Research-Institute of Genomics and Integrative Biology, Delhi, India
| | - Saira Hamid
- Watson-Crick Centre for Molecular Medicine, Islamic University of Science and Technology, Awantipora, India
| | - Tariq Masoodi
- Human Immunology Department, Research Branch, Sidra Medicine, Doha, Qatar
| | - Hana Q Sadida
- Department of Human Genetics-Precision Medicine in Diabetes, Obesity and Cancer Program, Sidra Medicine, Doha, Qatar
| | - Ikhlak Ahmed
- Department of Human Genetics-Precision Medicine in Diabetes, Obesity and Cancer Program, Sidra Medicine, Doha, Qatar
| | - Mohd Shahnawaz Khan
- Department of Biochemistry, College of Sciences, King Saud University, Riyadh, Saudi Arabia
| | - Inara Abeer
- Department of Pathology, Sker-I-Kashmir Institute of Medical Sciences, Srinagar, India
| | | | - Ruqaiah I Bedaiwi
- Faculty of Applied Medical Sciences, Medical Laboratory Technology, University of Tabuk, Tabuk, Saudi Arabia
| | - Rashid Mir
- Faculty of Applied Medical Sciences, Medical Laboratory Technology, University of Tabuk, Tabuk, Saudi Arabia
| | - Ammira S Al-Shabeeb Akil
- Department of Human Genetics-Precision Medicine in Diabetes, Obesity and Cancer Program, Sidra Medicine, Doha, Qatar
| | - Ajaz A Bhat
- Department of Human Genetics-Precision Medicine in Diabetes, Obesity and Cancer Program, Sidra Medicine, Doha, Qatar
| | - Muzafar A Macha
- Watson-Crick Centre for Molecular Medicine, Islamic University of Science and Technology, Awantipora, India
| |
Collapse
|
3
|
Álvarez VE, Quiroga MP, Centrón D. Identification of a Specific Biomarker of Acinetobacter baumannii Global Clone 1 by Machine Learning and PCR Related to Metabolic Fitness of ESKAPE Pathogens. mSystems 2023:e0073422. [PMID: 37184409 DOI: 10.1128/msystems.00734-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/16/2023] Open
Abstract
Since the emergence of high-risk clones worldwide, constant investigations have been undertaken to comprehend the molecular basis that led to their prevalent dissemination in nosocomial settings over time. So far, the complex and multifactorial genetic traits of this type of epidemic clones have allowed only the identification of biomarkers with low specificity. A machine learning algorithm was able to recognize unequivocally a biomarker for early and accurate detection of Acinetobacter baumannii global clone 1 (GC1), one of the most disseminated high-risk clones. A support vector machine model identified the U1 sequence with a length of 367 nucleotides that matched a fragment of the moaCB gene, which encodes the molybdenum cofactor biosynthesis C and B proteins. U1 differentiates specifically between A. baumannii GC1 and non-GC1 strains, becoming a suitable biomarker capable of being translated into clinical settings as a molecular typing method for early diagnosis based on PCR as shown here. Since the metabolic pathways of Mo enzymes have been recognized as putative therapeutic targets for ESKAPE (Enterococcus faecium, Staphylococcus aureus, Klebsiella pneumoniae, Acinetobacter baumannii, Pseudomonas aeruginosa, and Enterobacter species) pathogens, our findings highlight that machine learning can also be useful in knowledge gaps of high-risk clones and provides noteworthy support to the literature to identify relevant nosocomial biomarkers for other multidrug-resistant high-risk clones. IMPORTANCE A. baumannii GC1 is an important high-risk clone that rapidly develops extreme drug resistance in the nosocomial niche. Furthermore, several strains have been identified worldwide in environmental samples, exacerbating the risk of human interactions. Early diagnosis is mandatory to limit its dissemination and to outline appropriate antibiotic stewardship schedules. A region with a length of 367 bp (U1) within the moaCB gene that is not subjected to lateral genetic transfer or to antibiotic pressures was successfully found by a support vector machine model that predicts A. baumannii GC1 strains. At the same time, research on the group of Mo enzymes proposed this metabolic pathway related to the superbug's metabolism as a potential future drug target site for ESKAPE pathogens due to its central role in bacterial fitness during infection. These findings confirm that machine learning used for the identification of biomarkers of high-risk lineages can also serve to identify putative novel therapeutic target sites.
Collapse
Affiliation(s)
- Verónica Elizabeth Álvarez
- Laboratorio de Investigaciones en Mecanismos de Resistencia a Antibióticos (LIMRA), Instituto de Investigaciones en Microbiología y Parasitología Médica, Facultad de Medicina, Universidad de Buenos Aires-Consejo Nacional de Investigaciones Científicas y Tecnológicas (IMPaM, UBA-CONICET), Ciudad Autónoma de Buenos Aires, Argentina
| | - María Paula Quiroga
- Laboratorio de Investigaciones en Mecanismos de Resistencia a Antibióticos (LIMRA), Instituto de Investigaciones en Microbiología y Parasitología Médica, Facultad de Medicina, Universidad de Buenos Aires-Consejo Nacional de Investigaciones Científicas y Tecnológicas (IMPaM, UBA-CONICET), Ciudad Autónoma de Buenos Aires, Argentina
- Nodo de Bioinformática. Instituto de Investigaciones en Microbiología y Parasitología Médica, Facultad de Medicina, Universidad de Buenos Aires-Consejo Nacional de Investigaciones Científicas y Técnicas (IMPaM, UBA-CONICET), Ciudad Autónoma de Buenos Aires, Argentina
| | - Daniela Centrón
- Laboratorio de Investigaciones en Mecanismos de Resistencia a Antibióticos (LIMRA), Instituto de Investigaciones en Microbiología y Parasitología Médica, Facultad de Medicina, Universidad de Buenos Aires-Consejo Nacional de Investigaciones Científicas y Tecnológicas (IMPaM, UBA-CONICET), Ciudad Autónoma de Buenos Aires, Argentina
| |
Collapse
|
4
|
Yang X, Wang P, Yan S, Wang G. Study on potential differentially expressed genes in stroke by bioinformatics analysis. Neurol Sci 2021; 43:1155-1166. [PMID: 34313877 PMCID: PMC8789718 DOI: 10.1007/s10072-021-05470-1] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2021] [Accepted: 07/07/2021] [Indexed: 11/29/2022]
Abstract
Stroke is a sudden cerebrovascular circulatory disorder with high morbidity, disability, mortality, and recurrence rate, but its pathogenesis and key genes are still unclear. In this study, bioinformatics was used to deeply analyze the pathogenesis of stroke and related key genes, so as to study the potential pathogenesis of stroke and provide guidance for clinical treatment. Gene Expression profiles of GSE58294 and GSE16561 were obtained from Gene Expression Omnibus (GEO), the differentially expressed genes (DEGs) were identified between IS and normal control group. The different expression genes (DEGs) between IS and normal control group were screened with the GEO2R online tool. The Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analyses of the DEGs were performed. Using the Database for Annotation, Visualization and Integrated Discovery (DAVID) and gene set enrichment analysis (GSEA), the function and pathway enrichment analysis of DEGS were performed. Then, a protein–protein interaction (PPI) network was constructed via the Search Tool for the Retrieval of Interacting Genes (STRING) database. Cytoscape with CytoHubba were used to identify the hub genes. Finally, NetworkAnalyst was used to construct the targeted microRNAs (miRNAs) of the hub genes. A total of 85 DEGs were screened out in this study, including 65 upward genes and 20 downward genes. In addition, 3 KEGG pathways, cytokine − cytokine receptor interaction, hematopoietic cell lineage, B cell receptor signaling pathway, were significantly enriched using a database for labeling, visualization, and synthetic discovery. In combination with the results of the PPI network and CytoHubba, 10 hub genes including CEACAM8, CD19, MMP9, ARG1, CKAP4, CCR7, MGAM, CD79A, CD79B, and CLEC4D were selected. Combined with DEG-miRNAs visualization, 5 miRNAs, including hsa-mir-146a-5p, hsa-mir-7-5p, hsa-mir-335-5p, and hsa-mir-27a- 3p, were predicted as possibly the key miRNAs. Our findings will contribute to identification of potential biomarkers and novel strategies for the treatment of ischemic stroke, and provide a new strategy for clinical therapy.
Collapse
Affiliation(s)
- Xitong Yang
- Genetic Testing Center, The First Affiliated Hospital of Dali University, Dali, 671000, Yunnan, China
| | - Pengyu Wang
- Genetic Testing Center, The First Affiliated Hospital of Dali University, Dali, 671000, Yunnan, China
| | - Shanquan Yan
- Genetic Testing Center, The First Affiliated Hospital of Dali University, Dali, 671000, Yunnan, China
| | - Guangming Wang
- Genetic Testing Center, The First Affiliated Hospital of Dali University, Dali, 671000, Yunnan, China.
| |
Collapse
|