1
|
Liu J, Zhu J, Hao H, Bi J, Hou H, Zhang G. Transcriptomic and Molecular Docking Analysis Reveal Virulence Gene Regulation-Mediated Antibacterial Effects of Benzyl Isothiocyanate Against Staphylococcus aureus. Appl Biochem Biotechnol 2024:10.1007/s12010-024-04938-y. [PMID: 38709426 DOI: 10.1007/s12010-024-04938-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/16/2024] [Indexed: 05/07/2024]
Abstract
Staphylococcus aureus (S. aureus) is a common pathogen that can cause many serious infections. Thus, efficient and practical techniques to fight S. aureus are required. In this study, transcriptomics was used to evaluate changes in S. aureus following treatment with benzyl isothiocyanate (BITC) to determine its antibacterial action. The results revealed that the BITC at subinhibitory concentrations (1/8th MIC) treated group had 94 differentially expressed genes compared to the control group, with 52 downregulated genes. Moreover, STRING analyses were used to reveal the protein interactions encoded by 36 genes. Then, we verified three significant virulence genes by qRT-PCR, including capsular polysaccharide synthesis enzyme (cp8F), capsular polysaccharide biosynthesis protein (cp5D), and thermonuclease (nuc). Furthermore, molecular docking analysis was performed to investigate the action site of BITC with the encoded proteins of cp8F, cp5D, and nuc. The results showed that the docking fraction of BITC with selected proteins ranged from - 6.00 to - 6.60 kcal/mol, predicting the stability of these complexes. BITC forms hydrophobic, hydrogen-bonded, π-π conjugated interactions with amino acids TRP (130), GLY (10), ILE (406), LYS (368), TYR (192), and ARG (114) of these proteins. These findings will aid future research into the antibacterial effects of BITC against S. aureus.
Collapse
Affiliation(s)
- Jianan Liu
- School of Food Science and Technology, Dalian Polytechnic University, Dalian, 116034, China
| | - Junya Zhu
- Jinkui Food Science and Technology (Dalian) Co., Ltd, Dalian, 116000, China
| | - Hongshun Hao
- Department of Inorganic Nonmetallic Materials Engineering, Dalian Polytechnic University, Dalian, 116034, China
| | - Jingran Bi
- School of Food Science and Technology, Dalian Polytechnic University, Dalian, 116034, China
| | - Hongman Hou
- School of Food Science and Technology, Dalian Polytechnic University, Dalian, 116034, China
| | - Gongliang Zhang
- School of Food Science and Technology, Dalian Polytechnic University, Dalian, 116034, China.
| |
Collapse
|
2
|
Tsare EPG, Klapa MI, Moschonas NK. Protein-protein interaction network-based integration of GWAS and functional data for blood pressure regulation analysis. Hum Genomics 2024; 18:15. [PMID: 38326862 DOI: 10.1186/s40246-023-00565-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2023] [Accepted: 11/12/2023] [Indexed: 02/09/2024] Open
Abstract
BACKGROUND It is valuable to analyze the genome-wide association studies (GWAS) data for a complex disease phenotype in the context of the protein-protein interaction (PPI) network, as the related pathophysiology results from the function of interacting polyprotein pathways. The analysis may include the design and curation of a phenotype-specific GWAS meta-database incorporating genotypic and eQTL data linking to PPI and other biological datasets, and the development of systematic workflows for PPI network-based data integration toward protein and pathway prioritization. Here, we pursued this analysis for blood pressure (BP) regulation. METHODS The relational scheme of the implemented in Microsoft SQL Server BP-GWAS meta-database enabled the combined storage of: GWAS data and attributes mined from GWAS Catalog and the literature, Ensembl-defined SNP-transcript associations, and GTEx eQTL data. The BP-protein interactome was reconstructed from the PICKLE PPI meta-database, extending the GWAS-deduced network with the shortest paths connecting all GWAS-proteins into one component. The shortest-path intermediates were considered as BP-related. For protein prioritization, we combined a new integrated GWAS-based scoring scheme with two network-based criteria: one considering the protein role in the reconstructed by shortest-path (RbSP) interactome and one novel promoting the common neighbors of GWAS-prioritized proteins. Prioritized proteins were ranked by the number of satisfied criteria. RESULTS The meta-database includes 6687 variants linked with 1167 BP-associated protein-coding genes. The GWAS-deduced PPI network includes 1065 proteins, with 672 forming a connected component. The RbSP interactome contains 1443 additional, network-deduced proteins and indicated that essentially all BP-GWAS proteins are at most second neighbors. The prioritized BP-protein set was derived from the union of the most BP-significant by any of the GWAS-based or the network-based criteria. It included 335 proteins, with ~ 2/3 deduced from the BP PPI network extension and 126 prioritized by at least two criteria. ESR1 was the only protein satisfying all three criteria, followed in the top-10 by INSR, PTN11, CDK6, CSK, NOS3, SH2B3, ATP2B1, FES and FINC, satisfying two. Pathway analysis of the RbSP interactome revealed numerous bioprocesses, which are indeed functionally supported as BP-associated, extending our understanding about BP regulation. CONCLUSIONS The implemented workflow could be used for other multifactorial diseases.
Collapse
Affiliation(s)
- Evridiki-Pandora G Tsare
- Department of General Biology, School of Medicine, University of Patras, Patras, Greece
- Metabolic Engineering and Systems Biology Laboratory, Institute of Chemical Engineering Sciences, Foundation for Research and Technology-Hellas (FORTH/ICE-HT), Patras, Greece
| | - Maria I Klapa
- Metabolic Engineering and Systems Biology Laboratory, Institute of Chemical Engineering Sciences, Foundation for Research and Technology-Hellas (FORTH/ICE-HT), Patras, Greece.
| | - Nicholas K Moschonas
- Department of General Biology, School of Medicine, University of Patras, Patras, Greece.
- Metabolic Engineering and Systems Biology Laboratory, Institute of Chemical Engineering Sciences, Foundation for Research and Technology-Hellas (FORTH/ICE-HT), Patras, Greece.
| |
Collapse
|
3
|
Yao D, Mei S, Tang W, Xu X, Lu Q, Shi Z. AAAKB: A manually curated database for tracking and predicting genes of Abdominal aortic aneurysm (AAA). PLoS One 2023; 18:e0289966. [PMID: 38100461 PMCID: PMC10723669 DOI: 10.1371/journal.pone.0289966] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2023] [Accepted: 07/31/2023] [Indexed: 12/17/2023] Open
Abstract
Abdominal aortic aneurysm (AAA), an extremely dangerous vascular disease with high mortality, causes massive internal bleeding due to aneurysm rupture. To boost the research on AAA, efforts should be taken to organize and link the information about AAA-related genes and their functions. Currently, most researchers screen through genetic databases manually, which is cumbersome and time-consuming. Here, we developed "AAAKB" a manually curated knowledgebase containing genes, SNPs and pathways associated with AAA. In order to facilitate researchers to further explore the mechanism network of AAA, AAAKB provides predicted genes that are potentially associated with AAA. The prediction is based on the protein interaction information of genes collected in the database, and the random forest algorithm (RF) is used to build the prediction model. Some of these predicted genes are differentially expressed in patients with AAA, and some have been reported to play a role in other cardiovascular diseases, illustrating the utility of the knowledgebase in predicting novel genes. Also, AAAKB integrates a protein interaction visualization tool to quickly determine the shortest paths between target proteins. As the first knowledgebase to provide a comprehensive catalog of AAA-related genes, AAAKB will be an ideal research platform for AAA. Database URL: http://www.lqlgroup.cn:3838/AAAKB/.
Collapse
Affiliation(s)
- Di Yao
- Institute of Industrial Internet and Internet of Things, China Academy of Information and Communications Technology (CAICT), China
| | - Shuyuan Mei
- Key Laboratory of Cardiovascular and Cerebrovascular Medicine, School of Pharmacy, Nanjing Medical University, Nanjing, China
| | - Wangyang Tang
- School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, China
| | - Xingyu Xu
- School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, China
| | - Qiulun Lu
- Key Laboratory of Cardiovascular and Cerebrovascular Medicine, School of Pharmacy, Nanjing Medical University, Nanjing, China
| | - Zhiguang Shi
- Key Laboratory of Cardiovascular and Cerebrovascular Medicine, School of Pharmacy, Nanjing Medical University, Nanjing, China
| |
Collapse
|
4
|
Armaos A, Serra F, Núñez-Carpintero I, Seo JH, Baca SC, Gustincich S, Valencia A, Freedman ML, Cirillo D, Giambartolomei C, Tartaglia GG. The PENGUIN approach to reconstruct protein interactions at enhancer-promoter regions and its application to prostate cancer. Nat Commun 2023; 14:8084. [PMID: 38057321 PMCID: PMC10700545 DOI: 10.1038/s41467-023-43767-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2022] [Accepted: 11/18/2023] [Indexed: 12/08/2023] Open
Abstract
We introduce Promoter-Enhancer-Guided Interaction Networks (PENGUIN), a method for studying protein-protein interaction (PPI) networks within enhancer-promoter interactions. PENGUIN integrates H3K27ac-HiChIP data with tissue-specific PPIs to define enhancer-promoter PPI networks (EPINs). We validated PENGUIN using cancer (LNCaP) and benign (LHSAR) prostate cell lines. Our analysis detected EPIN clusters enriched with the architectural protein CTCF, a regulator of enhancer-promoter interactions. CTCF presence was coupled with the prevalence of prostate cancer (PrCa) single nucleotide polymorphisms (SNPs) within the same EPIN clusters, suggesting functional implications in PrCa. Within the EPINs displaying enrichments in both CTCF and PrCa SNPs, we also show enrichment in oncogenes. We substantiated our identified SNPs through CRISPR/Cas9 knockout and RNAi screens experiments. Here we show that PENGUIN provides insights into the intricate interplay between enhancer-promoter interactions and PPI networks, which are crucial for identifying key genes and potential intervention targets. A dedicated server is available at https://penguin.life.bsc.es/ .
Collapse
Affiliation(s)
- Alexandros Armaos
- Istituto Italiano di Tecnologia, CHT@Erzelli, Via Enrico Melen 83, Building B, 7th floor, 16152, Genova, Italy
| | - François Serra
- Barcelona Supercomputing Center, Plaça Eusebi Güell, 1-3, 08034, Barcelona, Spain
- Josep Carreras Leukaemia Research Institute, Ctra de Can Ruti, Camí de les Escoles, 08916, Badalona, Barcelona, Spain
| | | | - Ji-Heui Seo
- Department of Medical Oncology, The Center for Functional Cancer Epigenetics, Dana Farber Cancer Institute, Boston, MA, 02215, USA
| | - Sylvan C Baca
- Department of Medical Oncology, The Center for Functional Cancer Epigenetics, Dana Farber Cancer Institute, Boston, MA, 02215, USA
| | - Stefano Gustincich
- Istituto Italiano di Tecnologia, CHT@Erzelli, Via Enrico Melen 83, Building B, 7th floor, 16152, Genova, Italy
| | - Alfonso Valencia
- Barcelona Supercomputing Center, Plaça Eusebi Güell, 1-3, 08034, Barcelona, Spain
- ICREA - Institució Catalana de Recerca I Estudis Avançats, Pg. Lluís Companys 23, 08010, Barcelona, Spain
| | - Matthew L Freedman
- Department of Medical Oncology, The Center for Functional Cancer Epigenetics, Dana Farber Cancer Institute, Boston, MA, 02215, USA
- Eli and Edythe L. Broad Institute, 415 Main St., Cambridge, MA, 02142, USA
| | - Davide Cirillo
- Barcelona Supercomputing Center, Plaça Eusebi Güell, 1-3, 08034, Barcelona, Spain.
| | - Claudia Giambartolomei
- Istituto Italiano di Tecnologia, CHT@Erzelli, Via Enrico Melen 83, Building B, 7th floor, 16152, Genova, Italy.
- Health Data Science Centre, Human Technopole, Milan, Italy.
| | - Gian Gaetano Tartaglia
- Istituto Italiano di Tecnologia, CHT@Erzelli, Via Enrico Melen 83, Building B, 7th floor, 16152, Genova, Italy.
- ICREA - Institució Catalana de Recerca I Estudis Avançats, Pg. Lluís Companys 23, 08010, Barcelona, Spain.
- Istituto Italiano di Tecnologia, CNLS@Sapienza, Viale Regina Elena, 00161, Rome, Italy.
| |
Collapse
|
5
|
Song E. Persistent homology analysis of type 2 diabetes genome-wide association studies in protein-protein interaction networks. Front Genet 2023; 14:1270185. [PMID: 37823029 PMCID: PMC10562725 DOI: 10.3389/fgene.2023.1270185] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2023] [Accepted: 09/12/2023] [Indexed: 10/13/2023] Open
Abstract
Genome-wide association studies (GWAS) involving increasing sample sizes have identified hundreds of genetic variants associated with complex diseases, such as type 2 diabetes (T2D); however, it is unclear how GWAS hits form unique topological structures in protein-protein interaction (PPI) networks. Using persistent homology, this study explores the evolution and persistence of the topological features of T2D GWAS hits in the PPI network with increasing p-value thresholds. We define an n-dimensional persistent disease module as a higher-order generalization of the largest connected component (LCC). The 0-dimensional persistent T2D disease module is the LCC of the T2D GWAS hits, which is significantly detected in the PPI network (196 nodes and 235 edges, P< 0.05). In the 1-dimensional homology group analysis, all 18 1-dimensional holes (loops) of the T2D GWAS hits persist over all p-value thresholds. The 1-dimensional persistent T2D disease module comprising these 18 persistent 1-dimensional holes is significantly larger than that expected by chance (59 nodes and 83 edges, P< 0.001), indicating a significant topological structure in the PPI network. Our computational topology framework potentially possesses broad applicability to other complex phenotypes in identifying topological features that play an important role in disease pathobiology.
Collapse
Affiliation(s)
- Euijun Song
- Yonsei University College of Medicine, Seoul, Republic of Korea
- Present: Independent Researcher, Gyeonggi, Republic of Korea
| |
Collapse
|
6
|
Rancelis T, Domarkiene I, Ambrozaityte L, Utkus A. Implementing Core Genes and an Omnigenic Model for Behaviour Traits Prediction in Genomics. Genes (Basel) 2023; 14:1630. [PMID: 37628681 PMCID: PMC10454355 DOI: 10.3390/genes14081630] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2023] [Revised: 08/12/2023] [Accepted: 08/14/2023] [Indexed: 08/27/2023] Open
Abstract
A high number of genome variants are associated with complex traits, mainly due to genome-wide association studies (GWAS). Using polygenic risk scores (PRSs) is a widely accepted method for calculating an individual's complex trait prognosis using such data. Unlike monogenic traits, the practical implementation of complex traits by applying this method still falls behind. Calculating PRSs from all GWAS data has limited practical usability in behaviour traits due to statistical noise and the small effect size from a high number of genome variants involved. From a behaviour traits perspective, complex traits are explored using the concept of core genes from an omnigenic model, aiming to employ a simplified calculation version. Simplification may reduce the accuracy compared to a complete PRS encompassing all trait-associated variants. Integrating genome data with datasets from various disciplines, such as IT and psychology, could lead to better complex trait prediction. This review elucidates the significance of clear biological pathways in understanding behaviour traits. Specifically, it highlights the essential role of genes related to hormones, enzymes, and neurotransmitters as robust core genes in shaping these traits. Significant variations in core genes are prominently observed in behaviour traits such as stress response, impulsivity, and substance use.
Collapse
Affiliation(s)
- Tautvydas Rancelis
- Institute of Biomedical Sciences, Faculty of Medicine, Vilnius University, Santariskiu Str. 2, LT-08661 Vilnius, Lithuania; (I.D.); (L.A.); (A.U.)
| | | | | | | |
Collapse
|
7
|
Barroso GV, Lohmueller KE. Inferring the mode and strength of ongoing selection. Genome Res 2023; 33:632-643. [PMID: 37055196 PMCID: PMC10234300 DOI: 10.1101/gr.276386.121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2021] [Accepted: 03/29/2023] [Indexed: 04/15/2023]
Abstract
Genome sequence data are no longer scarce. The UK Biobank alone comprises 200,000 individual genomes, with more on the way, leading the field of human genetics toward sequencing entire populations. Within the next decades, other model organisms will follow suit, especially domesticated species such as crops and livestock. Having sequences from most individuals in a population will present new challenges for using these data to improve health and agriculture in the pursuit of a sustainable future. Existing population genetic methods are designed to model hundreds of randomly sampled sequences but are not optimized for extracting the information contained in the larger and richer data sets that are beginning to emerge, with thousands of closely related individuals. Here we develop a new method called trio-based inference of dominance and selection (TIDES) that uses data from tens of thousands of family trios to make inferences about natural selection acting in a single generation. TIDES further improves on the state of the art by making no assumptions regarding demography, linkage, or dominance. We discuss how our method paves the way for studying natural selection from new angles.
Collapse
Affiliation(s)
- Gustavo V Barroso
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, California 90095-1606, USA; Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, California 90095, USA
| | - Kirk E Lohmueller
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, California 90095-1606, USA; Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, California 90095, USA
| |
Collapse
|
8
|
Multi-omics peripheral and core regions of cancer. NPJ Syst Biol Appl 2022; 8:47. [PMID: 36446819 PMCID: PMC9707100 DOI: 10.1038/s41540-022-00258-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2022] [Accepted: 11/07/2022] [Indexed: 11/30/2022] Open
Abstract
Thousands of genes are perturbed by cancer, and these disturbances can be seen in transcriptome, methylation, somatic mutation, and copy number variation omics studies. Understanding their connectivity patterns as an omnigenic neighbourhood in a molecular interaction network (interactome) is a key step towards advancing knowledge of the molecular mechanisms underlying cancers. Here, we introduce a unified connectivity line (CLine) to pinpoint omics-specific omnigenic patterns across 15 curated cancers. Taking advantage of the universality of CLine, we distinguish the peripheral and core genes for each omics aspect. We propose a network-based framework, multi-omics periphery and core (MOPC), to combine peripheral and core genes from different omics into a button-like structure. On the basis of network proximity, we provide evidence that core genes tend to be specifically perturbed in one omics, but the peripheral genes are diversely perturbed in multiple omics. And the core of one omics is regulated by multiple omics peripheries. Finally, we take the MOPC as an omnigenic neighbourhood, describe its characteristics, and explore its relative contribution to network-based mechanisms of cancer. We were able to present how multi-omics perturbations percolate through the human interactome and contribute to an integrated periphery and core.
Collapse
|
9
|
Chen Y, Ma Y, Zhai Y, Yang H, Zhang C, Lu Y, Wei W, Cai Q, Ding X, Lu S, Fang Z. Persistent dysregulation of genes in the development of endometriosis. ANNALS OF TRANSLATIONAL MEDICINE 2022; 10:1175. [PMID: 36467354 PMCID: PMC9708481 DOI: 10.21037/atm-22-4806] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/08/2022] [Accepted: 11/07/2022] [Indexed: 09/29/2023]
Abstract
BACKGROUND Endometriosis is a chronic condition that affects women of child-bearing age. Since the etiology and pathogenesis of endometriosis have not been fully elucidated, it is important to investigate the mechanisms that lead to the deterioration of endometriosis. METHODS In this study, the transcriptome data of patients with normal, mild, and severe endometriosis were examined using the GSE51981 dataset obtained from the Gene Expression Omnibus database. Short Time Series Expression Miner (STEM) was used to screen the genes with continuous expression disorder in the development process, and the core genes were identified by constructing a protein-protein interaction network. The molecular mechanisms of endometriosis were examined using enrichment analysis. Finally, the transcription factors that regulate the core genes were predicted and the comprehensive mechanisms involved in the development of endometriosis were examined. RESULTS A total of 3,472 differentially expressed genes were identified from the normal, mild, and severe endometriosis samples. These were allocated into 12 modules and HRAS, HSP90AA1, TGFB1, TP53, and UBC were selected as the core genes. Enrichment analysis showed that the genes in modules 6, 7, and 9 were significantly related to oxygen levels, metallic processes, and hormone levels, respectively. Transcription factor prediction analysis showed that TP53 regulates HRAS to participate in immune related signaling pathways. Drug prediction analysis identified 792 drugs that interact with the targeted core genes. CONCLUSIONS This study explored the molecular mechanisms involved in the development of endometriosis and identified potential biomarkers of endometriosis. This data may provide novel targets and research directions for the diagnosis and treatment of endometriosis.
Collapse
Affiliation(s)
- Yanli Chen
- Department of Obstetrics and Gynecology, The Fourth Affiliated Hospital of Guangxi Medical University, Liuzhou, China
| | - Yanqun Ma
- Department of Obstetrics and Gynecology, The Fourth Affiliated Hospital of Guangxi Medical University, Liuzhou, China
| | - Yanzhi Zhai
- Department of Obstetrics and Gynecology, The Fourth Affiliated Hospital of Guangxi Medical University, Liuzhou, China
| | - Haiyan Yang
- Department of Obstetrics and Gynecology, The Fourth Affiliated Hospital of Guangxi Medical University, Liuzhou, China
| | - Chunlan Zhang
- Department of Obstetrics and Gynecology, The Fourth Affiliated Hospital of Guangxi Medical University, Liuzhou, China
| | - Yingxin Lu
- Department of Obstetrics and Gynecology, The Fourth Affiliated Hospital of Guangxi Medical University, Liuzhou, China
| | - Wei Wei
- Department of Obstetrics and Gynecology, The Fourth Affiliated Hospital of Guangxi Medical University, Liuzhou, China
| | - Qing Cai
- Department of Obstetrics and Gynecology, The Fourth Affiliated Hospital of Guangxi Medical University, Liuzhou, China
| | - Xuewen Ding
- Department of Obstetrics and Gynecology, The Fourth Affiliated Hospital of Guangxi Medical University, Liuzhou, China
| | - Shan Lu
- Department of Obstetrics and Gynecology, The Fourth Affiliated Hospital of Guangxi Medical University, Liuzhou, China
| | - Ziyu Fang
- Department of Obstetrics and Gynecology, The Fourth Affiliated Hospital of Guangxi Medical University, Liuzhou, China
| |
Collapse
|
10
|
Park Y, Heider D, Hauschild AC. Integrative Analysis of Next-Generation Sequencing for Next-Generation Cancer Research toward Artificial Intelligence. Cancers (Basel) 2021; 13:3148. [PMID: 34202427 PMCID: PMC8269018 DOI: 10.3390/cancers13133148] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2021] [Revised: 06/16/2021] [Accepted: 06/21/2021] [Indexed: 12/18/2022] Open
Abstract
The rapid improvement of next-generation sequencing (NGS) technologies and their application in large-scale cohorts in cancer research led to common challenges of big data. It opened a new research area incorporating systems biology and machine learning. As large-scale NGS data accumulated, sophisticated data analysis methods became indispensable. In addition, NGS data have been integrated with systems biology to build better predictive models to determine the characteristics of tumors and tumor subtypes. Therefore, various machine learning algorithms were introduced to identify underlying biological mechanisms. In this work, we review novel technologies developed for NGS data analysis, and we describe how these computational methodologies integrate systems biology and omics data. Subsequently, we discuss how deep neural networks outperform other approaches, the potential of graph neural networks (GNN) in systems biology, and the limitations in NGS biomedical research. To reflect on the various challenges and corresponding computational solutions, we will discuss the following three topics: (i) molecular characteristics, (ii) tumor heterogeneity, and (iii) drug discovery. We conclude that machine learning and network-based approaches can add valuable insights and build highly accurate models. However, a well-informed choice of learning algorithm and biological network information is crucial for the success of each specific research question.
Collapse
Affiliation(s)
- Youngjun Park
- Department of Mathematics and Computer Science, Philipps-University of Marburg, 35032 Marburg, Germany; (Y.P.); (D.H.)
| | - Dominik Heider
- Department of Mathematics and Computer Science, Philipps-University of Marburg, 35032 Marburg, Germany; (Y.P.); (D.H.)
| | - Anne-Christin Hauschild
- Department of Mathematics and Computer Science, Philipps-University of Marburg, 35032 Marburg, Germany; (Y.P.); (D.H.)
- Department of Medical Informatics, University Medical Center Göttingen, 37075 Göttingen, Germany
| |
Collapse
|
11
|
Altered Gene Expression in the Testis of Infertile Patients with Nonobstructive Azoospermia. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2021; 2021:5533483. [PMID: 34221106 PMCID: PMC8211532 DOI: 10.1155/2021/5533483] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/21/2021] [Revised: 05/14/2021] [Accepted: 05/28/2021] [Indexed: 11/18/2022]
Abstract
Background The molecular mechanism of nonobstructive azoospermia (NOA) remains unclear. The aim of this study was to identify gene expression changes in NOA patients and to explore potential biomarkers and therapeutic targets. Methods The gene expression profiles of GSE45885 and GSE145467 were collected from the Gene Expression Omnibus (GEO) database, and the differences between NOA and normal spermatogenesis were analyzed. Enrichment analysis was performed to explore biological functions for common differentially expressed genes (DEGs) in GSE45885 and GSE145467. Coexpression analysis of DEGs in GSE45885 was performed, and two modules with the highest correlation with NOA were screened. Key genes were then screened from the intersection genes of the two modules and common DEGs and PPI network. The expression of key genes was validated by quantitative real-time polymerase chain reaction (qRT-PCR) experiments. Finally, through miRTarBase, miRDB, and RAID, the miRNAs were predicted to regulate key genes, respectively. Results A total of 345 common DEGs were identified and they were mainly related to spermatogenesis, insulin signaling pathway. Coexpression analysis of DEGs in GSE45885 yielded eight modules; MEblack and MEturquoise had the highest correlation with NOA. Six genes in MEturquoise and RNF141 in MEblack were identified as key genes. qRT-PCR experiments validated the differential expression of key genes between NOA and control. Furthermore, RNF141 was regulated by the largest number of miRNAs. Conclusion Our findings suggest that the significant change expression of key genes may be potential markers and therapeutic targets of NOA and may have some impact on the development of NOA.
Collapse
|
12
|
Wang B, Dong X, Hu J, Ma X, Han C, Wang Y, Gao L. The peripheral and core regions of virus-host network of COVID-19. Brief Bioinform 2021; 22:6265188. [PMID: 33956950 PMCID: PMC8136014 DOI: 10.1093/bib/bbab169] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2020] [Revised: 03/30/2021] [Accepted: 04/11/2021] [Indexed: 12/16/2022] Open
Abstract
Two thousand nineteen novel coronavirus SARS-CoV-2, the pathogen of COVID-19, has caused a catastrophic pandemic, which has a profound and widespread impact on human lives and social economy globally. However, the molecular perturbations induced by the SARS-CoV-2 infection remain unknown. In this paper, from the perspective of omnigenic, we analyze the properties of the neighborhood perturbed by SARS-CoV-2 in the human interactome and disclose the peripheral and core regions of virus-host network (VHN). We find that the virus-host proteins (VHPs) form a significantly connected VHN, among which highly perturbed proteins aggregate into an observable core region. The non-core region of VHN forms a large scale but relatively low perturbed periphery. We further validate that the periphery is non-negligible and conducive to identifying comorbidities and detecting drug repurposing candidates for COVID-19. We particularly put forward a flower model for COVID-19, SARS and H1N1 based on their peripheral regions, and the flower model shows more correlations between COVID-19 and other two similar diseases in common functional pathways and candidate drugs. Overall, our periphery-core pattern can not only offer insights into interconnectivity of SARS-CoV-2 VHPs but also facilitate the research on therapeutic drugs.
Collapse
Affiliation(s)
- Bingbo Wang
- School of Computer Science and Technology, Xidian University, Xi'an 710071, China
| | - Xianan Dong
- School of Computer Science and Technology, Xidian University, Xi'an 710071, China
| | - Jie Hu
- School of Computer Science and Technology, Xidian University, Xi'an 710071, China
| | - Xiujuan Ma
- School of Computer Science and Technology, Xidian University, Xi'an 710071, China
| | - Chao Han
- School of Computer Science and Technology, Xidian University, Xi'an 710071, China
| | - Yajun Wang
- School of Computer Science and Technology, Xidian University, Xi'an 710071, China
| | - Lin Gao
- School of Computer Science and Technology, Xidian University, Xi'an 710071, China
| |
Collapse
|