Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Yousef M, Kumar A, Bakir-Gungor B. Application of Biological Domain Knowledge Based Feature Selection on Gene Expression Data. Entropy (Basel) 2020;23:E2. [PMID: 33374969 PMCID: PMC7821996 DOI: 10.3390/e23010002] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/27/2020] [Revised: 12/14/2020] [Accepted: 12/16/2020] [Indexed: 12/19/2022]

For:	Yousef M, Kumar A, Bakir-Gungor B. Application of Biological Domain Knowledge Based Feature Selection on Gene Expression Data. Entropy (Basel) 2020;23:E2. [PMID: 33374969 PMCID: PMC7821996 DOI: 10.3390/e23010002] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/27/2020] [Revised: 12/14/2020] [Accepted: 12/16/2020] [Indexed: 12/19/2022]

Number

Cited by Other Article(s)

Patrício A, Costa RS, Henriques R. Pattern-centric transformation of omics data grounded on discriminative gene associations aids predictive tasks in TCGA while ensuring interpretability. Biotechnol Bioeng 2024. [PMID: 38859573 DOI: 10.1002/bit.28758] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2023] [Revised: 02/07/2024] [Accepted: 05/18/2024] [Indexed: 06/12/2024]

Li K, Wang Z, Zhou Y, Li S. Lung adenocarcinoma identification based on hybrid feature selections and attentional convolutional neural networks. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2024;21:2991-3015. [PMID: 38454716 DOI: 10.3934/mbe.2024133] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/09/2024]

Qumsiyeh E, Salah Z, Yousef M. miRGediNET: A comprehensive examination of common genes in miRNA-Target interactions and disease associations: Insights from a grouping-scoring-modeling approach. Heliyon 2023;9:e22666. [PMID: 38090011 PMCID: PMC10711121 DOI: 10.1016/j.heliyon.2023.e22666] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2023] [Revised: 11/15/2023] [Accepted: 11/16/2023] [Indexed: 06/15/2024] Open

Ersoz NS, Bakir-Gungor B, Yousef M. GeNetOntology: identifying affected gene ontology terms via grouping, scoring, and modeling of gene expression data utilizing biological knowledge-based machine learning. Front Genet 2023;14:1139082. [PMID: 37671046 PMCID: PMC10476493 DOI: 10.3389/fgene.2023.1139082] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2023] [Accepted: 07/05/2023] [Indexed: 09/07/2023] Open

Abstract

Introduction: Identifying significant sets of genes that are up/downregulated under specific conditions is vital to understand disease development mechanisms at the molecular level. Along this line, in order to analyze transcriptomic data, several computational feature selection (i.e., gene selection) methods have been proposed. On the other hand, uncovering the core functions of the selected genes provides a deep understanding of diseases. In order to address this problem, biological domain knowledge-based feature selection methods have been proposed. Unlike computational gene selection approaches, these domain knowledge-based methods take the underlying biology into account and integrate knowledge from external biological resources. Gene Ontology (GO) is one such biological resource that provides ontology terms for defining the molecular function, cellular component, and biological process of the gene product. Methods: In this study, we developed a tool named GeNetOntology which performs GO-based feature selection for gene expression data analysis. In the proposed approach, the process of Grouping, Scoring, and Modeling (G-S-M) is used to identify significant GO terms. GO information has been used as the grouping information, which has been embedded into a machine learning (ML) algorithm to select informative ontology terms. The genes annotated with the selected ontology terms have been used in the training part to carry out the classification task of the ML model. The output is an important set of ontologies for the two-class classification task applied to gene expression data for a given phenotype. Results: Our approach has been tested on 11 different gene expression datasets, and the results showed that GeNetOntology successfully identified important disease-related ontology terms to be used in the classification model. Discussion: GeNetOntology will assist geneticists and scientists to identify a range of disease-related genes and ontologies in transcriptomic data analysis, and it will also help doctors design diagnosis platforms and improve patient treatment plans.

Collapse

Kuzudisli C, Bakir-Gungor B, Bulut N, Qaqish B, Yousef M. Review of feature selection approaches based on grouping of features. PeerJ 2023;11:e15666. [PMID: 37483989 PMCID: PMC10358338 DOI: 10.7717/peerj.15666] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Accepted: 06/08/2023] [Indexed: 07/25/2023] Open

Yousef M, Ozdemir F, Jaber A, Allmer J, Bakir-Gungor B. PriPath: identifying dysregulated pathways from differential gene expression via grouping, scoring, and modeling with an embedded feature selection approach. BMC Bioinformatics 2023;24:60. [PMID: 36823571 PMCID: PMC9947447 DOI: 10.1186/s12859-023-05187-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2022] [Accepted: 02/14/2023] [Indexed: 02/25/2023] Open

Abstract

BACKGROUND

Cell homeostasis relies on the concerted actions of genes, and dysregulated genes can lead to diseases. In living organisms, genes or their products do not act alone but within networks. Subsets of these networks can be viewed as modules that provide specific functionality to an organism. The Kyoto encyclopedia of genes and genomes (KEGG) systematically analyzes gene functions, proteins, and molecules and combines them into pathways. Measurements of gene expression (e.g., RNA-seq data) can be mapped to KEGG pathways to determine which modules are affected or dysregulated in the disease. However, genes acting in multiple pathways and other inherent issues complicate such analyses. Many current approaches may only employ gene expression data and need to pay more attention to some of the existing knowledge stored in KEGG pathways for detecting dysregulated pathways. New methods that consider more precompiled information are required for a more holistic association between gene expression and diseases.

RESULTS

PriPath is a novel approach that transfers the generic process of grouping and scoring, followed by modeling to analyze gene expression with KEGG pathways. In PriPath, KEGG pathways are utilized as the grouping function as part of a machine learning algorithm for selecting the most significant KEGG pathways. A machine learning model is trained to differentiate between diseases and controls using those groups. We have tested PriPath on 13 gene expression datasets of various cancers and other diseases. Our proposed approach successfully assigned biologically and clinically relevant KEGG terms to the samples based on the differentially expressed genes. We have comparatively evaluated the performance of PriPath against other tools, which are similar in their merit. For each dataset, we manually confirmed the top results of PriPath in the literature and found that most predictions can be supported by previous experimental research.

CONCLUSIONS

PriPath can thus aid in determining dysregulated pathways, which applies to medical diagnostics. In the future, we aim to advance this approach so that it can perform patient stratification based on gene expression and identify druggable targets. Thereby, we cover two aspects of precision medicine.

Collapse

Chavan AR, Singh AK, Gupta RK, Nakhate SP, Poddar BJ, Gujar VV, Purohit HJ, Khardenavis AA. Recent trends in the biotechnology of functional non-digestible oligosaccharides with prebiotic potential. Biotechnol Genet Eng Rev 2023:1-46. [PMID: 36714949 DOI: 10.1080/02648725.2022.2152627] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2022] [Accepted: 11/13/2022] [Indexed: 01/31/2023]

Jabeer A, Temiz M, Bakir-Gungor B, Yousef M. miRdisNET: Discovering microRNA biomarkers that are associated with diseases utilizing biological knowledge-based machine learning. Front Genet 2023;13:1076554. [PMID: 36712859 PMCID: PMC9877296 DOI: 10.3389/fgene.2022.1076554] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2022] [Accepted: 12/30/2022] [Indexed: 01/14/2023] Open

Abstract

During recent years, biological experiments and increasing evidence have shown that microRNAs play an important role in the diagnosis and treatment of human complex diseases. Therefore, to diagnose and treat human complex diseases, it is necessary to reveal the associations between a specific disease and related miRNAs. Although current computational models based on machine learning attempt to determine miRNA-disease associations, the accuracy of these models need to be improved, and candidate miRNA-disease relations need to be evaluated from a biological perspective. In this paper, we propose a computational model named miRdisNET to predict potential miRNA-disease associations. Specifically, miRdisNET requires two types of data, i.e., miRNA expression profiles and known disease-miRNA associations as input files. First, we generate subsets of specific diseases by applying the grouping component. These subsets contain miRNA expressions with class labels associated with each specific disease. Then, we assign an importance score to each group by using a machine learning method for classification. Finally, we apply a modeling component and obtain outputs. One of the most important outputs of miRdisNET is the performance of miRNA-disease prediction. Compared with the existing methods, miRdisNET obtained the highest AUC value of .9998. Another output of miRdisNET is a list of significant miRNAs for disease under study. The miRNAs identified by miRdisNET are validated via referring to the gold-standard databases which hold information on experimentally verified microRNA-disease associations. miRdisNET has been developed to predict candidate miRNAs for new diseases, where miRNA-disease relation is not yet known. In addition, miRdisNET presents candidate disease-disease associations based on shared miRNA knowledge. The miRdisNET tool and other supplementary files are publicly available at: https://github.com/malikyousef/miRdisNET.

Collapse

Qumsiyeh E, Showe L, Yousef M. GediNET for discovering gene associations across diseases using knowledge based machine learning approach. Sci Rep 2022;12:19955. [PMID: 36402891 PMCID: PMC9675776 DOI: 10.1038/s41598-022-24421-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2022] [Accepted: 11/15/2022] [Indexed: 11/21/2022] Open

Kranz A, Polen T, Kotulla C, Arndt A, Bosco G, Bussmann M, Chattopadhyay A, Cramer A, Davoudi CF, Degner U, Diesveld R, Freiherr von Boeselager R, Gärtner K, Gätgens C, Georgi T, Geraths C, Haas S, Heyer A, Hünnefeld M, Ishige T, Kabus A, Kallscheuer N, Kever L, Klaffl S, Kleine B, Kočan M, Koch-Koerfges A, Kraxner KJ, Krug A, Krüger A, Küberl A, Labib M, Lange C, Mack C, Maeda T, Mahr R, Majda S, Michel A, Morosov X, Müller O, Nanda AM, Nickel J, Pahlke J, Pfeifer E, Platzen L, Ramp P, Rittmann D, Schaffer S, Scheele S, Spelberg S, Schulte J, Schweitzer JE, Sindelar G, Sorger-Herrmann U, Spelberg M, Stansen C, Tharmasothirajan A, Ooyen JV, van Summeren-Wesenhagen P, Vogt M, Witthoff S, Zhu L, Eikmanns BJ, Oldiges M, Schaumann G, Baumgart M, Brocker M, Eggeling L, Freudl R, Frunzke J, Marienhagen J, Wendisch VF, Bott M. A manually curated compendium of expression profiles for the microbial cell factory Corynebacterium glutamicum. Sci Data 2022;9:594. [PMID: 36182956 PMCID: PMC9526701 DOI: 10.1038/s41597-022-01706-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2022] [Accepted: 08/18/2022] [Indexed: 11/12/2022] Open

Affiliation(s)

Angela Kranz IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany. .,IBG-4: Bioinformatics, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany.
Tino Polen IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Christian Kotulla IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Annette Arndt Institute of Microbiology and Biotechnology, University of Ulm, D-89069, Ulm, Germany
Graziella Bosco IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Michael Bussmann IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Ava Chattopadhyay IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Annette Cramer Institute of Microbiology and Biotechnology, University of Ulm, D-89069, Ulm, Germany
Cedric-Farhad Davoudi IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Ursula Degner IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Ramon Diesveld IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Raphael Freiherr von Boeselager IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Kim Gärtner IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Cornelia Gätgens IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Tobias Georgi IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Christian Geraths IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Sabine Haas IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Antonia Heyer IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Max Hünnefeld IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Takeru Ishige IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Armin Kabus IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Nicolai Kallscheuer IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Larissa Kever IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Simon Klaffl IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Britta Kleine IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Martina Kočan IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Abigail Koch-Koerfges IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Kim J Kraxner IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Andreas Krug IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Aileen Krüger IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Andreas Küberl IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Mohamed Labib IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Christian Lange IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Christina Mack IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Tomoya Maeda IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Regina Mahr IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Stephan Majda IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Andrea Michel IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Xenia Morosov IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Olga Müller IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Arun M Nanda IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Jens Nickel IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Jennifer Pahlke IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Eugen Pfeifer IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Laura Platzen IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Paul Ramp IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Doris Rittmann IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Steffen Schaffer IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Sandra Scheele IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Stephanie Spelberg IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Julia Schulte IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Jens-Eric Schweitzer IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Georg Sindelar IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Ulrike Sorger-Herrmann IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Markus Spelberg IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Corinna Stansen IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Apilaasha Tharmasothirajan IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Jan van Ooyen IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Philana van Summeren-Wesenhagen SenseUp GmbH, c/o Campus Forschungszentrum, Wilhelm-Johnen-Strasse, D-52425, Jülich, Germany
Michael Vogt IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Sabrina Witthoff IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Lingfeng Zhu IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Bernhard J Eikmanns Institute of Microbiology and Biotechnology, University of Ulm, D-89069, Ulm, Germany
Marco Oldiges IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Georg Schaumann SenseUp GmbH, c/o Campus Forschungszentrum, Wilhelm-Johnen-Strasse, D-52425, Jülich, Germany
Meike Baumgart IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Melanie Brocker IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Lothar Eggeling IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Roland Freudl IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Julia Frunzke IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Jan Marienhagen IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany
Volker F Wendisch Genetics of Prokaryotes, Biology & CeBiTec, Bielefeld University, Universitaetsstr. 25, D-33615, Bielefeld, Germany
Michael Bott IBG-1: Biotechnology, Institute of Bio- and Geosciences, Forschungszentrum Jülich, D-52425, Jülich, Germany.

Collapse

Lee C, Lee S, Park E, Hong J, Shin DY, Byun JM, Yun H, Koh Y, Yoon SS. Transcriptional signatures of the BCL2 family for individualized acute myeloid leukaemia treatment. Genome Med 2022;14:111. [PMID: 36171613 PMCID: PMC9520894 DOI: 10.1186/s13073-022-01115-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2022] [Accepted: 09/20/2022] [Indexed: 11/10/2022] Open

Abstract

Background

Although anti-apoptotic proteins of the B-cell lymphoma-2 (BCL2) family have been utilized as therapeutic targets in acute myeloid leukaemia (AML), their complicated regulatory networks make individualized therapy difficult. This study aimed to discover the transcriptional signatures of BCL2 family genes that reflect regulatory dynamics, which can guide individualized therapeutic strategies.

Methods

From three AML RNA-seq cohorts (BeatAML, LeuceGene, and TCGA; n = 451, 437, and 179, respectively), we constructed the BCL2 family signatures (BFSigs) by applying an innovative gene-set selection method reflecting biological knowledge followed by non-negative matrix factorization (NMF). To demonstrate the significance of the BFSigs, we conducted modelling to predict response to BCL2 family inhibitors, clustering, and functional enrichment analysis. Cross-platform validity of BFSigs was also confirmed using NanoString technology in a separate cohort of 47 patients.

Results

We established BFSigs labeled as the BCL2, MCL1/BCL2, and BFL1/MCL1 signatures that identify key anti-apoptotic proteins. Unsupervised clustering based on BFSig information consistently classified AML patients into three robust subtypes across different AML cohorts, implying the existence of biological entities revealed by the BFSig approach. Interestingly, each subtype has distinct enrichment patterns of major cancer pathways, including MAPK and mTORC1, which propose subtype-specific combination treatment with apoptosis modulating drugs. The BFSig-based classifier also predicted response to venetoclax with remarkable performance (area under the ROC curve, AUROC = 0.874), which was well-validated in an independent cohort (AUROC = 0.950). Lastly, we successfully confirmed the validity of BFSigs using NanoString technology.

Conclusions

This study proposes BFSigs as a biomarker for the effective selection of apoptosis targeting treatments and cancer pathways to co-target in AML.

Supplementary Information

The online version contains supplementary material available at 10.1186/s13073-022-01115-w.

Collapse

Affiliation(s)

Chansub Lee Cancer Research Institute, Seoul National University College of Medicine, Seoul, Republic of Korea.,Center for Medical Innovation, Seoul National University Hospital, Seoul, Republic of Korea
Sungyoung Lee Department of Genomic Medicine, Seoul National University Hospital, Seoul, Republic of Korea.,Center for Precision Medicine, Seoul National University Hospital, Seoul, Republic of Korea
Eunchae Park Cancer Research Institute, Seoul National University College of Medicine, Seoul, Republic of Korea.,Center for Medical Innovation, Seoul National University Hospital, Seoul, Republic of Korea
Junshik Hong Cancer Research Institute, Seoul National University College of Medicine, Seoul, Republic of Korea.,Center for Medical Innovation, Seoul National University Hospital, Seoul, Republic of Korea.,Division of Hematology and Medical Oncology, Department of Internal Medicine, Seoul National University Hospital, Seoul, Republic of Korea
Dong-Yeop Shin Cancer Research Institute, Seoul National University College of Medicine, Seoul, Republic of Korea.,Center for Medical Innovation, Seoul National University Hospital, Seoul, Republic of Korea.,Division of Hematology and Medical Oncology, Department of Internal Medicine, Seoul National University Hospital, Seoul, Republic of Korea
Ja Min Byun Cancer Research Institute, Seoul National University College of Medicine, Seoul, Republic of Korea.,Center for Medical Innovation, Seoul National University Hospital, Seoul, Republic of Korea.,Division of Hematology and Medical Oncology, Department of Internal Medicine, Seoul National University Hospital, Seoul, Republic of Korea
Hongseok Yun Department of Genomic Medicine, Seoul National University Hospital, Seoul, Republic of Korea. .,Center for Precision Medicine, Seoul National University Hospital, Seoul, Republic of Korea.
Youngil Koh Cancer Research Institute, Seoul National University College of Medicine, Seoul, Republic of Korea. .,Center for Medical Innovation, Seoul National University Hospital, Seoul, Republic of Korea. .,Division of Hematology and Medical Oncology, Department of Internal Medicine, Seoul National University Hospital, Seoul, Republic of Korea.
Sung-Soo Yoon Cancer Research Institute, Seoul National University College of Medicine, Seoul, Republic of Korea. .,Center for Medical Innovation, Seoul National University Hospital, Seoul, Republic of Korea. .,Division of Hematology and Medical Oncology, Department of Internal Medicine, Seoul National University Hospital, Seoul, Republic of Korea.

Collapse

Ensemble feature selection for multi‐label text classification: An intelligent order statistics approach. INT J INTELL SYST 2022. [DOI: 10.1002/int.23044] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]

Domain knowledge-enhanced variable selection for biomedical data analysis. Inf Sci (N Y) 2022. [DOI: 10.1016/j.ins.2022.05.076] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]

EGFAFS: A Novel Feature Selection Algorithm Based on Explosion Gravitation Field Algorithm. ENTROPY 2022;24:e24070873. [PMID: 35885095 PMCID: PMC9322764 DOI: 10.3390/e24070873] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/24/2022] [Revised: 06/15/2022] [Accepted: 06/22/2022] [Indexed: 02/04/2023]

Yousef M, Voskergian D. TextNetTopics: Text Classification Based Word Grouping as Topics and Topics’ Scoring. Front Genet 2022;13:893378. [PMID: 35795215 PMCID: PMC9251539 DOI: 10.3389/fgene.2022.893378] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2022] [Accepted: 05/25/2022] [Indexed: 11/28/2022] Open

Mate Analysis of Hepatocellular Carcinoma Immune Subtypes and Their Functional Effects Based on Fuzzy Logic and Evolutionary Algorithms. CONTRAST MEDIA & MOLECULAR IMAGING 2022;2022:5787981. [PMID: 35601568 PMCID: PMC9098361 DOI: 10.1155/2022/5787981] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/06/2022] [Revised: 03/23/2022] [Accepted: 04/05/2022] [Indexed: 11/17/2022]

Integrated Bioinformatics Analysis and Verification of Gene Targets for Myocardial Ischemia-Reperfusion Injury. EVIDENCE-BASED COMPLEMENTARY AND ALTERNATIVE MEDICINE 2022;2022:2056630. [PMID: 35463067 PMCID: PMC9033367 DOI: 10.1155/2022/2056630] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/16/2021] [Revised: 03/11/2022] [Accepted: 03/28/2022] [Indexed: 11/18/2022]

Bakir-Gungor B, Hacılar H, Jabeer A, Nalbantoglu OU, Aran O, Yousef M. Inflammatory bowel disease biomarkers of human gut microbiota selected via different feature selection methods. PeerJ 2022;10:e13205. [PMID: 35497193 PMCID: PMC9048649 DOI: 10.7717/peerj.13205] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2021] [Accepted: 03/10/2022] [Indexed: 01/12/2023] Open

Abstract

The tremendous boost in next generation sequencing and in the "omics" technologies makes it possible to characterize the human gut microbiome-the collective genomes of the microbial community that reside in our gastrointestinal tract. Although some of these microorganisms are considered to be essential regulators of our immune system, the alteration of the complexity and eubiotic state of microbiota might promote autoimmune and inflammatory disorders such as diabetes, rheumatoid arthritis, Inflammatory bowel diseases (IBD), obesity, and carcinogenesis. IBD, comprising Crohn's disease and ulcerative colitis, is a gut-related, multifactorial disease with an unknown etiology. IBD presents defects in the detection and control of the gut microbiota, associated with unbalanced immune reactions, genetic mutations that confer susceptibility to the disease, and complex environmental conditions such as westernized lifestyle. Although some existing studies attempt to unveil the composition and functional capacity of the gut microbiome in relation to IBD diseases, a comprehensive picture of the gut microbiome in IBD patients is far from being complete. Due to the complexity of metagenomic studies, the applications of the state-of-the-art machine learning techniques became popular to address a wide range of questions in the field of metagenomic data analysis. In this regard, using IBD associated metagenomics dataset, this study utilizes both supervised and unsupervised machine learning algorithms, (i) to generate a classification model that aids IBD diagnosis, (ii) to discover IBD-associated biomarkers, (iii) to discover subgroups of IBD patients using k-means and hierarchical clustering approaches. To deal with the high dimensionality of features, we applied robust feature selection algorithms such as Conditional Mutual Information Maximization (CMIM), Fast Correlation Based Filter (FCBF), min redundancy max relevance (mRMR), Select K Best (SKB), Information Gain (IG) and Extreme Gradient Boosting (XGBoost). In our experiments with 100-fold Monte Carlo cross-validation (MCCV), XGBoost, IG, and SKB methods showed a considerable effect in terms of minimizing the microbiota used for the diagnosis of IBD and thus reducing the cost and time. We observed that compared to Decision Tree, Support Vector Machine, Logitboost, Adaboost, and stacking ensemble classifiers, our Random Forest classifier resulted in better performance measures for the classification of IBD. Our findings revealed potential microbiome-mediated mechanisms of IBD and these findings might be useful for the development of microbiome-based diagnostics.

Collapse

Yousef M, Goy G, Bakir-Gungor B. miRModuleNet: Detecting miRNA-mRNA Regulatory Modules. Front Genet 2022;13:767455. [PMID: 35495139 PMCID: PMC9039401 DOI: 10.3389/fgene.2022.767455] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2021] [Accepted: 03/24/2022] [Indexed: 12/13/2022] Open

Prediction of Linear Cationic Antimicrobial Peptides Active against Gram-Negative and Gram-Positive Bacteria Based on Machine Learning Models. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12073631] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Long Non-Coding RNAs Might Regulate Phenotypic Switch of Vascular Smooth Muscle Cells Acting as ceRNA: Implications for In-Stent Restenosis. Int J Mol Sci 2022;23:ijms23063074. [PMID: 35328496 PMCID: PMC8952224 DOI: 10.3390/ijms23063074] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2022] [Revised: 03/07/2022] [Accepted: 03/09/2022] [Indexed: 02/01/2023] Open

Galbraith E, Convertino M. The Eco-Evo Mandala: Simplifying Bacterioplankton Complexity into Ecohealth Signatures. ENTROPY (BASEL, SWITZERLAND) 2021;23:1471. [PMID: 34828169 PMCID: PMC8625105 DOI: 10.3390/e23111471] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/30/2021] [Revised: 10/30/2021] [Accepted: 11/05/2021] [Indexed: 12/24/2022]

Abstract

The microbiome emits informative signals of biological organization and environmental pressure that aid ecosystem monitoring and prediction. Are the many signals reducible to a habitat-specific portfolio that characterizes ecosystem health? Does an optimally structured microbiome imply a resilient microbiome? To answer these questions, we applied our novel Eco-Evo Mandala to bacterioplankton data from four habitats within the Great Barrier Reef, to explore how patterns in community structure, function and genetics signal habitat-specific organization and departures from theoretical optimality. The Mandala revealed communities departing from optimality in habitat-specific ways, mostly along structural and functional traits related to bacterioplankton abundance and interaction distributions (reflected by ϵ and λ as power law and exponential distribution parameters), which are not linearly associated with each other. River and reef communities were similar in their relatively low abundance and interaction disorganization (low ϵ and λ) due to their protective structured habitats. On the contrary, lagoon and estuarine inshore reefs appeared the most disorganized due to the ocean temperature and biogeochemical stress. Phylogenetic distances (D) were minimally informative in characterizing bacterioplankton organization. However, dominant populations, such as Proteobacteria, Bacteroidetes, and Cyanobacteria, were largely responsible for community patterns, being generalists with a large functional gene repertoire (high D) that increases resilience. The relative balance of these populations was found to be habitat-specific and likely related to systemic environmental stress. The position on the Mandala along the three fundamental traits, as well as fluctuations in this ecological state, conveys information about the microbiome's health (and likely ecosystem health considering bacteria-based multitrophic dependencies) as divergence from the expected relative optimality. The Eco-Evo Mandala emphasizes how habitat and the microbiome's interaction network topology are first- and second-order factors for ecosystem health evaluation over taxonomic species richness. Unhealthy microbiome communities and unbalanced microbes are identified not by macroecological indicators but by mapping their impact on the collective proportion and distribution of interactions, which regulates the microbiome's ecosystem function.

Collapse

Bhosale H, Ramakrishnan V, Jayaraman VK. Support vector machine-based prediction of pore-forming toxins (PFT) using distributed representation of reduced alphabets. J Bioinform Comput Biol 2021;19:2150028. [PMID: 34693886 DOI: 10.1142/s0219720021500281] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]

A Review on Recent Progress in Machine Learning and Deep Learning Methods for Cancer Classification on Gene Expression Data. Processes (Basel) 2021. [DOI: 10.3390/pr9081466] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open

Arora G, Joshi J, Mandal RS, Shrivastava N, Virmani R, Sethi T. Artificial Intelligence in Surveillance, Diagnosis, Drug Discovery and Vaccine Development against COVID-19. Pathogens 2021;10:1048. [PMID: 34451513 PMCID: PMC8399076 DOI: 10.3390/pathogens10081048] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2021] [Revised: 08/11/2021] [Accepted: 08/11/2021] [Indexed: 12/15/2022] Open

Arora G, Joshi J, Mandal RS, Shrivastava N, Virmani R, Sethi T. Artificial Intelligence in Surveillance, Diagnosis, Drug Discovery and Vaccine Development against COVID-19. Pathogens 2021;10:1048. [PMID: 34451513 PMCID: PMC8399076 DOI: 10.3390/pathogens10081048,] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open

Yousef M, Goy G, Mitra R, Eischen CM, Jabeer A, Bakir-Gungor B. miRcorrNet: machine learning-based integration of miRNA and mRNA expression profiles, combined with feature grouping and ranking. PeerJ 2021;9:e11458. [PMID: 34055490 PMCID: PMC8140596 DOI: 10.7717/peerj.11458] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2020] [Accepted: 04/25/2021] [Indexed: 11/20/2022] Open

Abstract

A better understanding of disease development and progression mechanisms at the molecular level is critical both for the diagnosis of a disease and for the development of therapeutic approaches. The advancements in high throughput technologies allowed to generate mRNA and microRNA (miRNA) expression profiles; and the integrative analysis of these profiles allowed to uncover the functional effects of RNA expression in complex diseases, such as cancer. Several researches attempt to integrate miRNA and mRNA expression profiles using statistical methods such as Pearson correlation, and then combine it with enrichment analysis. In this study, we developed a novel tool called miRcorrNet, which performs machine learning-based integration to analyze miRNA and mRNA gene expression profiles. miRcorrNet groups mRNAs based on their correlation to miRNA expression levels and hence it generates groups of target genes associated with each miRNA. Then, these groups are subject to a rank function for classification. We have evaluated our tool using miRNA and mRNA expression profiling data downloaded from The Cancer Genome Atlas (TCGA), and performed comparative evaluation with existing tools. In our experiments we show that miRcorrNet performs as good as other tools in terms of accuracy (reaching more than 95% AUC value). Additionally, miRcorrNet includes ranking steps to separate two classes, namely case and control, which is not available in other tools. We have also evaluated the performance of miRcorrNet using a completely independent dataset. Moreover, we conducted a comprehensive literature search to explore the biological functions of the identified miRNAs. We have validated our significantly identified miRNA groups against known databases, which yielded about 90% accuracy. Our results suggest that miRcorrNet is able to accurately prioritize pan-cancer regulating high-confidence miRNAs. miRcorrNet tool and all other supplementary files are available at https://github.com/malikyousef/miRcorrNet.

Collapse