Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Cun Y, Fröhlich H. Biomarker gene signature discovery integrating network knowledge. Biology (Basel) 2012;1:5-17. [PMID: 24832044 PMCID: PMC4011032 DOI: 10.3390/biology1010005] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/29/2012] [Revised: 02/18/2012] [Accepted: 02/21/2012] [Indexed: 12/17/2022]

For:	Cun Y, Fröhlich H. Biomarker gene signature discovery integrating network knowledge. Biology (Basel) 2012;1:5-17. [PMID: 24832044 PMCID: PMC4011032 DOI: 10.3390/biology1010005] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/29/2012] [Revised: 02/18/2012] [Accepted: 02/21/2012] [Indexed: 12/17/2022]

Number

Cited by Other Article(s)

Kwak SY, Park JH, Won HY, Jang H, Lee SB, Jang WI, Park S, Kim MJ, Shim S. CXCL10 upregulation in radiation-exposed human peripheral blood mononuclear cells as a candidate biomarker for rapid triage after radiation exposure. Int J Radiat Biol 2024;100:541-549. [PMID: 38227479 DOI: 10.1080/09553002.2023.2295300] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Accepted: 11/13/2023] [Indexed: 01/17/2024]

Tian L, Yu T. An integrated deep learning framework for the interpretation of untargeted metabolomics data. Brief Bioinform 2023;24:bbad244. [PMID: 37369636 DOI: 10.1093/bib/bbad244] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2023] [Revised: 06/02/2023] [Accepted: 06/12/2023] [Indexed: 06/29/2023] Open

Abstract

Untargeted metabolomics is gaining widespread applications. The key aspects of the data analysis include modeling complex activities of the metabolic network, selecting metabolites associated with clinical outcome and finding critical metabolic pathways to reveal biological mechanisms. One of the key roadblocks in data analysis is not well-addressed, which is the problem of matching uncertainty between data features and known metabolites. Given the limitations of the experimental technology, the identities of data features cannot be directly revealed in the data. The predominant approach for mapping features to metabolites is to match the mass-to-charge ratio (m/z) of data features to those derived from theoretical values of known metabolites. The relationship between features and metabolites is not one-to-one since some metabolites share molecular composition, and various adduct ions can be derived from the same metabolite. This matching uncertainty causes unreliable metabolite selection and functional analysis results. Here we introduce an integrated deep learning framework for metabolomics data that take matching uncertainty into consideration. The model is devised with a gradual sparsification neural network based on the known metabolic network and the annotation relationship between features and metabolites. This architecture characterizes metabolomics data and reflects the modular structure of biological system. Three goals can be achieved simultaneously without requiring much complex inference and additional assumptions: (1) evaluate metabolite importance, (2) infer feature-metabolite matching likelihood and (3) select disease sub-networks. When applied to a COVID metabolomics dataset and an aging mouse brain dataset, our method found metabolic sub-networks that were easily interpretable.

Collapse

Mallik S, Sarkar A, Nath S, Maulik U, Das S, Pati SK, Ghosh S, Zhao Z. 3PNMF-MKL: A non-negative matrix factorization-based multiple kernel learning method for multi-modal data integration and its application to gene signature detection. Front Genet 2023;14:1095330. [PMID: 36865387 PMCID: PMC9971618 DOI: 10.3389/fgene.2023.1095330] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2022] [Accepted: 01/30/2023] [Indexed: 02/16/2023] Open

Abstract

In this current era, biomedical big data handling is a challenging task. Interestingly, the integration of multi-modal data, followed by significant feature mining (gene signature detection), becomes a daunting task. Remembering this, here, we proposed a novel framework, namely, three-factor penalized, non-negative matrix factorization-based multiple kernel learning with soft margin hinge loss (3PNMF-MKL) for multi-modal data integration, followed by gene signature detection. In brief, limma, employing the empirical Bayes statistics, was initially applied to each individual molecular profile, and the statistically significant features were extracted, which was followed by the three-factor penalized non-negative matrix factorization method used for data/matrix fusion using the reduced feature sets. Multiple kernel learning models with soft margin hinge loss had been deployed to estimate average accuracy scores and the area under the curve (AUC). Gene modules had been identified by the consecutive analysis of average linkage clustering and dynamic tree cut. The best module containing the highest correlation was considered the potential gene signature. We utilized an acute myeloid leukemia cancer dataset from The Cancer Genome Atlas (TCGA) repository containing five molecular profiles. Our algorithm generated a 50-gene signature that achieved a high classification AUC score (viz., 0.827). We explored the functions of signature genes using pathway and Gene Ontology (GO) databases. Our method outperformed the state-of-the-art methods in terms of computing AUC. Furthermore, we included some comparative studies with other related methods to enhance the acceptability of our method. Finally, it can be notified that our algorithm can be applied to any multi-modal dataset for data integration, followed by gene module discovery.

Collapse

Cao H, Hong X, Tost H, Meyer-Lindenberg A, Schwarz E. Advancing translational research in neuroscience through multi-task learning. Front Psychiatry 2022;13:993289. [PMID: 36465289 PMCID: PMC9714033 DOI: 10.3389/fpsyt.2022.993289] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/13/2022] [Accepted: 10/24/2022] [Indexed: 11/18/2022] Open

Murphy RG, Gilmore A, Senevirathne S, O'Reilly PG, LaBonte Wilson M, Jain S, McArt DG. Particle Swarm Optimization Artificial Intelligence technique for gene signature discovery in transcriptomic cohorts. Comput Struct Biotechnol J 2022;20:5547-5563. [PMID: 36249564 PMCID: PMC9556859 DOI: 10.1016/j.csbj.2022.09.033] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2022] [Revised: 09/22/2022] [Accepted: 09/22/2022] [Indexed: 11/12/2022] Open

Abstract

•

EBPSO identifies unique, accurate, and succinct gene signatures.

•

Key genes within the signatures provide biological insights its associated functions.

•

A web-based micro-framework developed for ease of use and real-time visualizations.

•

A promising alternative to traditional single gene signature generation.

•

Downstream analysis will better translate these signatures towards clinical translation.

The development of gene signatures is key for delivering personalized medicine, despite only a few signatures being available for use in the clinic for cancer patients. Gene signature discovery tends to revolve around identifying a single signature. However, it has been shown that various highly predictive signatures can be produced from the same dataset. This study assumes that the presentation of top ranked signatures will allow greater efforts in the selection of gene signatures for validation on external datasets and for their clinical translation. Particle swarm optimization (PSO) is an evolutionary algorithm often used as a search strategy and largely represented as binary PSO (BPSO) in this domain. BPSO, however, fails to produce succinct feature sets for complex optimization problems, thus affecting its overall runtime and optimization performance. Enhanced BPSO (EBPSO) was developed to overcome these shortcomings. Thus, this study will validate unique candidate gene signatures for different underlying biology from EBPSO on transcriptomics cohorts. EBPSO was consistently seen to be as accurate as BPSO with substantially smaller feature signatures and significantly faster runtimes. 100% accuracy was achieved in all but two of the selected data sets. Using clinical transcriptomics cohorts, EBPSO has demonstrated the ability to identify accurate, succinct, and significantly prognostic signatures that are unique from one another. This has been proposed as a promising alternative to overcome the issues regarding traditional single gene signature generation. Interpretation of key genes within the signatures provided biological insights into the associated functions that were well correlated to their cancer type.

Collapse

Jin Z, Kang J, Yu T. Feature selection and classification over the network with missing node observations. Stat Med 2022;41:1242-1262. [PMID: 34816464 PMCID: PMC9773124 DOI: 10.1002/sim.9267] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2021] [Revised: 09/14/2021] [Accepted: 10/29/2021] [Indexed: 12/25/2022]

Manjang K, Tripathi S, Yli-Harja O, Dehmer M, Emmert-Streib F. Graph-based exploitation of gene ontology using GOxploreR for scrutinizing biological significance. Sci Rep 2020;10:16672. [PMID: 33028846 PMCID: PMC7542435 DOI: 10.1038/s41598-020-73326-3] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2020] [Accepted: 08/17/2020] [Indexed: 12/12/2022] Open

Fröhlich H, Balling R, Beerenwinkel N, Kohlbacher O, Kumar S, Lengauer T, Maathuis MH, Moreau Y, Murphy SA, Przytycka TM, Rebhan M, Röst H, Schuppert A, Schwab M, Spang R, Stekhoven D, Sun J, Weber A, Ziemek D, Zupan B. From hype to reality: data science enabling personalized medicine. BMC Med 2018;16:150. [PMID: 30145981 PMCID: PMC6109989 DOI: 10.1186/s12916-018-1122-7] [Citation(s) in RCA: 196] [Impact Index Per Article: 32.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/28/2018] [Accepted: 07/09/2018] [Indexed: 02/08/2023] Open

Affiliation(s)

Holger Fröhlich UCB Biosciences GmbH, Alfred-Nobel-Str. Str. 10, 40789 Monheim, Germany University of Bonn, Bonn-Aachen International Center for IT, Endenicher Allee 19c, 53115 Bonn, Germany
Rudi Balling University of Luxembourg, 6 avenue du Swing, 4367 Belvaux, Luxembourg
Niko Beerenwinkel Department of Biosciences and Engineering, ETH Zurich, Mattenstr. 26, 4058 Basel, Switzerland
Oliver Kohlbacher University of Tübingen, WSI/ZBIT, Sand 14, 72076 Tübingen, Germany Max Planck Institute for Developmental Biology, Max-Planck-Ring 5, 72076 Tübingen, Germany Quantitative Biology Center, University of Tübingen, Auf der Morgenstelle 8, 72076 Tübingen, Germany Institute for Translational Bioinformatics, University Medical Center Tübingen, Sand 14, 72076 Tübingen, Germany
Santosh Kumar Department of Computer Science, University of Memphis, 2222 Dunn Hall, Memphis, TN 38152 USA
Thomas Lengauer Max-Planck-Institute for Informatics, 66123 Saarbrücken, Germany
Marloes H. Maathuis ETH Zurich, Seminar für Statistik, Rämistrasse 101, 8092 Zurich, Switzerland
Yves Moreau University of Leuven, ESAT, Kasteelpark Arenberg 10, 3001 Leuven, Belgium
Susan A. Murphy Harvard University, Science Center 400 Suite, Oxford Street, Cambridge, MA 02138-2901 USA
Teresa M. Przytycka National Center of Biotechnology Information, National Institute of Health, 8600 Rockville Pike, Bethesda, MD 20894-6075 USA
Michael Rebhan Novartis Institutes for Biomedical Research, 4056 Basel, Switzerland
Hannes Röst Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, 160 College Street, Toronto, ON M5S 3E1 Canada
Andreas Schuppert RWTH Aachen, Joint Research Center for Computational Biomedicine, Pauwelsstrasse 19, 52074 Aachen, Germany
Matthias Schwab Dr. Margarete Fischer-Bosch Institute of Clinical Pharmacology, Aucherbachstrasse 112, 70376 Stuttgart, Germany University of Tübingen, Departments of Clinical Pharmacology and of Pharmacy and Biochemistry, Tübingen, Germany
Rainer Spang University of Regensburg, Institute of Functional Genomics, Am BioPark 9, 93053 Regensburg, Germany
Daniel Stekhoven ETH Zurich, NEXUS Personalized Health Technol., Otto-Stern-Weg 7, 8093 Zurich, Switzerland
Jimeng Sun Georgia Tech University, 801 Atlantic Drive, Atlanta, GA 30332-0280 USA
Andreas Weber Institute for Computer Science, University of Bonn, Endenicher Allee 19a, 53115 Bonn, Germany
Daniel Ziemek Pfizer, Worldwide Research and Development, Linkstraße 10, 10785 Berlin, Germany
Blaz Zupan Faculty of Computer and Information Science, University of Ljubljana, Večna pot 113, SI-1000 Ljubljana, Slovenia

Collapse

Patients with early-stage oropharyngeal cancer can be identified with label-free serum proteomics. Br J Cancer 2018;119:200-212. [PMID: 29961760 PMCID: PMC6048110 DOI: 10.1038/s41416-018-0162-2] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2017] [Revised: 05/14/2018] [Accepted: 06/04/2018] [Indexed: 01/03/2023] Open

Wu M, Zhu L, Feng X. Network-based feature screening with applications to genome data. Ann Appl Stat 2018. [DOI: 10.1214/17-aoas1097] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Zhang C, Liu J, Shi Q, Zeng T, Chen L. Comparative network stratification analysis for identifying functional interpretable network biomarkers. BMC Bioinformatics 2017;18:48. [PMID: 28361683 PMCID: PMC5374559 DOI: 10.1186/s12859-017-1462-x] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open

Abstract

BACKGROUND

A major challenge of bioinformatics in the era of precision medicine is to identify the molecular biomarkers for complex diseases. It is a general expectation that these biomarkers or signatures have not only strong discrimination ability, but also readable interpretations in a biological sense. Generally, the conventional expression-based or network-based methods mainly capture differential genes or differential networks as biomarkers, however, such biomarkers only focus on phenotypic discrimination and usually have less biological or functional interpretation. Meanwhile, the conventional function-based methods could consider the biomarkers corresponding to certain biological functions or pathways, but ignore the differential information of genes, i.e., disregard the active degree of particular genes involved in particular functions, thereby resulting in less discriminative ability on phenotypes. Hence, it is strongly demanded to develop elaborate computational methods to directly identify functional network biomarkers with both discriminative power on disease states and readable interpretation on biological functions.

RESULTS

In this paper, we present a new computational framework based on an integer programming model, named as Comparative Network Stratification (CNS), to extract functional or interpretable network biomarkers, which are of strongly discriminative power on disease states and also readable interpretation on biological functions. In addition, CNS can not only recognize the pathogen biological functions disregarded by traditional Expression-based/Network-based methods, but also uncover the active network-structures underlying such dysregulated functions underestimated by traditional Function-based methods. To validate the effectiveness, we have compared CNS with five state-of-the-art methods, i.e. GSVA, Pathifier, stSVM, frSVM and AEP on four datasets of different complex diseases. The results show that CNS can enhance the discriminative power of network biomarkers, and further provide biologically interpretable information or disease pathogenic mechanism of these biomarkers. A case study on type 1 diabetes (T1D) demonstrates that CNS can identify many dysfunctional genes and networks previously disregarded by conventional approaches.

CONCLUSION

Therefore, CNS is actually a powerful bioinformatics tool, which can identify functional or interpretable network biomarkers with both discriminative power on disease states and readable interpretation on biological functions. CNS was implemented as a Matlab package, which is available at http://www.sysbio.ac.cn/cb/chenlab/images/CNSpackage_0.1.rar .

Collapse

Zhang L, Liu H, Huang Y, Wang X, Chen Y, Meng J. Cancer Progression Prediction Using Gene Interaction Regularized Elastic Net. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2017;14:145-154. [PMID: 28055897 PMCID: PMC5374042 DOI: 10.1109/tcbb.2015.2511758] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/12/2023]

Network-Assisted Disease Classification and Biomarker Discovery. Methods Mol Biol 2016;1386:353-74. [PMID: 26677191 DOI: 10.1007/978-1-4939-3283-2_16] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]

Identifying dense subgraphs in protein–protein interaction network for gene selection from microarray data. ACTA ACUST UNITED AC 2015. [DOI: 10.1007/s13721-015-0104-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]

Zhang X, Gao L, Liu ZP, Chen L. Identifying module biomarker in type 2 diabetes mellitus by discriminative area of functional activity. BMC Bioinformatics 2015;16:92. [PMID: 25888350 PMCID: PMC4374500 DOI: 10.1186/s12859-015-0519-y] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2014] [Accepted: 02/24/2015] [Indexed: 02/07/2023] Open

Schramm SJ, Jayaswal V, Goel A, Li SS, Yang YH, Mann GJ, Wilkins MR. Molecular interaction networks for the analysis of human disease: utility, limitations, and considerations. Proteomics 2014;13:3393-405. [PMID: 24166987 DOI: 10.1002/pmic.201200570] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2012] [Revised: 09/11/2013] [Accepted: 10/07/2013] [Indexed: 01/01/2023]

Cun Y, Fröhlich H. netClass: an R-package for network based, integrative biomarker signature discovery. Bioinformatics 2014;30:1325-6. [PMID: 24443376 DOI: 10.1093/bioinformatics/btu025] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open

Fröhlich H. Including network knowledge into Cox regression models for biomarker signature discovery. Biom J 2014;56:287-306. [DOI: 10.1002/bimj.201300035] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2013] [Revised: 10/10/2013] [Accepted: 10/25/2013] [Indexed: 12/14/2022]

Cun Y, Fröhlich H. Network and data integration for biomarker signature discovery via network smoothed T-statistics. PLoS One 2013;8:e73074. [PMID: 24019896 PMCID: PMC3760887 DOI: 10.1371/journal.pone.0073074] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2013] [Accepted: 07/16/2013] [Indexed: 01/01/2023] Open