Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Vitali F, Marini S, Pala D, Demartini A, Montoli S, Zambelli A, Bellazzi R. Patient similarity by joint matrix trifactorization to identify subgroups in acute myeloid leukemia. JAMIA Open 2018;1:75-86. [PMID: 31984320 PMCID: PMC6951984 DOI: 10.1093/jamiaopen/ooy008] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2017] [Revised: 03/07/2018] [Accepted: 03/20/2018] [Indexed: 12/31/2022] Open

For:	Vitali F, Marini S, Pala D, Demartini A, Montoli S, Zambelli A, Bellazzi R. Patient similarity by joint matrix trifactorization to identify subgroups in acute myeloid leukemia. JAMIA Open 2018;1:75-86. [PMID: 31984320 PMCID: PMC6951984 DOI: 10.1093/jamiaopen/ooy008] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2017] [Revised: 03/07/2018] [Accepted: 03/20/2018] [Indexed: 12/31/2022] Open

Number

Cited by Other Article(s)

Petti M, Farina L. Network medicine for patients' stratification: From single-layer to multi-omics. WIREs Mech Dis 2023;15:e1623. [PMID: 37323106 DOI: 10.1002/wsbm.1623] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2022] [Revised: 03/08/2023] [Accepted: 05/30/2023] [Indexed: 06/17/2023]

Gliozzo J, Mesiti M, Notaro M, Petrini A, Patak A, Puertas-Gallardo A, Paccanaro A, Valentini G, Casiraghi E. Heterogeneous data integration methods for patient similarity networks. Brief Bioinform 2022;23:6604996. [PMID: 35679533 PMCID: PMC9294435 DOI: 10.1093/bib/bbac207] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2021] [Revised: 04/14/2022] [Accepted: 05/04/2022] [Indexed: 12/29/2022] Open

Marini S, Oliva M, Slizovskiy IB, Das RA, Noyes NR, Kahveci T, Boucher C, Prosperi M. AMR-meta: a k-mer and metafeature approach to classify antimicrobial resistance from high-throughput short-read metagenomics data. Gigascience 2022;11:6588116. [PMID: 35583675 PMCID: PMC9116207 DOI: 10.1093/gigascience/giac029] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2021] [Revised: 01/27/2022] [Indexed: 12/15/2022] Open

Abstract

BACKGROUND

Antimicrobial resistance (AMR) is a global health concern. High-throughput metagenomic sequencing of microbial samples enables profiling of AMR genes through comparison with curated AMR databases. However, the performance of current methods is often hampered by database incompleteness and the presence of homology/homoplasy with other non-AMR genes in sequenced samples.

RESULTS

We present AMR-meta, a database-free and alignment-free approach, based on k-mers, which combines algebraic matrix factorization into metafeatures with regularized regression. Metafeatures capture multi-level gene diversity across the main antibiotic classes. AMR-meta takes in reads from metagenomic shotgun sequencing and outputs predictions about whether those reads contribute to resistance against specific classes of antibiotics. In addition, AMR-meta uses an augmented training strategy that joins an AMR gene database with non-AMR genes (used as negative examples). We compare AMR-meta with AMRPlusPlus, DeepARG, and Meta-MARC, further testing their ensemble via a voting system. In cross-validation, AMR-meta has a median f-score of 0.7 (interquartile range, 0.2-0.9). On semi-synthetic metagenomic data-external test-on average AMR-meta yields a 1.3-fold hit rate increase over existing methods. In terms of run-time, AMR-meta is 3 times faster than DeepARG, 30 times faster than Meta-MARC, and as fast as AMRPlusPlus. Finally, we note that differences in AMR ontologies and observed variance of all tools in classification outputs call for further development on standardization of benchmarking data and protocols.

CONCLUSIONS

AMR-meta is a fast, accurate classifier that exploits non-AMR negative sets to improve sensitivity and specificity. The differences in AMR ontologies and the high variance of all tools in classification outputs call for the deployment of standard benchmarking data and protocols, to fairly compare AMR prediction tools.

Collapse

Arici MK, Tuncbag N. Performance Assessment of the Network Reconstruction Approaches on Various Interactomes. Front Mol Biosci 2021;8:666705. [PMID: 34676243 PMCID: PMC8523993 DOI: 10.3389/fmolb.2021.666705] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2021] [Accepted: 07/14/2021] [Indexed: 01/04/2023] Open

Salazar DA, Pržulj N, Valencia CF. Multi-project and Multi-profile joint Non-negative Matrix Factorization for cancer omic datasets. Bioinformatics 2021;37:4801-4809. [PMID: 34375392 DOI: 10.1093/bioinformatics/btab579] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2021] [Revised: 07/31/2021] [Accepted: 08/06/2021] [Indexed: 11/12/2022] Open

Oei RW, Fang HSA, Tan WY, Hsu W, Lee ML, Tan NC. Using Domain Knowledge and Data-Driven Insights for Patient Similarity Analytics. J Pers Med 2021;11:jpm11080699. [PMID: 34442343 PMCID: PMC8398126 DOI: 10.3390/jpm11080699] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2021] [Revised: 07/15/2021] [Accepted: 07/21/2021] [Indexed: 12/23/2022] Open

Xenos A, Malod-Dognin N, Milinković S, Pržulj N. Linear functional organization of the omic embedding space. Bioinformatics 2021;37:3839-3847. [PMID: 34213534 PMCID: PMC8570782 DOI: 10.1093/bioinformatics/btab487] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2020] [Revised: 06/21/2021] [Accepted: 06/30/2021] [Indexed: 11/21/2022] Open

Abstract

Motivation

We are increasingly accumulating complex omics data that capture different aspects of cellular functioning. A key challenge is to untangle their complexity and effectively mine them for new biomedical information. To decipher this new information, we introduce algorithms based on network embeddings. Such algorithms represent biological macromolecules as vectors in d-dimensional space, in which topologically similar molecules are embedded close in space and knowledge is extracted directly by vector operations. Recently, it has been shown that neural networks used to obtain vectorial representations (embeddings) are implicitly factorizing a mutual information matrix, called Positive Pointwise Mutual Information (PPMI) matrix. Thus, we propose the use of the PPMI matrix to represent the human protein–protein interaction (PPI) network and also introduce the graphlet degree vector PPMI matrix of the PPI network to capture different topological (structural) similarities of the nodes in the molecular network.

Results

We generate the embeddings by decomposing these matrices with Nonnegative Matrix Tri-Factorization. We demonstrate that genes that are embedded close in these spaces have similar biological functions, so we can extract new biomedical knowledge directly by doing linear operations on their embedding vector representations. We exploit this property to predict new genes participating in protein complexes and to identify new cancer-related genes based on the cosine similarities between the vector representations of the genes. We validate 80% of our novel cancer-related gene predictions in the literature and also by patient survival curves that demonstrating that 93.3% of them have a potential clinical relevance as biomarkers of cancer.

Availability and implementation

Code and data are available online at https://gitlab.bsc.es/axenos/embedded-omics-data-geometry/.

Supplementary information

Supplementary data are available at Bioinformatics online.

Collapse

Nicora G, Moretti F, Sauta E, Della Porta M, Malcovati L, Cazzola M, Quaglini S, Bellazzi R. A continuous-time Markov model approach for modeling myelodysplastic syndromes progression from cross-sectional data. J Biomed Inform 2020;104:103398. [PMID: 32113003 DOI: 10.1016/j.jbi.2020.103398] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2019] [Revised: 01/31/2020] [Accepted: 02/25/2020] [Indexed: 01/27/2023]

Abstract

The integration of both genomics and clinical data to model disease progression is now possible, thanks to the increasing availability of molecular patients' profiles. This may lead to the definition of novel decision support tools, able to tailor therapeutic interventions on the basis of a "precise" patients' risk stratification, given their health status evolution. However, longitudinal analysis requires long-term data collection and curation, which can be time demanding, expensive and sometimes unfeasible. Here we present a clinical decision support framework that combines the simulation of disease progression from cross-sectional data with a Markov model that exploits continuous-time transition probabilities derived from Cox regression. Trajectories between patients at different disease stages are stochastically built according to a measure of patient similarity, computed with a matrix tri-factorization technique. Such trajectories are seen as realizations drawn from the stochastic process driving the transitions between the disease stages. Eventually, Markov models applied to the resulting longitudinal dataset highlight potentially relevant clinical information. We applied our method to cross-sectional genomic and clinical data from a cohort of Myelodysplastic syndromes (MDS) patients. MDS are heterogeneous clonal hematopoietic disorders whose patients are characterized by different risks of Acute Myeloid Leukemia (AML) development, defined by an international score. We computed patients' trajectories across increasing and subsequent levels of risk of developing AML, and we applied a Cox model to the simulated longitudinal dataset to assess whether genomic characteristics could be associated with a higher or lower probability of disease progression. We then used the learned parameters of such Cox model to calculate the transition probabilities of a continuous-time Markov model that describes the patients' evolution across stages. Our results are in most cases confirmed by previous studies, thus demonstrating that simulated longitudinal data represent a valuable resource to investigate disease progression of MDS patients.

Collapse

Marini S, Vitali F, Rampazzi S, Demartini A, Akutsu T. Protease target prediction via matrix factorization. Bioinformatics 2019;35:923-929. [PMID: 30169576 DOI: 10.1093/bioinformatics/bty746] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2018] [Revised: 08/20/2018] [Accepted: 08/27/2018] [Indexed: 11/14/2022] Open

A patient-similarity-based model for diagnostic prediction. Int J Med Inform 2019;135:104073. [PMID: 31923816 DOI: 10.1016/j.ijmedinf.2019.104073] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2019] [Revised: 11/26/2019] [Accepted: 12/30/2019] [Indexed: 12/28/2022]

Čopar A, Zupan B, Zitnik M. Fast optimization of non-negative matrix tri-factorization. PLoS One 2019;14:e0217994. [PMID: 31185054 PMCID: PMC6559648 DOI: 10.1371/journal.pone.0217994] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2019] [Accepted: 05/22/2019] [Indexed: 11/18/2022] Open

Zhang A, Li A, He J, Wang M. LSCDFS-MKL: A multiple kernel based method for lung squamous cell carcinomas disease-free survival prediction with pathological and genomic data. J Biomed Inform 2019;94:103194. [PMID: 31048071 DOI: 10.1016/j.jbi.2019.103194] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2018] [Revised: 04/14/2019] [Accepted: 04/29/2019] [Indexed: 11/18/2022]

Malod-Dognin N, Petschnigg J, Windels SFL, Povh J, Hemingway H, Ketteler R, Pržulj N. Towards a data-integrated cell. Nat Commun 2019;10:805. [PMID: 30778056 PMCID: PMC6379402 DOI: 10.1038/s41467-019-08797-8] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2018] [Revised: 01/18/2019] [Accepted: 01/25/2019] [Indexed: 01/01/2023] Open