Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Torun F, Virreira Winter S, Doll S, Riese FM, Vorobyev A, Mueller-Reif JB, Geyer PE, Strauss MT. Transparent Exploration of Machine Learning for Biomarker Discovery from Proteomics and Omics Data. J Proteome Res 2023;22:359-367. [PMID: 36426751 PMCID: PMC9903317 DOI: 10.1021/acs.jproteome.2c00473] [Citation(s) in RCA: 13] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2022] [Indexed: 11/27/2022]

For:	Torun F, Virreira Winter S, Doll S, Riese FM, Vorobyev A, Mueller-Reif JB, Geyer PE, Strauss MT. Transparent Exploration of Machine Learning for Biomarker Discovery from Proteomics and Omics Data. J Proteome Res 2023;22:359-367. [PMID: 36426751 PMCID: PMC9903317 DOI: 10.1021/acs.jproteome.2c00473] [Citation(s) in RCA: 13] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2022] [Indexed: 11/27/2022]

Number

Cited by Other Article(s)

Son A, Kim W, Park J, Park Y, Lee W, Lee S, Kim H. Mass Spectrometry Advancements and Applications for Biomarker Discovery, Diagnostic Innovations, and Personalized Medicine. Int J Mol Sci 2024;25:9880. [PMID: 39337367 PMCID: PMC11432749 DOI: 10.3390/ijms25189880] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2024] [Revised: 09/04/2024] [Accepted: 09/10/2024] [Indexed: 09/30/2024] Open

Ji J, Bi F, Zhang X, Zhang Z, Xie Y, Yang Q. Single-cell transcriptome analysis revealed heterogeneity in glycolysis and identified IGF2 as a therapeutic target for ovarian cancer subtypes. BMC Cancer 2024;24:926. [PMID: 39085784 PMCID: PMC11292870 DOI: 10.1186/s12885-024-12688-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2024] [Accepted: 07/24/2024] [Indexed: 08/02/2024] Open

Soni RK. Frontiers in plasma proteome profiling platforms: innovations and applications. Clin Proteomics 2024;21:43. [PMID: 38902643 PMCID: PMC11191172 DOI: 10.1186/s12014-024-09497-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2024] [Accepted: 06/12/2024] [Indexed: 06/22/2024] Open

Abstract

Biomarkers play a crucial role in advancing precision medicine by enabling more targeted and individualized approaches to diagnosis and treatment. Various biofluids, including serum, plasma, cerebrospinal fluid (CSF), saliva, tears, pancreatic cyst fluids, and urine, have been identified as rich sources of potential for the early detection of disease biomarkers in conditions such as cancer, cardiovascular diseases, and neurodegenerative disorders. The analysis of plasma and serum in proteomics research encounters challenges due to their high complexity and the wide dynamic range of protein abundance. These factors impede the sensitivity, coverage, and precision of protein detection when employing mass spectrometry, a widely utilized technology in discovery proteomics. Conventional approaches such as Neat Plasma workflow are inefficient in accurately quantifying low-abundant proteins, including those associated with tissue leakage, immune response molecules, interleukins, cytokines, and interferons. Moreover, the manual nature of the workflow poses a significant hurdle in conducting large cohort studies. In this study, our focus is on comparing workflows for plasma proteomic profiling to establish a methodology that is not only sensitive and reproducible but also applicable for large cohort studies in biomarker discovery. Our investigation revealed that the Proteograph XT workflow outperforms other workflows in terms of plasma proteome depth, quantitative accuracy, and reproducibility while offering complete automation of sample preparation. Notably, Proteograph XT demonstrates versatility by applying it to various types of biofluids. Additionally, the proteins quantified widely cover secretory proteins in peripheral blood, and the pathway analysis enriched with relevant components such as interleukins, tissue necrosis factors, chemokines, and B and T cell receptors provides valuable insights. These proteins, often challenging to quantify in complex biological samples, hold potential as early detection markers for various diseases, thereby contributing to the improvement of patient care quality.

Collapse

Mahawan T, Luckett T, Mielgo Iza A, Pornputtapong N, Caamaño Gutiérrez E. Robust and consistent biomarker candidates identification by a machine learning approach applied to pancreatic ductal adenocarcinoma metastasis. BMC Med Inform Decis Mak 2024;24:175. [PMID: 38902676 PMCID: PMC11191155 DOI: 10.1186/s12911-024-02578-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2024] [Accepted: 06/14/2024] [Indexed: 06/22/2024] Open

Abstract

BACKGROUND

Machine Learning (ML) plays a crucial role in biomedical research. Nevertheless, it still has limitations in data integration and irreproducibility. To address these challenges, robust methods are needed. Pancreatic ductal adenocarcinoma (PDAC), a highly aggressive cancer with low early detection rates and survival rates, is used as a case study. PDAC lacks reliable diagnostic biomarkers, especially metastatic biomarkers, which remains an unmet need. In this study, we propose an ML-based approach for discovering disease biomarkers, apply it to the identification of a PDAC metastatic composite biomarker candidate, and demonstrate the advantages of harnessing data resources.

METHODS

We utilised primary tumour RNAseq data from five public repositories, pooling samples to maximise statistical power and integrating data by correcting for technical variance. Data were split into train and validation sets. The train dataset underwent variable selection via a 10-fold cross-validation process that combined three algorithms in 100 models per fold. Genes found in at least 80% of models and five folds were considered robust to build a consensus multivariate model. A random forest model was constructed using selected genes from the train dataset and tested in the validation set. We also assessed the goodness of prediction by recalibrating a model using only the validation data. The biological context and relevance of signals was explored through enrichment and pathway analyses using QIAGEN Ingenuity Pathway Analysis and GeneMANIA.

RESULTS

We developed a pipeline that can detect robust signatures to build composite biomarkers. We tested the pipeline in PDAC, exploiting transcriptomics data from different sources, proposing a composite biomarker candidate comprised of fifteen genes consistently selected that showed very promising predictive capability. Biological contextualisation revealed links with cancer progression and metastasis, underscoring their potential relevance. All code is available in GitHub.

CONCLUSION

This study establishes a robust framework for identifying composite biomarkers across various disease contexts. We demonstrate its potential by proposing a plausible composite biomarker candidate for PDAC metastasis. By reusing data from public repositories, we highlight the sustainability of our research and the wider applications of our pipeline. The preliminary findings shed light on a promising validation and application path.

Collapse

Yang Z, Jin K, Chen Y, Liu Q, Chen H, Hu S, Wang Y, Pan Z, Feng F, Shi M, Xie H, Ma H, Zhou H. AM-DMF-SCP: Integrated Single-Cell Proteomics Analysis on an Active Matrix Digital Microfluidic Chip. JACS AU 2024;4:1811-1823. [PMID: 38818059 PMCID: PMC11134390 DOI: 10.1021/jacsau.4c00027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/08/2024] [Revised: 03/08/2024] [Accepted: 03/08/2024] [Indexed: 06/01/2024]

Affiliation(s)

Zhicheng Yang Department of Analytical Chemistry, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai 201203, China University of the Chinese Academy of Sciences, Beijing 100049, China
Kai Jin CAS Key Laboratory of Bio-Medical Diagnostics, Suzhou Institute of Biomedical Engineering and Technology, Chinese Academy of Sciences, Suzhou 215163, China
Yimin Chen Department of Analytical Chemistry, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai 201203, China University of the Chinese Academy of Sciences, Beijing 100049, China
Qian Liu Department of Analytical Chemistry, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai 201203, China
Hongxu Chen School of Chinese Materia Medica, Nanjing University of Chinese Medicine, 138 Xianlin Avenue, Nanjing, Jiangsu 210023, China
Siyi Hu CAS Key Laboratory of Bio-Medical Diagnostics, Suzhou Institute of Biomedical Engineering and Technology, Chinese Academy of Sciences, Suzhou 215163, China
Yuqiu Wang Department of Analytical Chemistry, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai 201203, China
Zilu Pan Division of Antitumor Pharmacology, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai 201203, China
Fang Feng Division of Antitumor Pharmacology, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai 201203, China
Mude Shi Guangdong ACXEL Micro & Nano Tech Co. Ltd., Foshan, Guangdong Province 528000, China
Hua Xie University of the Chinese Academy of Sciences, Beijing 100049, China Zhongshan Institute for Drug Discovery, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Zhongshan 528400, China Division of Antitumor Pharmacology, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai 201203, China
Hanbin Ma CAS Key Laboratory of Bio-Medical Diagnostics, Suzhou Institute of Biomedical Engineering and Technology, Chinese Academy of Sciences, Suzhou 215163, China Guangdong ACXEL Micro & Nano Tech Co. Ltd., Foshan, Guangdong Province 528000, China
Hu Zhou Department of Analytical Chemistry, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai 201203, China University of the Chinese Academy of Sciences, Beijing 100049, China Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou 310024, China

Collapse

Wang T, Chen H, Li N, Zhang B, Min H. Aqueous humor proteomics analyzed by bioinformatics and machine learning in PDR cases versus controls. Clin Proteomics 2024;21:36. [PMID: 38764026 PMCID: PMC11103871 DOI: 10.1186/s12014-024-09481-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2023] [Accepted: 04/07/2024] [Indexed: 05/21/2024] Open

Topitsch A, Halstenbach T, Rothweiler R, Fretwurst T, Nelson K, Schilling O. Mass Spectrometry-Based Proteomics of Poly(methylmethacrylate)-Embedded Bone. J Proteome Res 2024;23:1810-1820. [PMID: 38634750 DOI: 10.1021/acs.jproteome.4c00046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/19/2024]

Mukherjee A, Abraham S, Singh A, Balaji S, Mukunthan KS. From Data to Cure: A Comprehensive Exploration of Multi-omics Data Analysis for Targeted Therapies. Mol Biotechnol 2024:10.1007/s12033-024-01133-6. [PMID: 38565775 DOI: 10.1007/s12033-024-01133-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2023] [Accepted: 02/27/2024] [Indexed: 04/04/2024]

Strauss MT, Bludau I, Zeng WF, Voytik E, Ammar C, Schessner JP, Ilango R, Gill M, Meier F, Willems S, Mann M. AlphaPept: a modern and open framework for MS-based proteomics. Nat Commun 2024;15:2168. [PMID: 38461149 PMCID: PMC10924963 DOI: 10.1038/s41467-024-46485-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2022] [Accepted: 02/20/2024] [Indexed: 03/11/2024] Open

Govender MA, Stoychev SH, Brandenburg JT, Ramsay M, Fabian J, Govender IS. Proteomic insights into the pathophysiology of hypertension-associated albuminuria: Pilot study in a South African cohort. Clin Proteomics 2024;21:15. [PMID: 38402394 PMCID: PMC10893729 DOI: 10.1186/s12014-024-09458-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2023] [Accepted: 02/06/2024] [Indexed: 02/26/2024] Open

Abstract

BACKGROUND

Hypertension is an important public health priority with a high prevalence in Africa. It is also an independent risk factor for kidney outcomes. We aimed to identify potential proteins and pathways involved in hypertension-associated albuminuria by assessing urinary proteomic profiles in black South African participants with combined hypertension and albuminuria compared to those who have neither condition.

METHODS

The study included 24 South African cases with both hypertension and albuminuria and 49 control participants who had neither condition. Protein was extracted from urine samples and analysed using ultra-high-performance liquid chromatography coupled with mass spectrometry. Data were generated using data-independent acquisition (DIA) and processed using Spectronaut™ 15. Statistical and functional data annotation were performed on Perseus and Cytoscape to identify and annotate differentially abundant proteins. Machine learning was applied to the dataset using the OmicLearn platform.

RESULTS

Overall, a mean of 1,225 and 915 proteins were quantified in the control and case groups, respectively. Three hundred and thirty-two differentially abundant proteins were constructed into a network. Pathways associated with these differentially abundant proteins included the immune system (q-value [false discovery rate] = 1.4 × 10- 45), innate immune system (q = 1.1 × 10- 32), extracellular matrix (ECM) organisation (q = 0.03) and activation of matrix metalloproteinases (q = 0.04). Proteins with high disease scores (76-100% confidence) for both hypertension and chronic kidney disease included angiotensinogen (AGT), albumin (ALB), apolipoprotein L1 (APOL1), and uromodulin (UMOD). A machine learning approach was able to identify a set of 20 proteins, differentiating between cases and controls.

CONCLUSIONS

The urinary proteomic data combined with the machine learning approach was able to classify disease status and identify proteins and pathways associated with hypertension-associated albuminuria.

Collapse

Dowling P, Trollet C, Negroni E, Swandulla D, Ohlendieck K. How Can Proteomics Help to Elucidate the Pathophysiological Crosstalk in Muscular Dystrophy and Associated Multi-System Dysfunction? Proteomes 2024;12:4. [PMID: 38250815 PMCID: PMC10801633 DOI: 10.3390/proteomes12010004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2023] [Revised: 01/09/2024] [Accepted: 01/12/2024] [Indexed: 01/23/2024] Open

Das A, Behera RN, Kapoor A, Ambatipudi K. The Potential of Meta-Proteomics and Artificial Intelligence to Establish the Next Generation of Probiotics for Personalized Healthcare. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2023;71:17528-17542. [PMID: 37955263 DOI: 10.1021/acs.jafc.3c03834] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/14/2023]

Samadishadlou M, Rahbarghazi R, Piryaei Z, Esmaeili M, Avcı ÇB, Bani F, Kavousi K. Unlocking the potential of microRNAs: machine learning identifies key biomarkers for myocardial infarction diagnosis. Cardiovasc Diabetol 2023;22:247. [PMID: 37697288 PMCID: PMC10496209 DOI: 10.1186/s12933-023-01957-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Accepted: 08/10/2023] [Indexed: 09/13/2023] Open

Abstract

BACKGROUND

MicroRNAs (miRNAs) play a crucial role in regulating adaptive and maladaptive responses in cardiovascular diseases, making them attractive targets for potential biomarkers. However, their potential as novel biomarkers for diagnosing cardiovascular diseases requires systematic evaluation.

METHODS

In this study, we aimed to identify a key set of miRNA biomarkers using integrated bioinformatics and machine learning analysis. We combined and analyzed three gene expression datasets from the Gene Expression Omnibus (GEO) database, which contains peripheral blood mononuclear cell (PBMC) samples from individuals with myocardial infarction (MI), stable coronary artery disease (CAD), and healthy individuals. Additionally, we selected a set of miRNAs based on their area under the receiver operating characteristic curve (AUC-ROC) for separating the CAD and MI samples. We designed a two-layer architecture for sample classification, in which the first layer isolates healthy samples from unhealthy samples, and the second layer classifies stable CAD and MI samples. We trained different machine learning models using both biomarker sets and evaluated their performance on a test set.

RESULTS

We identified hsa-miR-21-3p, hsa-miR-186-5p, and hsa-miR-32-3p as the differentially expressed miRNAs, and a set including hsa-miR-186-5p, hsa-miR-21-3p, hsa-miR-197-5p, hsa-miR-29a-5p, and hsa-miR-296-5p as the optimum set of miRNAs selected by their AUC-ROC. Both biomarker sets could distinguish healthy from not-healthy samples with complete accuracy. The best performance for the classification of CAD and MI was achieved with an SVM model trained using the biomarker set selected by AUC-ROC, with an AUC-ROC of 0.96 and an accuracy of 0.94 on the test data.

CONCLUSIONS

Our study demonstrated that miRNA signatures derived from PBMCs could serve as valuable novel biomarkers for cardiovascular diseases.

Collapse

Hartman E, Scott AM, Karlsson C, Mohanty T, Vaara ST, Linder A, Malmström L, Malmström J. Interpreting biologically informed neural networks for enhanced proteomic biomarker discovery and pathway analysis. Nat Commun 2023;14:5359. [PMID: 37660105 PMCID: PMC10475049 DOI: 10.1038/s41467-023-41146-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Accepted: 08/22/2023] [Indexed: 09/04/2023] Open