1
|
Son A, Kim W, Park J, Park Y, Lee W, Lee S, Kim H. Mass Spectrometry Advancements and Applications for Biomarker Discovery, Diagnostic Innovations, and Personalized Medicine. Int J Mol Sci 2024; 25:9880. [PMID: 39337367 PMCID: PMC11432749 DOI: 10.3390/ijms25189880] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2024] [Revised: 09/04/2024] [Accepted: 09/10/2024] [Indexed: 09/30/2024] Open
Abstract
Mass spectrometry (MS) has revolutionized clinical chemistry, offering unparalleled capabilities for biomolecule analysis. This review explores the growing significance of mass spectrometry (MS), particularly when coupled with liquid chromatography (LC), in identifying disease biomarkers and quantifying biomolecules for diagnostic and prognostic purposes. The unique advantages of MS in accurately identifying and quantifying diverse molecules have positioned it as a cornerstone in personalized-medicine advancement. MS-based technologies have transformed precision medicine, enabling a comprehensive understanding of disease mechanisms and patient-specific treatment responses. LC-MS has shown exceptional utility in analyzing complex biological matrices, while high-resolution MS has expanded analytical capabilities, allowing the detection of low-abundance molecules and the elucidation of complex biological pathways. The integration of MS with other techniques, such as ion mobility spectrometry, has opened new avenues for biomarker discovery and validation. As we progress toward precision medicine, MS-based technologies will be crucial in addressing the challenges of individualized patient care, driving innovations in disease diagnosis, prognosis, and treatment strategies.
Collapse
Affiliation(s)
- Ahrum Son
- Department of Molecular Medicine, Scripps Research, San Diego, CA 92037, USA
| | - Woojin Kim
- Department of Bio-AI Convergence, Chungnam National University, Daejeon 34134, Republic of Korea
| | - Jongham Park
- Department of Bio-AI Convergence, Chungnam National University, Daejeon 34134, Republic of Korea
| | - Yongho Park
- Department of Bio-AI Convergence, Chungnam National University, Daejeon 34134, Republic of Korea
| | - Wonseok Lee
- Department of Bio-AI Convergence, Chungnam National University, Daejeon 34134, Republic of Korea
| | - Sangwoon Lee
- Department of Bio-AI Convergence, Chungnam National University, Daejeon 34134, Republic of Korea
| | - Hyunsoo Kim
- Department of Bio-AI Convergence, Chungnam National University, Daejeon 34134, Republic of Korea
- Department of Convergent Bioscience and Informatics, Chungnam National University, 99 Daehak-ro, Yuseong-gu, Daejeon 34134, Republic of Korea
- Protein AI Design Institute, Chungnam National University, 99 Daehak-ro, Yuseong-gu, Daejeon 34134, Republic of Korea
- SCICS, 99 Daehak-ro, Yuseong-gu, Daejeon 34134, Republic of Korea
| |
Collapse
|
2
|
Ji J, Bi F, Zhang X, Zhang Z, Xie Y, Yang Q. Single-cell transcriptome analysis revealed heterogeneity in glycolysis and identified IGF2 as a therapeutic target for ovarian cancer subtypes. BMC Cancer 2024; 24:926. [PMID: 39085784 PMCID: PMC11292870 DOI: 10.1186/s12885-024-12688-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2024] [Accepted: 07/24/2024] [Indexed: 08/02/2024] Open
Abstract
BACKGROUND As the most malignant tumor of the female reproductive system, ovarian cancer (OC) has garnered increasing attention. The Warburg effect, driven by glycolysis, accounts for tumor cell proliferation under aerobic conditions. However, the metabolic heterogeneity linked to glycolysis in OC remains elusive. METHODS We integrated single-cell data with OC to score glycolysis level in tumor cell subclusters. This led to the identification of a subcluster predominantly characterized by glycolysis, with a strong correlation to patient prognosis. Core transcription factors were pinpointed using hdWGCNA and metaVIPER. A specific transcription factor regulatory network was then constructed. A glycolysis-related prognostic model was developed and tested for estimating OC prognosis with a total of 85 machine-learning combinations, focusing on specific upregulated genes of two subtypes. We identified IGF2 as a key within the prognostic model and investigated its impact on OC progression and drug resistance through in vitro experiments, including the transwell assay, lactate production detection, and the CCK-8 assay. RESULTS Analysis showed that the Malignant 7 subcluster was primarily related to glycolysis. Two OC molecular subtypes, CS1 and CS2, were identified with distinct clinical, biological, and microenvironmental traits. A prognostic model was built, and IGF2 emerged as a key gene linked to prognosis. Experiments have proven that IGF2 can promote the glycolysis pathway and the malignant biological progression of OC cells. CONCLUSIONS We developed two novel OC subtypes based on glycolysis score, established a stable prognostic model, and identified IGF2 as the marker gene. These insights provided a new avenue for exploring OC's molecular mechanisms and personalized treatment approaches.
Collapse
Affiliation(s)
- Jinting Ji
- Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, Shenyang, Liaoning, 110004, China
| | - Fangfang Bi
- Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, Shenyang, Liaoning, 110004, China
| | - Xiaocui Zhang
- Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, Shenyang, Liaoning, 110004, China
| | - Zhiming Zhang
- Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, Shenyang, Liaoning, 110004, China
| | - Yichi Xie
- Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, Shenyang, Liaoning, 110004, China
| | - Qing Yang
- Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, Shenyang, Liaoning, 110004, China.
| |
Collapse
|
3
|
Soni RK. Frontiers in plasma proteome profiling platforms: innovations and applications. Clin Proteomics 2024; 21:43. [PMID: 38902643 PMCID: PMC11191172 DOI: 10.1186/s12014-024-09497-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2024] [Accepted: 06/12/2024] [Indexed: 06/22/2024] Open
Abstract
Biomarkers play a crucial role in advancing precision medicine by enabling more targeted and individualized approaches to diagnosis and treatment. Various biofluids, including serum, plasma, cerebrospinal fluid (CSF), saliva, tears, pancreatic cyst fluids, and urine, have been identified as rich sources of potential for the early detection of disease biomarkers in conditions such as cancer, cardiovascular diseases, and neurodegenerative disorders. The analysis of plasma and serum in proteomics research encounters challenges due to their high complexity and the wide dynamic range of protein abundance. These factors impede the sensitivity, coverage, and precision of protein detection when employing mass spectrometry, a widely utilized technology in discovery proteomics. Conventional approaches such as Neat Plasma workflow are inefficient in accurately quantifying low-abundant proteins, including those associated with tissue leakage, immune response molecules, interleukins, cytokines, and interferons. Moreover, the manual nature of the workflow poses a significant hurdle in conducting large cohort studies. In this study, our focus is on comparing workflows for plasma proteomic profiling to establish a methodology that is not only sensitive and reproducible but also applicable for large cohort studies in biomarker discovery. Our investigation revealed that the Proteograph XT workflow outperforms other workflows in terms of plasma proteome depth, quantitative accuracy, and reproducibility while offering complete automation of sample preparation. Notably, Proteograph XT demonstrates versatility by applying it to various types of biofluids. Additionally, the proteins quantified widely cover secretory proteins in peripheral blood, and the pathway analysis enriched with relevant components such as interleukins, tissue necrosis factors, chemokines, and B and T cell receptors provides valuable insights. These proteins, often challenging to quantify in complex biological samples, hold potential as early detection markers for various diseases, thereby contributing to the improvement of patient care quality.
Collapse
Affiliation(s)
- Rajesh Kumar Soni
- Proteomics and Macromolecular Crystallography Shared Resource, Columbia University Irving Medical Center, New York, USA.
- Department of Pathology and Cell Biology, Columbia University Irving Medical Center, New York, USA.
| |
Collapse
|
4
|
Mahawan T, Luckett T, Mielgo Iza A, Pornputtapong N, Caamaño Gutiérrez E. Robust and consistent biomarker candidates identification by a machine learning approach applied to pancreatic ductal adenocarcinoma metastasis. BMC Med Inform Decis Mak 2024; 24:175. [PMID: 38902676 PMCID: PMC11191155 DOI: 10.1186/s12911-024-02578-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2024] [Accepted: 06/14/2024] [Indexed: 06/22/2024] Open
Abstract
BACKGROUND Machine Learning (ML) plays a crucial role in biomedical research. Nevertheless, it still has limitations in data integration and irreproducibility. To address these challenges, robust methods are needed. Pancreatic ductal adenocarcinoma (PDAC), a highly aggressive cancer with low early detection rates and survival rates, is used as a case study. PDAC lacks reliable diagnostic biomarkers, especially metastatic biomarkers, which remains an unmet need. In this study, we propose an ML-based approach for discovering disease biomarkers, apply it to the identification of a PDAC metastatic composite biomarker candidate, and demonstrate the advantages of harnessing data resources. METHODS We utilised primary tumour RNAseq data from five public repositories, pooling samples to maximise statistical power and integrating data by correcting for technical variance. Data were split into train and validation sets. The train dataset underwent variable selection via a 10-fold cross-validation process that combined three algorithms in 100 models per fold. Genes found in at least 80% of models and five folds were considered robust to build a consensus multivariate model. A random forest model was constructed using selected genes from the train dataset and tested in the validation set. We also assessed the goodness of prediction by recalibrating a model using only the validation data. The biological context and relevance of signals was explored through enrichment and pathway analyses using QIAGEN Ingenuity Pathway Analysis and GeneMANIA. RESULTS We developed a pipeline that can detect robust signatures to build composite biomarkers. We tested the pipeline in PDAC, exploiting transcriptomics data from different sources, proposing a composite biomarker candidate comprised of fifteen genes consistently selected that showed very promising predictive capability. Biological contextualisation revealed links with cancer progression and metastasis, underscoring their potential relevance. All code is available in GitHub. CONCLUSION This study establishes a robust framework for identifying composite biomarkers across various disease contexts. We demonstrate its potential by proposing a plausible composite biomarker candidate for PDAC metastasis. By reusing data from public repositories, we highlight the sustainability of our research and the wider applications of our pipeline. The preliminary findings shed light on a promising validation and application path.
Collapse
Affiliation(s)
- Tanakamol Mahawan
- Program in Bioinformatics and Computational Biology, Graduate School, Chulalongkorn University, Bangkok, Thailand
- Department of Biochemistry & System Biology, Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, UK
- Akkhraratchakumari Veterinary College, Walailak University, Nakhon Si Thammarat, Thailand
| | - Teifion Luckett
- Department of Molecular and Clinical Cancer Medicine, Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, UK
| | - Ainhoa Mielgo Iza
- Department of Molecular and Clinical Cancer Medicine, Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, UK
| | - Natapol Pornputtapong
- Department of Biochemistry and Microbiology, Faculty of Pharmaceutical Sciences, and Center of Excellence in Systems Biology, Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand
| | - Eva Caamaño Gutiérrez
- Department of Biochemistry & System Biology, Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, UK.
- Computational Biology Facility, LIV-SRF, Faculty of Health and Life Sciences, University of Liverpool, Liverpool, UK.
| |
Collapse
|
5
|
Yang Z, Jin K, Chen Y, Liu Q, Chen H, Hu S, Wang Y, Pan Z, Feng F, Shi M, Xie H, Ma H, Zhou H. AM-DMF-SCP: Integrated Single-Cell Proteomics Analysis on an Active Matrix Digital Microfluidic Chip. JACS AU 2024; 4:1811-1823. [PMID: 38818059 PMCID: PMC11134390 DOI: 10.1021/jacsau.4c00027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/08/2024] [Revised: 03/08/2024] [Accepted: 03/08/2024] [Indexed: 06/01/2024]
Abstract
Single-cell proteomics offers unparalleled insights into cellular diversity and molecular mechanisms, enabling a deeper understanding of complex biological processes at the individual cell level. Here, we develop an integrated sample processing on an active-matrix digital microfluidic chip for single-cell proteomics (AM-DMF-SCP). Employing the AM-DMF-SCP approach and data-independent acquisition (DIA), we identify an average of 2258 protein groups in single HeLa cells within 15 min of the liquid chromatography gradient. We performed comparative analyses of three tumor cell lines: HeLa, A549, and HepG2, and machine learning was utilized to identify the unique features of these cell lines. Applying the AM-DMF-SCP to characterize the proteomes of a third-generation EGFR inhibitor, ASK120067-resistant cells (67R) and their parental NCI-H1975 cells, we observed a potential correlation between elevated VIM expression and 67R resistance, which is consistent with the findings from bulk sample analyses. These results suggest that AM-DMF-SCP is an automated, robust, and sensitive platform for single-cell proteomics and demonstrate the potential for providing valuable insights into cellular mechanisms.
Collapse
Affiliation(s)
- Zhicheng Yang
- Department
of Analytical Chemistry, State Key Laboratory of Drug Research, Shanghai
Institute of Materia Medica, Chinese Academy
of Sciences, Shanghai 201203, China
- University
of the Chinese Academy of Sciences, Beijing 100049, China
| | - Kai Jin
- CAS
Key Laboratory of Bio-Medical Diagnostics, Suzhou Institute of Biomedical
Engineering and Technology, Chinese Academy
of Sciences, Suzhou 215163, China
| | - Yimin Chen
- Department
of Analytical Chemistry, State Key Laboratory of Drug Research, Shanghai
Institute of Materia Medica, Chinese Academy
of Sciences, Shanghai 201203, China
- University
of the Chinese Academy of Sciences, Beijing 100049, China
| | - Qian Liu
- Department
of Analytical Chemistry, State Key Laboratory of Drug Research, Shanghai
Institute of Materia Medica, Chinese Academy
of Sciences, Shanghai 201203, China
| | - Hongxu Chen
- School
of Chinese Materia Medica, Nanjing University
of Chinese Medicine, 138 Xianlin Avenue, Nanjing, Jiangsu 210023, China
| | - Siyi Hu
- CAS
Key Laboratory of Bio-Medical Diagnostics, Suzhou Institute of Biomedical
Engineering and Technology, Chinese Academy
of Sciences, Suzhou 215163, China
| | - Yuqiu Wang
- Department
of Analytical Chemistry, State Key Laboratory of Drug Research, Shanghai
Institute of Materia Medica, Chinese Academy
of Sciences, Shanghai 201203, China
| | - Zilu Pan
- Division
of Antitumor Pharmacology, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai 201203, China
| | - Fang Feng
- Division
of Antitumor Pharmacology, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai 201203, China
| | - Mude Shi
- Guangdong
ACXEL Micro & Nano Tech Co. Ltd., Foshan, Guangdong Province 528000, China
| | - Hua Xie
- University
of the Chinese Academy of Sciences, Beijing 100049, China
- Zhongshan
Institute for Drug Discovery, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Zhongshan 528400, China
- Division
of Antitumor Pharmacology, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai 201203, China
| | - Hanbin Ma
- CAS
Key Laboratory of Bio-Medical Diagnostics, Suzhou Institute of Biomedical
Engineering and Technology, Chinese Academy
of Sciences, Suzhou 215163, China
- Guangdong
ACXEL Micro & Nano Tech Co. Ltd., Foshan, Guangdong Province 528000, China
| | - Hu Zhou
- Department
of Analytical Chemistry, State Key Laboratory of Drug Research, Shanghai
Institute of Materia Medica, Chinese Academy
of Sciences, Shanghai 201203, China
- University
of the Chinese Academy of Sciences, Beijing 100049, China
- Hangzhou
Institute for Advanced Study, University
of Chinese Academy of Sciences, Hangzhou 310024, China
| |
Collapse
|
6
|
Wang T, Chen H, Li N, Zhang B, Min H. Aqueous humor proteomics analyzed by bioinformatics and machine learning in PDR cases versus controls. Clin Proteomics 2024; 21:36. [PMID: 38764026 PMCID: PMC11103871 DOI: 10.1186/s12014-024-09481-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2023] [Accepted: 04/07/2024] [Indexed: 05/21/2024] Open
Abstract
BACKGROUND To comprehend the complexities of pathophysiological mechanisms and molecular events that contribute to proliferative diabetic retinopathy (PDR) and evaluate the diagnostic value of aqueous humor (AH) in monitoring the onset of PDR. METHODS A cohort containing 16 PDR and 10 cataract patients and another validation cohort containing 8 PDR and 4 cataract patients were studied. AH was collected and subjected to proteomics analyses. Bioinformatics analysis and a machine learning-based pipeline called inference of biomolecular combinations with minimal bias were used to explore the functional relevance, hub proteins, and biomarkers. RESULTS Deep profiling of AH proteomes revealed several insights. First, the combination of SIAE, SEMA7A, GNS, and IGKV3D-15 and the combination of ATP6AP1, SPARCL1, and SERPINA7 could serve as surrogate protein biomarkers for monitoring PDR progression. Second, ALB, FN1, ACTB, SERPINA1, C3, and VTN acted as hub proteins in the profiling of AH proteomes. SERPINA1 was the protein with the highest correlation coefficient not only for BCVA but also for the duration of diabetes. Third, "Complement and coagulation cascades" was an important pathway for PDR development. CONCLUSIONS AH proteomics provides stable and accurate biomarkers for early warning and diagnosis of PDR. This study provides a deep understanding of the molecular mechanisms of PDR and a rich resource for optimizing PDR management.
Collapse
Affiliation(s)
- Tan Wang
- Department of Ophthalmology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, No.1 Shuai Fu Yuan, Dongcheng District, Beijing, 100730, China
- Key Laboratory of Ocular Fundus Diseases, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 100730, China
| | - Huan Chen
- Department of Ophthalmology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, No.1 Shuai Fu Yuan, Dongcheng District, Beijing, 100730, China
- Key Laboratory of Ocular Fundus Diseases, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 100730, China
| | - Ningning Li
- Operating Room, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 100730, China
| | - Bao Zhang
- Animal Nutrition Institute, Sichuan Agricultural University, Chengdu, 611130, China
| | - Hanyi Min
- Department of Ophthalmology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, No.1 Shuai Fu Yuan, Dongcheng District, Beijing, 100730, China.
- Key Laboratory of Ocular Fundus Diseases, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 100730, China.
- Department of Ophthalmology, Aier Eye Hospital, Tianjin University, Nankai District, Fukang Road No.102, Tianjin, China.
| |
Collapse
|
7
|
Topitsch A, Halstenbach T, Rothweiler R, Fretwurst T, Nelson K, Schilling O. Mass Spectrometry-Based Proteomics of Poly(methylmethacrylate)-Embedded Bone. J Proteome Res 2024; 23:1810-1820. [PMID: 38634750 DOI: 10.1021/acs.jproteome.4c00046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/19/2024]
Abstract
Liquid chromatography-tandem mass spectrometry (LC-MS/MS) is a widely employed technique in proteomics research for studying the proteome biology of various clinical samples. Hard tissues, such as bone and teeth, are routinely preserved using synthetic poly(methyl methacrylate) (PMMA) embedding resins that enable histological, immunohistochemical, and morphological examination. However, the suitability of PMMA-embedded hard tissues for large-scale proteomic analysis remained unexplored. This study is the first to report on the feasibility of PMMA-embedded bone samples for LC-MS/MS analysis. Conventional workflows yielded merely limited coverage of the bone proteome. Using advanced strategies of prefractionation by high-pH reversed-phase liquid chromatography in combination with isobaric tandem mass tag labeling resulted in proteome coverage exceeding 1000 protein identifications. The quantitative comparison with cryopreserved samples revealed that each sample preparation workflow had a distinct impact on the proteomic profile. However, workflow replicates exhibited a high reproducibility for PMMA-embedded samples. Our findings further demonstrate that decalcification prior to protein extraction, along with the analysis of solubilization fractions, is not preferred for PMMA-embedded bone. The biological applicability of the proposed workflow was demonstrated using samples of human PMMA-embedded alveolar bone and the iliac crest, which revealed anatomical site-specific proteomic profiles. Overall, these results establish a crucial foundation for large-scale proteomics studies contributing to our knowledge of bone biology.
Collapse
Affiliation(s)
- Annika Topitsch
- Institute for Surgical Pathology, Faculty of Medicine, Medical Center - University of Freiburg, Breisacher Straße 115a, 79106 Freiburg, Germany
- Spemann Graduate School of Biology and Medicine (SGBM), University of Freiburg, Albertstraße 19a, 79104 Freiburg, Germany
- Faculty of Biology, University of Freiburg, Schänzlestraße 1, 79104 Freiburg, Germany
- Department of Oral and Maxillofacial Surgery/Translational Implantology, Faculty of Medicine, Medical Center - University of Freiburg, Hugstetter Straße 55, 79106 Freiburg, Germany
| | - Tim Halstenbach
- Department of Oral and Maxillofacial Surgery/Translational Implantology, Faculty of Medicine, Medical Center - University of Freiburg, Hugstetter Straße 55, 79106 Freiburg, Germany
| | - René Rothweiler
- Department of Oral and Maxillofacial Surgery/Translational Implantology, Faculty of Medicine, Medical Center - University of Freiburg, Hugstetter Straße 55, 79106 Freiburg, Germany
| | - Tobias Fretwurst
- Department of Oral and Maxillofacial Surgery/Translational Implantology, Faculty of Medicine, Medical Center - University of Freiburg, Hugstetter Straße 55, 79106 Freiburg, Germany
| | - Katja Nelson
- Department of Oral and Maxillofacial Surgery/Translational Implantology, Faculty of Medicine, Medical Center - University of Freiburg, Hugstetter Straße 55, 79106 Freiburg, Germany
| | - Oliver Schilling
- Institute for Surgical Pathology, Faculty of Medicine, Medical Center - University of Freiburg, Breisacher Straße 115a, 79106 Freiburg, Germany
| |
Collapse
|
8
|
Mukherjee A, Abraham S, Singh A, Balaji S, Mukunthan KS. From Data to Cure: A Comprehensive Exploration of Multi-omics Data Analysis for Targeted Therapies. Mol Biotechnol 2024:10.1007/s12033-024-01133-6. [PMID: 38565775 DOI: 10.1007/s12033-024-01133-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2023] [Accepted: 02/27/2024] [Indexed: 04/04/2024]
Abstract
In the dynamic landscape of targeted therapeutics, drug discovery has pivoted towards understanding underlying disease mechanisms, placing a strong emphasis on molecular perturbations and target identification. This paradigm shift, crucial for drug discovery, is underpinned by big data, a transformative force in the current era. Omics data, characterized by its heterogeneity and enormity, has ushered biological and biomedical research into the big data domain. Acknowledging the significance of integrating diverse omics data strata, known as multi-omics studies, researchers delve into the intricate interrelationships among various omics layers. This review navigates the expansive omics landscape, showcasing tailored assays for each molecular layer through genomes to metabolomes. The sheer volume of data generated necessitates sophisticated informatics techniques, with machine-learning (ML) algorithms emerging as robust tools. These datasets not only refine disease classification but also enhance diagnostics and foster the development of targeted therapeutic strategies. Through the integration of high-throughput data, the review focuses on targeting and modeling multiple disease-regulated networks, validating interactions with multiple targets, and enhancing therapeutic potential using network pharmacology approaches. Ultimately, this exploration aims to illuminate the transformative impact of multi-omics in the big data era, shaping the future of biological research.
Collapse
Affiliation(s)
- Arnab Mukherjee
- Department of Biotechnology, Manipal Institute of Technology, Manipal Academy of Higher Education, Manipal, India
| | - Suzanna Abraham
- Department of Biotechnology, Manipal Institute of Technology, Manipal Academy of Higher Education, Manipal, India
| | - Akshita Singh
- Department of Biotechnology, Manipal Institute of Technology, Manipal Academy of Higher Education, Manipal, India
| | - S Balaji
- Department of Biotechnology, Manipal Institute of Technology, Manipal Academy of Higher Education, Manipal, India
| | - K S Mukunthan
- Department of Biotechnology, Manipal Institute of Technology, Manipal Academy of Higher Education, Manipal, India.
| |
Collapse
|
9
|
Strauss MT, Bludau I, Zeng WF, Voytik E, Ammar C, Schessner JP, Ilango R, Gill M, Meier F, Willems S, Mann M. AlphaPept: a modern and open framework for MS-based proteomics. Nat Commun 2024; 15:2168. [PMID: 38461149 PMCID: PMC10924963 DOI: 10.1038/s41467-024-46485-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2022] [Accepted: 02/20/2024] [Indexed: 03/11/2024] Open
Abstract
In common with other omics technologies, mass spectrometry (MS)-based proteomics produces ever-increasing amounts of raw data, making efficient analysis a principal challenge. A plethora of different computational tools can process the MS data to derive peptide and protein identification and quantification. However, during the last years there has been dramatic progress in computer science, including collaboration tools that have transformed research and industry. To leverage these advances, we develop AlphaPept, a Python-based open-source framework for efficient processing of large high-resolution MS data sets. Numba for just-in-time compilation on CPU and GPU achieves hundred-fold speed improvements. AlphaPept uses the Python scientific stack of highly optimized packages, reducing the code base to domain-specific tasks while accessing the latest advances. We provide an easy on-ramp for community contributions through the concept of literate programming, implemented in Jupyter Notebooks. Large datasets can rapidly be processed as shown by the analysis of hundreds of proteomes in minutes per file, many-fold faster than acquisition. AlphaPept can be used to build automated processing pipelines with web-serving functionality and compatibility with downstream analysis tools. It provides easy access via one-click installation, a modular Python library for advanced users, and via an open GitHub repository for developers.
Collapse
Affiliation(s)
- Maximilian T Strauss
- Department of Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Martinsried, Germany.
- NNF Center for Protein Research, Faculty of Health Sciences, University of Copenhagen, Copenhagen, Denmark.
| | - Isabell Bludau
- Department of Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Martinsried, Germany
| | - Wen-Feng Zeng
- Department of Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Martinsried, Germany
| | - Eugenia Voytik
- Department of Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Martinsried, Germany
| | - Constantin Ammar
- Department of Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Martinsried, Germany
| | - Julia P Schessner
- Department of Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Martinsried, Germany
| | | | | | - Florian Meier
- Department of Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Martinsried, Germany
- Functional Proteomics, Jena University Hospital, Jena, Germany
| | - Sander Willems
- Department of Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Martinsried, Germany
| | - Matthias Mann
- Department of Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Martinsried, Germany.
- NNF Center for Protein Research, Faculty of Health Sciences, University of Copenhagen, Copenhagen, Denmark.
| |
Collapse
|
10
|
Govender MA, Stoychev SH, Brandenburg JT, Ramsay M, Fabian J, Govender IS. Proteomic insights into the pathophysiology of hypertension-associated albuminuria: Pilot study in a South African cohort. Clin Proteomics 2024; 21:15. [PMID: 38402394 PMCID: PMC10893729 DOI: 10.1186/s12014-024-09458-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2023] [Accepted: 02/06/2024] [Indexed: 02/26/2024] Open
Abstract
BACKGROUND Hypertension is an important public health priority with a high prevalence in Africa. It is also an independent risk factor for kidney outcomes. We aimed to identify potential proteins and pathways involved in hypertension-associated albuminuria by assessing urinary proteomic profiles in black South African participants with combined hypertension and albuminuria compared to those who have neither condition. METHODS The study included 24 South African cases with both hypertension and albuminuria and 49 control participants who had neither condition. Protein was extracted from urine samples and analysed using ultra-high-performance liquid chromatography coupled with mass spectrometry. Data were generated using data-independent acquisition (DIA) and processed using Spectronaut™ 15. Statistical and functional data annotation were performed on Perseus and Cytoscape to identify and annotate differentially abundant proteins. Machine learning was applied to the dataset using the OmicLearn platform. RESULTS Overall, a mean of 1,225 and 915 proteins were quantified in the control and case groups, respectively. Three hundred and thirty-two differentially abundant proteins were constructed into a network. Pathways associated with these differentially abundant proteins included the immune system (q-value [false discovery rate] = 1.4 × 10- 45), innate immune system (q = 1.1 × 10- 32), extracellular matrix (ECM) organisation (q = 0.03) and activation of matrix metalloproteinases (q = 0.04). Proteins with high disease scores (76-100% confidence) for both hypertension and chronic kidney disease included angiotensinogen (AGT), albumin (ALB), apolipoprotein L1 (APOL1), and uromodulin (UMOD). A machine learning approach was able to identify a set of 20 proteins, differentiating between cases and controls. CONCLUSIONS The urinary proteomic data combined with the machine learning approach was able to classify disease status and identify proteins and pathways associated with hypertension-associated albuminuria.
Collapse
Affiliation(s)
- Melanie A Govender
- Division of Human Genetics, National Health Laboratory Service and School of Pathology, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa.
- Sydney Brenner Institute for Molecular Bioscience, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa.
| | - Stoyan H Stoychev
- Council for Scientific and Industrial Research, NextGen Health, Pretoria, South Africa
- ReSyn Biosciences, Edenvale, South Africa
| | - Jean-Tristan Brandenburg
- Sydney Brenner Institute for Molecular Bioscience, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa
- Strengthening Oncology Services, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa
| | - Michèle Ramsay
- Division of Human Genetics, National Health Laboratory Service and School of Pathology, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa
- Sydney Brenner Institute for Molecular Bioscience, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa
| | - June Fabian
- Wits Donald Gordon Medical Centre, School of Clinical Medicine, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa
- Medical Research Council/Wits University Rural Public Health and Health Transitions Research Unit (Agincourt), School of Public Health, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa
| | - Ireshyn S Govender
- Council for Scientific and Industrial Research, NextGen Health, Pretoria, South Africa.
- ReSyn Biosciences, Edenvale, South Africa.
| |
Collapse
|
11
|
Dowling P, Trollet C, Negroni E, Swandulla D, Ohlendieck K. How Can Proteomics Help to Elucidate the Pathophysiological Crosstalk in Muscular Dystrophy and Associated Multi-System Dysfunction? Proteomes 2024; 12:4. [PMID: 38250815 PMCID: PMC10801633 DOI: 10.3390/proteomes12010004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2023] [Revised: 01/09/2024] [Accepted: 01/12/2024] [Indexed: 01/23/2024] Open
Abstract
This perspective article is concerned with the question of how proteomics, which is a core technique of systems biology that is deeply embedded in the multi-omics field of modern bioresearch, can help us better understand the molecular pathogenesis of complex diseases. As an illustrative example of a monogenetic disorder that primarily affects the neuromuscular system but is characterized by a plethora of multi-system pathophysiological alterations, the muscle-wasting disease Duchenne muscular dystrophy was examined. Recent achievements in the field of dystrophinopathy research are described with special reference to the proteome-wide complexity of neuromuscular changes and body-wide alterations/adaptations. Based on a description of the current applications of top-down versus bottom-up proteomic approaches and their technical challenges, future systems biological approaches are outlined. The envisaged holistic and integromic bioanalysis would encompass the integration of diverse omics-type studies including inter- and intra-proteomics as the core disciplines for systematic protein evaluations, with sophisticated biomolecular analyses, including physiology, molecular biology, biochemistry and histochemistry. Integrated proteomic findings promise to be instrumental in improving our detailed knowledge of pathogenic mechanisms and multi-system dysfunction, widening the available biomarker signature of dystrophinopathy for improved diagnostic/prognostic procedures, and advancing the identification of novel therapeutic targets to treat Duchenne muscular dystrophy.
Collapse
Affiliation(s)
- Paul Dowling
- Department of Biology, Maynooth University, National University of Ireland, W23 F2H6 Maynooth, Co. Kildare, Ireland;
- Kathleen Lonsdale Institute for Human Health Research, Maynooth University, W23 F2H6 Maynooth, Co. Kildare, Ireland
| | - Capucine Trollet
- Center for Research in Myology U974, Sorbonne Université, INSERM, Myology Institute, 75013 Paris, France; (C.T.); (E.N.)
| | - Elisa Negroni
- Center for Research in Myology U974, Sorbonne Université, INSERM, Myology Institute, 75013 Paris, France; (C.T.); (E.N.)
| | - Dieter Swandulla
- Institute of Physiology, Faculty of Medicine, University of Bonn, D53115 Bonn, Germany;
| | - Kay Ohlendieck
- Department of Biology, Maynooth University, National University of Ireland, W23 F2H6 Maynooth, Co. Kildare, Ireland;
- Kathleen Lonsdale Institute for Human Health Research, Maynooth University, W23 F2H6 Maynooth, Co. Kildare, Ireland
| |
Collapse
|
12
|
Das A, Behera RN, Kapoor A, Ambatipudi K. The Potential of Meta-Proteomics and Artificial Intelligence to Establish the Next Generation of Probiotics for Personalized Healthcare. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2023; 71:17528-17542. [PMID: 37955263 DOI: 10.1021/acs.jafc.3c03834] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/14/2023]
Abstract
The symbiosis of probiotic bacteria with humans has rendered various health benefits while providing nutrition and a suitable environment for their survival. However, the probiotics must survive unfavorable gut conditions to exert beneficial effects. The intrinsic resistance of probiotics to survive harsh conditions results from a myriad of proteins. Interaction of microbial proteins with the host is indispensable for modulating the gut microbiome, such as interaction with cell receptors and protective action against pathogens. The complex interplay of proteins should be unraveled by utilizing metaproteomic strategies. The contribution of probiotics to health is now widely accepted. However, due to the inconsistency of generalized probiotics, contemporary research toward precision probiotics has gained momentum for customized treatment. This review explores the application of metaproteomics and AI/ML algorithms in resolving multiomics data analysis and in silico prediction of microbial features for screening specific beneficial probiotic organisms. Implementing these integrative strategies could augment the potential of precision probiotics for personalized healthcare.
Collapse
Affiliation(s)
- Arpita Das
- Department of Biosciences and Bioengineering, Indian Institute of Technology Roorkee, Roorkee 247667, India
| | - Rama N Behera
- Department of Biosciences and Bioengineering, Indian Institute of Technology Roorkee, Roorkee 247667, India
| | - Ayushi Kapoor
- Department of Biosciences and Bioengineering, Indian Institute of Technology Roorkee, Roorkee 247667, India
| | - Kiran Ambatipudi
- Department of Biosciences and Bioengineering, Indian Institute of Technology Roorkee, Roorkee 247667, India
| |
Collapse
|
13
|
Samadishadlou M, Rahbarghazi R, Piryaei Z, Esmaeili M, Avcı ÇB, Bani F, Kavousi K. Unlocking the potential of microRNAs: machine learning identifies key biomarkers for myocardial infarction diagnosis. Cardiovasc Diabetol 2023; 22:247. [PMID: 37697288 PMCID: PMC10496209 DOI: 10.1186/s12933-023-01957-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Accepted: 08/10/2023] [Indexed: 09/13/2023] Open
Abstract
BACKGROUND MicroRNAs (miRNAs) play a crucial role in regulating adaptive and maladaptive responses in cardiovascular diseases, making them attractive targets for potential biomarkers. However, their potential as novel biomarkers for diagnosing cardiovascular diseases requires systematic evaluation. METHODS In this study, we aimed to identify a key set of miRNA biomarkers using integrated bioinformatics and machine learning analysis. We combined and analyzed three gene expression datasets from the Gene Expression Omnibus (GEO) database, which contains peripheral blood mononuclear cell (PBMC) samples from individuals with myocardial infarction (MI), stable coronary artery disease (CAD), and healthy individuals. Additionally, we selected a set of miRNAs based on their area under the receiver operating characteristic curve (AUC-ROC) for separating the CAD and MI samples. We designed a two-layer architecture for sample classification, in which the first layer isolates healthy samples from unhealthy samples, and the second layer classifies stable CAD and MI samples. We trained different machine learning models using both biomarker sets and evaluated their performance on a test set. RESULTS We identified hsa-miR-21-3p, hsa-miR-186-5p, and hsa-miR-32-3p as the differentially expressed miRNAs, and a set including hsa-miR-186-5p, hsa-miR-21-3p, hsa-miR-197-5p, hsa-miR-29a-5p, and hsa-miR-296-5p as the optimum set of miRNAs selected by their AUC-ROC. Both biomarker sets could distinguish healthy from not-healthy samples with complete accuracy. The best performance for the classification of CAD and MI was achieved with an SVM model trained using the biomarker set selected by AUC-ROC, with an AUC-ROC of 0.96 and an accuracy of 0.94 on the test data. CONCLUSIONS Our study demonstrated that miRNA signatures derived from PBMCs could serve as valuable novel biomarkers for cardiovascular diseases.
Collapse
Affiliation(s)
- Mehrdad Samadishadlou
- Department of Medical Nanotechnology, Faculty of Advanced Medical Sciences, Tabriz University of Medical Sciences, Tabriz, Iran
| | - Reza Rahbarghazi
- Stem Cell Research Center, Tabriz University of Medical Sciences, Tabriz, Iran
- Department of Applied Cell Sciences, Faculty of Advanced Medical Sciences, Tabriz University of Medical Sciences, Tabriz, Iran
| | - Zeynab Piryaei
- Laboratory of Complex Biological Systems and Bioinformatics (CBB), Department of Bioinformatics, Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran, Iran
| | - Mahdad Esmaeili
- Medical Bioengineering Department, Faculty of Advanced Medical Sciences, Tabriz University of Medical Sciences, Tabriz, Iran
| | - Çığır Biray Avcı
- Medical Biology Department, School of Medicine, Ege University, İzmir, Türkiye
| | - Farhad Bani
- Department of Medical Nanotechnology, Faculty of Advanced Medical Sciences, Tabriz University of Medical Sciences, Tabriz, Iran.
- Drug Applied Research Center, Tabriz University of Medical Sciences, Tabriz, Iran.
| | - Kaveh Kavousi
- Laboratory of Complex Biological Systems and Bioinformatics (CBB), Department of Bioinformatics, Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran, Iran.
| |
Collapse
|
14
|
Hartman E, Scott AM, Karlsson C, Mohanty T, Vaara ST, Linder A, Malmström L, Malmström J. Interpreting biologically informed neural networks for enhanced proteomic biomarker discovery and pathway analysis. Nat Commun 2023; 14:5359. [PMID: 37660105 PMCID: PMC10475049 DOI: 10.1038/s41467-023-41146-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Accepted: 08/22/2023] [Indexed: 09/04/2023] Open
Abstract
The incorporation of machine learning methods into proteomics workflows improves the identification of disease-relevant biomarkers and biological pathways. However, machine learning models, such as deep neural networks, typically suffer from lack of interpretability. Here, we present a deep learning approach to combine biological pathway analysis and biomarker identification to increase the interpretability of proteomics experiments. Our approach integrates a priori knowledge of the relationships between proteins and biological pathways and biological processes into sparse neural networks to create biologically informed neural networks. We employ these networks to differentiate between clinical subphenotypes of septic acute kidney injury and COVID-19, as well as acute respiratory distress syndrome of different aetiologies. To gain biological insight into the complex syndromes, we utilize feature attribution-methods to introspect the networks for the identification of proteins and pathways important for distinguishing between subtypes. The algorithms are implemented in a freely available open source Python-package ( https://github.com/InfectionMedicineProteomics/BINN ).
Collapse
Affiliation(s)
- Erik Hartman
- Division of Infection Medicine, Department of Clinical Sciences Lund, Faculty of Medicine, Lund University, Lund, Sweden.
| | - Aaron M Scott
- Division of Infection Medicine, Department of Clinical Sciences Lund, Faculty of Medicine, Lund University, Lund, Sweden
| | - Christofer Karlsson
- Division of Infection Medicine, Department of Clinical Sciences Lund, Faculty of Medicine, Lund University, Lund, Sweden
| | - Tirthankar Mohanty
- Division of Infection Medicine, Department of Clinical Sciences Lund, Faculty of Medicine, Lund University, Lund, Sweden
| | - Suvi T Vaara
- Department of Perioperative and Intensive Care, University of Helsinki and Helsinki University Hospital, Helsinki, Finland
| | - Adam Linder
- Division of Infection Medicine, Department of Clinical Sciences Lund, Faculty of Medicine, Lund University, Lund, Sweden
| | - Lars Malmström
- Division of Infection Medicine, Department of Clinical Sciences Lund, Faculty of Medicine, Lund University, Lund, Sweden
| | - Johan Malmström
- Division of Infection Medicine, Department of Clinical Sciences Lund, Faculty of Medicine, Lund University, Lund, Sweden.
| |
Collapse
|