1
|
de Crécy-Lagard V, Dias R, Friedberg I, Yuan Y, Swairjo MA. Limitations of Current Machine-Learning Models in Predicting Enzymatic Functions for Uncharacterized Proteins. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.01.601547. [PMID: 39005379 PMCID: PMC11244979 DOI: 10.1101/2024.07.01.601547] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/16/2024]
Abstract
Thirty to seventy percent of proteins in any given genome have no assigned function and have been labeled as the protein "unknownme". This large knowledge gap prevents the biological community from fully leveraging the plethora of genomic data that is now available. Machine-learning approaches are showing some promise in propagating functional knowledge from experimentally characterized proteins to the correct set of isofunctional orthologs. However, they largely fail to predict enzymatic functions unseen in the training set, as shown by dissecting the predictions made for 450 enzymes of unknown function from the model bacteria Escherichia coli using the DeepECTransformer platform. Lessons from these failures can help the community develop machine-learning methods that assist domain experts in making testable functional predictions for more members of the uncharacterized proteome.
Collapse
|
2
|
Vincent D, Bui A, Ezernieks V, Shahinfar S, Luke T, Ram D, Rigas N, Panozzo J, Rochfort S, Daetwyler H, Hayden M. A community resource to mass explore the wheat grain proteome and its application to the late-maturity alpha-amylase (LMA) problem. Gigascience 2022; 12:giad084. [PMID: 37919977 PMCID: PMC10627334 DOI: 10.1093/gigascience/giad084] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2023] [Revised: 08/02/2023] [Accepted: 09/19/2023] [Indexed: 11/04/2023] Open
Abstract
BACKGROUND Late-maturity alpha-amylase (LMA) is a wheat genetic defect causing the synthesis of high isoelectric point alpha-amylase following a temperature shock during mid-grain development or prolonged cold throughout grain development, both leading to starch degradation. While the physiology is well understood, the biochemical mechanisms involved in grain LMA response remain unclear. We have applied high-throughput proteomics to 4,061 wheat flours displaying a range of LMA activities. Using an array of statistical analyses to select LMA-responsive biomarkers, we have mined them using a suite of tools applicable to wheat proteins. RESULTS We observed that LMA-affected grains activated their primary metabolisms such as glycolysis and gluconeogenesis; TCA cycle, along with DNA- and RNA- binding mechanisms; and protein translation. This logically transitioned to protein folding activities driven by chaperones and protein disulfide isomerase, as well as protein assembly via dimerisation and complexing. The secondary metabolism was also mobilized with the upregulation of phytohormones and chemical and defence responses. LMA further invoked cellular structures, including ribosomes, microtubules, and chromatin. Finally, and unsurprisingly, LMA expression greatly impacted grain storage proteins, as well as starch and other carbohydrates, with the upregulation of alpha-gliadins and starch metabolism, whereas LMW glutenin, stachyose, sucrose, UDP-galactose, and UDP-glucose were downregulated. CONCLUSIONS To our knowledge, this is not only the first proteomics study tackling the wheat LMA issue but also the largest plant-based proteomics study published to date. Logistics, technicalities, requirements, and bottlenecks of such an ambitious large-scale high-throughput proteomics experiment along with the challenges associated with big data analyses are discussed.
Collapse
Affiliation(s)
- Delphine Vincent
- Agriculture Victoria Research, AgriBio, Center Centre for AgriBioscience, Bundoora, VIC 3083, Australia
| | - AnhDuyen Bui
- Agriculture Victoria Research, AgriBio, Center Centre for AgriBioscience, Bundoora, VIC 3083, Australia
| | - Vilnis Ezernieks
- Agriculture Victoria Research, AgriBio, Center Centre for AgriBioscience, Bundoora, VIC 3083, Australia
| | - Saleh Shahinfar
- Agriculture Victoria Research, AgriBio, Center Centre for AgriBioscience, Bundoora, VIC 3083, Australia
| | - Timothy Luke
- Agriculture Victoria Research, AgriBio, Center Centre for AgriBioscience, Bundoora, VIC 3083, Australia
| | - Doris Ram
- Agriculture Victoria Research, AgriBio, Center Centre for AgriBioscience, Bundoora, VIC 3083, Australia
| | - Nicholas Rigas
- Agriculture Victoria Research, Grains Innovation Park, Horsham, VIC 3400, Australia
| | - Joe Panozzo
- Agriculture Victoria Research, Grains Innovation Park, Horsham, VIC 3400, Australia
- Centre for Agricultural Innovation, University of Melbourne, Parkville, VIC 3010, Australia
| | - Simone Rochfort
- Agriculture Victoria Research, AgriBio, Center Centre for AgriBioscience, Bundoora, VIC 3083, Australia
- School of Applied Systems Biology, La Trobe University, Bundoora, VIC 3083, Australia
| | - Hans Daetwyler
- Agriculture Victoria Research, AgriBio, Center Centre for AgriBioscience, Bundoora, VIC 3083, Australia
- School of Applied Systems Biology, La Trobe University, Bundoora, VIC 3083, Australia
| | - Matthew Hayden
- Agriculture Victoria Research, AgriBio, Center Centre for AgriBioscience, Bundoora, VIC 3083, Australia
- School of Applied Systems Biology, La Trobe University, Bundoora, VIC 3083, Australia
| |
Collapse
|
3
|
Soldatos TG, Kim S, Schmidt S, Lesko LJ, Jackson DB. Advancing drug safety science by integrating molecular knowledge with post-marketing adverse event reports. CPT Pharmacometrics Syst Pharmacol 2022; 11:540-555. [PMID: 35143713 PMCID: PMC9124355 DOI: 10.1002/psp4.12765] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Revised: 12/20/2021] [Accepted: 01/17/2022] [Indexed: 12/15/2022] Open
Abstract
Promising drug development efforts may frequently fail due to unintended adverse reactions. Several methods have been developed to analyze such data, aiming to improve pharmacovigilance and drug safety. In this work, we provide a brief review of key directions to quantitatively analyzing adverse events and explore the potential of augmenting these methods using additional molecular data descriptors. We argue that molecular expansion of adverse event data may provide a path to improving the insights gained through more traditional pharmacovigilance approaches. Examples include the ability to assess statistical relevance with respect to underlying biomolecular mechanisms, the ability to generate plausible causative hypotheses and/or confirmation where possible, the ability to computationally study potential clinical trial designs and/or results, as well as the further provision of advanced features incorporated in innovative methods, such as machine learning. In summary, molecular data expansion provides an elegant way to extend mechanistic modeling, systems pharmacology, and patient‐centered approaches for the assessment of drug safety. We anticipate that such advances in real‐world data informatics and outcome analytics will help to better inform public health, via the improved ability to prospectively understand and predict various types of drug‐induced molecular perturbations and adverse events.
Collapse
Affiliation(s)
| | - Sarah Kim
- Department of PharmaceuticsCenter for Pharmacometrics and Systems PharmacologyUniversity of FloridaOrlandoFloridaUSA
| | - Stephan Schmidt
- Department of PharmaceuticsCenter for Pharmacometrics and Systems PharmacologyUniversity of FloridaOrlandoFloridaUSA
| | - Lawrence J. Lesko
- Department of PharmaceuticsCenter for Pharmacometrics and Systems PharmacologyUniversity of FloridaOrlandoFloridaUSA
| | | |
Collapse
|
4
|
Iyyappan OR, Manoharan S. Finding Gene Associations by Text Mining and Annotating it with Gene Ontology. Methods Mol Biol 2022; 2496:71-90. [PMID: 35713859 DOI: 10.1007/978-1-0716-2305-3_4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Digitalization of the research articles and their maintenance in a database was the first stage toward the development of biomedical research. With the large amounts of research being published daily, it has created a large gap in accessing all the articles for review to a given problem. To understand any biological process, an insight into the role of each element in the genome is essential. But with this gap in manual curation of literature, there are chances that important biological information may be lost. Hence, text mining plays an important role in bridging this gap and extracting important biological information from the text, finding associations among them and predicting annotations. An annotation may be gene, gene products, gene names, their physical and functional characteristics, and so on. The process of annotations may be classified as structural annotation, functional annotation, and relational annotation. In this chapter, a basic protocol utilizing text mining to extract biological information and predict their functional role based on Gene Ontology is provided.
Collapse
Affiliation(s)
- Oviya Ramalakshmi Iyyappan
- Department of Sciences, Amrita School of Engineering, Amrita Vishwa Vidyapeetham, Chennai, Tamilnadu, India.
| | - Sharanya Manoharan
- Department of Bioinformatics, Stella Maris College (Autonomous), Chennai, Tamilnadu, India
| |
Collapse
|
5
|
Rogozin IB, Roche-Lima A, Tyryshkin K, Carrasquillo-Carrión K, Lada AG, Poliakov LY, Schwartz E, Saura A, Yurchenko V, Cooper DN, Panchenko AR, Pavlov YI. DNA Methylation, Deamination, and Translesion Synthesis Combine to Generate Footprint Mutations in Cancer Driver Genes in B-Cell Derived Lymphomas and Other Cancers. Front Genet 2021; 12:671866. [PMID: 34093666 PMCID: PMC8170131 DOI: 10.3389/fgene.2021.671866] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2021] [Accepted: 04/21/2021] [Indexed: 11/13/2022] Open
Abstract
Cancer genomes harbor numerous genomic alterations and many cancers accumulate thousands of nucleotide sequence variations. A prominent fraction of these mutations arises as a consequence of the off-target activity of DNA/RNA editing cytosine deaminases followed by the replication/repair of edited sites by DNA polymerases (pol), as deduced from the analysis of the DNA sequence context of mutations in different tumor tissues. We have used the weight matrix (sequence profile) approach to analyze mutagenesis due to Activation Induced Deaminase (AID) and two error-prone DNA polymerases. Control experiments using shuffled weight matrices and somatic mutations in immunoglobulin genes confirmed the power of this method. Analysis of somatic mutations in various cancers suggested that AID and DNA polymerases η and θ contribute to mutagenesis in contexts that almost universally correlate with the context of mutations in A:T and G:C sites during the affinity maturation of immunoglobulin genes. Previously, we demonstrated that AID contributes to mutagenesis in (de)methylated genomic DNA in various cancers. Our current analysis of methylation data from malignant lymphomas suggests that driver genes are subject to different (de)methylation processes than non-driver genes and, in addition to AID, the activity of pols η and θ contributes to the establishment of methylation-dependent mutation profiles. This may reflect the functional importance of interplay between mutagenesis in cancer and (de)methylation processes in different groups of genes. The resulting changes in CpG methylation levels and chromatin modifications are likely to cause changes in the expression levels of driver genes that may affect cancer initiation and/or progression.
Collapse
Affiliation(s)
- Igor B Rogozin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, United States
| | - Abiel Roche-Lima
- Center for Collaborative Research in Health Disparities - RCMI Program, University of Puerto Rico, San Juan, Puerto Rico
| | - Kathrin Tyryshkin
- Department of Pathology and Molecular Medicine, School of Medicine, Queen's University, Kingston, ON, Canada
| | | | - Artem G Lada
- Department Microbiology and Molecular Genetics, University of California, Davis, Davis, CA, United States
| | - Lennard Y Poliakov
- Life Science Research Centre, Faculty of Science, University of Ostrava, Ostrava, Czechia
| | - Elena Schwartz
- Coordinating Center for Clinical Trials, National Cancer Institute, National Institutes of Health, Bethesda, MD, United States
| | - Andreu Saura
- Life Science Research Centre, Faculty of Science, University of Ostrava, Ostrava, Czechia
| | - Vyacheslav Yurchenko
- Life Science Research Centre, Faculty of Science, University of Ostrava, Ostrava, Czechia.,Martsinovsky Institute of Medical Parasitology, Tropical and Vector Borne Diseases, Sechenov First Moscow State Medical University, Moscow, Russia
| | - David N Cooper
- Institute of Medical Genetics, Cardiff University, Cardiff, United Kingdom
| | - Anna R Panchenko
- Department of Pathology and Molecular Medicine, School of Medicine, Queen's University, Kingston, ON, Canada
| | - Youri I Pavlov
- Eppley Institute for Research in Cancer and Allied Diseases, Omaha, NE, United States.,Department of Microbiology and Pathology, Biochemistry and Molecular Biology, Genetics, Cell Biology and Anatomy, University of Nebraska Medical Center, Omaha, NE, United States.,Department of Genetics and Biotechnology, Saint-Petersburg State University, Saint-Petersburg, Russia
| |
Collapse
|
6
|
Turina P, Fariselli P, Capriotti E. ThermoScan: Semi-automatic Identification of Protein Stability Data From PubMed. Front Mol Biosci 2021; 8:620475. [PMID: 33842537 PMCID: PMC8027235 DOI: 10.3389/fmolb.2021.620475] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2020] [Accepted: 02/18/2021] [Indexed: 11/13/2022] Open
Abstract
During the last years, the increasing number of DNA sequencing and protein mutagenesis studies has generated a large amount of variation data published in the biomedical literature. The collection of such data has been essential for the development and assessment of tools predicting the impact of protein variants at functional and structural levels. Nevertheless, the collection of manually curated data from literature is a highly time consuming and costly process that requires domain experts. In particular, the development of methods for predicting the effect of amino acid variants on protein stability relies on the thermodynamic data extracted from literature. In the past, such data were deposited in the ProTherm database, which however is no longer maintained since 2013. For facilitating the collection of protein thermodynamic data from literature, we developed the semi-automatic tool ThermoScan. ThermoScan is a text mining approach for the identification of relevant thermodynamic data on protein stability from full-text articles. The method relies on a regular expression searching for groups of words, including the most common conceptual words appearing in experimental studies on protein stability, several thermodynamic variables, and their units of measure. ThermoScan analyzes full-text articles from the PubMed Central Open Access subset and calculates an empiric score that allows the identification of manuscripts reporting thermodynamic data on protein stability. The method was optimized on a set of publications included in the ProTherm database, and tested on a new curated set of articles, manually selected for presence of thermodynamic data. The results show that ThermoScan returns accurate predictions and outperforms recently developed text-mining algorithms based on the analysis of publication abstracts. Availability: The ThermoScan server is freely accessible online at https://folding.biofold.org/thermoscan. The ThermoScan python code and the Google Chrome extension for submitting visualized PMC web pages to the ThermoScan server are available at https://github.com/biofold/ThermoScan.
Collapse
Affiliation(s)
- Paola Turina
- Department of Pharmacy and Biotechnology (FaBiT), University of Bologna, Bologna, Italy
| | - Piero Fariselli
- Department of Medical Sciences, University of Torino, Torino, Italy
| | - Emidio Capriotti
- Department of Pharmacy and Biotechnology (FaBiT), University of Bologna, Bologna, Italy
| |
Collapse
|
7
|
Schaefer A, Sachpekidis C, Diella F, Doerks A, Kratz AS, Meisel C, Jackson DB, Soldatos TG. Public Adverse Event Data Insights into the Safety of Pembrolizumab in Melanoma Patients. Cancers (Basel) 2020; 12:E1008. [PMID: 32325840 PMCID: PMC7226447 DOI: 10.3390/cancers12041008] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2020] [Revised: 04/14/2020] [Accepted: 04/15/2020] [Indexed: 12/18/2022] Open
Abstract
Immune checkpoint inhibition represents an important therapeutic option for advanced melanoma patients. Results from clinical studies have shown that treatment with the PD-1 inhibitors Pembrolizumab and Nivolumab provides improved response and survival rates. Moreover, combining Nivolumab with the CTLA-4 inhibitor Ipilimumab is superior to the respective monotherapies. However, use of these immunotherapies frequently associated with, sometimes life-threatening, immune-related adverse events. Thus, more evidence-based studies are required to characterize the underlying mechanisms, towards more effective clinical management and treatment monitoring. Our study examines two sets of public adverse event data coming from FAERS and VigiBase, each with more than two thousand melanoma patients treated with Pembrolizumab. Standard disproportionality metrics are utilized to characterize the safety of Pembrolizumab and its reaction profile is compared to those of the widely used Ipilimumab and Nivolumab based on melanoma cases that report only one of them. Our results confirm known toxicological considerations for their related and distinct side-effect profiles and highlight specific immune-related adverse reactions. Our retrospective computational analysis includes more patients than examined in other studies and relies on evidence coming from public pharmacovigilance data that contain safety reports from clinical and controlled studies as well as reports of suspected adverse events coming from real-world post-marketing setting. Despite these informative insights, more prospective studies are necessary to fully characterize the efficacy of these agents.
Collapse
Affiliation(s)
| | - Christos Sachpekidis
- Clinical Cooperation Unit Nuclear Medicine, German Cancer Research Center (DKFZ), 69120 Heidelberg, Germany
- Department of Nuclear Medicine, Inselspital, Bern University Hospital, University of Bern, 3010 Bern, Switzerland
| | | | - Anja Doerks
- Molecular Health GmbH, 69115 Heidelberg, Germany
| | | | | | | | | |
Collapse
|
8
|
Sachpekidis C, Jackson DB, Soldatos TG. Radioimmunotherapy in Non-Hodgkin's Lymphoma: Retrospective Adverse Event Profiling of Zevalin and Bexxar. Pharmaceuticals (Basel) 2019; 12:ph12040141. [PMID: 31546999 PMCID: PMC6958320 DOI: 10.3390/ph12040141] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2019] [Revised: 09/10/2019] [Accepted: 09/17/2019] [Indexed: 02/07/2023] Open
Abstract
The development of monoclonal antibodies has dramatically changed the outcome of patients with non-Hodgkin’s lymphoma (NHL), the most common hematological malignancy. However, despite the satisfying results of monoclonal antibody treatment, only few NHL patients are permanently cured with single-agent therapies. In this context, radioimmunotherapy, the administration of radionuclides conjugated to monoclonal antibodies, is aimed to augment the single-agent efficacy of immunotherapy in order to deliver targeted radiation to tumors, particularly CD20+ B-cell lymphomas. Based on evidence from several trials in NHL, the radiolabeled antibodies 90Y-ibritumomab tiuxetan (Zevalin, Spectrum Pharmaceuticals) and 131I-tositumomab (Bexxar, GlaxoSmithKline) received FDA approval in 2002 and 2003, respectively. However, none of the two radioimmunotherapeutic agents has been broadly applied in clinical practice. The main reason for the under-utilization of radioimmunotherapy includes economic and logistic considerations. However, concerns about potential side effects have also been raised. Driven by these developments, we performed retrospective analysis of adverse events reporting Zevalin or Bexxar, extracted from the FDA’s Adverse Event Reporting System (FAERS) and the World Health Organization’s VigiBase repository. Our results indicate that the two radioimmunotherapeutic agents have both related and distinct side effect profiles and confirm their known toxicological considerations. Our work also suggests that computational analysis of real-world post-marketing data can provide informative clinical insights. While more prospective studies are necessary to fully characterize the efficacy and safety of radioimmunotherapy, we expect that it has not yet reached its full therapeutic potential in modern hematological oncology.
Collapse
Affiliation(s)
- Christos Sachpekidis
- Department of Nuclear Medicine, Inselspital, Bern University Hospital, University of Bern, Bern 3010, Switzerland.
| | | | | |
Collapse
|
9
|
Soldatos TG, Iakovou I, Sachpekidis C. Retrospective Toxicological Profiling of Radium-223 Dichloride for the Treatment of Bone Metastases in Prostate Cancer Using Adverse Event Data. ACTA ACUST UNITED AC 2019; 55:medicina55050149. [PMID: 31100964 PMCID: PMC6572036 DOI: 10.3390/medicina55050149] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2019] [Revised: 04/12/2019] [Accepted: 05/13/2019] [Indexed: 02/06/2023]
Abstract
Background and Objective: Radium-223 dichloride (Xofigo®) is a calcium mimetic agent approved for the treatment of castration-resistant prostate cancer patients with symptomatic bone metastases and no known visceral metastatic disease. This targeted, α-particle-emitting therapy has demonstrated significant survival benefit accompanied by a favorable safety profile. Nevertheless, recent evidence suggests that its combined use with abiraterone and prednisone/prednisolone may be associated with increased risk of death and fractures. While the precise pathophysiologic mechanisms of these events are not yet clear, collecting evidence from more clinical trials and translational studies is necessary. The aim of our present study is to assess whether accessible sources of patient outcome data can help gain additional clinical insights to radium-223 dichloride’s safety profile. Materials and Methods: We performed a retrospective analysis of cases extracted from the FDA Adverse Event Reporting System and characterized side effect occurrence by using reporting ratios. Results: A total of ~1500 prostate cancer patients treated with radium-223 dichloride was identified, and side effects reported with the use of radium-223 dichloride alone or in combination with other therapeutic agents were extracted. Our analysis demonstrates that radium-223 dichloride may often come with hematological-related reactions, and that, when administered together with other drugs, its safety profile may differ. Conclusions: While more prospective studies are needed to fully characterize the toxicological profile of radium-223 dichloride, the present work constitutes perhaps the first effort to examine its safety when administered alone and in combination with other agents based on computational evidence from public real-world post marketing data.
Collapse
Affiliation(s)
| | - Ioannis Iakovou
- Department of Nuclear Medicine, Aristotle University of Thessaloniki, Papageorgiou Hospital, Thessaloniki, Greece.
| | - Christos Sachpekidis
- Department of Nuclear Medicine, Aristotle University of Thessaloniki, Papageorgiou Hospital, Thessaloniki, Greece.
- Department of Nuclear Medicine, Inselspital, Bern University Hospital, University of Bern, Bern, Switzerland.
| |
Collapse
|
10
|
Adverse Event Circumstances and the Case of Drug Interactions. Healthcare (Basel) 2019; 7:healthcare7010045. [PMID: 30893930 PMCID: PMC6473808 DOI: 10.3390/healthcare7010045] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2019] [Revised: 03/15/2019] [Accepted: 03/18/2019] [Indexed: 01/04/2023] Open
Abstract
Adverse events are a common and for the most part unavoidable consequence of therapeutic intervention. Nevertheless, available tomes of such data now provide us with an invaluable opportunity to study the relationship between human phenotype and drug-induced protein perturbations within a patient system. Deciphering the molecular basis of such adverse responses is not only paramount to the development of safer drugs but also presents a unique opportunity to dissect disease systems in search of novel response biomarkers, drug targets, and efficacious combination therapies. Inspired by the potential applications of this approach, we first examined adverse event circumstances reported in FAERS and then performed a molecular level interrogation of cancer patient adverse events to investigate the prevalence of drug-drug interactions in the context of patient responses. We discuss avoidable and/or preventable cases and how molecular analytics can help optimize therapeutic use of co-medications. While up to one out of three adverse events in this dataset might be explicable by iatrogenic, patient, and product/device related factors, almost half of the patients in FAERS received multiple drugs and one in four may have experienced effects attributable to drug interactions.
Collapse
|
11
|
Soldatou V, Soldatos A, Soldatos T. Examining Socioeconomic and Computational Aspects of Vaccine Pharmacovigilance. BIOMED RESEARCH INTERNATIONAL 2019; 2019:6576483. [PMID: 30911546 PMCID: PMC6399563 DOI: 10.1155/2019/6576483] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/01/2018] [Revised: 11/26/2018] [Accepted: 12/24/2018] [Indexed: 12/30/2022]
Abstract
BACKGROUND Vaccine pharmacovigilance relates to the detection of adverse events, their assessment, understanding, and prevention, and communication of their risk to the public. These activities can be tedious and long lasting for regulatory authority scientists and may be affected by community practices and public health policies. To better understand underlying challenges, we examined vaccine adverse event reports, assessed whether data-driven techniques can provide additional insight in safety characterization, and wondered on the impact of socioeconomic parameters. METHODS First, we integrated VAERS content with additional sources of drug and molecular data and examined reaction and outcome occurrence by using disproportionality metrics and enrichment analysis. Second, we reviewed social and behavioral determinants that may affect vaccine pharmacovigilance aspects. RESULTS We describe our experience in processing more than 607000 vaccine adverse event reports and report on the challenges to integrate more than 95500 VAERS medication narratives with structured information about drugs and other therapeutics or supplements. We found that only 12.6% of events were serious, while 8.97% referred to polypharmacy cases. Exacerbation of serious clinical patient outcomes was observed in 8.88% VAERS cases in which drugs may interact with vaccinations or with each other, regardless of vaccine activity interference. Furthermore, we characterized the symptoms reported in those cases and summarized reaction occurrence among vaccine-types. Last, we examine socioeconomic parameters and cost-management features, explore adverse event reporting trends, and highlight perspectives relating to the use and development of digital services, especially in the context of personalized and collaborative health-care. CONCLUSIONS This work provides an informative review of VAERS, identifies challenges and limitations in the processing of vaccine adverse event data, and calls for the better understanding of the socioeconomic landscape pertaining vaccine safety concerns. We expect that adoption of computational techniques for integrated safety assessment and interpretation is key not only to pharmacovigilance practice but also to stakeholders from the entire healthcare system.
Collapse
Affiliation(s)
- Vasiliki Soldatou
- Department of Computer Science and Biomedical Informatics, University of Thessaly, Greece
| | - Anastasios Soldatos
- Department of Business Administration, School of Business, Athens University of Economics and Business, Greece
| | | |
Collapse
|
12
|
Soldatos TG, Taglang G, Jackson DB. In Silico Profiling of Clinical Phenotypes for Human Targets Using Adverse Event Data. High Throughput 2018; 7:ht7040037. [PMID: 30477159 PMCID: PMC6306940 DOI: 10.3390/ht7040037] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2018] [Revised: 11/09/2018] [Accepted: 11/19/2018] [Indexed: 12/19/2022] Open
Abstract
We present a novel approach for the molecular transformation and analysis of patient clinical phenotypes. Building on the fact that drugs perturb the function of targets/genes, we integrated data from 8.2 million clinical reports detailing drug-induced side effects with the molecular world of drug-target information. Using this dataset, we extracted 1.8 million associations of clinical phenotypes to 770 human drug-targets. This collection is perhaps the largest phenotypic profiling reference of human targets to-date, and unique in that it enables rapid development of testable molecular hypotheses directly from human-specific information. We also present validation results demonstrating analytical utilities of the approach, including drug safety prediction, and the design of novel combination therapies. Challenging the long-standing notion that molecular perturbation studies cannot be performed in humans, our data allows researchers to capitalize on the vast tomes of clinical information available throughout the healthcare system.
Collapse
Affiliation(s)
| | - Guillaume Taglang
- Molecular Health GmbH, Kurfuersten Anlage 21, 69115 Heidelberg, Germany.
| | - David B Jackson
- Molecular Health GmbH, Kurfuersten Anlage 21, 69115 Heidelberg, Germany.
| |
Collapse
|
13
|
Retrospective Side Effect Profiling of the Metastatic Melanoma Combination Therapy Ipilimumab-Nivolumab Using Adverse Event Data. Diagnostics (Basel) 2018; 8:diagnostics8040076. [PMID: 30384507 PMCID: PMC6316083 DOI: 10.3390/diagnostics8040076] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2017] [Revised: 10/29/2018] [Accepted: 10/30/2018] [Indexed: 12/15/2022] Open
Abstract
Recent studies suggest that combining nivolumab with ipilimumab is a more effective treatment for melanoma patients, compared to using ipilimumab or nivolumab alone. However, treatment with these immunotherapeutic agents is frequently associated with increased risk of toxicity, and (auto-) immune-related adverse events. The precise pathophysiologic mechanisms of these events are not yet clear, and evidence from clinical trials and translational studies remains limited. Our retrospective analysis of ~7700 metastatic melanoma patients treated with ipilimumab and/or nivolumab from the FDA Adverse Event Reporting System (FAERS) demonstrates that the identified immune-related reactions are specific to ipilimumab and/or nivolumab, and that when the two agents are administered together, their safety profile combines reactions from each drug alone. While more prospective studies are needed to characterize the safety of ipilimumab and nivolumab, the present work constitutes perhaps the first effort to examine the safety of these drugs and their combination based on computational evidence from real world post marketing data.
Collapse
|
14
|
Waldemarin RC, de Farias CRG. OBO to UML: Support for the development of conceptual models in the biomedical domain. J Biomed Inform 2018; 80:14-25. [PMID: 29496629 DOI: 10.1016/j.jbi.2018.02.015] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2017] [Revised: 02/23/2018] [Accepted: 02/25/2018] [Indexed: 11/18/2022]
Abstract
A conceptual model abstractly defines a number of concepts and their relationships for the purposes of understanding and communication. Once a conceptual model is available, it can also be used as a starting point for the development of a software system. The development of conceptual models using the Unified Modeling Language (UML) facilitates the representation of modeled concepts and allows software developers to directly reuse these concepts in the design of a software system. The OBO Foundry represents the most relevant collaborative effort towards the development of ontologies in the biomedical domain. The development of UML conceptual models in the biomedical domain may benefit from the use of domain-specific semantics and notation. Further, the development of these models may also benefit from the reuse of knowledge contained in OBO ontologies. This paper investigates the support for the development of conceptual models in the biomedical domain using UML as a conceptual modeling language and using the support provided by the OBO Foundry for the development of biomedical ontologies, namely entity kind and relationship types definitions provided by the Basic Formal Ontology (BFO) and the OBO Core Relations Ontology (OBO Core), respectively. Further, the paper investigates the support for the reuse of biomedical knowledge currently available in OBOFFF ontologies in the development these conceptual models. The paper describes a UML profile for the OBO Core Relations Ontology, which basically defines a number of stereotypes to represent BFO entity kinds and OBO Core relationship types definitions. The paper also presents a support toolset consisting of a graphical editor named OBO-RO Editor, which directly supports the development of UML models using the extensions defined by our profile, and a command-line tool named OBO2UML, which directly converts an OBOFFF ontology into a UML model.
Collapse
Affiliation(s)
- Ricardo C Waldemarin
- Biomedical Systems and Services Laboratory, Department of Computer Science and Mathematics (DCM), University of São Paulo (FFCLRP/USP), Brazil
| | - Cléver R G de Farias
- Biomedical Systems and Services Laboratory, Department of Computer Science and Mathematics (DCM), University of São Paulo (FFCLRP/USP), Brazil.
| |
Collapse
|
15
|
Zhou J, Fu BQ. The research on gene-disease association based on text-mining of PubMed. BMC Bioinformatics 2018; 19:37. [PMID: 29415654 PMCID: PMC5804013 DOI: 10.1186/s12859-018-2048-y] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2017] [Accepted: 01/29/2018] [Indexed: 11/23/2022] Open
Abstract
Background The associations between genes and diseases are of critical significance in aspects of prevention, diagnosis and treatment. Although gene-disease relationships have been investigated extensively, much of the underpinnings of these associations are yet to be elucidated. Methods A novel method integrates MeSH database, term weight (TW), and co-occurrence methods to predict gene-disease associations based on the cosine similarity between gene vectors and disease vectors. Vectors are transformed from the texts of documents in the PubMed database according to the appearance and location of the gene or disease terms. The disease related text data has been optimized during the process of constructing vectors. Results The overall distribution of cosine similarity value was investigated. By using the gene-disease association data in OMIM database as golden standard, the performance of cosine similarity in predicting gene-disease linkage was evaluated. The effects of applying weight matrix, penalty weights for keywords (PWK), and normalization were also investigated. Finally, we demonstrated that our method outperforms heterogeneous network edge prediction (HNEP) in aspects of precision rate and recall rate. Conclusions Our method proposed in this paper is easy to be conducted and the results can be integrated with other models to improve the overall performance of gene-disease association predictions.
Collapse
Affiliation(s)
- Jie Zhou
- Guangdong Key Laboratory of Computer Network, School of Computer Science and Engineering, South China University of Technology, Guangzhou, 510006, China.
| | - Bo-Quan Fu
- Guangdong Key Laboratory of Computer Network, School of Computer Science and Engineering, South China University of Technology, Guangzhou, 510006, China
| |
Collapse
|
16
|
Abstract
Computational manipulation of knowledge is an important, and often under-appreciated, aspect of biomedical Data Science. The first Data Science initiative from the US National Institutes of Health was entitled "Big Data to Knowledge (BD2K)." The main emphasis of the more than $200M allocated to that program has been on "Big Data;" the "Knowledge" component has largely been the implicit assumption that the work will lead to new biomedical knowledge. However, there is long-standing and highly productive work in computational knowledge representation and reasoning, and computational processing of knowledge has a role in the world of Data Science. Knowledge-based biomedical Data Science involves the design and implementation of computer systems that act as if they knew about biomedicine. There are many ways in which a computational approach might act as if it knew something: for example, it might be able to answer a natural language question about a biomedical topic, or pass an exam; it might be able to use existing biomedical knowledge to rank or evaluate hypotheses; it might explain or interpret data in light of prior knowledge, either in a Bayesian or other sort of framework. These are all examples of automated reasoning that act on computational representations of knowledge. After a brief survey of existing approaches to knowledge-based data science, this position paper argues that such research is ripe for expansion, and expanded application.
Collapse
Affiliation(s)
- Lawrence E Hunter
- Computational Bioscience, University of Colorado School of Medicine, Aurora, CO 80045, USA ; ORCID: https://orcid.org/0000-0003-1455-3370
| |
Collapse
|
17
|
Brandi J, Manfredi M, Speziali G, Gosetti F, Marengo E, Cecconi D. Proteomic approaches to decipher cancer cell secretome. Semin Cell Dev Biol 2017; 78:93-101. [PMID: 28684183 DOI: 10.1016/j.semcdb.2017.06.030] [Citation(s) in RCA: 36] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2017] [Revised: 06/30/2017] [Accepted: 06/30/2017] [Indexed: 01/17/2023]
Abstract
In this review, we give an overview of the actual proteomic approaches used in the study of cancer cells secretome. In particular, we describe the proteomic strategies to decipher cancer cell secretome initially focusing on the different aspects of sample preparation. We examine the issues related to the presence of low abundant proteins, the analysis of secreted proteins in the conditioned media with or without the removal of fetal bovine serum and strategies developed to reduce intracellular protein contamination. As regards the identification and quantification of secreted proteins, we described the different proteomic approaches used, i.e. gel-based, MS-based (label-based and label-free), and the antibody and array-based methods, together with some of the most recent applications in the field of cancer research. Moreover, we describe the bioinformatics tools developed for the in silico validation and characterization of cancer cells secretome. We also discuss the most important available tools for protein annotation and for prediction of classical and non-classical secreted proteins. In summary in this review advances, concerns and challenges in the field of cancer secretome analysis are discussed.
Collapse
Affiliation(s)
- Jessica Brandi
- Department of Biotechnology, Proteomics and Mass Spectrometry Lab, University of Verona, Strada le Grazie 15, 37135, Verona, Italy
| | - Marcello Manfredi
- Department of Sciences and Technological Innovation, University of Piemonte Orientale, Viale T. Michel 11, 15121, Alessandria, Italy; ISALIT S.r.l., Novara, Italy.
| | - Giulia Speziali
- Department of Biotechnology, Proteomics and Mass Spectrometry Lab, University of Verona, Strada le Grazie 15, 37135, Verona, Italy
| | - Fabio Gosetti
- Department of Sciences and Technological Innovation, University of Piemonte Orientale, Viale T. Michel 11, 15121, Alessandria, Italy
| | - Emilio Marengo
- Department of Sciences and Technological Innovation, University of Piemonte Orientale, Viale T. Michel 11, 15121, Alessandria, Italy
| | - Daniela Cecconi
- Department of Biotechnology, Proteomics and Mass Spectrometry Lab, University of Verona, Strada le Grazie 15, 37135, Verona, Italy
| |
Collapse
|
18
|
Morota G, Peñagaricano F, Petersen JL, Ciobanu DC, Tsuyuzaki K, Nikaido I. An application of MeSH enrichment analysis in livestock. Anim Genet 2015; 46:381-7. [PMID: 26036323 PMCID: PMC5032990 DOI: 10.1111/age.12307] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/01/2015] [Indexed: 01/01/2023]
Abstract
An integral part of functional genomics studies is to assess the enrichment of specific biological terms in lists of genes found to be playing an important role in biological phenomena. Contrasting the observed frequency of annotated terms with those of the background is at the core of overrepresentation analysis (ORA). Gene Ontology (GO) is a means to consistently classify and annotate gene products and has become a mainstay in ORA. Alternatively, Medical Subject Headings (MeSH) offers a comprehensive life science vocabulary including additional categories that are not covered by GO. Although MeSH is applied predominantly in human and model organism research, its full potential in livestock genetics is yet to be explored. In this study, MeSH ORA was evaluated to discern biological properties of identified genes and contrast them with the results obtained from GO enrichment analysis. Three published datasets were employed for this purpose, representing a gene expression study in dairy cattle, the use of SNPs for genome‐wide prediction in swine and the identification of genomic regions targeted by selection in horses. We found that several overrepresented MeSH annotations linked to these gene sets share similar concepts with those of GO terms. Moreover, MeSH yielded unique annotations, which are not directly provided by GO terms, suggesting that MeSH has the potential to refine and enrich the representation of biological knowledge. We demonstrated that MeSH can be regarded as another choice of annotation to draw biological inferences from genes identified via experimental analyses. When used in combination with GO terms, our results indicate that MeSH can enhance our functional interpretations for specific biological conditions or the genetic basis of complex traits in livestock species.
Collapse
Affiliation(s)
- G Morota
- Department of Animal Science, University of Nebraska, Lincoln, NE, USA
| | - F Peñagaricano
- Department of Animal Sciences, University of Florida, Gainesville, FL, USA.,University of Florida Genetics Institute, University of Florida, Gainesville, FL, USA
| | - J L Petersen
- Department of Animal Science, University of Nebraska, Lincoln, NE, USA
| | - D C Ciobanu
- Department of Animal Science, University of Nebraska, Lincoln, NE, USA
| | - K Tsuyuzaki
- Department of Medicinal and Life Science, Faculty of Pharmaceutical Sciences, Tokyo University of Science, 2641 Yamazaki, Noda, Chiba, Japan.,Bioinformatics Research Unit, Advanced Center for Computing and Communication, RIKEN, 2-1 Hirosawa, Wako, Saitama, Japan
| | - I Nikaido
- Bioinformatics Research Unit, Advanced Center for Computing and Communication, RIKEN, 2-1 Hirosawa, Wako, Saitama, Japan
| |
Collapse
|
19
|
Andrade-Navarro M, Perez-Iratxeta C. Text mining of biomedical literature: doing well, but we could be doing better. Methods 2015; 74:1-2. [PMID: 25703199 DOI: 10.1016/j.ymeth.2015.01.014] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022] Open
Affiliation(s)
- Miguel Andrade-Navarro
- Faculty of Biology, Johannes-Gutenberg University of Mainz, Gresemundweg 2, 55128 Mainz, Germany; Institute of Molecular Biology, Ackermannweg 4, 55128 Mainz, Germany
| | - Carol Perez-Iratxeta
- Ottawa Hospital Research Institute, 501 Smyth Road, Ottawa, Ontario K1H 8L6, Canada
| |
Collapse
|