1
|
Bradley D, Hogrebe A, Dandage R, Dubé AK, Leutert M, Dionne U, Chang A, Villén J, Landry CR. The fitness cost of spurious phosphorylation. EMBO J 2024:10.1038/s44318-024-00200-7. [PMID: 39256561 DOI: 10.1038/s44318-024-00200-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2023] [Revised: 07/23/2024] [Accepted: 07/24/2024] [Indexed: 09/12/2024] Open
Abstract
The fidelity of signal transduction requires the binding of regulatory molecules to their cognate targets. However, the crowded cell interior risks off-target interactions between proteins that are functionally unrelated. How such off-target interactions impact fitness is not generally known. Here, we use Saccharomyces cerevisiae to inducibly express tyrosine kinases. Because yeast lacks bona fide tyrosine kinases, the resulting tyrosine phosphorylation is biologically spurious. We engineered 44 yeast strains each expressing a tyrosine kinase, and quantitatively analysed their phosphoproteomes. This analysis resulted in ~30,000 phosphosites mapping to ~3500 proteins. The number of spurious pY sites generated correlates strongly with decreased growth, and we predict over 1000 pY events to be deleterious. However, we also find that many of the spurious pY sites have a negligible effect on fitness, possibly because of their low stoichiometry. This result is consistent with our evolutionary analyses demonstrating a lack of phosphotyrosine counter-selection in species with tyrosine kinases. Our results suggest that, alongside the risk for toxicity, the cell can tolerate a large degree of non-functional crosstalk as interaction networks evolve.
Collapse
Affiliation(s)
- David Bradley
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada
- Department of Biochemistry, Microbiology and Bioinformatics, Université Laval, Québec, QC, Canada
- Quebec Network for Research on Protein Function, Engineering, and Applications (PROTEO), Université du Québec à Montréal, Montréal, QC, Canada
- Université Laval Big Data Research Center (BDRC_UL), Québec, QC, Canada
- Department of Biology, Université Laval, Québec, QC, Canada
| | - Alexander Hogrebe
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Rohan Dandage
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada
- Department of Biochemistry, Microbiology and Bioinformatics, Université Laval, Québec, QC, Canada
- Quebec Network for Research on Protein Function, Engineering, and Applications (PROTEO), Université du Québec à Montréal, Montréal, QC, Canada
- Université Laval Big Data Research Center (BDRC_UL), Québec, QC, Canada
- Department of Biology, Université Laval, Québec, QC, Canada
| | - Alexandre K Dubé
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada
- Department of Biochemistry, Microbiology and Bioinformatics, Université Laval, Québec, QC, Canada
- Quebec Network for Research on Protein Function, Engineering, and Applications (PROTEO), Université du Québec à Montréal, Montréal, QC, Canada
- Université Laval Big Data Research Center (BDRC_UL), Québec, QC, Canada
- Department of Biology, Université Laval, Québec, QC, Canada
| | - Mario Leutert
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
- Institute of Molecular Systems Biology, ETH Zürich, Zürich, Switzerland
| | - Ugo Dionne
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada
- Department of Biochemistry, Microbiology and Bioinformatics, Université Laval, Québec, QC, Canada
- Quebec Network for Research on Protein Function, Engineering, and Applications (PROTEO), Université du Québec à Montréal, Montréal, QC, Canada
- Université Laval Big Data Research Center (BDRC_UL), Québec, QC, Canada
- Department of Biology, Université Laval, Québec, QC, Canada
| | - Alexis Chang
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Judit Villén
- Department of Genome Sciences, University of Washington, Seattle, WA, USA.
| | - Christian R Landry
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada.
- Department of Biochemistry, Microbiology and Bioinformatics, Université Laval, Québec, QC, Canada.
- Quebec Network for Research on Protein Function, Engineering, and Applications (PROTEO), Université du Québec à Montréal, Montréal, QC, Canada.
- Université Laval Big Data Research Center (BDRC_UL), Québec, QC, Canada.
- Department of Biology, Université Laval, Québec, QC, Canada.
| |
Collapse
|
2
|
Gao K, Cao W, He Z, Liu L, Guo J, Dong L, Song J, Wu Y, Zhao Y. Network medicine analysis for dissecting the therapeutic mechanism of consensus TCM formulae in treating hepatocellular carcinoma with different TCM syndromes. Front Endocrinol (Lausanne) 2024; 15:1373054. [PMID: 39211446 PMCID: PMC11357915 DOI: 10.3389/fendo.2024.1373054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/19/2024] [Accepted: 07/26/2024] [Indexed: 09/04/2024] Open
Abstract
Introduction Hepatocellular carcinoma (HCC) is a major cause of cancer-related mortality worldwide. Traditional Chinese Medicine (TCM) is widely utilized as an adjunct therapy, improving patient survival and quality of life. TCM categorizes HCC into five distinct syndromes, each treated with specific herbal formulae. However, the molecular mechanisms underlying these treatments remain unclear. Methods We employed a network medicine approach to explore the therapeutic mechanisms of TCM in HCC. By constructing a protein-protein interaction (PPI) network, we integrated genes associated with TCM syndromes and their corresponding herbal formulae. This allowed for a quantitative analysis of the topological and functional relationships between TCM syndromes, HCC, and the specific formulae used for treatment. Results Our findings revealed that genes related to the five TCM syndromes were closely associated with HCC-related genes within the PPI network. The gene sets corresponding to the five TCM formulae exhibited significant proximity to HCC and its related syndromes, suggesting the efficacy of TCM syndrome differentiation and treatment. Additionally, through a random walk algorithm applied to a heterogeneous network, we prioritized active herbal ingredients, with results confirmed by literature. Discussion The identification of these key compounds underscores the potential of network medicine to unravel the complex pharmacological actions of TCM. This study provides a molecular basis for TCM's therapeutic strategies in HCC and highlights specific herbal ingredients as potential leads for drug development and precision medicine.
Collapse
Affiliation(s)
- Kai Gao
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Chaoyang District, Beijing, China
| | - WanChen Cao
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Chaoyang District, Beijing, China
| | - ZiHao He
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Chaoyang District, Beijing, China
| | - Liu Liu
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Chaoyang District, Beijing, China
| | - JinCheng Guo
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Chaoyang District, Beijing, China
| | - Lei Dong
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Chaoyang District, Beijing, China
| | - Jini Song
- New York Institute of Technology College of Osteopathic Medicine, Arkansas State University, Jonesboro, AR, United States
| | - Yang Wu
- The Research Center for Ubiquitous Computing Systems (CUbiCS), Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
| | - Yi Zhao
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Chaoyang District, Beijing, China
- The Research Center for Ubiquitous Computing Systems (CUbiCS), Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
| |
Collapse
|
3
|
Zitnik M, Li MM, Wells A, Glass K, Morselli Gysi D, Krishnan A, Murali TM, Radivojac P, Roy S, Baudot A, Bozdag S, Chen DZ, Cowen L, Devkota K, Gitter A, Gosline SJC, Gu P, Guzzi PH, Huang H, Jiang M, Kesimoglu ZN, Koyuturk M, Ma J, Pico AR, Pržulj N, Przytycka TM, Raphael BJ, Ritz A, Sharan R, Shen Y, Singh M, Slonim DK, Tong H, Yang XH, Yoon BJ, Yu H, Milenković T. Current and future directions in network biology. BIOINFORMATICS ADVANCES 2024; 4:vbae099. [PMID: 39143982 PMCID: PMC11321866 DOI: 10.1093/bioadv/vbae099] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/27/2023] [Revised: 05/31/2024] [Accepted: 07/08/2024] [Indexed: 08/16/2024]
Abstract
Summary Network biology is an interdisciplinary field bridging computational and biological sciences that has proved pivotal in advancing the understanding of cellular functions and diseases across biological systems and scales. Although the field has been around for two decades, it remains nascent. It has witnessed rapid evolution, accompanied by emerging challenges. These stem from various factors, notably the growing complexity and volume of data together with the increased diversity of data types describing different tiers of biological organization. We discuss prevailing research directions in network biology, focusing on molecular/cellular networks but also on other biological network types such as biomedical knowledge graphs, patient similarity networks, brain networks, and social/contact networks relevant to disease spread. In more detail, we highlight areas of inference and comparison of biological networks, multimodal data integration and heterogeneous networks, higher-order network analysis, machine learning on networks, and network-based personalized medicine. Following the overview of recent breakthroughs across these five areas, we offer a perspective on future directions of network biology. Additionally, we discuss scientific communities, educational initiatives, and the importance of fostering diversity within the field. This article establishes a roadmap for an immediate and long-term vision for network biology. Availability and implementation Not applicable.
Collapse
Affiliation(s)
- Marinka Zitnik
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA 02115, United States
| | - Michelle M Li
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA 02115, United States
| | - Aydin Wells
- Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN 46556, United States
- Lucy Family Institute for Data and Society, University of Notre Dame, Notre Dame, IN 46556, United States
- Eck Institute for Global Health, University of Notre Dame, Notre Dame, IN 46556, United States
| | - Kimberly Glass
- Channing Division of Network Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA 02115, United States
| | - Deisy Morselli Gysi
- Channing Division of Network Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA 02115, United States
- Department of Statistics, Federal University of Paraná, Curitiba, Paraná 81530-015, Brazil
- Department of Physics, Northeastern University, Boston, MA 02115, United States
| | - Arjun Krishnan
- Department of Biomedical Informatics, University of Colorado Anschutz Medical Campus, Aurora, CO 80045, United States
| | - T M Murali
- Department of Computer Science, Virginia Tech, Blacksburg, VA 24061, United States
| | - Predrag Radivojac
- Khoury College of Computer Sciences, Northeastern University, Boston, MA 02115, United States
| | - Sushmita Roy
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI 53715, United States
- Wisconsin Institute for Discovery, Madison, WI 53715, United States
| | - Anaïs Baudot
- Aix Marseille Université, INSERM, MMG, Marseille, France
| | - Serdar Bozdag
- Department of Computer Science and Engineering, University of North Texas, Denton, TX 76203, United States
- Department of Mathematics, University of North Texas, Denton, TX 76203, United States
| | - Danny Z Chen
- Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN 46556, United States
| | - Lenore Cowen
- Department of Computer Science, Tufts University, Medford, MA 02155, United States
| | - Kapil Devkota
- Department of Computer Science, Tufts University, Medford, MA 02155, United States
| | - Anthony Gitter
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI 53715, United States
- Morgridge Institute for Research, Madison, WI 53715, United States
| | - Sara J C Gosline
- Biological Sciences Division, Pacific Northwest National Laboratory, Seattle, WA 98109, United States
| | - Pengfei Gu
- Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN 46556, United States
| | - Pietro H Guzzi
- Department of Medical and Surgical Sciences, University Magna Graecia of Catanzaro, Catanzaro, 88100, Italy
| | - Heng Huang
- Department of Computer Science, University of Maryland College Park, College Park, MD 20742, United States
| | - Meng Jiang
- Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN 46556, United States
| | - Ziynet Nesibe Kesimoglu
- Department of Computer Science and Engineering, University of North Texas, Denton, TX 76203, United States
- National Center of Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20814, United States
| | - Mehmet Koyuturk
- Department of Computer and Data Sciences, Case Western Reserve University, Cleveland, OH 44106, United States
| | - Jian Ma
- Ray and Stephanie Lane Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, United States
| | - Alexander R Pico
- Institute of Data Science and Biotechnology, Gladstone Institutes, San Francisco, CA 94158, United States
| | - Nataša Pržulj
- Department of Computer Science, University College London, London, WC1E 6BT, England
- ICREA, Catalan Institution for Research and Advanced Studies, Barcelona, 08010, Spain
- Barcelona Supercomputing Center (BSC), Barcelona, 08034, Spain
| | - Teresa M Przytycka
- National Center of Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20814, United States
| | - Benjamin J Raphael
- Department of Computer Science, Princeton University, Princeton, NJ 08544, United States
| | - Anna Ritz
- Department of Biology, Reed College, Portland, OR 97202, United States
| | - Roded Sharan
- School of Computer Science, Tel Aviv University, Tel Aviv, 69978, Israel
| | - Yang Shen
- Department of Electrical and Computer Engineering, Texas A&M University, College Station, TX 77843, United States
| | - Mona Singh
- Department of Computer Science, Princeton University, Princeton, NJ 08544, United States
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08544, United States
| | - Donna K Slonim
- Department of Computer Science, Tufts University, Medford, MA 02155, United States
| | - Hanghang Tong
- Department of Computer Science, University of Illinois Urbana-Champaign, Urbana, IL 61801, United States
| | - Xinan Holly Yang
- Department of Pediatrics, University of Chicago, Chicago, IL 60637, United States
| | - Byung-Jun Yoon
- Department of Electrical and Computer Engineering, Texas A&M University, College Station, TX 77843, United States
- Computational Science Initiative, Brookhaven National Laboratory, Upton, NY 11973, United States
| | - Haiyuan Yu
- Department of Computational Biology, Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY 14853, United States
| | - Tijana Milenković
- Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN 46556, United States
- Lucy Family Institute for Data and Society, University of Notre Dame, Notre Dame, IN 46556, United States
- Eck Institute for Global Health, University of Notre Dame, Notre Dame, IN 46556, United States
| |
Collapse
|
4
|
Zhang Y, Leung AK, Kang JJ, Sun Y, Wu G, Li L, Sun J, Cheng L, Qiu T, Zhang J, Wierbowski S, Gupta S, Booth J, Yu H. A multiscale functional map of somatic mutations in cancer integrating protein structure and network topology. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.03.06.531441. [PMID: 36945530 PMCID: PMC10028849 DOI: 10.1101/2023.03.06.531441] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/09/2023]
Abstract
A major goal of cancer biology is to understand the mechanisms underlying tumorigenesis driven by somatically acquired mutations. Two distinct types of computational methodologies have emerged: one focuses on analyzing clustering of mutations within protein sequences and 3D structures, while the other characterizes mutations by leveraging the topology of protein-protein interaction network. Their insights are largely non-overlapping, offering complementary strengths. Here, we established a unified, end-to-end 3D structurally-informed protein interaction network propagation framework, NetFlow3D, that systematically maps the multiscale mechanistic effects of somatic mutations in cancer. The establishment of NetFlow3D hinges upon the Human Protein Structurome, a comprehensive repository we compiled that incorporates the 3D structures of every single protein as well as the binding interfaces of all known protein interactions in humans. NetFlow3D leverages the Structurome to integrate information across atomic, residue, protein and network levels: It conducts 3D clustering of mutations across atomic and residue levels on protein structures to identify potential driver mutations. It then anisotropically propagates their impacts across the protein interaction network, with propagation guided by the specific 3D structural interfaces involved, to identify significantly interconnected network "modules", thereby uncovering key biological processes underlying disease etiology. Applied to 1,038,899 somatic protein-altering mutations in 9,946 TCGA tumors across 33 cancer types, NetFlow3D identified 1,4444 significant 3D clusters throughout the Human Protein Structurome, of which ~55% would not have been found if using only experimentally-determined structures. It then identified 26 significantly interconnected modules that encompass ~8-fold more proteins than applying standard network analyses. NetFlow3D and our pan-cancer results can be accessed from http://netflow3d.yulab.org.
Collapse
Affiliation(s)
- Yingying Zhang
- Department of Computational Biology, Cornell University; Ithaca, 14853, USA
- Weill Institute for Cell and Molecular Biology, Cornell University; Ithaca, 14853, USA
- Department of Molecular Biology and Genetics, Cornell University; Ithaca, 14853, USA
| | - Alden K. Leung
- Department of Computational Biology, Cornell University; Ithaca, 14853, USA
- Weill Institute for Cell and Molecular Biology, Cornell University; Ithaca, 14853, USA
| | - Jin Joo Kang
- Department of Computational Biology, Cornell University; Ithaca, 14853, USA
- Weill Institute for Cell and Molecular Biology, Cornell University; Ithaca, 14853, USA
| | - Yu Sun
- Department of Computational Biology, Cornell University; Ithaca, 14853, USA
- Weill Institute for Cell and Molecular Biology, Cornell University; Ithaca, 14853, USA
| | - Guanxi Wu
- College of Agriculture and Life Sciences, Cornell University; Ithaca, 14853, USA
| | - Le Li
- Department of Computational Biology, Cornell University; Ithaca, 14853, USA
- Weill Institute for Cell and Molecular Biology, Cornell University; Ithaca, 14853, USA
| | - Jiayang Sun
- Department of Computational Biology, Cornell University; Ithaca, 14853, USA
| | - Lily Cheng
- Department of Science and Technology Studies, Cornell University; Ithaca, 14853, USA
| | - Tian Qiu
- School of Electrical and Computer Engineering, Cornell University; Ithaca, 14853, USA
| | - Junke Zhang
- Department of Computational Biology, Cornell University; Ithaca, 14853, USA
- Weill Institute for Cell and Molecular Biology, Cornell University; Ithaca, 14853, USA
| | - Shayne Wierbowski
- Department of Computational Biology, Cornell University; Ithaca, 14853, USA
- Weill Institute for Cell and Molecular Biology, Cornell University; Ithaca, 14853, USA
| | - Shagun Gupta
- Department of Computational Biology, Cornell University; Ithaca, 14853, USA
- Weill Institute for Cell and Molecular Biology, Cornell University; Ithaca, 14853, USA
| | - James Booth
- Department of Computational Biology, Cornell University; Ithaca, 14853, USA
- Department of Statistics and Data Science, Cornell University; Ithaca, 14853, USA
| | - Haiyuan Yu
- Department of Computational Biology, Cornell University; Ithaca, 14853, USA
- Weill Institute for Cell and Molecular Biology, Cornell University; Ithaca, 14853, USA
| |
Collapse
|
5
|
Zhao H, Petrey D, Murray D, Honig B. ZEPPI: Proteome-scale sequence-based evaluation of protein-protein interaction models. Proc Natl Acad Sci U S A 2024; 121:e2400260121. [PMID: 38743624 PMCID: PMC11127014 DOI: 10.1073/pnas.2400260121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2024] [Accepted: 04/18/2024] [Indexed: 05/16/2024] Open
Abstract
We introduce ZEPPI (Z-score Evaluation of Protein-Protein Interfaces), a framework to evaluate structural models of a complex based on sequence coevolution and conservation involving residues in protein-protein interfaces. The ZEPPI score is calculated by comparing metrics for an interface to those obtained from randomly chosen residues. Since contacting residues are defined by the structural model, this obviates the need to account for indirect interactions. Further, although ZEPPI relies on species-paired multiple sequence alignments, its focus on interfacial residues allows it to leverage quite shallow alignments. ZEPPI can be implemented on a proteome-wide scale and is applied here to millions of structural models of dimeric complexes in the Escherichia coli and human interactomes found in the PrePPI database. PrePPI's scoring function is based primarily on the evaluation of protein-protein interfaces, and ZEPPI adds a new feature to this analysis through the incorporation of evolutionary information. ZEPPI performance is evaluated through applications to experimentally determined complexes and to decoys from the CASP-CAPRI experiment. As we discuss, the standard CAPRI scores used to evaluate docking models are based on model quality and not on the ability to give yes/no answers as to whether two proteins interact. ZEPPI is able to detect weak signals from PPI models that the CAPRI scores define as incorrect and, similarly, to identify potential PPIs defined as low confidence by the current PrePPI scoring function. A number of examples that illustrate how the combination of PrePPI and ZEPPI can yield functional hypotheses are provided.
Collapse
Affiliation(s)
- Haiqing Zhao
- Department of Systems Biology, Columbia University Irving Medical Center, New York, NY10032
| | - Donald Petrey
- Department of Systems Biology, Columbia University Irving Medical Center, New York, NY10032
| | - Diana Murray
- Department of Systems Biology, Columbia University Irving Medical Center, New York, NY10032
| | - Barry Honig
- Department of Systems Biology, Columbia University Irving Medical Center, New York, NY10032
- Department of Biochemistry and Molecular Biophysics, Columbia University Irving Medical Center, New York, NY10032
- Department of Medicine, Columbia University, New York, NY10032
- Zuckerman Institute, Columbia University, New York, NY10027
| |
Collapse
|
6
|
Liu JX, Zhang X, Huang YQ, Hao GF, Yang GF. Multi-level bioinformatics resources support drug target discovery of protein-protein interactions. Drug Discov Today 2024; 29:103979. [PMID: 38608830 DOI: 10.1016/j.drudis.2024.103979] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2024] [Revised: 03/14/2024] [Accepted: 04/05/2024] [Indexed: 04/14/2024]
Abstract
Drug discovery often begins with a new target. Protein-protein interactions (PPIs) are crucial to multitudinous cellular processes and offer a promising avenue for drug-target discovery. PPIs are characterized by multi-level complexity: at the protein level, interaction networks can be used to identify potential targets, whereas at the residue level, the details of the interactions of individual PPIs can be used to examine a target's druggability. Much great progress has been made in target discovery through multi-level PPI-related computational approaches, but these resources have not been fully discussed. Here, we systematically survey bioinformatics tools for identifying and assessing potential drug targets, examining their characteristics, limitations and applications. This work will aid the integration of the broader protein-to-network context with the analysis of detailed binding mechanisms to support the discovery of drug targets.
Collapse
Affiliation(s)
- Jia-Xin Liu
- National Key Laboratory of Green Pesticide, Key Laboratory of Pesticide & Chemical Biology, Ministry of Education, International Joint Research Center for Intelligent Biosensor Technology and Health, Central China Normal University, Wuhan 430079, PR China
| | - Xiao Zhang
- State Key Laboratory of Green Pesticide, Key Laboratory of Green Pesticide and Agricultural Bioengineering, Ministry of Education, Center for R&D of Fine Chemicals, Guizhou University, Guiyang 550025, PR China
| | - Yuan-Qin Huang
- State Key Laboratory of Green Pesticide, Key Laboratory of Green Pesticide and Agricultural Bioengineering, Ministry of Education, Center for R&D of Fine Chemicals, Guizhou University, Guiyang 550025, PR China
| | - Ge-Fei Hao
- National Key Laboratory of Green Pesticide, Key Laboratory of Pesticide & Chemical Biology, Ministry of Education, International Joint Research Center for Intelligent Biosensor Technology and Health, Central China Normal University, Wuhan 430079, PR China; State Key Laboratory of Green Pesticide, Key Laboratory of Green Pesticide and Agricultural Bioengineering, Ministry of Education, Center for R&D of Fine Chemicals, Guizhou University, Guiyang 550025, PR China.
| | - Guang-Fu Yang
- National Key Laboratory of Green Pesticide, Key Laboratory of Pesticide & Chemical Biology, Ministry of Education, International Joint Research Center for Intelligent Biosensor Technology and Health, Central China Normal University, Wuhan 430079, PR China.
| |
Collapse
|
7
|
Trepte P, Secker C, Olivet J, Blavier J, Kostova S, Maseko SB, Minia I, Silva Ramos E, Cassonnet P, Golusik S, Zenkner M, Beetz S, Liebich MJ, Scharek N, Schütz A, Sperling M, Lisurek M, Wang Y, Spirohn K, Hao T, Calderwood MA, Hill DE, Landthaler M, Choi SG, Twizere JC, Vidal M, Wanker EE. AI-guided pipeline for protein-protein interaction drug discovery identifies a SARS-CoV-2 inhibitor. Mol Syst Biol 2024; 20:428-457. [PMID: 38467836 PMCID: PMC10987651 DOI: 10.1038/s44320-024-00019-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2023] [Revised: 01/22/2024] [Accepted: 01/23/2024] [Indexed: 03/13/2024] Open
Abstract
Protein-protein interactions (PPIs) offer great opportunities to expand the druggable proteome and therapeutically tackle various diseases, but remain challenging targets for drug discovery. Here, we provide a comprehensive pipeline that combines experimental and computational tools to identify and validate PPI targets and perform early-stage drug discovery. We have developed a machine learning approach that prioritizes interactions by analyzing quantitative data from binary PPI assays or AlphaFold-Multimer predictions. Using the quantitative assay LuTHy together with our machine learning algorithm, we identified high-confidence interactions among SARS-CoV-2 proteins for which we predicted three-dimensional structures using AlphaFold-Multimer. We employed VirtualFlow to target the contact interface of the NSP10-NSP16 SARS-CoV-2 methyltransferase complex by ultra-large virtual drug screening. Thereby, we identified a compound that binds to NSP10 and inhibits its interaction with NSP16, while also disrupting the methyltransferase activity of the complex, and SARS-CoV-2 replication. Overall, this pipeline will help to prioritize PPI targets to accelerate the discovery of early-stage drug candidates targeting protein complexes and pathways.
Collapse
Affiliation(s)
- Philipp Trepte
- Proteomics and Molecular Mechanisms of Neurodegenerative Diseases, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, 13125, Berlin, Germany.
- Brain Development and Disease, Institute of Molecular Biotechnology of the Austrian Academy of Sciences, 1030, Vienna, Austria.
| | - Christopher Secker
- Proteomics and Molecular Mechanisms of Neurodegenerative Diseases, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, 13125, Berlin, Germany.
- Zuse Institute Berlin, Berlin, Germany.
| | - Julien Olivet
- Laboratory of Viral Interactomes, Interdisciplinary Cluster for Applied Genoproteomics (GIGA)-Molecular Biology of Diseases, University of Liège, 4000, Liège, Belgium
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, MA, 02215, USA
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA, 02115, USA
- Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
- Structural Biology Unit, Laboratory of Virology and Chemotherapy, Rega Institute for Medical Research, Department of Microbiology, Immunology and Transplantation, Katholieke Universiteit Leuven, 3000, Leuven, Belgium
| | - Jeremy Blavier
- Laboratory of Viral Interactomes, Interdisciplinary Cluster for Applied Genoproteomics (GIGA)-Molecular Biology of Diseases, University of Liège, 4000, Liège, Belgium
| | - Simona Kostova
- Proteomics and Molecular Mechanisms of Neurodegenerative Diseases, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, 13125, Berlin, Germany
| | - Sibusiso B Maseko
- Laboratory of Viral Interactomes, Interdisciplinary Cluster for Applied Genoproteomics (GIGA)-Molecular Biology of Diseases, University of Liège, 4000, Liège, Belgium
| | - Igor Minia
- RNA Biology and Posttranscriptional Regulation, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin Institute for Medical Systems Biology, 13125, Berlin, Germany
| | - Eduardo Silva Ramos
- Proteomics and Molecular Mechanisms of Neurodegenerative Diseases, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, 13125, Berlin, Germany
| | - Patricia Cassonnet
- Département de Virologie, Unité de Génétique Moléculaire des Virus à ARN (GMVR), Institut Pasteur, Centre National de la Recherche Scientifique (CNRS), Université de Paris, Paris, France
| | - Sabrina Golusik
- Proteomics and Molecular Mechanisms of Neurodegenerative Diseases, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, 13125, Berlin, Germany
| | - Martina Zenkner
- Proteomics and Molecular Mechanisms of Neurodegenerative Diseases, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, 13125, Berlin, Germany
| | - Stephanie Beetz
- Proteomics and Molecular Mechanisms of Neurodegenerative Diseases, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, 13125, Berlin, Germany
| | - Mara J Liebich
- Proteomics and Molecular Mechanisms of Neurodegenerative Diseases, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, 13125, Berlin, Germany
| | - Nadine Scharek
- Proteomics and Molecular Mechanisms of Neurodegenerative Diseases, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, 13125, Berlin, Germany
| | - Anja Schütz
- Protein Production & Characterization, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, 13125, Berlin, Germany
| | - Marcel Sperling
- Multifunctional Colloids and Coating, Fraunhofer Institute for Applied Polymer Research (IAP), 14476, Potsdam-Golm, Germany
| | - Michael Lisurek
- Structural Chemistry and Computational Biophysics, Leibniz-Institut für Molekulare Pharmakologie (FMP), 13125, Berlin, Germany
| | - Yang Wang
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, MA, 02215, USA
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA, 02115, USA
- Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
| | - Kerstin Spirohn
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, MA, 02215, USA
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA, 02115, USA
- Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
| | - Tong Hao
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, MA, 02215, USA
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA, 02115, USA
- Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
| | - Michael A Calderwood
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, MA, 02215, USA
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA, 02115, USA
- Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
| | - David E Hill
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, MA, 02215, USA
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA, 02115, USA
- Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
| | - Markus Landthaler
- RNA Biology and Posttranscriptional Regulation, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin Institute for Medical Systems Biology, 13125, Berlin, Germany
- Institute of Biology, Humboldt-Universität zu Berlin, 13125, Berlin, Germany
| | - Soon Gang Choi
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, MA, 02215, USA.
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA, 02115, USA.
- Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA.
| | - Jean-Claude Twizere
- Laboratory of Viral Interactomes, Interdisciplinary Cluster for Applied Genoproteomics (GIGA)-Molecular Biology of Diseases, University of Liège, 4000, Liège, Belgium.
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, MA, 02215, USA.
- TERRA Teaching and Research Center, Gembloux Agro-Bio Tech, University of Liège, 5030, Gembloux, Belgium.
- Laboratory of Algal Synthetic and Systems Biology, Division of Science and Math, New York University Abu Dhabi, Abu Dhabi, UAE.
| | - Marc Vidal
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, MA, 02215, USA.
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA, 02115, USA.
| | - Erich E Wanker
- Proteomics and Molecular Mechanisms of Neurodegenerative Diseases, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, 13125, Berlin, Germany.
| |
Collapse
|
8
|
Xiong D, Qiu Y, Zhao J, Zhou Y, Lee D, Gupta S, Torres M, Lu W, Liang S, Kang JJ, Eng C, Loscalzo J, Cheng F, Yu H. Structurally-informed human interactome reveals proteome-wide perturbations by disease mutations. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.04.24.538110. [PMID: 37162909 PMCID: PMC10168245 DOI: 10.1101/2023.04.24.538110] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
Human genome sequencing studies have identified numerous loci associated with complex diseases. However, translating human genetic and genomic findings to disease pathobiology and therapeutic discovery remains a major challenge at multiscale interactome network levels. Here, we present a deep-learning-based ensemble framework, termed PIONEER (Protein-protein InteractiOn iNtErfacE pRediction), that accurately predicts protein binding partner-specific interfaces for all known protein interactions in humans and seven other common model organisms, generating comprehensive structurally-informed protein interactomes. We demonstrate that PIONEER outperforms existing state-of-the-art methods. We further systematically validated PIONEER predictions experimentally through generating 2,395 mutations and testing their impact on 6,754 mutation-interaction pairs, confirming the high quality and validity of PIONEER predictions. We show that disease-associated mutations are enriched in PIONEER-predicted protein-protein interfaces after mapping mutations from ~60,000 germline exomes and ~36,000 somatic genomes. We identify 586 significant protein-protein interactions (PPIs) enriched with PIONEER-predicted interface somatic mutations (termed oncoPPIs) from pan-cancer analysis of ~11,000 tumor whole-exomes across 33 cancer types. We show that PIONEER-predicted oncoPPIs are significantly associated with patient survival and drug responses from both cancer cell lines and patient-derived xenograft mouse models. We identify a landscape of PPI-perturbing tumor alleles upon ubiquitination by E3 ligases, and we experimentally validate the tumorigenic KEAP1-NRF2 interface mutation p.Thr80Lys in non-small cell lung cancer. We show that PIONEER-predicted PPI-perturbing alleles alter protein abundance and correlates with drug responses and patient survival in colon and uterine cancers as demonstrated by proteogenomic data from the National Cancer Institute's Clinical Proteomic Tumor Analysis Consortium. PIONEER, implemented as both a web server platform and a software package, identifies functional consequences of disease-associated alleles and offers a deep learning tool for precision medicine at multiscale interactome network levels.
Collapse
Affiliation(s)
- Dapeng Xiong
- Department of Computational Biology, Cornell University, Ithaca, NY 14853, USA
- Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY 14853, USA
- Center for Innovative Proteomics, Cornell University, Ithaca, NY 14853, USA
| | - Yunguang Qiu
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, OH 44195, USA
| | - Junfei Zhao
- Department of Systems Biology, Herbert Irving Comprehensive Center, Columbia University, New York, NY 10032, USA
| | - Yadi Zhou
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, OH 44195, USA
| | - Dongjin Lee
- Department of Computational Biology, Cornell University, Ithaca, NY 14853, USA
- Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY 14853, USA
| | - Shobhita Gupta
- Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY 14853, USA
- Center for Innovative Proteomics, Cornell University, Ithaca, NY 14853, USA
- Biophysics Program, Cornell University, Ithaca, NY 14853, USA
| | - Mateo Torres
- Department of Computational Biology, Cornell University, Ithaca, NY 14853, USA
- Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY 14853, USA
- Center for Innovative Proteomics, Cornell University, Ithaca, NY 14853, USA
| | - Weiqiang Lu
- Shanghai Key Laboratory of Regulatory Biology, Institute of Biomedical Sciences and School of Life Sciences, East China Normal University, Shanghai 200241, China
| | - Siqi Liang
- Department of Computational Biology, Cornell University, Ithaca, NY 14853, USA
- Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY 14853, USA
| | - Jin Joo Kang
- Department of Computational Biology, Cornell University, Ithaca, NY 14853, USA
- Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY 14853, USA
- Center for Innovative Proteomics, Cornell University, Ithaca, NY 14853, USA
| | - Charis Eng
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, OH 44195, USA
- Department of Molecular Medicine, Cleveland Clinic Lerner College of Medicine, Case Western Reserve University, Cleveland, OH 44195, USA
- Case Comprehensive Cancer Center, Case Western Reserve University School of Medicine, Cleveland, OH 44106, USA
| | - Joseph Loscalzo
- Channing Division of Network Medicine, Division of Cardiovascular Medicine, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA 02115, USA
| | - Feixiong Cheng
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, OH 44195, USA
- Department of Molecular Medicine, Cleveland Clinic Lerner College of Medicine, Case Western Reserve University, Cleveland, OH 44195, USA
- Case Comprehensive Cancer Center, Case Western Reserve University School of Medicine, Cleveland, OH 44106, USA
| | - Haiyuan Yu
- Department of Computational Biology, Cornell University, Ithaca, NY 14853, USA
- Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY 14853, USA
- Center for Innovative Proteomics, Cornell University, Ithaca, NY 14853, USA
| |
Collapse
|
9
|
Xiao H, Rosen A, Chhibbar P, Moise L, Das J. From bench to bedside via bytes: Multi-omic immunoprofiling and integration using machine learning and network approaches. Hum Vaccin Immunother 2023; 19:2282803. [PMID: 38100557 PMCID: PMC10730168 DOI: 10.1080/21645515.2023.2282803] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2023] [Accepted: 11/09/2023] [Indexed: 12/17/2023] Open
Abstract
A significant surge in research endeavors leverages the vast potential of high-throughput omic technology platforms for broad profiling of biological responses to vaccines and cutting-edge immunotherapies and stem-cell therapies under development. These profiles capture different aspects of core regulatory and functional processes at different scales of resolution from molecular and cellular to organismal. Systems approaches capture the complex and intricate interplay between these layers and scales. Here, we summarize experimental data modalities, for characterizing the genome, epigenome, transcriptome, proteome, metabolome, and antibody-ome, that enable us to generate large-scale immune profiles. We also discuss machine learning and network approaches that are commonly used to analyze and integrate these modalities, to gain insights into correlates and mechanisms of natural and vaccine-mediated immunity as well as therapy-induced immunomodulation.
Collapse
Affiliation(s)
- Hanxi Xiao
- Center for Systems Immunology, Departments of Immunology and Computational and Systems Biology, University of Pittsburgh, Pittsburgh, PA, USA
| | - Aaron Rosen
- Center for Systems Immunology, Departments of Immunology and Computational and Systems Biology, University of Pittsburgh, Pittsburgh, PA, USA
| | - Prabal Chhibbar
- Center for Systems Immunology, Departments of Immunology and Computational and Systems Biology, University of Pittsburgh, Pittsburgh, PA, USA
| | | | - Jishnu Das
- Center for Systems Immunology, Departments of Immunology and Computational and Systems Biology, University of Pittsburgh, Pittsburgh, PA, USA
| |
Collapse
|
10
|
Kosoglu K, Aydin Z, Tuncbag N, Gursoy A, Keskin O. Structural coverage of the human interactome. Brief Bioinform 2023; 25:bbad496. [PMID: 38180828 PMCID: PMC10768791 DOI: 10.1093/bib/bbad496] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2023] [Revised: 11/16/2023] [Accepted: 11/30/2023] [Indexed: 01/07/2024] Open
Abstract
Complex biological processes in cells are embedded in the interactome, representing the complete set of protein-protein interactions. Mapping and analyzing the protein structures are essential to fully comprehending these processes' molecular details. Therefore, knowing the structural coverage of the interactome is important to show the current limitations. Structural modeling of protein-protein interactions requires accurate protein structures. In this study, we mapped all experimental structures to the reference human proteome. Later, we found the enrichment in structural coverage when complementary methods such as homology modeling and deep learning (AlphaFold) were included. We then collected the interactions from the literature and databases to form the reference human interactome, resulting in 117 897 non-redundant interactions. When we analyzed the structural coverage of the interactome, we found that the number of experimentally determined protein complex structures is scarce, corresponding to 3.95% of all binary interactions. We also analyzed known and modeled structures to potentially construct the structural interactome with a docking method. Our analysis showed that 12.97% of the interactions from HuRI and 73.62% and 32.94% from the filtered versions of STRING and HIPPIE could potentially be modeled with high structural coverage or accuracy, respectively. Overall, this paper provides an overview of the current state of structural coverage of the human proteome and interactome.
Collapse
Affiliation(s)
- Kayra Kosoglu
- Computational Sciences and Engineering, College of Engineering, Koc University, 34450 Istanbul, Turkey
| | - Zeynep Aydin
- Computational Sciences and Engineering, College of Engineering, Koc University, 34450 Istanbul, Turkey
| | - Nurcan Tuncbag
- School of Medicine, Koc University, 34450 Istanbul, Turkey
- Department of Chemical and Biological Engineering, College of Engineering, Koc University, 34450 Istanbul, Turkey
| | - Attila Gursoy
- Department of Computer Engineering, College of Engineering, Koc University, 34450 Istanbul, Turkey
| | - Ozlem Keskin
- Department of Chemical and Biological Engineering, College of Engineering, Koc University, 34450 Istanbul, Turkey
| |
Collapse
|
11
|
Liu Y, Yang J, Wang T, Luo M, Chen Y, Chen C, Ronai Z, Zhou Y, Ruppin E, Han L. Expanding PROTACtable genome universe of E3 ligases. Nat Commun 2023; 14:6509. [PMID: 37845222 PMCID: PMC10579327 DOI: 10.1038/s41467-023-42233-2] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Accepted: 09/28/2023] [Indexed: 10/18/2023] Open
Abstract
Proteolysis-targeting chimera (PROTAC) and other targeted protein degradation (TPD) molecules that induce degradation by the ubiquitin-proteasome system (UPS) offer new opportunities to engage targets that remain challenging to be inhibited by conventional small molecules. One fundamental element in the degradation process is the E3 ligase. However, less than 2% amongst hundreds of E3 ligases in the human genome have been engaged in current studies in the TPD field, calling for the recruiting of additional ones to further enhance the therapeutic potential of TPD. To accelerate the development of PROTACs utilizing under-explored E3 ligases, we systematically characterize E3 ligases from seven different aspects, including chemical ligandability, expression patterns, protein-protein interactions (PPI), structure availability, functional essentiality, cellular location, and PPI interface by analyzing 30 large-scale data sets. Our analysis uncovers several E3 ligases as promising extant PROTACs. In total, combining confidence score, ligandability, expression pattern, and PPI, we identified 76 E3 ligases as PROTAC-interacting candidates. We develop a user-friendly and flexible web portal ( https://hanlaboratory.com/E3Atlas/ ) aimed at assisting researchers to rapidly identify E3 ligases with promising TPD activities against specifically desired targets, facilitating the development of these therapies in cancer and beyond.
Collapse
Affiliation(s)
- Yuan Liu
- Department of Biostatistics and Health Data Science, School of Medicine, Indiana University, Indianapolis, IN, USA
- Brown Center for Immunotherapy, School of Medicine, Indiana University, Indianapolis, IN, USA
- Center for Epigenetics and Disease Prevention, Institute of Biosciences and Technology, Texas A&M University, Houston, TX, USA
| | - Jingwen Yang
- Department of Biostatistics and Health Data Science, School of Medicine, Indiana University, Indianapolis, IN, USA
- Brown Center for Immunotherapy, School of Medicine, Indiana University, Indianapolis, IN, USA
- Center for Epigenetics and Disease Prevention, Institute of Biosciences and Technology, Texas A&M University, Houston, TX, USA
| | - Tianlu Wang
- Center for Translational Cancer Research, Institute of Biosciences and Technology, Texas A&M University, Houston, TX, USA
| | - Mei Luo
- Department of Biostatistics and Health Data Science, School of Medicine, Indiana University, Indianapolis, IN, USA
- Brown Center for Immunotherapy, School of Medicine, Indiana University, Indianapolis, IN, USA
| | - Yamei Chen
- Department of Biostatistics and Health Data Science, School of Medicine, Indiana University, Indianapolis, IN, USA
- Brown Center for Immunotherapy, School of Medicine, Indiana University, Indianapolis, IN, USA
- Center for Epigenetics and Disease Prevention, Institute of Biosciences and Technology, Texas A&M University, Houston, TX, USA
| | - Chengxuan Chen
- Department of Biostatistics and Health Data Science, School of Medicine, Indiana University, Indianapolis, IN, USA
- Brown Center for Immunotherapy, School of Medicine, Indiana University, Indianapolis, IN, USA
- Center for Epigenetics and Disease Prevention, Institute of Biosciences and Technology, Texas A&M University, Houston, TX, USA
| | - Ze'ev Ronai
- Cancer Center, Sanford Burnham Prebys Medical Discovery Institute, La Jolla, CA, 92037, USA
| | - Yubin Zhou
- Center for Translational Cancer Research, Institute of Biosciences and Technology, Texas A&M University, Houston, TX, USA
- Department of Translational Medical Sciences, College of Medicine, Texas A&M University, Houston, TX, USA
| | - Eytan Ruppin
- Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute (NCI), National Institutes of Health (NIH), Bethesda, 20892, MD, USA.
| | - Leng Han
- Department of Biostatistics and Health Data Science, School of Medicine, Indiana University, Indianapolis, IN, USA.
- Brown Center for Immunotherapy, School of Medicine, Indiana University, Indianapolis, IN, USA.
- Center for Epigenetics and Disease Prevention, Institute of Biosciences and Technology, Texas A&M University, Houston, TX, USA.
- Department of Translational Medical Sciences, College of Medicine, Texas A&M University, Houston, TX, USA.
| |
Collapse
|
12
|
Bradley D, Hogrebe A, Dandage R, Dubé AK, Leutert M, Dionne U, Chang A, Villén J, Landry CR. The fitness cost of spurious phosphorylation. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.08.561337. [PMID: 37873463 PMCID: PMC10592693 DOI: 10.1101/2023.10.08.561337] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]
Abstract
The fidelity of signal transduction requires the binding of regulatory molecules to their cognate targets. However, the crowded cell interior risks off-target interactions between proteins that are functionally unrelated. How such off-target interactions impact fitness is not generally known, but quantifying this is required to understand the constraints faced by cell systems as they evolve. Here, we use the model organism S. cerevisiae to inducibly express tyrosine kinases. Because yeast lacks bona fide tyrosine kinases, most of the resulting tyrosine phosphorylation is spurious. This provides a suitable system to measure the impact of artificial protein interactions on fitness. We engineered 44 yeast strains each expressing a tyrosine kinase, and quantitatively analysed their phosphoproteomes. This analysis resulted in ~30,000 phosphosites mapping to ~3,500 proteins. Examination of the fitness costs in each strain revealed a strong correlation between the number of spurious pY sites and decreased growth. Moreover, the analysis of pY effects on protein structure and on protein function revealed over 1000 pY events that we predict to be deleterious. However, we also find that a large number of the spurious pY sites have a negligible effect on fitness, possibly because of their low stoichiometry. This result is consistent with our evolutionary analyses demonstrating a lack of phosphotyrosine counter-selection in species with bona fide tyrosine kinases. Taken together, our results suggest that, alongside the risk for toxicity, the cell can tolerate a large degree of non-functional crosstalk as interaction networks evolve.
Collapse
Affiliation(s)
- David Bradley
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada
- Department of Biochemistry, Microbiology and Bioinformatics, Université Laval, Québec, QC, Canada
- Quebec Network for Research on Protein Function, Engineering, and Applications (PROTEO), Université du Québec à Montréal, Montréal, QC, Canada
- Université Laval Big Data Research Center (BDRC_UL), Québec, QC, Canada
- Department of Biology, Université Laval, Québec, QC, Canada
| | - Alexander Hogrebe
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Rohan Dandage
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada
- Department of Biochemistry, Microbiology and Bioinformatics, Université Laval, Québec, QC, Canada
- Quebec Network for Research on Protein Function, Engineering, and Applications (PROTEO), Université du Québec à Montréal, Montréal, QC, Canada
- Université Laval Big Data Research Center (BDRC_UL), Québec, QC, Canada
- Department of Biology, Université Laval, Québec, QC, Canada
| | - Alexandre K Dubé
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada
- Department of Biochemistry, Microbiology and Bioinformatics, Université Laval, Québec, QC, Canada
- Quebec Network for Research on Protein Function, Engineering, and Applications (PROTEO), Université du Québec à Montréal, Montréal, QC, Canada
- Université Laval Big Data Research Center (BDRC_UL), Québec, QC, Canada
- Department of Biology, Université Laval, Québec, QC, Canada
| | - Mario Leutert
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
- Institute of Molecular Systems Biology, ETH Zürich, Zürich, Switzerland
| | - Ugo Dionne
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada
- Department of Biochemistry, Microbiology and Bioinformatics, Université Laval, Québec, QC, Canada
- Quebec Network for Research on Protein Function, Engineering, and Applications (PROTEO), Université du Québec à Montréal, Montréal, QC, Canada
- Université Laval Big Data Research Center (BDRC_UL), Québec, QC, Canada
- Department of Biology, Université Laval, Québec, QC, Canada
| | - Alexis Chang
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Judit Villén
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Christian R Landry
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada
- Department of Biochemistry, Microbiology and Bioinformatics, Université Laval, Québec, QC, Canada
- Quebec Network for Research on Protein Function, Engineering, and Applications (PROTEO), Université du Québec à Montréal, Montréal, QC, Canada
- Université Laval Big Data Research Center (BDRC_UL), Québec, QC, Canada
- Department of Biology, Université Laval, Québec, QC, Canada
| |
Collapse
|
13
|
Cankara F, Doğan T. ASCARIS: Positional feature annotation and protein structure-based representation of single amino acid variations. Comput Struct Biotechnol J 2023; 21:4743-4758. [PMID: 37822561 PMCID: PMC10562615 DOI: 10.1016/j.csbj.2023.09.017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2023] [Revised: 09/15/2023] [Accepted: 09/15/2023] [Indexed: 10/13/2023] Open
Abstract
Background Genomic variations may cause deleterious effects on protein functionality and perturb biological processes. Elucidating the effects of variations is critical for developing novel treatment strategies for diseases of genetic origin. Computational approaches have been aiding the work in this field by modeling and analyzing the mutational landscape. However, new approaches are required, especially for accurate representation and data-centric analysis of sequence variations. Method In this study, we propose ASCARIS (Annotation and StruCture-bAsed RepresentatIon of Single amino acid variations), a method for the featurization (i.e., quantitative representation) of single amino acid variations (SAVs), which could be used for a variety of purposes, such as predicting their functional effects or building multi-omics-based integrative models. ASCARIS utilizes the direct and spatial correspondence between the location of the SAV on the sequence/structure and 30 different types of positional feature annotations (e.g., active/lipidation/glycosylation sites; calcium/metal/DNA binding, inter/transmembrane regions, etc.), along with structural features and physicochemical properties. The main novelty of this method lies in constructing reusable numerical representations of SAVs via functional annotations. Results We statistically analyzed the relationship between these features and the consequences of variations and found that each carries information in this regard. To investigate potential applications of ASCARIS, we trained variant effect prediction models that utilize our SAV representations as input. We carried out an ablation study and a comparison against the state-of-the-art methods and observed that ASCARIS has a competing and complementary performance against widely-used predictors. ASCARIS can be used alone or in combination with other approaches to represent SAVs from a functional perspective. ASCARIS is available as a programmatic tool at https://github.com/HUBioDataLab/ASCARIS and as a web-service at https://huggingface.co/spaces/HUBioDataLab/ASCARIS.
Collapse
Affiliation(s)
- Fatma Cankara
- Biological Data Science Laboratory, Dept. of Computer Engineering, Hacettepe University, Ankara, Turkey
- Department of Health Informatics, Graduate School of Informatics, METU, Ankara, Turkey
- Department of Computational Sciences and Engineering, Koc University, Istanbul, Turkey
| | - Tunca Doğan
- Biological Data Science Laboratory, Dept. of Computer Engineering, Hacettepe University, Ankara, Turkey
- Institute of Informatics, Hacettepe University, Ankara, Turkey
- Department of Bioinformatics, Graduate School of Health Sciences, Hacettepe University, Ankara, Turkey
| |
Collapse
|
14
|
Liu Z, Qian W, Cai W, Song W, Wang W, Maharjan DT, Cheng W, Chen J, Wang H, Xu D, Lin GN. Inferring the Effects of Protein Variants on Protein-Protein Interactions with Interpretable Transformer Representations. RESEARCH (WASHINGTON, D.C.) 2023; 6:0219. [PMID: 37701056 PMCID: PMC10494974 DOI: 10.34133/research.0219] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/04/2023] [Accepted: 08/20/2023] [Indexed: 09/14/2023]
Abstract
Identifying pathogenetic variants and inferring their impact on protein-protein interactions sheds light on their functional consequences on diseases. Limited by the availability of experimental data on the consequences of protein interaction, most existing methods focus on building models to predict changes in protein binding affinity. Here, we introduced MIPPI, an end-to-end, interpretable transformer-based deep learning model that learns features directly from sequences by leveraging the interaction data from IMEx. MIPPI was specifically trained to determine the types of variant impact (increasing, decreasing, disrupting, and no effect) on protein-protein interactions. We demonstrate the accuracy of MIPPI and provide interpretation through the analysis of learned attention weights, which exhibit correlations with the amino acids interacting with the variant. Moreover, we showed the practicality of MIPPI in prioritizing de novo mutations associated with complex neurodevelopmental disorders and the potential to determine the pathogenic and driving mutations. Finally, we experimentally validated the functional impact of several variants identified in patients with such disorders. Overall, MIPPI emerges as a versatile, robust, and interpretable model, capable of effectively predicting mutation impacts on protein-protein interactions and facilitating the discovery of clinically actionable variants.
Collapse
Affiliation(s)
- Zhe Liu
- Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai, China
| | - Wei Qian
- Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai, China
| | - Wenxiang Cai
- Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai, China
| | - Weichen Song
- Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai, China
| | - Weidi Wang
- Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai, China
- Shanghai Key Laboratory of Psychotic Disorders, Shanghai, China
| | - Dhruba Tara Maharjan
- Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai, China
| | - Wenhong Cheng
- Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai, China
| | - Jue Chen
- Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai, China
| | - Han Wang
- School of Information Science and Technology, Institute of Computational Biology, Northeast Normal University, Changchun, China
| | - Dong Xu
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO 65211, USA
- Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO 65211, USA
| | - Guan Ning Lin
- Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai, China
- Shanghai Key Laboratory of Psychotic Disorders, Shanghai, China
| |
Collapse
|
15
|
Petrey D, Zhao H, Trudeau SJ, Murray D, Honig B. PrePPI: A Structure Informed Proteome-wide Database of Protein-Protein Interactions. J Mol Biol 2023; 435:168052. [PMID: 36933822 PMCID: PMC10293085 DOI: 10.1016/j.jmb.2023.168052] [Citation(s) in RCA: 13] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2023] [Revised: 03/09/2023] [Accepted: 03/10/2023] [Indexed: 03/18/2023]
Abstract
We present an updated version of the Predicting Protein-Protein Interactions (PrePPI) webserver which predicts PPIs on a proteome-wide scale. PrePPI combines structural and non-structural evidence within a Bayesian framework to compute a likelihood ratio (LR) for essentially every possible pair of proteins in a proteome; the current database is for the human interactome. The structural modeling (SM) component is derived from template-based modeling and its application on a proteome-wide scale is enabled by a unique scoring function used to evaluate a putative complex. The updated version of PrePPI leverages AlphaFold structures that are parsed into individual domains. As has been demonstrated in earlier applications, PrePPI performs extremely well as measured by receiver operating characteristic curves derived from testing on E. coli and human protein-protein interaction (PPI) databases. A PrePPI database of ∼1.3 million human PPIs can be queried with a webserver application that comprises multiple functionalities for examining query proteins, template complexes, 3D models for predicted complexes, and related features (https://honiglab.c2b2.columbia.edu/PrePPI). PrePPI is a state-of-the-art resource that offers an unprecedented structure-informed view of the human interactome.
Collapse
Affiliation(s)
- Donald Petrey
- Department of Systems Biology, Columbia University Irving Medical Center, New York, NY 10032, USA
| | - Haiqing Zhao
- Department of Systems Biology, Columbia University Irving Medical Center, New York, NY 10032, USA
| | - Stephen J Trudeau
- Department of Systems Biology, Columbia University Irving Medical Center, New York, NY 10032, USA
| | - Diana Murray
- Department of Systems Biology, Columbia University Irving Medical Center, New York, NY 10032, USA
| | - Barry Honig
- Department of Systems Biology, Columbia University Irving Medical Center, New York, NY 10032, USA; Department of Biochemistry and Molecular Biophysics, Columbia University Irving Medical Center, New York, NY 10032, USA; Department of Medicine, Columbia University, New York, NY 10032, USA; Zuckerman Mind Brain and Behavior Institute, Columbia University, New York, NY 10027, USA.
| |
Collapse
|
16
|
Trepte P, Secker C, Kostova S, Maseko SB, Choi SG, Blavier J, Minia I, Ramos ES, Cassonnet P, Golusik S, Zenkner M, Beetz S, Liebich MJ, Scharek N, Schütz A, Sperling M, Lisurek M, Wang Y, Spirohn K, Hao T, Calderwood MA, Hill DE, Landthaler M, Olivet J, Twizere JC, Vidal M, Wanker EE. AI-guided pipeline for protein-protein interaction drug discovery identifies a SARS-CoV-2 inhibitor. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.06.14.544560. [PMID: 37398436 PMCID: PMC10312674 DOI: 10.1101/2023.06.14.544560] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/04/2023]
Abstract
Protein-protein interactions (PPIs) offer great opportunities to expand the druggable proteome and therapeutically tackle various diseases, but remain challenging targets for drug discovery. Here, we provide a comprehensive pipeline that combines experimental and computational tools to identify and validate PPI targets and perform early-stage drug discovery. We have developed a machine learning approach that prioritizes interactions by analyzing quantitative data from binary PPI assays and AlphaFold-Multimer predictions. Using the quantitative assay LuTHy together with our machine learning algorithm, we identified high-confidence interactions among SARS-CoV-2 proteins for which we predicted three-dimensional structures using AlphaFold Multimer. We employed VirtualFlow to target the contact interface of the NSP10-NSP16 SARS-CoV-2 methyltransferase complex by ultra-large virtual drug screening. Thereby, we identified a compound that binds to NSP10 and inhibits its interaction with NSP16, while also disrupting the methyltransferase activity of the complex, and SARS-CoV-2 replication. Overall, this pipeline will help to prioritize PPI targets to accelerate the discovery of early-stage drug candidates targeting protein complexes and pathways.
Collapse
Affiliation(s)
- Philipp Trepte
- Proteomics and Molecular Mechanisms of Neurodegenerative Diseases, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, 13125, Berlin, Germany
- Brain Development and Disease, Institute of Molecular Biotechnology of the Austrian Academy of Sciences, 1030, Vienna, Austria
| | - Christopher Secker
- Proteomics and Molecular Mechanisms of Neurodegenerative Diseases, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, 13125, Berlin, Germany
- Zuse Institute Berlin, Berlin, Germany
| | - Simona Kostova
- Proteomics and Molecular Mechanisms of Neurodegenerative Diseases, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, 13125, Berlin, Germany
| | - Sibusiso B. Maseko
- Laboratory of Viral Interactomes, Interdisciplinary Cluster for Applied Genoproteomics (GIGA)-Molecular Biology of Diseases, University of Liège, 4000, Liège, Belgium
| | - Soon Gang Choi
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, MA, 02215, USA
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA, 02115, USA
- Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
| | - Jeremy Blavier
- Laboratory of Viral Interactomes, Interdisciplinary Cluster for Applied Genoproteomics (GIGA)-Molecular Biology of Diseases, University of Liège, 4000, Liège, Belgium
| | - Igor Minia
- RNA Biology and Posttranscriptional Regulation, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin Institute for Medical Systems Biology, 13125, Berlin, Germany
| | - Eduardo Silva Ramos
- Proteomics and Molecular Mechanisms of Neurodegenerative Diseases, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, 13125, Berlin, Germany
| | - Patricia Cassonnet
- Département de Virologie, Unité de Génétique Moléculaire des Virus à ARN (GMVR), Institut Pasteur, Centre National de la Recherche Scientifique (CNRS), Université de Paris, Paris, France
| | - Sabrina Golusik
- Proteomics and Molecular Mechanisms of Neurodegenerative Diseases, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, 13125, Berlin, Germany
| | - Martina Zenkner
- Proteomics and Molecular Mechanisms of Neurodegenerative Diseases, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, 13125, Berlin, Germany
| | - Stephanie Beetz
- Proteomics and Molecular Mechanisms of Neurodegenerative Diseases, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, 13125, Berlin, Germany
| | - Mara J. Liebich
- Proteomics and Molecular Mechanisms of Neurodegenerative Diseases, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, 13125, Berlin, Germany
| | - Nadine Scharek
- Proteomics and Molecular Mechanisms of Neurodegenerative Diseases, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, 13125, Berlin, Germany
| | - Anja Schütz
- Protein Production & Characterization, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, 13125, Berlin, Germany
| | - Marcel Sperling
- Multifunctional Colloids and Coating, Fraunhofer Institute for Applied Polymer Research (IAP), 14476, Potsdam-Golm, Germany
| | - Michael Lisurek
- Structural Chemistry and Computational Biophysics, Leibniz-Institut für Molekulare Pharmakologie (FMP), 13125, Berlin, Germany
| | - Yang Wang
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, MA, 02215, USA
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA, 02115, USA
- Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
| | - Kerstin Spirohn
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, MA, 02215, USA
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA, 02115, USA
- Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
| | - Tong Hao
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, MA, 02215, USA
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA, 02115, USA
- Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
| | - Michael A. Calderwood
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, MA, 02215, USA
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA, 02115, USA
- Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
| | - David E. Hill
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, MA, 02215, USA
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA, 02115, USA
- Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
| | - Markus Landthaler
- RNA Biology and Posttranscriptional Regulation, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin Institute for Medical Systems Biology, 13125, Berlin, Germany
- Institute of Biology, Humboldt-Universität zu Berlin, 13125, Berlin, Germany
| | - Julien Olivet
- Laboratory of Viral Interactomes, Interdisciplinary Cluster for Applied Genoproteomics (GIGA)-Molecular Biology of Diseases, University of Liège, 4000, Liège, Belgium
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, MA, 02215, USA
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA, 02115, USA
- Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
- Structural Biology Unit, Laboratory of Virology and Chemotherapy, Rega Institute for Medical Research, Department of Microbiology, Immunology and Transplantation, Katholieke Universiteit Leuven, 3000, Leuven, Belgium
| | - Jean-Claude Twizere
- Laboratory of Viral Interactomes, Interdisciplinary Cluster for Applied Genoproteomics (GIGA)-Molecular Biology of Diseases, University of Liège, 4000, Liège, Belgium
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, MA, 02215, USA
- TERRA Teaching and Research Center, Gembloux Agro-Bio Tech, University of Liège, 5030, Gembloux, Belgium
- Laboratory of Algal Synthetic and Systems Biology, Division of Science and Math, New York University Abu Dhabi, Abu Dhabi, United Arab Emirates
| | - Marc Vidal
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, MA, 02215, USA
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA, 02115, USA
| | - Erich E. Wanker
- Proteomics and Molecular Mechanisms of Neurodegenerative Diseases, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, 13125, Berlin, Germany
| |
Collapse
|
17
|
Mo J, Li Z, Chen H, Lu Z, Ding B, Yuan X, Liu Y, Zhu W. Network medicine framework identified drug-repurposing opportunities of pharmaco-active compounds of Angelica acutiloba (Siebold & Zucc.) Kitag. for skin aging. Aging (Albany NY) 2023; 15:5144-5163. [PMID: 37310405 PMCID: PMC10292898 DOI: 10.18632/aging.204789] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2023] [Accepted: 05/15/2023] [Indexed: 06/14/2023]
Abstract
Increasing incidence of skin aging has highlighted the importance of identifying effective drugs with repurposed opportunities for skin aging. We aimed to identify pharmaco-active compounds with drug-repurposing opportunities for skin aging from Angelica acutiloba (Siebold & Zucc.) Kitag. (AAK). The proximity of network medicine framework (NMF) firstly identified 8 key AAK compounds with repurposed opportunities for skin aging, which may exert by regulating 29 differentially expressed genes (DGEs) of skin aging, including 13 up-regulated targets and 16 down-regulated targets. Connectivity MAP (cMAP) analysis revealed 8 key compounds were involved in regulating the process of cell proliferation and apoptosis, mitochondrial energy metabolism and oxidative stress of skin aging. Molecular docking analysis showed that 8 key compounds had a high docked ability with AR, BCHE, HPGD and PI3, which were identified as specific biomarker for the diagnosis of skin aging. Finally, the mechanisms of these key compounds were predicted to be involved in inhibiting autophagy pathway and activating Phospholipase D signaling pathway. In conclusion, this study firstly elucidated the drug-repurposing opportunities of AAK compounds for skin aging, providing a theoretical reference for identifying repurposing drugs from Chinese medicine and new insights for our future research.
Collapse
Affiliation(s)
- Jiaxin Mo
- The Second Clinical College, Guangzhou University of Chinese Medicine, Guangzhou Province 510006, China
| | - Zunjiang Li
- The Second Clinical College, Guangzhou University of Chinese Medicine, Guangzhou Province 510006, China
| | - Hankun Chen
- Guangzhou Qinglan Biotechnology Co. Ltd., Guangzhou Province 515000, China
| | - Zhongyu Lu
- The Second Clinical College, Guangzhou University of Chinese Medicine, Guangzhou Province 510006, China
| | - Banghan Ding
- The Second Clinical College, Guangzhou University of Chinese Medicine, Guangzhou Province 510006, China
- Guangdong Provincial Hospital of Traditional Chinese Medicine, Guangzhou Province 510120, China
| | - Xiaohong Yuan
- Guangdong Provincial Hospital of Traditional Chinese Medicine, Guangzhou Province 510120, China
| | - Yuan Liu
- Guangzhou Huamiao Biotechnology Research Institute Co. Ltd., Guangzhou Province 510000, China
| | - Wei Zhu
- Guangdong Provincial Hospital of Traditional Chinese Medicine, Guangzhou Province 510120, China
| |
Collapse
|
18
|
Zhou X, Mitra R, Hou F, Zhou S, Wang L, Jiang W. Genomic Landscape and Potential Regulation of RNA Editing in Drug Resistance. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2023; 10:e2207357. [PMID: 36912579 PMCID: PMC10190536 DOI: 10.1002/advs.202207357] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/12/2022] [Revised: 01/31/2023] [Indexed: 05/18/2023]
Abstract
Adenosine-to-inosine RNA editing critically affects the response of cancer therapies. However, comprehensive identification of drug resistance-related RNA editing events and systematic understanding of how RNA editing mediates anticancer drug resistance remain unclear. Here, 7157 differential editing sites (DESs) are identified from 98 127 informative RNA editing sites in tumor tissues, many of which are validated in cancer cell lines. Diverse editing patterns of DESs are discovered in resistant samples, which could not be fully explained by adenosine deaminase acting on RNA enzymes. Some RNA-binding proteins are identified that potentially regulate these editing events. Notably, the DESs are significantly enriched in 3'-untranslated regions (3'-UTRs). The impact of DESs in 3'-UTR on the microRNA (miRNA) regulations is explored, and some triplets (DES, miRNA, and gene) that may contribute to drug resistance are identified. In addition, it is determined that the functions of genes enriched with DESs are associated with drug resistance, such as apoptosis, drug metabolism, and DNA synthesis involved in DNA repair. An online resource (http://www.jianglab.cn/REDR/) to support convenient retrieval of DESs is also built. The findings reveal the landscape and potential regulatory mechanism of RNA editing in drug resistance, providing new therapeutic targets for reversing drug resistance.
Collapse
Affiliation(s)
- Xu Zhou
- Department of Biomedical EngineeringNanjing University of Aeronautics and AstronauticsNanjing211106P. R. China
| | - Ramkrishna Mitra
- Department of PharmacologyPhysiology, and Cancer BiologySidney Kimmel Cancer CenterThomas Jefferson UniversityPhiladelphiaPennsylvania19107USA
| | - Fei Hou
- Department of Biomedical EngineeringNanjing University of Aeronautics and AstronauticsNanjing211106P. R. China
| | - Shunheng Zhou
- Department of Biomedical EngineeringNanjing University of Aeronautics and AstronauticsNanjing211106P. R. China
| | - Lihong Wang
- Department of PathophysiologySchool of MedicineSoutheast UniversityNanjing210009P. R. China
| | - Wei Jiang
- Department of Biomedical EngineeringNanjing University of Aeronautics and AstronauticsNanjing211106P. R. China
| |
Collapse
|
19
|
Durham J, Zhang J, Humphreys IR, Pei J, Cong Q. Recent advances in predicting and modeling protein-protein interactions. Trends Biochem Sci 2023; 48:527-538. [PMID: 37061423 DOI: 10.1016/j.tibs.2023.03.003] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Revised: 03/03/2023] [Accepted: 03/17/2023] [Indexed: 04/17/2023]
Abstract
Protein-protein interactions (PPIs) drive biological processes, and disruption of PPIs can cause disease. With recent breakthroughs in structure prediction and a deluge of genomic sequence data, computational methods to predict PPIs and model spatial structures of protein complexes are now approaching the accuracy of experimental approaches for permanent interactions and show promise for elucidating transient interactions. As we describe here, the key to this success is rich evolutionary information deciphered from thousands of homologous sequences that coevolve in interacting partners. This covariation signal, revealed by sophisticated statistical and machine learning (ML) algorithms, predicts physiological interactions. Accurate artificial intelligence (AI)-based modeling of protein structures promises to provide accurate 3D models of PPIs at a proteome-wide scale.
Collapse
Affiliation(s)
- Jesse Durham
- Eugene McDermott Center for Human Growth and Development, University of Texas Southwestern Medical Center, Dallas, TX, USA; Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, TX, USA; Harold C. Simmons Comprehensive Cancer Center, University of Texas Southwestern Medical Center, Dallas, TX, USA
| | - Jing Zhang
- Eugene McDermott Center for Human Growth and Development, University of Texas Southwestern Medical Center, Dallas, TX, USA; Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, TX, USA; Harold C. Simmons Comprehensive Cancer Center, University of Texas Southwestern Medical Center, Dallas, TX, USA
| | - Ian R Humphreys
- Department of Biochemistry, University of Washington, Seattle, WA, USA; Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Jimin Pei
- Eugene McDermott Center for Human Growth and Development, University of Texas Southwestern Medical Center, Dallas, TX, USA; Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, TX, USA; Harold C. Simmons Comprehensive Cancer Center, University of Texas Southwestern Medical Center, Dallas, TX, USA
| | - Qian Cong
- Eugene McDermott Center for Human Growth and Development, University of Texas Southwestern Medical Center, Dallas, TX, USA; Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, TX, USA; Harold C. Simmons Comprehensive Cancer Center, University of Texas Southwestern Medical Center, Dallas, TX, USA.
| |
Collapse
|
20
|
Petrey D, Zhao H, Trudeau S, Murray D, Honig B. PrePPI: A structure informed proteome-wide database of protein-protein interactions. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.02.27.530276. [PMID: 36909476 PMCID: PMC10002632 DOI: 10.1101/2023.02.27.530276] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/04/2023]
Abstract
We present an updated version of the Predicting Protein-Protein Interactions (PrePPI) webserver which predicts PPIs on a proteome-wide scale. PrePPI combines structural and non-structural clues within a Bayesian framework to compute a likelihood ratio (LR) for essentially every possible pair of proteins in a proteome; the current database is for the human interactome. The structural modeling (SM) clue is derived from templatebased modeling and its application on a proteome-wide scale is enabled by a unique scoring function used to evaluate a putative complex. The updated version of PrePPI leverages AlphaFold structures that are parsed into individual domains. As has been demonstrated in earlier applications, PrePPI performs extremely well as measured by receiver operating characteristic curves derived from testing on E. coli and human protein-protein interaction (PPI) databases. A PrePPI database of ~1.3 million human PPIs can be queried with a webserver application that comprises multiple functionalities for examining query proteins, template complexes, 3D models for predicted complexes, and related features ( https://honiglab.c2b2.columbia.edu/PrePPI ). PrePPI is a state-of- the-art resource that offers an unprecedented structure-informed view of the human interactome. Graphic Abstract
Collapse
|
21
|
Kim D, Ha D, Lee K, Lee H, Kim I, Kim S. An evolution-based machine learning to identify cancer type-specific driver mutations. Brief Bioinform 2023; 24:6961611. [PMID: 36575568 DOI: 10.1093/bib/bbac593] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2022] [Revised: 11/18/2022] [Accepted: 12/03/2022] [Indexed: 12/29/2022] Open
Abstract
Identifying cancer type-specific driver mutations is crucial for illuminating distinct pathologic mechanisms across various tumors and providing opportunities of patient-specific treatment. However, although many computational methods were developed to predict driver mutations in a type-specific manner, the methods still have room to improve. Here, we devise a novel feature based on sequence co-evolution analysis to identify cancer type-specific driver mutations and construct a machine learning (ML) model with state-of-the-art performance. Specifically, relying on 28 000 tumor samples across 66 cancer types, our ML framework outperformed current leading methods of detecting cancer driver mutations. Interestingly, the cancer mutations identified by sequence co-evolution feature are frequently observed in interfaces mediating tissue-specific protein-protein interactions that are known to associate with shaping tissue-specific oncogenesis. Moreover, we provide pre-calculated potential oncogenicity on available human proteins with prediction scores of all possible residue alterations through user-friendly website (http://sbi.postech.ac.kr/w/cancerCE). This work will facilitate the identification of cancer type-specific driver mutations in newly sequenced tumor samples.
Collapse
Affiliation(s)
| | | | | | | | - Inhae Kim
- ImmunoBiome Inc., Pohang, South Korea
| | - Sanguk Kim
- Department of Life Sciences.,Artificial Intelligence Graduate Program, Pohang University of Science and Technology, Pohang 790-784, South Korea.,Institute of Convergence Research and Education in Advanced Technology, Yonsei University, Seoul 120-149, South Korea
| |
Collapse
|
22
|
He Z, Gao K, Dong L, Liu L, Qu X, Zou Z, Wu Y, Bu D, Guo JC, Zhao Y. Drug screening and biomarker gene investigation in cancer therapy through the human transcriptional regulatory network. Comput Struct Biotechnol J 2023; 21:1557-1572. [PMID: 36879883 PMCID: PMC9984461 DOI: 10.1016/j.csbj.2023.02.005] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2022] [Revised: 01/19/2023] [Accepted: 02/04/2023] [Indexed: 02/10/2023] Open
Abstract
A complex and vast biological network regulates all biological functions in the human body in a sophisticated manner, and abnormalities in this network can lead to disease and even cancer. The construction of a high-quality human molecular interaction network is possible with the development of experimental techniques that facilitate the interpretation of the mechanisms of drug treatment for cancer. We collected 11 molecular interaction databases based on experimental sources and constructed a human protein-protein interaction (PPI) network and a human transcriptional regulatory network (HTRN). A random walk-based graph embedding method was used to calculate the diffusion profiles of drugs and cancers, and a pipeline was constructed by using five similarity comparison metrics combined with a rank aggregation algorithm, which can be implemented for drug screening and biomarker gene prediction. Taking NSCLC as an example, curcumin was identified as a potentially promising anticancer drug from 5450 natural small molecules, and combined with differentially expressed genes, survival analysis, and topological ranking, we obtained BIRC5 (survivin), which is both a biomarker for NSCLC and a key target for curcumin. Finally, the binding mode of curcumin and survivin was explored using molecular docking. This work has a guiding significance for antitumor drug screening and the identification of tumor markers.
Collapse
Affiliation(s)
- Zihao He
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing 100029, China
| | - Kai Gao
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing 100029, China
| | - Lei Dong
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing 100029, China
| | - Liu Liu
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing 100029, China
| | - Xinchi Qu
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing 100029, China
| | - Zhengkai Zou
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing 100029, China
| | - Yang Wu
- Research Center for Ubiquitous Computing Systems, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China
| | - Dechao Bu
- Research Center for Ubiquitous Computing Systems, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China
| | - Jin-Cheng Guo
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing 100029, China
| | - Yi Zhao
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing 100029, China.,Research Center for Ubiquitous Computing Systems, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China
| |
Collapse
|
23
|
Luo Y, Liu L, He Z, Zhang S, Huo P, Wang Z, Jiaxin Q, Zhao L, Wu Y, Zhang D, Bu D, Chen R, Zhao Y. TREAT: Therapeutic RNAs exploration inspired by artificial intelligence technology. Comput Struct Biotechnol J 2022; 20:5680-5689. [PMID: 36320935 PMCID: PMC9589171 DOI: 10.1016/j.csbj.2022.10.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Revised: 10/04/2022] [Accepted: 10/05/2022] [Indexed: 11/08/2022] Open
Abstract
Recent advances in RNA engineering have enabled the development of RNA-based therapeutics for a broad spectrum of applications. Developing RNA therapeutics start with targeted RNA screening and move to the drug design and optimization. However, existing target screening tools ignore noncoding RNAs and their disease-relevant regulatory relationships. And designing therapeutic RNAs encounters high computational complexity of multi-objective optimization to overcome the immunogenicity, instability and inefficient translational production. To unlock the therapeutic potential of noncoding RNAs and enable one-stop screening and design of therapeutic RNAs, we have built the platform TREAT. It incorporates 43,087,953 regulatory relationships between coding and noncoding genes from 81 biological networks under different physiological conditions. TREAT introduces graph representation learning with Random Walk Diffusions to perform disease-relevant target screening, in addition to the commonly utilized Topological Degree and PageRank algorithms. Design and optimization of large RNAs or interfering RNAs are both available. To reduce the computational complexity of multi-objective optimization for large RNA, we stratified the features into local and global features. The local features are evaluated on the fixed-length or dynamic-length local bins, whereas the latter are inspired by AI language models of protein sequence. Then the global assessment is performed on refined candidates, thus reducing the enormous search space. Overall, TREAT is a one-stop platform for the screening and designing of therapeutic RNAs, with particular attention to noncoding RNAs and cutting-edge AI technology embedded, leading the progress of innovative therapeutics for challenging diseases. TREAT is freely accessible at https://rna.org.cn/treat.
Collapse
Affiliation(s)
- Yufan Luo
- Research Center for Ubiquitous Computing Systems, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China,University of Chinese Academy of Sciences, Beijing 100049, China
| | - Liu Liu
- Research Center for Ubiquitous Computing Systems, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
| | - Zihao He
- Research Center for Ubiquitous Computing Systems, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
| | - Shanshan Zhang
- Luoyang Zhongke Information Industry Research Institute, Luoyang, China
| | - Peipei Huo
- Luoyang Zhongke Information Industry Research Institute, Luoyang, China
| | - Zhihao Wang
- Luoyang Zhongke Information Industry Research Institute, Luoyang, China
| | - Qin Jiaxin
- Research Center for Ubiquitous Computing Systems, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
| | - Lianhe Zhao
- Research Center for Ubiquitous Computing Systems, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
| | - Yang Wu
- Research Center for Ubiquitous Computing Systems, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
| | - Dongdong Zhang
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
| | - Dechao Bu
- Research Center for Ubiquitous Computing Systems, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China,Hwa Mei Hospital, University of Chinese Academy of Sciences, China,Correspondence authors at: Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China (Y. Zhao).
| | - Runsheng Chen
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China,Shenzhen Institute of Nucleic Acid Drug Research, Shenzhen Bay Laboratory Pingshan Translational Medicine Center, Shenzhen 510800, China,Correspondence authors at: Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China (Y. Zhao).
| | - Yi Zhao
- Research Center for Ubiquitous Computing Systems, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China,Correspondence authors at: Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China (Y. Zhao).
| |
Collapse
|
24
|
Li B, Roden DM, Capra JA. The 3D mutational constraint on amino acid sites in the human proteome. Nat Commun 2022; 13:3273. [PMID: 35672414 PMCID: PMC9174330 DOI: 10.1038/s41467-022-30936-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2021] [Accepted: 05/19/2022] [Indexed: 12/16/2022] Open
Abstract
Quantification of the tolerance of protein sites to genetic variation has become a cornerstone of variant interpretation. We hypothesize that the constraint on missense variation at individual amino acid sites is largely shaped by direct interactions with 3D neighboring sites. To quantify this constraint, we introduce a framework called COntact Set MISsense tolerance (or COSMIS) and comprehensively map the landscape of 3D mutational constraint on 6.1 million amino acid sites covering 16,533 human proteins. We show that 3D mutational constraint is pervasive and that the level of constraint is strongly associated with disease relevance both at the site and the protein level. We demonstrate that COSMIS performs significantly better at variant interpretation tasks than other population-based constraint metrics while also providing structural insight into the functional roles of constrained sites. We anticipate that COSMIS will facilitate the interpretation of protein-coding variation in evolution and prioritization of sites for mechanistic investigation.
Collapse
Affiliation(s)
- Bian Li
- Department of Biological Sciences, Vanderbilt University, Nashville, TN, 37203, USA.
- Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, 37232, USA.
| | - Dan M Roden
- Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, 37232, USA
- Departments of Pharmacology and Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, 37232, USA
| | - John A Capra
- Department of Biological Sciences, Vanderbilt University, Nashville, TN, 37203, USA.
- Bakar Computational Health Sciences Institute and Department of Epidemiology and Biostatistics, University of California, San Francisco, CA, 94143, USA.
| |
Collapse
|
25
|
Ozturk K, Carter H. Predicting functional consequences of mutations using molecular interaction network features. Hum Genet 2022; 141:1195-1210. [PMID: 34432150 PMCID: PMC8873243 DOI: 10.1007/s00439-021-02329-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2021] [Accepted: 07/31/2021] [Indexed: 12/13/2022]
Abstract
Variant interpretation remains a central challenge for precision medicine. Missense variants are particularly difficult to understand as they change only a single amino acid in a protein sequence yet can have large and varied effects on protein activity. Numerous tools have been developed to identify missense variants with putative disease consequences from protein sequence and structure. However, biological function arises through higher order interactions among proteins and molecules within cells. We therefore sought to capture information about the potential of missense mutations to perturb protein interaction networks by integrating protein structure and interaction data. We developed 16 network-based annotations for missense mutations that provide orthogonal information to features classically used to prioritize variants. We then evaluated them in the context of a proven machine-learning framework for variant effect prediction across multiple benchmark datasets to demonstrate their potential to improve variant classification. Interestingly, network features resulted in larger performance gains for classifying somatic mutations than for germline variants, possibly due to different constraints on what mutations are tolerated at the cellular versus organismal level. Our results suggest that modeling variant potential to perturb context-specific interactome networks is a fruitful strategy to advance in silico variant effect prediction.
Collapse
Affiliation(s)
- Kivilcim Ozturk
- Division of Medical Genetics, Department of Medicine, University of California San Diego, La Jolla, CA, USA
- Bioinformatics and Systems Biology Program, University of California San Diego, La Jolla, CA, USA
| | - Hannah Carter
- Division of Medical Genetics, Department of Medicine, University of California San Diego, La Jolla, CA, USA.
- Bioinformatics and Systems Biology Program, University of California San Diego, La Jolla, CA, USA.
- Moores Cancer Center, University of California San Diego, La Jolla, CA, USA.
| |
Collapse
|
26
|
Paiva VDA, Gomes IDS, Monteiro CR, Mendonça MV, Martins PM, Santana CA, Gonçalves-Almeida V, Izidoro SC, Melo-Minardi RCD, Silveira SDA. Protein structural bioinformatics: An overview. Comput Biol Med 2022; 147:105695. [DOI: 10.1016/j.compbiomed.2022.105695] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2021] [Revised: 06/01/2022] [Accepted: 06/02/2022] [Indexed: 11/27/2022]
|
27
|
Lee WY, Lee CY, Lee JS, Kim CE. Identifying Candidate Flavonoids for Non-Alcoholic Fatty Liver Disease by Network-Based Strategy. Front Pharmacol 2022; 13:892559. [PMID: 35721123 PMCID: PMC9204489 DOI: 10.3389/fphar.2022.892559] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2022] [Accepted: 04/22/2022] [Indexed: 11/16/2022] Open
Abstract
Nonalcoholic fatty liver disease (NAFLD) is the most common type of chronic liver disease and lacks guaranteed pharmacological therapeutic options. In this study, we applied a network-based framework for comprehensively identifying candidate flavonoids for the prevention and/or treatment of NAFLD. Flavonoid-target interaction information was obtained from combining experimentally validated data and results obtained using a recently developed machine-learning model, AI-DTI. Flavonoids were then prioritized by calculating the network proximity between flavonoid targets and NAFLD-associated proteins. The preventive effects of the candidate flavonoids were evaluated using FFA-induced hepatic steatosis in HepG2 and AML12 cells. We reconstructed the flavonoid-target network and found that the number of re-covered compound-target interactions was significantly higher than the chance level. Proximity scores have successfully rediscovered flavonoids and their potential mechanisms that are reported to have therapeutic effects on NAFLD. Finally, we revealed that discovered candidates, particularly glycitin, significantly attenuated lipid accumulation and moderately inhibited intracellular reactive oxygen species production. We further confirmed the affinity of glycitin with the predicted target using molecular docking and found that glycitin targets are closely related to several proteins involved in lipid metabolism, inflammatory responses, and oxidative stress. The predicted network-level effects were validated at the levels of mRNA. In summary, our study offers and validates network-based methods for the identification of candidate flavonoids for NAFLD.
Collapse
Affiliation(s)
- Won-Yung Lee
- Department of Physiology, College of Korean Medicine, Gachon University, Seongnam, South Korea
- Department of Herbal Formula, College of Korean Medicine, Dongguk University, Goyang-si, South Korea
| | - Choong-Yeol Lee
- Department of Physiology, College of Korean Medicine, Gachon University, Seongnam, South Korea
| | - Jin-Seok Lee
- Institute of Bioscience and Integrative Medicine, Daejeon Oriental Hospital of Daejeon University, Daejeon, South Korea
| | - Chang-Eop Kim
- Department of Physiology, College of Korean Medicine, Gachon University, Seongnam, South Korea
| |
Collapse
|
28
|
Abstract
Three-dimensional protein structural data at the molecular level are pivotal for successful precision medicine. Such data are crucial not only for discovering drugs that act to block the active site of the target mutant protein but also for clarifying to the patient and the clinician how the mutations harbored by the patient work. The relative paucity of structural data reflects their cost, challenges in their interpretation, and lack of clinical guidelines for their utilization. Rapid technological advancements in experimental high-resolution structural determination increasingly generate structures. Computationally, modeling algorithms, including molecular dynamics simulations, are becoming more powerful, as are compute-intensive hardware, particularly graphics processing units, overlapping with the inception of the exascale era. Accessible, freely available, and detailed structural and dynamical data can be merged with big data to powerfully transform personalized pharmacology. Here we review protein and emerging genome high-resolution data, along with means, applications, and examples underscoring their usefulness in precision medicine. Expected final online publication date for the Annual Review of Biomedical Data Science, Volume 5 is August 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
Collapse
Affiliation(s)
- Ruth Nussinov
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research in the Laboratory of Cancer Immunometabolism, National Cancer Institute, Frederick, Maryland, USA; .,Department of Human Molecular Genetics and Biochemistry, Sackler School of Medicine, Tel Aviv University, Tel Aviv, Israel
| | - Hyunbum Jang
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research in the Laboratory of Cancer Immunometabolism, National Cancer Institute, Frederick, Maryland, USA;
| | - Guy Nir
- Department of Biochemistry and Molecular Biology, Department of Neuroscience, Cell Biology and Anatomy, and Sealy Center for Structural Biology and Molecular Biophysics, University of Texas Medical Branch, Galveston, Texas, USA
| | - Chung-Jung Tsai
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research in the Laboratory of Cancer Immunometabolism, National Cancer Institute, Frederick, Maryland, USA;
| | - Feixiong Cheng
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, Ohio, USA.,Department of Molecular Medicine, Cleveland Clinic Lerner College of Medicine, Case Western Reserve University, Cleveland, Ohio, USA.,Case Comprehensive Cancer Center, Case Western Reserve University School of Medicine, Cleveland, Ohio, USA
| |
Collapse
|
29
|
Lim H, Cankara F, Tsai CJ, Keskin O, Nussinov R, Gursoy A. Artificial intelligence approaches to human-microbiome protein–protein interactions. Curr Opin Struct Biol 2022; 73:102328. [DOI: 10.1016/j.sbi.2022.102328] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2021] [Revised: 12/01/2021] [Accepted: 12/31/2021] [Indexed: 02/08/2023]
|
30
|
Lee D, Xiong D, Wierbowski S, Li L, Liang S, Yu H. Deep learning methods for 3D structural proteome and interactome modeling. Curr Opin Struct Biol 2022; 73:102329. [PMID: 35139457 PMCID: PMC8957610 DOI: 10.1016/j.sbi.2022.102329] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2021] [Revised: 12/05/2021] [Accepted: 12/31/2021] [Indexed: 12/19/2022]
Abstract
Bolstered by recent methodological and hardware advances, deep learning has increasingly been applied to biological problems and structural proteomics. Such approaches have achieved remarkable improvements over traditional machine learning methods in tasks ranging from protein contact map prediction to protein folding, prediction of protein-protein interaction interfaces, and characterization of protein-drug binding pockets. In particular, emergence of ab initio protein structure prediction methods including AlphaFold2 has revolutionized protein structural modeling. From a protein function perspective, numerous deep learning methods have facilitated deconvolution of the exact amino acid residues and protein surface regions responsible for binding other proteins or small molecule drugs. In this review, we provide a comprehensive overview of recent deep learning methods applied in structural proteomics.
Collapse
|
31
|
Duan D, Hanson M, Holland DO, Johnson ME. Integrating protein copy numbers with interaction networks to quantify stoichiometry in clathrin-mediated endocytosis. Sci Rep 2022; 12:5413. [PMID: 35354856 PMCID: PMC8967901 DOI: 10.1038/s41598-022-09259-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Accepted: 03/21/2022] [Indexed: 11/25/2022] Open
Abstract
Proteins that drive processes like clathrin-mediated endocytosis (CME) are expressed at copy numbers within a cell and across cell types varying from hundreds (e.g. auxilin) to millions (e.g. clathrin). These variations contain important information about function, but without integration with the interaction network, they cannot capture how supply and demand for each protein depends on binding to shared and distinct partners. Here we construct the interface-resolved network of 82 proteins involved in CME and establish a metric, a stoichiometric balance ratio (SBR), that quantifies whether each protein in the network has an abundance that is sub- or super-stoichiometric dependent on the global competition for binding. We find that highly abundant proteins (like clathrin) are super-stoichiometric, but that not all super-stoichiometric proteins are highly abundant, across three cell populations (HeLa, fibroblast, and neuronal synaptosomes). Most strikingly, within all cells there is significant competition to bind shared sites on clathrin and the central AP-2 adaptor by other adaptor proteins, resulting in most being in excess supply. Our network and systematic analysis, including response to perturbations of network components, show how competition for shared binding sites results in functionally similar proteins having widely varying stoichiometries, due to variations in both abundance and their unique network of binding partners.
Collapse
Affiliation(s)
- Daisy Duan
- TC Jenkins Department of Biophysics, Johns Hopkins University, 3400 N Charles St, Baltimore, MD, 21218, USA
| | - Meretta Hanson
- TC Jenkins Department of Biophysics, Johns Hopkins University, 3400 N Charles St, Baltimore, MD, 21218, USA
| | | | - Margaret E Johnson
- TC Jenkins Department of Biophysics, Johns Hopkins University, 3400 N Charles St, Baltimore, MD, 21218, USA.
| |
Collapse
|
32
|
Tiberti M, Terkelsen T, Degn K, Beltrame L, Cremers TC, da Piedade I, Di Marco M, Maiani E, Papaleo E. MutateX: an automated pipeline for in silico saturation mutagenesis of protein structures and structural ensembles. Brief Bioinform 2022; 23:6552273. [PMID: 35323860 DOI: 10.1093/bib/bbac074] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Revised: 01/28/2022] [Accepted: 02/16/2022] [Indexed: 12/26/2022] Open
Abstract
Mutations, which result in amino acid substitutions, influence the stability of proteins and their binding to biomolecules. A molecular understanding of the effects of protein mutations is both of biotechnological and medical relevance. Empirical free energy functions that quickly estimate the free energy change upon mutation (ΔΔG) can be exploited for systematic screenings of proteins and protein complexes. In silico saturation mutagenesis can guide the design of new experiments or rationalize the consequences of known mutations. Often software such as FoldX, while fast and reliable, lack the necessary automation features to apply them in a high-throughput manner. We introduce MutateX, a software to automate the prediction of ΔΔGs associated with the systematic mutation of each residue within a protein, or protein complex to all other possible residue types, using the FoldX energy function. MutateX also supports ΔΔG calculations over protein ensembles, upon post-translational modifications and in multimeric assemblies. At the heart of MutateX lies an automated pipeline engine that handles input preparation, parallelization and outputs publication-ready figures. We illustrate the MutateX protocol applied to different case studies. The results of the high-throughput scan provided by our tools can help in different applications, such as the analysis of disease-associated mutations, to complement experimental deep mutational scans, or assist the design of variants for industrial applications. MutateX is a collection of Python tools that relies on open-source libraries. It is available free of charge under the GNU General Public License from https://github.com/ELELAB/mutatex.
Collapse
Affiliation(s)
- Matteo Tiberti
- Cancer Structural Biology, Danish Cancer Society Research Center, 2100, Copenhagen, Denmark
| | - Thilde Terkelsen
- Cancer Structural Biology, Danish Cancer Society Research Center, 2100, Copenhagen, Denmark
| | - Kristine Degn
- Cancer Systems Biology, Section for Bioinformatics, Department of Health and Technology, Technical University of Denmark, 2800, Lyngby, Denmark
| | - Ludovica Beltrame
- Cancer Structural Biology, Danish Cancer Society Research Center, 2100, Copenhagen, Denmark
| | - Tycho Canter Cremers
- Cancer Structural Biology, Danish Cancer Society Research Center, 2100, Copenhagen, Denmark
| | - Isabelle da Piedade
- Cancer Structural Biology, Danish Cancer Society Research Center, 2100, Copenhagen, Denmark
| | - Miriam Di Marco
- Cancer Structural Biology, Danish Cancer Society Research Center, 2100, Copenhagen, Denmark
| | - Emiliano Maiani
- Cancer Structural Biology, Danish Cancer Society Research Center, 2100, Copenhagen, Denmark
| | - Elena Papaleo
- Cancer Structural Biology, Danish Cancer Society Research Center, 2100, Copenhagen, Denmark.,Cancer Systems Biology, Section for Bioinformatics, Department of Health and Technology, Technical University of Denmark, 2800, Lyngby, Denmark.,Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| |
Collapse
|
33
|
Delaunay M, Ha-Duong T. Computational Tools and Strategies to Develop Peptide-Based Inhibitors of Protein-Protein Interactions. METHODS IN MOLECULAR BIOLOGY (CLIFTON, N.J.) 2022; 2405:205-230. [PMID: 35298816 DOI: 10.1007/978-1-0716-1855-4_11] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Protein-protein interactions play crucial and subtle roles in many biological processes and modifications of their fine mechanisms generally result in severe diseases. Peptide derivatives are very promising therapeutic agents for modulating protein-protein associations with sizes and specificities between those of small compounds and antibodies. For the same reasons, rational design of peptide-based inhibitors naturally borrows and combines computational methods from both protein-ligand and protein-protein research fields. In this chapter, we aim to provide an overview of computational tools and approaches used for identifying and optimizing peptides that target protein-protein interfaces with high affinity and specificity. We hope that this review will help to implement appropriate in silico strategies for peptide-based drug design that builds on available information for the systems of interest.
Collapse
Affiliation(s)
| | - Tâp Ha-Duong
- Université Paris-Saclay, CNRS, BioCIS, Châtenay-Malabry, France.
| |
Collapse
|
34
|
BIPSPI+: Mining Type-Specific Datasets of Protein Complexes to Improve Protein Binding Site Prediction. J Mol Biol 2022; 434:167556. [DOI: 10.1016/j.jmb.2022.167556] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2021] [Revised: 03/12/2022] [Accepted: 03/16/2022] [Indexed: 11/20/2022]
|
35
|
Xiong D, Lee D, Li L, Zhao Q, Yu H. Implications of disease-related mutations at protein-protein interfaces. Curr Opin Struct Biol 2022; 72:219-225. [PMID: 34959033 PMCID: PMC8863207 DOI: 10.1016/j.sbi.2021.11.012] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2021] [Revised: 11/01/2021] [Accepted: 11/18/2021] [Indexed: 02/03/2023]
Abstract
Protein-protein interfaces have been attracting great attention owing to their critical roles in protein-protein interactions and the fact that human disease-related mutations are generally enriched in them. Recently, substantial research progress has been made in this field, which has significantly promoted the understanding and treatment of various human diseases. For example, many studies have discovered the properties of disease-related mutations. Besides, as more large-scale experimental data become available, various computational approaches have been proposed to advance our understanding of disease mutations from the data. Here, we overview recent advances in characteristics of disease-related mutations at protein-protein interfaces, mutation effects on protein interactions, and investigation of mutations on specific diseases.
Collapse
Affiliation(s)
- Dapeng Xiong
- Department of Computational Biology, Cornell University, Ithaca, NY 14853, USA,Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY 14853, USA
| | - Dongjin Lee
- Department of Computational Biology, Cornell University, Ithaca, NY 14853, USA,Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY 14853, USA
| | - Le Li
- Department of Computational Biology, Cornell University, Ithaca, NY 14853, USA,Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY 14853, USA
| | - Qiuye Zhao
- Department of Computational Biology, Cornell University, Ithaca, NY 14853, USA,Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY 14853, USA
| | - Haiyuan Yu
- Department of Computational Biology, Cornell University, Ithaca, NY 14853, USA,Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY 14853, USA
| |
Collapse
|
36
|
Chen S, Liu Y, Zhang Y, Wierbowski SD, Lipkin SM, Wei X, Yu H. A full-proteome, interaction-specific characterization of mutational hotspots across human cancers. Genome Res 2022; 32:135-149. [PMID: 34963661 PMCID: PMC8744679 DOI: 10.1101/gr.275437.121] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2021] [Accepted: 11/22/2021] [Indexed: 11/24/2022]
Abstract
Rapid accumulation of cancer genomic data has led to the identification of an increasing number of mutational hotspots with uncharacterized significance. Here we present a biologically informed computational framework that characterizes the functional relevance of all 1107 published mutational hotspots identified in approximately 25,000 tumor samples across 41 cancer types in the context of a human 3D interactome network, in which the interface of each interaction is mapped at residue resolution. Hotspots reside in network hub proteins and are enriched on protein interaction interfaces, suggesting that alteration of specific protein-protein interactions is critical for the oncogenicity of many hotspot mutations. Our framework enables, for the first time, systematic identification of specific protein interactions affected by hotspot mutations at the full proteome scale. Furthermore, by constructing a hotspot-affected network that connects all hotspot-affected interactions throughout the whole-human interactome, we uncover genome-wide relationships among hotspots and implicate novel cancer proteins that do not harbor hotspot mutations themselves. Moreover, applying our network-based framework to specific cancer types identifies clinically significant hotspots that can be used for prognosis and therapy targets. Overall, we show that our framework bridges the gap between the statistical significance of mutational hotspots and their biological and clinical significance in human cancers.
Collapse
Affiliation(s)
- Siwei Chen
- Department of Computational Biology, Cornell University, Ithaca, New York 14853, USA
- Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, New York 14853, USA
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York 14853, USA
| | - Yuan Liu
- Department of Computational Biology, Cornell University, Ithaca, New York 14853, USA
- Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, New York 14853, USA
| | - Yingying Zhang
- Department of Computational Biology, Cornell University, Ithaca, New York 14853, USA
- Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, New York 14853, USA
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York 14853, USA
| | - Shayne D Wierbowski
- Department of Computational Biology, Cornell University, Ithaca, New York 14853, USA
- Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, New York 14853, USA
| | - Steven M Lipkin
- Department of Medicine, Weill Cornell Medicine, New York, New York 10021, USA
| | - Xiaomu Wei
- Department of Computational Biology, Cornell University, Ithaca, New York 14853, USA
- Department of Medicine, Weill Cornell Medicine, New York, New York 10021, USA
| | - Haiyuan Yu
- Department of Computational Biology, Cornell University, Ithaca, New York 14853, USA
- Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, New York 14853, USA
| |
Collapse
|
37
|
Ovek D, Abali Z, Zeylan ME, Keskin O, Gursoy A, Tuncbag N. Artificial intelligence based methods for hot spot prediction. Curr Opin Struct Biol 2021; 72:209-218. [PMID: 34954608 DOI: 10.1016/j.sbi.2021.11.003] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2021] [Revised: 10/07/2021] [Accepted: 11/08/2021] [Indexed: 11/29/2022]
Abstract
Proteins interact through their interfaces to fulfill essential functions in the cell. They bind to their partners in a highly specific manner and form complexes that have a profound effect on understanding the biological pathways they are involved in. Any abnormal interactions may cause diseases. Therefore, the identification of small molecules which modulate protein interactions through their interfaces has high therapeutic potential. However, discovering such molecules is challenging. Most protein-protein binding affinity is attributed to a small set of amino acids found in protein interfaces known as hot spots. Recent studies demonstrate that drug-like small molecules specifically may bind to hot spots. Therefore, hot spot prediction is crucial. As experimental data accumulates, artificial intelligence begins to be used for computational hot spot prediction. First, we review machine learning and deep learning for computational hot spot prediction and then explain the significance of hot spots toward drug design.
Collapse
Affiliation(s)
- Damla Ovek
- College of Engineering, Koc University, 34450 Istanbul, Turkey
| | - Zeynep Abali
- College of Engineering, Koc University, 34450 Istanbul, Turkey
| | | | - Ozlem Keskin
- College of Engineering, Koc University, 34450 Istanbul, Turkey.
| | - Attila Gursoy
- College of Engineering, Koc University, 34450 Istanbul, Turkey.
| | - Nurcan Tuncbag
- College of Engineering, Koc University, 34450 Istanbul, Turkey; School of Medicine, Koc University, 34450 Istanbul, Turkey.
| |
Collapse
|
38
|
A 3D structural SARS-CoV-2-human interactome to explore genetic and drug perturbations. Nat Methods 2021; 18:1477-1488. [PMID: 34845387 PMCID: PMC8665054 DOI: 10.1038/s41592-021-01318-w] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2020] [Accepted: 10/05/2021] [Indexed: 01/08/2023]
Abstract
Emergence of new viral agents is driven by evolution of interactions between viral proteins and host targets. For instance, increased infectivity of SARS-CoV-2 compared to SARS-CoV-1 arose in part through rapid evolution along the interface between the spike protein and its human receptor ACE2, leading to increased binding affinity. To facilitate broader exploration of how pathogen-host interactions might impact transmission and virulence in the ongoing COVID-19 pandemic, we performed state-of-the-art interface prediction followed by molecular docking to construct a three-dimensional structural interactome between SARS-CoV-2 and human. We additionally carried out downstream meta-analyses to investigate enrichment of sequence divergence between SARS-CoV-1 and SARS-CoV-2 or human population variants along viral-human protein-interaction interfaces, predict changes in binding affinity by these mutations/variants and further prioritize drug repurposing candidates predicted to competitively bind human targets. We believe this resource ( http://3D-SARS2.yulab.org ) will aid in development and testing of informed hypotheses for SARS-CoV-2 etiology and treatments.
Collapse
|
39
|
Torres PHM, Rossi AD, Blundell TL. ProtCHOIR: a tool for proteome-scale generation of homo-oligomers. Brief Bioinform 2021; 22:bbab182. [PMID: 34015821 PMCID: PMC8574958 DOI: 10.1093/bib/bbab182] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2021] [Revised: 04/04/2021] [Accepted: 04/20/2021] [Indexed: 01/10/2023] Open
Abstract
The rapid developments in gene sequencing technologies achieved in the recent decades, along with the expansion of knowledge on the three-dimensional structures of proteins, have enabled the construction of proteome-scale databases of protein models such as the Genome3D and ModBase. Nevertheless, although gene products are usually expressed as individual polypeptide chains, most biological processes are associated with either transient or stable oligomerisation. In the PDB databank, for example, ~40% of the deposited structures contain at least one homo-oligomeric interface. Unfortunately, databases of protein models are generally devoid of multimeric structures. To tackle this particular issue, we have developed ProtCHOIR, a tool that is able to generate homo-oligomeric structures in an automated fashion, providing detailed information for the input protein and output complex. ProtCHOIR requires input of either a sequence or a protomeric structure that is queried against a pre-constructed local database of homo-oligomeric structures, then extensively analyzed using well-established tools such as PSI-Blast, MAFFT, PISA and Molprobity. Finally, MODELLER is employed to achieve the construction of the homo-oligomers. The output complex is thoroughly analyzed taking into account its stereochemical quality, interfacial stabilities, hydrophobicity and conservation profile. All these data are then summarized in a user-friendly HTML report that can be saved or printed as a PDF file. The software is easily parallelizable and also outputs a comma-separated file with summary statistics that can straightforwardly be concatenated as a spreadsheet-like document for large-scale data analyses. As a proof-of-concept, we built oligomeric models for the Mabellini Mycobacterium abscessus structural proteome database. ProtCHOIR can be run as a web-service and the code can be obtained free-of-charge at http://lmdm.biof.ufrj.br/protchoir.
Collapse
|
40
|
Raimondi F, Burkhart JG, Betts MJ, Russell RB, Wu G. Leveraging biochemical reactions to unravel functional impacts of cancer somatic variants affecting protein interaction interfaces. F1000Res 2021; 10:1111. [PMID: 36569594 PMCID: PMC9755755 DOI: 10.12688/f1000research.74395.1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 10/27/2021] [Indexed: 07/26/2023] Open
Abstract
Background: Considering protein mutations in their biological context is essential for understanding their functional impact, interpretation of high-dimensional datasets and development of effective targeted therapies in personalized medicine. Methods: We combined the curated knowledge of biochemical reactions from Reactome with the analysis of interaction-mediating 3D interfaces from Mechismo. In addition, we provided a software tool for users to explore and browse the analysis results in a multi-scale perspective starting from pathways and reactions to protein-protein interactions and protein 3D structures. Results: We analyzed somatic mutations from TCGA, revealing several significantly impacted reactions and pathways in specific cancer types. We found examples of genes not yet listed as oncodrivers, whose rare mutations were predicted to affect cancer processes similarly to known oncodrivers. Some identified processes lack any known oncodrivers, which suggests potentially new cancer-related processes (e.g. complement cascade reactions). Furthermore, we found that mutations perturbing certain processes are significantly associated with distinct phenotypes (i.e. survival time) in specific cancer types (e.g. PIK3CA centered pathways in LGG and UCEC cancer types), suggesting the translational potential of our approach for patient stratification. Our analysis also uncovered several druggable processes (e.g. GPCR signalling pathways) containing enriched reactions, providing support for new off-label therapeutic options. Conclusions: In summary, we have established a multi-scale approach to study genetic variants based on protein-protein interaction 3D structures. Our approach is different from previously published studies in its focus on biochemical reactions and can be applied to other data types (e.g. post-translational modifications) collected for many types of disease.
Collapse
Affiliation(s)
| | - Joshua G. Burkhart
- Division of Bioinformatics and Computational Biology, Department of Medical Informatics and Clinical Epidemiology, Oregon Health & Science University, Portland, Oregon, USA
| | - Matthew J. Betts
- Heidelberg University Biochemistry Center, University of Heidelberg, Heidelberg, Germany
- BioQuant, University of Heidelberg, Heidelberg, Germany
| | - Robert B. Russell
- Heidelberg University Biochemistry Center, University of Heidelberg, Heidelberg, Germany
- BioQuant, University of Heidelberg, Heidelberg, Germany
| | - Guanming Wu
- Division of Bioinformatics and Computational Biology, Department of Medical Informatics and Clinical Epidemiology, Oregon Health & Science University, Portland, Oregon, USA
| |
Collapse
|
41
|
Raimondi F, Burkhart JG, Betts MJ, Russell RB, Wu G. Leveraging biochemical reactions to unravel functional impacts of cancer somatic variants affecting protein interaction interfaces. F1000Res 2021; 10:1111. [PMID: 36569594 PMCID: PMC9755755 DOI: 10.12688/f1000research.74395.3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 12/01/2022] [Indexed: 12/14/2022] Open
Abstract
Background: Considering protein mutations in their biological context is essential for understanding their functional impact, interpretation of high-dimensional datasets and development of effective targeted therapies in personalized medicine. Methods: We combined the curated knowledge of biochemical reactions from Reactome with the analysis of interaction-mediating 3D interfaces from Mechismo. In addition, we provided a software tool for users to explore and browse the analysis results in a multi-scale perspective starting from pathways and reactions to protein-protein interactions and protein 3D structures. Results: We analyzed somatic mutations from TCGA, revealing several significantly impacted reactions and pathways in specific cancer types. We found examples of genes not yet listed as oncodrivers, whose rare mutations were predicted to affect cancer processes similarly to known oncodrivers. Some identified processes lack any known oncodrivers, which suggests potentially new cancer-related processes (e.g. complement cascade reactions). Furthermore, we found that mutations perturbing certain processes are significantly associated with distinct phenotypes (i.e. survival time) in specific cancer types (e.g. PIK3CA centered pathways in LGG and UCEC cancer types), suggesting the translational potential of our approach for patient stratification. Our analysis also uncovered several druggable processes (e.g. GPCR signalling pathways) containing enriched reactions, providing support for new off-label therapeutic options. Conclusions: In summary, we have established a multi-scale approach to study genetic variants based on protein-protein interaction 3D structures. Our approach is different from previously published studies in its focus on biochemical reactions and can be applied to other data types (e.g. post-translational modifications) collected for many types of disease.
Collapse
Affiliation(s)
| | - Joshua G. Burkhart
- Division of Bioinformatics and Computational Biology, Department of Medical Informatics and Clinical Epidemiology, Oregon Health & Science University, Portland, Oregon, USA
| | - Matthew J. Betts
- Heidelberg University Biochemistry Center, University of Heidelberg, Heidelberg, Germany
- BioQuant, University of Heidelberg, Heidelberg, Germany
| | - Robert B. Russell
- Heidelberg University Biochemistry Center, University of Heidelberg, Heidelberg, Germany
- BioQuant, University of Heidelberg, Heidelberg, Germany
| | - Guanming Wu
- Division of Bioinformatics and Computational Biology, Department of Medical Informatics and Clinical Epidemiology, Oregon Health & Science University, Portland, Oregon, USA
| |
Collapse
|
42
|
Raimondi F, Burkhart JG, Betts MJ, Russell RB, Wu G. Leveraging biochemical reactions to unravel functional impacts of cancer somatic variants affecting protein interaction interfaces. F1000Res 2021; 10:1111. [PMID: 36569594 PMCID: PMC9755755 DOI: 10.12688/f1000research.74395.2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 11/25/2022] [Indexed: 07/26/2023] Open
Abstract
Background: Considering protein mutations in their biological context is essential for understanding their functional impact, interpretation of high-dimensional datasets and development of effective targeted therapies in personalized medicine. Methods: We combined the curated knowledge of biochemical reactions from Reactome with the analysis of interaction-mediating 3D interfaces from Mechismo. In addition, we provided a software tool for users to explore and browse the analysis results in a multi-scale perspective starting from pathways and reactions to protein-protein interactions and protein 3D structures. Results: We analyzed somatic mutations from TCGA, revealing several significantly impacted reactions and pathways in specific cancer types. We found examples of genes not yet listed as oncodrivers, whose rare mutations were predicted to affect cancer processes similarly to known oncodrivers. Some identified processes lack any known oncodrivers, which suggests potentially new cancer-related processes (e.g. complement cascade reactions). Furthermore, we found that mutations perturbing certain processes are significantly associated with distinct phenotypes (i.e. survival time) in specific cancer types (e.g. PIK3CA centered pathways in LGG and UCEC cancer types), suggesting the translational potential of our approach for patient stratification. Our analysis also uncovered several druggable processes (e.g. GPCR signalling pathways) containing enriched reactions, providing support for new off-label therapeutic options. Conclusions: In summary, we have established a multi-scale approach to study genetic variants based on protein-protein interaction 3D structures. Our approach is different from previously published studies in its focus on biochemical reactions and can be applied to other data types (e.g. post-translational modifications) collected for many types of disease.
Collapse
Affiliation(s)
| | - Joshua G. Burkhart
- Division of Bioinformatics and Computational Biology, Department of Medical Informatics and Clinical Epidemiology, Oregon Health & Science University, Portland, Oregon, USA
| | - Matthew J. Betts
- Heidelberg University Biochemistry Center, University of Heidelberg, Heidelberg, Germany
- BioQuant, University of Heidelberg, Heidelberg, Germany
| | - Robert B. Russell
- Heidelberg University Biochemistry Center, University of Heidelberg, Heidelberg, Germany
- BioQuant, University of Heidelberg, Heidelberg, Germany
| | - Guanming Wu
- Division of Bioinformatics and Computational Biology, Department of Medical Informatics and Clinical Epidemiology, Oregon Health & Science University, Portland, Oregon, USA
| |
Collapse
|
43
|
Rouhani M, Hadi-Alijanvand H. Effect of Lithium Drug on Binding Affinities of Glycogen Synthase Kinase-3 β to Its Network Partners: A New Computational Approach. J Chem Inf Model 2021; 61:5280-5292. [PMID: 34533953 DOI: 10.1021/acs.jcim.1c00952] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Finding new methods to study the effect of small molecules on protein interaction networks provides us with invaluable tools in the fields of pharmacodynamics and drug design. Lithium is an antimanic drug that has been used for the treatment of bipolar disorder for more than 60 years. Here, we utilized a new approach to study the effect of lithium as a drug on the protein interaction network of GSK-3β as a hub protein and computed the affinities of GSK-3β to its partners in the presence of lithium or sodium ions. For this purpose, ensembles of GSK-3β protein structures were created in the presence of either lithium or sodium ions using adaptive tempering molecular dynamics simulations. The protein binding patches of GSK-3β for its partners were determined, and finally, the affinity of each binding patch to the related partner was computed for structures of ensembles using a monomer-based approach. Besides, by comparing structural dynamics of GSK-3β during MD simulations in the presence of LiCl and NaCl, we suggested a new mechanism for the inhibitory effect of lithium on GSK-3β.
Collapse
Affiliation(s)
- Maryam Rouhani
- Department of Biological Sciences, Institute for Advanced Studies in Basic Sciences (IASBS), Zanjan 45137-66731, Iran
| | - Hamid Hadi-Alijanvand
- Department of Biological Sciences, Institute for Advanced Studies in Basic Sciences (IASBS), Zanjan 45137-66731, Iran
| |
Collapse
|
44
|
Arici MK, Tuncbag N. Performance Assessment of the Network Reconstruction Approaches on Various Interactomes. Front Mol Biosci 2021; 8:666705. [PMID: 34676243 PMCID: PMC8523993 DOI: 10.3389/fmolb.2021.666705] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2021] [Accepted: 07/14/2021] [Indexed: 01/04/2023] Open
Abstract
Beyond the list of molecules, there is a necessity to collectively consider multiple sets of omic data and to reconstruct the connections between the molecules. Especially, pathway reconstruction is crucial to understanding disease biology because abnormal cellular signaling may be pathological. The main challenge is how to integrate the data together in an accurate way. In this study, we aim to comparatively analyze the performance of a set of network reconstruction algorithms on multiple reference interactomes. We first explored several human protein interactomes, including PathwayCommons, OmniPath, HIPPIE, iRefWeb, STRING, and ConsensusPathDB. The comparison is based on the coverage of each interactome in terms of cancer driver proteins, structural information of protein interactions, and the bias toward well-studied proteins. We next used these interactomes to evaluate the performance of network reconstruction algorithms including all-pair shortest path, heat diffusion with flux, personalized PageRank with flux, and prize-collecting Steiner forest (PCSF) approaches. Each approach has its own merits and weaknesses. Among them, PCSF had the most balanced performance in terms of precision and recall scores when 28 pathways from NetPath were reconstructed using the listed algorithms. Additionally, the reference interactome affects the performance of the network reconstruction approaches. The coverage and disease- or tissue-specificity of each interactome may vary, which may result in differences in the reconstructed networks.
Collapse
Affiliation(s)
- M Kaan Arici
- Graduate School of Informatics, Middle East Technical University, Ankara, Turkey.,Foot and Mouth Diseases Institute, Ministry of Agriculture and Forestry, Ankara, Turkey
| | - Nurcan Tuncbag
- Chemical and Biological Engineering, College of Engineering, Koc University, Istanbul, Turkey.,School of Medicine, Koc University, Istanbul, Turkey
| |
Collapse
|
45
|
Bim LV, Carneiro TNR, Buzatto VC, Colozza-Gama GA, Koyama FC, Thomaz DMD, de Jesus Paniza AC, Lee EA, Galante PAF, Cerutti JM. Molecular Signature Expands the Landscape of Driver Negative Thyroid Cancers. Cancers (Basel) 2021; 13:5184. [PMID: 34680332 PMCID: PMC8534197 DOI: 10.3390/cancers13205184] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2021] [Accepted: 10/12/2021] [Indexed: 12/04/2022] Open
Abstract
Thyroid cancer is the most common endocrine malignancy. However, the cytological diagnosis of follicular thyroid carcinoma (FTC), Hürthle cell carcinoma (HCC), and follicular variant of papillary thyroid carcinoma (FVPTC) and their benign counterparts is a challenge for preoperative diagnosis. Nearly 20-30% of biopsied thyroid nodules are classified as having indeterminate risk of malignancy and incur costs to the health care system. Based on that, 120 patients were screened for the main driver mutations previously described in thyroid cancer. Subsequently, 14 mutation-negative cases that are the main source of diagnostic errors (FTC, HCC, or FVPTC) underwent RNA-Sequencing analysis. Somatic variants in candidate driver genes (ECD, NUP98,LRP1B, NCOR1, ATM, SOS1, and SPOP) and fusions were described. NCOR1 and SPOP variants underwent validation. Moreover, expression profiling of driver-negative samples was compared to 16 BRAF V600E, RAS, or PAX8-PPARg positive samples. Negative samples were separated in two clusters, following the expression pattern of the RAS/PAX8-PPARg or BRAF V600E positive samples. Both negative groups showed distinct BRS, ERK, and TDS scores, tumor mutation burden, signaling pathways and immune cell profile. Altogether, here we report novel gene variants and describe cancer-related pathways that might impact preoperative diagnosis and provide insights into thyroid tumor biology.
Collapse
Affiliation(s)
- Larissa Valdemarin Bim
- Genetic Bases of Thyroid Tumors Laboratory, Division of Genetics, Department of Morphology and Genetics, Escola Paulista de Medicina, Universidade Federal de São Paulo, Pedro de Toledo 669, 11 Andar, São Paulo 04039-032, SP, Brazil; (L.V.B.); (T.N.R.C.); (G.A.C.-G.); (D.M.D.T.); (A.C.d.J.P.)
| | - Thaise Nayane Ribeiro Carneiro
- Genetic Bases of Thyroid Tumors Laboratory, Division of Genetics, Department of Morphology and Genetics, Escola Paulista de Medicina, Universidade Federal de São Paulo, Pedro de Toledo 669, 11 Andar, São Paulo 04039-032, SP, Brazil; (L.V.B.); (T.N.R.C.); (G.A.C.-G.); (D.M.D.T.); (A.C.d.J.P.)
| | - Vanessa Candiotti Buzatto
- Centro de Oncologia Molecular, Hospital Sírio-Libanês, Rua Professor Daher Cutait 69, Bela Vista, São Paulo 01308-060, SP, Brazil; (V.C.B.); (F.C.K.); (P.A.F.G.)
| | - Gabriel Avelar Colozza-Gama
- Genetic Bases of Thyroid Tumors Laboratory, Division of Genetics, Department of Morphology and Genetics, Escola Paulista de Medicina, Universidade Federal de São Paulo, Pedro de Toledo 669, 11 Andar, São Paulo 04039-032, SP, Brazil; (L.V.B.); (T.N.R.C.); (G.A.C.-G.); (D.M.D.T.); (A.C.d.J.P.)
| | - Fernanda C. Koyama
- Centro de Oncologia Molecular, Hospital Sírio-Libanês, Rua Professor Daher Cutait 69, Bela Vista, São Paulo 01308-060, SP, Brazil; (V.C.B.); (F.C.K.); (P.A.F.G.)
| | - Debora Mota Dias Thomaz
- Genetic Bases of Thyroid Tumors Laboratory, Division of Genetics, Department of Morphology and Genetics, Escola Paulista de Medicina, Universidade Federal de São Paulo, Pedro de Toledo 669, 11 Andar, São Paulo 04039-032, SP, Brazil; (L.V.B.); (T.N.R.C.); (G.A.C.-G.); (D.M.D.T.); (A.C.d.J.P.)
| | - Ana Carolina de Jesus Paniza
- Genetic Bases of Thyroid Tumors Laboratory, Division of Genetics, Department of Morphology and Genetics, Escola Paulista de Medicina, Universidade Federal de São Paulo, Pedro de Toledo 669, 11 Andar, São Paulo 04039-032, SP, Brazil; (L.V.B.); (T.N.R.C.); (G.A.C.-G.); (D.M.D.T.); (A.C.d.J.P.)
| | - Eunjung Alice Lee
- Division of Genetics and Genomics, Boston Children’s Hospital and Harvard Medical School, 3 Blackfan Circle, Boston, MA 02115, USA;
| | - Pedro Alexandre Favoretto Galante
- Centro de Oncologia Molecular, Hospital Sírio-Libanês, Rua Professor Daher Cutait 69, Bela Vista, São Paulo 01308-060, SP, Brazil; (V.C.B.); (F.C.K.); (P.A.F.G.)
| | - Janete Maria Cerutti
- Genetic Bases of Thyroid Tumors Laboratory, Division of Genetics, Department of Morphology and Genetics, Escola Paulista de Medicina, Universidade Federal de São Paulo, Pedro de Toledo 669, 11 Andar, São Paulo 04039-032, SP, Brazil; (L.V.B.); (T.N.R.C.); (G.A.C.-G.); (D.M.D.T.); (A.C.d.J.P.)
| |
Collapse
|
46
|
Senger G, Schaefer MH. Protein Complex Organization Imposes Constraints on Proteome Dysregulation in Cancer. FRONTIERS IN BIOINFORMATICS 2021; 1:723482. [PMID: 36303728 PMCID: PMC9580999 DOI: 10.3389/fbinf.2021.723482] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2021] [Accepted: 08/16/2021] [Indexed: 01/07/2023] Open
Abstract
Protein assembly is a highly dynamic process and proteins can interact in different ways and stoichiometries within a complex. The importance of maintaining protein stoichiometry for complex function and avoiding aggregation of orphan subunits has been demonstrated. However, how exactly the organization of proteins into complexes constrains differential protein abundance in extreme cellular conditions like cancer, where a lot of protein abundance changes occur, has not been systematically investigated. To study this, we collected proteomic data made available by the Clinical Proteomic Tumor Analysis Consortium (CPTAC) to quantify proteomic changes during carcinogenesis and systematically tested five interaction types in complexes to investigate which of these features impact on protein abundance correlation patterns in cancer. We found that higher than expected fraction of protein complex subunits does not show changes in their abundances compared to those in the normal samples. Furthermore, we found that the way proteins interact in complexes indeed constrains their co-abundance patterns. Our results highlight the role of the interactions between the proteins and the need of cancer cells to deal with aberrant changes in protein abundance.
Collapse
|
47
|
Protein interaction landscapes revealed by advanced in vivo cross-linking-mass spectrometry. Proc Natl Acad Sci U S A 2021; 118:2023360118. [PMID: 34349018 DOI: 10.1073/pnas.2023360118] [Citation(s) in RCA: 58] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
Defining protein-protein interactions (PPIs) in their native environment is crucial to understanding protein structure and function. Cross-linking-mass spectrometry (XL-MS) has proven effective in capturing PPIs in living cells; however, the proteome coverage remains limited. Here, we have developed a robust in vivo XL-MS platform to facilitate in-depth PPI mapping by integrating a multifunctional MS-cleavable cross-linker with sample preparation strategies and high-resolution MS. The advancement of click chemistry-based enrichment significantly enhanced the detection of cross-linked peptides for proteome-wide analyses. This platform enabled the identification of 13,904 unique lysine-lysine linkages from in vivo cross-linked HEK 293 cells, permitting construction of the largest in vivo PPI network to date, comprising 6,439 interactions among 2,484 proteins. These results allowed us to generate a highly detailed yet panoramic portrait of human interactomes associated with diverse cellular pathways. The strategy presented here signifies a technological advancement for in vivo PPI mapping at the systems level and can be generalized for charting protein interaction landscapes in any organisms.
Collapse
|
48
|
Moesslacher CS, Kohlmayr JM, Stelzl U. Exploring absent protein function in yeast: assaying post translational modification and human genetic variation. MICROBIAL CELL (GRAZ, AUSTRIA) 2021; 8:164-183. [PMID: 34395585 PMCID: PMC8329848 DOI: 10.15698/mic2021.08.756] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/08/2021] [Revised: 06/13/2021] [Accepted: 06/18/2021] [Indexed: 01/08/2023]
Abstract
Yeast is a valuable eukaryotic model organism that has evolved many processes conserved up to humans, yet many protein functions, including certain DNA and protein modifications, are absent. It is this absence of protein function that is fundamental to approaches using yeast as an in vivo test system to investigate human proteins. Functionality of the heterologous expressed proteins is connected to a quantitative, selectable phenotype, enabling the systematic analyses of mechanisms and specificity of DNA modification, post-translational protein modifications as well as the impact of annotated cancer mutations and coding variation on protein activity and interaction. Through continuous improvements of yeast screening systems, this is increasingly carried out on a global scale using deep mutational scanning approaches. Here we discuss the applicability of yeast systems to investigate absent human protein function with a specific focus on the impact of protein variation on protein-protein interaction modulation.
Collapse
Affiliation(s)
- Christina S Moesslacher
- Institute of Pharmaceutical Sciences and BioTechMed-Graz, University of Graz, Graz, Austria
- Contributed equally to the writing of this review
| | - Johanna M Kohlmayr
- Institute of Pharmaceutical Sciences and BioTechMed-Graz, University of Graz, Graz, Austria
- Contributed equally to the writing of this review
| | - Ulrich Stelzl
- Institute of Pharmaceutical Sciences and BioTechMed-Graz, University of Graz, Graz, Austria
- Contributed equally to the writing of this review
| |
Collapse
|
49
|
Ford KM, Panwala R, Chen DH, Portell A, Palmer N, Mali P. Peptide-tiling screens of cancer drivers reveal oncogenic protein domains and associated peptide inhibitors. Cell Syst 2021; 12:716-732.e7. [PMID: 34051140 PMCID: PMC8298269 DOI: 10.1016/j.cels.2021.05.002] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2020] [Revised: 02/09/2021] [Accepted: 03/30/2021] [Indexed: 02/07/2023]
Abstract
Gene fragments derived from structural domains mediating physical interactions can modulate biological functions. Utilizing this, we developed lentiviral overexpression libraries of peptides comprehensively tiling high-confidence cancer driver genes. Toward inhibiting cancer growth, we assayed ~66,000 peptides, tiling 65 cancer drivers and 579 mutant alleles. Pooled fitness screens in two breast cancer cell lines revealed peptides, which selectively reduced cellular proliferation, implicating oncogenic protein domains important for cell fitness. Coupling of cell-penetrating motifs to these peptides enabled drug-like function, with peptides derived from EGFR and RAF1 inhibiting cell growth at IC50s of 27-63 μM. We anticipate that this peptide-tiling (PepTile) approach will enable rapid de novo mapping of bioactive protein domains and associated interfering peptides.
Collapse
Affiliation(s)
- Kyle M Ford
- Department of Bioengineering, University of California, San Diego, San Diego, CA 92093, USA
| | - Rebecca Panwala
- Department of Bioengineering, University of California, San Diego, San Diego, CA 92093, USA
| | - Dai-Hua Chen
- Department of Bioengineering, University of California, San Diego, San Diego, CA 92093, USA
| | - Andrew Portell
- Department of Bioengineering, University of California, San Diego, San Diego, CA 92093, USA
| | - Nathan Palmer
- Division of Biological Sciences, University of California, San Diego, San Diego, CA 92093, USA
| | - Prashant Mali
- Department of Bioengineering, University of California, San Diego, San Diego, CA 92093, USA.
| |
Collapse
|
50
|
Chavez JD, Wippel HH, Tang X, Keller A, Bruce JE. In-Cell Labeling and Mass Spectrometry for Systems-Level Structural Biology. Chem Rev 2021; 122:7647-7689. [PMID: 34232610 PMCID: PMC8966414 DOI: 10.1021/acs.chemrev.1c00223] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Biological systems have evolved to utilize proteins to accomplish nearly all functional roles needed to sustain life. A majority of biological functions occur within the crowded environment inside cells and subcellular compartments where proteins exist in a densely packed complex network of protein-protein interactions. The structural biology field has experienced a renaissance with recent advances in crystallography, NMR, and CryoEM that now produce stunning models of large and complex structures previously unimaginable. Nevertheless, measurements of such structural detail within cellular environments remain elusive. This review will highlight how advances in mass spectrometry, chemical labeling, and informatics capabilities are merging to provide structural insights on proteins, complexes, and networks that exist inside cells. Because of the molecular detection specificity provided by mass spectrometry and proteomics, these approaches provide systems-level information that not only benefits from conventional structural analysis, but also is highly complementary. Although far from comprehensive in their current form, these approaches are currently providing systems structural biology information that can uniquely reveal how conformations and interactions involving many proteins change inside cells with perturbations such as disease, drug treatment, or phenotypic differences. With continued advancements and more widespread adaptation, systems structural biology based on in-cell labeling and mass spectrometry will provide an even greater wealth of structural knowledge.
Collapse
Affiliation(s)
- Juan D Chavez
- Department of Genome Sciences, University of Washington, Seattle, Washington 98109, United States
| | - Helisa H Wippel
- Department of Genome Sciences, University of Washington, Seattle, Washington 98109, United States
| | - Xiaoting Tang
- Department of Genome Sciences, University of Washington, Seattle, Washington 98109, United States
| | - Andrew Keller
- Department of Genome Sciences, University of Washington, Seattle, Washington 98109, United States
| | - James E Bruce
- Department of Genome Sciences, University of Washington, Seattle, Washington 98109, United States
| |
Collapse
|