Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Laine E, Karami Y, Carbone A. GEMME: a simple and fast global epistatic model predicting mutational effects. Mol Biol Evol 2019;36:2604-2619. [PMID: 31406981 PMCID: PMC6805226 DOI: 10.1093/molbev/msz179] [Citation(s) in RCA: 50] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2019] [Revised: 06/03/2019] [Accepted: 08/02/2019] [Indexed: 12/15/2022] Open

For:	Laine E, Karami Y, Carbone A. GEMME: a simple and fast global epistatic model predicting mutational effects. Mol Biol Evol 2019;36:2604-2619. [PMID: 31406981 PMCID: PMC6805226 DOI: 10.1093/molbev/msz179] [Citation(s) in RCA: 50] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2019] [Revised: 06/03/2019] [Accepted: 08/02/2019] [Indexed: 12/15/2022] Open

Number

Cited by Other Article(s)

Marsili G, Pallotto C, Fortuna C, Amendola A, Fiorentini C, Esperti S, Blanc P, Suardi LR, Giulietta V, Argentini C. Fifty years after the first identification of Toscana virus in Italy: Genomic characterization of viral isolates within lineage A and aminoacidic markers of evolution. INFECTION, GENETICS AND EVOLUTION : JOURNAL OF MOLECULAR EPIDEMIOLOGY AND EVOLUTIONARY GENETICS IN INFECTIOUS DISEASES 2024;122:105601. [PMID: 38830443 DOI: 10.1016/j.meegid.2024.105601] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/04/2024] [Revised: 04/18/2024] [Accepted: 05/03/2024] [Indexed: 06/05/2024]

Ozkan S, Padilla N, de la Cruz X. QAFI: a novel method for quantitative estimation of missense variant impact using protein-specific predictors and ensemble learning. Hum Genet 2024:10.1007/s00439-024-02692-z. [PMID: 39048855 DOI: 10.1007/s00439-024-02692-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2024] [Accepted: 07/14/2024] [Indexed: 07/27/2024]

Li P, Liu ZP. MuToN Quantifies Binding Affinity Changes upon Protein Mutations by Geometric Deep Learning. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2024:e2402918. [PMID: 38995072 DOI: 10.1002/advs.202402918] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/20/2024] [Revised: 06/04/2024] [Indexed: 07/13/2024]

Cheng P, Mao C, Tang J, Yang S, Cheng Y, Wang W, Gu Q, Han W, Chen H, Li S, Chen Y, Zhou J, Li W, Pan A, Zhao S, Huang X, Zhu S, Zhang J, Shu W, Wang S. Zero-shot prediction of mutation effects with multimodal deep representation learning guides protein engineering. Cell Res 2024:10.1038/s41422-024-00989-2. [PMID: 38969803 DOI: 10.1038/s41422-024-00989-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2024] [Accepted: 06/03/2024] [Indexed: 07/07/2024] Open

Abstract

Mutations in amino acid sequences can provoke changes in protein function. Accurate and unsupervised prediction of mutation effects is critical in biotechnology and biomedicine, but remains a fundamental challenge. To resolve this challenge, here we present Protein Mutational Effect Predictor (ProMEP), a general and multiple sequence alignment-free method that enables zero-shot prediction of mutation effects. A multimodal deep representation learning model embedded in ProMEP was developed to comprehensively learn both sequence and structure contexts from ~160 million proteins. ProMEP achieves state-of-the-art performance in mutational effect prediction and accomplishes a tremendous improvement in speed, enabling efficient and intelligent protein engineering. Specifically, ProMEP accurately forecasts mutational consequences on the gene-editing enzymes TnpB and TadA, and successfully guides the development of high-performance gene-editing tools with their engineered variants. The gene-editing efficiency of a 5-site mutant of TnpB reaches up to 74.04% (vs 24.66% for the wild type); and the base editing tool developed on the basis of a TadA 15-site mutant (in addition to the A106V/D108N double mutation that renders deoxyadenosine deaminase activity to TadA) exhibits an A-to-G conversion frequency of up to 77.27% (vs 69.80% for ABE8e, a previous TadA-based adenine base editor) with significantly reduced bystander and off-target effects compared to ABE8e. ProMEP not only showcases superior performance in predicting mutational effects on proteins but also demonstrates a great capability to guide protein engineering. Therefore, ProMEP enables efficient exploration of the gigantic protein space and facilitates practical design of proteins, thereby advancing studies in biomedicine and synthetic biology.

Collapse

Affiliation(s)

Peng Cheng Bioinformatics Center of AMMS, Beijing, China
Cong Mao State Key Laboratory of Reproductive Medicine and Offspring Health, Women's Hospital of Nanjing Medical University, Nanjing Maternity and Child Health Care Hospital, Nanjing Medical University, Nanjing, Jiangsu, China
Jin Tang Zhejiang Lab, Hangzhou, Zhejiang, China
Sen Yang Bioinformatics Center of AMMS, Beijing, China
Yu Cheng State Key Laboratory of Reproductive Medicine and Offspring Health, Women's Hospital of Nanjing Medical University, Nanjing Maternity and Child Health Care Hospital, Nanjing Medical University, Nanjing, Jiangsu, China
Wuke Wang Zhejiang Lab, Hangzhou, Zhejiang, China
Qiuxi Gu State Key Laboratory of Reproductive Medicine and Offspring Health, Women's Hospital of Nanjing Medical University, Nanjing Maternity and Child Health Care Hospital, Nanjing Medical University, Nanjing, Jiangsu, China
Wei Han Zhejiang Lab, Hangzhou, Zhejiang, China
Hao Chen State Key Laboratory of Reproductive Medicine and Offspring Health, Women's Hospital of Nanjing Medical University, Nanjing Maternity and Child Health Care Hospital, Nanjing Medical University, Nanjing, Jiangsu, China
Sihan Li State Key Laboratory of Reproductive Medicine and Offspring Health, Women's Hospital of Nanjing Medical University, Nanjing Maternity and Child Health Care Hospital, Nanjing Medical University, Nanjing, Jiangsu, China
Yaofeng Chen Bioinformatics Center of AMMS, Beijing, China
Jianglin Zhou Bioinformatics Center of AMMS, Beijing, China
Wuju Li Bioinformatics Center of AMMS, Beijing, China
Aimin Pan Zhejiang Lab, Hangzhou, Zhejiang, China
Suwen Zhao iHuman Institute, ShanghaiTech University, Shanghai, China School of Life Science and Technology, ShanghaiTech University, Shanghai, China
Xingxu Huang Zhejiang Lab, Hangzhou, Zhejiang, China School of Life Science and Technology, ShanghaiTech University, Shanghai, China
Shiqiang Zhu Zhejiang Lab, Hangzhou, Zhejiang, China.
Jun Zhang State Key Laboratory of Reproductive Medicine and Offspring Health, Women's Hospital of Nanjing Medical University, Nanjing Maternity and Child Health Care Hospital, Nanjing Medical University, Nanjing, Jiangsu, China.
Wenjie Shu Bioinformatics Center of AMMS, Beijing, China.
Shengqi Wang Bioinformatics Center of AMMS, Beijing, China.

Collapse

Dereli O, Kuru N, Akkoyun E, Bircan A, Tastan O, Adebali O. PHACTboost: A Phylogeny-Aware Pathogenicity Predictor for Missense Mutations via Boosting. Mol Biol Evol 2024;41:msae136. [PMID: 38934805 PMCID: PMC11251492 DOI: 10.1093/molbev/msae136] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2023] [Revised: 05/30/2024] [Accepted: 06/24/2024] [Indexed: 06/28/2024] Open

Zhou Z, Zhang L, Yu Y, Wu B, Li M, Hong L, Tan P. Enhancing efficiency of protein language models with minimal wet-lab data through few-shot learning. Nat Commun 2024;15:5566. [PMID: 38956442 PMCID: PMC11219809 DOI: 10.1038/s41467-024-49798-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2024] [Accepted: 06/11/2024] [Indexed: 07/04/2024] Open

Cocco S, Posani L, Monasson R. Functional effects of mutations in proteins can be predicted and interpreted by guided selection of sequence covariation information. Proc Natl Acad Sci U S A 2024;121:e2312335121. [PMID: 38889151 PMCID: PMC11214004 DOI: 10.1073/pnas.2312335121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2023] [Accepted: 04/21/2024] [Indexed: 06/20/2024] Open

Gouliaev F, Jonsson N, Gersing S, Lisby M, Lindorff-Larsen K, Hartmann-Petersen R. Destabilization and Degradation of a Disease-Linked PGM1 Protein Variant. Biochemistry 2024;63:1423-1433. [PMID: 38743592 DOI: 10.1021/acs.biochem.4c00042] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/16/2024]

Yee SW, Ferrández-Peral L, Alentorn-Moron P, Fontsere C, Ceylan M, Koleske ML, Handin N, Artegoitia VM, Lara G, Chien HC, Zhou X, Dainat J, Zalevsky A, Sali A, Brand CM, Wolfreys FD, Yang J, Gestwicki JE, Capra JA, Artursson P, Newman JW, Marquès-Bonet T, Giacomini KM. Illuminating the function of the orphan transporter, SLC22A10, in humans and other primates. Nat Commun 2024;15:4380. [PMID: 38782905 PMCID: PMC11116522 DOI: 10.1038/s41467-024-48569-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Accepted: 05/06/2024] [Indexed: 05/25/2024] Open

Affiliation(s)

Sook Wah Yee Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, USA
Luis Ferrández-Peral Institute of Evolutionary Biology (UPF-CSIC), PRBB, Dr. Aiguader 88, 08003, Barcelona, Spain
Pol Alentorn-Moron Institute of Evolutionary Biology (UPF-CSIC), PRBB, Dr. Aiguader 88, 08003, Barcelona, Spain
Claudia Fontsere Institute of Evolutionary Biology (UPF-CSIC), PRBB, Dr. Aiguader 88, 08003, Barcelona, Spain Center for Evolutionary Hologenomics, The Globe Institute, University of Copenhagen, Øster Farimagsgade 5A, 1352, Copenhagen, Denmark
Merve Ceylan Department of Pharmacy, Uppsala University, Uppsala, Sweden
Megan L Koleske Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, USA
Niklas Handin Department of Pharmacy, Uppsala University, Uppsala, Sweden
Virginia M Artegoitia United States Department of Agriculture, Agricultural Research Service, Western Human Nutrition Research Center, Davis, CA, 95616, USA
Giovanni Lara Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, USA
Huan-Chieh Chien Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, USA
Xujia Zhou Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, USA
Jacques Dainat Joint Research Unit for Infectious Diseases and Vectors Ecology Genetics Evolution and Control (MIVEGEC), University of Montpellier, French National Center for Scientific Research (CNRS 5290), French National Research Institute for Sustainable Development (IRD 224), 911 Avenue Agropolis, BP 64501, 34394, Montpellier Cedex 5, France
Arthur Zalevsky Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, USA
Andrej Sali Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, USA Department of Pharmaceutical Chemistry, University of California, San Francisco, CA, USA Quantitative Biosciences Institute (QBI), University of California, San Francisco, San Francisco, CA, US
Colin M Brand Bakar Computational Health Sciences Institute, University of California, San Francisco, CA, USA Department of Epidemiology and Biostatistics, University of California, San Francisco, CA, USA
Finn D Wolfreys Department of Ophthalmology, University of California, San Francisco, CA, USA
Jia Yang Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, USA
Jason E Gestwicki Department of Pharmaceutical Chemistry, University of California, San Francisco, CA, USA Institute for Neurodegenerative Diseases, University of California, San Francisco, CA, USA
John A Capra Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, USA Bakar Computational Health Sciences Institute, University of California, San Francisco, CA, USA Department of Epidemiology and Biostatistics, University of California, San Francisco, CA, USA Institute for Human Genetics, University of California San Francisco, San Francisco, CA, USA
Per Artursson Department of Pharmacy, Uppsala University, Uppsala, Sweden Science for Life Laboratories, Uppsala University, Uppsala, Sweden
John W Newman United States Department of Agriculture, Agricultural Research Service, Western Human Nutrition Research Center, Davis, CA, 95616, USA Department of Nutrition, University of California, Davis, Davis, CA, 95616, USA
Tomàs Marquès-Bonet Institute of Evolutionary Biology (UPF-CSIC), PRBB, Dr. Aiguader 88, 08003, Barcelona, Spain Catalan Institution of Research and Advanced Studies (ICREA), Passeig de Lluís Companys, 23, 08010, Barcelona, Spain CNAG, Centro Nacional de Analisis Genomico, Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, 08028, Barcelona, Spain Institut Català de Paleontologia Miquel Crusafont, Universitat Autònoma de Barcelona, Edifici ICTA-ICP, c/ Columnes s/n, 08193, Cerdanyola del Vallès, Barcelona, Spain
Kathleen M Giacomini Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, USA. Institute for Human Genetics, University of California San Francisco, San Francisco, CA, USA.

Collapse

Grønbæk-Thygesen M, Voutsinos V, Johansson KE, Schulze TK, Cagiada M, Pedersen L, Clausen L, Nariya S, Powell RL, Stein A, Fowler DM, Lindorff-Larsen K, Hartmann-Petersen R. Deep mutational scanning reveals a correlation between degradation and toxicity of thousands of aspartoacylase variants. Nat Commun 2024;15:4026. [PMID: 38740822 DOI: 10.1038/s41467-024-48481-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Accepted: 05/02/2024] [Indexed: 05/16/2024] Open

Klink GV, Kalinina OV, Bazykin GA. Changing selection on amino acid substitutions in Gag protein between major HIV-1 subtypes. Virus Evol 2024;10:veae036. [PMID: 38808036 PMCID: PMC11131029 DOI: 10.1093/ve/veae036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2023] [Revised: 12/27/2023] [Accepted: 04/28/2024] [Indexed: 05/30/2024] Open

Livesey BJ, Badonyi M, Dias M, Frazer J, Kumar S, Lindorff-Larsen K, McCandlish DM, Orenbuch R, Shearer CA, Muffley L, Foreman J, Glazer AM, Lehner B, Marks DS, Roth FP, Rubin AF, Starita LM, Marsh JA. Guidelines for releasing a variant effect predictor. ARXIV 2024:arXiv:2404.10807v1. [PMID: 38699161 PMCID: PMC11065047] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 05/05/2024]

Affiliation(s)

Benjamin J. Livesey MRC Human Genetics Unit, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, UK
Mihaly Badonyi MRC Human Genetics Unit, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, UK
Mafalda Dias Centre for Genomic Regulation (CRG),The Barcelona Institute of Science and Technology, Barcelona, Spain
Jonathan Frazer Centre for Genomic Regulation (CRG),The Barcelona Institute of Science and Technology, Barcelona, Spain
Sushant Kumar Department of Medical Biophysics, University of Toronto; Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada
Kresten Lindorff-Larsen Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
David M. McCandlish Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
Rose Orenbuch Department of Systems Biology, Harvard Medical School, Boston, MA, USA
Courtney A. Shearer Department of Systems Biology, Harvard Medical School, Boston, MA, USA
Lara Muffley Department of Genome Sciences, University of Washington and the Brotman Baty Institute for Precision Medicine, Seattle, WA, USA
Julia Foreman European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
Andrew M. Glazer Vanderbilt University Medical Center, Nashville, TN, USA
Ben Lehner Wellcome Sanger Institute, Cambridge, UK; Universitat Pompeu Fabra (UPF), Barcelona, Spain; Institució Catalana de Recerca i Estudis Avançats (ICREA), Barcelona, Spain
Debora S. Marks Department of Systems Biology, Harvard Medical School, Boston, MA, USA Broad Institute of MIT and Harvard, Boston, MA, USA
Frederick P. Roth Department of Computational and Systems Biology, University of Pittsburgh School of Medicine, Pittsburgh, PA, USA
Alan F. Rubin Bioinformatics Division, Walter and Eliza Hall Institute of Medical Research; Department of Medical Biology, University of Melbourne, Parkville, Australia
Lea M. Starita Department of Genome Sciences, University of Washington and the Brotman Baty Institute for Precision Medicine, Seattle, WA, USA
Joseph A. Marsh MRC Human Genetics Unit, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, UK

Collapse

Grønbæk-Thygesen M, Hartmann-Petersen R. Cellular and molecular mechanisms of aspartoacylase and its role in Canavan disease. Cell Biosci 2024;14:45. [PMID: 38582917 PMCID: PMC10998430 DOI: 10.1186/s13578-024-01224-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2023] [Accepted: 03/24/2024] [Indexed: 04/08/2024] Open

Dibyachintan S, Dube AK, Bradley D, Lemieux P, Dionne U, Landry CR. Cryptic genetic variation shapes the fate of gene duplicates in a protein interaction network. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.02.23.581840. [PMID: 38464075 PMCID: PMC10925128 DOI: 10.1101/2024.02.23.581840] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/12/2024]

Affiliation(s)

Soham Dibyachintan PROTEO-Regroupement Québécois de Recherche sur la Fonction, l'Ingénierie et les Applications des Protéines, Québec, QC, Canada Centre de Recherche en Données Massives de l'Université Laval, Université Laval, Québec, QC, Canada Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada Département de Biochimie, de Microbiologie et de Bio-Informatique, Université Laval, Québec, QC, Canada
Alexandre K Dube PROTEO-Regroupement Québécois de Recherche sur la Fonction, l'Ingénierie et les Applications des Protéines, Québec, QC, Canada Centre de Recherche en Données Massives de l'Université Laval, Université Laval, Québec, QC, Canada Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada Département de Biochimie, de Microbiologie et de Bio-Informatique, Université Laval, Québec, QC, Canada Département de Biologie, Université Laval, Québec, QC, Canada
David Bradley PROTEO-Regroupement Québécois de Recherche sur la Fonction, l'Ingénierie et les Applications des Protéines, Québec, QC, Canada Centre de Recherche en Données Massives de l'Université Laval, Université Laval, Québec, QC, Canada Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada Département de Biochimie, de Microbiologie et de Bio-Informatique, Université Laval, Québec, QC, Canada Département de Biologie, Université Laval, Québec, QC, Canada
Pascale Lemieux PROTEO-Regroupement Québécois de Recherche sur la Fonction, l'Ingénierie et les Applications des Protéines, Québec, QC, Canada Centre de Recherche en Données Massives de l'Université Laval, Université Laval, Québec, QC, Canada Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada Département de Biochimie, de Microbiologie et de Bio-Informatique, Université Laval, Québec, QC, Canada
Ugo Dionne PROTEO-Regroupement Québécois de Recherche sur la Fonction, l'Ingénierie et les Applications des Protéines, Québec, QC, Canada Centre de Recherche en Données Massives de l'Université Laval, Université Laval, Québec, QC, Canada Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada Current affiliation: Lunenfeld-Tanenbaum Research Institute, Sinai Health, Toronto, ON, Canada
Christian R Landry PROTEO-Regroupement Québécois de Recherche sur la Fonction, l'Ingénierie et les Applications des Protéines, Québec, QC, Canada Centre de Recherche en Données Massives de l'Université Laval, Université Laval, Québec, QC, Canada Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada Département de Biochimie, de Microbiologie et de Bio-Informatique, Université Laval, Québec, QC, Canada Département de Biologie, Université Laval, Québec, QC, Canada

Collapse

Wang X, Li A, Li X, Cui H. Empowering Protein Engineering through Recombination of Beneficial Substitutions. Chemistry 2024;30:e202303889. [PMID: 38288640 DOI: 10.1002/chem.202303889] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2024] [Indexed: 02/24/2024]

Saez-Matia A, Ibarluzea MG, M-Alicante S, Muguruza-Montero A, Nuñez E, Ramis R, Ballesteros OR, Lasa-Goicuria D, Fons C, Gallego M, Casis O, Leonardo A, Bergara A, Villarroel A. MLe-KCNQ2: An Artificial Intelligence Model for the Prognosis of Missense KCNQ2 Gene Variants. Int J Mol Sci 2024;25:2910. [PMID: 38474157 DOI: 10.3390/ijms25052910] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2024] [Revised: 02/27/2024] [Accepted: 02/29/2024] [Indexed: 03/14/2024] Open

Abstract

Despite the increasing availability of genomic data and enhanced data analysis procedures, predicting the severity of associated diseases remains elusive in the absence of clinical descriptors. To address this challenge, we have focused on the KV7.2 voltage-gated potassium channel gene (KCNQ2), known for its link to developmental delays and various epilepsies, including self-limited benign familial neonatal epilepsy and epileptic encephalopathy. Genome-wide tools often exhibit a tendency to overestimate deleterious mutations, frequently overlooking tolerated variants, and lack the capacity to discriminate variant severity. This study introduces a novel approach by evaluating multiple machine learning (ML) protocols and descriptors. The combination of genomic information with a novel Variant Frequency Index (VFI) builds a robust foundation for constructing reliable gene-specific ML models. The ensemble model, MLe-KCNQ2, formed through logistic regression, support vector machine, random forest and gradient boosting algorithms, achieves specificity and sensitivity values surpassing 0.95 (AUC-ROC > 0.98). The ensemble MLe-KCNQ2 model also categorizes pathogenic mutations as benign or severe, with an area under the receiver operating characteristic curve (AUC-ROC) above 0.67. This study not only presents a transferable methodology for accurately classifying KCNQ2 missense variants, but also provides valuable insights for clinical counseling and aids in the determination of variant severity. The research context emphasizes the necessity of precise variant classification, especially for genes like KCNQ2, contributing to the broader understanding of gene-specific challenges in the field of genomic research. The MLe-KCNQ2 model stands as a promising tool for enhancing clinical decision making and prognosis in the realm of KCNQ2-related pathologies.

Collapse

Clausen L, Voutsinos V, Cagiada M, Johansson KE, Grønbæk-Thygesen M, Nariya S, Powell RL, Have MKN, Oestergaard VH, Stein A, Fowler DM, Lindorff-Larsen K, Hartmann-Petersen R. A mutational atlas for Parkin proteostasis. Nat Commun 2024;15:1541. [PMID: 38378758 PMCID: PMC10879094 DOI: 10.1038/s41467-024-45829-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2023] [Accepted: 02/01/2024] [Indexed: 02/22/2024] Open

Andorf CM, Haley OC, Hayford RK, Portwood JL, Harding S, Sen S, Cannon EK, Gardiner JM, Kim HS, Woodhouse MR. PanEffect: a pan-genome visualization tool for variant effects in maize. Bioinformatics 2024;40:btae073. [PMID: 38337024 PMCID: PMC10881103 DOI: 10.1093/bioinformatics/btae073] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2023] [Revised: 01/30/2024] [Accepted: 02/06/2024] [Indexed: 02/12/2024] Open

Fannjiang C, Listgarten J. Is Novelty Predictable? Cold Spring Harb Perspect Biol 2024;16:a041469. [PMID: 38052497 PMCID: PMC10835614 DOI: 10.1101/cshperspect.a041469] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/07/2023]

Nourbakhsh M, Degn K, Saksager A, Tiberti M, Papaleo E. Prediction of cancer driver genes and mutations: the potential of integrative computational frameworks. Brief Bioinform 2024;25:bbad519. [PMID: 38261338 PMCID: PMC10805075 DOI: 10.1093/bib/bbad519] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Revised: 11/27/2023] [Accepted: 12/11/2023] [Indexed: 01/24/2024] Open

Weissenow K, Rost B. Rendering protein mutation movies with MutAmore. BMC Bioinformatics 2023;24:469. [PMID: 38087198 PMCID: PMC10714560 DOI: 10.1186/s12859-023-05610-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2023] [Accepted: 12/08/2023] [Indexed: 12/18/2023] Open

Notin P, Kollasch AW, Ritter D, van Niekerk L, Paul S, Spinner H, Rollins N, Shaw A, Weitzman R, Frazer J, Dias M, Franceschi D, Orenbuch R, Gal Y, Marks DS. ProteinGym: Large-Scale Benchmarks for Protein Design and Fitness Prediction. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.12.07.570727. [PMID: 38106144 PMCID: PMC10723403 DOI: 10.1101/2023.12.07.570727] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2023]

Notin P, Marks DS, Weitzman R, Gal Y. ProteinNPT: Improving Protein Property Prediction and Design with Non-Parametric Transformers. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.12.06.570473. [PMID: 38106034 PMCID: PMC10723423 DOI: 10.1101/2023.12.06.570473] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2023]

Abakarova M, Marquet C, Rera M, Rost B, Laine E. Alignment-based Protein Mutational Landscape Prediction: Doing More with Less. Genome Biol Evol 2023;15:evad201. [PMID: 37936309 PMCID: PMC10653582 DOI: 10.1093/gbe/evad201] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2023] [Revised: 10/27/2023] [Accepted: 11/01/2023] [Indexed: 11/09/2023] Open

Bradley D, Hogrebe A, Dandage R, Dubé AK, Leutert M, Dionne U, Chang A, Villén J, Landry CR. The fitness cost of spurious phosphorylation. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.08.561337. [PMID: 37873463 PMCID: PMC10592693 DOI: 10.1101/2023.10.08.561337] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]

Affiliation(s)

David Bradley Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada Department of Biochemistry, Microbiology and Bioinformatics, Université Laval, Québec, QC, Canada Quebec Network for Research on Protein Function, Engineering, and Applications (PROTEO), Université du Québec à Montréal, Montréal, QC, Canada Université Laval Big Data Research Center (BDRC_UL), Québec, QC, Canada Department of Biology, Université Laval, Québec, QC, Canada
Alexander Hogrebe Department of Genome Sciences, University of Washington, Seattle, WA, USA
Rohan Dandage Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada Department of Biochemistry, Microbiology and Bioinformatics, Université Laval, Québec, QC, Canada Quebec Network for Research on Protein Function, Engineering, and Applications (PROTEO), Université du Québec à Montréal, Montréal, QC, Canada Université Laval Big Data Research Center (BDRC_UL), Québec, QC, Canada Department of Biology, Université Laval, Québec, QC, Canada
Alexandre K Dubé Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada Department of Biochemistry, Microbiology and Bioinformatics, Université Laval, Québec, QC, Canada Quebec Network for Research on Protein Function, Engineering, and Applications (PROTEO), Université du Québec à Montréal, Montréal, QC, Canada Université Laval Big Data Research Center (BDRC_UL), Québec, QC, Canada Department of Biology, Université Laval, Québec, QC, Canada
Mario Leutert Department of Genome Sciences, University of Washington, Seattle, WA, USA Institute of Molecular Systems Biology, ETH Zürich, Zürich, Switzerland
Ugo Dionne Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada Department of Biochemistry, Microbiology and Bioinformatics, Université Laval, Québec, QC, Canada Quebec Network for Research on Protein Function, Engineering, and Applications (PROTEO), Université du Québec à Montréal, Montréal, QC, Canada Université Laval Big Data Research Center (BDRC_UL), Québec, QC, Canada Department of Biology, Université Laval, Québec, QC, Canada
Alexis Chang Department of Genome Sciences, University of Washington, Seattle, WA, USA
Judit Villén Department of Genome Sciences, University of Washington, Seattle, WA, USA
Christian R Landry Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada Department of Biochemistry, Microbiology and Bioinformatics, Université Laval, Québec, QC, Canada Quebec Network for Research on Protein Function, Engineering, and Applications (PROTEO), Université du Québec à Montréal, Montréal, QC, Canada Université Laval Big Data Research Center (BDRC_UL), Québec, QC, Canada Department of Biology, Université Laval, Québec, QC, Canada

Collapse

Posani L, Rizzato F, Monasson R, Cocco S. Infer global, predict local: Quantity-relevance trade-off in protein fitness predictions from sequence data. PLoS Comput Biol 2023;19:e1011521. [PMID: 37883593 PMCID: PMC10645369 DOI: 10.1371/journal.pcbi.1011521] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2023] [Revised: 11/14/2023] [Accepted: 09/15/2023] [Indexed: 10/28/2023] Open

Cheng J, Novati G, Pan J, Bycroft C, Žemgulytė A, Applebaum T, Pritzel A, Wong LH, Zielinski M, Sargeant T, Schneider RG, Senior AW, Jumper J, Hassabis D, Kohli P, Avsec Ž. Accurate proteome-wide missense variant effect prediction with AlphaMissense. Science 2023;381:eadg7492. [PMID: 37733863 DOI: 10.1126/science.adg7492] [Citation(s) in RCA: 225] [Impact Index Per Article: 225.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2023] [Accepted: 08/23/2023] [Indexed: 09/23/2023]

Wang H, Zang Y, Kang Y, Zhang J, Zhang L, Zhang S. ETLD: an encoder-transformation layer-decoder architecture for protein contact and mutation effects prediction. Brief Bioinform 2023;24:bbad290. [PMID: 37598423 DOI: 10.1093/bib/bbad290] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2023] [Revised: 06/21/2023] [Accepted: 07/26/2023] [Indexed: 08/22/2023] Open

Yee SW, Ferrández-Peral L, Alentorn P, Fontsere C, Ceylan M, Koleske ML, Handin N, Artegoitia VM, Lara G, Chien HC, Zhou X, Dainat J, Zalevsky A, Sali A, Brand CM, Capra JA, Artursson P, Newman JW, Marques-Bonet T, Giacomini KM. Illuminating the Function of the Orphan Transporter, SLC22A10 in Humans and Other Primates. RESEARCH SQUARE 2023:rs.3.rs-3263845. [PMID: 37790518 PMCID: PMC10543398 DOI: 10.21203/rs.3.rs-3263845/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/05/2023]

Affiliation(s)

Sook Wah Yee Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, California, USA
Luis Ferrández-Peral Institute of Evolutionary Biology (UPF-CSIC), PRBB, 08003 Barcelona, Spain
Pol Alentorn Institute of Evolutionary Biology (UPF-CSIC), PRBB, 08003 Barcelona, Spain
Claudia Fontsere Institute of Evolutionary Biology (UPF-CSIC), PRBB, 08003 Barcelona, Spain; Center for Evolutionary Hologenomics, The Globe Institute, University of Copenhagen, Øster Farimagsgade 5A, 1352 Copenhagen, Denmark
Merve Ceylan Department of Pharmacy and Science for Life Laboratory, Uppsala University, P.O. Box 580, 75123, Uppsala, Sweden
Megan L. Koleske Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, California, USA
Niklas Handin Department of Pharmacy and Science for Life Laboratory, Uppsala University, P.O. Box 580, 75123, Uppsala, Sweden
Virginia M. Artegoitia United States Department of Agriculture, Agricultural Research Service, Western Human Nutrition Research Center, Davis, CA 95616, USA
Giovanni Lara Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, California, USA
Huan-Chieh Chien Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, California, USA
Xujia Zhou Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, California, USA
Jacques Dainat Joint Research Unit for Infectious Diseases and Vectors Ecology Genetics Evolution and Control (MIVEGEC), University of Montpellier, French National Center for Scientific Research (CNRS 5290), French National Research Institute for Sustainable Development (IRD 224), 911 Avenue Agropolis, BP 64501, 34394 Montpellier Cedex 5, France
Arthur Zalevsky Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, California, USA
Andrej Sali Department of Bioengineering and Therapeutic Sciences, UCSF Box 0775 1700 4th St, University of California, San Francisco, San Francisco, CA 94158, United States; Department of Pharmaceutical Chemistry, University of California, San Francisco, UCSF Box 2880 600 16th St, San Francisco, CA 94143, United States; Quantitative Biosciences Institute (QBI), University of California, San Francisco, 1700 4th St, San Francisco, CA, United States
Colin M. Brand Bakar Computational Health Sciences Institute, University of California, San Francisco, CA, USA; Department of Epidemiology and Biostatistics, University of California, San Francisco, CA, USA
John A. Capra Bakar Computational Health Sciences Institute, University of California, San Francisco, CA, USA; Department of Epidemiology and Biostatistics, University of California, San Francisco, CA, USA
Per Artursson Department of Pharmacy and Science for Life Laboratory, Uppsala University, P.O. Box 580, 75123, Uppsala, Sweden
John W. Newman United States Department of Agriculture, Agricultural Research Service, Western Human Nutrition Research Center, Davis, CA 95616, USA; Department of Nutrition, University of California, Davis, Davis, CA 95616, USA; UC Davis West Coast Metabolomics Center, Davis, CA 95616, USA
Tomas Marques-Bonet Institute of Evolutionary Biology (UPF-CSIC), PRBB, 08003 Barcelona, Spain; Institute of Evolutionary Biology (UPF-CSIC), PRBB, Dr. Aiguader 88, 08003 Barcelona, Spain; Catalan Institution of Research and Advanced Studies (ICREA), Passeig de Lluís Companys, 23, 08010, Barcelona, Spain; CNAG, Centro Nacional de Analisis Genomico, Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, 08028 Barcelona, Spain; Institut Català de Paleontologia Miquel Crusafont, Universitat Autònoma de Barcelona, Edifici ICTA-ICP, c/ Columnes s/n, 08193 Cerdanyola del Vallès, Barcelona, Spain
Kathleen M. Giacomini Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, California, USA Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, California, USA; Institute for Human Genetics, University of California San Francisco, San Francisco, CA, USA

Collapse

Yee SW, Ferrández-Peral L, Alentorn P, Fontsere C, Ceylan M, Koleske ML, Handin N, Artegoitia VM, Lara G, Chien HC, Zhou X, Dainat J, Zalevsky A, Sali A, Brand CM, Capra JA, Artursson P, Newman JW, Marques-Bonet T, Giacomini KM. Illuminating the Function of the Orphan Transporter, SLC22A10 in Humans and Other Primates. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.08.552553. [PMID: 37609337 PMCID: PMC10441401 DOI: 10.1101/2023.08.08.552553] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/24/2023]

Affiliation(s)

Sook Wah Yee Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, California, USA
Luis Ferrández-Peral Institute of Evolutionary Biology (UPF-CSIC), PRBB, 08003 Barcelona, Spain
Pol Alentorn Institute of Evolutionary Biology (UPF-CSIC), PRBB, 08003 Barcelona, Spain
Claudia Fontsere Institute of Evolutionary Biology (UPF-CSIC), PRBB, 08003 Barcelona, Spain; Center for Evolutionary Hologenomics, The Globe Institute, University of Copenhagen, Øster Farimagsgade 5A, 1352 Copenhagen, Denmark
Merve Ceylan Department of Pharmacy and Science for Life Laboratory, Uppsala University, P.O. Box 580, 75123, Uppsala, Sweden
Megan L. Koleske Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, California, USA
Niklas Handin Department of Pharmacy and Science for Life Laboratory, Uppsala University, P.O. Box 580, 75123, Uppsala, Sweden
Virginia M. Artegoitia United States Department of Agriculture, Agricultural Research Service, Western Human Nutrition Research Center, Davis, CA 95616, USA
Giovanni Lara Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, California, USA
Huan-Chieh Chien Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, California, USA
Xujia Zhou Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, California, USA
Jacques Dainat Joint Research Unit for Infectious Diseases and Vectors Ecology Genetics Evolution and Control (MIVEGEC), University of Montpellier, French National Center for Scientific Research (CNRS 5290), French National Research Institute for Sustainable Development (IRD 224), 911 Avenue Agropolis, BP 64501, 34394 Montpellier Cedex 5, France
Arthur Zalevsky Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, California, USA
Andrej Sali Department of Bioengineering and Therapeutic Sciences, UCSF Box 0775 1700 4th St, University of California, San Francisco, San Francisco, CA 94158, United States; Department of Pharmaceutical Chemistry, University of California, San Francisco, UCSF Box 2880 600 16th St, San Francisco, CA 94143, United States; Quantitative Biosciences Institute (QBI), University of California, San Francisco, 1700 4th St, San Francisco, CA, United States
Colin M. Brand Bakar Computational Health Sciences Institute, University of California, San Francisco, CA, USA; Department of Epidemiology and Biostatistics, University of California, San Francisco, CA, USA
John A. Capra Bakar Computational Health Sciences Institute, University of California, San Francisco, CA, USA; Department of Epidemiology and Biostatistics, University of California, San Francisco, CA, USA
Per Artursson Department of Pharmacy and Science for Life Laboratory, Uppsala University, P.O. Box 580, 75123, Uppsala, Sweden
John W. Newman United States Department of Agriculture, Agricultural Research Service, Western Human Nutrition Research Center, Davis, CA 95616, USA; Department of Nutrition, University of California, Davis, Davis, CA 95616, USA; UC Davis West Coast Metabolomics Center, Davis, CA 95616, USA
Tomas Marques-Bonet Institute of Evolutionary Biology (UPF-CSIC), PRBB, 08003 Barcelona, Spain; Institute of Evolutionary Biology (UPF-CSIC), PRBB, Dr. Aiguader 88, 08003 Barcelona, Spain; Catalan Institution of Research and Advanced Studies (ICREA), Passeig de Lluís Companys, 23, 08010, Barcelona, Spain; CNAG, Centro Nacional de Analisis Genomico, Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, 08028 Barcelona, Spain; Institut Català de Paleontologia Miquel Crusafont, Universitat Autònoma de Barcelona, Edifici ICTA-ICP, c/ Columnes s/n, 08193 Cerdanyola del Vallès, Barcelona, Spain
Kathleen M. Giacomini Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, California, USA Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, California, USA; Institute for Human Genetics, University of California San Francisco, San Francisco, CA, USA

Collapse

Jagota M, Ye C, Albors C, Rastogi R, Koehl A, Ioannidis N, Song YS. Cross-protein transfer learning substantially improves disease variant prediction. Genome Biol 2023;24:182. [PMID: 37550700 PMCID: PMC10408151 DOI: 10.1186/s13059-023-03024-6] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2022] [Accepted: 07/27/2023] [Indexed: 08/09/2023] Open

Tsuboyama K, Dauparas J, Chen J, Laine E, Mohseni Behbahani Y, Weinstein JJ, Mangan NM, Ovchinnikov S, Rocklin GJ. Mega-scale experimental analysis of protein folding stability in biology and design. Nature 2023;620:434-444. [PMID: 37468638 PMCID: PMC10412457 DOI: 10.1038/s41586-023-06328-6] [Citation(s) in RCA: 50] [Impact Index Per Article: 50.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2023] [Accepted: 06/14/2023] [Indexed: 07/21/2023]

Nagar N, Tubiana J, Loewenthal G, Wolfson HJ, Ben Tal N, Pupko T. EvoRator2: Predicting Site-specific Amino Acid Substitutions Based on Protein Structural Information Using Deep Learning. J Mol Biol 2023;435:168155. [PMID: 37356902 DOI: 10.1016/j.jmb.2023.168155] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2023] [Revised: 05/13/2023] [Accepted: 05/17/2023] [Indexed: 06/27/2023]

Cagiada M, Bottaro S, Lindemose S, Schenstrøm SM, Stein A, Hartmann-Petersen R, Lindorff-Larsen K. Discovering functionally important sites in proteins. Nat Commun 2023;14:4175. [PMID: 37443362 PMCID: PMC10345196 DOI: 10.1038/s41467-023-39909-0] [Citation(s) in RCA: 13] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2023] [Accepted: 07/02/2023] [Indexed: 07/15/2023] Open

Mohseni Behbahani Y, Laine E, Carbone A. Deep Local Analysis deconstructs protein-protein interfaces and accurately estimates binding affinity changes upon mutation. Bioinformatics 2023;39:i544-i552. [PMID: 37387162 DOI: 10.1093/bioinformatics/btad231] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/01/2023] Open

Kampmeyer C, Grønbæk-Thygesen M, Oelerich N, Tatham MH, Cagiada M, Lindorff-Larsen K, Boomsma W, Hofmann K, Hartmann-Petersen R. Lysine deserts prevent adventitious ubiquitylation of ubiquitin-proteasome components. Cell Mol Life Sci 2023;80:143. [PMID: 37160462 PMCID: PMC10169902 DOI: 10.1007/s00018-023-04782-z] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Revised: 03/15/2023] [Accepted: 04/17/2023] [Indexed: 05/11/2023]

Gersing S, Cagiada M, Gebbia M, Gjesing AP, Coté AG, Seesankar G, Li R, Tabet D, Weile J, Stein A, Gloyn AL, Hansen T, Roth FP, Lindorff-Larsen K, Hartmann-Petersen R. A comprehensive map of human glucokinase variant activity. Genome Biol 2023;24:97. [PMID: 37101203 PMCID: PMC10131484 DOI: 10.1186/s13059-023-02935-8] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2022] [Accepted: 04/10/2023] [Indexed: 04/28/2023] Open

Affiliation(s)

Sarah Gersing The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, 2200, Copenhagen, Denmark
Matteo Cagiada The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, 2200, Copenhagen, Denmark
Marinella Gebbia Donnelly Centre, University of Toronto, Toronto, ON, M5S 3E1, Canada Department of Molecular Genetics, University of Toronto, Toronto, ON, M5S 1A8, Canada Lunenfeld-Tanenbaum Research Institute, Sinai Health, Toronto, ON, M5G 1X5, Canada
Anette P Gjesing Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
Atina G Coté Donnelly Centre, University of Toronto, Toronto, ON, M5S 3E1, Canada Department of Molecular Genetics, University of Toronto, Toronto, ON, M5S 1A8, Canada Lunenfeld-Tanenbaum Research Institute, Sinai Health, Toronto, ON, M5G 1X5, Canada
Gireesh Seesankar Donnelly Centre, University of Toronto, Toronto, ON, M5S 3E1, Canada Department of Molecular Genetics, University of Toronto, Toronto, ON, M5S 1A8, Canada Lunenfeld-Tanenbaum Research Institute, Sinai Health, Toronto, ON, M5G 1X5, Canada
Roujia Li Donnelly Centre, University of Toronto, Toronto, ON, M5S 3E1, Canada Department of Molecular Genetics, University of Toronto, Toronto, ON, M5S 1A8, Canada Lunenfeld-Tanenbaum Research Institute, Sinai Health, Toronto, ON, M5G 1X5, Canada Department of Computer Science, University of Toronto, Toronto, ON, M5T 3A1, Canada
Daniel Tabet Donnelly Centre, University of Toronto, Toronto, ON, M5S 3E1, Canada Department of Molecular Genetics, University of Toronto, Toronto, ON, M5S 1A8, Canada Lunenfeld-Tanenbaum Research Institute, Sinai Health, Toronto, ON, M5G 1X5, Canada Department of Computer Science, University of Toronto, Toronto, ON, M5T 3A1, Canada
Jochen Weile Donnelly Centre, University of Toronto, Toronto, ON, M5S 3E1, Canada Department of Molecular Genetics, University of Toronto, Toronto, ON, M5S 1A8, Canada Lunenfeld-Tanenbaum Research Institute, Sinai Health, Toronto, ON, M5G 1X5, Canada Department of Computer Science, University of Toronto, Toronto, ON, M5T 3A1, Canada
Amelie Stein The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, 2200, Copenhagen, Denmark
Anna L Gloyn Division of Endocrinology, Department of Pediatrics, Stanford University School of Medicine, Stanford, CA, USA Stanford Diabetes Research Center, Stanford University, Stanford, CA, USA
Torben Hansen Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
Frederick P Roth Donnelly Centre, University of Toronto, Toronto, ON, M5S 3E1, Canada. Department of Molecular Genetics, University of Toronto, Toronto, ON, M5S 1A8, Canada. Lunenfeld-Tanenbaum Research Institute, Sinai Health, Toronto, ON, M5G 1X5, Canada. Department of Computer Science, University of Toronto, Toronto, ON, M5T 3A1, Canada.
Kresten Lindorff-Larsen The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, 2200, Copenhagen, Denmark.
Rasmus Hartmann-Petersen The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, 2200, Copenhagen, Denmark.

Collapse

Abildgaard AB, Nielsen SV, Bernstein I, Stein A, Lindorff-Larsen K, Hartmann-Petersen R. Lynch syndrome, molecular mechanisms and variant classification. Br J Cancer 2023;128:726-734. [PMID: 36434153 PMCID: PMC9978028 DOI: 10.1038/s41416-022-02059-z] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2022] [Revised: 10/31/2022] [Accepted: 11/02/2022] [Indexed: 11/27/2022] Open

Cisneros AF, Gagnon-Arsenault I, Dubé AK, Després PC, Kumar P, Lafontaine K, Pelletier JN, Landry CR. Epistasis between promoter activity and coding mutations shapes gene evolvability. SCIENCE ADVANCES 2023;9:eadd9109. [PMID: 36735790 PMCID: PMC9897669 DOI: 10.1126/sciadv.add9109] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/12/2022] [Accepted: 12/22/2022] [Indexed: 06/01/2023]

Affiliation(s)

Angel F. Cisneros Département de biochimie, de microbiologie et de bio-informatique, Faculté des sciences et de génie, Université Laval, G1V 0A6, Québec, Canada Institut de biologie intégrative et des systèmes, Université Laval, G1V 0A6, Québec, Canada PROTEO, Le regroupement québécois de recherche sur la fonction, l’ingénierie et les applications des protéines, Université Laval, G1V 0A6, Québec, Canada Centre de recherche sur les données massives, Université Laval, G1V 0A6, Québec, Canada
Isabelle Gagnon-Arsenault Département de biochimie, de microbiologie et de bio-informatique, Faculté des sciences et de génie, Université Laval, G1V 0A6, Québec, Canada Institut de biologie intégrative et des systèmes, Université Laval, G1V 0A6, Québec, Canada PROTEO, Le regroupement québécois de recherche sur la fonction, l’ingénierie et les applications des protéines, Université Laval, G1V 0A6, Québec, Canada Centre de recherche sur les données massives, Université Laval, G1V 0A6, Québec, Canada Département de biologie, Faculté des sciences et de génie, Université Laval, G1V 0A6, Québec, Canada
Alexandre K. Dubé Département de biochimie, de microbiologie et de bio-informatique, Faculté des sciences et de génie, Université Laval, G1V 0A6, Québec, Canada Institut de biologie intégrative et des systèmes, Université Laval, G1V 0A6, Québec, Canada PROTEO, Le regroupement québécois de recherche sur la fonction, l’ingénierie et les applications des protéines, Université Laval, G1V 0A6, Québec, Canada Centre de recherche sur les données massives, Université Laval, G1V 0A6, Québec, Canada Département de biologie, Faculté des sciences et de génie, Université Laval, G1V 0A6, Québec, Canada
Philippe C. Després Département de biochimie, de microbiologie et de bio-informatique, Faculté des sciences et de génie, Université Laval, G1V 0A6, Québec, Canada Institut de biologie intégrative et des systèmes, Université Laval, G1V 0A6, Québec, Canada PROTEO, Le regroupement québécois de recherche sur la fonction, l’ingénierie et les applications des protéines, Université Laval, G1V 0A6, Québec, Canada Centre de recherche sur les données massives, Université Laval, G1V 0A6, Québec, Canada
Pradum Kumar Département de biochimie, de microbiologie et de bio-informatique, Faculté des sciences et de génie, Université Laval, G1V 0A6, Québec, Canada Department of Biosciences and Bioengineering, Indian Institute of Technology Roorkee, Roorkee, 247667, India
Kiana Lafontaine PROTEO, Le regroupement québécois de recherche sur la fonction, l’ingénierie et les applications des protéines, Université Laval, G1V 0A6, Québec, Canada Département de biochimie et de médecine moléculaire, Faculté de médecine, Université de Montréal, H3C 3J7, Montréal, Canada
Joelle N. Pelletier PROTEO, Le regroupement québécois de recherche sur la fonction, l’ingénierie et les applications des protéines, Université Laval, G1V 0A6, Québec, Canada Département de biochimie et de médecine moléculaire, Faculté de médecine, Université de Montréal, H3C 3J7, Montréal, Canada Département de chimie, Faculté des arts et des sciences, Université de Montréal, H3C 3J7, Montréal, Canada
Christian R. Landry Département de biochimie, de microbiologie et de bio-informatique, Faculté des sciences et de génie, Université Laval, G1V 0A6, Québec, Canada Institut de biologie intégrative et des systèmes, Université Laval, G1V 0A6, Québec, Canada PROTEO, Le regroupement québécois de recherche sur la fonction, l’ingénierie et les applications des protéines, Université Laval, G1V 0A6, Québec, Canada Centre de recherche sur les données massives, Université Laval, G1V 0A6, Québec, Canada Département de biologie, Faculté des sciences et de génie, Université Laval, G1V 0A6, Québec, Canada

Collapse

Yildirim A, Tekpinar M. Building Quantitative Bridges between Dynamics and Sequences of SARS-CoV-2 Main Protease and a Diverse Set of Thirty-Two Proteins. J Chem Inf Model 2023;63:9-19. [PMID: 36513349 DOI: 10.1021/acs.jcim.2c01206] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]

Tiemann JKS, Zschach H, Lindorff-Larsen K, Stein A. Interpreting the molecular mechanisms of disease variants in human transmembrane proteins. Biophys J 2023:S0006-3495(22)03941-8. [PMID: 36600598 DOI: 10.1016/j.bpj.2022.12.031] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Revised: 11/19/2022] [Accepted: 12/21/2022] [Indexed: 01/06/2023] Open

Olenyi T, Marquet C, Heinzinger M, Kröger B, Nikolova T, Bernhofer M, Sändig P, Schütze K, Littmann M, Mirdita M, Steinegger M, Dallago C, Rost B. LambdaPP: Fast and accessible protein-specific phenotype predictions. Protein Sci 2023;32:e4524. [PMID: 36454227 PMCID: PMC9793974 DOI: 10.1002/pro.4524] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2022] [Revised: 11/09/2022] [Accepted: 11/21/2022] [Indexed: 12/04/2022]

Affiliation(s)

Tobias Olenyi TUM (Technical University of Munich) Department of InformaticsBioinformatics‐ & Computational Biology—i12GarchingGermany,TUM Graduate SchoolCenter of Doctoral Studies in Informatics and its Applications (CeDoSIA)GarchingGermany
Céline Marquet TUM (Technical University of Munich) Department of InformaticsBioinformatics‐ & Computational Biology—i12GarchingGermany,TUM Graduate SchoolCenter of Doctoral Studies in Informatics and its Applications (CeDoSIA)GarchingGermany
Michael Heinzinger TUM (Technical University of Munich) Department of InformaticsBioinformatics‐ & Computational Biology—i12GarchingGermany,TUM Graduate SchoolCenter of Doctoral Studies in Informatics and its Applications (CeDoSIA)GarchingGermany
Benjamin Kröger TUM (Technical University of Munich) Department of InformaticsBioinformatics‐ & Computational Biology—i12GarchingGermany
Tiha Nikolova TUM (Technical University of Munich) Department of InformaticsBioinformatics‐ & Computational Biology—i12GarchingGermany
Michael Bernhofer TUM Graduate SchoolCenter of Doctoral Studies in Informatics and its Applications (CeDoSIA)GarchingGermany
Philip Sändig TUM (Technical University of Munich) Department of InformaticsBioinformatics‐ & Computational Biology—i12GarchingGermany
Konstantin Schütze TUM (Technical University of Munich) Department of InformaticsBioinformatics‐ & Computational Biology—i12GarchingGermany
Maria Littmann TUM (Technical University of Munich) Department of InformaticsBioinformatics‐ & Computational Biology—i12GarchingGermany
Milot Mirdita School of Biological SciencesSeoul National UniversitySeoulSouth Korea
Martin Steinegger School of Biological SciencesSeoul National UniversitySeoulSouth Korea,Korea Artificial Intelligence InstituteSeoul National UniversitySeoulSouth Korea,Korea Institute of Molecular Biology and GeneticsSeoul National UniversitySeoulSouth Korea
Christian Dallago TUM (Technical University of Munich) Department of InformaticsBioinformatics‐ & Computational Biology—i12GarchingGermany,VantAINew YorkUSA
Burkhard Rost TUM (Technical University of Munich) Department of InformaticsBioinformatics‐ & Computational Biology—i12GarchingGermany,Institute for Advanced Study (TUM‐IAS)Lichtenbergstr. 2a, 85748 Garching/Munich, Germany & TUM School of Life Sciences Weihenstephan (WZW)FreisingGermany

Collapse

Fu Y, Bedő J, Papenfuss AT, Rubin AF. Integrating deep mutational scanning and low-throughput mutagenesis data to predict the impact of amino acid variants. Gigascience 2022;12:giad073. [PMID: 37721410 PMCID: PMC10506130 DOI: 10.1093/gigascience/giad073] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2023] [Revised: 07/02/2023] [Accepted: 08/23/2023] [Indexed: 09/19/2023] Open

Gjesing AP, Engelbrechtsen L, Cathrine B Thuesen A, Have CT, Hollensted M, Grarup N, Linneberg A, Steen Nielsen J, Christensen LB, Thomsen RW, Johansson KE, Cagiada M, Gersing S, Hartmann-Petersen R, Lindorff-Larsen K, Vaag A, Sørensen HT, Brandslund I, Beck-Nielsen H, Pedersen O, Rungby J, Hansen T. 14-fold increased prevalence of rare glucokinase gene variant carriers in unselected Danish patients with newly diagnosed type 2 diabetes. Diabetes Res Clin Pract 2022;194:110159. [PMID: 36400171 DOI: 10.1016/j.diabres.2022.110159] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/02/2022] [Revised: 11/08/2022] [Accepted: 11/11/2022] [Indexed: 11/17/2022]

Affiliation(s)

Anette P Gjesing Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark.
Line Engelbrechtsen Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark; Department of Gynecology and Obstetrics, Herlev Hospital, Denmark
Anne Cathrine B Thuesen Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark; Steno Diabetes Center Copenhagen, Gentofte, Denmark
Christian T Have Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
Mette Hollensted Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
Niels Grarup Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
Allan Linneberg Center for Clinical Research and Prevention, Bispebjerg and Frederiksberg Hospital, Copenhagen, Denmark; Department of Clinical Experimental Research, Rigshospitalet, Glostrup, Denmark; Department of Clinical Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
Jens Steen Nielsen The Danish Centre for Strategic Research in Type 2 Diabetes (DD2), Steno Diabetes Center Odense, Odense University Hospital, Odense, Denmark
Lotte B Christensen Department of Clinical Epidemiology, Aarhus University Hospital and Department of Clinical Medicine, Aarhus University, Aarhus, Denmark
Reimar W Thomsen Department of Clinical Epidemiology, Aarhus University Hospital and Department of Clinical Medicine, Aarhus University, Aarhus, Denmark
Kristoffer E Johansson The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, DK-2200 Copenhagen, Denmark
Matteo Cagiada The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, DK-2200 Copenhagen, Denmark
Sarah Gersing The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, DK-2200 Copenhagen, Denmark
Rasmus Hartmann-Petersen The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, DK-2200 Copenhagen, Denmark
Kresten Lindorff-Larsen The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, DK-2200 Copenhagen, Denmark
Allan Vaag Steno Diabetes Center Copenhagen, Gentofte, Denmark
Henrik T Sørensen Department of Clinical Epidemiology, Aarhus University Hospital and Department of Clinical Medicine, Aarhus University, Aarhus, Denmark
Ivan Brandslund Department of Clinical Biochemistry, Hospital Lillebaelt, Vejle, Denmark; Institute of Regional Health Research, University of Southern Denmark, Odense, Denmark
Henning Beck-Nielsen Diabetes Research Centre, Department of Endocrinology, Centre for Individualized Medicine in Arterial Diseases, Odense University Hospital, Odense, Denmark
Oluf Pedersen Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
Jørgen Rungby The Danish Centre for Strategic Research in Type 2 Diabetes (DD2), Steno Diabetes Center Odense, Odense University Hospital, Odense, Denmark; Department of Endocrinology and Copenhagen Center for Translational Research, Bispebjerg Hospital, University of Copenhagen, Copenhagen, Denmark
Torben Hansen Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark.

Collapse

Pacheco-Garcia JL, Cagiada M, Tienne-Matos K, Salido E, Lindorff-Larsen K, L. Pey A. Effect of naturally-occurring mutations on the stability and function of cancer-associated NQO1: Comparison of experiments and computation. Front Mol Biosci 2022;9:1063620. [PMID: 36504709 PMCID: PMC9730889 DOI: 10.3389/fmolb.2022.1063620] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2022] [Accepted: 11/03/2022] [Indexed: 11/25/2022] Open

The Cancermuts software package for the prioritization of missense cancer variants: a case study of AMBRA1 in melanoma. Cell Death Dis 2022;13:872. [PMID: 36243772 PMCID: PMC9569343 DOI: 10.1038/s41419-022-05318-2] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2022] [Revised: 09/27/2022] [Accepted: 10/03/2022] [Indexed: 11/07/2022]

Azbukina N, Zharikova A, Ramensky V. Intragenic compensation through the lens of deep mutational scanning. Biophys Rev 2022;14:1161-1182. [PMID: 36345285 PMCID: PMC9636336 DOI: 10.1007/s12551-022-01005-w] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2022] [Accepted: 09/26/2022] [Indexed: 12/20/2022] Open

Marquet C, Heinzinger M, Olenyi T, Dallago C, Erckert K, Bernhofer M, Nechaev D, Rost B. Embeddings from protein language models predict conservation and variant effects. Hum Genet 2022;141:1629-1647. [PMID: 34967936 PMCID: PMC8716573 DOI: 10.1007/s00439-021-02411-y] [Citation(s) in RCA: 37] [Impact Index Per Article: 18.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2021] [Accepted: 12/06/2021] [Indexed: 12/13/2022]

Abstract

The emergence of SARS-CoV-2 variants stressed the demand for tools allowing to interpret the effect of single amino acid variants (SAVs) on protein function. While Deep Mutational Scanning (DMS) sets continue to expand our understanding of the mutational landscape of single proteins, the results continue to challenge analyses. Protein Language Models (pLMs) use the latest deep learning (DL) algorithms to leverage growing databases of protein sequences. These methods learn to predict missing or masked amino acids from the context of entire sequence regions. Here, we used pLM representations (embeddings) to predict sequence conservation and SAV effects without multiple sequence alignments (MSAs). Embeddings alone predicted residue conservation almost as accurately from single sequences as ConSeq using MSAs (two-state Matthews Correlation Coefficient-MCC-for ProtT5 embeddings of 0.596 ± 0.006 vs. 0.608 ± 0.006 for ConSeq). Inputting the conservation prediction along with BLOSUM62 substitution scores and pLM mask reconstruction probabilities into a simplistic logistic regression (LR) ensemble for Variant Effect Score Prediction without Alignments (VESPA) predicted SAV effect magnitude without any optimization on DMS data. Comparing predictions for a standard set of 39 DMS experiments to other methods (incl. ESM-1v, DeepSequence, and GEMME) revealed our approach as competitive with the state-of-the-art (SOTA) methods using MSA input. No method outperformed all others, neither consistently nor statistically significantly, independently of the performance measure applied (Spearman and Pearson correlation). Finally, we investigated binary effect predictions on DMS experiments for four human proteins. Overall, embedding-based methods have become competitive with methods relying on MSAs for SAV effect prediction at a fraction of the costs in computing/energy. Our method predicted SAV effects for the entire human proteome (~ 20 k proteins) within 40 min on one Nvidia Quadro RTX 8000. All methods and data sets are freely available for local and online execution through bioembeddings.com, https://github.com/Rostlab/VESPA , and PredictProtein.

Collapse

Affiliation(s)

Céline Marquet Department of Informatics, Bioinformatics and Computational Biology - i12, TUM-Technical University of Munich, Boltzmannstr. 3, Garching, 85748, Munich, Germany. TUM Graduate School, Center of Doctoral Studies in Informatics and its Applications (CeDoSIA), Boltzmannstr. 11, 85748, Garching, Germany.
Michael Heinzinger Department of Informatics, Bioinformatics and Computational Biology - i12, TUM-Technical University of Munich, Boltzmannstr. 3, Garching, 85748, Munich, Germany TUM Graduate School, Center of Doctoral Studies in Informatics and its Applications (CeDoSIA), Boltzmannstr. 11, 85748, Garching, Germany
Tobias Olenyi Department of Informatics, Bioinformatics and Computational Biology - i12, TUM-Technical University of Munich, Boltzmannstr. 3, Garching, 85748, Munich, Germany TUM Graduate School, Center of Doctoral Studies in Informatics and its Applications (CeDoSIA), Boltzmannstr. 11, 85748, Garching, Germany
Christian Dallago Department of Informatics, Bioinformatics and Computational Biology - i12, TUM-Technical University of Munich, Boltzmannstr. 3, Garching, 85748, Munich, Germany TUM Graduate School, Center of Doctoral Studies in Informatics and its Applications (CeDoSIA), Boltzmannstr. 11, 85748, Garching, Germany
Kyra Erckert Department of Informatics, Bioinformatics and Computational Biology - i12, TUM-Technical University of Munich, Boltzmannstr. 3, Garching, 85748, Munich, Germany TUM Graduate School, Center of Doctoral Studies in Informatics and its Applications (CeDoSIA), Boltzmannstr. 11, 85748, Garching, Germany
Michael Bernhofer Department of Informatics, Bioinformatics and Computational Biology - i12, TUM-Technical University of Munich, Boltzmannstr. 3, Garching, 85748, Munich, Germany TUM Graduate School, Center of Doctoral Studies in Informatics and its Applications (CeDoSIA), Boltzmannstr. 11, 85748, Garching, Germany
Dmitrii Nechaev Department of Informatics, Bioinformatics and Computational Biology - i12, TUM-Technical University of Munich, Boltzmannstr. 3, Garching, 85748, Munich, Germany TUM Graduate School, Center of Doctoral Studies in Informatics and its Applications (CeDoSIA), Boltzmannstr. 11, 85748, Garching, Germany
Burkhard Rost Department of Informatics, Bioinformatics and Computational Biology - i12, TUM-Technical University of Munich, Boltzmannstr. 3, Garching, 85748, Munich, Germany Institute for Advanced Study (TUM-IAS), Lichtenbergstr. 2a, Garching, 85748, Munich, Germany TUM School of Life Sciences Weihenstephan (TUM-WZW), Alte Akademie 8, Freising, Germany

Collapse

Deciphering polymorphism in 61,157 Escherichia coli genomes via epistatic sequence landscapes. Nat Commun 2022;13:4030. [PMID: 35821377 PMCID: PMC9276797 DOI: 10.1038/s41467-022-31643-3] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2021] [Accepted: 06/27/2022] [Indexed: 12/05/2022] Open

Kuru N, Dereli O, Akkoyun E, Bircan A, Tastan O, Adebali O. PHACT: Phylogeny-aware computing of tolerance for missense mutations. Mol Biol Evol 2022;39:6593375. [PMID: 35639618 PMCID: PMC9178230 DOI: 10.1093/molbev/msac114] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open