Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Andreeva A, Kulesha E, Gough J, Murzin AG. The SCOP database in 2020: expanded classification of representative family and superfamily domains of known protein structures. Nucleic Acids Res 2020;48:D376-D382. [PMID: 31724711 PMCID: PMC7139981 DOI: 10.1093/nar/gkz1064] [Citation(s) in RCA: 186] [Impact Index Per Article: 46.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2019] [Revised: 10/17/2019] [Accepted: 10/30/2019] [Indexed: 12/13/2022] Open

For:	Andreeva A, Kulesha E, Gough J, Murzin AG. The SCOP database in 2020: expanded classification of representative family and superfamily domains of known protein structures. Nucleic Acids Res 2020;48:D376-D382. [PMID: 31724711 PMCID: PMC7139981 DOI: 10.1093/nar/gkz1064] [Citation(s) in RCA: 186] [Impact Index Per Article: 46.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2019] [Revised: 10/17/2019] [Accepted: 10/30/2019] [Indexed: 12/13/2022] Open

Number

Cited by Other Article(s)

Li W, Almirantis Y, Provata A. Range-limited Heaps' law for functional DNA words in the human genome. J Theor Biol 2024;592:111878. [PMID: 38901778 DOI: 10.1016/j.jtbi.2024.111878] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2023] [Revised: 05/31/2024] [Accepted: 06/10/2024] [Indexed: 06/22/2024]

Ahdritz G, Bouatta N, Floristean C, Kadyan S, Xia Q, Gerecke W, O'Donnell TJ, Berenberg D, Fisk I, Zanichelli N, Zhang B, Nowaczynski A, Wang B, Stepniewska-Dziubinska MM, Zhang S, Ojewole A, Guney ME, Biderman S, Watkins AM, Ra S, Lorenzo PR, Nivon L, Weitzner B, Ban YEA, Chen S, Zhang M, Li C, Song SL, He Y, Sorger PK, Mostaque E, Zhang Z, Bonneau R, AlQuraishi M. OpenFold: retraining AlphaFold2 yields new insights into its learning mechanisms and capacity for generalization. Nat Methods 2024;21:1514-1524. [PMID: 38744917 DOI: 10.1038/s41592-024-02272-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Accepted: 04/03/2024] [Indexed: 05/16/2024]

Affiliation(s)

Gustaf Ahdritz Department of Systems Biology, Columbia University, New York, NY, USA Harvard University, Cambridge, MA, USA
Nazim Bouatta Laboratory of Systems Pharmacology, Harvard Medical School, Boston, MA, USA.
Christina Floristean Department of Systems Biology, Columbia University, New York, NY, USA
Sachin Kadyan Department of Systems Biology, Columbia University, New York, NY, USA
Qinghui Xia Department of Systems Biology, Columbia University, New York, NY, USA
William Gerecke Laboratory of Systems Pharmacology, Harvard Medical School, Boston, MA, USA
Timothy J O'Donnell Icahn School of Medicine at Mount Sinai, New York, NY, USA
Daniel Berenberg Department of Computer Science, Courant Institute of Mathematical Sciences, New York University, New York, NY, USA
Ian Fisk Flatiron Institute, New York, NY, USA
Niccolò Zanichelli OpenBioML, Cambridge, MA, USA
Bo Zhang Scientific Computing and Imaging Institute, University of Utah, Salt Lake City, UT, USA
Arkadiusz Nowaczynski NVIDIA, Santa Clara, CA, USA
Bei Wang NVIDIA, Santa Clara, CA, USA
Marta M Stepniewska-Dziubinska NVIDIA, Santa Clara, CA, USA
Shang Zhang NVIDIA, Santa Clara, CA, USA
Adegoke Ojewole NVIDIA, Santa Clara, CA, USA
Murat Efe Guney NVIDIA, Santa Clara, CA, USA
Stella Biderman EleutherAI, New York, NY, USA Booz Allen Hamilton, McLean, VA, USA
Andrew M Watkins Prescient Design, Genentech, New York, NY, USA
Stephen Ra Prescient Design, Genentech, New York, NY, USA
Pablo Ribalta Lorenzo NVIDIA, Santa Clara, CA, USA
Lucas Nivon Cyrus Bio, Seattle, WA, USA
Brian Weitzner Outpace Bio, Seattle, WA, USA
Yih-En Andrew Ban Arzeda, Seattle, WA, USA
Shiyang Chen Rutgers University, New Brunswick, NJ, USA
Minjia Zhang University of Illinois at Urbana-Champaign, Champaign, IL, USA
Conglong Li Microsoft, Redmond, WA, USA
Shuaiwen Leon Song Microsoft, Redmond, WA, USA
Yuxiong He Microsoft, Redmond, WA, USA
Peter K Sorger Laboratory of Systems Pharmacology, Harvard Medical School, Boston, MA, USA
Emad Mostaque Stability AI, Los Altos, CA, USA
Zhao Zhang Rutgers University, New Brunswick, NJ, USA
Richard Bonneau Prescient Design, Genentech, New York, NY, USA
Mohammed AlQuraishi Department of Systems Biology, Columbia University, New York, NY, USA.

Collapse

Medvedev KE, Schaeffer RD, Grishin NV. DrugDomain: The evolutionary context of drugs and small molecules bound to domains. Protein Sci 2024;33:e5116. [PMID: 38979784 PMCID: PMC11231930 DOI: 10.1002/pro.5116] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2024] [Revised: 06/27/2024] [Accepted: 06/29/2024] [Indexed: 07/10/2024]

Umuhire Juru A, Ghirlando R, Zhang J. Structural basis of tRNA recognition by the widespread OB fold. Nat Commun 2024;15:6385. [PMID: 39075051 PMCID: PMC11286949 DOI: 10.1038/s41467-024-50730-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2024] [Accepted: 07/18/2024] [Indexed: 07/31/2024] Open

Schiffrin B, Calabrese AN. Chaperones in concert: Orchestrating co-translational protein folding in the cell. Mol Cell 2024;84:2403-2404. [PMID: 38996455 DOI: 10.1016/j.molcel.2024.06.018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2024] [Revised: 06/18/2024] [Accepted: 06/19/2024] [Indexed: 07/14/2024]

Wang X, Zhang Y, Li Z, Duan Z, Guo M, Wang Z, Zhu F, Xue W. PROSCA: an online platform for humanized scaffold mining facilitating rational protein engineering. Nucleic Acids Res 2024;52:W272-W279. [PMID: 38738624 PMCID: PMC11223824 DOI: 10.1093/nar/gkae384] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2024] [Revised: 04/23/2024] [Accepted: 04/29/2024] [Indexed: 05/14/2024] Open

Goverde CA, Pacesa M, Goldbach N, Dornfeld LJ, Balbi PEM, Georgeon S, Rosset S, Kapoor S, Choudhury J, Dauparas J, Schellhaas C, Kozlov S, Baker D, Ovchinnikov S, Vecchio AJ, Correia BE. Computational design of soluble and functional membrane protein analogues. Nature 2024;631:449-458. [PMID: 38898281 PMCID: PMC11236705 DOI: 10.1038/s41586-024-07601-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2023] [Accepted: 05/23/2024] [Indexed: 06/21/2024]

Affiliation(s)

Casper A Goverde Laboratory of Protein Design and Immunoengineering, École Polytechnique Fédérale de Lausanne and Swiss Institute of Bioinformatics, Lausanne, Switzerland
Martin Pacesa Laboratory of Protein Design and Immunoengineering, École Polytechnique Fédérale de Lausanne and Swiss Institute of Bioinformatics, Lausanne, Switzerland
Nicolas Goldbach Laboratory of Protein Design and Immunoengineering, École Polytechnique Fédérale de Lausanne and Swiss Institute of Bioinformatics, Lausanne, Switzerland
Lars J Dornfeld Laboratory of Protein Design and Immunoengineering, École Polytechnique Fédérale de Lausanne and Swiss Institute of Bioinformatics, Lausanne, Switzerland
Petra E M Balbi Laboratory of Protein Design and Immunoengineering, École Polytechnique Fédérale de Lausanne and Swiss Institute of Bioinformatics, Lausanne, Switzerland
Sandrine Georgeon Laboratory of Protein Design and Immunoengineering, École Polytechnique Fédérale de Lausanne and Swiss Institute of Bioinformatics, Lausanne, Switzerland
Stéphane Rosset Laboratory of Protein Design and Immunoengineering, École Polytechnique Fédérale de Lausanne and Swiss Institute of Bioinformatics, Lausanne, Switzerland
Srajan Kapoor Department of Structural Biology, University at Buffalo, Buffalo, NY, USA
Jagrity Choudhury Department of Structural Biology, University at Buffalo, Buffalo, NY, USA
Justas Dauparas Department of Biochemistry, University of Washington, Seattle, WA, USA Institute for Protein Design, University of Washington, Seattle, WA, USA
Christian Schellhaas Laboratory of Protein Design and Immunoengineering, École Polytechnique Fédérale de Lausanne and Swiss Institute of Bioinformatics, Lausanne, Switzerland
Simon Kozlov Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA
David Baker Department of Biochemistry, University of Washington, Seattle, WA, USA Institute for Protein Design, University of Washington, Seattle, WA, USA Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
Sergey Ovchinnikov Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA
Alex J Vecchio Department of Structural Biology, University at Buffalo, Buffalo, NY, USA
Bruno E Correia Laboratory of Protein Design and Immunoengineering, École Polytechnique Fédérale de Lausanne and Swiss Institute of Bioinformatics, Lausanne, Switzerland.

Collapse

Grossman AS, Gell DA, Wu DG, Carper DL, Hettich RL, Goodrich-Blair H. Bacterial hemophilin homologs and their specific type eleven secretor proteins have conserved roles in heme capture and are diversifying as a family. J Bacteriol 2024;206:e0044423. [PMID: 38506530 DOI: 10.1128/jb.00444-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2024] [Accepted: 02/18/2024] [Indexed: 03/21/2024] Open

Abstract

Cellular life relies on enzymes that require metals, which must be acquired from extracellular sources. Bacteria utilize surface and secreted proteins to acquire such valuable nutrients from their environment. These include the cargo proteins of the type eleven secretion system (T11SS), which have been connected to host specificity, metal homeostasis, and nutritional immunity evasion. This Sec-dependent, Gram-negative secretion system is encoded by organisms throughout the phylum Proteobacteria, including human pathogens Neisseria meningitidis, Proteus mirabilis, Acinetobacter baumannii, and Haemophilus influenzae. Experimentally verified T11SS-dependent cargo include transferrin-binding protein B (TbpB), the hemophilin homologs heme receptor protein C (HrpC), hemophilin A (HphA), the immune evasion protein factor-H binding protein (fHbp), and the host symbiosis factor nematode intestinal localization protein C (NilC). Here, we examined the specificity of T11SS systems for their cognate cargo proteins using taxonomically distributed homolog pairs of T11SS and hemophilin cargo and explored the ligand binding ability of those hemophilin cargo homologs. In vivo expression in Escherichia coli of hemophilin homologs revealed that each is secreted in a specific manner by its cognate T11SS protein. Sequence analysis and structural modeling suggest that all hemophilin homologs share an N-terminal ligand-binding domain with the same topology as the ligand-binding domains of the Haemophilus haemolyticus heme binding protein (Hpl) and HphA. We term this signature feature of this group of proteins the hemophilin ligand-binding domain. Network analysis of hemophilin homologs revealed five subclusters and representatives from four of these showed variable heme-binding activities, which, combined with sequence-structure variation, suggests that hemophilins are diversifying in function.IMPORTANCEThe secreted protein hemophilin and its homologs contribute to the survival of several bacterial symbionts within their respective host environments. Here, we compared taxonomically diverse hemophilin homologs and their paired Type 11 secretion systems (T11SS) to determine if heme binding and T11SS secretion are conserved characteristics of this family. We establish the existence of divergent hemophilin sub-families and describe structural features that contribute to distinct ligand-binding behaviors. Furthermore, we demonstrate that T11SS are specific for their cognate hemophilin family cargo proteins. Our work establishes that hemophilin homolog-T11SS pairs are diverging from each other, potentially evolving into novel ligand acquisition systems that provide competitive benefits in host niches.

Collapse

Hamamsy T, Morton JT, Blackwell R, Berenberg D, Carriero N, Gligorijevic V, Strauss CEM, Leman JK, Cho K, Bonneau R. Protein remote homology detection and structural alignment using deep learning. Nat Biotechnol 2024;42:975-985. [PMID: 37679542 PMCID: PMC11180608 DOI: 10.1038/s41587-023-01917-2] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2022] [Accepted: 07/26/2023] [Indexed: 09/09/2023]

Toledo-Patiño S, Goetz SK, Shanmugaratnam S, Höcker B, Farías-Rico JA. Molecular handcraft of a well-folded protein chimera. FEBS Lett 2024;598:1375-1386. [PMID: 38508768 DOI: 10.1002/1873-3468.14856] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2023] [Revised: 02/11/2024] [Accepted: 02/12/2024] [Indexed: 03/22/2024]

Xia Y, Pan X, Shen HB. A comprehensive survey on protein-ligand binding site prediction. Curr Opin Struct Biol 2024;86:102793. [PMID: 38447285 DOI: 10.1016/j.sbi.2024.102793] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2023] [Revised: 02/18/2024] [Accepted: 02/18/2024] [Indexed: 03/08/2024]

Rajasekaran N, Kaiser CM. Navigating the complexities of multi-domain protein folding. Curr Opin Struct Biol 2024;86:102790. [PMID: 38432063 DOI: 10.1016/j.sbi.2024.102790] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2023] [Revised: 02/11/2024] [Accepted: 02/12/2024] [Indexed: 03/05/2024]

Choudhary P, Feng Z, Berrisford J, Chao H, Ikegawa Y, Peisach E, Piehl DW, Smith J, Tanweer A, Varadi M, Westbrook JD, Young JY, Patwardhan A, Morris KL, Hoch JC, Kurisu G, Velankar S, Burley SK. PDB NextGen Archive: centralizing access to integrated annotations and enriched structural information by the Worldwide Protein Data Bank. Database (Oxford) 2024;2024:baae041. [PMID: 38803272 PMCID: PMC11130521 DOI: 10.1093/database/baae041] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2023] [Revised: 01/29/2024] [Accepted: 05/14/2024] [Indexed: 05/29/2024]

Affiliation(s)

Preeti Choudhary Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute Wellcome Genome Campus, Hinxton, Cambridgeshire, CB10 1SD, UK
Zukang Feng Research Collaboratory for Structural Bioinformatics Protein Data Bank, Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, 174 Frelinghuysen Rd., Piscataway, NJ 08854, USA
John Berrisford Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute Wellcome Genome Campus, Hinxton, Cambridgeshire, CB10 1SD, UK
Henry Chao Research Collaboratory for Structural Bioinformatics Protein Data Bank, Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, 174 Frelinghuysen Rd., Piscataway, NJ 08854, USA
Yasuyo Ikegawa Protein Data Bank Japan, Protein Research Foundation, 3-2, Yamadaoka, Minoh, Osaka 562-8686, Japan
Ezra Peisach Research Collaboratory for Structural Bioinformatics Protein Data Bank, Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, 174 Frelinghuysen Rd., Piscataway, NJ 08854, USA
Dennis W Piehl Research Collaboratory for Structural Bioinformatics Protein Data Bank, Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, 174 Frelinghuysen Rd., Piscataway, NJ 08854, USA
James Smith Research Collaboratory for Structural Bioinformatics Protein Data Bank, Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, 174 Frelinghuysen Rd., Piscataway, NJ 08854, USA
Ahsan Tanweer Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute Wellcome Genome Campus, Hinxton, Cambridgeshire, CB10 1SD, UK
Mihaly Varadi Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute Wellcome Genome Campus, Hinxton, Cambridgeshire, CB10 1SD, UK
John D Westbrook Research Collaboratory for Structural Bioinformatics Protein Data Bank, Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, 174 Frelinghuysen Rd., Piscataway, NJ 08854, USA
Jasmine Y Young Research Collaboratory for Structural Bioinformatics Protein Data Bank, Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, 174 Frelinghuysen Rd., Piscataway, NJ 08854, USA
Ardan Patwardhan The Electron Microscopy Data Bank, European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire, CB10 1SD, UK
Kyle L Morris The Electron Microscopy Data Bank, European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire, CB10 1SD, UK
Jeffrey C Hoch Biological Magnetic Resonance Data Bank, Department of Molecular Biology and Biophysics, UConn Health, 263 Farmington Avenue, Farmington, CT 06030-3305, USA
Genji Kurisu Protein Data Bank Japan, Protein Research Foundation, 3-2, Yamadaoka, Minoh, Osaka 562-8686, Japan Protein Data Bank Japan, Institute for Protein Research, Osaka University, 3-2 Yamadaoka, Suita-shi, Osaka 565-0871, Japan
Sameer Velankar Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute Wellcome Genome Campus, Hinxton, Cambridgeshire, CB10 1SD, UK
Stephen K Burley Research Collaboratory for Structural Bioinformatics Protein Data Bank, Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, 174 Frelinghuysen Rd., Piscataway, NJ 08854, USA Rutgers Cancer Institute of New Jersey, 195 Little Albany St., New Brunswick, NJ 08901, USA Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, 123 Bevier Rd., Piscataway, NJ 08854, USA

Collapse

Kazakov AS, Rastrygina VA, Vologzhannikova AA, Zemskova MY, Bobrova LA, Deryusheva EI, Permyakova ME, Sokolov AS, Litus EA, Shevelyova MP, Uversky VN, Permyakov EA, Permyakov SE. Recognition of granulocyte-macrophage colony-stimulating factor by specific S100 proteins. Cell Calcium 2024;119:102869. [PMID: 38484433 DOI: 10.1016/j.ceca.2024.102869] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2023] [Revised: 03/01/2024] [Accepted: 03/03/2024] [Indexed: 04/05/2024]

Abstract

Granulocyte-macrophage colony-stimulating factor (GM-CSF) is a pleiotropic myelopoietic growth factor and proinflammatory cytokine, clinically used for multiple indications and serving as a promising target for treatment of many disorders, including cancer, multiple sclerosis, rheumatoid arthritis, psoriasis, asthma, COVID-19. We have previously shown that dimeric Ca2+-bound forms of S100A6 and S100P proteins, members of the multifunctional S100 protein family, are specific to GM-CSF. To probe selectivity of these interactions, the affinity of recombinant human GM-CSF to dimeric Ca2+-loaded forms of 18 recombinant human S100 proteins was studied by surface plasmon resonance spectroscopy. Of them, only S100A4 protein specifically binds to GM-CSF with equilibrium dissociation constant, Kd, values of 0.3-2 μM, as confirmed by intrinsic fluorescence and chemical crosslinking data. Calcium removal prevents S100A4 binding to GM-CSF, whereas monomerization of S100A4/A6/P proteins disrupts S100A4/A6 interaction with GM-CSF and induces a slight decrease in S100P affinity for GM-CSF. Structural modelling indicates the presence in the GM-CSF molecule of a conserved S100A4/A6/P-binding site, consisting of the residues from its termini, helices I and III, some of which are involved in the interaction with GM-CSF receptors. The predicted involvement of the 'hinge' region and F89 residue of S100P in GM-CSF recognition was confirmed by mutagenesis. Examination of S100A4/A6/P ability to affect GM-CSF signaling showed that S100A4/A6 inhibit GM-CSF-induced suppression of viability of monocytic THP-1 cells. The ability of the S100 proteins to modulate GM-CSF activity is relevant to progression of various neoplasms and other diseases, according to bioinformatics analysis. The direct regulation of GM-CSF signaling by extracellular forms of the S100 proteins should be taken into account in the clinical use of GM-CSF and development of the therapeutic interventions targeting GM-CSF or its receptors.

Collapse

Affiliation(s)

Alexey S Kazakov Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences, Institute for Biological Instrumentation, Institutskaya str., 7, Pushchino, Moscow Region 142290, Russia.
Victoria A Rastrygina Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences, Institute for Biological Instrumentation, Institutskaya str., 7, Pushchino, Moscow Region 142290, Russia
Alisa A Vologzhannikova Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences, Institute for Biological Instrumentation, Institutskaya str., 7, Pushchino, Moscow Region 142290, Russia
Marina Y Zemskova Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences, Institute for Biological Instrumentation, Institutskaya str., 7, Pushchino, Moscow Region 142290, Russia; Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences, G.K. Skryabin Institute of Biochemistry and Physiology of Microorganisms, pr. Nauki, 5, Pushchino, Moscow Region 142290, Russia
Lolita A Bobrova Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences, Institute for Biological Instrumentation, Institutskaya str., 7, Pushchino, Moscow Region 142290, Russia
Evgenia I Deryusheva Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences, Institute for Biological Instrumentation, Institutskaya str., 7, Pushchino, Moscow Region 142290, Russia.
Maria E Permyakova Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences, Institute for Biological Instrumentation, Institutskaya str., 7, Pushchino, Moscow Region 142290, Russia
Andrey S Sokolov Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences, Institute for Biological Instrumentation, Institutskaya str., 7, Pushchino, Moscow Region 142290, Russia
Ekaterina A Litus Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences, Institute for Biological Instrumentation, Institutskaya str., 7, Pushchino, Moscow Region 142290, Russia
Marina P Shevelyova Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences, Institute for Biological Instrumentation, Institutskaya str., 7, Pushchino, Moscow Region 142290, Russia
Vladimir N Uversky Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences, Institute for Biological Instrumentation, Institutskaya str., 7, Pushchino, Moscow Region 142290, Russia; Department of Molecular Medicine and USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, USA.
Eugene A Permyakov Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences, Institute for Biological Instrumentation, Institutskaya str., 7, Pushchino, Moscow Region 142290, Russia
Sergei E Permyakov Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences, Institute for Biological Instrumentation, Institutskaya str., 7, Pushchino, Moscow Region 142290, Russia.

Collapse

Ellaway JIJ, Anyango S, Nair S, Zaki HA, Nadzirin N, Powell HR, Gutmanas A, Varadi M, Velankar S. Identifying protein conformational states in the Protein Data Bank: Toward unlocking the potential of integrative dynamics studies. STRUCTURAL DYNAMICS (MELVILLE, N.Y.) 2024;11:034701. [PMID: 38774441 PMCID: PMC11106648 DOI: 10.1063/4.0000251] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/07/2024] [Accepted: 05/08/2024] [Indexed: 05/24/2024]

Liu R, Clayton J, Shen M, Bhatnagar S, Shen J. Machine Learning Models to Interrogate Proteome-Wide Covalent Ligandabilities Directed at Cysteines. JACS AU 2024;4:1374-1384. [PMID: 38665640 PMCID: PMC11040703 DOI: 10.1021/jacsau.3c00749] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Revised: 02/22/2024] [Accepted: 02/23/2024] [Indexed: 04/28/2024]

Pan Z, Zhuo L, Wan TY, Chen RY, Li YZ. DnaK duplication and specialization in bacteria correlates with increased proteome complexity. mSystems 2024;9:e0115423. [PMID: 38530057 PMCID: PMC11019930 DOI: 10.1128/msystems.01154-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2023] [Accepted: 03/10/2024] [Indexed: 03/27/2024] Open

Abstract

The chaperone 70 kDa heat shock protein (Hsp70) is important for cells from bacteria to humans to maintain proteostasis, and all eukaryotes and several prokaryotes encode Hsp70 paralogs. Although the mechanisms of Hsp70 function have been clearly illuminated, the function and evolution of Hsp70 paralogs is not well studied. DnaK is a highly conserved bacterial Hsp70 family. Here, we show that dnaK is present in 98.9% of bacterial genomes, and 6.4% of them possess two or more DnaK paralogs. We found that the duplication of dnaK is positively correlated with an increase in proteomic complexity (proteome size, number of domains). We identified the interactomes of the two DnaK paralogs of Myxococcus xanthus DK1622 (MxDnaKs), which revealed that they are mostly nonoverlapping, although both prefer α and β domain proteins. Consistent with the entire M. xanthus proteome, MxDnaK substrates have both significantly more multi-domain proteins and a higher isoelectric point than that of Escherichia coli, which encodes a single DnaK homolog. MxDnaK1 is transcriptionally upregulated in response to heat shock and prefers to bind cytosolic proteins, while MxDnaK2 is downregulated by heat shock and is more associated with membrane proteins. Using domain swapping, we show that the nucleotide-binding domain and the substrate-binding β domain are responsible for the significant differences in DnaK interactomes, and the nucleotide binding domain also determines the dimerization of MxDnaK2, but not MxDnaK1. Our work suggests that bacterial DnaK has been duplicated in order to deal with a more complex proteome, and that this allows evolution of distinct domains to deal with different subsets of target proteins.IMPORTANCEAll eukaryotic and ~40% of prokaryotic species encode multiple 70 kDa heat shock protein (Hsp70) homologs with similar but diversified functions. Here, we show that duplication of canonical Hsp70 (DnaK in prokaryotes) correlates with increasing proteomic complexity and evolution of particular regions of the protein. Using the Myxococcus xanthus DnaK duplicates as a case, we found that their substrate spectrums are mostly nonoverlapping, and are both consistent to that of Escherichia coli DnaK in structural and molecular characteristics, but show differential enrichment of membrane proteins. Domain/region swapping demonstrated that the nucleotide-binding domain and the β substrate-binding domain (SBDβ), but not the SBDα or disordered C-terminal tail region, are responsible for this functional divergence. This work provides the first direct evidence for regional evolution of DnaK paralogs.

Collapse

Glidden-Handgis G, Wheeler TJ. WAS IT A MATch I SAW? Approximate palindromes lead to overstated false match rates in benchmarks using reversed sequences. BIOINFORMATICS ADVANCES 2024;4:vbae052. [PMID: 38764475 PMCID: PMC11099658 DOI: 10.1093/bioadv/vbae052] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/21/2023] [Revised: 03/31/2024] [Accepted: 04/04/2024] [Indexed: 05/21/2024]

Abstract

Background

Software for labeling biological sequences typically produces a theory-based statistic for each match (the E-value) that indicates the likelihood of seeing that match's score by chance. E-values accurately predict false match rate for comparisons of random (shuffled) sequences, and thus provide a reasoned mechanism for setting score thresholds that enable high sensitivity with low expected false match rate. This threshold-setting strategy is challenged by real biological sequences, which contain regions of local repetition and low sequence complexity that cause excess matches between non-homologous sequences. Knowing this, tool developers often develop benchmarks that use realistic-seeming decoy sequences to explore empirical tradeoffs between sensitivity and false match rate. A recent trend has been to employ reversed biological sequences as realistic decoys, because these preserve the distribution of letters and the existence of local repeats, while disrupting the original sequence's functional properties. However, we and others have observed that sequences appear to produce high scoring alignments to their reversals with surprising frequency, leading to overstatement of false match risk that may negatively affect downstream analysis.

Results

We demonstrate that an alignment between a sequence S and its (possibly mutated) reversal tends to produce higher scores than alignment between truly unrelated sequences, even when S is a shuffled string with no notable repetitive or low-complexity regions. This phenomenon is due to the unintuitive fact that (even randomly shuffled) sequences contain palindromes that are on average longer than the longest common substrings (LCS) shared between permuted variants of the same sequence. Though the expected palindrome length is only slightly larger than the expected LCS, the distribution of alignment scores involving reversed sequences is strongly right-shifted, leading to greatly increased frequency of high-scoring alignments to reversed sequences.

Impact

Overestimates of false match risk can motivate unnecessarily high score thresholds, leading to potentially reduced true match sensitivity. Also, when tool sensitivity is only reported up to the score of the first matched decoy sequence, a large decoy set consisting of reversed sequences can obscure sensitivity differences between tools. As a result of these observations, we advise that reversed biological sequences be used as decoys only when care is taken to remove positive matches in the original (un-reversed) sequences, or when overstatement of false labeling is not a concern. Though the primary focus of the analysis is on sequence annotation, we also demonstrate that the prevalence of internal palindromes may lead to an overstatement of the rate of false labels in protein identification with mass spectrometry.

Collapse

Penteado RF, Iulek J. Crystal structure of Methionyl-tRNA Synthetase from Rickettsia typhi in complex with its cognate amino acid. Biochimie 2024;219:63-73. [PMID: 37673171 DOI: 10.1016/j.biochi.2023.09.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Revised: 08/08/2023] [Accepted: 09/02/2023] [Indexed: 09/08/2023]

Dutta A, Kanaujia SP. The Structural Features of MlaD Illuminate its Unique Ligand-Transporting Mechanism and Ancestry. Protein J 2024;43:298-315. [PMID: 38347327 DOI: 10.1007/s10930-023-10179-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/22/2023] [Indexed: 05/01/2024]

Roel‐Touris J, Carcelén L, Marcos E. The structural landscape of the immunoglobulin fold by large-scale de novo design. Protein Sci 2024;33:e4936. [PMID: 38501461 PMCID: PMC10949314 DOI: 10.1002/pro.4936] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2023] [Revised: 02/02/2024] [Accepted: 02/06/2024] [Indexed: 03/20/2024]

Rozano L, Jones DAB, Hane JK, Mancera RL. Template-Based Modelling of the Structure of Fungal Effector Proteins. Mol Biotechnol 2024;66:784-813. [PMID: 36940017 PMCID: PMC11043172 DOI: 10.1007/s12033-023-00703-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Accepted: 02/14/2023] [Indexed: 03/21/2023]

Singleton MD, Eisen MB. Evolutionary analyses of intrinsically disordered regions reveal widespread signals of conservation. PLoS Comput Biol 2024;20:e1012028. [PMID: 38662765 PMCID: PMC11075841 DOI: 10.1371/journal.pcbi.1012028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2024] [Revised: 05/07/2024] [Accepted: 03/28/2024] [Indexed: 05/08/2024] Open

Gupta MN, Uversky VN. Protein structure-function continuum model: Emerging nexuses between specificity, evolution, and structure. Protein Sci 2024;33:e4968. [PMID: 38532700 DOI: 10.1002/pro.4968] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2023] [Revised: 02/18/2024] [Accepted: 03/05/2024] [Indexed: 03/28/2024]

Dotan E, Jaschek G, Pupko T, Belinkov Y. Effect of tokenization on transformers for biological sequences. Bioinformatics 2024;40:btae196. [PMID: 38608190 PMCID: PMC11055402 DOI: 10.1093/bioinformatics/btae196] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2023] [Revised: 02/20/2024] [Accepted: 04/11/2024] [Indexed: 04/14/2024] Open

Abstract

MOTIVATION

Deep-learning models are transforming biological research, including many bioinformatics and comparative genomics algorithms, such as sequence alignments, phylogenetic tree inference, and automatic classification of protein functions. Among these deep-learning algorithms, models for processing natural languages, developed in the natural language processing (NLP) community, were recently applied to biological sequences. However, biological sequences are different from natural languages, such as English, and French, in which segmentation of the text to separate words is relatively straightforward. Moreover, biological sequences are characterized by extremely long sentences, which hamper their processing by current machine-learning models, notably the transformer architecture. In NLP, one of the first processing steps is to transform the raw text to a list of tokens. Deep-learning applications to biological sequence data mostly segment proteins and DNA to single characters. In this work, we study the effect of alternative tokenization algorithms on eight different tasks in biology, from predicting the function of proteins and their stability, through nucleotide sequence alignment, to classifying proteins to specific families.

RESULTS

We demonstrate that applying alternative tokenization algorithms can increase accuracy and at the same time, substantially reduce the input length compared to the trivial tokenizer in which each character is a token. Furthermore, applying these tokenization algorithms allows interpreting trained models, taking into account dependencies among positions. Finally, we trained these tokenizers on a large dataset of protein sequences containing more than 400 billion amino acids, which resulted in over a 3-fold decrease in the number of tokens. We then tested these tokenizers trained on large-scale data on the above specific tasks and showed that for some tasks it is highly beneficial to train database-specific tokenizers. Our study suggests that tokenizers are likely to be a critical component in future deep-network analysis of biological sequence data.

AVAILABILITY AND IMPLEMENTATION

Code, data, and trained tokenizers are available on https://github.com/technion-cs-nlp/BiologicalTokenizers.

Collapse

Abbass J, Parisi C. Machine learning-based prediction of proteins' architecture using sequences of amino acids and structural alphabets. J Biomol Struct Dyn 2024:1-16. [PMID: 38505995 DOI: 10.1080/07391102.2024.2328736] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Accepted: 03/05/2024] [Indexed: 03/21/2024]

Goverde CA, Pacesa M, Goldbach N, Dornfeld LJ, Balbi PEM, Georgeon S, Rosset S, Kapoor S, Choudhury J, Dauparas J, Schellhaas C, Kozlov S, Baker D, Ovchinnikov S, Vecchio AJ, Correia BE. Computational design of soluble functional analogues of integral membrane proteins. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.05.09.540044. [PMID: 38496615 PMCID: PMC10942269 DOI: 10.1101/2023.05.09.540044] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/19/2024]

Makarova KS, Zhang C, Wolf YI, Karamycheva S, Whitaker RJ, Koonin EV. Computational analysis of genes with lethal knockout phenotype and prediction of essential genes in archaea. mBio 2024;15:e0309223. [PMID: 38189270 PMCID: PMC10865827 DOI: 10.1128/mbio.03092-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2023] [Accepted: 11/27/2023] [Indexed: 01/09/2024] Open

Abstract

The identification of microbial genes essential for survival as those with lethal knockout phenotype (LKP) is a common strategy for functional interrogation of genomes. However, interpretation of the LKP is complicated because a substantial fraction of the genes with this phenotype remains poorly functionally characterized. Furthermore, many genes can exhibit LKP not because their products perform essential cellular functions but because their knockout activates the toxicity of other genes (conditionally essential genes). We analyzed the sets of LKP genes for two archaea, Methanococcus maripaludis and Sulfolobus islandicus, using a variety of computational approaches aiming to differentiate between essential and conditionally essential genes and to predict at least a general function for as many of the proteins encoded by these genes as possible. This analysis allowed us to predict the functions of several LKP genes including previously uncharacterized subunit of the GINS protein complex with an essential function in genome replication and of the KEOPS complex that is responsible for an essential tRNA modification as well as GRP protease implicated in protein quality control. Additionally, several novel antitoxins (conditionally essential genes) were predicted, and this prediction was experimentally validated by showing that the deletion of these genes together with the adjacent genes apparently encoding the cognate toxins caused no growth defect. We applied principal component analysis based on sequence and comparative genomic features showing that this approach can separate essential genes from conditionally essential ones and used it to predict essential genes in other archaeal genomes.IMPORTANCEOnly a relatively small fraction of the genes in any bacterium or archaeon is essential for survival as demonstrated by the lethal effect of their disruption. The identification of essential genes and their functions is crucial for understanding fundamental cell biology. However, many of the genes with a lethal knockout phenotype remain poorly functionally characterized, and furthermore, many genes can exhibit this phenotype not because their products perform essential cellular functions but because their knockout activates the toxicity of other genes. We applied state-of-the-art computational methods to predict the functions of a number of uncharacterized genes with the lethal knockout phenotype in two archaeal species and developed a computational approach to predict genes involved in essential functions. These findings advance the current understanding of key functionalities of archaeal cells.

Collapse

Doni D, Cavallari E, Noguera ME, Gentili HG, Cavion F, Parisi G, Fornasari MS, Sartori G, Santos J, Bellanda M, Carbonera D, Costantini P, Bortolus M. Searching for Frataxin Function: Exploring the Analogy with Nqo15, the Frataxin-like Protein of Respiratory Complex I from Thermus thermophilus. Int J Mol Sci 2024;25:1912. [PMID: 38339189 PMCID: PMC10855754 DOI: 10.3390/ijms25031912] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2023] [Revised: 01/26/2024] [Accepted: 02/02/2024] [Indexed: 02/12/2024] Open

Affiliation(s)

Davide Doni Department of Biology, University of Padova, 35121 Padova, Italy; (D.D.); (F.C.)
Eva Cavallari Department of Biology, University of Padova, 35121 Padova, Italy; (D.D.); (F.C.) Grenoble Alpes University, CNRS, CEA, INRAE, IRIG-LPCV, 38000 Grenoble, France
Martin Ezequiel Noguera Department of Physiology and Molecular and Cellular Biology, Institute of Biosciences, Biotechnology and Translational Biology (iB3), Faculty of Exact and Natural Sciences, University of Buenos Aires, Intendente Güiraldes 2160, Buenos Aires C1428EG, Argentina; (M.E.N.); (H.G.G.); (J.S.) Institute of Biological Chemistry and Physical Chemistry, Dr Alejandro Paladini (UBA-CONICET), University of Buenos Aires, Junín 956, Buenos Aires 1113AAD, Argentina Department of Science and Technology, National University of Quilmes, Roque Saenz Peña 352, Bernal B1876BXD, Argentina; (G.P.); (M.S.F.)
Hernan Gustavo Gentili Department of Physiology and Molecular and Cellular Biology, Institute of Biosciences, Biotechnology and Translational Biology (iB3), Faculty of Exact and Natural Sciences, University of Buenos Aires, Intendente Güiraldes 2160, Buenos Aires C1428EG, Argentina; (M.E.N.); (H.G.G.); (J.S.)
Federica Cavion Department of Biology, University of Padova, 35121 Padova, Italy; (D.D.); (F.C.)
Gustavo Parisi Department of Science and Technology, National University of Quilmes, Roque Saenz Peña 352, Bernal B1876BXD, Argentina; (G.P.); (M.S.F.)
Maria Silvina Fornasari Department of Science and Technology, National University of Quilmes, Roque Saenz Peña 352, Bernal B1876BXD, Argentina; (G.P.); (M.S.F.)
Geppo Sartori Department of Biomedical Sciences, University of Padova, 35121 Padova, Italy;
Javier Santos Department of Physiology and Molecular and Cellular Biology, Institute of Biosciences, Biotechnology and Translational Biology (iB3), Faculty of Exact and Natural Sciences, University of Buenos Aires, Intendente Güiraldes 2160, Buenos Aires C1428EG, Argentina; (M.E.N.); (H.G.G.); (J.S.)
Massimo Bellanda Department of Chemical Sciences, University of Padova, 35131 Padova, Italy; (M.B.); (D.C.) Consiglio Nazionale delle Ricerche Institute of Biomolecular Chemistry, 35131 Padova, Italy
Donatella Carbonera Department of Chemical Sciences, University of Padova, 35131 Padova, Italy; (M.B.); (D.C.)
Paola Costantini Department of Biology, University of Padova, 35121 Padova, Italy; (D.D.); (F.C.)
Marco Bortolus Department of Chemical Sciences, University of Padova, 35131 Padova, Italy; (M.B.); (D.C.)

Collapse

Satalkar V, Degaga GD, Li W, Pang YT, McShan AC, Gumbart JC, Mitchell JC, Torres MP. Generative β-hairpin design using a residue-based physicochemical property landscape. Biophys J 2024:S0006-3495(24)00070-5. [PMID: 38297834 DOI: 10.1016/j.bpj.2024.01.029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2023] [Revised: 12/20/2023] [Accepted: 01/25/2024] [Indexed: 02/02/2024] Open

Sayin AZ, Abali Z, Senyuz S, Cankara F, Gursoy A, Keskin O. Conformational diversity and protein-protein interfaces in drug repurposing in Ras signaling pathway. Sci Rep 2024;14:1239. [PMID: 38216592 PMCID: PMC10786864 DOI: 10.1038/s41598-023-50913-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Accepted: 12/27/2023] [Indexed: 01/14/2024] Open

Liu R, Clayton J, Shen M, Bhatnagar S, Shen J. Machine Learning Models to Interrogate Proteomewide Covalent Ligandabilities Directed at Cysteines. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.08.17.553742. [PMID: 37662346 PMCID: PMC10473668 DOI: 10.1101/2023.08.17.553742] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/05/2023]

Schierholz L, Brown CR, Helena-Bueno K, Uversky VN, Hirt RP, Barandun J, Melnikov SV. A Conserved Ribosomal Protein Has Entirely Dissimilar Structures in Different Organisms. Mol Biol Evol 2024;41:msad254. [PMID: 37987564 PMCID: PMC10764239 DOI: 10.1093/molbev/msad254] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2023] [Revised: 10/23/2023] [Accepted: 11/16/2023] [Indexed: 11/22/2023] Open

Pantolini L, Studer G, Pereira J, Durairaj J, Tauriello G, Schwede T. Embedding-based alignment: combining protein language models with dynamic programming alignment to detect structural similarities in the twilight-zone. Bioinformatics 2024;40:btad786. [PMID: 38175775 PMCID: PMC10792726 DOI: 10.1093/bioinformatics/btad786] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Revised: 10/27/2023] [Accepted: 12/29/2023] [Indexed: 01/06/2024] Open

Denessiouk K, Denesyuk AI, Permyakov SE, Permyakov EA, Johnson MS, Uversky VN. The active site of the SGNH hydrolase-like fold proteins: Nucleophile-oxyanion (Nuc-Oxy) and Acid-Base zones. Curr Res Struct Biol 2023;7:100123. [PMID: 38235349 PMCID: PMC10792757 DOI: 10.1016/j.crstbi.2023.100123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2023] [Revised: 12/25/2023] [Accepted: 12/27/2023] [Indexed: 01/19/2024] Open

Affiliation(s)

Konstantin Denessiouk Institute for Biological Instrumentation of the Russian Academy of Sciences, Federal Research Center “Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences”, Pushchino, 142290, Russia Structural Bioinformatics Laboratory, Biochemistry, InFLAMES Research Flagship Center, Faculty of Science and Engineering, Biochemistry, Åbo Akademi University, Turku, 20520, Finland
Alexander I. Denesyuk Institute for Biological Instrumentation of the Russian Academy of Sciences, Federal Research Center “Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences”, Pushchino, 142290, Russia Structural Bioinformatics Laboratory, Biochemistry, InFLAMES Research Flagship Center, Faculty of Science and Engineering, Biochemistry, Åbo Akademi University, Turku, 20520, Finland
Sergei E. Permyakov Institute for Biological Instrumentation of the Russian Academy of Sciences, Federal Research Center “Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences”, Pushchino, 142290, Russia
Eugene A. Permyakov Institute for Biological Instrumentation of the Russian Academy of Sciences, Federal Research Center “Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences”, Pushchino, 142290, Russia
Mark S. Johnson Structural Bioinformatics Laboratory, Biochemistry, InFLAMES Research Flagship Center, Faculty of Science and Engineering, Biochemistry, Åbo Akademi University, Turku, 20520, Finland
Vladimir N. Uversky Institute for Biological Instrumentation of the Russian Academy of Sciences, Federal Research Center “Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences”, Pushchino, 142290, Russia Department of Molecular Medicine and USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL, 33612, USA

Collapse

Subramanian AM, Thomson M. Unexplored regions of the protein sequence-structure map revealed at scale by a library of foldtuned language models. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.12.22.573145. [PMID: 38187750 PMCID: PMC10769378 DOI: 10.1101/2023.12.22.573145] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/09/2024]

Abstract

Nature has likely sampled only a fraction of all protein sequences and structures allowed by the laws of biophysics. However, the combinatorial scale of amino-acid sequence-space has traditionally precluded substantive study of the full protein sequence-structure map. In particular, it remains unknown how much of the vast uncharted landscape of far-from-natural sequences consists of alternate ways to encode the familiar ensemble of natural folds; proteins in this category also represent an opportunity to diversify candidates for downstream applications. Here, we characterize sequence-structure mapping in far-from-natural regions of sequence-space guided by the capacity of protein language models (pLMs) to explore sequences outside their natural training data through generation. We demonstrate that pretrained generative pLMs sample a limited structural snapshot of the natural protein universe, including >350 common (sub)domain elements. Incorporating pLM, structure prediction, and structure-based search techniques, we surpass this limitation by developing a novel "foldtuning" strategy that pushes a pretrained pLM into a generative regime that maintains structural similarity to a target protein fold (e.g. TIM barrel, thioredoxin, etc) while maximizing dissimilarity to natural amino-acid sequences. We apply "foldtuning" to build a library of pLMs for >700 naturally-abundant folds in the SCOP database, accessing swaths of proteins that take familiar structures yet lie far from known sequences, spanning targets that include enzymes, immune ligands, and signaling proteins. By revealing protein sequence-structure information at scale outside of the context of evolution, we anticipate that this work will enable future systematic searches for wholly novel folds and facilitate more immediate protein design goals in catalysis and medicine.

Collapse

Lau AM, Kandathil SM, Jones DT. Merizo: a rapid and accurate protein domain segmentation method using invariant point attention. Nat Commun 2023;14:8445. [PMID: 38114456 PMCID: PMC10730818 DOI: 10.1038/s41467-023-43934-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Accepted: 11/24/2023] [Indexed: 12/21/2023] Open

Tsuchiya Y, Yonezawa T, Yamamori Y, Inoura H, Osawa M, Ikeda K, Tomii K. PoSSuM v.3: A Major Expansion of the PoSSuM Database for Finding Similar Binding Sites of Proteins. J Chem Inf Model 2023;63:7578-7587. [PMID: 38016694 PMCID: PMC10716853 DOI: 10.1021/acs.jcim.3c01405] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2023] [Revised: 10/28/2023] [Accepted: 11/01/2023] [Indexed: 11/30/2023]

Segura J, Rose Y, Bi C, Duarte J, Burley SK, Bittrich S. RCSB Protein Data Bank: visualizing groups of experimentally determined PDB structures alongside computed structure models of proteins. FRONTIERS IN BIOINFORMATICS 2023;3:1311287. [PMID: 38111685 PMCID: PMC10726007 DOI: 10.3389/fbinf.2023.1311287] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2023] [Accepted: 11/17/2023] [Indexed: 12/20/2023] Open

Midlik A, Nair S, Anyango S, Deshpande M, Sehnal D, Varadi M, Velankar S. PDBImages: a command-line tool for automated macromolecular structure visualization. Bioinformatics 2023;39:btad744. [PMID: 38085238 PMCID: PMC10746859 DOI: 10.1093/bioinformatics/btad744] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2023] [Revised: 10/20/2023] [Accepted: 12/11/2023] [Indexed: 12/24/2023] Open

Hamamsy T, Barot M, Morton JT, Steinegger M, Bonneau R, Cho K. Learning sequence, structure, and function representations of proteins with language models. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.26.568742. [PMID: 38045331 PMCID: PMC10690258 DOI: 10.1101/2023.11.26.568742] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/05/2023]

Wang T, Wang L, Zhang X, Shen C, Zhang O, Wang J, Wu J, Jin R, Zhou D, Chen S, Liu L, Wang X, Hsieh CY, Chen G, Pan P, Kang Y, Hou T. Comprehensive assessment of protein loop modeling programs on large-scale datasets: prediction accuracy and efficiency. Brief Bioinform 2023;25:bbad486. [PMID: 38171930 PMCID: PMC10764206 DOI: 10.1093/bib/bbad486] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Revised: 12/04/2023] [Accepted: 12/05/2023] [Indexed: 01/05/2024] Open

Cao W, Wu LY, Xia XY, Chen X, Wang ZX, Pan XM. A sequence-based evolutionary distance method for Phylogenetic analysis of highly divergent proteins. Sci Rep 2023;13:20304. [PMID: 37985846 PMCID: PMC10662474 DOI: 10.1038/s41598-023-47496-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2023] [Accepted: 11/14/2023] [Indexed: 11/22/2023] Open

Bale A, Rambo R, Prior C. The SKMT Algorithm: A method for assessing and comparing underlying protein entanglement. PLoS Comput Biol 2023;19:e1011248. [PMID: 38011290 PMCID: PMC10703313 DOI: 10.1371/journal.pcbi.1011248] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2023] [Revised: 12/07/2023] [Accepted: 11/06/2023] [Indexed: 11/29/2023] Open

Casier R, Duhamel J. Appraisal of blob-Based Approaches in the Prediction of Protein Folding Times. J Phys Chem B 2023;127:8852-8859. [PMID: 37793094 DOI: 10.1021/acs.jpcb.3c04958] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/06/2023]

Abstract

A series of reports published in the last 3 years has illustrated that a blob-based model (BBM) can predict the folding time of proteins from their primary amino acid (aa) sequence based on three simple rules established to characterize the long-range backbone dynamics (LRBD) of racemic polypeptides. The sole use of LRBD to predict protein folding times with the BBM represents a radical departure from all other prediction methods currently applied to determine protein folding times, which rely instead on parameters such as the structure content, folding kinetics, chain length, amino acid properties, or contact topography of proteins. Furthermore, the built-in modularity of the BBM enables the parametrization and inclusion of new phenomena affecting the LRBD of polypeptides, while its conceptual simplicity makes it an interesting new mathematical tool for studying protein folding. However, its novelty implies that its relationship with many other methods used to predict protein folding times has not been well researched. Consequently, the purpose of this report is to uncover the physical phenomena encountered during protein folding that are best described by the BBM through the identification of parameters that have been recognized over the years as being strong predictors for protein folding, such as protein size, topology, structural class, and folding kinetics. This was accomplished by determining the parameters most strongly correlated with the folding times predicted by the BBM. While the BBM in its present form appears to be a good indicator of the folding times of the vast majority of the 195 proteins considered so far, this report finds that it excels for moderately large proteins that are primarily composed of locally formed structural motifs such as α-helices or for proteins that fold in multiple steps. Altogether, these observations based on the use of the BBM support the notion that proteins fold the way they do because the LRBD of polypeptides is mostly driven by the local interactions experienced between aa's within reach of one another.

Collapse

Ooka K, Arai M. Accurate prediction of protein folding mechanisms by simple structure-based statistical mechanical models. Nat Commun 2023;14:6338. [PMID: 37857633 PMCID: PMC10587348 DOI: 10.1038/s41467-023-41664-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2022] [Accepted: 09/10/2023] [Indexed: 10/21/2023] Open

Bae DW, Lee SH, Park JH, Son SY, Lin Y, Lee J, Jang BR, Lee KH, Lee YH, Lee H, Kang S, Kim B, Cha SS. An archaeal transcription factor EnfR with a novel 'eighth note' fold controls hydrogen production of a hyperthermophilic archaeon Thermococcus onnurineus NA1. Nucleic Acids Res 2023;51:10026-10040. [PMID: 37650645 PMCID: PMC10570040 DOI: 10.1093/nar/gkad699] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2022] [Revised: 07/13/2023] [Accepted: 08/14/2023] [Indexed: 09/01/2023] Open

Affiliation(s)

Da-Woon Bae Department of Chemistry & Nanoscience, Ewha Womans University, Seoul 03760, Republic of Korea
Seong Hyuk Lee Marine Biotechnology Research Center, Korea Institute of Ocean Science and Technology, Busan, South Korea
Ji Hye Park Department of Food Science and Biotechnology, Ewha Womans University, Seoul 03760, Republic of Korea
Se-Young Son Department of Chemistry & Nanoscience, Ewha Womans University, Seoul 03760, Republic of Korea
Yuxi Lin Research Center for Bioconvergence Analysis, Korea Basic Science Institute (KBSI), Cheongju, Chungbuk 28119, Republic of Korea
Jung Hyen Lee Department of Food Science and Biotechnology, Ewha Womans University, Seoul 03760, Republic of Korea
Bo-Ram Jang Department of Life Science, Sogang University, 35 Baekbeom-Ro, Mapo-Gu, Seoul, South Korea
Kyu-Ho Lee Department of Life Science, Sogang University, 35 Baekbeom-Ro, Mapo-Gu, Seoul, South Korea
Young-Ho Lee Research Center for Bioconvergence Analysis, Korea Basic Science Institute (KBSI), Cheongju, Chungbuk 28119, Republic of Korea Bio-Analytical Science, University of Science and Technology, Daejeon 34113, Republic of Korea Department of Systems Biotechnology, Chung-Ang University, Anseong, Gyeonggi 17546, Republic of Korea Frontier Research Institute for Interdisciplinary Sciences, Tohoku University, Sendai, Miyagi 980-8578, Japan
Hyun Sook Lee Marine Biotechnology Research Center, Korea Institute of Ocean Science and Technology, Busan, South Korea Department of Marine Biotechnology, KIOST School, University of Science and Technology, Daejeon, South Korea
Sung Gyun Kang Marine Biotechnology Research Center, Korea Institute of Ocean Science and Technology, Busan, South Korea Department of Marine Biotechnology, KIOST School, University of Science and Technology, Daejeon, South Korea
Byoung Sik Kim Department of Food Science and Biotechnology, Ewha Womans University, Seoul 03760, Republic of Korea
Sun-Shin Cha Department of Chemistry & Nanoscience, Ewha Womans University, Seoul 03760, Republic of Korea

Collapse

Pavlopoulos GA, Baltoumas FA, Liu S, Selvitopi O, Camargo AP, Nayfach S, Azad A, Roux S, Call L, Ivanova NN, Chen IM, Paez-Espino D, Karatzas E, Iliopoulos I, Konstantinidis K, Tiedje JM, Pett-Ridge J, Baker D, Visel A, Ouzounis CA, Ovchinnikov S, Buluç A, Kyrpides NC. Unraveling the functional dark matter through global metagenomics. Nature 2023;622:594-602. [PMID: 37821698 PMCID: PMC10584684 DOI: 10.1038/s41586-023-06583-7] [Citation(s) in RCA: 14] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2022] [Accepted: 08/30/2023] [Indexed: 10/13/2023]

Affiliation(s)

Georgios A Pavlopoulos Institute for Fundamental Biomedical Research, Biomedical Science Research Center Alexander Fleming, Vari, Greece. DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA. Center for New Biotechnologies and Precision Medicine, School of Medicine, National and Kapodistrian University of Athens, Athens, Greece.
Fotis A Baltoumas Institute for Fundamental Biomedical Research, Biomedical Science Research Center Alexander Fleming, Vari, Greece
Sirui Liu John Harvard Distinguished Science Fellowship Program, Harvard University, Cambridge, MA, USA
Oguz Selvitopi Computational Research Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
Antonio Pedro Camargo DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
Stephen Nayfach DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
Ariful Azad Luddy School of Informatics, Computing and Engineering, Indiana University Bloomington, Bloomington, IN, USA
Simon Roux DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
Lee Call DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
Natalia N Ivanova DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
I Min Chen DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
David Paez-Espino DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
Evangelos Karatzas Institute for Fundamental Biomedical Research, Biomedical Science Research Center Alexander Fleming, Vari, Greece
Ioannis Iliopoulos Department of Basic Sciences, School of Medicine, University of Crete, Heraklion, Greece
Konstantinos Konstantinidis School of Civil and Environmental Engineering, Georgia Institute of Technology, Atlanta, GA, USA
James M Tiedje Center for Microbial Ecology, Michigan State University, East Lansing, MI, USA
Jennifer Pett-Ridge Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore, CA, USA
David Baker Department of Biochemistry, University of Washington, Seattle, WA, USA Institute for Protein Design, University of Washington, Seattle, WA, USA Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
Axel Visel DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
Christos A Ouzounis DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA Biological Computation & Process Laboratory, Chemical Process & Energy Resources Institute, Centre for Research & Technology Hellas, Thessalonica, Greece Biological Computation & Computational Biology Group, Artificial Intelligence & Information Analysis Lab, School of Informatics, Aristotle University of Thessalonica, Thessalonica, Greece
Sergey Ovchinnikov John Harvard Distinguished Science Fellowship Program, Harvard University, Cambridge, MA, USA
Aydin Buluç Computational Research Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, CA, USA
Nikos C Kyrpides DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA.

Collapse

Kazakov AS, Deryusheva EI, Rastrygina VA, Sokolov AS, Permyakova ME, Litus EA, Uversky VN, Permyakov EA, Permyakov SE. Interaction of S100A6 Protein with the Four-Helical Cytokines. Biomolecules 2023;13:1345. [PMID: 37759746 PMCID: PMC10526228 DOI: 10.3390/biom13091345] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2023] [Revised: 08/19/2023] [Accepted: 08/31/2023] [Indexed: 09/29/2023] Open

Abstract

S100 is a family of over 20 structurally homologous, but functionally diverse regulatory (calcium/zinc)-binding proteins of vertebrates. The involvement of S100 proteins in numerous vital (patho)physiological processes is mediated by their interaction with various (intra/extra)cellular protein partners, including cell surface receptors. Furthermore, recent studies have revealed the ability of specific S100 proteins to modulate cell signaling via direct interaction with cytokines. Previously, we revealed the binding of ca. 71% of the four-helical cytokines via the S100P protein, due to the presence in its molecule of a cytokine-binding site overlapping with the binding site for the S100P receptor. Here, we show that another S100 protein, S100A6 (that has a pairwise sequence identity with S100P of 35%), specifically binds numerous four-helical cytokines. We have studied the affinity of the recombinant forms of 35 human four-helical cytokines from all structural families of this fold to Ca2+-loaded recombinant human S100A6, using surface plasmon resonance spectroscopy. S100A6 recognizes 26 of the cytokines from all families of this fold, with equilibrium dissociation constants from 0.3 nM to 12 µM. Overall, S100A6 interacts with ca. 73% of the four-helical cytokines studied to date, with a selectivity equivalent to that for the S100P protein, with the differences limited to the binding of interleukin-2 and oncostatin M. The molecular docking study evidences the presence in the S100A6 molecule of a cytokine-binding site, analogous to that found in S100P. The findings argue the presence in some of the promiscuous members of the S100 family of a site specific to a wide range of four-helical cytokines. This unique feature of the S100 proteins potentially allows them to modulate the activity of the numerous four-helical cytokines in the disorders accompanied by an excessive release of the cytokines.

Collapse

Affiliation(s)

Alexey S. Kazakov Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences, Institute for Biological Instrumentation, Institutskaya str., 7, Pushchino, Moscow Region 142290, Russia; (A.S.K.); (E.I.D.); (V.A.R.); (A.S.S.); (M.E.P.); (E.A.L.); (E.A.P.)
Evgenia I. Deryusheva Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences, Institute for Biological Instrumentation, Institutskaya str., 7, Pushchino, Moscow Region 142290, Russia; (A.S.K.); (E.I.D.); (V.A.R.); (A.S.S.); (M.E.P.); (E.A.L.); (E.A.P.)
Victoria A. Rastrygina Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences, Institute for Biological Instrumentation, Institutskaya str., 7, Pushchino, Moscow Region 142290, Russia; (A.S.K.); (E.I.D.); (V.A.R.); (A.S.S.); (M.E.P.); (E.A.L.); (E.A.P.)
Andrey S. Sokolov Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences, Institute for Biological Instrumentation, Institutskaya str., 7, Pushchino, Moscow Region 142290, Russia; (A.S.K.); (E.I.D.); (V.A.R.); (A.S.S.); (M.E.P.); (E.A.L.); (E.A.P.)
Maria E. Permyakova Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences, Institute for Biological Instrumentation, Institutskaya str., 7, Pushchino, Moscow Region 142290, Russia; (A.S.K.); (E.I.D.); (V.A.R.); (A.S.S.); (M.E.P.); (E.A.L.); (E.A.P.)
Ekaterina A. Litus Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences, Institute for Biological Instrumentation, Institutskaya str., 7, Pushchino, Moscow Region 142290, Russia; (A.S.K.); (E.I.D.); (V.A.R.); (A.S.S.); (M.E.P.); (E.A.L.); (E.A.P.)
Vladimir N. Uversky Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences, Institute for Biological Instrumentation, Institutskaya str., 7, Pushchino, Moscow Region 142290, Russia; (A.S.K.); (E.I.D.); (V.A.R.); (A.S.S.); (M.E.P.); (E.A.L.); (E.A.P.) Department of Molecular, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, USA USF Health Byrd Alzheimer’s Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, USA
Eugene A. Permyakov Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences, Institute for Biological Instrumentation, Institutskaya str., 7, Pushchino, Moscow Region 142290, Russia; (A.S.K.); (E.I.D.); (V.A.R.); (A.S.S.); (M.E.P.); (E.A.L.); (E.A.P.)
Sergei E. Permyakov Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences, Institute for Biological Instrumentation, Institutskaya str., 7, Pushchino, Moscow Region 142290, Russia; (A.S.K.); (E.I.D.); (V.A.R.); (A.S.S.); (M.E.P.); (E.A.L.); (E.A.P.)

Collapse

Xie L, Xie L. Elucidation of genome-wide understudied proteins targeted by PROTAC-induced degradation using interpretable machine learning. PLoS Comput Biol 2023;19:e1010974. [PMID: 37590332 PMCID: PMC10464998 DOI: 10.1371/journal.pcbi.1010974] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2023] [Revised: 08/29/2023] [Accepted: 07/27/2023] [Indexed: 08/19/2023] Open