1
|
Takács K, Varga B, Farkas V, Perczel A, Grolmusz V. Opening Amyloid-Windows to the secondary structure of proteins: The amyloidogenecity increases tenfold inside beta-sheets. Comput Biol Med 2024; 179:108863. [PMID: 39024903 DOI: 10.1016/j.compbiomed.2024.108863] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2024] [Revised: 06/29/2024] [Accepted: 07/06/2024] [Indexed: 07/20/2024]
Abstract
Methods from artificial intelligence (AI), in general, and machine learning, in particular, have kept conquering new territories in numerous areas of science. Most of the applications of these techniques are restricted to the classification of large data sets, but new scientific knowledge can seldom be inferred from these tools. Here we show that an AI-based amyloidogenecity predictor can strongly differentiate the border- and the internal hexamers of β-pleated sheets when screening all the Protein Data Bank-deposited homology-filtered protein structures. Our main result shows that more than 30% of internal hexamers of β sheets are predicted to be amyloidogenic, while just outside the border regions, only 3% are predicted as such. This result may elucidate a general protection mechanism of proteins against turning into amyloids: if the borders of β-sheets were amyloidogenic, then the whole β sheet could turn more easily into an insoluble amyloid-structure, characterized by periodically repeated parallel β-sheets. We also present that no analogous phenomenon exists on the borders of α-helices or randomly chosen subsequences of the studied protein structures.
Collapse
Affiliation(s)
- Kristóf Takács
- PIT Bioinformatics Group, Eötvös University, H-1117 Budapest, Hungary.
| | - Bálint Varga
- PIT Bioinformatics Group, Eötvös University, H-1117 Budapest, Hungary.
| | - Viktor Farkas
- HUN-REN -ELTE Protein Modeling Research Group, H-1117 Budapest, Hungary.
| | - András Perczel
- HUN-REN -ELTE Protein Modeling Research Group, H-1117 Budapest, Hungary; Laboratory of Structural Chemistry and Biology, Eötvös University, H-1117, Budapest, Hungary.
| | - Vince Grolmusz
- PIT Bioinformatics Group, Eötvös University, H-1117 Budapest, Hungary; Uratim Ltd., H-1118 Budapest, Hungary.
| |
Collapse
|
2
|
Keresztes L, Szögi E, Varga B, Farkas V, Perczel A, Grolmusz V. Succinct Amyloid and Nonamyloid Patterns in Hexapeptides. ACS OMEGA 2022; 7:35532-35537. [PMID: 36249386 PMCID: PMC9558248 DOI: 10.1021/acsomega.2c02513] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/22/2022] [Accepted: 08/16/2022] [Indexed: 06/16/2023]
Abstract
Hexapeptides are widely applied as a model system for studying the amyloid-forming properties of polypeptides, including proteins. Recently, large experimental databases have become publicly available with amyloidogenic labels. Using these data sets for training and testing purposes, one may build artificial intelligence (AI)-based classifiers for predicting the amyloid state of peptides. In our previous work (Biomolecules 2021, 11, 500), we described the Support Vector Machine (SVM)-based Budapest Amyloid Predictor (https://pitgroup.org/bap). Here, we apply the Budapest Amyloid Predictor for discovering numerous amyloidogenic and nonamyloidogenic hexapeptide patterns with accuracy between 80% and 84%, as surprising and succinct novel rules for further understanding the amyloid state of peptides. For example, we have shown that for any independently mutated residue (position marked by "x"), the patterns CxFLWx, FxFLFx, or xxIVIV are predicted to be amyloidogenic, while those of PxDxxx, xxKxEx, and xxPQxx are nonamyloidogenic. We note that each amyloidogenic pattern with two x's (e.g.,CxFLWx) describes succinctly 202 = 400 hexapeptides, while the nonamyloidogenic patterns comprising four point mutations (e.g.,PxDxxx) give 204 = 160 000 hexapeptides in total. We also examine the restricted substitutions for positions "x" from subclasses of proteinogenic amino acid residues; for example, if "x" is substituted with hydrophobic amino acids, then there exist patterns containing three x's, like MxVVxx, predicted to be amyloidogenic. If we can choose for the x positions any hydrophobic amino acids, except the "structure breaker" proline, then we get amyloid patterns with five x positions, for example, xxxFxx, each corresponding to 32 768 hexapeptides. To our knowledge, no similar applications of artificial intelligence tools or succinct amyloid patterns were described before the present work.
Collapse
Affiliation(s)
- László Keresztes
- PIT
Bioinformatics Group, Eötvös
University, Budapest H-1117, Hungary
| | - Evelin Szögi
- PIT
Bioinformatics Group, Eötvös
University, Budapest H-1117, Hungary
| | - Bálint Varga
- PIT
Bioinformatics Group, Eötvös
University, Budapest H-1117, Hungary
| | - Viktor Farkas
- MTA-ELTE
Protein Modeling Research Group, Budapest H-1117, Hungary
| | - András Perczel
- MTA-ELTE
Protein Modeling Research Group, Budapest H-1117, Hungary
- Laboratory
of Structural Chemistry and Biology, Eötvös
University, Budapest H-1117, Hungary
| | - Vince Grolmusz
- PIT
Bioinformatics Group, Eötvös
University, Budapest H-1117, Hungary
- Uratim
Ltd., Budapest H-1118, Hungary
| |
Collapse
|
3
|
Ryan SM, Almassey M, Burch AM, Ngo G, Martin JM, Myers D, Compton D, Archie S, Cross M, Naeger L, Salzman A, Virola‐Iarussi A, Barbee SA, Mortimer NT, Sanyal S, Vrailas‐Mortimer AD. Drosophila p38 MAPK interacts with BAG-3/starvin to regulate age-dependent protein homeostasis. Aging Cell 2021; 20:e13481. [PMID: 34674371 PMCID: PMC8590102 DOI: 10.1111/acel.13481] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2020] [Revised: 08/23/2021] [Accepted: 09/09/2021] [Indexed: 12/25/2022] Open
Abstract
As organisms age, they often accumulate protein aggregates that are thought to be toxic, potentially leading to age‐related diseases. This accumulation of protein aggregates is partially attributed to a failure to maintain protein homeostasis. A variety of genetic factors have been linked to longevity, but how these factors also contribute to protein homeostasis is not completely understood. In order to understand the relationship between aging and protein aggregation, we tested how a gene that regulates lifespan and age‐dependent locomotor behaviors, p38 MAPK (p38Kb), influences protein homeostasis as an organism ages. We find that p38Kb regulates age‐dependent protein aggregation through an interaction with starvin, a regulator of muscle protein homeostasis. Furthermore, we have identified Lamin as an age‐dependent target of p38Kb and starvin.
Collapse
Affiliation(s)
- Sarah M. Ryan
- Department of Biological Sciences University of Denver Denver CO USA
| | - Michael Almassey
- School of Biological Sciences Illinois State University Normal IL USA
| | | | - Gia Ngo
- Department of Biological Sciences University of Denver Denver CO USA
| | - Julia M. Martin
- School of Biological Sciences Illinois State University Normal IL USA
| | - David Myers
- School of Biological Sciences Illinois State University Normal IL USA
| | - Devin Compton
- School of Biological Sciences Illinois State University Normal IL USA
| | - Shira Archie
- School of Biological Sciences Illinois State University Normal IL USA
| | - Megan Cross
- School of Biological Sciences Illinois State University Normal IL USA
| | - Lauren Naeger
- School of Biological Sciences Illinois State University Normal IL USA
| | - Ashley Salzman
- School of Biological Sciences Illinois State University Normal IL USA
| | | | - Scott A. Barbee
- Department of Biological Sciences University of Denver Denver CO USA
| | | | - Subhabrata Sanyal
- Department of Cell Biology Emory University Atlanta GA USA
- Calico San Francisco CA USA
| | - Alysia D. Vrailas‐Mortimer
- Department of Biological Sciences University of Denver Denver CO USA
- School of Biological Sciences Illinois State University Normal IL USA
- Department of Cell Biology Emory University Atlanta GA USA
| |
Collapse
|
4
|
Takács K, Grolmusz V. On the border of the amyloidogenic sequences: prefix analysis of the parallel beta sheets in the PDB_Amyloid collection. J Integr Bioinform 2021; 19:jib-2020-0043. [PMID: 34303324 PMCID: PMC9069647 DOI: 10.1515/jib-2020-0043] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2020] [Accepted: 06/28/2021] [Indexed: 01/04/2023] Open
Abstract
The Protein Data Bank (PDB) today contains more than 174,000 entries with the 3-dimensional structures of biological macromolecules. Using the rich resources of this repository, it is possible identifying subsets with specific, interesting properties for different applications. Our research group prepared an automatically updated list of amyloid- and probably amyloidogenic molecules, the PDB_Amyloid collection, which is freely available at the address http://pitgroup.org/amyloid. This resource applies exclusively the geometric properties of the steric structures for identifying amyloids. In the present contribution, we analyze the starting (i.e., prefix) subsequences of the characteristic, parallel beta-sheets of the structures in the PDB_Amyloid collection, and identify further appearances of these length-5 prefix subsequences in the whole PDB data set. We have identified this way numerous proteins, whose normal or irregular functions involve amyloid formation, structural misfolding, or anti-coagulant properties, simply by containing these prefixes: including the T-cell receptor (TCR), bound with the major histocompatibility complexes MHC-1 and MHC-2; the p53 tumor suppressor protein; a mycobacterial RNA polymerase transcription initialization complex; the human bridging integrator protein BIN-1; and the tick anti-coagulant peptide TAP.
Collapse
Affiliation(s)
- Kristóf Takács
- PIT Bioinformatics Group, Eötvös University, BudapestH-1117, Hungary
| | - Vince Grolmusz
- PIT Bioinformatics Group, Eötvös University, BudapestH-1117, Hungary.,Uratim Ltd., BudapestH-1118, Hungary
| |
Collapse
|
5
|
Keresztes L, Szögi E, Varga B, Farkas V, Perczel A, Grolmusz V. The Budapest Amyloid Predictor and Its Applications. Biomolecules 2021; 11:500. [PMID: 33810341 PMCID: PMC8067080 DOI: 10.3390/biom11040500] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2021] [Revised: 03/18/2021] [Accepted: 03/23/2021] [Indexed: 12/13/2022] Open
Abstract
The amyloid state of proteins is widely studied with relevance to neurology, biochemistry, and biotechnology. In contrast with nearly amorphous aggregation, the amyloid state has a well-defined structure, consisting of parallel and antiparallel β-sheets in a periodically repeated formation. The understanding of the amyloid state is growing with the development of novel molecular imaging tools, like cryogenic electron microscopy. Sequence-based amyloid predictors were developed, mainly using artificial neural networks (ANNs) as the underlying computational technique. From a good neural-network-based predictor, it is a very difficult task to identify the attributes of the input amino acid sequence, which imply the decision of the network. Here, we present a linear Support Vector Machine (SVM)-based predictor for hexapeptides with correctness higher than 84%, i.e., it is at least as good as the best published ANN-based tools. Unlike artificial neural networks, the decisions of the linear SVMs are much easier to analyze and, from a good predictor, we can infer rich biochemical knowledge. In the Budapest Amyloid Predictor webserver the user needs to input a hexapeptide, and the server outputs a prediction for the input plus the 6 × 19 = 114 distance-1 neighbors of the input hexapeptide.
Collapse
Affiliation(s)
- László Keresztes
- PIT Bioinformatics Group, Eötvös University, H-1117 Budapest, Hungary; (L.K.); (E.S.); (B.V.)
| | - Evelin Szögi
- PIT Bioinformatics Group, Eötvös University, H-1117 Budapest, Hungary; (L.K.); (E.S.); (B.V.)
| | - Bálint Varga
- PIT Bioinformatics Group, Eötvös University, H-1117 Budapest, Hungary; (L.K.); (E.S.); (B.V.)
| | - Viktor Farkas
- MTA-ELTE Protein Modeling Research Group, H-1117 Budapest, Hungary; (V.F.); (A.P.)
| | - András Perczel
- MTA-ELTE Protein Modeling Research Group, H-1117 Budapest, Hungary; (V.F.); (A.P.)
- Laboratory of Structural Chemistry and Biology, Eötvös University, H-1117 Budapest, Hungary
| | - Vince Grolmusz
- PIT Bioinformatics Group, Eötvös University, H-1117 Budapest, Hungary; (L.K.); (E.S.); (B.V.)
- Uratim Ltd., H-1118 Budapest, Hungary
| |
Collapse
|
6
|
Kapros A, Balázs A, Harmat V, Háló A, Budai L, Pintér I, Menyhárd DK, Perczel A. Configuration-Controlled Crystal and/or Gel Formation of Protected d-Glucosamines Supported by Promiscuous Interaction Surfaces and a Conformationally Heterogeneous Solution State. Chemistry 2020; 26:11643-11655. [PMID: 32333713 DOI: 10.1002/chem.202000882] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2020] [Revised: 04/17/2020] [Indexed: 11/10/2022]
Abstract
The configuration-dependent self-association mode of the two anomers of O-Ac,N-Fmoc-d-glucosamine, a foldamer building block, leading to gel and/or single crystal formation is described. The β-anomer of the sugar amino acid (2) forms a gel from various solvents (confirmed by SEM, rheology measurements, NMR, and ECD spectroscopy), whereas the α-anomer (1) does not form a gel with any solvent tested. Transition from the solution state to a gel is coupled to a concurrent shift of the Fmoc-groups: from a freely rotating (almost symmetrical) to a specific, asymmetric orientation. Whereas the crystal structure of the α-anomer is built as an evenly packed 3D system, the β-anomer forms a looser superstructure of well-packed 2D layers. Modeling indicates that in the lowest energy, but scarcely sampled conformer of the β-anomer, the Fmoc-group bends above the sugar moiety, stabilized by intramolecular CH↔π interactions between the aromatic rings. It is concluded that possessing an extended and promiscuous interaction surface and a conformationally heterogeneous solution state are among the basic requirements of gel formation for a candidate molecule.
Collapse
Affiliation(s)
- Anita Kapros
- Laboratory of Structural Chemistry and Biology, Institute of Chemistry, Eötvös Loránd University, Pázmány P. stny. 1/A, Budapest, 1117, Hungary
| | - Attila Balázs
- Laboratory of Structural Chemistry and Biology, Institute of Chemistry, Eötvös Loránd University, Pázmány P. stny. 1/A, Budapest, 1117, Hungary
| | - Veronika Harmat
- Laboratory of Structural Chemistry and Biology, Institute of Chemistry, Eötvös Loránd University, Pázmány P. stny. 1/A, Budapest, 1117, Hungary.,MTA-ELTE Protein Modelling Research Group, Pázmány P. stny. 1/A, Budapest, 1117, Hungary
| | - Adrienn Háló
- Laboratory of Structural Chemistry and Biology, Institute of Chemistry, Eötvös Loránd University, Pázmány P. stny. 1/A, Budapest, 1117, Hungary
| | - Lívia Budai
- Department of Pharmaceutics, Semmelweis University, Hőgyes Endre utca 7, Budapest, 1092, Hungary
| | - István Pintér
- Laboratory of Structural Chemistry and Biology, Institute of Chemistry, Eötvös Loránd University, Pázmány P. stny. 1/A, Budapest, 1117, Hungary
| | - Dóra K Menyhárd
- Laboratory of Structural Chemistry and Biology, Institute of Chemistry, Eötvös Loránd University, Pázmány P. stny. 1/A, Budapest, 1117, Hungary.,MTA-ELTE Protein Modelling Research Group, Pázmány P. stny. 1/A, Budapest, 1117, Hungary
| | - András Perczel
- Laboratory of Structural Chemistry and Biology, Institute of Chemistry, Eötvös Loránd University, Pázmány P. stny. 1/A, Budapest, 1117, Hungary.,MTA-ELTE Protein Modelling Research Group, Pázmány P. stny. 1/A, Budapest, 1117, Hungary
| |
Collapse
|
7
|
Náray-Szabó G. Biomolecules as Potential Drugs. Curr Protein Pept Sci 2019; 20:1038-1039. [PMID: 31674892 DOI: 10.2174/138920372011191024104424] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Affiliation(s)
- Gábor Náray-Szabó
- MTA-ELTE Protein Modelling Research Group Pazmany Peter st. 1A, 1117 Budapest, Hungary
| |
Collapse
|