1
|
Breimann S, Kamp F, Steiner H, Frishman D. AAontology: An Ontology of Amino Acid Scales for Interpretable Machine Learning. J Mol Biol 2024; 436:168717. [PMID: 39053689 DOI: 10.1016/j.jmb.2024.168717] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2024] [Revised: 07/15/2024] [Accepted: 07/19/2024] [Indexed: 07/27/2024]
Abstract
Amino acid scales are crucial for protein prediction tasks, many of them being curated in the AAindex database. Despite various clustering attempts to organize them and to better understand their relationships, these approaches lack the fine-grained classification necessary for satisfactory interpretability in many protein prediction problems. To address this issue, we developed AAontology-a two-level classification for 586 amino acid scales (mainly from AAindex) together with an in-depth analysis of their relations-using bag-of-word-based classification, clustering, and manual refinement over multiple iterations. AAontology organizes physicochemical scales into 8 categories and 67 subcategories, enhancing the interpretability of scale-based machine learning methods in protein bioinformatics. Thereby it enables researchers to gain a deeper biological insight. We anticipate that AAontology will be a building block to link amino acid properties with protein function and dysfunctions as well as aid informed decision-making in mutation analysis or protein drug design.
Collapse
Affiliation(s)
- Stephan Breimann
- Department of Bioinformatics, School of Life Sciences, Technical University of Munich, Freising, Germany; Ludwig-Maximilians-University Munich, Biomedical Center, Division of Metabolic Biochemistry, Munich, Germany; German Center for Neurodegenerative Diseases (DZNE), Munich, Germany
| | - Frits Kamp
- Ludwig-Maximilians-University Munich, Biomedical Center, Division of Metabolic Biochemistry, Munich, Germany
| | - Harald Steiner
- Ludwig-Maximilians-University Munich, Biomedical Center, Division of Metabolic Biochemistry, Munich, Germany; German Center for Neurodegenerative Diseases (DZNE), Munich, Germany
| | - Dmitrij Frishman
- Department of Bioinformatics, School of Life Sciences, Technical University of Munich, Freising, Germany.
| |
Collapse
|
2
|
Hendershot LM, Buck TM, Brodsky JL. The Essential Functions of Molecular Chaperones and Folding Enzymes in Maintaining Endoplasmic Reticulum Homeostasis. J Mol Biol 2024; 436:168418. [PMID: 38143019 DOI: 10.1016/j.jmb.2023.168418] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2023] [Revised: 12/18/2023] [Accepted: 12/19/2023] [Indexed: 12/26/2023]
Abstract
It has been estimated that up to one-third of the proteins encoded by the human genome enter the endoplasmic reticulum (ER) as extended polypeptide chains where they undergo covalent modifications, fold into their native structures, and assemble into oligomeric protein complexes. The fidelity of these processes is critical to support organellar, cellular, and organismal health, and is perhaps best underscored by the growing number of disease-causing mutations that reduce the fidelity of protein biogenesis in the ER. To meet demands encountered by the diverse protein clientele that mature in the ER, this organelle is populated with a cadre of molecular chaperones that prevent protein aggregation, facilitate protein disulfide isomerization, and lower the activation energy barrier of cis-trans prolyl isomerization. Components of the lectin (glycan-binding) chaperone system also reside within the ER and play numerous roles during protein biogenesis. In addition, the ER houses multiple homologs of select chaperones that can recognize and act upon diverse peptide signatures. Moreover, redundancy helps ensure that folding-compromised substrates are unable to overwhelm essential ER-resident chaperones and enzymes. In contrast, the ER in higher eukaryotic cells possesses a single member of the Hsp70, Hsp90, and Hsp110 chaperone families, even though several homologs of these molecules reside in the cytoplasm. In this review, we discuss specific functions of the many factors that maintain ER quality control, highlight some of their interactions, and describe the vulnerabilities that arise from the absence of multiple members of some chaperone families.
Collapse
Affiliation(s)
- Linda M Hendershot
- Department of Tumor Cell Biology, St. Jude Children's Research Hospital, Memphis, TN 38105, United States.
| | - Teresa M Buck
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA 15260, United States
| | - Jeffrey L Brodsky
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA 15260, United States
| |
Collapse
|
3
|
van der Sluijs P, Hoelen H, Schmidt A, Braakman I. The Folding Pathway of ABC Transporter CFTR: Effective and Robust. J Mol Biol 2024; 436:168591. [PMID: 38677493 DOI: 10.1016/j.jmb.2024.168591] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2024] [Revised: 04/16/2024] [Accepted: 04/23/2024] [Indexed: 04/29/2024]
Abstract
De novo protein folding into a native three-dimensional structure is indispensable for biological function, is instructed by its amino acid sequence, and occurs along a vectorial trajectory. The human proteome contains thousands of membrane-spanning proteins, whose biosynthesis begins on endoplasmic reticulum-associated ribosomes. Nearly half of all membrane proteins traverse the membrane more than once, including therapeutically important protein families such as solute carriers, G-protein-coupled receptors, and ABC transporters. These mediate a variety of functions like signal transduction and solute transport and are often of vital importance for cell function and tissue homeostasis. Missense mutations in multispan membrane proteins can lead to misfolding and cause disease; an example is the ABC transporter Cystic Fibrosis Transmembrane Conductance Regulator (CFTR). Even though our understanding of multispan membrane-protein folding still is rather rudimental, the cumulative knowledge of 20 years of basic research on CFTR folding has led to development of drugs that modulate the misfolded protein. This has provided the prospect of a life without CF to the vast majority of patients. In this review we describe our understanding of the folding pathway of CFTR in cells, which is modular and tolerates many defects, making it effective and robust. We address how modulator drugs affect folding and function of CFTR, and distinguish protein stability from its folding process. Since the domain architecture of (mammalian) ABC transporters are highly conserved, we anticipate that the insights we discuss here for folding of CFTR may lay the groundwork for understanding the general rules of ABC-transporter folding.
Collapse
Affiliation(s)
- Peter van der Sluijs
- Cellular Protein Chemistry, Bijvoet Centre for Biomolecular Research, Utrecht University, 3584 CH Utrecht, the Netherlands.
| | - Hanneke Hoelen
- Cellular Protein Chemistry, Bijvoet Centre for Biomolecular Research, Utrecht University, 3584 CH Utrecht, the Netherlands; Present address: GenDx, Yalelaan 48, 3584 CM Utrecht, The Netherlands
| | - Andre Schmidt
- Cellular Protein Chemistry, Bijvoet Centre for Biomolecular Research, Utrecht University, 3584 CH Utrecht, the Netherlands; 3D-Pharmxchange, Tilburg, the Netherlands
| | - Ineke Braakman
- Cellular Protein Chemistry, Bijvoet Centre for Biomolecular Research, Utrecht University, 3584 CH Utrecht, the Netherlands
| |
Collapse
|
4
|
Zhao H, Liu X, Amantai X, Bi J, Cao X, Yue X. Characterization and Comparison Analysis of Milk Fat Globule Membrane Proteins between Human and Porcine Milk. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2024; 72:3210-3217. [PMID: 38291649 DOI: 10.1021/acs.jafc.3c06298] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/01/2024]
Abstract
This study aimed to explore the differences in milk fat globule membrane (MFGM) proteins between human milk (HM) and porcine milk (PM) using a label-free quantitative proteomic approach. A total of 3920 and 4001 MFGM proteins were identified between PM and HM, respectively. Among them, 3520 common MFGM proteins were detected, including 956 significant differentially expressed MFGM proteins (DEPs). Gene ontology (GO) enrichment analysis showed that the DEPs were highly enriched in the lipid metabolic process and intrinsic component of membrane. Kyoto Encyclopedia of Genes and Genomes pathways suggested that protein processing in the endoplasmic reticulum was the most highly enriched pathway, followed by peroxisome, complement, and coagulation cascades. This study reflects the difference in the composition of MFGM proteins between HM and PM and provides a scientific and systematic reference for the development of MFGM protein nutrition.
Collapse
Affiliation(s)
- Huiwen Zhao
- College of Food Science, Shenyang Agricultural University, Shenyang 110866, China
| | - Xiaoyu Liu
- Department of Obstetrics and Gynaecology, General Hospital of Northern Theater Command, Shenyang 110016, China
| | - Xiakouna Amantai
- College of Food Science, Shenyang Agricultural University, Shenyang 110866, China
| | - Jiayang Bi
- College of Food Science, Shenyang Agricultural University, Shenyang 110866, China
| | - Xueyan Cao
- College of Food Science, Shenyang Agricultural University, Shenyang 110866, China
| | - Xiqing Yue
- College of Food Science, Shenyang Agricultural University, Shenyang 110866, China
| |
Collapse
|
5
|
Kleizen B, de Mattos E, Papaioannou O, Monti M, Tartaglia GG, van der Sluijs P, Braakman I. Transmembrane Helices 7 and 8 Confer Aggregation Sensitivity to the Cystic Fibrosis Transmembrane Conductance Regulator. Int J Mol Sci 2023; 24:15741. [PMID: 37958724 PMCID: PMC10648718 DOI: 10.3390/ijms242115741] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2023] [Revised: 10/18/2023] [Accepted: 10/19/2023] [Indexed: 11/15/2023] Open
Abstract
The Cystic Fibrosis Transmembrane Conductance Regulator (CFTR) is a large multi-spanning membrane protein that is susceptible to misfolding and aggregation. We have identified here the region responsible for this instability. Temperature-induced aggregation of C-terminally truncated versions of CFTR demonstrated that all truncations up to the second transmembrane domain (TMD2), including the R region, largely resisted aggregation. Limited proteolysis identified a folded structure that was prone to aggregation and consisted of TMD2 and at least part of the Regulatory Region R. Only when both TM7 (TransMembrane helix 7) and TM8 were present, TMD2 fragments became as aggregation-sensitive as wild-type CFTR, in line with increased thermo-instability of late CFTR nascent chains and in silico prediction of aggregation propensity. In accord, isolated TMD2 was degraded faster in cells than isolated TMD1. We conclude that TMD2 extended at its N-terminus with part of the R region forms a protease-resistant structure that induces heat instability in CFTR and may be responsible for its limited intracellular stability.
Collapse
Affiliation(s)
- Bertrand Kleizen
- Cellular Protein Chemistry, Bijvoet Centre for Biomolecular Research, Utrecht University, 3584 CH Utrecht, The Netherlands; (B.K.); (E.d.M.); (O.P.); (P.v.d.S.)
| | - Eduardo de Mattos
- Cellular Protein Chemistry, Bijvoet Centre for Biomolecular Research, Utrecht University, 3584 CH Utrecht, The Netherlands; (B.K.); (E.d.M.); (O.P.); (P.v.d.S.)
| | - Olga Papaioannou
- Cellular Protein Chemistry, Bijvoet Centre for Biomolecular Research, Utrecht University, 3584 CH Utrecht, The Netherlands; (B.K.); (E.d.M.); (O.P.); (P.v.d.S.)
| | - Michele Monti
- Center for Life Nano- & Neuro-Science, Fondazione Istituto Italiano di Tecnologia (IIT), 00161 Rome, Italy; (M.M.); (G.G.T.)
- Centre for Human Technologies (CHT), Istituto Italiano di Tecnologia (IIT), 16152 Genoa, Italy
| | - Gian Gaetano Tartaglia
- Center for Life Nano- & Neuro-Science, Fondazione Istituto Italiano di Tecnologia (IIT), 00161 Rome, Italy; (M.M.); (G.G.T.)
- Centre for Human Technologies (CHT), Istituto Italiano di Tecnologia (IIT), 16152 Genoa, Italy
| | - Peter van der Sluijs
- Cellular Protein Chemistry, Bijvoet Centre for Biomolecular Research, Utrecht University, 3584 CH Utrecht, The Netherlands; (B.K.); (E.d.M.); (O.P.); (P.v.d.S.)
| | - Ineke Braakman
- Cellular Protein Chemistry, Bijvoet Centre for Biomolecular Research, Utrecht University, 3584 CH Utrecht, The Netherlands; (B.K.); (E.d.M.); (O.P.); (P.v.d.S.)
| |
Collapse
|
6
|
Montpetit J, Clúa J, Hsieh YF, Vogiatzaki E, Müller J, Abel S, Strasser R, Poirier Y. Endoplasmic reticulum calnexins participate in the primary root growth response to phosphate deficiency. PLANT PHYSIOLOGY 2023; 191:1719-1733. [PMID: 36567484 PMCID: PMC10022610 DOI: 10.1093/plphys/kiac595] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Accepted: 12/08/2022] [Indexed: 06/17/2023]
Abstract
Accumulation of incompletely folded proteins in the endoplasmic reticulum (ER) leads to ER stress, activates ER protein degradation pathways, and upregulates genes involved in protein folding. This process is known as the unfolded protein response (UPR). The role of ER protein folding in plant responses to nutrient deficiencies is unclear. We analyzed Arabidopsis (Arabidopsis thaliana) mutants affected in ER protein quality control and established that both CALNEXIN (CNX) genes function in the primary root response to phosphate (Pi) deficiency. CNX1 and CNX2 are homologous ER lectins promoting protein folding of N-glycosylated proteins via the recognition of the GlcMan9GlcNAc2 glycan. Growth of cnx1-1 and cnx2-2 single mutants was similar to that of the wild type under high and low Pi conditions, but the cnx1-1 cnx2-2 double mutant showed decreased primary root growth under low Pi conditions due to reduced meristematic cell division. This phenotype was specific to Pi deficiency; the double mutant responded normally to osmotic and salt stress. Expression of CNX2 mutated in amino acids involved in binding the GlcMan9GlcNAc2 glycan failed to complement the cnx1-1 cnx2-2 mutant. The root growth phenotype was Fe-dependent and was associated with root apoplastic Fe accumulation. Two genes involved in Fe-dependent inhibition of primary root growth under Pi deficiency, the ferroxidase LOW PHOSPHATE 1 (LPR1) and P5-type ATPase PLEIOTROPIC DRUG RESISTANCE 2 (PDR2) were epistatic to CNX1/CNX2. Overexpressing PDR2 failed to complement the cnx1-1 cnx2-2 root phenotype. The cnx1-1 cnx2-2 mutant showed no evidence of UPR activation, indicating a limited effect on ER protein folding. CNX might process a set of N-glycosylated proteins specifically involved in the response to Pi deficiency.
Collapse
Affiliation(s)
- Jonatan Montpetit
- Department of Plant Molecular Biology, Biophore Building, University of Lausanne, 1015 Lausanne, Switzerland
| | - Joaquín Clúa
- Department of Plant Molecular Biology, Biophore Building, University of Lausanne, 1015 Lausanne, Switzerland
| | - Yi-Fang Hsieh
- Department of Plant Molecular Biology, Biophore Building, University of Lausanne, 1015 Lausanne, Switzerland
| | - Evangelia Vogiatzaki
- Department of Plant Molecular Biology, Biophore Building, University of Lausanne, 1015 Lausanne, Switzerland
| | - Jens Müller
- Department of Molecular Signal Processing, Leibniz Institute of Plant Biochemistry, 06120 Halle, Germany
| | - Steffen Abel
- Department of Molecular Signal Processing, Leibniz Institute of Plant Biochemistry, 06120 Halle, Germany
| | - Richard Strasser
- Department of Applied Genetics and Cell Biology, University of Natural Resources and Life Sciences, Vienna, Muthgasse 18, A-1190 Vienna, Austria
| | - Yves Poirier
- Department of Plant Molecular Biology, Biophore Building, University of Lausanne, 1015 Lausanne, Switzerland
| |
Collapse
|