1
|
Hernández Berthet AS, Aptekmann AA, Tejero J, Sánchez IE, Noguera ME, Roman EA. Associating protein sequence positions with the modulation of quantitative phenotypes. Arch Biochem Biophys 2024; 755:109979. [PMID: 38583654 DOI: 10.1016/j.abb.2024.109979] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2023] [Revised: 03/11/2024] [Accepted: 03/27/2024] [Indexed: 04/09/2024]
Abstract
Although protein sequences encode the information for folding and function, understanding their link is not an easy task. Unluckily, the prediction of how specific amino acids contribute to these features is still considerably impaired. Here, we developed a simple algorithm that finds positions in a protein sequence with potential to modulate the studied quantitative phenotypes. From a few hundred protein sequences, we perform multiple sequence alignments, obtain the per-position pairwise differences for both the sequence and the observed phenotypes, and calculate the correlation between these last two quantities. We tested our methodology with four cases: archaeal Adenylate Kinases and the organisms optimal growth temperatures, microbial rhodopsins and their maximal absorption wavelengths, mammalian myoglobins and their muscular concentration, and inhibition of HIV protease clinical isolates by two different molecules. We found from 3 to 10 positions tightly associated with those phenotypes, depending on the studied case. We showed that these correlations appear using individual positions but an improvement is achieved when the most correlated positions are jointly analyzed. Noteworthy, we performed phenotype predictions using a simple linear model that links per-position divergences and differences in the observed phenotypes. Predictions are comparable to the state-of-art methodologies which, in most of the cases, are far more complex. All of the calculations are obtained at a very low information cost since the only input needed is a multiple sequence alignment of protein sequences with their associated quantitative phenotypes. The diversity of the explored systems makes our work a valuable tool to find sequence determinants of biological activity modulation and to predict various functional features for uncharacterized members of a protein family.
Collapse
Affiliation(s)
- Ayelén S Hernández Berthet
- Universidad de Buenos Aires, Facultad de Ciencias Exactas y Naturales, Intendente Güiraldes 2160 - Ciudad Universitaria, 1428EGA, C.A.B.A., Argentina.
| | - Ariel A Aptekmann
- Universidad de Buenos Aires, Consejo Nacional de Investigaciones Científicas y Técnicas. Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN), Facultad de Ciencias Exactas y Naturales, Laboratorio de Fisiología de Proteínas, Buenos Aires, Argentina; Department of Biochemistry and Microbiology, Rutgers University, New Brunswick, NJ, 08873, USA; Institute of Marine and Coastal Sciences, Rutgers University, New Brunswick, NJ, 08901, USA.
| | - Jesús Tejero
- Heart, Lung, Blood and Vascular Medicine Institute, University of Pittsburgh, Pittsburgh, PA, 15261, USA; Division of Pulmonary, Allergy and Critical Care Medicine, University of Pittsburgh, Pittsburgh, PA, 15261, USA; Department of Bioengineering, Swanson School of Engineering, University of Pittsburgh, Pittsburgh, PA, 15260, USA; Department of Pharmacology and Chemical Biology, University of Pittsburgh, Pittsburgh, PA, 15261, USA.
| | - Ignacio E Sánchez
- Universidad de Buenos Aires, Consejo Nacional de Investigaciones Científicas y Técnicas. Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN), Facultad de Ciencias Exactas y Naturales, Laboratorio de Fisiología de Proteínas, Buenos Aires, Argentina.
| | - Martín E Noguera
- Consejo Nacional de Investigaciones Científicas y Técnicas, Instituto de Química y Fisicoquímica Biológicas Dr. Alejandro Paladini, Junín 956, 1113AAD, C.A.B.A., Argentina; Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Roque Saenz Peña 352, B1876BXD, Bernal, Argentina.
| | - Ernesto A Roman
- Universidad de Buenos Aires, Facultad de Ciencias Exactas y Naturales, Intendente Güiraldes 2160 - Ciudad Universitaria, 1428EGA, C.A.B.A., Argentina; Consejo Nacional de Investigaciones Científicas y Técnicas, Instituto de Química y Fisicoquímica Biológicas Dr. Alejandro Paladini, Junín 956, 1113AAD, C.A.B.A., Argentina.
| |
Collapse
|
2
|
Sackerson C, Garcia V, Medina N, Maldonado J, Daly J, Cartwright R. Comparative analysis of the myoglobin gene in whales and humans reveals evolutionary changes in regulatory elements and expression levels. PLoS One 2023; 18:e0284834. [PMID: 37643191 PMCID: PMC10464968 DOI: 10.1371/journal.pone.0284834] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2023] [Accepted: 08/15/2023] [Indexed: 08/31/2023] Open
Abstract
Cetacea and other diving mammals have undergone numerous adaptations to their aquatic environment, among them high levels of the oxygen-carrying intracellular hemoprotein myoglobin in skeletal muscles. Hypotheses regarding the mechanisms leading to these high myoglobin levels often invoke the induction of gene expression by exercise, hypoxia, and other physiological gene regulatory pathways. Here we explore an alternative hypothesis: that cetacean myoglobin genes have evolved high levels of transcription driven by the intrinsic developmental mechanisms that drive muscle cell differentiation. We have used luciferase assays in differentiated C2C12 cells to test this hypothesis. Contrary to our hypothesis, we find that the myoglobin gene from the minke whale, Balaenoptera acutorostrata, shows a low level of expression, only about 8% that of humans. This low expression level is broadly shared among cetaceans and artiodactylans. Previous work on regulation of the human gene has identified a core muscle-specific enhancer comprised of two regions, the "AT element" and a C-rich sequence 5' of the AT element termed the "CCAC-box". Analysis of the minke whale gene supports the importance of the AT element, but the minke whale CCAC-box ortholog has little effect. Instead, critical positive input has been identified in a G-rich region 3' of the AT element. Also, a conserved E-box in exon 1 positively affects expression, despite having been assigned a repressive role in the human gene. Last, a novel region 5' of the core enhancer has been identified, which we hypothesize may function as a boundary element. These results illustrate regulatory flexibility during evolution. We discuss the possibility that low transcription levels are actually beneficial, and that evolution of the myoglobin protein toward enhanced stability is a critical factor in the accumulation of high myoglobin levels in adult cetacean muscle tissue.
Collapse
Affiliation(s)
- Charles Sackerson
- Biology Department, California State University Channel Islands, Camarillo, California, United States of America
| | - Vivian Garcia
- Biology Department, California State University Channel Islands, Camarillo, California, United States of America
| | - Nicole Medina
- Biology Department, California State University Channel Islands, Camarillo, California, United States of America
| | - Jessica Maldonado
- Biology Department, California State University Channel Islands, Camarillo, California, United States of America
| | - John Daly
- Biology Department, California State University Channel Islands, Camarillo, California, United States of America
| | - Rachel Cartwright
- Biology Department, California State University Channel Islands, Camarillo, California, United States of America
- The Keiki Kohola Project, Lahaina, Hawaii, United States of America
| |
Collapse
|
3
|
AlResaini S, Malik A, Alonazi M, Alhomida A, Khan JM. SDS induces amorphous, amyloid-fibril, and alpha-helical structures in the myoglobin in a concentration-dependent manner. Int J Biol Macromol 2023; 231:123237. [PMID: 36639087 DOI: 10.1016/j.ijbiomac.2023.123237] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Revised: 01/05/2023] [Accepted: 01/09/2023] [Indexed: 01/12/2023]
Abstract
Amyloid fibrils have been linked to a number of diseases. Surfactants imitate plasma membrane lipids and induce amyloid fibrils. This study examined the effects of the anionic surfactant sodium dodecyl sulfate (SDS) at pH 4.5 on equine skeletal muscle myoglobin (E-Mb). To analyze the effect of SDS on aggregation and amyloid-fibril formation to E-Mb, we used various spectroscopic techniques (turbidity, light scattering, intrinsic fluorescence, ThT fluorescence, and circular dichroism (CD)), electrophoretic, and microscopic techniques. Turbidity, SDS-PAGE, and light scattering all indicated the formation of E-Mb aggregates at SDS concentrations ranging from 0.2 mM to 1.0 mM. In the presence of 0.4 mM SDS, far-UV CD and TEM data indicate that E-MB forms amorphous aggregates. ThT binding, Far-UV CD, and TEM findings indicate that E-Mb forms amyloid-like structures in the presence of 0.6-1.0 mM SDS. However, no aggregation was seen at SDS concentrations above 1 mM. In the presence of high SDS concentrations (> 1 mM), the E-Mb exhibited native-like α-helical structure. As a result, SDS exhibited three distinct behaviors: amorphous aggregates, amyloid-fibrils, and helix-inducer. These findings also shed light on how amyloid fibrils are formed when anionic surfactants are introduced, which is a significant takeaway.
Collapse
Affiliation(s)
- Sundus AlResaini
- Department of Biochemistry, Collage of Science, King Saud University, Riyadh, Saudi Arabia
| | - Ajamaluddin Malik
- Department of Biochemistry, Collage of Science, King Saud University, Riyadh, Saudi Arabia.
| | - Mona Alonazi
- Department of Biochemistry, Collage of Science, King Saud University, Riyadh, Saudi Arabia
| | - Abdullah Alhomida
- Department of Biochemistry, Collage of Science, King Saud University, Riyadh, Saudi Arabia
| | - Javed Masood Khan
- Department of Food and Nutrition, Facility of Food and Agriculture Science, King Saud University, Riyadh, Saudi Arabia
| |
Collapse
|
4
|
Queiroz JPF, Lourenzoni MR, Rocha BAM. Structural evolution of an amphibian-specific globin: A computational evolutionary biochemistry approach. COMPARATIVE BIOCHEMISTRY AND PHYSIOLOGY. PART D, GENOMICS & PROTEOMICS 2023; 45:101055. [PMID: 36566682 DOI: 10.1016/j.cbd.2022.101055] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/29/2022] [Revised: 12/14/2022] [Accepted: 12/15/2022] [Indexed: 12/24/2022]
Abstract
Studies on the globin family are continuously revealing insights into the mechanisms of gene and protein evolution. The rise of a new globin gene type in Pelobatoidea and Neobatrachia (Amphibia:Anura) from an α-globin precursor provides the opportunity to investigate the genetic and physical mechanisms underlying the origin of new protein structural and functional properties. This amphibian-specific globin (globin A/GbA) discovered in the heart of Rana catesbeiana is a monomer. As the ancestral oligomeric state of α-globins is a homodimer, we inferred that the ancestral state was lost somewhere in the GbA lineage. Here, we combined computational molecular evolution with structural bioinformatics to determine the extent to which the loss of the homodimeric state is pervasive in the GbA clade. We also characterized the loci of GbA genes in Bufo bufo. We found two GbA clades in Neobatrachia. One was deleted in Ranidae, but retained and expanded to yield a new globin cluster in Bufonidae species. Loss of the ancestral oligomeric state seems to be pervasive in the GbA clade. However, a taxonomic sampling that includes more Pelobatoidea, as well as early Neobatrachia, lineages would be necessary to determine the oligomeric state of the last common ancestor of all GbA. The evidence presented here points out a possible loss of oligomerization in Pelobatoidea GbA as a result of amino acid substitutions that weaken the homodimeric state. In contrast, the loss of oligomerization in both Neobatrachia GbA clades was linked to independent deletions that disrupted many packing contacts at the homodimer interface.
Collapse
Affiliation(s)
- João Pedro Fernandes Queiroz
- Laboratorio de Biocristalografia - LABIC, Departamento de Bioquimica e Biologia Molecular, Universidade Federal do Ceara, Campus do Pici s.n., bloco 907, Av. Mister Hull, Fortaleza, Ceara, 60440-970, Brazil.
| | - Marcos Roberto Lourenzoni
- Protein Engineering and Health Solutions Group - GEPeSS Fundacao Oswaldo Cruz - Ceara, Eusébio, Ceara, 60175-047, Brazil.
| | - Bruno Anderson Matias Rocha
- Laboratorio de Biocristalografia - LABIC, Departamento de Bioquimica e Biologia Molecular, Universidade Federal do Ceara, Campus do Pici s.n., bloco 907, Av. Mister Hull, Fortaleza, Ceara, 60440-970, Brazil.
| |
Collapse
|
5
|
Role of Nitric Oxide-Derived Metabolites in Reactions of Methylglyoxal with Lysine and Lysine-Rich Protein Leghemoglobin. Int J Mol Sci 2022; 24:ijms24010168. [PMID: 36613614 PMCID: PMC9820652 DOI: 10.3390/ijms24010168] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2022] [Revised: 12/14/2022] [Accepted: 12/17/2022] [Indexed: 12/24/2022] Open
Abstract
Carbonyl stress occurs when reactive carbonyl compounds (RCC), such as reducing sugars, dicarbonyls etc., accumulate in the organism. The interaction of RCC carbonyl groups with amino groups of molecules is called the Maillard reaction. One of the most active RCCs is α-dicarbonyl methylglyoxal (MG) that modifies biomolecules forming non-enzymatic glycation products. Organic free radicals are formed in the reaction between MG and lysine or Nα-acetyllysine. S-nitrosothiols and nitric oxide (•NO) donor PAPA NONOate increased the yield of organic free radical intermediates, while other •NO-derived metabolites, namely, nitroxyl anion and dinitrosyl iron complexes (DNICs) decreased it. At the late stages of the Maillard reaction, S-nitrosoglutathione (GSNO) also inhibited the formation of glycation end products (AGEs). The formation of a new type of DNICs, bound with Maillard reaction products, was found. The results obtained were used to explain the glycation features of legume hemoglobin-leghemoglobin (Lb), which is a lysine-rich protein. In Lb, lysine residues can form fluorescent cross-linked AGEs, and •NO-derived metabolites slow down their formation. The knowledge of these processes can be used to increase the stability of Lb. It can help in better understanding the impact of stress factors on legume plants and contribute to the production of recombinant Lb for biotechnology.
Collapse
|
6
|
Isogai Y, Imamura H, Sumi T, Shirai T. Improvement of Protein Solubility in Macromolecular Crowding during Myoglobin Evolution. Biochemistry 2022; 61:1543-1547. [PMID: 35674519 DOI: 10.1021/acs.biochem.2c00166] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
The inside of living cells is crowded by extremely high concentrations of biomolecules, and thus globular proteins should have been developed to increase their solubility under such crowding conditions during organic evolution. The O2-storage protein myoglobin (Mb) is known to be expressed in myocytes of diving mammals in much larger quantities than those of land mammals. We have previously resurrected ancient whale and pinniped Mbs and experimentally demonstrated that the diving animal Mbs have evolved to maintain high solubility under the crowding conditions or to increase their tolerance against macromolecular precipitants, rather than solubility in a dilute buffer solution. However, the detail of chemical mechanisms of the precipitant tolerance remains unclear. Here, we investigated pH dependence of the precipitant tolerance (β, slope of the solubility against precipitant concentration) of extant Mbs and plotted the β values, as well as those of ancestral Mbs, against their surface net charges (ZMb). The results demonstrated that the precipitant tolerance was approximated by the square of ZMb, that is, β = aZMb2 + b, in which a and b are constants. This effect of ZMb against the precipitation is not predicted by a classical excluded volume theory that gives constant β for Mbs but can be explained by electrostatic repulsion between Mb molecules. The present study elucidates how Mb molecules have evolved to increase their in vivo solubility and shows the physiological significance of either neutral or basic isoelectric points (pI) of the natural Mbs, rather than acidic pI.
Collapse
Affiliation(s)
- Yasuhiro Isogai
- Department of Pharmaceutical Engineering, Toyama Prefectural University, Imizu, Toyama 939-0398, Japan
| | - Hiroshi Imamura
- Department of Applied Chemistry, College of Life Sciences, Ritsumeikan University, 1-1-1 Nojihigashi, Kusatsu, Shiga 525-8577, Japan
| | - Tomonari Sumi
- Research Institute for Interdisciplinary Science, Okayama University, 3-1-1 Tsushima-Naka, Kita-ku, Okayama 700-8530, Japan
| | - Tsuyoshi Shirai
- Department of Computer Bioscience, Nagahama Institute of Bio-Science and Technology, 1266 Tamura-Cho, Nagahama, Shiga 526-0829, Japan
| |
Collapse
|