1
|
Caetano-Anollés G, Aziz MF, Mughal F, Caetano-Anollés D. Tracing protein and proteome history with chronologies and networks: folding recapitulates evolution. Expert Rev Proteomics 2021; 18:863-880. [PMID: 34628994 DOI: 10.1080/14789450.2021.1992277] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
Abstract
INTRODUCTION While the origin and evolution of proteins remain mysterious, advances in evolutionary genomics and systems biology are facilitating the historical exploration of the structure, function and organization of proteins and proteomes. Molecular chronologies are series of time events describing the history of biological systems and subsystems and the rise of biological innovations. Together with time-varying networks, these chronologies provide a window into the past. AREAS COVERED Here, we review molecular chronologies and networks built with modern methods of phylogeny reconstruction. We discuss how chronologies of structural domain families uncover the explosive emergence of metabolism, the late rise of translation, the co-evolution of ribosomal proteins and rRNA, and the late development of the ribosomal exit tunnel; events that coincided with a tendency to shorten folding time. Evolving networks described the early emergence of domains and a late 'big bang' of domain combinations. EXPERT OPINION Two processes, folding and recruitment appear central to the evolutionary progression. The former increases protein persistence. The later fosters diversity. Chronologically, protein evolution mirrors folding by combining supersecondary structures into domains, developing translation machinery to facilitate folding speed and stability, and enhancing structural complexity by establishing long-distance interactions in novel structural and architectural designs.
Collapse
Affiliation(s)
- Gustavo Caetano-Anollés
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois, Urbana, Illinois, USA.,C. R. Woese Institute for Genomic Biology, University of Illinois, Urbana, Illinois, USA
| | - M Fayez Aziz
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois, Urbana, Illinois, USA
| | - Fizza Mughal
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois, Urbana, Illinois, USA
| | - Derek Caetano-Anollés
- Data Science Platform, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
| |
Collapse
|
2
|
Chu XY, Zhang HY. Cofactors as Molecular Fossils To Trace the Origin and Evolution of Proteins. Chembiochem 2020; 21:3161-3168. [PMID: 32515532 DOI: 10.1002/cbic.202000027] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2020] [Revised: 06/03/2020] [Indexed: 12/16/2022]
Abstract
Due to their early origin and extreme conservation, cofactors are valuable molecular fossils for tracing the origin and evolution of proteins. First, as the order of protein folds binding with cofactors roughly coincides with protein-fold chronology, cofactors are considered to have facilitated the origin of primitive proteins by selecting them from pools of random amino acid sequences. Second, in the subsequent evolution of proteins, cofactors still played an important role. More interestingly, as metallic cofactors evolved with geochemical variations, some geochemical events left imprints in the chronology of protein architecture; this provides further evidence supporting the coevolution of biochemistry and geochemistry. In this paper, we attempt to review the molecular fossils used in tracing the origin and evolution of proteins, with a special focus on cofactors.
Collapse
Affiliation(s)
- Xin-Yi Chu
- Hubei Key Laboratory of Agricultural Bioinformatics College of Informatics, Huazhong Agricultural University, Wuhan, 430070, China
| | - Hong-Yu Zhang
- Hubei Key Laboratory of Agricultural Bioinformatics College of Informatics, Huazhong Agricultural University, Wuhan, 430070, China
| |
Collapse
|
3
|
An evolutionary path to altered cofactor specificity in a metalloenzyme. Nat Commun 2020; 11:2738. [PMID: 32483131 PMCID: PMC7264356 DOI: 10.1038/s41467-020-16478-0] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2019] [Accepted: 04/29/2020] [Indexed: 11/28/2022] Open
Abstract
Almost half of all enzymes utilize a metal cofactor. However, the features that dictate the metal utilized by metalloenzymes are poorly understood, limiting our ability to manipulate these enzymes for industrial and health-associated applications. The ubiquitous iron/manganese superoxide dismutase (SOD) family exemplifies this deficit, as the specific metal used by any family member cannot be predicted. Biochemical, structural and paramagnetic analysis of two evolutionarily related SODs with different metal specificity produced by the pathogenic bacterium Staphylococcus aureus identifies two positions that control metal specificity. These residues make no direct contacts with the metal-coordinating ligands but control the metal’s redox properties, demonstrating that subtle architectural changes can dramatically alter metal utilization. Introducing these mutations into S. aureus alters the ability of the bacterium to resist superoxide stress when metal starved by the host, revealing that small changes in metal-dependent activity can drive the evolution of metalloenzymes with new cofactor specificity. Many metalloenzymes are highly specific for their cognate metal ion but the molecular principles underlying this specificity often remain unclear. Here, the authors characterize the structural and biochemical basis for the different metal specificity of two evolutionarily related superoxide dismutases.
Collapse
|
4
|
Abstract
The rise of animal life is temporally related to the increased availability of oxygen in the hydrosphere and atmosphere during the Neoproterozoic. However, the earliest metazoans probably needed relatively low oxygen concentrations, suggesting additional environmental and/or biochemical developments were involved. Copper was required in the exploitation of oxygen by the evolving animals, through the development of respiratory proteins and the extracellular matrix required for structural support. We synthesize global data demonstrating a marked enrichment of copper in the Earth’s crust that coincided with the biological use of oxygen, and this new biological use of copper. The copper enrichment was likely recycled into the surface environment by weathering of basalt and other magmatic rocks, at copper liberation rates up to 300 times that of typical granitic terrain. The weathering of basalts also triggered the Sturtian glaciation, which accelerated erosion. We postulate that the coincidence of a high availability of copper, along with increased oxygen levels, for the first time during the Neoproterozoic supported the critical advances of respiration and structural support in evolving animals.
Collapse
|
5
|
Ahmad A, Burns CS, Fink AL, Uversky VN. Peculiarities of copper binding to alpha-synuclein. J Biomol Struct Dyn 2016; 29:825-42. [PMID: 22208282 DOI: 10.1080/073911012010525023] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
Heavy metals have been implicated as the causative agents for the pathogenesis of the most prevalent neurodegenerative disease. Various mechanisms have been proposed to explain the toxic effects of metals ranging from metal-induced oxidation of protein to metal-induced changes in the protein conformation. Aggregation of a-synuclein is implicated in Parkinson's disease (PD), and various metals, including copper, constitute a prominent group of alpha-synuclein aggregation enhancers. In this study, we have systematically characterized the a-synuclein-Cu21 binding sites and analyzed the possible role of metal binding in a-synuclein fibrillation using a set of biophysical techniques, such as electron paramagnetic resonance (EPR), electron spin-echo envelope modulation (ESEEM), circular dichroism (CD), and size exclusion chromatography (SEC). Our analyses indicated that a-synuclein possesses at least two binding sites for Cu21. We have been able to locate one of the binding sites in the N-terminal region. Furthermore, based on the EPR studies of model peptides and Beta-synuclein, we concluded that the suspected His residue did not appear to participate in strong Cu21 binding.
Collapse
Affiliation(s)
- Atta Ahmad
- Department of Chemistry and Biochemistry, University of California, Santa Cruz, California, USA.
| | | | | | | |
Collapse
|
6
|
Mannige RV. Origination of the Protein Fold Repertoire from Oily Pluripotent Peptides. Proteomes 2014; 2:154-168. [PMID: 28250375 PMCID: PMC5302733 DOI: 10.3390/proteomes2020154] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2013] [Revised: 02/27/2014] [Accepted: 03/20/2014] [Indexed: 11/16/2022] Open
Abstract
While the repertoire of protein folds that exists today underlies most of life’s capabilities, our mechanistic picture of protein fold origination is incomplete. This paper discusses a hypothetical mechanism for the emergence of the protein fold repertoire from highly dynamic and collapsed peptides, exemplified by peptides with high oil content or hydrophobicity. These peptides are called pluripotent to emphasize their capacity to evolve into numerous folds transiently available to them. As evidence, the paper will discuss previous simulation work on the superior fold evolvability of oily peptides, trace (“fossil”) evidence within proteomes seen today, and a general relationship between protein dynamism and evolvability. Aside from implications on the origination of protein folds, the hypothesis implies that the vanishing utility of a random peptide in protein origination may be relatively exaggerated, as some random peptides with a certain composition (e.g., oily) may fare better than others. In later sections, the hypothesis is discussed in the context of existing discussions regarding the spontaneous origination of biomolecules.
Collapse
Affiliation(s)
- Ranjan V Mannige
- Molecular Foundry, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720,USA.
| |
Collapse
|
7
|
Abrusán G, Zhang Y, Szilágyi A. Structure prediction and analysis of DNA transposon and LINE retrotransposon proteins. J Biol Chem 2013; 288:16127-38. [PMID: 23530042 PMCID: PMC3668768 DOI: 10.1074/jbc.m113.451500] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2013] [Revised: 03/21/2013] [Indexed: 01/15/2023] Open
Abstract
Despite the considerable amount of research on transposable elements, no large-scale structural analyses of the TE proteome have been performed so far. We predicted the structures of hundreds of proteins from a representative set of DNA and LINE transposable elements and used the obtained structural data to provide the first general structural characterization of TE proteins and to estimate the frequency of TE domestication and horizontal transfer events. We show that 1) ORF1 and Gag proteins of retrotransposons contain high amounts of structural disorder; thus, despite their very low conservation, the presence of disordered regions and probably their chaperone function is conserved. 2) The distribution of SCOP classes in DNA transposons and LINEs indicates that the proteins of DNA transposons are more ancient, containing folds that already existed when the first cellular organisms appeared. 3) DNA transposon proteins have lower contact order than randomly selected reference proteins, indicating rapid folding, most likely to avoid protein aggregation. 4) Structure-based searches for TE homologs indicate that the overall frequency of TE domestication events is low, whereas we found a relatively high number of cases where horizontal transfer, frequently involving parasites, is the most likely explanation for the observed homology.
Collapse
Affiliation(s)
- György Abrusán
- Synthetic and Systems Biology Unit, Institute of Biochemistry, Biological Research Centre of the Hungarian Academy of Sciences, 6701 Szeged, Hungary.
| | | | | |
Collapse
|
8
|
Nitschke W, McGlynn SE, Milner-White EJ, Russell MJ. On the antiquity of metalloenzymes and their substrates in bioenergetics. BIOCHIMICA ET BIOPHYSICA ACTA-BIOENERGETICS 2013; 1827:871-81. [PMID: 23454059 DOI: 10.1016/j.bbabio.2013.02.008] [Citation(s) in RCA: 84] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/03/2012] [Revised: 02/15/2013] [Accepted: 02/18/2013] [Indexed: 02/07/2023]
Abstract
Many metalloenzymes that inject and extract reducing equivalents at the beginning and the end of electron transport chains involved in chemiosmosis are suggested, through phylogenetic analysis, to have been present in the Last Universal Common Ancestor (LUCA). Their active centres are affine with the structures of minerals presumed to contribute to precipitate membranes produced on the mixing of hydrothermal solutions with the Hadean Ocean ~4 billion years ago. These mineral precipitates consist of transition element sulphides and oxides such as nickelian mackinawite ([Fe>Ni]2S2), a nickel-bearing greigite (~FeSS[Fe3NiS4]SSFe), violarite (~NiSS[Fe2Ni2S4]SSNi), a molybdenum bearing complex (~Mo(IV/VI)2Fe3S(0/2-)9) and green rust or fougerite (~[Fe(II)Fe(III)(OH)4](+)[OH](-)). They may be respectively compared with the active centres of Ni-Fe hydrogenase, carbon monoxide dehydrogenase (CODH), acetyl coenzyme-A synthase (ACS), the complex iron-sulphur molybdoenzyme (CISM) superfamily and methane monooxygenase (MMO). With the look of good catalysts - a suggestion that gathers some support from prebiotic hydrothermal experimentation - and sequestered by short peptides, they could be thought of as the original building blocks of proto-enzyme active centres. This convergence of the makeup of the LUCA-metalloenzymes with mineral structure and composition of hydrothermal precipitates adds credence to the alkaline hydrothermal (chemiosmotic) theory for the emergence of life, specifically to the possibility that the first metabolic pathway - the acetyl CoA pathway - was initially driven from either end, reductively from CO2 to CO and oxidatively and reductively from CH4 through to a methane thiol group, the two entities assembled with the help of a further thiol on a violarite cluster sequestered by peptides. By contrast, the organic coenzymes were entirely a product of the first metabolic pathways. This article is part of a Special Issue entitled: Metals in Bioenergetics and Biomimetics Systems.
Collapse
Affiliation(s)
- Wolfgang Nitschke
- Laboratoire de Bioénergétique et Ingénierie des Protéines, Marseille Cedex 20, France
| | | | | | | |
Collapse
|
9
|
Ilbert M, Bonnefoy V. Insight into the evolution of the iron oxidation pathways. BIOCHIMICA ET BIOPHYSICA ACTA-BIOENERGETICS 2012; 1827:161-75. [PMID: 23044392 DOI: 10.1016/j.bbabio.2012.10.001] [Citation(s) in RCA: 192] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/03/2012] [Revised: 09/27/2012] [Accepted: 10/01/2012] [Indexed: 01/01/2023]
Abstract
Iron is a ubiquitous element in the universe. Ferrous iron (Fe(II)) was abundant in the primordial ocean until the oxygenation of the Earth's atmosphere led to its widespread oxidation and precipitation. This change of iron bioavailability likely put selective pressure on the evolution of life. This element is essential to most extant life forms and is an important cofactor in many redox-active proteins involved in a number of vital pathways. In addition, iron plays a central role in many environments as an energy source for some microorganisms. This review is focused on Fe(II) oxidation. The fact that the ability to oxidize Fe(II) is widely distributed in Bacteria and Archaea and in a number of quite different biotopes suggests that the dissimilatory Fe(II) oxidation is an ancient energy metabolism. Based on what is known today about Fe(II) oxidation pathways, we propose that they arose independently more than once in evolution and evolved convergently. The iron paleochemistry, the phylogeny, the physiology of the iron oxidizers, and the nature of the cofactors of the redox proteins involved in these pathways suggest a possible scenario for the timescale in which each type of Fe(II) oxidation pathways evolved. The nitrate dependent anoxic iron oxidizers are likely the most ancient iron oxidizers. We suggest that the phototrophic anoxic iron oxidizers arose in surface waters after the Archaea/Bacteria-split but before the Great Oxidation Event. The neutrophilic oxic iron oxidizers possibly appeared in microaerobic marine environments prior to the Great Oxidation Event while the acidophilic ones emerged likely after the advent of atmospheric O(2). This article is part of a Special Issue entitled: The evolutionary aspects of bioenergetic systems.
Collapse
Affiliation(s)
- Marianne Ilbert
- Aix-Marseille Université, CNRS, BIP UMR7281,13009, Marseille, France.
| | | |
Collapse
|
10
|
Kim KM, Qin T, Jiang YY, Chen LL, Xiong M, Caetano-Anollés D, Zhang HY, Caetano-Anollés G. Protein domain structure uncovers the origin of aerobic metabolism and the rise of planetary oxygen. Structure 2012; 20:67-76. [PMID: 22244756 DOI: 10.1016/j.str.2011.11.003] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2011] [Revised: 11/06/2011] [Accepted: 11/08/2011] [Indexed: 10/14/2022]
Abstract
The origin and evolution of modern biochemistry remain a mystery despite advances in evolutionary bioinformatics. Here, we use a structural census in nearly 1,000 genomes and a molecular clock of folds to define a timeline of appearance of protein families linked to single-domain enzymes. The timeline sorts out enzymatic recruitment, validates patterns in metabolic history, and reveals that the most ancient reaction of aerobic metabolism involved the synthesis of pyridoxal 5'-phosphate or pyridoxal and appeared 2.9 Gyr ago. The oxygen source for this primordial reaction was probably Mn catalase, which appeared at the same time and could have generated oxygen as a side product of hydrogen peroxide detoxification. Finally, evolutionary analysis of transferred groups and metabolite fragments revealed that oxidized sulfur did not participate in metabolism until the rise of oxygen. The evolutionary patterns we uncover in molecules and chemistries provide strong support for the coevolution of biochemistry and geochemistry.
Collapse
Affiliation(s)
- Kyung Mo Kim
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois, Urbana, IL 61801, USA
| | | | | | | | | | | | | | | |
Collapse
|
11
|
Wang M, Jiang YY, Kim KM, Qu G, Ji HF, Mittenthal JE, Zhang HY, Caetano-Anollés G. A universal molecular clock of protein folds and its power in tracing the early history of aerobic metabolism and planet oxygenation. Mol Biol Evol 2010; 28:567-82. [PMID: 20805191 DOI: 10.1093/molbev/msq232] [Citation(s) in RCA: 97] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
The standard molecular clock describes a constant rate of molecular evolution and provides a powerful framework for evolutionary timescales. Here, we describe the existence and implications of a molecular clock of folds, a universal recurrence in the discovery of new structures in the world of proteins. Using a phylogenomic structural census in hundreds of proteomes, we build phylogenies and time lines of domains at fold and fold superfamily levels of structural complexity. These time lines correlate approximately linearly with geological timescales and were here used to date two crucial events in life history, planet oxygenation and organism diversification. We first dissected the structures and functions of enzymes in simulated metabolic networks. The placement of anaerobic and aerobic enzymes in the time line revealed that aerobic metabolism emerged about 2.9 billion years (giga-annum; Ga) ago and expanded during a period of about 400 My, reaching what is known as the Great Oxidation Event. During this period, enzymes recruited old and new folds for oxygen-mediated enzymatic activities. Remarkably, the first fold lost by a superkingdom disappeared in Archaea 2.6 Ga ago, within the span of oxygen rise, suggesting that oxygen also triggered diversification of life. The implications of a molecular clock of folds are many and important for the neutral theory of molecular evolution and for understanding the growth and diversity of the protein world. The clock also extends the standard concept that was specific to molecules and their timescales and turns it into a universal timescale-generating tool.
Collapse
Affiliation(s)
- Minglei Wang
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois, Urbana-Champaign, USA
| | | | | | | | | | | | | | | |
Collapse
|
12
|
History of biological metal utilization inferred through phylogenomic analysis of protein structures. Proc Natl Acad Sci U S A 2010; 107:10567-72. [PMID: 20498051 DOI: 10.1073/pnas.0912491107] [Citation(s) in RCA: 184] [Impact Index Per Article: 13.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The fundamental chemistry of trace elements dictates the molecular speciation and reactivity both within cells and the environment at large. Using protein structure and comparative genomics, we elucidate several major influences this chemistry has had upon biology. All of life exhibits the same proteome size-dependent scaling for the number of metal-binding proteins within a proteome. This fundamental evolutionary constant shows that the selection of one element occurs at the exclusion of another, with the eschewal of Fe for Zn and Ca being a defining feature of eukaryotic proteomes. Early life lacked both the structures required to control intracellular metal concentrations and the metal-binding proteins that catalyze electron transport and redox transformations. The development of protein structures for metal homeostasis coincided with the emergence of metal-specific structures, which predominantly bound metals abundant in the Archean ocean. Potentially, this promoted the diversification of emerging lineages of Archaea and Bacteria through the establishment of biogeochemical cycles. In contrast, structures binding Cu and Zn evolved much later, providing further evidence that environmental availability influenced the selection of the elements. The late evolving Zn-binding proteins are fundamental to eukaryotic cellular biology, and Zn bioavailability may have been a limiting factor in eukaryotic evolution. The results presented here provide an evolutionary timeline based on genomic characteristics, and key hypotheses can be tested by alternative geochemical methods.
Collapse
|
13
|
Kim KM, Caetano-Anollés G. Emergence and evolution of modern molecular functions inferred from phylogenomic analysis of ontological data. Mol Biol Evol 2010; 27:1710-33. [PMID: 20418223 DOI: 10.1093/molbev/msq106] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The biological processes that characterize the phenotypes of a living system are embodied in the function of molecules and hold the key to evolutionary history, delimiting natural selection and change. These processes and functions provide direct insight into the emergence, development, and organization of cellular life. However, detailed molecular functions make up a network-like hierarchy of relationships that tells little of evolutionary links between structure and function in biology. For example, Gene Ontology terms represent widely-used vocabularies of processes and functions with evolutionary relationships that are implicit but not defined. Here, we uncover patterns of global evolutionary history in ontological terms associated with the sequence of 38 genomes. These patterns unfold the metabolic origins of modern molecular functions and major biological transitions in evolution toward complex life. Phylogenies reveal the primordial appearance of hydrolases and transferases, with ATPase, GTPase, and helicase activities being the most ancient. This indicates that ancient catalysts were crucial for binding and transport, the emergence of nucleic acids and protein biopolymers, and the communication of primordial cells with the environment. Finally, the history of biological processes showed that cellular biopolymer metabolic processes preceded biopolymer biosynthesis and essential processes related to macromolecular formation, directly challenging the existence of an RNA world. Phylogenomic systematization of biological function takes the structure and function paradigm to a completely new level of abstraction, demonstrating a "metabolic first" origin of life. The approach uncovers patterns in the morphing of function that are unprecedented and necessary for systematic views in biology.
Collapse
Affiliation(s)
- Kyung Mo Kim
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois at Urbana-Champaign, IL, USA
| | | |
Collapse
|
14
|
Biotech news and views. Biotechnol J 2009. [DOI: 10.1002/biot.200990078] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|