51
|
Groussin M, Hobbs JK, Szöllősi GJ, Gribaldo S, Arcus VL, Gouy M. Toward more accurate ancestral protein genotype-phenotype reconstructions with the use of species tree-aware gene trees. Mol Biol Evol 2014; 32:13-22. [PMID: 25371435 PMCID: PMC4271536 DOI: 10.1093/molbev/msu305] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
The resurrection of ancestral proteins provides direct insight into how natural selection has shaped proteins found in nature. By tracing substitutions along a gene phylogeny, ancestral proteins can be reconstructed in silico and subsequently synthesized in vitro. This elegant strategy reveals the complex mechanisms responsible for the evolution of protein functions and structures. However, to date, all protein resurrection studies have used simplistic approaches for ancestral sequence reconstruction (ASR), including the assumption that a single sequence alignment alone is sufficient to accurately reconstruct the history of the gene family. The impact of such shortcuts on conclusions about ancestral functions has not been investigated. Here, we show with simulations that utilizing information on species history using a model that accounts for the duplication, horizontal transfer, and loss (DTL) of genes statistically increases ASR accuracy. This underscores the importance of the tree topology in the inference of putative ancestors. We validate our in silico predictions using in vitro resurrection of the LeuB enzyme for the ancestor of the Firmicutes, a major and ancient bacterial phylum. With this particular protein, our experimental results demonstrate that information on the species phylogeny results in a biochemically more realistic and kinetically more stable ancestral protein. Additional resurrection experiments with different proteins are necessary to statistically quantify the impact of using species tree-aware gene trees on ancestral protein phenotypes. Nonetheless, our results suggest the need for incorporating both sequence and DTL information in future studies of protein resurrections to accurately define the genotype-phenotype space in which proteins diversify.
Collapse
Affiliation(s)
- Mathieu Groussin
- Laboratoire de Biométrie et Biologie Evolutive, Université de Lyon, Université Lyon 1, CNRS, UMR5558, Villeurbanne, France
| | - Joanne K Hobbs
- Department of Biological Sciences, University of Waikato, Hamilton, New Zealand
| | - Gergely J Szöllősi
- Laboratoire de Biométrie et Biologie Evolutive, Université de Lyon, Université Lyon 1, CNRS, UMR5558, Villeurbanne, France ELTE-MTA "Lendület" Biophysics Research Group, Pázmány, Budapest, Hungary
| | - Simonetta Gribaldo
- Unité de Biologie Moléculaire du Gène chez les Extrêmophiles, Département de Microbiologie, Institut Pasteur, Paris cedex, France
| | - Vickery L Arcus
- Department of Biological Sciences, University of Waikato, Hamilton, New Zealand
| | - Manolo Gouy
- Laboratoire de Biométrie et Biologie Evolutive, Université de Lyon, Université Lyon 1, CNRS, UMR5558, Villeurbanne, France
| |
Collapse
|
52
|
Scutt CP, Vandenbussche M. Current trends and future directions in flower development research. ANNALS OF BOTANY 2014; 114:1399-406. [PMID: 25335868 PMCID: PMC4204790 DOI: 10.1093/aob/mcu224] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/18/2014] [Accepted: 09/24/2014] [Indexed: 05/05/2023]
Abstract
Flowers, the reproductive structures of the approximately 400 000 extant species of flowering plants, exist in a tremendous range of forms and sizes, mainly due to developmental differences involving the number, arrangement, size and form of the floral organs of which they consist. However, this tremendous diversity is underpinned by a surprisingly robust basic floral structure in which a central group of carpels forms on an axis of determinate growth, almost invariably surrounded by two successive zones containing stamens and perianth organs, respectively. Over the last 25 years, remarkable progress has been achieved in describing the molecular mechanisms that control almost all aspects of flower development, from the phase change that initiates flowering to the final production of fruits and seeds. However, this work has been performed almost exclusively in a small number of eudicot model species, chief among which is Arabidopsis thaliana. Studies of flower development must now be extended to a much wider phylogenetic range of flowering plants and, indeed, to their closest living relatives, the gymnosperms. Studies of further, more wide-ranging models should provide insights that, for various reasons, cannot be obtained by studying the major existing models alone. The use of further models should also help to explain how the first flowering plants evolved from an unknown, although presumably gymnosperm-like ancestor, and rapidly diversified to become the largest major plant group and to dominate the terrestrial flora. The benefits for society of a thorough understanding of flower development are self-evident, as human life depends to a large extent on flowering plants and on the fruits and seeds they produce. In this preface to the Special Issue, we introduce eleven articles on flower development, representing work in both established and further models, including gymnosperms. We also present some of our own views on current trends and future directions of the flower development field.
Collapse
Affiliation(s)
- Charlie P Scutt
- Laboratoire de Reproduction et Développement des Plantes, (Unité mixte de recherche 5667: CNRS-INRA-Université de Lyon), Ecole Normale Supérieure de Lyon, 46 allée d'Italie, 69364 Lyon Cedex 07, France
| | - Michiel Vandenbussche
- Laboratoire de Reproduction et Développement des Plantes, (Unité mixte de recherche 5667: CNRS-INRA-Université de Lyon), Ecole Normale Supérieure de Lyon, 46 allée d'Italie, 69364 Lyon Cedex 07, France
| |
Collapse
|
53
|
Hone DWE, Faulkes CG. A proposed framework for establishing and evaluating hypotheses about the behaviour of extinct organisms. J Zool (1987) 2014. [DOI: 10.1111/jzo.12114] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Affiliation(s)
- D. W. E. Hone
- School of Biological and Chemical Sciences; Queen Mary University of London; London UK
| | - C. G. Faulkes
- School of Biological and Chemical Sciences; Queen Mary University of London; London UK
| |
Collapse
|
54
|
|
55
|
Reconstructed ancestral Myo-inositol-3-phosphate synthases indicate that ancestors of the Thermococcales and Thermotoga species were more thermophilic than their descendants. PLoS One 2013; 8:e84300. [PMID: 24391933 PMCID: PMC3877268 DOI: 10.1371/journal.pone.0084300] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2013] [Accepted: 11/19/2013] [Indexed: 01/06/2023] Open
Abstract
The bacterial genomes of Thermotoga species show evidence of significant interdomain horizontal gene transfer from the Archaea. Members of this genus acquired many genes from the Thermococcales, which grow at higher temperatures than Thermotoga species. In order to study the functional history of an interdomain horizontally acquired gene we used ancestral sequence reconstruction to examine the thermal characteristics of reconstructed ancestral proteins of the Thermotoga lineage and its archaeal donors. Several ancestral sequence reconstruction methods were used to determine the possible sequences of the ancestral Thermotoga and Archaea myo-inositol-3-phosphate synthase (MIPS). These sequences were predicted to be more thermostable than the extant proteins using an established sequence composition method. We verified these computational predictions by measuring the activities and thermostabilities of purified proteins from the Thermotoga and the Thermococcales species, and eight ancestral reconstructed proteins. We found that the ancestral proteins from both the archaeal donor and the Thermotoga most recent common ancestor recipient were more thermostable than their descendants. We show that there is a correlation between the thermostability of MIPS protein and the optimal growth temperature (OGT) of its host, which suggests that the OGT of the ancestors of these species of Archaea and the Thermotoga grew at higher OGTs than their descendants.
Collapse
|
56
|
Reisinger B, Sperl J, Holinski A, Schmid V, Rajendran C, Carstensen L, Schlee S, Blanquart S, Merkl R, Sterner R. Evidence for the Existence of Elaborate Enzyme Complexes in the Paleoarchean Era. J Am Chem Soc 2013; 136:122-9. [DOI: 10.1021/ja4115677] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Affiliation(s)
- Bernd Reisinger
- Institute
of Biophysics and Physical Biochemistry, University of Regensburg, Universitätsstraße 31, D-93053 Regensburg, Germany
| | - Josef Sperl
- Institute
of Biophysics and Physical Biochemistry, University of Regensburg, Universitätsstraße 31, D-93053 Regensburg, Germany
| | - Alexandra Holinski
- Institute
of Biophysics and Physical Biochemistry, University of Regensburg, Universitätsstraße 31, D-93053 Regensburg, Germany
| | - Veronika Schmid
- Institute
of Biophysics and Physical Biochemistry, University of Regensburg, Universitätsstraße 31, D-93053 Regensburg, Germany
| | - Chitra Rajendran
- Institute
of Biophysics and Physical Biochemistry, University of Regensburg, Universitätsstraße 31, D-93053 Regensburg, Germany
| | - Linn Carstensen
- Institute
of Biophysics and Physical Biochemistry, University of Regensburg, Universitätsstraße 31, D-93053 Regensburg, Germany
| | - Sandra Schlee
- Institute
of Biophysics and Physical Biochemistry, University of Regensburg, Universitätsstraße 31, D-93053 Regensburg, Germany
| | - Samuel Blanquart
- Equipe
Bonsai,
Institut National de Recherche en Informatique et en Automatique, INRIA Lille Nord Europe, 40 avenue Halley, 59650 Villeneuve d’Ascq, France
| | - Rainer Merkl
- Institute
of Biophysics and Physical Biochemistry, University of Regensburg, Universitätsstraße 31, D-93053 Regensburg, Germany
| | - Reinhard Sterner
- Institute
of Biophysics and Physical Biochemistry, University of Regensburg, Universitätsstraße 31, D-93053 Regensburg, Germany
| |
Collapse
|
57
|
Daly TK, Sutherland-Smith AJ, Penny D. In silico resurrection of the major vault protein suggests it is ancestral in modern eukaryotes. Genome Biol Evol 2013; 5:1567-83. [PMID: 23887922 PMCID: PMC3762200 DOI: 10.1093/gbe/evt113] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Vaults are very large oligomeric ribonucleoproteins conserved among a variety of species. The rat vault 3D structure shows an ovoid oligomeric particle, consisting of 78 major vault protein monomers, each of approximately 861 amino acids. Vaults are probably the largest ribonucleoprotein structures in eukaryote cells, being approximately 70 nm in length with a diameter of 40 nm—the size of three ribosomes and with a lumen capacity of 50 million Å3. We use both protein sequences and inferred ancestral sequences for in silico virtual resurrection of tertiary and quaternary structures to search for vaults in a wide variety of eukaryotes. We find that the vault’s phylogenetic distribution is widespread in eukaryotes, but is apparently absent in some notable model organisms. Our conclusion from the distribution of vaults is that they were present in the last eukaryote common ancestor but they have apparently been lost from a number of groups including fungi, insects, and probably plants. Our approach of inferring ancestral 3D and quaternary structures is expected to be useful generally.
Collapse
Affiliation(s)
- Toni K Daly
- Institute of Fundamental Sciences, Massey University, Palmerston North, New Zealand.
| | | | | |
Collapse
|
58
|
Abstract
AbstractEuparkeria capensis has long been considered an archetype for the ancestral archosaur morphology, and has been placed just outside of crown Archosauria by nearly all cladistic analyses. Six species are currently considered to be putative members of a clade Euparkeriidae, and have been collected from Olenekian- or Anisian-aged deposits in South Africa (Euparkeria capensis – the only definitive member of the group), China (Halazhaisuchus qiaoensis, Wangisuchus tzeyii, ‘Turfanosuchus’ shageduensis), Russia (Dorosuchus neoetus) and Poland (Osmolskina czatkowicensis). Four other species (Turfanosuchus dabanensis, Xilousuchus sapingensis, Platyognathus hsui, Dongusia colorata) were historically assigned to Euparkeriidae, but have been removed by recent work. Recent authors deemed Osmolskina czatkowicensis and Dorosuchus neoetus to be the most likely taxa to form a euparkeriid clade with Euparkeria capensis, but Osmolskina czatkowicensis and Euparkeria capensis were not found as sister taxa by the only cladistic analysis to have tested euparkeriid monophyly. Euparkeria capensis was small (<1 m), insectivorous or carnivorous, probably had vision adapted to low-light conditions and a semi-erect crocodile-like stance, and may have been facultatively bipedal. Bone histology demonstrates that Euparkeria capensis had a slow growth rate, which has been suggested to have been an adaptation to relatively stable environmental conditions.
Collapse
Affiliation(s)
- Roland B. Sookias
- GeoBio-Center, Ludwig-Maximilians-Universität München, Richard-Wagner-Straße 10, D-80333 Munich, Germany
| | - Richard J. Butler
- GeoBio-Center, Ludwig-Maximilians-Universität München, Richard-Wagner-Straße 10, D-80333 Munich, Germany
| |
Collapse
|
59
|
Ashkenazy H, Penn O, Doron-Faigenboim A, Cohen O, Cannarozzi G, Zomer O, Pupko T. FastML: a web server for probabilistic reconstruction of ancestral sequences. Nucleic Acids Res 2012; 40:W580-4. [PMID: 22661579 PMCID: PMC3394241 DOI: 10.1093/nar/gks498] [Citation(s) in RCA: 229] [Impact Index Per Article: 19.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
Abstract
Ancestral sequence reconstruction is essential to a variety of evolutionary studies. Here, we present the FastML web server, a user-friendly tool for the reconstruction of ancestral sequences. FastML implements various novel features that differentiate it from existing tools: (i) FastML uses an indel-coding method, in which each gap, possibly spanning multiples sites, is coded as binary data. FastML then reconstructs ancestral indel states assuming a continuous time Markov process. FastML provides the most likely ancestral sequences, integrating both indels and characters; (ii) FastML accounts for uncertainty in ancestral states: it provides not only the posterior probabilities for each character and indel at each sequence position, but also a sample of ancestral sequences from this posterior distribution, and a list of the k-most likely ancestral sequences; (iii) FastML implements a large array of evolutionary models, which makes it generic and applicable for nucleotide, protein and codon sequences; and (iv) a graphical representation of the results is provided, including, for example, a graphical logo of the inferred ancestral sequences. The utility of FastML is demonstrated by reconstructing ancestral sequences of the Env protein from various HIV-1 subtypes. FastML is freely available for all academic users and is available online at http://fastml.tau.ac.il/.
Collapse
Affiliation(s)
- Haim Ashkenazy
- Department of Cell Research and Immunology, George S. Wise Faculty of Life Sciences, Tel Aviv University, 69978 Tel Aviv, Israel
| | | | | | | | | | | | | |
Collapse
|
60
|
Hobbs JK, Shepherd C, Saul DJ, Demetras NJ, Haaning S, Monk CR, Daniel RM, Arcus VL. On the Origin and Evolution of Thermophily: Reconstruction of Functional Precambrian Enzymes from Ancestors of Bacillus. Mol Biol Evol 2011; 29:825-35. [DOI: 10.1093/molbev/msr253] [Citation(s) in RCA: 68] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
|
61
|
Konno A, Kitagawa A, Watanabe M, Ogawa T, Shirai T. Tracing protein evolution through ancestral structures of fish galectin. Structure 2011; 19:711-21. [PMID: 21565705 DOI: 10.1016/j.str.2011.02.014] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2010] [Revised: 02/08/2011] [Accepted: 02/16/2011] [Indexed: 11/16/2022]
Abstract
Ancestral structures of fish galectins (congerins) were determined. The extant isoforms I and II of congerin are the components of a fish biological defense system and have rapidly differentiated under natural selection pressure, by which congerin I has experienced a protein-fold evolution. The dimer structure of the ancestral congerin demonstrated intermediate features of the extant isoforms. The protein-fold evolution was not observed in the ancestral structure, indicating it specifically occurred in congerin I lineage. Details of hydrogen bonding pattern at the dimer interface and the carbohydrate-binding site of the ancestor were different from the current proteins. The differences implied these proteins were under selection pressure for stabilizing dimer structure and differentiation in carbohydrate specificity. The ancestor had rather low cytotoxic activity than offspring, indicating selection was made to enhance this activity of congerins. Combined with functional analyses, the structure revealed atomic details of the differentiation process of the proteins.
Collapse
Affiliation(s)
- Ayumu Konno
- Department of Biomolecular Science, Graduate School of Life Sciences, Tohoku University, Sendai 980-8577, Japan
| | | | | | | | | |
Collapse
|
62
|
Losos JB. Seeing the forest for the trees: the limitations of phylogenies in comparative biology. (American Society of Naturalists Address). Am Nat 2011; 177:709-27. [PMID: 21597249 DOI: 10.1086/660020] [Citation(s) in RCA: 159] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
Abstract
The past 30 years have seen a revolution in comparative biology. Before that time, systematics was not at the forefront of the biological sciences, and few scientists considered phylogenetic relationships when investigating evolutionary questions. By contrast, systematic biology is now one of the most vigorous disciplines in biology, and the use of phylogenies not only is requisite in macroevolutionary studies but also has been applied to a wide range of topics and fields that no one could possibly have envisioned 30 years ago. My message is simple: phylogenies are fundamental to comparative biology, but they are not the be-all and end-all. Phylogenies are powerful tools for understanding the past, but like any tool, they have their limitations. In addition, phylogenies are much more informative about pattern than they are about process. The best way to fully understand the past-both pattern and process-is to integrate phylogenies with other types of historical data as well as with direct studies of evolutionary process.
Collapse
Affiliation(s)
- Jonathan B Losos
- Museum of Comparative Zoology and Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts 02138, USA.
| |
Collapse
|
63
|
Schmitz L, Motani R. Nocturnality in Dinosaurs Inferred from Scleral Ring and Orbit Morphology. Science 2011; 332:705-8. [DOI: 10.1126/science.1200043] [Citation(s) in RCA: 111] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
|
64
|
Perez-Jimenez R, Inglés-Prieto A, Zhao ZM, Sanchez-Romero I, Alegre-Cebollada J, Kosuri P, Garcia-Manyes S, Kappock TJ, Tanokura M, Holmgren A, Sanchez-Ruiz JM, Gaucher EA, Fernandez JM. Single-molecule paleoenzymology probes the chemistry of resurrected enzymes. Nat Struct Mol Biol 2011; 18:592-6. [PMID: 21460845 PMCID: PMC3087858 DOI: 10.1038/nsmb.2020] [Citation(s) in RCA: 140] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2010] [Accepted: 01/24/2011] [Indexed: 01/01/2023]
Abstract
A journey back in time is possible at the molecular level by reconstructing proteins from extinct organisms. Here we report the reconstruction, based on sequence predicted by phylogenetic analysis, of seven Precambrian thioredoxin enzymes (Trx), dating back between ~1.4 and ~4 billion years (Gyr). The reconstructed enzymes are up to 32° C more stable than modern enzymes and the oldest show significantly higher activity than extant ones at pH 5. We probed their mechanisms of reduction using single-molecule force spectroscopy. From the force-dependency of the rate of reduction of an engineered substrate, we conclude that ancient Trxs utilize chemical mechanisms of reduction similar to those of modern enzymes. While Trx enzymes have maintained their reductase chemistry unchanged, they have adapted over a 4 Gyr time span to the changes in temperature and ocean acidity that characterize the evolution of the global environment from ancient to modern Earth.
Collapse
Affiliation(s)
- Raul Perez-Jimenez
- Department of Biological Sciences, Columbia University, New York, New York, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
65
|
Abstract
The visual pigment rhodopsin (rh1) constitutes the first step in the sensory transduction cascade in the rod photoreceptors of the vertebrate eye, forming the basis of vision at low light levels. In most vertebrates, rhodopsin is a single-copy gene whose function in rod photoreceptors is highly conserved. We found evidence for a second rhodopsin-like gene (rh1-2) in the zebrafish genome. This novel gene was not the product of a zebrafish-specific gene duplication event and contains a number of unique amino acid substitutions. Despite these differences, expression of rh1-2 in vitro yielded a protein that not only bound chromophore, producing an absorption spectrum in the visible range (λmax ≈ 500 nm), but also activated in response to light. Unlike rh1, rh1-2 is not expressed during the first 4 days of embryonic development; it is expressed in the retina of adult fish but not the brain or muscle. Similar rh1-2 sequences were found in two other Danio species, as well as a more distantly related cyprinid, Epalzeorhynchos bicolor. While sequences were only identified in cyprinid fish, phylogenetic analyses suggest an older origin for this gene family. Our study suggests that rh1-2 is a functional opsin gene that is expressed in the retina later in development. The discovery of a new previously uncharacterized opsin gene in zebrafish retina is surprising given its status as a model system for studies of vertebrate vision and visual development.
Collapse
|
66
|
Abstract
It is now widely accepted that the climate of our planet is changing, but it is still hard to predict the consequences of these changes on ecosystems. The impact is worst at the poles, with scientists concerned that impacts at lower latitudes will follow suit. Canada has a great responsibility and potential for studying the effects of climate changes on the ecological dynamics, given its geographical location and its scientific leadership in this field. The 5th annual meeting of the Canadian Society for Ecology and Evolution was held in the International Year of Biodiversity, to share recent advances in a wide variety of disciplines ranging from molecular biology to behavioural ecology, and to integrate them into a general view that will help us preserve biodiversity and limit the impact of climate change on ecosystems.
Collapse
Affiliation(s)
- Carole Di Poi
- PROTEO, Université Laval, , Pavillon Charles-Eugène-Marchand, 1030, Avenue de la Médecine, Québec, Canada , G1V 0A6
| | | | | |
Collapse
|
67
|
Lakner C, Holder MT, Goldman N, Naylor GJP. What's in a Likelihood? Simple Models of Protein Evolution and the Contribution of Structurally Viable Reconstructions to the Likelihood. Syst Biol 2011; 60:161-74. [DOI: 10.1093/sysbio/syq088] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Affiliation(s)
- Clemens Lakner
- Department of Biological Science, Section of Ecology and Evolution
- Department of Scientific Computing, Florida State University, Tallahassee, FL 32306-4120, USA
| | - Mark T. Holder
- Department of Ecology and Evolution, University of Kansas, 6031 Haworth, 1200 Sunnyside Avenue, Lawrence, KS 66045
| | - Nick Goldman
- European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Gavin J. P. Naylor
- Department of Scientific Computing, Florida State University, Tallahassee, FL 32306-4120, USA
| |
Collapse
|
68
|
Ovchinnikov IV, Kholina OI. Genome digging: insight into the mitochondrial genome of Homo. PLoS One 2010; 5:e14278. [PMID: 21151557 PMCID: PMC3000329 DOI: 10.1371/journal.pone.0014278] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2010] [Accepted: 11/17/2010] [Indexed: 11/19/2022] Open
Abstract
BACKGROUND A fraction of the Neanderthal mitochondrial genome sequence has a similarity with a 5,839-bp nuclear DNA sequence of mitochondrial origin (numt) on the human chromosome 1. This fact has never been interpreted. Although this phenomenon may be attributed to contamination and mosaic assembly of Neanderthal mtDNA from short sequencing reads, we explain the mysterious similarity by integration of this numt (mtAncestor-1) into the nuclear genome of the common ancestor of Neanderthals and modern humans not long before their reproductive split. PRINCIPAL FINDINGS Exploiting bioinformatics, we uncovered an additional numt (mtAncestor-2) with a high similarity to the Neanderthal mtDNA and indicated that both numts represent almost identical replicas of the mtDNA sequences ancestral to the mitochondrial genomes of Neanderthals and modern humans. In the proteins, encoded by mtDNA, the majority of amino acids distinguishing chimpanzees from humans and Neanderthals were acquired by the ancestral hominins. The overall rate of nonsynonymous evolution in Neanderthal mitochondrial protein-coding genes is not higher than in other lineages. The model incorporating the ancestral hominin mtDNA sequences estimates the average divergence age of the mtDNAs of Neanderthals and modern humans to be 450,000-485,000 years. The mtAncestor-1 and mtAncestor-2 sequences were incorporated into the nuclear genome approximately 620,000 years and 2,885,000 years ago, respectively. CONCLUSIONS This study provides the first insight into the evolution of the mitochondrial DNA in hominins ancestral to Neanderthals and humans. We hypothesize that mtAncestor-1 and mtAncestor-2 are likely to be molecular fossils of the mtDNAs of Homo heidelbergensis and a stem Homo lineage. The d(N)/d(S) dynamics suggests that the effective population size of extinct hominins was low. However, the hominin lineage ancestral to humans, Neanderthals and H. heidelbergensis, had a larger effective population size and possessed genetic diversity comparable with those of chimpanzee and gorilla.
Collapse
Affiliation(s)
- Igor V Ovchinnikov
- Department of Biology, University of North Dakota, Grand Forks, North Dakota, United States of America.
| | | |
Collapse
|
69
|
Gaucher EA, Kratzer JT, Randall RN. Deep phylogeny--how a tree can help characterize early life on Earth. Cold Spring Harb Perspect Biol 2010; 2:a002238. [PMID: 20182607 DOI: 10.1101/cshperspect.a002238] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
The Darwinian concept of biological evolution assumes that life on Earth shares a common ancestor. The diversification of this common ancestor through speciation events and vertical transmission of genetic material implies that the classification of life can be illustrated in a tree-like manner, commonly referred to as the Tree of Life. This article describes features of the Tree of Life, such as how the tree has been both pruned and become bushier throughout the past century as our knowledge of biology has expanded. We present current views that the classification of life may be best illustrated as a ring or even a coral with tree-like characteristics. This article also discusses how the organization of the Tree of Life offers clues about ancient life on Earth. In particular, we focus on the environmental conditions and temperature history of Precambrian life and show how chemical, biological, and geological data can converge to better understand this history."You know, a tree is a tree. How many more do you need to look at?"--Ronald Reagan (Governor of California), quoted in the Sacramento Bee, opposing expansion of Redwood National Park, March 3, 1966.
Collapse
Affiliation(s)
- Eric A Gaucher
- School of Biology, School of Chemistry, and Parker H. Petit Institute for Bioengineering and Biosciences, Georgia Institute of Technology, Atlanta, Georgia, USA.
| | | | | |
Collapse
|
70
|
Helaers R, Milinkovitch MC. MetaPIGA v2.0: maximum likelihood large phylogeny estimation using the metapopulation genetic algorithm and other stochastic heuristics. BMC Bioinformatics 2010; 11:379. [PMID: 20633263 PMCID: PMC2912891 DOI: 10.1186/1471-2105-11-379] [Citation(s) in RCA: 84] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2010] [Accepted: 07/15/2010] [Indexed: 11/11/2022] Open
Abstract
Background The development, in the last decade, of stochastic heuristics implemented in robust application softwares has made large phylogeny inference a key step in most comparative studies involving molecular sequences. Still, the choice of a phylogeny inference software is often dictated by a combination of parameters not related to the raw performance of the implemented algorithm(s) but rather by practical issues such as ergonomics and/or the availability of specific functionalities. Results Here, we present MetaPIGA v2.0, a robust implementation of several stochastic heuristics for large phylogeny inference (under maximum likelihood), including a Simulated Annealing algorithm, a classical Genetic Algorithm, and the Metapopulation Genetic Algorithm (metaGA) together with complex substitution models, discrete Gamma rate heterogeneity, and the possibility to partition data. MetaPIGA v2.0 also implements the Likelihood Ratio Test, the Akaike Information Criterion, and the Bayesian Information Criterion for automated selection of substitution models that best fit the data. Heuristics and substitution models are highly customizable through manual batch files and command line processing. However, MetaPIGA v2.0 also offers an extensive graphical user interface for parameters setting, generating and running batch files, following run progress, and manipulating result trees. MetaPIGA v2.0 uses standard formats for data sets and trees, is platform independent, runs in 32 and 64-bits systems, and takes advantage of multiprocessor and multicore computers. Conclusions The metaGA resolves the major problem inherent to classical Genetic Algorithms by maintaining high inter-population variation even under strong intra-population selection. Implementation of the metaGA together with additional stochastic heuristics into a single software will allow rigorous optimization of each heuristic as well as a meaningful comparison of performances among these algorithms. MetaPIGA v2.0 gives access both to high customization for the phylogeneticist, as well as to an ergonomic interface and functionalities assisting the non-specialist for sound inference of large phylogenetic trees using nucleotide sequences. MetaPIGA v2.0 and its extensive user-manual are freely available to academics at http://www.metapiga.org.
Collapse
|
71
|
Abstract
Like other RNA viruses, coxsackievirus B5 (CVB5) exists as circulating heterogeneous populations of genetic variants. In this study, we present the reconstruction and characterization of a probable ancestral virion of CVB5. Phylogenetic analyses based on capsid protein-encoding regions (the VP1 gene of 41 clinical isolates and the entire P1 region of eight clinical isolates) of CVB5 revealed two major cocirculating lineages. Ancestral capsid sequences were inferred from sequences of these contemporary CVB5 isolates by using maximum likelihood methods. By using Bayesian phylodynamic analysis, the inferred VP1 ancestral sequence dated back to 1854 (1807 to 1898). In order to study the properties of the putative ancestral capsid, the entire ancestral P1 sequence was synthesized de novo and inserted into the replicative backbone of an infectious CVB5 cDNA clone. Characterization of the recombinant virus in cell culture showed that fully functional infectious virus particles were assembled and that these viruses displayed properties similar to those of modern isolates in terms of receptor preferences, plaque phenotypes, growth characteristics, and cell tropism. This is the first report describing the resurrection and characterization of a picornavirus with a putative ancestral capsid. Our approach, including a phylogenetics-based reconstruction of viral predecessors, could serve as a starting point for experimental studies of viral evolution and might also provide an alternative strategy for the development of vaccines.
Collapse
|
72
|
Morrow JM, Chang BSW. The p1D4-hrGFP II expression vector: a tool for expressing and purifying visual pigments and other G protein-coupled receptors. Plasmid 2010; 64:162-9. [PMID: 20627111 DOI: 10.1016/j.plasmid.2010.07.002] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2010] [Revised: 06/29/2010] [Accepted: 07/05/2010] [Indexed: 11/19/2022]
Abstract
The heterologous expression of membrane proteins such as G protein-coupled receptors can be a notoriously difficult task. We have engineered an expression vector, p1D4-hrGFP II, in order to efficiently express visual pigments in mammalian cell culture. This expression vector is based on pIRES-hrGFP II (Stratagene), with the addition of a C-terminal 1D4 epitope tag for immunoblotting and immunoaffinity purification. This vector employs the CMV promoter and hrGFP II, a co-translated reporter gene. We measured the effectiveness of pIRES-hrGFP II in expressing bovine rhodopsin, and showed a 3.9- to 5.7-fold increase in expression as measured by absorbance spectroscopy as compared with the pMT vector, a common choice for visual pigment expression. We then expressed zebrafish RH2-1 using p1D4-hrGFP II in order to assess its utility in expressing cone opsins, known to be less stable and more difficult to express than bovine rhodopsin. We show a λ(280)/λ(MAX) value of 3.3, one third of that reported in previous studies, suggesting increased expression levels and decreased levels of misfolded, non-functional visual pigment. Finally, we monitored HEK293T cell growth following transfection with pIRES-hrGFP II using fluorescence microscopy to illustrate the benefits of having a co-translated reporter during heterologous expression studies.
Collapse
Affiliation(s)
- James M Morrow
- Department of Cell & Systems Biology, University of Toronto, Room 501, Toronto, Ontario, Canada
| | | |
Collapse
|
73
|
Hanson-Smith V, Kolaczkowski B, Thornton JW. Robustness of ancestral sequence reconstruction to phylogenetic uncertainty. Mol Biol Evol 2010; 27:1988-99. [PMID: 20368266 PMCID: PMC2922618 DOI: 10.1093/molbev/msq081] [Citation(s) in RCA: 113] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Ancestral sequence reconstruction (ASR) is widely used to formulate and test hypotheses about the sequences, functions, and structures of ancient genes. Ancestral sequences are usually inferred from an alignment of extant sequences using a maximum likelihood (ML) phylogenetic algorithm, which calculates the most likely ancestral sequence assuming a probabilistic model of sequence evolution and a specific phylogeny—typically the tree with the ML. The true phylogeny is seldom known with certainty, however. ML methods ignore this uncertainty, whereas Bayesian methods incorporate it by integrating the likelihood of each ancestral state over a distribution of possible trees. It is not known whether Bayesian approaches to phylogenetic uncertainty improve the accuracy of inferred ancestral sequences. Here, we use simulation-based experiments under both simplified and empirically derived conditions to compare the accuracy of ASR carried out using ML and Bayesian approaches. We show that incorporating phylogenetic uncertainty by integrating over topologies very rarely changes the inferred ancestral state and does not improve the accuracy of the reconstructed ancestral sequence. Ancestral state reconstructions are robust to uncertainty about the underlying tree because the conditions that produce phylogenetic uncertainty also make the ancestral state identical across plausible trees; conversely, the conditions under which different phylogenies yield different inferred ancestral states produce little or no ambiguity about the true phylogeny. Our results suggest that ML can produce accurate ASRs, even in the face of phylogenetic uncertainty. Using Bayesian integration to incorporate this uncertainty is neither necessary nor beneficial.
Collapse
|
74
|
Li G, Ma J, Zhang L. Greedy selection of species for ancestral state reconstruction on phylogenies: elimination is better than insertion. PLoS One 2010; 5:e8985. [PMID: 20140213 PMCID: PMC2816206 DOI: 10.1371/journal.pone.0008985] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2009] [Accepted: 01/05/2010] [Indexed: 12/26/2022] Open
Abstract
Accurate reconstruction of ancestral character states on a phylogeny is crucial in many genomics studies. We study how to select species to achieve the best reconstruction of ancestral character states on a phylogeny. We first show that the marginal maximum likelihood has the monotonicity property that more taxa give better reconstruction, but the Fitch method does not have it even on an ultrametric phylogeny. We further validate a greedy approach for species selection using simulation. The validation tests indicate that backward greedy selection outperforms forward greedy selection. In addition, by applying our selection strategy, we obtain a set of the ten most informative species for the reconstruction of the genomic sequence of the so-called boreoeutherian ancestor of placental mammals. This study has broad relevance in comparative genomics and paleogenomics since limited research resources do not allow researchers to sequence the large number of descendant species required to reconstruct an ancestral sequence.
Collapse
Affiliation(s)
- Guoliang Li
- Computational & Mathematical Biology, Genome Institute of Singapore, Singapore, Singapore
| | - Jian Ma
- Department of Bioengineering, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
| | - Louxin Zhang
- Department of Mathematics, National University of Singapore, Singapore, Singapore
- * E-mail:
| |
Collapse
|
75
|
Abstract
While a variety of methods exist to reconstruct ancestral sequences, all of them assume that a single phylogeny underlies all the positions in the alignment and therefore that recombination has not taken place. Using computer simulations we show that recombination can severely bias ancestral sequence reconstruction (ASR), and quantify this effect. If recombination is ignored, the ancestral sequences recovered can be quite distinct from the grand most recent common ancestor (GMRCA) of the sample and better resemble the concatenate of partial most recent common ancestors (MRCAs) at each recombination fragment. When independent phylogenetic trees are assumed for the different recombinant segments, the estimation of the fragment MRCAs improves significantly. Importantly, we show that recombination can change the biological predictions derived from ASRs carried out with real data. Given that recombination is widespread on nuclear genes and in particular in RNA viruses and some bacteria, the reconstruction of ancestral sequences in these cases should consider the potential impact of recombination and ideally be carried out using approaches that accommodate recombination.
Collapse
|
76
|
Bradley RK, Holmes I. Evolutionary triplet models of structured RNA. PLoS Comput Biol 2009; 5:e1000483. [PMID: 19714212 PMCID: PMC2725318 DOI: 10.1371/journal.pcbi.1000483] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2008] [Accepted: 07/23/2009] [Indexed: 12/31/2022] Open
Abstract
The reconstruction and synthesis of ancestral RNAs is a feasible goal for paleogenetics. This will require new bioinformatics methods, including a robust statistical framework for reconstructing histories of substitutions, indels and structural changes. We describe a "transducer composition" algorithm for extending pairwise probabilistic models of RNA structural evolution to models of multiple sequences related by a phylogenetic tree. This algorithm draws on formal models of computational linguistics as well as the 1985 protosequence algorithm of David Sankoff. The output of the composition algorithm is a multiple-sequence stochastic context-free grammar. We describe dynamic programming algorithms, which are robust to null cycles and empty bifurcations, for parsing this grammar. Example applications include structural alignment of non-coding RNAs, propagation of structural information from an experimentally-characterized sequence to its homologs, and inference of the ancestral structure of a set of diverged RNAs. We implemented the above algorithms for a simple model of pairwise RNA structural evolution; in particular, the algorithms for maximum likelihood (ML) alignment of three known RNA structures and a known phylogeny and inference of the common ancestral structure. We compared this ML algorithm to a variety of related, but simpler, techniques, including ML alignment algorithms for simpler models that omitted various aspects of the full model and also a posterior-decoding alignment algorithm for one of the simpler models. In our tests, incorporation of basepair structure was the most important factor for accurate alignment inference; appropriate use of posterior-decoding was next; and fine details of the model were least important. Posterior-decoding heuristics can be substantially faster than exact phylogenetic inference, so this motivates the use of sum-over-pairs heuristics where possible (and approximate sum-over-pairs). For more exact probabilistic inference, we discuss the use of transducer composition for ML (or MCMC) inference on phylogenies, including possible ways to make the core operations tractable.
Collapse
Affiliation(s)
- Robert K. Bradley
- Biophysics Graduate Group, University of California, Berkeley, California, United States of America
| | - Ian Holmes
- Biophysics Graduate Group, University of California, Berkeley, California, United States of America
- Department of Bioengineering, University of California, Berkeley, California, United States of America
- * E-mail:
| |
Collapse
|
77
|
Bagwill A, Sever DM, Elsey RM. Seasonal variation of the oviduct of the American alligator,Alligator mississippiensis(Reptilia: Crocodylia). J Morphol 2009; 270:702-13. [DOI: 10.1002/jmor.10714] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
78
|
Liberles DA. Reading the Story in DNA: A Beginner's Guide to Molecular Evolution. Syst Biol 2009. [DOI: 10.1093/sysbio/syp003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
|
79
|
Elucidation of phenotypic adaptations: Molecular analyses of dim-light vision proteins in vertebrates. Proc Natl Acad Sci U S A 2008; 105:13480-5. [PMID: 18768804 DOI: 10.1073/pnas.0802426105] [Citation(s) in RCA: 196] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Vertebrate ancestors appeared in a uniform, shallow water environment, but modern species flourish in highly variable niches. A striking array of phenotypes exhibited by contemporary animals is assumed to have evolved by accumulating a series of selectively advantageous mutations. However, the experimental test of such adaptive events at the molecular level is remarkably difficult. One testable phenotype, dim-light vision, is mediated by rhodopsins. Here, we engineered 11 ancestral rhodopsins and show that those in early ancestors absorbed light maximally (lambda(max)) at 500 nm, from which contemporary rhodopsins with variable lambda(max)s of 480-525 nm evolved on at least 18 separate occasions. These highly environment-specific adaptations seem to have occurred largely by amino acid replacements at 12 sites, and most of those at the remaining 191 ( approximately 94%) sites have undergone neutral evolution. The comparison between these results and those inferred by commonly-used parsimony and Bayesian methods demonstrates that statistical tests of positive selection can be misleading without experimental support and that the molecular basis of spectral tuning in rhodopsins should be elucidated by mutagenesis analyses using ancestral pigments.
Collapse
|
80
|
Affiliation(s)
- Shozo Yokoyama
- Department of Biology, Emory University, Atlanta, Georgia 30322;
| |
Collapse
|
81
|
Hult EF, Weadick CJ, Chang BSW, Tobe SS. Reconstruction of ancestral FGLamide-type insect allatostatins: a novel approach to the study of allatostatin function and evolution. JOURNAL OF INSECT PHYSIOLOGY 2008; 54:959-968. [PMID: 18541257 DOI: 10.1016/j.jinsphys.2008.04.007] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/23/2007] [Revised: 03/19/2008] [Accepted: 04/03/2008] [Indexed: 05/26/2023]
Abstract
Allatostatins (ASTs) are a class of regulatory neuropeptides, with diverse functions, found in an array of invertebrate phyla. ASTs have complex gene structure, in which individual ASTs are cleaved from a precursor peptide. Little is known about the molecular evolution of AST structure and function, even in extensively studied groups such as cockroaches. This paper presents the application of a novel technique for the analysis of this system, that of ancestral reconstruction, whereby ancestral amino acid sequences are resurrected in the laboratory. We inferred the ancestral sequences of a well-characterized peptide, AST 7, for the insect ancestor, as well as several cockroach ancestors. Peptides were assayed for in vitro inhibition of JH production in Diploptera punctata and Periplaneta americana. Our results surprisingly, indicate a decrease in potency of the ancestral cockroach AST7 peptide in comparison with more ancient ones such as the ancestral insect peptide, as well as more recently evolved cockroach peptides. We propose that this unexpected decrease in peptide potency at the cockroach ancestor may be related to the concurrent increase in peptide copy number in the lineages leading to cockroaches. This model is consistent with current physiological data, and may be linked to the increased role of ASTs in the regulation of reproductive processes in the cockroaches.
Collapse
Affiliation(s)
- Ekaterina F Hult
- Department of Cell and Systems Biology, University of Toronto, Toronto, Ont., Canada M5S 3G5
| | | | | | | |
Collapse
|
82
|
Whelan S. Spatial and Temporal Heterogeneity in Nucleotide Sequence Evolution. Mol Biol Evol 2008; 25:1683-94. [DOI: 10.1093/molbev/msn119] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
|
83
|
de Farias ST, Guimarães RC. Aminoacyl-tRNA synthetase classes and groups in prokaryotes. J Theor Biol 2007; 250:221-9. [PMID: 17983631 DOI: 10.1016/j.jtbi.2007.09.025] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2006] [Revised: 08/29/2007] [Accepted: 09/18/2007] [Indexed: 01/01/2023]
Abstract
Knowledge on the evolution of aminoacyl-tRNA synthetases is crucial to studies on the origins of life. The relationships between the different aminoacyl-tRNA synthetase specificities in prokaryotic organisms are studied in this work. We reconstructed the ancestor sequences and the phylogenetic relationships utilizing the Maximum Likelihood method. The results suggest that in class I the evolution of the N-terminal segment was strongly influenced by the amino acid hydropathy in both domains of prokaryotes. The results for the C-terminal segments of class I were different in the two domains, indicating that its evolution was strongly influenced by the specific types of tRNA modification in each domain. The class II groups in Archaea were more heterogeneous with respect to the hydropathy of amino acids, indicating the interference of other influences. In bacteria, the configuration was also complex but the overall consensual division in two groups was maintained, group IIa forming a single branch with the five hydroapathetic amino acid specificities and group IIb containing the specificities for the moderately hydrophobic together with the hydrophilic amino acids. It is indicated that the aminoacyl-tRNA synthetase in both domains were subjected to different selective forces in diverse parts of the proteins, resulting in complex phylogenetic patterns.
Collapse
Affiliation(s)
- Sávio Torres de Farias
- Dept. Biologia Geral, Inst. Ciências Biológicas, Univ. Federal de Minas Gerais, 31270.901 Belo Horizonte, MG, Brazil.
| | | |
Collapse
|
84
|
Dean AM, Thornton JW. Mechanistic approaches to the study of evolution: the functional synthesis. Nat Rev Genet 2007; 8:675-88. [PMID: 17703238 PMCID: PMC2488205 DOI: 10.1038/nrg2160] [Citation(s) in RCA: 259] [Impact Index Per Article: 15.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
An emerging synthesis of evolutionary biology and experimental molecular biology is providing much stronger and deeper inferences about the dynamics and mechanisms of evolution than were possible in the past. The new approach combines statistical analyses of gene sequences with manipulative molecular experiments to reveal how ancient mutations altered biochemical processes and produced novel phenotypes. This functional synthesis has set the stage for major advances in our understanding of fundamental questions in evolutionary biology. Here we describe this emerging approach, highlight important new insights that it has made possible, and suggest future directions for the field.
Collapse
Affiliation(s)
- Antony M Dean
- University of Minnesota, St Paul, Minnesota 55108, USA.
| | | |
Collapse
|
85
|
Müller J, Tsuji LA. Impedance-Matching Hearing in Paleozoic Reptiles: Evidence of Advanced Sensory Perception at an Early Stage of Amniote Evolution. PLoS One 2007; 2:e889. [PMID: 17849018 PMCID: PMC1964539 DOI: 10.1371/journal.pone.0000889] [Citation(s) in RCA: 55] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2007] [Accepted: 08/20/2007] [Indexed: 11/29/2022] Open
Abstract
Background Insights into the onset of evolutionary novelties are key to the understanding of amniote origins and diversification. The possession of an impedance-matching tympanic middle ear is characteristic of all terrestrial vertebrates with a sophisticated hearing sense and an adaptively important feature of many modern terrestrial vertebrates. Whereas tympanic ears seem to have evolved multiple times within tetrapods, especially among crown-group members such as frogs, mammals, squamates, turtles, crocodiles, and birds, the presence of true tympanic ears has never been recorded in a Paleozoic amniote, suggesting they evolved fairly recently in amniote history. Methodology/Principal Findings In the present study, we performed a morphological examination and a phylogenetic analysis of poorly known parareptiles from the Middle Permian of the Mezen River Basin in Russia. We recovered a well-supported clade that is characterized by a unique cheek morphology indicative of a tympanum stretching across large parts of the temporal region to an extent not seen in other amniotes, fossil or extant, and a braincase specialized in showing modifications clearly related to an increase in auditory function, unlike the braincase of any other Paleozoic tetrapod. In addition, we estimated the ratio of the tympanum area relative to the stapedial footplate for the basalmost taxon of the clade, which, at 23∶1, is in close correspondence to that of modern amniotes capable of efficient impedance-matching hearing. Conclusions/Significance Using modern amniotes as analogues, the possession of an impedance-matching middle ear in these parareptiles suggests unique ecological adaptations potentially related to living in dim-light environments. More importantly, our results demonstrate that already at an early stage of amniote diversification, and prior to the Permo-Triassic extinction event, the complexity of terrestrial vertebrate ecosystems had reached a level that proved advanced sensory perception to be of notable adaptive significance.
Collapse
Affiliation(s)
- Johannes Müller
- Humboldt-Universität zu Berlin, Museum für Naturkunde, Berlin, Germany.
| | | |
Collapse
|
86
|
Sweeney AM, Des Marais DL, Ban YEA, Johnsen S. Evolution of graded refractive index in squid lenses. J R Soc Interface 2007; 4:685-98. [PMID: 17293312 PMCID: PMC2373386 DOI: 10.1098/rsif.2006.0210] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
A lens with a graded refractive index is required for vision in aquatic animals with camera-type eyes. This optical design entails a radial gradient of protein density, with low density in external layers and high density in internal layers. To maintain the optical stability of the eye, different material properties are required for proteins in different regions of the lens. In low-density regions of the lens where slight protein aggregation causes significant light scattering, aggregation must be minimized. Squid lens S-crystallin proteins are evolutionarily derived from the glutathione S-transferase protein family. We used biochemistry, optical modelling and phylogenetics to study the evolution and material properties of S-crystallins. S-crystallins are differentially expressed in a radial gradient, suggesting a role in refractive index. This gradient in S-crystallin expression is correlated with their evolutionary history and biochemistry. S-crystallins have been under positive selection. This selection appears to have resulted in stabilization of derived S-crystallins via mutations in the dimer interface and extended electrostatic fields. These derived S-crystallins probably cause the glassy organization and stability of low refractive index lens layers. Our work elucidates the molecular and evolutionary mechanisms underlying the production and maintenance of camera-like optics in squid lenses.
Collapse
|
87
|
Rolland M, Jensen MA, Nickle DC, Yan J, Learn GH, Heath L, Weiner D, Mullins JI. Reconstruction and function of ancestral center-of-tree human immunodeficiency virus type 1 proteins. J Virol 2007; 81:8507-14. [PMID: 17537854 PMCID: PMC1951385 DOI: 10.1128/jvi.02683-06] [Citation(s) in RCA: 50] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The extensive diversity of human immunodeficiency virus type 1 (HIV-1) and its capacity to mutate and escape host immune responses are major challenges for AIDS vaccine development. Ancestral sequences, which minimize the genetic distance to circulating strains, provide an opportunity to design immunogens with the potential to elicit broad recognition of HIV epitopes. We developed a phylogenetics-informed algorithm to reconstruct ancestral HIV sequences, called Center of Tree (COT). COT sequences have potentially significant benefits over isolate-based strategies, as they minimize the evolutionary distances to circulating strains. COT sequences are designed to surmount the potential pitfalls stemming from sampling bias with the consensus method and outlier bias with the most-recent-common-ancestor approach. We computationally derived COT sequences from circulating HIV-1 subtype B sequences for the genes encoding the major viral structural protein (Gag) and two regulatory proteins, Tat and Nef. COT genes were synthesized de novo and expressed in mammalian cells, and the proteins were characterized. COT Gag was shown to generate virus-like particles, while COT Tat transactivated gene expression from the HIV-1 long terminal repeat and COT Nef mediated downregulation of cell surface major histocompatibility complex class I. Thus, retrodicted ancestral COT proteins can retain the biological functions of extant HIV-1 proteins. Additionally, COT proteins were immunogenic, as they elicited antigen-specific cytotoxic T-lymphocyte responses in mice. These data support the utility of the COT approach to create novel and biologically active ancestral proteins as a starting point for studies of the structure, function, and biological fitness of highly variable genes, as well as for the rational design of globally relevant vaccine candidates.
Collapse
MESH Headings
- AIDS Vaccines/genetics
- AIDS Vaccines/immunology
- Algorithms
- Amino Acid Sequence
- Animals
- Antigens, Viral/classification
- Antigens, Viral/genetics
- Antigens, Viral/immunology
- Base Sequence
- Directed Molecular Evolution/methods
- Epitopes/genetics
- Epitopes/immunology
- Female
- Gene Products, gag/classification
- Gene Products, gag/genetics
- Gene Products, gag/immunology
- Gene Products, nef/classification
- Gene Products, nef/genetics
- Gene Products, nef/immunology
- Gene Products, tat/classification
- Gene Products, tat/genetics
- Gene Products, tat/immunology
- HIV-1/genetics
- HIV-1/immunology
- Humans
- Mice
- Mice, Inbred BALB C
- Molecular Sequence Data
- Phylogeny
- nef Gene Products, Human Immunodeficiency Virus
- tat Gene Products, Human Immunodeficiency Virus
Collapse
Affiliation(s)
- Morgane Rolland
- Department of Microbiology SC-42, University of Washington, Seattle, WA 98195-8070, USA
| | | | | | | | | | | | | | | |
Collapse
|
88
|
Benner SA, Sassi SO, Gaucher EA. Molecular paleoscience: systems biology from the past. ACTA ACUST UNITED AC 2007; 75:1-132, xi. [PMID: 17124866 DOI: 10.1002/9780471224464.ch1] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/22/2023]
Abstract
Experimental paleomolecular biology, paleobiochemistry, and paleogenetics are closely related emerging fields that infer the sequences of ancient genes and proteins from now-extinct organisms, and then resurrect them for study in the laboratory. The goal of paleogenetics is to use information from natural history to solve the conundrum of modern genomics: How can we understand deeply the function of biomolecular structures uncovered and described by modern chemical biology? Reviewed here are the first 20 cases where biomolecular resurrections have been achieved. These show how paleogenetics can lead to an understanding of the function of biomolecules, analyze changing function, and put meaning to genomic sequences, all in ways that are not possible with traditional molecular biological studies.
Collapse
Affiliation(s)
- Steven A Benner
- Foundation for Applied Molecular Evolution, 1115 NW 4th Street, Gainesville, FL 32601, USA
| | | | | |
Collapse
|
89
|
Ryan BJ, Barrett R. ProteinParser--a community based tool for the generation of a detailed protein consensus and FASTA output. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2007; 85:69-76. [PMID: 17079048 DOI: 10.1016/j.cmpb.2006.09.015] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/04/2006] [Revised: 09/28/2006] [Accepted: 09/29/2006] [Indexed: 05/12/2023]
Abstract
Comparison of bioinformatic data is a common application in the life sciences and beyond. In this communication, a novel Java based software tool, ProteinParser, is outlined. This software tool calculates a detailed consensus, or most common, amino acid at a given position in an aligned protein set, whilst also generating a full consensus protein FASTA output. A second application of this software tool, computing a consensus amino acid given a tolerance threshold, is also demonstrated. The phytase and the common bacterial beta-lactamase proteins are analysed as 'proof of concept' examples. Consensus proteins, as generated by ProteinParser, are regularly utilised in the selection of residues for protein stabilisation mutagenesis; however, this widely applicable software tool will find many alternative applications in areas such as protein homology modelling.
Collapse
Affiliation(s)
- Barry J Ryan
- School of Biotechnology and National Centre for Sensor Research, Dublin City University, Dublin 9, Ireland.
| | | |
Collapse
|
90
|
Serb JM, Oakley TH. Hierarchical phylogenetics as a quantitative analytical framework for evolutionary developmental biology. Bioessays 2006; 27:1158-66. [PMID: 16237676 DOI: 10.1002/bies.20291] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Phylogenetics has inherent utility in evolutionary developmental biology (EDB) as it is an established methodology for estimating evolutionary relationships and for making comparisons between levels of biological organization. However, explicit phylogenetic methods generally have been limited to two levels of organization in EDB-the species and the gene. We demonstrate that phylogenetic methods can be applied broadly to other organizational levels, such as morphological structures or cell types, to identify evolutionary patterns. We present examples at and between different hierarchical levels of organization to address questions central to EDB. We argue that this application of "hierarchical phylogenetics" can be a unifying analytical approach to the field of EDB.
Collapse
Affiliation(s)
- Jeanne M Serb
- Ecology, Evolution and Organismal Biology, Iowa State University, Ames, Iowa, USA
| | | |
Collapse
|
91
|
Abstract
In the recent Dover trial, and elsewhere, the 'Intelligent Design' movement has championed the bacterial flagellum as an irreducibly complex system that, it is claimed, could not have evolved through natural selection. Here we explore the arguments in favour of viewing bacterial flagella as evolved, rather than designed, entities. We dismiss the need for any great conceptual leaps in creating a model of flagellar evolution and speculate as to how an experimental programme focused on this topic might look.
Collapse
Affiliation(s)
- Mark J Pallen
- Division of Immunity & Infection, Medical School, University of Birmingham, Birmingham, B15 2TT UK.
| | | |
Collapse
|
92
|
Skovgaard M, Kodra JT, Gram DX, Knudsen SM, Madsen D, Liberles DA. Using evolutionary information and ancestral sequences to understand the sequence-function relationship in GLP-1 agonists. J Mol Biol 2006; 363:977-88. [PMID: 16989858 DOI: 10.1016/j.jmb.2006.08.066] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2006] [Revised: 08/04/2006] [Accepted: 08/15/2006] [Indexed: 12/31/2022]
Abstract
Glucagon-like peptide-1 (GLP-1) is an incretin hormone with therapeutic potential for type 2 diabetes. A variety of GLP-1 sequences are known from amphibian species, and some of these have been tested here and found to be able to bind and activate the human GLP-1 receptor. While little difference was observed for the in vitro potency for the human GLP-1 receptor, larger differences were found in the enzymatic stability of these peptides. Two peptides showed increased enzymatic stability, and they group together phylogenetically, though they originate from Amphibia and Reptilia. We have used ancestral sequence reconstruction to analyze the evolution of these GLP-1 molecules, including the synthesis of new peptides. We find that the increased stability could not be observed in the resurrected peptides from the common ancestor of frogs, even though they maintain the ability to activate the human GLP-1 receptor. Another method, using residue mapping on evolutionary branches yielded peptides that had maintained potency towards the receptor and also showed increased stability. This represents a new approach using evolutionary data in protein engineering.
Collapse
Affiliation(s)
- Marie Skovgaard
- Novo Nordisk A/S, Novo Nordisk Park, DK-2760 Måløv, Denmark.
| | | | | | | | | | | |
Collapse
|
93
|
Poole AM, Ranganathan R. Knowledge-based potentials in protein design. Curr Opin Struct Biol 2006; 16:508-13. [PMID: 16843652 DOI: 10.1016/j.sbi.2006.06.013] [Citation(s) in RCA: 72] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2006] [Revised: 06/07/2006] [Accepted: 06/30/2006] [Indexed: 02/03/2023]
Abstract
Knowledge-based potentials are statistical parameters derived from databases of known protein properties that empirically capture aspects of the physical chemistry of protein structure and function. These potentials play a key role in protein design by improving the accuracy of physics-based models of interatomic interactions and enhancing the computational efficiency of the design process by limiting the complexity of searching sequence space. Recently, knowledge-based potentials (in isolation or in combination with physics-based potentials) have been applied to the modification of existing protein function, the redesign of natural protein folds and the complete design of a non-natural protein fold. In addition, knowledge-based potentials appear to be providing important information about the global topology of amino acid interactions in natural proteins. A detailed study of the methods and products of these protein design efforts promises to greatly expand our understanding of proteins and the evolutionary process that created them.
Collapse
Affiliation(s)
- Alan M Poole
- Howard Hughes Medical Institute, Department of Pharmacology and the Green Comprehensive Center Division for Systems Biology, University of Texas Southwestern Medical Center, Dallas, TX 75390-9050, USA
| | | |
Collapse
|
94
|
Williams PD, Pollock DD, Blackburne BP, Goldstein RA. Assessing the accuracy of ancestral protein reconstruction methods. PLoS Comput Biol 2006; 2:e69. [PMID: 16789817 PMCID: PMC1480538 DOI: 10.1371/journal.pcbi.0020069] [Citation(s) in RCA: 133] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2005] [Accepted: 05/04/2006] [Indexed: 11/18/2022] Open
Abstract
The phylogenetic inference of ancestral protein sequences is a powerful technique for the study of molecular evolution, but any conclusions drawn from such studies are only as good as the accuracy of the reconstruction method. Every inference method leads to errors in the ancestral protein sequence, resulting in potentially misleading estimates of the ancestral protein's properties. To assess the accuracy of ancestral protein reconstruction methods, we performed computational population evolution simulations featuring near-neutral evolution under purifying selection, speciation, and divergence using an off-lattice protein model where fitness depends on the ability to be stable in a specified target structure. We were thus able to compare the thermodynamic properties of the true ancestral sequences with the properties of “ancestral sequences” inferred by maximum parsimony, maximum likelihood, and Bayesian methods. Surprisingly, we found that methods such as maximum parsimony and maximum likelihood that reconstruct a “best guess” amino acid at each position overestimate thermostability, while a Bayesian method that sometimes chooses less-probable residues from the posterior probability distribution does not. Maximum likelihood and maximum parsimony apparently tend to eliminate variants at a position that are slightly detrimental to structural stability simply because such detrimental variants are less frequent. Other properties of ancestral proteins might be similarly overestimated. This suggests that ancestral reconstruction studies require greater care to come to credible conclusions regarding functional evolution. Inferred functional patterns that mimic reconstruction bias should be reevaluated. It is now possible to apply computational methods to known current protein sequences to recreate the sequences of ancestral proteins. By synthesising these proteins and measuring their properties in the laboratory, we can gain much information about the nature of evolution, better understand how proteins change and adapt over time, and develop insights into the environments of ancient organisms. Unfortunately, the accuracy of these reconstructions is difficult to evaluate. We simulate protein evolution using a simplified computational model and apply the various reconstruction methods to the sequences that arise from our simulations. Because we have the complete record of the evolutionary history, we can evaluate the reconstruction accuracy directly. We demonstrate that the reconstruction procedures in common use may have a bias toward overestimating the properties of these ancestral proteins, opposite to what has been assumed previously. An alternative method of creating these sequences is presented, Bayesian sampling, that can eliminate this bias and provide more robust conclusions.
Collapse
Affiliation(s)
- Paul D Williams
- Department of Chemistry, University of Michigan, Ann Arbor, Michigan, United States of America
| | - David D Pollock
- Department of Biological Sciences, Biological Computation and Visualization Center, Louisiana State University, Baton Rouge, Louisiana, United States of America
| | - Benjamin P Blackburne
- Division of Mathematical Biology, National Institute of Medical Research, Mill Hill, London, United Kingdom
| | - Richard A Goldstein
- Division of Mathematical Biology, National Institute of Medical Research, Mill Hill, London, United Kingdom
- * To whom correspondence should be addressed. E-mail:
| |
Collapse
|
95
|
Ren F, Tanaka H, Yang Z. An empirical examination of the utility of codon-substitution models in phylogeny reconstruction. Syst Biol 2006; 54:808-18. [PMID: 16243764 DOI: 10.1080/10635150500354688] [Citation(s) in RCA: 76] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022] Open
Abstract
Models of codon substitution have been commonly used to compare protein-coding DNA sequences and are particularly effective in detecting signals of natural selection acting on the protein. Their utility in reconstructing molecular phylogenies and in dating species divergences has not been explored. Codon models naturally accommodate synonymous and nonsynonymous substitutions, which occur at very different rates and may be informative for recent and ancient divergences, respectively. Thus codon models may be expected to make an efficient use of phylogenetic information in protein-coding DNA sequences. Here we applied codon models to 106 protein-coding genes from eight yeast species to reconstruct phylogenies using the maximum likelihood method, in comparison with nucleotide- and amino acid-based analyses. The results appeared to confirm that expectation. Nucleotide-based analysis, under simplistic substitution models, were efficient in recovering recent divergences whereas amino acid-based analysis performed better at recovering deep divergences. Codon models appeared to combine the advantages of amino acid and nucleotide data and had good performance at recovering both recent and deep divergences. Estimation of relative species divergence times using amino acid and codon models suggested that translation of gene sequences into proteins led to information loss of from 30% for deep nodes to 66% for recent nodes. Although computational burden makes codon models unfeasible for tree search in large data sets, we suggest that they may be useful for comparing candidate trees. Nucleotide models that accommodate the differences in evolutionary dynamics at the three codon positions also performed well, at much less computational cost. We discuss the relationship between a model's fit to data and its utility in phylogeny reconstruction and caution against use of overly complex substitution models.
Collapse
Affiliation(s)
- Fengrong Ren
- Advanced Biomedical Information, Center for Information Medicine, Tokyo Medical and Dental University, Japan
| | | | | |
Collapse
|
96
|
Marsh L. Evolution of Structural Shape in Bacterial Globin-Related Proteins. J Mol Evol 2006; 62:575-87. [PMID: 16612536 DOI: 10.1007/s00239-005-0025-3] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2005] [Accepted: 12/31/2005] [Indexed: 10/24/2022]
Abstract
The globin family of proteins has a characteristic structural pattern of helix interactions that nonetheless exhibits some variation. A simplified model for globin structural evolution was developed in which protein shape evolved by random change of contacts between helices. A conserved globin domain of 15 bacterial proteins representing four structural families was studied. Using a parsimony approach ancestral structural states could be reconstructed. The distribution of number of contact changes per site for a fixed topology tree fit a gamma distribution. Homoplasy was high, with multiple changes per site and no support for an invariant class of residue-residue contacts. Contacts changed more slowly than sequence. A phylogenetic reconstruction using a distance measure based on the proportion of shared contacts was generally consistent with a sequence-based phylogeny but not highly resolved. Contact pattern convergence between members of different globin family proteins could not be detected. Simulation studies indicated the convergence test was sensitive enough to have detected convergence involving only 10% of the contacts, suggesting a limit on the extent of selection for a specific contact pattern. Contact site methods may provide additional approaches to study the relationship between protein structure and sequence evolution.
Collapse
Affiliation(s)
- Lorraine Marsh
- Department of Biology, Long Island University, 1 University Plaza, Brooklyn, NY 11201, USA.
| |
Collapse
|
97
|
Abstract
There are a variety of reasons to reconstruct the sequences of ancient proteins, but whatever the reason, the value of the reconstructed protein depends on the accuracy with which the ancient sequence is inferred. This study uses sequences simulated by a sequence-evolution simulation program that compares parsimony, maximum likelihood, and the Bayesian methods of inferring ancestral sequences and concludes that the Bayesian method, as implemented by MRBAYES 3.11, is preferred. Estimated ancestral sequences are of necessity the same length as the alignment on which the underlying phylogeny is based. A highly accurate method for correcting the estimated sequences is introduced, and it is shown that the correction permits inferring the sequences of ancient protein sequences with a very high degree of accuracy.
Collapse
Affiliation(s)
- Barry G Hall
- Bellingham Research Institute, 218 Chuckanut Point Road, Bellingham, WA 98229, USA.
| |
Collapse
|
98
|
Gaucher EA, De Kee DW, Benner SA. Application of DETECTER, an evolutionary genomic tool to analyze genetic variation, to the cystic fibrosis gene family. BMC Genomics 2006; 7:44. [PMID: 16522197 PMCID: PMC1420294 DOI: 10.1186/1471-2164-7-44] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2005] [Accepted: 03/07/2006] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The medical community requires computational tools that distinguish missense genetic differences having phenotypic impact within the vast number of sense mutations that do not. Tools that do this will become increasingly important for those seeking to use human genome sequence data to predict disease, make prognoses, and customize therapy to individual patients. RESULTS An approach, termed DETECTER, is proposed to identify sites in a protein sequence where amino acid replacements are likely to have a significant effect on phenotype, including causing genetic disease. This approach uses a model-dependent tool to estimate the normalized replacement rate at individual sites in a protein sequence, based on a history of those sites extracted from an evolutionary analysis of the corresponding protein family. This tool identifies sites that have higher-than-average, average, or lower-than-average rates of change in the lineage leading to the sequence in the population of interest. The rates are then combined with sequence data to determine the likelihoods that particular amino acids were present at individual sites in the evolutionary history of the gene family. These likelihoods are used to predict whether any specific amino acid replacements, if introduced at the site in a modern human population, would have a significant impact on fitness. The DETECTER tool is used to analyze the cystic fibrosis transmembrane conductance regulator (CFTR) gene family. CONCLUSION In this system, DETECTER retrodicts amino acid replacements associated with the cystic fibrosis disease with greater accuracy than alternative approaches. While this result validates this approach for this particular family of proteins only, the approach may be applicable to the analysis of polymorphisms generally, including SNPs in a human population.
Collapse
Affiliation(s)
- Eric A Gaucher
- Foundation for Applied Molecular Evolution, Gainesville, FL USA
| | - Danny W De Kee
- Foundation for Applied Molecular Evolution, Gainesville, FL USA
| | - Steven A Benner
- Department of Chemistry, University of Florida, Gainesville, FL USA
| |
Collapse
|
99
|
Bradley ME, Benner SA. Integrating protein structures and precomputed genealogies in the Magnum database: examples with cellular retinoid binding proteins. BMC Bioinformatics 2006; 7:89. [PMID: 16504077 PMCID: PMC1475641 DOI: 10.1186/1471-2105-7-89] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2005] [Accepted: 02/23/2006] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND When accurate models for the divergent evolution of protein sequences are integrated with complementary biological information, such as folded protein structures, analyses of the combined data often lead to new hypotheses about molecular physiology. This represents an excellent example of how bioinformatics can be used to guide experimental research. However, progress in this direction has been slowed by the lack of a publicly available resource suitable for general use. RESULTS The precomputed Magnum database offers a solution to this problem for ca. 1,800 full-length protein families with at least one crystal structure. The Magnum deliverables include 1) multiple sequence alignments, 2) mapping of alignment sites to crystal structure sites, 3) phylogenetic trees, 4) inferred ancestral sequences at internal tree nodes, and 5) amino acid replacements along tree branches. Comprehensive evaluations revealed that the automated procedures used to construct Magnum produced accurate models of how proteins divergently evolve, or genealogies, and correctly integrated these with the structural data. To demonstrate Magnum's capabilities, we asked for amino acid replacements requiring three nucleotide substitutions, located at internal protein structure sites, and occurring on short phylogenetic tree branches. In the cellular retinoid binding protein family a site that potentially modulates ligand binding affinity was discovered. Recruitment of cellular retinol binding protein to function as a lens crystallin in the diurnal gecko afforded another opportunity to showcase the predictive value of a browsable database containing branch replacement patterns integrated with protein structures. CONCLUSION We integrated two areas of protein science, evolution and structure, on a large scale and created a precomputed database, known as Magnum, which is the first freely available resource of its kind. Magnum provides evolutionary and structural bioinformatics resources that are useful for identifying experimentally testable hypotheses about the molecular basis of protein behaviors and functions, as illustrated with the examples from the cellular retinoid binding proteins.
Collapse
Affiliation(s)
- Michael E Bradley
- Department of Chemistry, University of Florida, P.O. Box 117200, Gainesville, FL, 32611, USA
- Division of Biological Sciences, Department of Ecology and Evolution, University of Chicago, 1101 East 57Street, Chicago, IL, 60615, USA
| | - Steven A Benner
- Foundation for Applied Molecular Evolution, 1115 NW 14Avenue, Gainesville, FL, 32601, USA
| |
Collapse
|
100
|
Lucena B, Haussler D. Counterexample to a claim about the reconstruction of ancestral character states. Syst Biol 2006; 54:693-5. [PMID: 16126665 DOI: 10.1080/10635150590950344] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022] Open
Affiliation(s)
- Brian Lucena
- Division of Computer Science, University of California, Berkeley, California, USA.
| | | |
Collapse
|