1
|
Chen Q, Yang H, Feng X, Chen Q, Shi S, Wu CI, He Z. Two decades of suspect evidence for adaptive molecular evolution – Negative selection confounding positive selection signals. Natl Sci Rev 2021; 9:nwab217. [PMID: 35663241 PMCID: PMC9154339 DOI: 10.1093/nsr/nwab217] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2021] [Accepted: 11/21/2021] [Indexed: 11/21/2022] Open
Abstract
There has been a large literature in the last two decades affirming adaptive DNA sequence evolution between species. The main lines of evidence are from (i) the McDonald-Kreitman (MK) test, which compares divergence and polymorphism data, and (ii) the phylogenetic analysis by maximum likelihood (PAML) test, which analyzes multispecies divergence data. Here, we apply these two tests concurrently to genomic data of Drosophila and Arabidopsis. To our surprise, the >100 genes identified by the two tests do not overlap beyond random expectation. Because the non-concordance could be due to low powers leading to high false negatives, we merge every 20–30 genes into a ‘supergene’. At the supergene level, the power of detection is large but the calls still do not overlap. We rule out methodological reasons for the non-concordance. In particular, extensive simulations fail to find scenarios whereby positive selection can only be detected by either MK or PAML, but not both. Since molecular evolution is governed by positive and negative selection concurrently, a fundamental assumption for estimating one of these (say, positive selection) is that the other is constant. However, in a broad survey of primates, birds, Drosophila and Arabidopsis, we found that negative selection rarely stays constant for long in evolution. As a consequence, the variation in negative selection is often misconstrued as a signal of positive selection. In conclusion, MK, PAML and any method that examines genomic sequence evolution has to explicitly address the variation in negative selection before estimating positive selection. In a companion study, we propose a possible path forward in two stages—first, by mapping out the changes in negative selection and then using this map to estimate positive selection. For now, the large literature on positive selection between species has to await reassessment.
Collapse
Affiliation(s)
- Qipian Chen
- State Key Laboratory of Biocontrol, Guangdong Key Lab of Plant Resources, School of Life Sciences, Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Sun Yat-sen University, Guangzhou, China
| | - Hao Yang
- State Key Laboratory of Biocontrol, Guangdong Key Lab of Plant Resources, School of Life Sciences, Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Sun Yat-sen University, Guangzhou, China
| | - Xiao Feng
- State Key Laboratory of Biocontrol, Guangdong Key Lab of Plant Resources, School of Life Sciences, Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Sun Yat-sen University, Guangzhou, China
| | - Qingjian Chen
- State Key Laboratory of Biocontrol, Guangdong Key Lab of Plant Resources, School of Life Sciences, Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Sun Yat-sen University, Guangzhou, China
| | - Suhua Shi
- State Key Laboratory of Biocontrol, Guangdong Key Lab of Plant Resources, School of Life Sciences, Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Sun Yat-sen University, Guangzhou, China
| | - Chung-I Wu
- State Key Laboratory of Biocontrol, Guangdong Key Lab of Plant Resources, School of Life Sciences, Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Sun Yat-sen University, Guangzhou, China
| | - Ziwen He
- State Key Laboratory of Biocontrol, Guangdong Key Lab of Plant Resources, School of Life Sciences, Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Sun Yat-sen University, Guangzhou, China
| |
Collapse
|
2
|
Ma S, Liu H, Sun W, Mustafa A, Xi Y, Pu F, Li Y, Han C, Bai L, Hua H. Molecular evolution of the ATP-binding cassette subfamily G member 2 gene subfamily and its paralogs in birds. BMC Evol Biol 2020; 20:85. [PMID: 32664916 PMCID: PMC7362505 DOI: 10.1186/s12862-020-01654-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2019] [Accepted: 07/07/2020] [Indexed: 12/11/2022] Open
Abstract
Background ATP-binding cassette (ABC) transporters are involved in the active transportation of various endogenous or exogenous substances. Two ABCG2 gene subfamily members have been identified in birds. A detailed comparative study of the ABCG2 and ABCG2-like genes aid our understanding of their evolutionary history at the molecular level and provide a theoretical reference for studying the specific functions of ABCG2 and ABCG2-like genes in birds. Results We first identified 77 ABCG2/ABCG2-like gene sequences in the genomes of 41 birds. Further analysis showed that both the nucleic acid and amino acid sequences of ABCG2 and ABCG2-like genes were highly conserved and exhibited high homology in birds. However, significant differences in the N-terminal structure were found between the ABCG2 and ABCG2-like amino acid sequences. A selective pressure analysis showed that the ABCG2 and ABCG2-like genes were affected by purifying selection during the process of bird evolution. Conclusions We believe that multiple members of the ABCG2 gene subfamily exist on chromosome 4 in the ancestors of birds. Over the long course of evolution, only the ABCG2 gene was retained on chromosome 4 in birds. The ABCG2-like gene on chromosome 6 might have originated from chromosome replication or fusion. The structural differences between the N terminus of ABCG2 protein and those of ABCG2-like proteins might lead to functional differences between the corresponding genes.
Collapse
Affiliation(s)
- Shengchao Ma
- Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, Sichuan, 611130, P.R. China
| | - Hehe Liu
- Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, Sichuan, 611130, P.R. China.
| | - Wenqiang Sun
- Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, Sichuan, 611130, P.R. China
| | - Ahsan Mustafa
- Institute of Animal Nutrition, Key Laboratory for Animal Disease-Resistance Nutrition of China, Ministry of Education, Sichuan Agricultural University, Chengdu, P.R. China
| | - Yang Xi
- Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, Sichuan, 611130, P.R. China
| | - Fajun Pu
- Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, Sichuan, 611130, P.R. China
| | - Yanying Li
- Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, Sichuan, 611130, P.R. China
| | - Chunchun Han
- Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, Sichuan, 611130, P.R. China
| | - Lili Bai
- Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, Sichuan, 611130, P.R. China
| | - He Hua
- Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, Sichuan, 611130, P.R. China
| |
Collapse
|
3
|
Grant JC, González-Beiras C, Amick KM, Fortney KR, Gangaiah D, Humphreys TL, Mitjà O, Abecasis A, Spinola SM. Multiple Class I and Class II Haemophilus ducreyi Strains Cause Cutaneous Ulcers in Children on an Endemic Island. Clin Infect Dis 2019; 67:1768-1774. [PMID: 29897409 DOI: 10.1093/cid/ciy343] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2018] [Accepted: 04/18/2018] [Indexed: 12/14/2022] Open
Abstract
Background Together with Treponema pallidum subspecies pertenue, Haemophilus ducreyi is a major cause of exudative cutaneous ulcers (CUs) in children. For H. ducreyi, both class I and class II strains, asymptomatic colonization, and environmental reservoirs have been found in endemic regions, but the epidemiology of this infection is unknown. Methods Based on published whole-genome sequences of H. ducreyi CU strains, a single-locus typing system was developed and applied to H. ducreyi-positive CU samples obtained prior to, 1 year after, and 2 years after the initiation of a mass drug administration campaign to eradicate CU on Lihir Island in Papua New Guinea. DNA from the CU samples was amplified with class I and class II dsrA-specific primers and sequenced; the samples were classified into dsrA types, which were geospatially mapped. Selection pressure analysis was performed on the dsrA sequences. Results Thirty-seven samples contained class I sequences, 27 contained class II sequences, and 13 contained both. There were 5 class I and 4 class II types circulating on the island; 3 types accounted for approximately 87% of the strains. The composition and geospatial distribution of the types varied little over time and there was no evidence of selection pressure. Conclusions Multiple strains of H. ducreyi cause CU on an endemic island and coinfections are common. In contrast to recent findings with T. pallidum pertenue, strain composition is not affected by antibiotic pressure, consistent with environmental reservoirs of H. ducreyi. Such reservoirs must be addressed to achieve eradication of H. ducreyi.
Collapse
Affiliation(s)
- Jacob C Grant
- Department of Microbiology and Immunology, Indiana University School of Medicine, Indianapolis
| | | | - Kristen M Amick
- Department of Biology, Allegheny College, Meadville, Pennsylvania
| | - Kate R Fortney
- Department of Microbiology and Immunology, Indiana University School of Medicine, Indianapolis
| | - Dharanesh Gangaiah
- Department of Microbiology and Immunology, Indiana University School of Medicine, Indianapolis
| | | | - Oriol Mitjà
- Lihir Medical Centre-International SOS, Newcrest Mining, Lihir Island, Papua New Guinea.,Barcelona Institute for Global Health, Hospital Clinic, University of Barcelona, Spain.,School of Medicine and Health Sciences, University of Papua New Guinea, Port Moresby
| | - Ana Abecasis
- Global Health and Tropical Medicine, Institute of Hygiene and Tropical Medicine, Lisbon, Portugal
| | - Stanley M Spinola
- Department of Microbiology and Immunology, Indiana University School of Medicine, Indianapolis.,Department of Medicine, Indiana University School of Medicine, Indianapolis.,Department of Pathology and Laboratory Medicine, Indiana University School of Medicine, Indianapolis
| |
Collapse
|
4
|
Looking for Darwin in Genomic Sequences: Validity and Success Depends on the Relationship Between Model and Data. Methods Mol Biol 2019; 1910:399-426. [PMID: 31278672 DOI: 10.1007/978-1-4939-9074-0_13] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
Codon substitution models (CSMs) are commonly used to infer the history of natural section for a set of protein-coding sequences, often with the explicit goal of detecting the signature of positive Darwinian selection. However, the validity and success of CSMs used in conjunction with the maximum likelihood (ML) framework is sometimes challenged with claims that the approach might too often support false conclusions. In this chapter, we use a case study approach to identify four legitimate statistical difficulties associated with inference of evolutionary events using CSMs. These include: (1) model misspecification, (2) low information content, (3) the confounding of processes, and (4) phenomenological load, or PL. While past criticisms of CSMs can be connected to these issues, the historical critiques were often misdirected, or overstated, because they failed to recognize that the success of any model-based approach depends on the relationship between model and data. Here, we explore this relationship and provide a candid assessment of the limitations of CSMs to extract historical information from extant sequences. To aid in this assessment, we provide a brief overview of: (1) a more realistic way of thinking about the process of codon evolution framed in terms of population genetic parameters, and (2) a novel presentation of the ML statistical framework. We then divide the development of CSMs into two broad phases of scientific activity and show that the latter phase is characterized by increases in model complexity that can sometimes negatively impact inference of evolutionary mechanisms. Such problems are not yet widely appreciated by the users of CSMs. These problems can be avoided by using a model that is appropriate for the data; but, understanding the relationship between the data and a fitted model is a difficult task. We argue that the only way to properly understand that relationship is to perform in silico experiments using a generating process that can mimic the data as closely as possible. The mutation-selection modeling framework (MutSel) is presented as the basis of such a generating process. We contend that if complex CSMs continue to be developed for testing explicit mechanistic hypotheses, then additional analyses such as those described in here (e.g., penalized LRTs and estimation of PL) will need to be applied alongside the more traditional inferential methods.
Collapse
|
5
|
Evolutionary Genetics of Hypoxia and Cold Tolerance in Mammals. J Mol Evol 2018; 86:618-634. [PMID: 30327830 DOI: 10.1007/s00239-018-9870-8] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2018] [Accepted: 10/03/2018] [Indexed: 01/30/2023]
Abstract
Low oxygen and fluctuant ambient temperature pose serious challenges to mammalian survival. Physiological adaptations in mammals to hypoxia and low temperatures have been intensively investigated, yet their underlying molecular mechanisms need further exploration. Independent invasions of high-altitude plateaus, subterranean burrows and marine environments by different mammals provide opportunities to conduct such analyses. Here, we focused on six genes in the hypoxia inducible factor (HIF) pathway and two non-shivering thermogenesis (NST)-related genes [PPAR co-activator 1 (PGC-1) and uncoupling protein 1 (UCP1)] in representative species of pikas and other mammals to understand whether these loci were targeted by natural selection during independent invasions to conditions characterized by hypoxia and temperature fluctuations by high-altitude, subterranean and marine mammals. Our analyses revealed pervasive positive selection signals in the HIF pathway genes of mammals occupying high-altitude, subterranean and aquatic ecosystems; however, the mechanisms underlying their independent adaptations to hypoxic environments varied by taxa, since different genes were positively selected in each taxon and expression levels of individual genes varied among species. Additionally, parallel amino acid substitutions were also detected in hypoxia-tolerant mammals, indicating that convergent evolution may play a role in their independent adaptations to hypoxic environments. However, divergent evolutionary histories of NST-related genes were noted, since significant evidence of positive selection was observed in PGC-1 and UCP1 in high-altitude species and subterranean rodents; however, UCP1 may have already lost its function in diving cetaceans, which may be related to the thick blubber layer of adipose and connective tissue in these mammals.
Collapse
|
6
|
Twort VG, Dennis AB, Park D, Lomas KF, Newcomb RD, Buckley TR. Positive selection and comparative molecular evolution of reproductive proteins from New Zealand tree weta (Orthoptera, Hemideina). PLoS One 2017; 12:e0188147. [PMID: 29131842 PMCID: PMC5683631 DOI: 10.1371/journal.pone.0188147] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2017] [Accepted: 11/01/2017] [Indexed: 11/18/2022] Open
Abstract
Animal reproductive proteins, especially those in the seminal fluid, have been shown to have higher levels of divergence than non-reproductive proteins and are often evolving adaptively. Seminal fluid proteins have been implicated in the formation of reproductive barriers between diverging lineages, and hence represent interesting candidates underlying speciation. RNA-seq was used to generate the first male reproductive transcriptome for the New Zealand tree weta species Hemideina thoracica and H. crassidens. We identified 865 putative reproductive associated proteins across both species, encompassing a diverse range of functional classes. Candidate gene sequencing of nine genes across three Hemideina, and two Deinacrida species suggests that H. thoracica has the highest levels of intraspecific genetic diversity. Non-monophyly was observed in the majority of sequenced genes indicating that either gene flow may be occurring between the species, or that reciprocal monophyly at these loci has yet to be attained. Evidence for positive selection was found for one lectin-related reproductive protein, with an overall omega of 7.65 and one site in particular being under strong positive selection. This candidate gene represents the first step in the identification of proteins underlying the evolutionary basis of weta reproduction and speciation.
Collapse
Affiliation(s)
- Victoria G. Twort
- School of Biological Sciences, The University of Auckland, Auckland, New Zealand
- Landcare Research, Auckland, New Zealand
- * E-mail:
| | | | | | | | - Richard D. Newcomb
- School of Biological Sciences, The University of Auckland, Auckland, New Zealand
- The New Zealand Institute for Plant and Food Research Ltd, Auckland, New Zealand
| | - Thomas R. Buckley
- School of Biological Sciences, The University of Auckland, Auckland, New Zealand
- Landcare Research, Auckland, New Zealand
| |
Collapse
|
7
|
Antunes A, Ramos MJ. Gathering Computational Genomics and Proteomics to Unravel Adaptive Evolution. Evol Bioinform Online 2017. [DOI: 10.1177/117693430700300004] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
A recent editorial in PLoS Biology by MacCallum and Hill (2006) pointed out the inappropriateness of studies evaluating signatures of positive selection based solely in single-site analyses. Therefore the rising number of articles claiming positive selection that have been recently published urges the question of how to improve the bioinformatics standards for reliably unravel positive selection? Deeper integrative efforts using state-of-the-art methodologies at the gene-level and protein-level are improving positive selection studies. Here we provide some computational guidelines to thoroughly document molecular adaptation.
Collapse
Affiliation(s)
- Agostinho Antunes
- REQUIMTE, Departamento de Química, Faculdade de Ciências, Universidade do Porto, Rua do Campo Alegre, 687; 4169-007 Porto, Portugal
| | - Maria João Ramos
- REQUIMTE, Departamento de Química, Faculdade de Ciências, Universidade do Porto, Rua do Campo Alegre, 687; 4169-007 Porto, Portugal
| |
Collapse
|
8
|
Almeida D, Maldonado E, Khan I, Silva L, Gilbert MTP, Zhang G, Jarvis ED, O'Brien SJ, Johnson WE, Antunes A. Whole-Genome Identification, Phylogeny, and Evolution of the Cytochrome P450 Family 2 (CYP2) Subfamilies in Birds. Genome Biol Evol 2016; 8:1115-31. [PMID: 26979796 PMCID: PMC4860681 DOI: 10.1093/gbe/evw041] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/27/2016] [Indexed: 12/19/2022] Open
Abstract
The cytochrome P450 (CYP) superfamily defends organisms from endogenous and noxious environmental compounds, and thus is crucial for survival. However, beyond mammals the molecular evolution of CYP2 subfamilies is poorly understood. Here, we characterized the CYP2 family across 48 avian whole genomes representing all major extant bird clades. Overall, 12 CYP2 subfamilies were identified, including the first description of the CYP2F, CYP2G, and several CYP2AF genes in avian genomes. Some of the CYP2 genes previously described as being lineage-specific, such as CYP2K and CYP2W, are ubiquitous to all avian groups. Furthermore, we identified a large number of CYP2J copies, which have been associated previously with water reabsorption. We detected positive selection in the avian CYP2C, CYP2D, CYP2H, CYP2J, CYP2K, and CYP2AC subfamilies. Moreover, we identified new substrate recognition sites (SRS0, SRS2_SRS3, and SRS3.1) and heme binding areas that influence CYP2 structure and function of functional importance as under significant positive selection. Some of the positively selected sites in avian CYP2D are located within the same SRS1 region that was previously linked with the metabolism of plant toxins. Additionally, we find that selective constraint variations in some avian CYP2 subfamilies are consistently associated with different feeding habits (CYP2H and CYP2J), habitats (CYP2D, CYP2H, CYP2J, and CYP2K), and migratory behaviors (CYP2D, CYP2H, and CYP2J). Overall, our findings indicate that there has been active enzyme site selection on CYP2 subfamilies and differential selection associated with different life history traits among birds.
Collapse
Affiliation(s)
- Daniela Almeida
- CIIMAR/CIMAR, Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Portugal Department of Biology, Faculty of Sciences, University of Porto, Portugal
| | - Emanuel Maldonado
- CIIMAR/CIMAR, Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Portugal
| | - Imran Khan
- CIIMAR/CIMAR, Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Portugal Department of Biology, Faculty of Sciences, University of Porto, Portugal
| | - Liliana Silva
- CIIMAR/CIMAR, Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Portugal Department of Biology, Faculty of Sciences, University of Porto, Portugal
| | - M Thomas P Gilbert
- Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, Denmark
| | - Guojie Zhang
- China National GeneBank, BGI-Shenzhen, Shenzen, China Centre for Social Evolution, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Erich D Jarvis
- Centre for Social Evolution, Department of Biology, University of Copenhagen, Copenhagen, Denmark Howard Hughes Medical Institute, Chevy Chase, Maryland
| | - Stephen J O'Brien
- Theodosius Dobzhansky Center for Genome Bioinformatics, St. Petersburg State University, Russia Oceanographic Center, Nova Southeastern University, Ft Lauderdale
| | - Warren E Johnson
- National Zoological Park, Smithsonian Conservation Biology Institute, Washington DC
| | - Agostinho Antunes
- CIIMAR/CIMAR, Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Portugal Department of Biology, Faculty of Sciences, University of Porto, Portugal
| |
Collapse
|
9
|
Kamau LM, Skilton RA, Githaka N, Kiara H, Kabiru E, Shah T, Musoke AJ, Bishop RP. Extensive polymorphism of Ra86 genes in field populations of Rhipicephalus appendiculatus from Kenya. Ticks Tick Borne Dis 2016; 7:772-781. [PMID: 27051976 DOI: 10.1016/j.ttbdis.2016.03.011] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2015] [Revised: 03/15/2016] [Accepted: 03/16/2016] [Indexed: 10/22/2022]
Abstract
Commercial vaccines based on recombinant forms of the Bm86 tick gut antigen are used to control the southern cattle tick, Rhipicephalus microplus, a 1-host species, in Australia and Latin America. We describe herein sequence polymorphism in genes encoding Ra86 homologues of Bm86 in the brown ear tick, Rhipicephalus appendiculatus, isolated from four Kenyan field populations and one laboratory colony. Sequencing of 19 Ra86 sequences defined two alleles differentiated by indels, encoding 693 amino acids (aa) and 654 aa respectively, from the Muguga laboratory reference strain. Ra86 sequences were also determined from gut cDNA from four field populations of R. appendiculatus collected in different livestock production systems in Kenya. Analysis of approximately 20 Ra86 sequences from each of the four field sites in central and Western Kenya; Makuyu, Kiambu, Kakamega and Uasin Gishu, revealed three additional size types differentiated by 39-49 amino acid indels resulting in a total of 5 indel-defined genotypes. The 693 aa type 5 was isolated only from the laboratory tick stock; genotypes 1, 2 and 3 were identified in ticks from the four Kenyan field sites and appeared to be derivatives of the shorter RA86 genotype found in Muguga laboratory stock genotype 4. By contrast no large indels have yet been observed between R. microplus sequences from Australia, South America or Africa. Evidence that selection contributes to the observed sequence variation was provided by analysis of ratio of synonymous and non-synonymous substitutions and application of the selective neutrality and neutral evolution tests to the primary data. Phylogenetic analysis clustered sequences from all Ra86 size types and Bm86, into four major clades based on amino acid substitutions, but there was no evidence that these groupings correlated with geographical separation of R. appendiculatus populations.
Collapse
Affiliation(s)
- L M Kamau
- Department of Zoological Sciences, Kenyatta University, P.O. Box 43844, Nairobi, Kenya; International Livestock Research Institute (ILRI), P.O. Box 30709, Nairobi, Kenya
| | - R A Skilton
- International Livestock Research Institute (ILRI), P.O. Box 30709, Nairobi, Kenya
| | - N Githaka
- International Livestock Research Institute (ILRI), P.O. Box 30709, Nairobi, Kenya.
| | - H Kiara
- International Livestock Research Institute (ILRI), P.O. Box 30709, Nairobi, Kenya
| | - E Kabiru
- Department of Zoological Sciences, Kenyatta University, P.O. Box 43844, Nairobi, Kenya
| | - T Shah
- International Livestock Research Institute (ILRI), P.O. Box 30709, Nairobi, Kenya
| | - A J Musoke
- International Livestock Research Institute (ILRI), P.O. Box 30709, Nairobi, Kenya
| | - R P Bishop
- International Livestock Research Institute (ILRI), P.O. Box 30709, Nairobi, Kenya
| |
Collapse
|
10
|
Adaptation of the Mitochondrial Genome in Cephalopods: Enhancing Proton Translocation Channels and the Subunit Interactions. PLoS One 2015; 10:e0135405. [PMID: 26285039 PMCID: PMC4540416 DOI: 10.1371/journal.pone.0135405] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2015] [Accepted: 07/21/2015] [Indexed: 01/25/2023] Open
Abstract
Mitochondrial protein-coding genes (mt genes) encode subunits forming complexes of crucial cellular pathways, including those involved in the vital process of oxidative phosphorylation (OXPHOS). Despite the vital role of the mitochondrial genome (mt genome) in the survival of organisms, little is known with respect to its adaptive implications within marine invertebrates. The molluscan Class Cephalopoda is represented by a marine group of species known to occupy contrasting environments ranging from the intertidal to the deep sea, having distinct metabolic requirements, varied body shapes and highly advanced visual and nervous systems that make them highly competitive and successful worldwide predators. Thus, cephalopods are valuable models for testing natural selection acting on their mitochondrial subunits (mt subunits). Here, we used concatenated mt genes from 17 fully sequenced mt genomes of diverse cephalopod species to generate a robust mitochondrial phylogeny for the Class Cephalopoda. We followed an integrative approach considering several branches of interest–covering cephalopods with distinct morphologies, metabolic rates and habitats–to identify sites under positive selection and localize them in the respective protein alignment and/or tridimensional structure of the mt subunits. Our results revealed significant adaptive variation in several mt subunits involved in the energy production pathway of cephalopods: ND5 and ND6 from Complex I, CYTB from Complex III, COX2 and COX3 from Complex IV, and in ATP8 from Complex V. Furthermore, we identified relevant sites involved in protein-interactions, lining proton translocation channels, as well as disease/deficiencies related sites in the aforementioned complexes. A particular case, revealed by this study, is the involvement of some positively selected sites, found in Octopoda lineage in lining proton translocation channels (site 74 from ND5) and in interactions between subunits (site 507 from ND5) of Complex I.
Collapse
|
11
|
Global Diversity within and between Human Herpesvirus 1 and 2 Glycoproteins. J Virol 2015; 89:8206-18. [PMID: 26018161 DOI: 10.1128/jvi.01302-15] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2015] [Accepted: 05/19/2015] [Indexed: 12/17/2022] Open
Abstract
UNLABELLED Human herpes simplex viruses 1 and 2 (HSV-1 and HSV-2) are large-genome DNA viruses that establish a persistent infection in sensory neurons and commonly manifest with recurring oral or genital erosions that transmit virus. HSV encodes 12 predicted glycoproteins that serve various functions, including cellular attachment, entry, and egress. Glycoprotein G is currently the target of an antibody test to differentiate HSV-1 from HSV-2; however, this test has shown reduced capacity to differentiate HSV strains in East Africa. Until the recent availability of 26 full-length HSV-1 and 36 full-length HSV-2 sequences, minimal comparative information was available for these viruses. In this study, we use a variety of sequence analysis methods to compare all available sequence data for HSV-1 and HSV-2 glycoproteins, using viruses isolated in Europe, Asia, North America, the Republic of South Africa, and East Africa. We found numerous differences in diversity, nonsynonymous/synonymous substitution rates, and recombination rates between HSV-1 glycoproteins and their HSV-2 counterparts. Phylogenetic analysis revealed that while most global HSV-2 glycoprotein G sequences did not form clusters within or between continents, one clade (supported at 60.5%) contained 37% of the African sequences analyzed. Accordingly, sequences from this African subset contained unique amino acid signatures, not only in glycoprotein G, but also in glycoproteins I and E, which may account for the failure of sensitive antibody tests to distinguish HSV-1 from HSV-2 in some African individuals. Consensus sequences generated in the study can be used to improve diagnostic assays that differentiate HSV-1 from HSV-2 in global populations. IMPORTANCE Human herpes simplex viruses 1 and 2 (HSV-1 and HSV-2) are large DNA viruses associated with recurring oral or genital erosions that transmit virus. Up to 12 HSV-1 and HSV-2 glycoproteins are involved in HSV cell entry or are required for viral spread in animals, albeit some are dispensable for replication in vitro. The recent availability of comparable numbers of full-length HSV-1 and HSV-2 sequences enabled comparative analysis of gene diversity of glycoproteins within and between HSV types. Overall, we found less glycoprotein sequence diversity within HSV-2 than within the HSV-1 strains studied, while at the same time, several HSV-2 glycoproteins were evolving under less selective pressure. Because HSV glycoproteins are the focus of antibody tests to detect and differentiate between infections with the two strains and are constituents of vaccines in clinical-stage development, these findings will aid in refining the targets for diagnostic tests and vaccines.
Collapse
|
12
|
Pheromone receptor evolution in the cryptic leafroller species, Ctenopseustis obliquana and C. herana. J Mol Evol 2014; 80:42-56. [PMID: 25252791 DOI: 10.1007/s00239-014-9650-z] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2014] [Accepted: 09/18/2014] [Indexed: 10/24/2022]
Abstract
How new mate recognition systems evolve when changes are required in both the male and female components remains a conundrum. Here, we investigated the molecular basis of pheromone reception in two species of tortricid (leafroller) moth, Ctenopseustis obliquana and C. herana. Male C. obliquana are attracted to a 90:10 blend of (Z)-8-tetradecenyl acetate (Z8-14:OAc) and (Z)-5-tetradecenyl acetate (Z5-14:OAc), whereas C. herana males are attracted to Z5-14:OAc alone. We used a transcriptome sequencing approach from adult male and female antennae to identify 47 olfactory receptors (ORs) from each species and assessed their expression levels in male and female antennae using RNA-Seq counting and quantitative RT-PCR. Three male-biased and one female-biased OR were identified in C. obliquana by quantitative RT-PCR, and four male-biased and one female-biased receptor in C. herana. The male-biased receptors, CoblOR7, CoblOR30, CherOR7, CherOR30, CherOR1a and CherOR1b were tested for their ability to respond to sex pheromone components in a HEK293 cell calcium assay. CoblOR7 and CherOR7 responded to Z8-14:OAc, however, no receptor for Z5-14:OAc was identified. In addition to Z8-14:OAc, CherOR7 also responded to Z7-14:OAc, indicating that this receptor may be under relaxed constraint. Of the 29 amino acid differences between CoblOR7 and CherOR7, significantly more are located in the third and the sixth transmembrane domain regions. Overall, these findings are consistent with studies revealing the presence of neurons tuned to both Z8-14:OAc and Z5-14:OAc in both species, but that for C. herana males, the ability to detect Z8-14:OAc is currently not required.
Collapse
|
13
|
Garvin MR, Bielawski JP, Sazanov LA, Gharrett AJ. Review and meta-analysis of natural selection in mitochondrial complex I in metazoans. J ZOOL SYST EVOL RES 2014. [DOI: 10.1111/jzs.12079] [Citation(s) in RCA: 61] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Affiliation(s)
- Michael R. Garvin
- Fisheries Division; School of Fisheries and Ocean Sciences; University of Alaska Fairbanks; Juneau AK USA
| | - Joseph P. Bielawski
- Department of Biology; Dalhousie University; Halifax NS Canada
- Department of Mathematics & Statistics; Dalhousie University; Halifax NS Canada
| | | | - Anthony J. Gharrett
- Fisheries Division; School of Fisheries and Ocean Sciences; University of Alaska Fairbanks; Juneau AK USA
| |
Collapse
|
14
|
The Site-Wise Log-Likelihood Score is a Good Predictor of Genes under Positive Selection. J Mol Evol 2013; 76:280-94. [DOI: 10.1007/s00239-013-9557-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2012] [Accepted: 03/20/2013] [Indexed: 12/21/2022]
|
15
|
Voordeckers K, Brown CA, Vanneste K, van der Zande E, Voet A, Maere S, Verstrepen KJ. Reconstruction of ancestral metabolic enzymes reveals molecular mechanisms underlying evolutionary innovation through gene duplication. PLoS Biol 2012; 10:e1001446. [PMID: 23239941 PMCID: PMC3519909 DOI: 10.1371/journal.pbio.1001446] [Citation(s) in RCA: 138] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2012] [Accepted: 10/30/2012] [Indexed: 11/24/2022] Open
Abstract
Gene duplications are believed to facilitate evolutionary innovation. However, the mechanisms shaping the fate of duplicated genes remain heavily debated because the molecular processes and evolutionary forces involved are difficult to reconstruct. Here, we study a large family of fungal glucosidase genes that underwent several duplication events. We reconstruct all key ancestral enzymes and show that the very first preduplication enzyme was primarily active on maltose-like substrates, with trace activity for isomaltose-like sugars. Structural analysis and activity measurements on resurrected and present-day enzymes suggest that both activities cannot be fully optimized in a single enzyme. However, gene duplications repeatedly spawned daughter genes in which mutations optimized either isomaltase or maltase activity. Interestingly, similar shifts in enzyme activity were reached multiple times via different evolutionary routes. Together, our results provide a detailed picture of the molecular mechanisms that drove divergence of these duplicated enzymes and show that whereas the classic models of dosage, sub-, and neofunctionalization are helpful to conceptualize the implications of gene duplication, the three mechanisms co-occur and intertwine.
Collapse
Affiliation(s)
- Karin Voordeckers
- VIB Laboratory for Systems Biology, Leuven, Belgium
- CMPG Laboratory for Genetics and Genomics, KU Leuven, Leuven, Belgium
| | - Chris A. Brown
- VIB Laboratory for Systems Biology, Leuven, Belgium
- CMPG Laboratory for Genetics and Genomics, KU Leuven, Leuven, Belgium
- Fathom Information Design, Boston, Massachusetts, United States of America
- Faculty of Arts and Sciences Center for Systems Biology, Harvard University, Cambridge, Massachusetts, United States of America
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts, United States of America
| | - Kevin Vanneste
- VIB Department of Plant Systems Biology, Gent, Belgium
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Gent, Belgium
| | - Elisa van der Zande
- VIB Laboratory for Systems Biology, Leuven, Belgium
- CMPG Laboratory for Genetics and Genomics, KU Leuven, Leuven, Belgium
| | - Arnout Voet
- Laboratory for Molecular en Structural Biology, KU Leuven, Leuven, Belgium
| | - Steven Maere
- VIB Department of Plant Systems Biology, Gent, Belgium
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Gent, Belgium
| | - Kevin J. Verstrepen
- VIB Laboratory for Systems Biology, Leuven, Belgium
- CMPG Laboratory for Genetics and Genomics, KU Leuven, Leuven, Belgium
| |
Collapse
|
16
|
Lin MF, Kheradpour P, Washietl S, Parker BJ, Pedersen JS, Kellis M. Locating protein-coding sequences under selection for additional, overlapping functions in 29 mammalian genomes. Genome Res 2011; 21:1916-28. [PMID: 21994248 DOI: 10.1101/gr.108753.110] [Citation(s) in RCA: 72] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
The degeneracy of the genetic code allows protein-coding DNA and RNA sequences to simultaneously encode additional, overlapping functional elements. A sequence in which both protein-coding and additional overlapping functions have evolved under purifying selection should show increased evolutionary conservation compared to typical protein-coding genes--especially at synonymous sites. In this study, we use genome alignments of 29 placental mammals to systematically locate short regions within human ORFs that show conspicuously low estimated rates of synonymous substitution across these species. The 29-species alignment provides statistical power to locate more than 10,000 such regions with resolution down to nine-codon windows, which are found within more than a quarter of all human protein-coding genes and contain ∼2% of their synonymous sites. We collect numerous lines of evidence that the observed synonymous constraint in these regions reflects selection on overlapping functional elements including splicing regulatory elements, dual-coding genes, RNA secondary structures, microRNA target sites, and developmental enhancers. Our results show that overlapping functional elements are common in mammalian genes, despite the vast genomic landscape.
Collapse
Affiliation(s)
- Michael F Lin
- Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA
| | | | | | | | | | | |
Collapse
|
17
|
Suzuki Y. Statistical methods for detecting natural selection from genomic data. Genes Genet Syst 2011; 85:359-76. [PMID: 21415566 DOI: 10.1266/ggs.85.359] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
In the study of molecular and phenotypic evolution, understanding the relative importance of random genetic drift and positive selection as the mechanisms for driving divergences between populations and maintaining polymorphisms within populations has been a central issue. A variety of statistical methods has been developed for detecting natural selection operating at the amino acid and nucleotide sequence levels. These methods may be largely classified into those aimed at detecting recurrent and/or recent/ongoing natural selection by utilizing the divergence and/or polymorphism data. Using these methods, pervasive positive selection has been identified for protein-coding and non-coding sequences in the genomic analysis of some organisms. However, many of these methods have been criticized by using computer simulation and real data analysis to produce excessive false-positives and to be sensitive to various disturbing factors. Importantly, some of these methods have been invalidated experimentally. These facts indicate that many of the statistical methods for detecting natural selection are unreliable. In addition, the signals that have been believed as the evidence for fixations of advantageous mutations due to positive selection may also be interpreted as the evidence for fixations of deleterious mutations due to random genetic drift. The genomic diversity data are rapidly accumulating in various organisms, and detection of natural selection may play a critical role for clarifying the relative role of random genetic drift and positive selection in molecular and phenotypic evolution. It is therefore important to develop reliable statistical methods that are unbiased as well as robust against various disturbing factors, for inferring natural selection.
Collapse
Affiliation(s)
- Yoshiyuki Suzuki
- Graduate School of Natural Sciences, Nagoya City University, Japan.
| |
Collapse
|
18
|
Woodard SH, Fischman BJ, Venkat A, Hudson ME, Varala K, Cameron SA, Clark AG, Robinson GE. Genes involved in convergent evolution of eusociality in bees. Proc Natl Acad Sci U S A 2011; 108:7472-7. [PMID: 21482769 PMCID: PMC3088614 DOI: 10.1073/pnas.1103457108] [Citation(s) in RCA: 139] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Eusociality has arisen independently at least 11 times in insects. Despite this convergence, there are striking differences among eusocial lifestyles, ranging from species living in small colonies with overt conflict over reproduction to species in which colonies contain hundreds of thousands of highly specialized sterile workers produced by one or a few queens. Although the evolution of eusociality has been intensively studied, the genetic changes involved in the evolution of eusociality are relatively unknown. We examined patterns of molecular evolution across three independent origins of eusociality by sequencing transcriptomes of nine socially diverse bee species and combining these data with genome sequence from the honey bee Apis mellifera to generate orthologous sequence alignments for 3,647 genes. We found a shared set of 212 genes with a molecular signature of accelerated evolution across all eusocial lineages studied, as well as unique sets of 173 and 218 genes with a signature of accelerated evolution specific to either highly or primitively eusocial lineages, respectively. These results demonstrate that convergent evolution can involve a mosaic pattern of molecular changes in both shared and lineage-specific sets of genes. Genes involved in signal transduction, gland development, and carbohydrate metabolism are among the most prominent rapidly evolving genes in eusocial lineages. These findings provide a starting point for linking specific genetic changes to the evolution of eusociality.
Collapse
Affiliation(s)
- S Hollis Woodard
- Department of Crop Sciences and Entomology, Institute for Genomic Biology, and Neuroscience Program, University of Illinois, Urbana, IL 61801, USA
| | | | | | | | | | | | | | | |
Collapse
|
19
|
Yang Z, dos Reis M. Statistical properties of the branch-site test of positive selection. Mol Biol Evol 2010; 28:1217-28. [PMID: 21087944 DOI: 10.1093/molbev/msq303] [Citation(s) in RCA: 230] [Impact Index Per Article: 16.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The branch-site test is a likelihood ratio test to detect positive selection along prespecified lineages on a phylogeny that affects only a subset of codons in a protein-coding gene, with positive selection indicated by accelerated nonsynonymous substitutions (with ω = d(N)/d(S) > 1). This test may have more power than earlier methods, which average nucleotide substitution rates over sites in the protein and/or over branches on the tree. However, a few recent studies questioned the statistical basis of the test and claimed that the test generated too many false positives. In this paper, we examine the null distribution of the test and conduct a computer simulation to examine the false-positive rate and the power of the test. The results suggest that the asymptotic theory is reliable for typical data sets, and indeed in our simulations, the large-sample null distribution was reliable with as few as 20-50 codons in the alignment. We examined the impact of sequence length, the strength of positive selection, and the proportion of sites under positive selection on the power of the branch-site test. We found that the test was far more powerful in detecting episodic positive selection than branch-based tests, which average substitution rates over all codons in the gene and thus miss the signal when most codons are under strong selective constraint. Recent claims of statistical problems with the branch-site test are due to misinterpretations of simulation results. Our results, as well as previous simulation studies that have demonstrated the robustness of the test, suggest that the branch-site test may be a useful tool for detecting episodic positive selection and for generating biological hypotheses for mutation studies and functional analyses. The test is sensitive to sequence and alignment errors and caution should be exercised concerning its use when data quality is in doubt.
Collapse
Affiliation(s)
- Ziheng Yang
- Department of Genetics, Evolution and Environment, University College London, United Kingdom.
| | | |
Collapse
|
20
|
Selection and the cell cycle: positive Darwinian selection in a well-known DNA damage response pathway. J Mol Evol 2010; 71:444-57. [PMID: 21057781 DOI: 10.1007/s00239-010-9399-y] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2010] [Accepted: 10/06/2010] [Indexed: 10/18/2022]
Abstract
Cancer is a common occurrence in multi-cellular organisms and is not strictly limited to the elderly in a population. It is therefore possible that individuals with genotypes that protect against early onset cancers have a selective advantage. In this study the patterns of mutation in the proteins of a well-studied DNA damage response pathway have been examined for evidence of adaptive evolutionary change. Using a maximum likelihood framework and the mammalian species phylogeny, together with codon models of evolution, selective pressure variation across the interacting network of proteins has been detected. The presence of signatures of adaptive evolution in BRCA1 and BRCA2 has already been documented but the effect on the entire network of interacting proteins in this damage response pathway has, until now, been unknown. Positive selection is evident throughout the network with a total of 11 proteins out of 15 examined displaying patterns of substitution characteristic of positive selection. It is also shown here that modern human populations display evidence of an ongoing selective sweep in 9 of these DNA damage repair proteins. The results presented here provide the community with new residues that may be relevant to cancer susceptibility while also highlighting those proteins where human and mouse have undergone lineage-specific functional shift. An understanding of this damage response pathway from an evolutionary perspective will undoubtedly contribute to future cancer treatment approaches.
Collapse
|
21
|
Nei M, Suzuki Y, Nozawa M. The neutral theory of molecular evolution in the genomic era. Annu Rev Genomics Hum Genet 2010; 11:265-89. [PMID: 20565254 DOI: 10.1146/annurev-genom-082908-150129] [Citation(s) in RCA: 162] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The neutral theory of molecular evolution has been widely accepted and is the guiding principle for studying evolutionary genomics and the molecular basis of phenotypic evolution. Recent data on genomic evolution are generally consistent with the neutral theory. However, many recently published papers claim the detection of positive Darwinian selection via the use of new statistical methods. Examination of these methods has shown that their theoretical bases are not well established and often result in high rates of false-positive and false-negative results. When the deficiencies of these statistical methods are rectified, the results become largely consistent with the neutral theory. At present, genome-wide analyses of natural selection consist of collections of single-locus analyses. However, because phenotypic evolution is controlled by the interaction of many genes, the study of natural selection ought to take such interactions into account. Experimental studies of evolution will also be crucial.
Collapse
Affiliation(s)
- Masatoshi Nei
- Institute of Molecular Evolutionary Genetics and Department of Biology, Pennsylvania State University, University Park, PA 16802, USA.
| | | | | |
Collapse
|
22
|
Chen X, Zhang Q, Li J, Cao W, Zhang JX, Zhang L, Zhang W, Shao ZJ, Yan Y. Analysis of recombination and natural selection in human enterovirus 71. Virology 2010; 398:251-61. [DOI: 10.1016/j.virol.2009.12.007] [Citation(s) in RCA: 52] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2009] [Revised: 11/13/2009] [Accepted: 12/04/2009] [Indexed: 11/30/2022]
|
23
|
Heidel-Fischer HM, Vogel H, Heckel DG, Wheat CW. Microevolutionary dynamics of a macroevolutionary key innovation in a Lepidopteran herbivore. BMC Evol Biol 2010; 10:60. [PMID: 20181249 PMCID: PMC2841170 DOI: 10.1186/1471-2148-10-60] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2009] [Accepted: 02/24/2010] [Indexed: 01/23/2023] Open
Abstract
Background A molecular population genetics understanding is central to the study of ecological and evolutionary functional genomics. Population genetics identifies genetic variation and its distribution within and among populations, it reveals the demographic history of the populations studied, and can provide indirect insights into historical selection dynamics. Here we use this approach to examine the demographic and selective dynamics acting of a candidate gene involved in plant-insect interactions. Previous work documents the macroevolutionary and historical ecological importance of the nitrile-specifier protein (Nsp), which facilitated the host shift of Pieridae butterflies onto Brassicales host plants ~80 Myr ago. Results Here we assess the microevolutionary dynamics of the Nsp gene by studying the within and among-population variation at Nsp and reference genes in the butterfly Pieris rapae (Small Cabbage White). Nsp exhibits unexpectedly high amounts of amino acid polymorphism, unequally distributed across the gene. The vast majority of genetic variation exists within populations, with little to no genetic differentiation among four populations on two continents. A comparison of synonymous and nonsynonymous substitutions in 70 randomly chosen genes among P. rapae and its close relative Pieris brassicae (Large Cabbage White) finds Nsp to have a significantly relaxed functional constraint compared to housekeeping genes. We find strong evidence for a recent population expansion and no role for strong purifying or directional selection upon the Nsp gene. Conclusions The microevolutionary dynamics of the Nsp gene in P. rapae are dominated by recent population expansion and variation in functional constraint across the repeated domains of the Nsp gene. While the high amounts of amino acid diversity suggest there may be significant functional differences among allelic variants segregating within populations, indirect tests of selection could not conclusively identify a signature of historical selection. The importance of using this information for planning future studies of potential performance and fitness consequences of the observed variation is discussed.
Collapse
|
24
|
Aguileta G, Lengelle J, Marthey S, Chiapello H, Rodolphe F, Gendrault A, Yockteng R, Vercken E, Devier B, Fontaine MC, Wincker P, Dossat C, Cruaud C, Couloux A, Giraud T. Finding candidate genes under positive selection in Non-model species: examples of genes involved in host specialization in pathogens. Mol Ecol 2009; 19:292-306. [PMID: 20041992 DOI: 10.1111/j.1365-294x.2009.04454.x] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Numerous genes in diverse organisms have been shown to be under positive selection, especially genes involved in reproduction, adaptation to contrasting environments, hybrid inviability, and host-pathogen interactions. Looking for genes under positive selection in pathogens has been a priority in efforts to investigate coevolution dynamics and to develop vaccines or drugs. To elucidate the functions involved in host specialization, here we aimed at identifying candidate sequences that could have evolved under positive selection among closely related pathogens specialized on different hosts. For this goal, we sequenced c. 17,000-32,000 ESTs from each of four Microbotryum species, which are fungal pathogens responsible for anther smut disease on host plants in the Caryophyllaceae. Forty-two of the 372 predicted orthologous genes showed significant signal of positive selection, which represents a good number of candidate genes for further investigation. Sequencing 16 of these genes in 9 additional Microbotryum species confirmed that they have indeed been rapidly evolving in the pathogen species specialized on different hosts. The genes showing significant signals of positive selection were putatively involved in nutrient uptake from the host, secondary metabolite synthesis and secretion, respiration under stressful conditions and stress response, hyphal growth and differentiation, and regulation of expression by other genes. Many of these genes had transmembrane domains and may therefore also be involved in pathogen recognition by the host. Our approach thus revealed fruitful and should be feasible for many non-model organisms for which candidate genes for diversifying selection are needed.
Collapse
Affiliation(s)
- G Aguileta
- Ecologie, Systématique et Evolution, Université Paris-Sud, F-91405 Orsay cedex, France
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
25
|
Holmes EC. The Evolutionary Genetics of Emerging Viruses. ANNUAL REVIEW OF ECOLOGY EVOLUTION AND SYSTEMATICS 2009. [DOI: 10.1146/annurev.ecolsys.110308.120248] [Citation(s) in RCA: 143] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
- Edward C. Holmes
- Center for Infectious Disease Dynamics, Department of Biology, Mueller Laboratory, Pennsylvania State University, University Park, Pennsylvania 16802; and Fogarty International Center, National Institutes of Health, Bethesda, Maryland 20892
| |
Collapse
|
26
|
Arias CF, Escalera-Zamudio M, de los Dolores Soto-Del Río M, Georgina Cobián-Güemes A, Isa P, López S. Molecular Anatomy of 2009 Influenza Virus A (H1N1). Arch Med Res 2009; 40:643-54. [DOI: 10.1016/j.arcmed.2009.10.007] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2009] [Accepted: 10/08/2009] [Indexed: 01/29/2023]
|
27
|
Smadja C, Shi P, Butlin RK, Robertson HM. Large gene family expansions and adaptive evolution for odorant and gustatory receptors in the pea aphid, Acyrthosiphon pisum. Mol Biol Evol 2009; 26:2073-86. [PMID: 19542205 DOI: 10.1093/molbev/msp116] [Citation(s) in RCA: 140] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2023] Open
Abstract
Gaining insight into the mechanisms of chemoreception in aphids is of primary importance for both integrative studies on the evolution of host plant specialization and applied research in pest control management because aphids rely on their sense of smell and taste to locate and assess their host plants. We made use of the recent genome sequence of the pea aphid, Acyrthosiphon pisum, to address the molecular characterization and evolution of key molecular components of chemoreception: the odorant (Or) and gustatory (Gr) receptor genes. We identified 79 Or and 77 Gr genes in the pea aphid genome and showed that most of them are aphid-specific genes that have undergone recent and rapid expansion in the genome. By addressing selection within sets of paralogous Or and Gr expansions, for the first time in an insect species, we show that the most recently duplicated loci have evolved under positive selection, which might be related to the high degree of ecological specialization of this species. Although more functional studies are still needed for insect chemoreceptors, we provide evidence that Grs and Ors have different sets of positively selected sites, suggesting the possibility that these two gene families might have different binding pockets and bind structurally distinct classes of ligand. The pea aphid is the most basal insect species with a completely sequenced genome to date. The identification of chemoreceptor genes in this species is a key step toward further exploring insect comparative genetics, the genomics of ecological specialization and speciation, and new pest control strategies.
Collapse
Affiliation(s)
- Carole Smadja
- Animal and Plant Sciences Department, University of Sheffield, Sheffield, United Kingdom.
| | | | | | | |
Collapse
|
28
|
Wood N, Bhattacharya T, Keele BF, Giorgi E, Liu M, Gaschen B, Daniels M, Ferrari G, Haynes BF, McMichael A, Shaw GM, Hahn BH, Korber B, Seoighe C. HIV evolution in early infection: selection pressures, patterns of insertion and deletion, and the impact of APOBEC. PLoS Pathog 2009; 5:e1000414. [PMID: 19424423 PMCID: PMC2671846 DOI: 10.1371/journal.ppat.1000414] [Citation(s) in RCA: 143] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2008] [Accepted: 04/06/2009] [Indexed: 11/24/2022] Open
Abstract
The pattern of viral diversification in newly infected individuals provides information about the host environment and immune responses typically experienced by the newly transmitted virus. For example, sites that tend to evolve rapidly across multiple early-infection patients could be involved in enabling escape from common early immune responses, could represent adaptation for rapid growth in a newly infected host, or could represent reversion from less fit forms of the virus that were selected for immune escape in previous hosts. Here we investigated the diversification of HIV-1 env coding sequences in 81 very early B subtype infections previously shown to have resulted from transmission or expansion of single viruses (n = 78) or two closely related viruses (n = 3). In these cases, the sequence of the infecting virus can be estimated accurately, enabling inference of both the direction of substitutions as well as distinction between insertion and deletion events. By integrating information across multiple acutely infected hosts, we find evidence of adaptive evolution of HIV-1 env and identify a subset of codon sites that diversified more rapidly than can be explained by a model of neutral evolution. Of 24 such rapidly diversifying sites, 14 were either i) clustered and embedded in CTL epitopes that were verified experimentally or predicted based on the individual's HLA or ii) in a nucleotide context indicative of APOBEC-mediated G-to-A substitutions, despite having excluded heavily hypermutated sequences prior to the analysis. In several cases, a rapidly evolving site was embedded both in an APOBEC motif and in a CTL epitope, suggesting that APOBEC may facilitate early immune escape. Ten rapidly diversifying sites could not be explained by CTL escape or APOBEC hypermutation, including the most frequently mutated site, in the fusion peptide of gp41. We also examined the distribution, extent, and sequence context of insertions and deletions, and we provide evidence that the length variation seen in hypervariable loop regions of the envelope glycoprotein is a consequence of selection and not of mutational hotspots. Our results provide a detailed view of the process of diversification of HIV-1 following transmission, highlighting the role of CTL escape and hypermutation in shaping viral evolution during the establishment of new infections. HIV is a rapidly evolving virus, displaying enormous genetic diversity between and even within infected individuals, with implications for vaccine design and drug treatment. Yet, recent research has shown that most new infections result from transmission of a single virus resulting in a homogeneous viral population in early infection. The process of diversification from the transmitted virus provides information about the selection pressures experienced by the virus during the establishment of a new infection. In this paper, we studied early diversification of the envelope gene in a cohort of 81 subjects acutely infected with HIV-1 subtype B and found evidence of adaptive evolution, with a proportion of sites that tended to diversify more rapidly than expected under a model of neutral evolution. Several of these rapidly diversifying sites facilitate escape from early cytotoxic immune responses. Interestingly, hypermutation of the virus, brought about by host proteins as a strategy to restrict infection, appeared to be associated with early immune escape. In addition to single base substitutions, insertions and deletions are an important aspect of HIV evolution. We show that insertion and deletion mutations occur evenly across the gene, but are preferentially fixed in the variable loop regions.
Collapse
Affiliation(s)
- Natasha Wood
- Institute of Infectious Disease and Molecular Medicine, University of Cape Town, Observatory, Cape Town, South Africa
- Centre for High-Performance Computing, CSIR Campus, Rosebank, Cape Town, South Africa
| | - Tanmoy Bhattacharya
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
- Santa Fe Institute, Santa Fe, New Mexico, United States of America
| | - Brandon F. Keele
- University of Alabama at Birmingham, Birmingham, Alabama, United States of America
| | - Elena Giorgi
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
- Department of Mathematics and Statistics, University of Massachusetts, Amherst, Massachusetts, United States of America
| | - Michael Liu
- Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, United Kingdom
| | - Brian Gaschen
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
| | - Marcus Daniels
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
| | - Guido Ferrari
- Duke University, Durham, North Carolina, United States of America
| | - Barton F. Haynes
- Duke University, Durham, North Carolina, United States of America
| | - Andrew McMichael
- Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, United Kingdom
| | - George M. Shaw
- University of Alabama at Birmingham, Birmingham, Alabama, United States of America
| | - Beatrice H. Hahn
- University of Alabama at Birmingham, Birmingham, Alabama, United States of America
| | - Bette Korber
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
- Santa Fe Institute, Santa Fe, New Mexico, United States of America
| | - Cathal Seoighe
- Institute of Infectious Disease and Molecular Medicine, University of Cape Town, Observatory, Cape Town, South Africa
- Centre for High-Performance Computing, CSIR Campus, Rosebank, Cape Town, South Africa
- School of Mathematics, Statistics and Applied Mathematics, National University of Ireland Galway, Galway, Ireland
- * E-mail:
| |
Collapse
|
29
|
Wu J, Zhao F, Bai J, Deng G, Qin S, Bao Q. Evidence for positive Darwinian selection of Vip gene in Bacillus thuringiensis. J Genet Genomics 2009; 34:649-60. [PMID: 17643951 DOI: 10.1016/s1673-8527(07)60074-5] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2006] [Accepted: 02/24/2007] [Indexed: 10/23/2022]
Abstract
Vegetative insecticidal proteins (VIPs), produced during the vegetative stage of their growth in Bacillus thuringiensis, are a group of insecticidal proteins and represent the second generation of insecticidal trans-genes that will complement the novel delta-endotoxins in future. Fewer structural and functional relationships of Vip proteins are known in comparison with those of delta-endotoxins. In this study, both the maximum-likelihood methods and the maximum parsimony based sliding window analysis were used to evaluate the molecular evolution of Vip proteins. As a result, strong evidence was found that Vip proteins are subject to the high rates of positive selection, and 16 sites are identified to be under positive selection using the Bayes Empirical Bayesian method. Interestingly, all these positively selected sites are located from site-705 to site-809 in the C-terminus of the Vip proteins. Most of these sites are exposed and clustered in the loop regions when mapped onto its computational predicted secondary tertiary and a part of the tertiary structure. It has been postulated that the high divergence in the C-terminal of Vip proteins may not result from the lack of functional constraints, but rather from the rapid mutation to adapt their targeted insects, driven by positive selection. The potential positive selection pressures may be an attempt to adapt for the "arm race" between Vip proteins and the targeted insects, or to enlarge their target's host range. Sites identified to be under positive selection may be related to the insect host range, which may shed a light on the investigation of the Vip proteins' structural and functional relationships.
Collapse
Affiliation(s)
- Jinyu Wu
- Institute of Biomedical Informatics / Zhejiang Provincial Key Laboratory of Medical Genetics, Wenzhou Medical College, Wenzhou 325000, China
| | | | | | | | | | | |
Collapse
|
30
|
Reliabilities of identifying positive selection by the branch-site and the site-prediction methods. Proc Natl Acad Sci U S A 2009; 106:6700-5. [PMID: 19339501 DOI: 10.1073/pnas.0901855106] [Citation(s) in RCA: 129] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023] Open
Abstract
Natural selection operating in protein-coding genes is often studied by examining the ratio (omega) of the rates of nonsynonymous to synonymous nucleotide substitution. The branch-site method (BSM) based on a likelihood ratio test is one of such tests to detect positive selection for a predetermined branch of a phylogenetic tree. However, because the number of nucleotide substitutions involved is often very small, we conducted a computer simulation to examine the reliability of BSM in comparison with the small-sample method (SSM) based on Fisher's exact test. The results indicate that BSM often generates false positives compared with SSM when the number of nucleotide substitutions is approximately 80 or smaller. Because the omega value is also used for predicting positively selected sites, we examined the reliabilities of the site-prediction methods, using nucleotide sequence data for the dim-light and color vision genes in vertebrates. The results showed that the site-prediction methods have a low probability of identifying functional changes of amino acids experimentally determined and often falsely identify other sites where amino acid substitutions are unlikely to be important. This low rate of predictability occurs because most of the current statistical methods are designed to identify codon sites with high omega values, which may not have anything to do with functional changes. The codon sites showing functional changes generally do not show a high omega value. To understand adaptive evolution, some form of experimental confirmation is necessary.
Collapse
|
31
|
Go Y, Niimura Y. Similar Numbers but Different Repertoires of Olfactory Receptor Genes in Humans and Chimpanzees. Mol Biol Evol 2008; 25:1897-907. [DOI: 10.1093/molbev/msn135] [Citation(s) in RCA: 85] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
|
32
|
Suzuki Y. Inferring natural selection operating on conservative and radical substitution at single amino acid sites. Genes Genet Syst 2008; 82:341-60. [PMID: 17895585 DOI: 10.1266/ggs.82.341] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Natural selection operating on amino acid substitution at single amino acid sites can be detected by comparing the rates of synonymous (r(S)) and nonsynonymous (r(N)) nucleotide substitution at single codon sites. Amino acid substitutions can be classified as conservative or radical according to whether they retain the properties of the substituted amino acid. Here methods for comparing the rates of conservative (r(C)) and radical (r(R)) nonsynonymous substitution with r(S) at single codon sites were developed to detect natural selection operating on these substitutions at single amino acid sites. A method for comparing r(C) and r(R) at single codon sites was also developed to detect biases toward these substitutions at single amino acid sites. Charge was used as the property of the amino acids. In a computer simulation, false-positive rates of these methods were always < 5%, unless termination sites were included in the computation of the numbers of sites and estimates of transition/transversion rate ratio were highly biased. The frequency of detection of natural selection operating on conservative substitution was almost independent of the presence of natural selection operating on radical substitution, and vice versa. Natural selection operating specifically on conservative and radical substitution was detected more efficiently by comparing r(S) with r(C) and r(S) with r(R) than by comparing r(S) with r(N). These methods also appeared to be robust against the occurrence of recombination during evolution. In an analysis of class I human leukocyte antigen, negative selection operating on conservative substitution, but not positive selection operating on radical substitution, was observed at some of the codon sites with r(R) > r(C), suggesting that r(R) > r(C) may not necessarily be an indicator of positive selection operating on radical substitution.
Collapse
Affiliation(s)
- Yoshiyuki Suzuki
- Institute of Molecular Evolutionary Genetics and Department of Biology, The Pennsylvania State University, USA.
| |
Collapse
|
33
|
Negrisolo E, Bargelloni L, Patarnello T, Ozouf-Costaz C, Pisano E, di Prisco G, Verde C. Comparative and evolutionary genomics of globin genes in fish. Methods Enzymol 2008; 436:511-38. [PMID: 18237652 DOI: 10.1016/s0076-6879(08)36029-7] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Sequencing genomes of model organisms is a great challenge for biological sciences. In the past decade, scientists have developed a large number of methods to align and compare sequenced genomes. The analysis of a given sequence provides much information on the genome structure but to a lesser extent on the function. Comparative genomics are a useful tool for functional and evolutionary annotation of genomes. In principle, comparison of genomic sequences may allow for identification of the evolutionary selection (negative or positive) that the functional sequences have been subjected to over time. Positively selected genome regions are the most important ones for evolution, because most changes are adaptive and often induce biological differences in organisms. The draft genomes of five fish species have recently become available. We herewith review and discuss some new insights into comparative genomics in fish globin genes. Special attention will be given to a complementary methodological approach to comparative genomics, fluorescence in situ hybridization (FISH). Internet resources for analyzing sequence alignments and annotations and new bioinformatic tools to address critical problems are thoroughly discussed.
Collapse
Affiliation(s)
- Enrico Negrisolo
- Department of Public Health, Comparative Pathology, and Veterinary Hygiene, University of Padova, Legnaro, Italy
| | | | | | | | | | | | | |
Collapse
|
34
|
Tunstall NE, Sirey T, Newcomb RD, Warr CG. Selective pressures on Drosophila chemosensory receptor genes. J Mol Evol 2007; 64:628-36. [PMID: 17541681 DOI: 10.1007/s00239-006-0151-6] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2006] [Accepted: 02/28/2007] [Indexed: 01/06/2023]
Abstract
The evolution and patterns of selection of genes encoding 10 Drosophila odorant receptors (Or) and the sex pheromone receptor Gr68a were investigated by comparing orthologous sequences across five to eight ecologically diverse species of Drosophila. Using maximum likelihood estimates of dN/dS ratios we show that all 11 genes sampled are under purifying selection, indicating functional constraint. Four of these genes (Or33c, Or42a, Or85e, and Gr68a) may be under positive selection, and if so, there is good evidence that 12 specific amino acid sites may be under positive selection. All of these sites are predicted to be located either in loop regions or just inside membrane spanning regions, and interestingly one of the two sites in Gr68a is in a similar position to a previously described polymorphism in Gr5a that causes a shift in sensitivity to its ligand trehalose. For three Ors, possible evidence for positive selection was detected along a lineage. These include Or22a in the lineage leading to D. mauritiana and Or22b in the lineage leading to D. simulans. This is of interest in light of previous data showing a change in ligand response profile for these species in the sensory neuron (ab3A) which expresses both Or22a and Or22b in D. melanogaster. In summary, while the main chemosensory function and/or structural integrity of these 10 Or genes and Gr68a are evolutionarily preserved, positive selection appears to be acting on some of these genes, at specific sites and along certain lineages, and provides testable hypotheses for further functional experimentation.
Collapse
Affiliation(s)
- Narelle E Tunstall
- School of Biological Sciences, Monash University, Wellington Road, Clayton, VIC, Australia
| | | | | | | |
Collapse
|
35
|
Radhakrishnan Y, Fares MA, French FS, Hall SH. Comparative genomic analysis of a mammalian beta-defensin gene cluster. Physiol Genomics 2007; 30:213-22. [PMID: 17456736 DOI: 10.1152/physiolgenomics.00263.2006] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open
Abstract
Comparative genomic analyses have yielded valuable insights into conserved and divergent aspects of gene function, regulation, and evolution. Herein, we describe the characterization of a mouse beta-defensin gene cluster locus on chromosome 2F6. In addition, we present the evolutionary analysis of this cluster and its human, rhesus, and rat orthologs. Expression analysis in mouse revealed the occurrence of defensin cluster transcripts in multiple tissues, with the highest abundance in the urogenital tract. Molecular evolutionary analysis suggests that this cluster originated by a series of duplication events, and by positive selection occurring even after the rodent-primate split. In addition, the constraints analysis showed higher positive selection in rodents than in primates, especially distal to the six-cysteine array. Positive selection in the evolution of these defensins may relate not only to the evolving enhancement of ancestral host defense but also to functional innovations in reproduction. The multiplicity of defensins and their preferential overexpression in the urogenital tract indicate that defensins function in the protection and maintenance of fertility.
Collapse
Affiliation(s)
- Yashwanth Radhakrishnan
- Laboratories for Reproductive Biology, Department of Pediatrics, University of North Carolina, Chapel Hill, North Carolina, USA
| | | | | | | |
Collapse
|
36
|
Buggiotti L, Primmer CR. Molecular evolution of the avian growth hormone gene and comparison with its mammalian counterpart. J Evol Biol 2006; 19:844-54. [PMID: 16674581 DOI: 10.1111/j.1420-9101.2005.01042.x] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
The molecular evolution of all available avian growth hormone (GH) gene sequences was investigated using both maximum-likelihood and parsimony methods, and the patterns compared to those found in mammals. In contrast to the rapid bursts of evolution observed for mammalian GH, the evolutionary rate of the avian GH mature peptide appears to have been more constant. However several positively selected sites were identified at functionally important positions in the avian signal peptide by the site-specific likelihood method. This implies that sequence variation in the avian GH signal peptide may be adaptive, although more conservative parsimony methods failed to confirm this. Nevertheless, the differing patterns of avian and mammalian GH signal peptide molecular evolution are consistent with the apparently differing roles of GH in controlling growth in these taxonomic groups and support the hypothesis that signal peptide sequence variation may in fact be the basis for increased functional complexity.
Collapse
Affiliation(s)
- L Buggiotti
- Division of Genetics and Physiology, Department of Biology, University of Turku, Turku, Finland
| | | |
Collapse
|
37
|
Arnau V, Gallach M, Lucas JI, Marín I. UVPAR: fast detection of functional shifts in duplicate genes. BMC Bioinformatics 2006; 7:174. [PMID: 16569227 PMCID: PMC1570150 DOI: 10.1186/1471-2105-7-174] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2005] [Accepted: 03/28/2006] [Indexed: 12/29/2022] Open
Abstract
BACKGROUND The imprint of natural selection on gene sequences is often difficult to detect. A plethora of methods have been devised to detect genetic changes due to selective processes. However, many of those methods depend heavily on underlying assumptions regarding the mode of change of DNA sequences and often require sophisticated mathematical treatments that made them computationally slow. The development of fast and effective methods to detect modifications in the selective constraints of genes is therefore of great interest. RESULTS We describe UVPAR, a program designed to quickly test for changes in the functional constraints of duplicate genes. Starting with alignments of the proteins encoded by couples of duplicate genes in two different species, UVPAR detects the regions in which modifications of the functional constraints in the paralogs occurred since both species diverged. Sequences can be analyzed with UVPAR in just a few minutes on a standard PC computer. To demonstrate the power of the program, we first show how the results obtained with UVPAR compare to those based on other approaches, using data for vertebrate Hox genes. We then describe a comprehensive study of the RBR family of ubiquitin ligases in which we have performed 529 analyses involving 14 duplicate genes in seven model species. A significant increase in the number of functional shifts was observed for the species Danio rerio and for the gene Ariadne-2. CONCLUSION These results show that UVPAR can be used to generate sensitive analyses to detect changes in the selection constraints acting on paralogs. The high speed of the program allows its application to genome-scale analyses.
Collapse
Affiliation(s)
- Vicente Arnau
- Departamento de Informática, Universidad de Valencia, Burjassot, Spain
| | - Miguel Gallach
- Departamento de Genética, Universidad de Valencia, Burjassot, Spain
| | - J Ignasi Lucas
- Departamento de Genética, Universidad de Valencia, Burjassot, Spain
| | - Ignacio Marín
- Departamento de Genética, Universidad de Valencia, Burjassot, Spain
| |
Collapse
|
38
|
β-Defensin evolution: selection complexity and clues for residues of functional importance. Biochem Soc Trans 2006. [DOI: 10.1042/bst0340257] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
We have examined the evolution of the genes at the major human β-defensin locus and the orthologous loci in a range of other primates and mammals. For the first time, these data allow us to examine selective episodes in the more recent evolutionary history of this locus as well as in the ancient past. We have used a combination of maximum-likelihood-based tests and a maximum-parsimony-based sliding window approach to give a detailed view of the varying modes of selection operating at this locus. We provide evidence for strong positive selection soon after the duplication of these genes within an ancestral mammalian genome. During the divergence of primates, however, variable selective pressures have acted on β-defensin genes in different evolutionary lineages, with episodes of both negative and, more rarely, positive selection. Positive selection appears to have been more common in the rodent lineage, accompanying the birth of novel rodent-specific β-defensin gene clades. Sites in the second exon have been subject to positive selection and, by implication, are important in functional diversity. A small number of sites in the mature human peptides were found to have undergone repeated episodes of selection in different primate lineages. Particular sites were consistently implicated by multiple methods at positions throughout the mature peptides. These sites are clustered at positions that are predicted to be important for the function of β-defensins.
Collapse
|
39
|
Field SF, Bulina MY, Kelmanson IV, Bielawski JP, Matz MV. Adaptive Evolution of Multicolored Fluorescent Proteins in Reef-Building Corals. J Mol Evol 2006; 62:332-9. [PMID: 16474984 DOI: 10.1007/s00239-005-0129-9] [Citation(s) in RCA: 70] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2005] [Accepted: 09/21/2005] [Indexed: 11/28/2022]
Abstract
Here we investigate the evolutionary scenarios that led to the appearance of fluorescent color diversity in reef-building corals. We show that the mutations that have been responsible for the generation of new cyan and red phenotypes from the ancestral green were fixed with the help of positive natural selection. This fact strongly suggests that the color diversity is a product of adaptive evolution. An unexpected finding was a set of residues arranged as an intermolecular binding interface, which was also identified as a target of positive selection but is nevertheless not related to color diversification. We hypothesize that multicolored fluorescent proteins evolved as part of a mechanism regulating the relationships between the coral and its algal endosymbionts (zooxanthellae). We envision that the effect of the proteins' fluorescence on algal physiology may be achieved not only through photosynthesis modulation, but also through regulatory photosensors analogous to phytochromes and cryptochromes of higher plants. Such a regulation would require relatively subtle, but spectrally precise, modifications of the light field. Evolution of such a mechanism would explain both the adaptive diversification of colors and the coevolutionary chase at the putative algae-protein binding interface in coral fluorescent proteins.
Collapse
Affiliation(s)
- Steven F Field
- Whitney Laboratory for Marine Bioscience, University of Florida, St. Augustine, 32080, USA
| | | | | | | | | |
Collapse
|
40
|
Scherer NM, Thompson CE, Freitas LB, Bonatto SL, Salzano FM. Patterns of molecular evolution in pathogenesis-related proteins. Genet Mol Biol 2005. [DOI: 10.1590/s1415-47572005000500001] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Affiliation(s)
- Nicole M. Scherer
- Universidade Federal do Rio Grande do Sul, Brazil; Heinrich Heine Universität, Germany
| | | | | | | | | |
Collapse
|
41
|
Abstract
Charles Darwin proposed that evolution occurs primarily by natural selection, but this view has been controversial from the beginning. Two of the major opposing views have been mutationism and neutralism. Early molecular studies suggested that most amino acid substitutions in proteins are neutral or nearly neutral and the functional change of proteins occurs by a few key amino acid substitutions. This suggestion generated an intense controversy over selectionism and neutralism. This controversy is partially caused by Kimura's definition of neutrality, which was too strict (|2Ns|< or =1). If we define neutral mutations as the mutations that do not change the function of gene products appreciably, many controversies disappear because slightly deleterious and slightly advantageous mutations are engulfed by neutral mutations. The ratio of the rate of nonsynonymous nucleotide substitution to that of synonymous substitution is a useful quantity to study positive Darwinian selection operating at highly variable genetic loci, but it does not necessarily detect adaptively important codons. Previously, multigene families were thought to evolve following the model of concerted evolution, but new evidence indicates that most of them evolve by a birth-and-death process of duplicate genes. It is now clear that most phenotypic characters or genetic systems such as the adaptive immune system in vertebrates are controlled by the interaction of a number of multigene families, which are often evolutionarily related and are subject to birth-and-death evolution. Therefore, it is important to study the mechanisms of gene family interaction for understanding phenotypic evolution. Because gene duplication occurs more or less at random, phenotypic evolution contains some fortuitous elements, though the environmental factors also play an important role. The randomness of phenotypic evolution is qualitatively different from allele frequency changes by random genetic drift. However, there is some similarity between phenotypic and molecular evolution with respect to functional or environmental constraints and evolutionary rate. It appears that mutation (including gene duplication and other DNA changes) is the driving force of evolution at both the genic and the phenotypic levels.
Collapse
Affiliation(s)
- Masatoshi Nei
- Department of Biology, Institute of Molecular Evolutionary Genetics, , Pennsylvania State University, USA.
| |
Collapse
|
42
|
Abstract
There is an increasing interest in detecting genes, or genomic regions, that have been targeted by natural selection. The interest stems from a basic desire to learn more about evolutionary processes in humans and other organisms, and from the realization that inferences regarding selection may provide important functional information. This review provides a nonmathematical description of the issues involved in detecting selection from DNA sequences and SNP data and is intended for readers who are not familiar with population genetic theory. Particular attention is placed on issues relating to the analysis of large-scale genomic data sets.
Collapse
Affiliation(s)
- Rasmus Nielsen
- Center for Bioinformatics and Department of Evolutionary Biology, University of Copenhagen, 2100 Copenhagen Ø, Denmark.
| |
Collapse
|
43
|
Berlin S, Smith NGC. Testing for adaptive evolution of the female reproductive protein ZPC in mammals, birds and fishes reveals problems with the M7-M8 likelihood ratio test. BMC Evol Biol 2005; 5:65. [PMID: 16283938 PMCID: PMC1308811 DOI: 10.1186/1471-2148-5-65] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2005] [Accepted: 11/10/2005] [Indexed: 12/11/2022] Open
Abstract
Background Adaptive evolution appears to be a common feature of reproductive proteins across a very wide range of organisms. A promising way of addressing the evolutionary forces responsible for this general phenomenon is to test for adaptive evolution in the same gene but among groups of species, which differ in their reproductive biology. One can then test evolutionary hypotheses by asking whether the variation in adaptive evolution is consistent with the variation in reproductive biology. We have attempted to apply this approach to the study of a female reproductive protein, zona pellucida C (ZPC), which has been previously shown by the use of likelihood ratio tests (LRTs) to be under positive selection in mammals. Results We tested for evidence of adaptive evolution of ZPC in 15 mammalian species, in 11 avian species and in six fish species using three different LRTs (M1a-M2a, M7-M8, and M8a-M8). The only significant findings of adaptive evolution came from the M7-M8 test in mammals and fishes. Since LRTs of adaptive evolution may yield false positives in some situations, we examined the properties of the LRTs by several different simulation methods. When we simulated data to test the robustness of the LRTs, we found that the pattern of evolution in ZPC generates an excess of false positives for the M7-M8 LRT but not for the M1a-M2a or M8a-M8 LRTs. This bias is strong enough to have generated the significant M7-M8 results for mammals and fishes. Conclusion We conclude that there is no strong evidence for adaptive evolution of ZPC in any of the vertebrate groups we studied, and that the M7-M8 LRT can be biased towards false inference of adaptive evolution by certain patterns of non-adaptive evolution.
Collapse
Affiliation(s)
- Sofia Berlin
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Norbyvagen 18 D, 752 36 Uppsala, Sweden
- Current address: Department of Genetics and Genomics, Roslin Institute (Edinburgh), Roslin, Midlothian EH25 9PS, UK
| | - Nick GC Smith
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Norbyvagen 18 D, 752 36 Uppsala, Sweden
- Department of Mathematics and Statistics, Lancaster University, Lancaster LA1 4YF, UK
| |
Collapse
|
44
|
Suzuki Y. Statistical properties of the methods for detecting positively selected amino acid sites. Gene 2005; 365:125-9. [PMID: 16256279 DOI: 10.1016/j.gene.2005.09.002] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2005] [Revised: 06/30/2005] [Accepted: 09/07/2005] [Indexed: 10/25/2022]
Abstract
Parsimony and Bayesian methods have been developed for detecting positively selected amino acid sites. It has been reported that the parsimony method is generally conservative. In contrast, the Bayesian method is known to identify more positively selected sites than the parsimony method, especially when the number of sequences analyzed is small, although the interpretation of results obtained from the former method is controversial. Here I show that the likelihood-ratio test (LRT) of the Bayesian method corresponds to the parsimony method with window analysis, by analyzing the nucleotide sequences encoding the trans-activator (tax) gene of human T-cell lymphotropic virus type I (HTLV-I). It is also indicated that in the parsimony method, the test of selective neutrality using the binomial probability tends to be conservative, but the Monte Carlo simulation is useful for solving this problem. In addition, in the Bayesian method, the bootstrap method appears to produce similar results to the LRT. This information may be useful for improving the methods for detecting positively selected amino acid sites.
Collapse
Affiliation(s)
- Yoshiyuki Suzuki
- Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima-shi, Shizuoka-ken 411-8540, Japan.
| |
Collapse
|
45
|
Vasemägi A, Primmer CR. Challenges for identifying functionally important genetic variation: the promise of combining complementary research strategies. Mol Ecol 2005; 14:3623-42. [PMID: 16202085 DOI: 10.1111/j.1365-294x.2005.02690.x] [Citation(s) in RCA: 239] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Strategies for the identification of functional genetic variation underlying phenotypic traits of ecological and evolutionary importance have received considerable attention in the literature recently. This paper aims to bring together and compare the relative strengths and limitations of various potentially useful research strategies for dissecting functionally important genetic variation in a wide range of organisms. We briefly explore the relative strengths and limitations of traditional and emerging approaches and evaluate their potential use in free-living populations. While it is likely that much of the progress in functional genetic analyses will rely on progress in traditional model species, it is clear that with prudent choices of methods and appropriate sampling designs, much headway can be also made in a diverse range of species. We suggest that combining research approaches targeting different functional and biological levels can potentially increase understanding the genetic basis of ecological and evolutionary processes both in model and non-model organisms.
Collapse
Affiliation(s)
- A Vasemägi
- Department of Biology, University of Turku, Finland
| | | |
Collapse
|
46
|
Podar M, Eads JR, Richardson TH. Evolution of a microbial nitrilase gene family: a comparative and environmental genomics study. BMC Evol Biol 2005; 5:42. [PMID: 16083508 PMCID: PMC1199592 DOI: 10.1186/1471-2148-5-42] [Citation(s) in RCA: 56] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2005] [Accepted: 08/06/2005] [Indexed: 01/05/2023] Open
Abstract
BACKGROUND Completed genomes and environmental genomic sequences are bringing a significant contribution to understanding the evolution of gene families, microbial metabolism and community eco-physiology. Here, we used comparative genomics and phylogenetic analyses in conjunction with enzymatic data to probe the evolution and functions of a microbial nitrilase gene family. Nitrilases are relatively rare in bacterial genomes, their biological function being unclear. RESULTS We examined the genetic neighborhood of the different subfamily genes and discovered conserved gene clusters or operons associated with specific nitrilase clades. The inferred evolutionary transitions that separate nitrilases which belong to different gene clusters correlated with changes in their enzymatic properties. We present evidence that Darwinian adaptation acted during one of those transitions and identified sites in the enzyme that may have been under positive selection. CONCLUSION Changes in the observed biochemical properties of the nitrilases associated with the different gene clusters are consistent with a hypothesis that those enzymes have been recruited to a novel metabolic pathway following gene duplication and neofunctionalization. These results demonstrate the benefits of combining environmental genomic sampling and completed genomes data with evolutionary and biochemical analyses in the study of gene families. They also open new directions for studying the functions of nitrilases and the genes they are associated with.
Collapse
Affiliation(s)
- Mircea Podar
- Diversa Corporation, 4955 Directors Place, San Diego, CA 92131 USA
| | - Jonathan R Eads
- Diversa Corporation, 4955 Directors Place, San Diego, CA 92131 USA
| | | |
Collapse
|
47
|
Lynn DJ, Freeman AR, Murray C, Bradley DG. A genomics approach to the detection of positive selection in cattle: adaptive evolution of the T-cell and natural killer cell-surface protein CD2. Genetics 2005; 170:1189-96. [PMID: 15802510 PMCID: PMC1451189 DOI: 10.1534/genetics.104.039040] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2004] [Accepted: 02/16/2005] [Indexed: 11/18/2022] Open
Abstract
The detection of adaptive evolution at the molecular level is of interest not only as an insight into the process of evolution but also because of its functional implications for genes of interest. Here, we present the first genomics approach to detecting positive selection operating on the Bos taurus lineage, an important domestic species. This analysis led to the identification of the T-cell and natural killer (NK) cell receptor cluster of differentiation 2 (CD2) as having a strong signal of selection. Further detailed investigation of CD2 revealed that this gene was subject to positive selection during the evolution of a number of mammalian lineages. Moreover, we show that selection has operated primarily on the extracellular domain of CD2 and discuss the implications of this for an important regulator of the adaptive immune response.
Collapse
Affiliation(s)
- David J. Lynn
- Department of Genetics, Smurfit Institute, Trinity College, Dublin 2, Ireland
| | - Abigail R. Freeman
- Department of Genetics, Smurfit Institute, Trinity College, Dublin 2, Ireland
| | - Caitriona Murray
- Department of Genetics, Smurfit Institute, Trinity College, Dublin 2, Ireland
| | - Daniel G. Bradley
- Department of Genetics, Smurfit Institute, Trinity College, Dublin 2, Ireland
| |
Collapse
|
48
|
Krieger MJB, Ross KG. Molecular evolutionary analyses of the odorant-binding protein gene Gp-9 in fire ants and other Solenopsis species. Mol Biol Evol 2005; 22:2090-103. [PMID: 15987877 DOI: 10.1093/molbev/msi203] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
The fire ant Solenopsis invicta exists in two social forms, one with colonies headed by a single reproductive queen (monogyne form) and the other with colonies containing multiple queens (polygyne form). This variation in social organization is associated with variation at the gene Gp-9, with monogyne colonies harboring only the B allelic variant and polygyne colonies containing b-like variants as well. We generated new Gp-9 sequences from 15 Solenopsis species and combined these with previously published sequences to conduct a comprehensive, phylogenetically based study of the molecular evolution of this important gene. The exon/intron structure and the respective lengths of the five exons of Gp-9 are identical across all species examined, and we detected no evidence for intragenic recombination. These data conform to a previous suggestion that Gp-9 lies in a genomic region with low recombination, and they indicate that evolution of the coding region in Solenopsis has involved point substitutions only. Our results confirm a link between the presence of b-like alleles and the expression of polygyny in all South American fire ant species known to possess colonies of both social forms. Moreover, phylogenetic analyses show that b-like alleles comprise a derived clade of Gp-9 sequences within the socially polymorphic species, lending further support to the hypothesis that monogyny preceded polygyny in this group of fire ants. Site-specific maximum likelihood tests identified several amino acids that have experienced positive selection, two of which are adjacent to the inferred binding-pocket residues in the GP-9 protein. Four other binding-pocket residues are variable among fire ant species, although selection is not implicated in this variation. Branch-specific tests revealed strong positive selection on the stem lineage of the b-like allele clade, as expected if selection drove the amino acid replacements crucial to the expression of polygyne social organization. Such selection may have operated via the ligand-binding properties of GP-9, as one of the two amino acids uniquely shared by all b-like alleles is predicted to be a binding-pocket residue.
Collapse
|
49
|
Semple CAM, Maxwell A, Gautier P, Kilanowski FM, Eastwood H, Barran PE, Dorin JR. The complexity of selection at the major primate beta-defensin locus. BMC Evol Biol 2005; 5:32. [PMID: 15904491 PMCID: PMC1156880 DOI: 10.1186/1471-2148-5-32] [Citation(s) in RCA: 36] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2005] [Accepted: 05/18/2005] [Indexed: 12/21/2022] Open
Abstract
BACKGROUND We have examined the evolution of the genes at the major human beta-defensin locus and the orthologous loci in a range of other primates and mouse. For the first time these data allow us to examine selective episodes in the more recent evolutionary history of this locus as well as the ancient past. We have used a combination of maximum likelihood based tests and a maximum parsimony based sliding window approach to give a detailed view of the varying modes of selection operating at this locus. RESULTS We provide evidence for strong positive selection soon after the duplication of these genes within an ancestral mammalian genome. Consequently variable selective pressures have acted on beta-defensin genes in different evolutionary lineages, with episodes both of negative, and more rarely positive selection, during the divergence of primates. Positive selection appears to have been more common in the rodent lineage, accompanying the birth of novel, rodent-specific beta-defensin genes. These observations allow a fuller understanding of the evolution of mammalian innate immunity. In both the rodent and primate lineages, sites in the second exon have been subject to positive selection and by implication are important in functional diversity. A small number of sites in the mature human peptides were found to have undergone repeated episodes of selection in different primate lineages. Particular sites were consistently implicated by multiple methods at positions throughout the mature peptides. These sites are clustered at positions predicted to be important for the specificity of the antimicrobial or chemoattractant properties of beta-defensins. Surprisingly, sites within the prepropeptide region were also implicated as being subject to significant positive selection, suggesting previously unappreciated functional significance for this region. CONCLUSIONS Identification of these putatively functional sites has important implications for our understanding of beta-defensin function and for novel antibiotic design.
Collapse
Affiliation(s)
- Colin AM Semple
- MRC Human Genetics Unit, Western General Hospital, Edinburgh, EH4 2XU, UK
| | - Alison Maxwell
- MRC Human Genetics Unit, Western General Hospital, Edinburgh, EH4 2XU, UK
| | - Philippe Gautier
- MRC Human Genetics Unit, Western General Hospital, Edinburgh, EH4 2XU, UK
| | - Fiona M Kilanowski
- MRC Human Genetics Unit, Western General Hospital, Edinburgh, EH4 2XU, UK
| | - Hayden Eastwood
- School of Chemistry, The University of Edinburgh, The King's Buildings, West Mains Road, Edinburgh, EH9 3JJ, UK
| | - Perdita E Barran
- School of Chemistry, The University of Edinburgh, The King's Buildings, West Mains Road, Edinburgh, EH9 3JJ, UK
| | - Julia R Dorin
- MRC Human Genetics Unit, Western General Hospital, Edinburgh, EH4 2XU, UK
| |
Collapse
|
50
|
Abstract
Homeobox genes encode transcription factors that function in various developmental processes and are usually evolutionarily conserved in their sequences. However, two X-chromosome-linked testis-expressed homeobox genes, one from rodents and the other from fruit flies, are known to evolve rapidly under positive Darwinian selection. Here we report yet another case, from primates. TGIFLX is an X-linked homeobox gene that originated by retroposition of the autosomal gene TGIF2, most likely in a common ancestor of rodents and primates. While TGIF2 is ubiquitously expressed, TGIFLX is exclusively expressed in adult testis. A comparison of the TGIFLX sequences among 16 anthropoid primates revealed a significantly higher rate of nonsynonymous nucleotide substitution (d(N)) than synonymous substitution (d(S)), strongly suggesting the action of positive selection. Although the high d(N)/d(S) ratio is most evident outside the homeobox, the homeobox has a d(N)/d(S) of approximately 0.89 and includes two codons that are likely under selection. Furthermore, the rate of radical amino acid substitutions that alter amino acid charge is significantly greater than that of conservative substitutions, suggesting that the selection promotes diversity of the protein charge profile. More interestingly, an analysis of 64 orthologous homeobox genes from humans and mice shows substantially higher rates of amino acid substitution in X-linked testis-expressed genes than in other genes. These results suggest a general pattern of rapid evolution of mammalian X-linked testis-expressed homeobox genes. Although the physiological function of and the exact selective agent on TGIFLX and other rapidly evolving homeobox genes are unclear, the common expression pattern of these transcription factor genes led us to conjecture that the selection is related to one or more aspects of male reproduction and may contribute to speciation.
Collapse
Affiliation(s)
- Xiaoxia Wang
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, Michigan 48109, USA
| | | |
Collapse
|