1
|
Resequencing and SNP discovery of Amur ide (Leuciscus waleckii) provides insights into local adaptations to extreme environments. Sci Rep 2021; 11:5064. [PMID: 33658614 PMCID: PMC7930030 DOI: 10.1038/s41598-021-84652-5] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2020] [Accepted: 02/18/2021] [Indexed: 01/31/2023] Open
Abstract
Amur ide (Leuciscus waleckii), a Cyprinid species, is broadly distributed in Northeast Asia. Different from its freshwater counterparts, the population in Lake Dali Nor has a strong alkalinity tolerance and can adapt to extremely alkali-saline water with bicarbonate over 50 mmol/L. To uncover the genetic basis of its alkaline adaptation, three populations, including one alkali form from Lake Dali Nor (DL), one freshwater form from its adjacent sister Lake Ganggeng Nor (GG), and one freshwater form from its historical origin, namely, the Songhua River (SH), were analyzed using genome resequencing technology. A total of 679.82 Gb clean data and 38,091,163 high-quality single-nucleotide polymorphism (SNP) loci were detected in the three populations. Nucleotide diversity and population structure analysis revealed that the DL and GG populations have lower nucleotide diversities and different genetic structures than those of the SH population. Selective sweeping showed 21 genes involved in osmoregulatory regulation (DLG1, VIPR1, AKT1, and GNAI1), inflammation and immune responses (DLG1, BRINP1, CTSL, TRAF6, AKT1, STAT3, GNAI1, SEC22b, and PSME4b), and cardiorespiratory development (TRAF6, PSME4b, STAT3, AKT1, and COL9A1) to be associated with alkaline adaption of the DL population. Interestingly, selective pressure (CodeML, MEME, and FEL) methods identified two functional codon sites of VIPR1 to be under positive selection in the DL population. The subsequent 3D protein modeling confirmed that these selected sites will incur changes in protein structure and function in the DL population. In brief, this study provides molecular evidence of population divergence and alkaline adaptation, which will be very useful for revealing the genetic basis of alkaline adaptation in Amur ide.
Collapse
|
2
|
Ma S, Liu H, Sun W, Mustafa A, Xi Y, Pu F, Li Y, Han C, Bai L, Hua H. Molecular evolution of the ATP-binding cassette subfamily G member 2 gene subfamily and its paralogs in birds. BMC Evol Biol 2020; 20:85. [PMID: 32664916 PMCID: PMC7362505 DOI: 10.1186/s12862-020-01654-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2019] [Accepted: 07/07/2020] [Indexed: 12/11/2022] Open
Abstract
Background ATP-binding cassette (ABC) transporters are involved in the active transportation of various endogenous or exogenous substances. Two ABCG2 gene subfamily members have been identified in birds. A detailed comparative study of the ABCG2 and ABCG2-like genes aid our understanding of their evolutionary history at the molecular level and provide a theoretical reference for studying the specific functions of ABCG2 and ABCG2-like genes in birds. Results We first identified 77 ABCG2/ABCG2-like gene sequences in the genomes of 41 birds. Further analysis showed that both the nucleic acid and amino acid sequences of ABCG2 and ABCG2-like genes were highly conserved and exhibited high homology in birds. However, significant differences in the N-terminal structure were found between the ABCG2 and ABCG2-like amino acid sequences. A selective pressure analysis showed that the ABCG2 and ABCG2-like genes were affected by purifying selection during the process of bird evolution. Conclusions We believe that multiple members of the ABCG2 gene subfamily exist on chromosome 4 in the ancestors of birds. Over the long course of evolution, only the ABCG2 gene was retained on chromosome 4 in birds. The ABCG2-like gene on chromosome 6 might have originated from chromosome replication or fusion. The structural differences between the N terminus of ABCG2 protein and those of ABCG2-like proteins might lead to functional differences between the corresponding genes.
Collapse
Affiliation(s)
- Shengchao Ma
- Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, Sichuan, 611130, P.R. China
| | - Hehe Liu
- Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, Sichuan, 611130, P.R. China.
| | - Wenqiang Sun
- Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, Sichuan, 611130, P.R. China
| | - Ahsan Mustafa
- Institute of Animal Nutrition, Key Laboratory for Animal Disease-Resistance Nutrition of China, Ministry of Education, Sichuan Agricultural University, Chengdu, P.R. China
| | - Yang Xi
- Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, Sichuan, 611130, P.R. China
| | - Fajun Pu
- Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, Sichuan, 611130, P.R. China
| | - Yanying Li
- Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, Sichuan, 611130, P.R. China
| | - Chunchun Han
- Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, Sichuan, 611130, P.R. China
| | - Lili Bai
- Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, Sichuan, 611130, P.R. China
| | - He Hua
- Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, Sichuan, 611130, P.R. China
| |
Collapse
|
3
|
Grant JC, González-Beiras C, Amick KM, Fortney KR, Gangaiah D, Humphreys TL, Mitjà O, Abecasis A, Spinola SM. Multiple Class I and Class II Haemophilus ducreyi Strains Cause Cutaneous Ulcers in Children on an Endemic Island. Clin Infect Dis 2019; 67:1768-1774. [PMID: 29897409 DOI: 10.1093/cid/ciy343] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2018] [Accepted: 04/18/2018] [Indexed: 12/14/2022] Open
Abstract
Background Together with Treponema pallidum subspecies pertenue, Haemophilus ducreyi is a major cause of exudative cutaneous ulcers (CUs) in children. For H. ducreyi, both class I and class II strains, asymptomatic colonization, and environmental reservoirs have been found in endemic regions, but the epidemiology of this infection is unknown. Methods Based on published whole-genome sequences of H. ducreyi CU strains, a single-locus typing system was developed and applied to H. ducreyi-positive CU samples obtained prior to, 1 year after, and 2 years after the initiation of a mass drug administration campaign to eradicate CU on Lihir Island in Papua New Guinea. DNA from the CU samples was amplified with class I and class II dsrA-specific primers and sequenced; the samples were classified into dsrA types, which were geospatially mapped. Selection pressure analysis was performed on the dsrA sequences. Results Thirty-seven samples contained class I sequences, 27 contained class II sequences, and 13 contained both. There were 5 class I and 4 class II types circulating on the island; 3 types accounted for approximately 87% of the strains. The composition and geospatial distribution of the types varied little over time and there was no evidence of selection pressure. Conclusions Multiple strains of H. ducreyi cause CU on an endemic island and coinfections are common. In contrast to recent findings with T. pallidum pertenue, strain composition is not affected by antibiotic pressure, consistent with environmental reservoirs of H. ducreyi. Such reservoirs must be addressed to achieve eradication of H. ducreyi.
Collapse
Affiliation(s)
- Jacob C Grant
- Department of Microbiology and Immunology, Indiana University School of Medicine, Indianapolis
| | | | - Kristen M Amick
- Department of Biology, Allegheny College, Meadville, Pennsylvania
| | - Kate R Fortney
- Department of Microbiology and Immunology, Indiana University School of Medicine, Indianapolis
| | - Dharanesh Gangaiah
- Department of Microbiology and Immunology, Indiana University School of Medicine, Indianapolis
| | | | - Oriol Mitjà
- Lihir Medical Centre-International SOS, Newcrest Mining, Lihir Island, Papua New Guinea.,Barcelona Institute for Global Health, Hospital Clinic, University of Barcelona, Spain.,School of Medicine and Health Sciences, University of Papua New Guinea, Port Moresby
| | - Ana Abecasis
- Global Health and Tropical Medicine, Institute of Hygiene and Tropical Medicine, Lisbon, Portugal
| | - Stanley M Spinola
- Department of Microbiology and Immunology, Indiana University School of Medicine, Indianapolis.,Department of Medicine, Indiana University School of Medicine, Indianapolis.,Department of Pathology and Laboratory Medicine, Indiana University School of Medicine, Indianapolis
| |
Collapse
|
4
|
Lang D, Lim BK, Gao Y, Wang X. Adaptive evolutionary expansion of the ribonuclease 6 in Rodentia. Integr Zool 2019; 14:306-317. [PMID: 30688011 DOI: 10.1111/1749-4877.12382] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Ribonuclease 6 (RNase6 or RNase K6) is a protein that belongs to a superfamily thought to be the sole verte-brate-specific enzyme known for a wide range of physiological functions, including digestion, cytotoxicity, angiogenesis, male reproduction and host defense. In our study, 51 functional genes and 11 pseudogenes were identified from 27 Rodentia species. Intriguingly, in the 3 main lineages of rodents there were multiple RNase6s identified in all species of Ctenohystrica, whereas only a single RNase6 was observed in other Rodentia species examined except for 2 species in the mouse-related clade. The evolutionary scenario of "birth (gene duplication) and death (gene deactivation)" and gene sorting have been demonstrated in Ctenohystrica. In addition, bursts of positive selection, diversification of isoelectric point and positive net charge have been identified in Ctenohystrica, especially at two key sites that are involved in antimicrobial function. Site Trp30 has undergone positive selection and Ile45 has changed into other residues in Group B and Group C of the Ctenohystrica. Our results demonstrated a complex and intriguing evolutionary pattern of rodent RNase6, and indicated that functional modification may have occurred, which establishes an important theoretical foundation for future functional assays in rodent RNase6.
Collapse
Affiliation(s)
- Datian Lang
- Agronomy and Life Science Department, Zhaotong University, Zhaotong, China.,State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Science, Kunming, China
| | - Burton K Lim
- Department of Natural History, Royal Ontario Museum, Toronto, Ontario, Canada
| | - Yun Gao
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Science, Kunming, China
| | - Xiaoping Wang
- School of Life Sciences, Yunnan University, Kunming, China.,State Key Laboratory for Conservation and Utilization of Bio-resource in Yunnan, Yunnan University, Kunming, China.,Key Laboratory for Animal Genetic Diversity and Evolution of High Education in Yunnan Province, Yunnan University, Kunming, China
| |
Collapse
|
5
|
Looking for Darwin in Genomic Sequences: Validity and Success Depends on the Relationship Between Model and Data. Methods Mol Biol 2019; 1910:399-426. [PMID: 31278672 DOI: 10.1007/978-1-4939-9074-0_13] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
Codon substitution models (CSMs) are commonly used to infer the history of natural section for a set of protein-coding sequences, often with the explicit goal of detecting the signature of positive Darwinian selection. However, the validity and success of CSMs used in conjunction with the maximum likelihood (ML) framework is sometimes challenged with claims that the approach might too often support false conclusions. In this chapter, we use a case study approach to identify four legitimate statistical difficulties associated with inference of evolutionary events using CSMs. These include: (1) model misspecification, (2) low information content, (3) the confounding of processes, and (4) phenomenological load, or PL. While past criticisms of CSMs can be connected to these issues, the historical critiques were often misdirected, or overstated, because they failed to recognize that the success of any model-based approach depends on the relationship between model and data. Here, we explore this relationship and provide a candid assessment of the limitations of CSMs to extract historical information from extant sequences. To aid in this assessment, we provide a brief overview of: (1) a more realistic way of thinking about the process of codon evolution framed in terms of population genetic parameters, and (2) a novel presentation of the ML statistical framework. We then divide the development of CSMs into two broad phases of scientific activity and show that the latter phase is characterized by increases in model complexity that can sometimes negatively impact inference of evolutionary mechanisms. Such problems are not yet widely appreciated by the users of CSMs. These problems can be avoided by using a model that is appropriate for the data; but, understanding the relationship between the data and a fitted model is a difficult task. We argue that the only way to properly understand that relationship is to perform in silico experiments using a generating process that can mimic the data as closely as possible. The mutation-selection modeling framework (MutSel) is presented as the basis of such a generating process. We contend that if complex CSMs continue to be developed for testing explicit mechanistic hypotheses, then additional analyses such as those described in here (e.g., penalized LRTs and estimation of PL) will need to be applied alongside the more traditional inferential methods.
Collapse
|
6
|
Twort VG, Dennis AB, Park D, Lomas KF, Newcomb RD, Buckley TR. Positive selection and comparative molecular evolution of reproductive proteins from New Zealand tree weta (Orthoptera, Hemideina). PLoS One 2017; 12:e0188147. [PMID: 29131842 PMCID: PMC5683631 DOI: 10.1371/journal.pone.0188147] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2017] [Accepted: 11/01/2017] [Indexed: 11/18/2022] Open
Abstract
Animal reproductive proteins, especially those in the seminal fluid, have been shown to have higher levels of divergence than non-reproductive proteins and are often evolving adaptively. Seminal fluid proteins have been implicated in the formation of reproductive barriers between diverging lineages, and hence represent interesting candidates underlying speciation. RNA-seq was used to generate the first male reproductive transcriptome for the New Zealand tree weta species Hemideina thoracica and H. crassidens. We identified 865 putative reproductive associated proteins across both species, encompassing a diverse range of functional classes. Candidate gene sequencing of nine genes across three Hemideina, and two Deinacrida species suggests that H. thoracica has the highest levels of intraspecific genetic diversity. Non-monophyly was observed in the majority of sequenced genes indicating that either gene flow may be occurring between the species, or that reciprocal monophyly at these loci has yet to be attained. Evidence for positive selection was found for one lectin-related reproductive protein, with an overall omega of 7.65 and one site in particular being under strong positive selection. This candidate gene represents the first step in the identification of proteins underlying the evolutionary basis of weta reproduction and speciation.
Collapse
Affiliation(s)
- Victoria G. Twort
- School of Biological Sciences, The University of Auckland, Auckland, New Zealand
- Landcare Research, Auckland, New Zealand
- * E-mail:
| | | | | | | | - Richard D. Newcomb
- School of Biological Sciences, The University of Auckland, Auckland, New Zealand
- The New Zealand Institute for Plant and Food Research Ltd, Auckland, New Zealand
| | - Thomas R. Buckley
- School of Biological Sciences, The University of Auckland, Auckland, New Zealand
- Landcare Research, Auckland, New Zealand
| |
Collapse
|
7
|
Ahmad HI, Liu G, Jiang X, Liu C, Chong Y, Huarong H. Adaptive molecular evolution of MC1R gene reveals the evidence for positive diversifying selection in indigenous goat populations. Ecol Evol 2017; 7:5170-5180. [PMID: 28770057 PMCID: PMC5528238 DOI: 10.1002/ece3.2919] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2016] [Revised: 02/10/2017] [Accepted: 02/13/2017] [Indexed: 12/16/2022] Open
Abstract
Detecting signatures of selection can provide a new insight into the mechanism of contemporary breeding and artificial selection and further reveal the causal genes associated to the phenotypic variation. However, the signatures of selection on genes entailing for profitable traits between Chinese commercial and indigenous goats have been poorly interpreted. We noticed footprints of positive selection at MC1R gene containing SNPs genotyped in five Chinese native goat breeds. An experimental distribution of FST was built based on approximations of FST for each SNP across five breeds. We identified selection using the high FST outlier method and found that MC1R candidate gene show evidence of positive selection. Furthermore, adaptive selection pressure on specific codons was determined using different codon based on maximum‐likelihood methods; signature of positive selection in mammalian MC1R was explored in individual codons. Evolutionary analyses were inferred under maximum likelihood models, the HyPhy package implemented in the DATAMONKEY Web Server. The results of codon selection displayed positive diversifying selection at the sites were mainly involved in development of genetic variations in coat color in various mammalian species. Positive diversifying selection inferred with recent evolutionary changes in domesticated goat MC1R provides new insights that the gene evolution may have been modulated by domestication events in goats.
Collapse
Affiliation(s)
- Hafiz Ishfaq Ahmad
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of the Ministry of Education College of Animal Science and Technology Huazhong Agricultural University Wuhan China
| | - Guiqiong Liu
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of the Ministry of Education College of Animal Science and Technology Huazhong Agricultural University Wuhan China
| | - Xunping Jiang
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of the Ministry of Education College of Animal Science and Technology Huazhong Agricultural University Wuhan China
| | - Chenhui Liu
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of the Ministry of Education College of Animal Science and Technology Huazhong Agricultural University Wuhan China
| | - Yuqing Chong
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of the Ministry of Education College of Animal Science and Technology Huazhong Agricultural University Wuhan China
| | - Huang Huarong
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of the Ministry of Education College of Animal Science and Technology Huazhong Agricultural University Wuhan China
| |
Collapse
|
8
|
Tao J, Feng C, Ai B, Kang M. Adaptive molecular evolution of the two-pore channel 1 gene TPC1 in the karst-adapted genus Primulina (Gesneriaceae). ANNALS OF BOTANY 2016; 118:1257-1268. [PMID: 27582362 PMCID: PMC5155596 DOI: 10.1093/aob/mcw168] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/07/2016] [Revised: 05/26/2016] [Accepted: 06/30/2016] [Indexed: 05/29/2023]
Abstract
BACKGROUND AND AIMS Limestone karst areas possess high floral diversity and endemism. The genus Primulina, which contributes to the unique calcicole flora, has high species richness and exhibit specific soil-based habitat associations that are mainly distributed on calcareous karst soils. The adaptive molecular evolutionary mechanism of the genus to karst calcium-rich environments is still not well understood. The Ca2+-permeable channel TPC1 was used in this study to test whether its gene is involved in the local adaptation of Primulina to karst high-calcium soil environments. METHODS Specific amplification and sequencing primers were designed and used to amplify the full-length coding sequences of TPC1 from cDNA of 76 Primulina species. The sequence alignment without recombination and the corresponding reconstructed phylogeny tree were used in molecular evolutionary analyses at the nucleic acid level and amino acid level, respectively. Finally, the identified sites under positive selection were labelled on the predicted secondary structure of TPC1. KEY RESULTS Seventy-six full-length coding sequences of Primulina TPC1 were obtained. The length of the sequences varied between 2220 and 2286 bp and the insertion/deletion was located at the 5' end of the sequences. No signal of substitution saturation was detected in the sequences, while significant recombination breakpoints were detected. The molecular evolutionary analyses showed that TPC1 was dominated by purifying selection and the selective pressures were not significantly different among species lineages. However, significant signals of positive selection were detected at both TPC1 codon level and amino acid level, and five sites under positive selective pressure were identified by at least three different methods. CONCLUSIONS The Ca2+-permeable channel TPC1 may be involved in the local adaptation of Primulina to karst Ca2+-rich environments. Different species lineages suffered similar selective pressure associated with calcium in karst environments, and episodic diversifying selection at a few sites may play a major role in the molecular evolution of Primulina TPC1.
Collapse
Affiliation(s)
- Junjie Tao
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou 510650, China and
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou 510650, China and
| | - Chao Feng
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou 510650, China and
| | - Bin Ai
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou 510650, China and
| | - Ming Kang
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou 510650, China and
| |
Collapse
|
9
|
Almeida D, Maldonado E, Khan I, Silva L, Gilbert MTP, Zhang G, Jarvis ED, O'Brien SJ, Johnson WE, Antunes A. Whole-Genome Identification, Phylogeny, and Evolution of the Cytochrome P450 Family 2 (CYP2) Subfamilies in Birds. Genome Biol Evol 2016; 8:1115-31. [PMID: 26979796 PMCID: PMC4860681 DOI: 10.1093/gbe/evw041] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/27/2016] [Indexed: 12/19/2022] Open
Abstract
The cytochrome P450 (CYP) superfamily defends organisms from endogenous and noxious environmental compounds, and thus is crucial for survival. However, beyond mammals the molecular evolution of CYP2 subfamilies is poorly understood. Here, we characterized the CYP2 family across 48 avian whole genomes representing all major extant bird clades. Overall, 12 CYP2 subfamilies were identified, including the first description of the CYP2F, CYP2G, and several CYP2AF genes in avian genomes. Some of the CYP2 genes previously described as being lineage-specific, such as CYP2K and CYP2W, are ubiquitous to all avian groups. Furthermore, we identified a large number of CYP2J copies, which have been associated previously with water reabsorption. We detected positive selection in the avian CYP2C, CYP2D, CYP2H, CYP2J, CYP2K, and CYP2AC subfamilies. Moreover, we identified new substrate recognition sites (SRS0, SRS2_SRS3, and SRS3.1) and heme binding areas that influence CYP2 structure and function of functional importance as under significant positive selection. Some of the positively selected sites in avian CYP2D are located within the same SRS1 region that was previously linked with the metabolism of plant toxins. Additionally, we find that selective constraint variations in some avian CYP2 subfamilies are consistently associated with different feeding habits (CYP2H and CYP2J), habitats (CYP2D, CYP2H, CYP2J, and CYP2K), and migratory behaviors (CYP2D, CYP2H, and CYP2J). Overall, our findings indicate that there has been active enzyme site selection on CYP2 subfamilies and differential selection associated with different life history traits among birds.
Collapse
Affiliation(s)
- Daniela Almeida
- CIIMAR/CIMAR, Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Portugal Department of Biology, Faculty of Sciences, University of Porto, Portugal
| | - Emanuel Maldonado
- CIIMAR/CIMAR, Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Portugal
| | - Imran Khan
- CIIMAR/CIMAR, Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Portugal Department of Biology, Faculty of Sciences, University of Porto, Portugal
| | - Liliana Silva
- CIIMAR/CIMAR, Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Portugal Department of Biology, Faculty of Sciences, University of Porto, Portugal
| | - M Thomas P Gilbert
- Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, Denmark
| | - Guojie Zhang
- China National GeneBank, BGI-Shenzhen, Shenzen, China Centre for Social Evolution, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Erich D Jarvis
- Centre for Social Evolution, Department of Biology, University of Copenhagen, Copenhagen, Denmark Howard Hughes Medical Institute, Chevy Chase, Maryland
| | - Stephen J O'Brien
- Theodosius Dobzhansky Center for Genome Bioinformatics, St. Petersburg State University, Russia Oceanographic Center, Nova Southeastern University, Ft Lauderdale
| | - Warren E Johnson
- National Zoological Park, Smithsonian Conservation Biology Institute, Washington DC
| | - Agostinho Antunes
- CIIMAR/CIMAR, Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Portugal Department of Biology, Faculty of Sciences, University of Porto, Portugal
| |
Collapse
|
10
|
Adaptation of the Mitochondrial Genome in Cephalopods: Enhancing Proton Translocation Channels and the Subunit Interactions. PLoS One 2015; 10:e0135405. [PMID: 26285039 PMCID: PMC4540416 DOI: 10.1371/journal.pone.0135405] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2015] [Accepted: 07/21/2015] [Indexed: 01/25/2023] Open
Abstract
Mitochondrial protein-coding genes (mt genes) encode subunits forming complexes of crucial cellular pathways, including those involved in the vital process of oxidative phosphorylation (OXPHOS). Despite the vital role of the mitochondrial genome (mt genome) in the survival of organisms, little is known with respect to its adaptive implications within marine invertebrates. The molluscan Class Cephalopoda is represented by a marine group of species known to occupy contrasting environments ranging from the intertidal to the deep sea, having distinct metabolic requirements, varied body shapes and highly advanced visual and nervous systems that make them highly competitive and successful worldwide predators. Thus, cephalopods are valuable models for testing natural selection acting on their mitochondrial subunits (mt subunits). Here, we used concatenated mt genes from 17 fully sequenced mt genomes of diverse cephalopod species to generate a robust mitochondrial phylogeny for the Class Cephalopoda. We followed an integrative approach considering several branches of interest–covering cephalopods with distinct morphologies, metabolic rates and habitats–to identify sites under positive selection and localize them in the respective protein alignment and/or tridimensional structure of the mt subunits. Our results revealed significant adaptive variation in several mt subunits involved in the energy production pathway of cephalopods: ND5 and ND6 from Complex I, CYTB from Complex III, COX2 and COX3 from Complex IV, and in ATP8 from Complex V. Furthermore, we identified relevant sites involved in protein-interactions, lining proton translocation channels, as well as disease/deficiencies related sites in the aforementioned complexes. A particular case, revealed by this study, is the involvement of some positively selected sites, found in Octopoda lineage in lining proton translocation channels (site 74 from ND5) and in interactions between subunits (site 507 from ND5) of Complex I.
Collapse
|
11
|
Global Diversity within and between Human Herpesvirus 1 and 2 Glycoproteins. J Virol 2015; 89:8206-18. [PMID: 26018161 DOI: 10.1128/jvi.01302-15] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2015] [Accepted: 05/19/2015] [Indexed: 12/17/2022] Open
Abstract
UNLABELLED Human herpes simplex viruses 1 and 2 (HSV-1 and HSV-2) are large-genome DNA viruses that establish a persistent infection in sensory neurons and commonly manifest with recurring oral or genital erosions that transmit virus. HSV encodes 12 predicted glycoproteins that serve various functions, including cellular attachment, entry, and egress. Glycoprotein G is currently the target of an antibody test to differentiate HSV-1 from HSV-2; however, this test has shown reduced capacity to differentiate HSV strains in East Africa. Until the recent availability of 26 full-length HSV-1 and 36 full-length HSV-2 sequences, minimal comparative information was available for these viruses. In this study, we use a variety of sequence analysis methods to compare all available sequence data for HSV-1 and HSV-2 glycoproteins, using viruses isolated in Europe, Asia, North America, the Republic of South Africa, and East Africa. We found numerous differences in diversity, nonsynonymous/synonymous substitution rates, and recombination rates between HSV-1 glycoproteins and their HSV-2 counterparts. Phylogenetic analysis revealed that while most global HSV-2 glycoprotein G sequences did not form clusters within or between continents, one clade (supported at 60.5%) contained 37% of the African sequences analyzed. Accordingly, sequences from this African subset contained unique amino acid signatures, not only in glycoprotein G, but also in glycoproteins I and E, which may account for the failure of sensitive antibody tests to distinguish HSV-1 from HSV-2 in some African individuals. Consensus sequences generated in the study can be used to improve diagnostic assays that differentiate HSV-1 from HSV-2 in global populations. IMPORTANCE Human herpes simplex viruses 1 and 2 (HSV-1 and HSV-2) are large DNA viruses associated with recurring oral or genital erosions that transmit virus. Up to 12 HSV-1 and HSV-2 glycoproteins are involved in HSV cell entry or are required for viral spread in animals, albeit some are dispensable for replication in vitro. The recent availability of comparable numbers of full-length HSV-1 and HSV-2 sequences enabled comparative analysis of gene diversity of glycoproteins within and between HSV types. Overall, we found less glycoprotein sequence diversity within HSV-2 than within the HSV-1 strains studied, while at the same time, several HSV-2 glycoproteins were evolving under less selective pressure. Because HSV glycoproteins are the focus of antibody tests to detect and differentiate between infections with the two strains and are constituents of vaccines in clinical-stage development, these findings will aid in refining the targets for diagnostic tests and vaccines.
Collapse
|
12
|
Pheromone receptor evolution in the cryptic leafroller species, Ctenopseustis obliquana and C. herana. J Mol Evol 2014; 80:42-56. [PMID: 25252791 DOI: 10.1007/s00239-014-9650-z] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2014] [Accepted: 09/18/2014] [Indexed: 10/24/2022]
Abstract
How new mate recognition systems evolve when changes are required in both the male and female components remains a conundrum. Here, we investigated the molecular basis of pheromone reception in two species of tortricid (leafroller) moth, Ctenopseustis obliquana and C. herana. Male C. obliquana are attracted to a 90:10 blend of (Z)-8-tetradecenyl acetate (Z8-14:OAc) and (Z)-5-tetradecenyl acetate (Z5-14:OAc), whereas C. herana males are attracted to Z5-14:OAc alone. We used a transcriptome sequencing approach from adult male and female antennae to identify 47 olfactory receptors (ORs) from each species and assessed their expression levels in male and female antennae using RNA-Seq counting and quantitative RT-PCR. Three male-biased and one female-biased OR were identified in C. obliquana by quantitative RT-PCR, and four male-biased and one female-biased receptor in C. herana. The male-biased receptors, CoblOR7, CoblOR30, CherOR7, CherOR30, CherOR1a and CherOR1b were tested for their ability to respond to sex pheromone components in a HEK293 cell calcium assay. CoblOR7 and CherOR7 responded to Z8-14:OAc, however, no receptor for Z5-14:OAc was identified. In addition to Z8-14:OAc, CherOR7 also responded to Z7-14:OAc, indicating that this receptor may be under relaxed constraint. Of the 29 amino acid differences between CoblOR7 and CherOR7, significantly more are located in the third and the sixth transmembrane domain regions. Overall, these findings are consistent with studies revealing the presence of neurons tuned to both Z8-14:OAc and Z5-14:OAc in both species, but that for C. herana males, the ability to detect Z8-14:OAc is currently not required.
Collapse
|
13
|
Evolutionary and functional novelty of pancreatic ribonuclease: a study of Musteloidea (order Carnivora). Sci Rep 2014; 4:5070. [PMID: 24861105 PMCID: PMC5381406 DOI: 10.1038/srep05070] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2014] [Accepted: 04/30/2014] [Indexed: 11/08/2022] Open
Abstract
Pancreatic ribonuclease (RNASE1) is a digestive enzyme that has been one of the key models in studies of evolutionary innovation and functional diversification. It has been believed that the RNASE1 gene duplications are correlated with the plant-feeding adaptation of foregut-fermenting herbivores. Here, we characterized RNASE1 genes from Caniformia, which has a simple digestive system and lacks microbial digestion typical of herbivores, in an unprecedented scope based on both gene sequence and tissue expression analyses. Remarkably, the results yielded new hypotheses regarding the evolution and the function of Caniformia RNASE1 genes. Four independent gene duplication events in the families of superfamily Musteloidea, including Procyonidae, Ailuridae, Mephitidae and Mustelidae, were recovered, rejecting previous Mustelidae-specific duplication hypothesis, but supporting Musteloidea duplication hypothesis. Moreover, our analyses revealed pronounced differences among the RNASE1 gene copies regarding their selection pressures, pI values and tissue expression patterns, suggesting the differences in their physiological functions. Notably, the expression analyses detected the transcription of a RNASE1 pseudogene in several tissues, raising the possibility that pseudogenes are also a potential source during the RNase functional diversification. In sum, the present work demonstrated a far more complex and intriguing evolutionary pattern and functional diversity of mammalian ribonuclease than previously thought.
Collapse
|
14
|
Li J, Zhang KQ. Independent expansion of zincin metalloproteinases in Onygenales fungi may be associated with their pathogenicity. PLoS One 2014; 9:e90225. [PMID: 24587291 PMCID: PMC3938660 DOI: 10.1371/journal.pone.0090225] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2013] [Accepted: 01/28/2014] [Indexed: 11/24/2022] Open
Abstract
To get a comprehensive view of fungal M35 family (deuterolysin) and M36 family (fungalysin) genes, we conducted genome-wide investigations and phylogenetic analyses of genes in these two families from 50 sequenced Ascomycota fungi with different life styles. Large variations in the number of M35 family and M36 family genes were found among different fungal genomes, indicating that these two gene families have been highly dynamic through fungal evolution. Moreover, we found obvious expansions of Meps in two families of Onygenales: Onygenaceae and Arthodermataceae, whereas species in family Ajellomycetace did not show expansion of these genes. The strikingly different gene duplication and loss patterns in Onygenales may be associated with the different pathogenicity of these species. Interestingly, likelihood ratio tests (LRT) of both M35 family and M36 family genes suggested that several branches leading to the duplicated genes in dermatophytic and Coccidioides fungi had signatures of positive selection, indicating that the duplicated Mep genes have likely diverged functionally to play important roles during the evolution of pathogenicity of dermatophytic and Coccidioides fungi. The potentially positively selected residues discovered by our analysis may have contributed to the development of new physiological functions of the duplicated Mep genes in dermatophytic fungi and Coccidioides species. Our study adds to the current knowledge of the evolution of Meps in fungi and also establishes a theoretical foundation for future experimental investigations.
Collapse
Affiliation(s)
- Juan Li
- Laboratory for Conservation and Utilization of Bio-resources, and Key Laboratory for Microbial Resources of the Ministry of Education, Yunnan University, Kunming, P.R. China
| | - Ke-Qin Zhang
- Laboratory for Conservation and Utilization of Bio-resources, and Key Laboratory for Microbial Resources of the Ministry of Education, Yunnan University, Kunming, P.R. China
- * E-mail:
| |
Collapse
|
15
|
Chen Z, Wang Z, Xu S, Zhou K, Yang G. Characterization of hairless (Hr) and FGF5 genes provides insights into the molecular basis of hair loss in cetaceans. BMC Evol Biol 2013; 13:34. [PMID: 23394579 PMCID: PMC3608953 DOI: 10.1186/1471-2148-13-34] [Citation(s) in RCA: 45] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2012] [Accepted: 01/30/2013] [Indexed: 11/29/2022] Open
Abstract
Background Hair is one of the main distinguishing characteristics of mammals and it has many important biological functions. Cetaceans originated from terrestrial mammals and they have evolved a series of adaptations to aquatic environments, which are of evolutionary significance. However, the molecular mechanisms underlying their aquatic adaptations have not been well explored. This study provided insights into the evolution of hair loss during the transition from land to water by investigating and comparing two essential regulators of hair follicle development and hair follicle cycling, i.e., the Hairless (Hr) and FGF5 genes, in representative cetaceans and their terrestrial relatives. Results The full open reading frame sequences of the Hr and FGF5 genes were characterized in seven cetaceans. The sequence characteristics and evolutionary analyses suggested the functional loss of the Hr gene in cetaceans, which supports the loss of hair during their full adaptation to aquatic habitats. By contrast, positive selection for the FGF5 gene was found in cetaceans where a series of positively selected amino acid residues were identified. Conclusions This is the first study to investigate the molecular basis of the hair loss in cetaceans. Our investigation of Hr and FGF5, two indispensable regulators of the hair cycle, provide some new insights into the molecular basis of hair loss in cetaceans. The results suggest that positive selection for the FGF5 gene might have promoted the termination of hair growth and early entry into the catagen stage of hair follicle cycling. Consequently, the hair follicle cycle was disrupted and the hair was lost completely due to the loss of the Hr gene function in cetaceans. This suggests that cetaceans have evolved an effective and complex mechanism for hair loss.
Collapse
Affiliation(s)
- Zhuo Chen
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, Nanjing 210023, China
| | | | | | | | | |
Collapse
|
16
|
Voordeckers K, Brown CA, Vanneste K, van der Zande E, Voet A, Maere S, Verstrepen KJ. Reconstruction of ancestral metabolic enzymes reveals molecular mechanisms underlying evolutionary innovation through gene duplication. PLoS Biol 2012; 10:e1001446. [PMID: 23239941 PMCID: PMC3519909 DOI: 10.1371/journal.pbio.1001446] [Citation(s) in RCA: 138] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2012] [Accepted: 10/30/2012] [Indexed: 11/24/2022] Open
Abstract
Gene duplications are believed to facilitate evolutionary innovation. However, the mechanisms shaping the fate of duplicated genes remain heavily debated because the molecular processes and evolutionary forces involved are difficult to reconstruct. Here, we study a large family of fungal glucosidase genes that underwent several duplication events. We reconstruct all key ancestral enzymes and show that the very first preduplication enzyme was primarily active on maltose-like substrates, with trace activity for isomaltose-like sugars. Structural analysis and activity measurements on resurrected and present-day enzymes suggest that both activities cannot be fully optimized in a single enzyme. However, gene duplications repeatedly spawned daughter genes in which mutations optimized either isomaltase or maltase activity. Interestingly, similar shifts in enzyme activity were reached multiple times via different evolutionary routes. Together, our results provide a detailed picture of the molecular mechanisms that drove divergence of these duplicated enzymes and show that whereas the classic models of dosage, sub-, and neofunctionalization are helpful to conceptualize the implications of gene duplication, the three mechanisms co-occur and intertwine.
Collapse
Affiliation(s)
- Karin Voordeckers
- VIB Laboratory for Systems Biology, Leuven, Belgium
- CMPG Laboratory for Genetics and Genomics, KU Leuven, Leuven, Belgium
| | - Chris A. Brown
- VIB Laboratory for Systems Biology, Leuven, Belgium
- CMPG Laboratory for Genetics and Genomics, KU Leuven, Leuven, Belgium
- Fathom Information Design, Boston, Massachusetts, United States of America
- Faculty of Arts and Sciences Center for Systems Biology, Harvard University, Cambridge, Massachusetts, United States of America
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts, United States of America
| | - Kevin Vanneste
- VIB Department of Plant Systems Biology, Gent, Belgium
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Gent, Belgium
| | - Elisa van der Zande
- VIB Laboratory for Systems Biology, Leuven, Belgium
- CMPG Laboratory for Genetics and Genomics, KU Leuven, Leuven, Belgium
| | - Arnout Voet
- Laboratory for Molecular en Structural Biology, KU Leuven, Leuven, Belgium
| | - Steven Maere
- VIB Department of Plant Systems Biology, Gent, Belgium
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Gent, Belgium
| | - Kevin J. Verstrepen
- VIB Laboratory for Systems Biology, Leuven, Belgium
- CMPG Laboratory for Genetics and Genomics, KU Leuven, Leuven, Belgium
| |
Collapse
|
17
|
Xu S, Chen Y, Cheng Y, Yang D, Zhou X, Xu J, Zhou K, Yang G. Positive selection at the ASPM gene coincides with brain size enlargements in cetaceans. Proc Biol Sci 2012; 279:4433-40. [PMID: 22977148 DOI: 10.1098/rspb.2012.1729] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The enlargement of cetacean brain size represents an enigmatic event in mammalian evolution, yet its genetic basis remains poorly explored. One candidate gene associated with brain size evolution is the abnormal spindle-like microcephaly associated (ASPM), as mutations in this gene cause severe reductions in the cortical size of humans. Here, we investigated the ASPM gene in representative cetacean lineages and previously published sequences from other mammals to test whether the expansion of the cetacean brain matched adaptive ASPM evolution patterns. Our analyses yielded significant evidence of positive selection on the ASPM gene during cetacean evolution, especially for the Odontoceti and Delphinoidea lineages. These molecular patterns were associated with two major events of relative brain size enlargement in odontocetes and delphinoids. It is of particular interest to find that positive selection was restricted to cetaceans and primates, two distant lineages both characterized by a massive expansion of brain size. This result is suggestive of convergent molecular evolution, although no site-specific convergence at the amino acid level was found.
Collapse
Affiliation(s)
- Shixia Xu
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, 1 Wenyuan Road, Nanjing, People's Republic of China
| | | | | | | | | | | | | | | |
Collapse
|
18
|
Li J, Yu L, Tian Y, Zhang KQ. Molecular evolution of the deuterolysin (M35) family genes in Coccidioides. PLoS One 2012; 7:e31536. [PMID: 22363666 PMCID: PMC3282736 DOI: 10.1371/journal.pone.0031536] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2011] [Accepted: 01/09/2012] [Indexed: 11/18/2022] Open
Abstract
Coccidioides is a primary fungal pathogen of humans, causing life-threatening respiratory disease known as coccidioidomycosis (Valley fever) in immunocompromised individuals. Recently, Sharpton et al (2009) found that the deuterolysin (M35) family genes were significantly expanded in both the Coccidioides genus and in U. reesii, and that Coccidioides has acquired three more M35 family genes than U. reesii. In the present work, phylogenetic analyses based on a total of 28 M35 family genes using different alignments and tree-building methods consistently revealed five clades with high nodal supports. Interestingly, likelihood ratio tests suggested significant differences in selective pressure on the ancestral lineage of three additional duplicated M35 family genes from Coccidioides species compared to the other lineages in the phylogeny, which may be associated with novel functional adaptations of M35 family genes in the Coccidioides species, e.g., recent pathogenesis acquisition. Our study adds to the expanding view of M35 family gene evolution and functions as well as establishes a theoretical foundation for future experimental investigations.
Collapse
Affiliation(s)
- Juan Li
- Laboratory for Conservation and Utilization of Bio-resources, and Key Laboratory for Microbial Resources of the Ministry of Education, Yunnan University, Kunming, People's Republic of China
| | - Li Yu
- Laboratory for Conservation and Utilization of Bio-resources, and Key Laboratory for Microbial Resources of the Ministry of Education, Yunnan University, Kunming, People's Republic of China
| | - Yanmei Tian
- Laboratory for Conservation and Utilization of Bio-resources, and Key Laboratory for Microbial Resources of the Ministry of Education, Yunnan University, Kunming, People's Republic of China
| | - Ke-Qin Zhang
- Laboratory for Conservation and Utilization of Bio-resources, and Key Laboratory for Microbial Resources of the Ministry of Education, Yunnan University, Kunming, People's Republic of China
- * E-mail:
| |
Collapse
|
19
|
Evidence for positive selection on the leptin gene in Cetacea and Pinnipedia. PLoS One 2011; 6:e26579. [PMID: 22046310 PMCID: PMC3203152 DOI: 10.1371/journal.pone.0026579] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2011] [Accepted: 09/29/2011] [Indexed: 01/21/2023] Open
Abstract
The leptin gene has received intensive attention and scientific investigation for its importance in energy homeostasis and reproductive regulation in mammals. Furthermore, study of the leptin gene is of crucial importance for public health, particularly for its role in obesity, as well as for other numerous physiological roles that it plays in mammals. In the present work, we report the identification of novel leptin genes in 4 species of Cetacea, and a comparison with 55 publicly available leptin sequences from mammalian genome assemblies and previous studies. Our study provides evidence for positive selection in the suborder Odontoceti (toothed whales) of the Cetacea and the family Phocidae (earless seals) of the Pinnipedia. We also detected positive selection in several leptin gene residues in these two lineages. To test whether leptin and its receptor evolved in a coordinated manner, we analyzed 24 leptin receptor gene (LPR) sequences from available mammalian genome assemblies and other published data. Unlike the case of leptin, our analyses did not find evidence of positive selection for LPR across the Cetacea and Pinnipedia lineages. In line with this, positively selected sites identified in the leptin genes of these two lineages were located outside of leptin receptor binding sites, which at least partially explains why co-evolution of leptin and its receptor was not observed in the present study. Our study provides interesting insights into current understanding of the evolution of mammalian leptin genes in response to selective pressures from life in an aquatic environment, and leads to a hypothesis that new tissue specificity or novel physiologic functions of leptin genes may have arisen in both odontocetes and phocids. Additional data from other species encompassing varying life histories and functional tests of the adaptive role of the amino acid changes identified in this study will help determine the factors that promote the adaptive evolution of the leptin genes in marine mammals.
Collapse
|
20
|
Kobayashi Y, Horie M, Tomonaga K, Suzuki Y. No evidence for natural selection on endogenous borna-like nucleoprotein elements after the divergence of Old World and New World monkeys. PLoS One 2011; 6:e24403. [PMID: 21912690 PMCID: PMC3166317 DOI: 10.1371/journal.pone.0024403] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2011] [Accepted: 07/18/2011] [Indexed: 12/02/2022] Open
Abstract
Endogenous Borna-like nucleoprotein (EBLNs) elements were recently discovered as non-retroviral RNA virus elements derived from bornavirus in the genomes of various animals. Most of EBLNs appeared to be defective, but some of primate EBLN-1 to -4, which appeared to be originated from four independent integrations of bornavirus nucleoprotein (N) gene, have retained an open reading frame (ORF) for more than 40 million years. It was therefore possible that primate EBLNs have encoded functional proteins during evolution. To examine this possibility, natural selection operating on all ORFs of primate EBLN-1 to -4 was examined by comparing the rates of synonymous and nonsynonymous substitutions. The expected number of premature termination codons in EBLN-1 generated after the divergence of Old World and New World monkeys under the selective neutrality was also examined by the Monte Carlo simulation. As a result, natural selection was not identified for the entire region as well as parts of ORFs in the pairwise analysis of primate EBLN-1 to -4 and for any branch of the phylogenetic trees for EBLN-1 to -4 after the divergence of Old World and New World monkeys. Computer simulation also indicated that the absence of premature termination codon in the present-day EBLN-1 does not necessarily support the maintenance of function after the divergence of Old World and New World monkeys. These results suggest that EBLNs have not generally encoded functional proteins after the divergence of Old World and New World monkeys.
Collapse
Affiliation(s)
- Yuki Kobayashi
- Graduate School of Natural Sciences, Nagoya City University, Nagoya, Aichi, Japan
| | - Masayuki Horie
- Research Institute for Microbial Diseases (BIKEN), Osaka University, Suita, Osaka, Japan
- Institute for Virus Research, Kyoto University, Sakyo-ku, Kyoto, Japan
| | - Keizo Tomonaga
- Research Institute for Microbial Diseases (BIKEN), Osaka University, Suita, Osaka, Japan
- Institute for Virus Research, Kyoto University, Sakyo-ku, Kyoto, Japan
| | - Yoshiyuki Suzuki
- Graduate School of Natural Sciences, Nagoya City University, Nagoya, Aichi, Japan
- * E-mail:
| |
Collapse
|
21
|
Suzuki Y. Statistical methods for detecting natural selection from genomic data. Genes Genet Syst 2011; 85:359-76. [PMID: 21415566 DOI: 10.1266/ggs.85.359] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
In the study of molecular and phenotypic evolution, understanding the relative importance of random genetic drift and positive selection as the mechanisms for driving divergences between populations and maintaining polymorphisms within populations has been a central issue. A variety of statistical methods has been developed for detecting natural selection operating at the amino acid and nucleotide sequence levels. These methods may be largely classified into those aimed at detecting recurrent and/or recent/ongoing natural selection by utilizing the divergence and/or polymorphism data. Using these methods, pervasive positive selection has been identified for protein-coding and non-coding sequences in the genomic analysis of some organisms. However, many of these methods have been criticized by using computer simulation and real data analysis to produce excessive false-positives and to be sensitive to various disturbing factors. Importantly, some of these methods have been invalidated experimentally. These facts indicate that many of the statistical methods for detecting natural selection are unreliable. In addition, the signals that have been believed as the evidence for fixations of advantageous mutations due to positive selection may also be interpreted as the evidence for fixations of deleterious mutations due to random genetic drift. The genomic diversity data are rapidly accumulating in various organisms, and detection of natural selection may play a critical role for clarifying the relative role of random genetic drift and positive selection in molecular and phenotypic evolution. It is therefore important to develop reliable statistical methods that are unbiased as well as robust against various disturbing factors, for inferring natural selection.
Collapse
Affiliation(s)
- Yoshiyuki Suzuki
- Graduate School of Natural Sciences, Nagoya City University, Japan.
| |
Collapse
|
22
|
Yu L, Wang X, Ting N, Zhang Y. Mitogenomic analysis of Chinese snub-nosed monkeys: Evidence of positive selection in NADH dehydrogenase genes in high-altitude adaptation. Mitochondrion 2011; 11:497-503. [PMID: 21292038 DOI: 10.1016/j.mito.2011.01.004] [Citation(s) in RCA: 90] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2010] [Revised: 01/14/2011] [Accepted: 01/24/2011] [Indexed: 11/26/2022]
Abstract
Chinese snub-nosed monkeys belong to the genus Rhinopithecus and are limited in distribution to six isolated mountainous areas in the temperate regions of Central and Southwest China. Compared to the other members of the subfamily Colobinae (or leaf-eating monkeys), these endangered primates are unique in being adapted to a high altitude environment and display a remarkable ability to tolerate low temperatures and hypoxia. They thus offer an interesting organismal model of adaptation to extreme environmental stress. Mitochondria generate energy by oxidative phosphorylation (OXPHOS) and play important roles in oxygen usage and energy metabolism. We analyzed the mitochondrial genomes of two Chinese snub-nosed monkey species and eight other colobines in the first attempt to understand the genetic basis of high altitude adaptation in non-human primates. We found significant evidence of positive selection in one Chinese snub-nosed monkey, Rhinopithecus roxellana, which is suggestive of adaptive change related to high altitude and cold weather stress. In addition, our study identified two potentially important adaptive amino acid residues (533 and 3307) in the NADH2 and NADH6 genes, respectively. Surprisingly, no evidence for positive selection was found in Rhinopithecus bieti (the other Chinese snub-nosed monkey analyzed). This finding is intriguing, especially considering that R. bieti inhabits a higher altitudinal distribution than R. roxellana. We hypothesize that a different adaptive genetic basis to high altitude survival exists in R. bieti from those seen in other mammals, and that positive selection and functionally associated mutations in this species may be detected in nuclear genes related to energy and oxygen metabolism. More information on the structure, function, and evolution of mitochondrial and nuclear genomes in Chinese snub-nosed monkeys is required to reveal the molecular mechanisms that underlie adaptations to high altitude survival in non-human primates.
Collapse
Affiliation(s)
- Li Yu
- Laboratory for Conservation and Utilization of Bio-resource & Key Laboratory for Microbial Resources of the Ministry of Education, Yunnan University, Kunming, 650091, PR, China.
| | | | | | | |
Collapse
|
23
|
Yang Z, dos Reis M. Statistical properties of the branch-site test of positive selection. Mol Biol Evol 2010; 28:1217-28. [PMID: 21087944 DOI: 10.1093/molbev/msq303] [Citation(s) in RCA: 230] [Impact Index Per Article: 16.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The branch-site test is a likelihood ratio test to detect positive selection along prespecified lineages on a phylogeny that affects only a subset of codons in a protein-coding gene, with positive selection indicated by accelerated nonsynonymous substitutions (with ω = d(N)/d(S) > 1). This test may have more power than earlier methods, which average nucleotide substitution rates over sites in the protein and/or over branches on the tree. However, a few recent studies questioned the statistical basis of the test and claimed that the test generated too many false positives. In this paper, we examine the null distribution of the test and conduct a computer simulation to examine the false-positive rate and the power of the test. The results suggest that the asymptotic theory is reliable for typical data sets, and indeed in our simulations, the large-sample null distribution was reliable with as few as 20-50 codons in the alignment. We examined the impact of sequence length, the strength of positive selection, and the proportion of sites under positive selection on the power of the branch-site test. We found that the test was far more powerful in detecting episodic positive selection than branch-based tests, which average substitution rates over all codons in the gene and thus miss the signal when most codons are under strong selective constraint. Recent claims of statistical problems with the branch-site test are due to misinterpretations of simulation results. Our results, as well as previous simulation studies that have demonstrated the robustness of the test, suggest that the branch-site test may be a useful tool for detecting episodic positive selection and for generating biological hypotheses for mutation studies and functional analyses. The test is sensitive to sequence and alignment errors and caution should be exercised concerning its use when data quality is in doubt.
Collapse
Affiliation(s)
- Ziheng Yang
- Department of Genetics, Evolution and Environment, University College London, United Kingdom.
| | | |
Collapse
|
24
|
Cohen O, Ashkenazy H, Belinky F, Huchon D, Pupko T. GLOOME: gain loss mapping engine. Bioinformatics 2010; 26:2914-5. [PMID: 20876605 DOI: 10.1093/bioinformatics/btq549] [Citation(s) in RCA: 88] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
UNLABELLED The evolutionary analysis of presence and absence profiles (phyletic patterns) is widely used in biology. It is assumed that the observed phyletic pattern is the result of gain and loss dynamics along a phylogenetic tree. Examples of characters that are represented by phyletic patterns include restriction sites, gene families, introns and indels, to name a few. Here, we present a user-friendly web server that accurately infers branch-specific and site-specific gain and loss events. The novel inference methodology is based on a stochastic mapping approach utilizing models that reliably capture the underlying evolutionary processes. A variety of features are available including the ability to analyze the data with various evolutionary models, to infer gain and loss events using either stochastic mapping or maximum parsimony, and to estimate gain and loss rates for each character analyzed. AVAILABILITY Freely available for use at http://gloome.tau.ac.il/.
Collapse
Affiliation(s)
- Ofir Cohen
- Department of Cell Research and Immunology, Tel-Aviv University, Tel Aviv 69978, Israel
| | | | | | | | | |
Collapse
|
25
|
Yu L, Wang XY, Jin W, Luan PT, Ting N, Zhang YP. Adaptive evolution of digestive RNASE1 genes in leaf-eating monkeys revisited: new insights from ten additional colobines. Mol Biol Evol 2010; 27:121-31. [PMID: 19748931 DOI: 10.1093/molbev/msp216] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Pancreatic RNase genes implicated in the adaptation of the colobine monkeys to leaf eating have long intrigued evolutionary biologists since the identification of a duplicated RNASE1 gene with enhanced digestive efficiencies in Pygathrix nemaeus. The recent emergence of two contrasting hypotheses, that is, independent duplication and one-duplication event hypotheses, make it into focus again. Current understanding of Colobine RNASE1 gene evolution of colobine monkeys largely depends on the analyses of few colobine species. The present study with more intensive taxonomic and character sampling not only provides a clearer picture of Colobine RNASE1 gene evolution but also allows to have a more thorough understanding about the molecular basis underlying the adaptation of Colobinae to the unique leaf-feeding lifestyle. The present broader and detailed phylogenetic analyses yielded two important findings: 1) All trees based on the analyses of coding, noncoding, and both regions provided consistent evidence, indicating RNASE1 duplication occurred after Asian and African colobines speciation, that is, independent duplication hypothesis; 2) No obvious evidence of gene conversion in RNASE1 gene was found, favoring independent evolution of Colobine RNASE1 gene duplicates. The conclusion drawn from previous studies that gene conversion has played a significant role in the evolution of Colobine RNASE1 was not supported. Our selective constraint analyses also provided interesting insights, with significant evidence of positive selection detected on ancestor lineages leading to duplicated gene copies. The identification of a handful of new adaptive sites and amino acid changes that have not been characterized previously also provide a necessary foundation for further experimental investigations of RNASE1 functional evolution in Colobinae.
Collapse
Affiliation(s)
- Li Yu
- Laboratory for Conservation and Utilization of Bio-resource and Key Laboratory for Microbial Resources of the Ministry of Education, Yunnan University, Kunming, China
| | | | | | | | | | | |
Collapse
|
26
|
Chen X, Zhang Q, Li J, Cao W, Zhang JX, Zhang L, Zhang W, Shao ZJ, Yan Y. Analysis of recombination and natural selection in human enterovirus 71. Virology 2010; 398:251-61. [DOI: 10.1016/j.virol.2009.12.007] [Citation(s) in RCA: 52] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2009] [Revised: 11/13/2009] [Accepted: 12/04/2009] [Indexed: 11/30/2022]
|
27
|
Heidel-Fischer HM, Vogel H, Heckel DG, Wheat CW. Microevolutionary dynamics of a macroevolutionary key innovation in a Lepidopteran herbivore. BMC Evol Biol 2010; 10:60. [PMID: 20181249 PMCID: PMC2841170 DOI: 10.1186/1471-2148-10-60] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2009] [Accepted: 02/24/2010] [Indexed: 01/23/2023] Open
Abstract
Background A molecular population genetics understanding is central to the study of ecological and evolutionary functional genomics. Population genetics identifies genetic variation and its distribution within and among populations, it reveals the demographic history of the populations studied, and can provide indirect insights into historical selection dynamics. Here we use this approach to examine the demographic and selective dynamics acting of a candidate gene involved in plant-insect interactions. Previous work documents the macroevolutionary and historical ecological importance of the nitrile-specifier protein (Nsp), which facilitated the host shift of Pieridae butterflies onto Brassicales host plants ~80 Myr ago. Results Here we assess the microevolutionary dynamics of the Nsp gene by studying the within and among-population variation at Nsp and reference genes in the butterfly Pieris rapae (Small Cabbage White). Nsp exhibits unexpectedly high amounts of amino acid polymorphism, unequally distributed across the gene. The vast majority of genetic variation exists within populations, with little to no genetic differentiation among four populations on two continents. A comparison of synonymous and nonsynonymous substitutions in 70 randomly chosen genes among P. rapae and its close relative Pieris brassicae (Large Cabbage White) finds Nsp to have a significantly relaxed functional constraint compared to housekeeping genes. We find strong evidence for a recent population expansion and no role for strong purifying or directional selection upon the Nsp gene. Conclusions The microevolutionary dynamics of the Nsp gene in P. rapae are dominated by recent population expansion and variation in functional constraint across the repeated domains of the Nsp gene. While the high amounts of amino acid diversity suggest there may be significant functional differences among allelic variants segregating within populations, indirect tests of selection could not conclusively identify a signature of historical selection. The importance of using this information for planning future studies of potential performance and fitness consequences of the observed variation is discussed.
Collapse
|
28
|
Abstract
Amastin is a transmembrane glycoprotein found on the cell surfaces of trypanosomatid parasites. Encoded by a large, diverse gene family, amastin was initially described from the intracellular, amastigote stage of Trypanosoma cruzi and Leishmania donovani. Genome sequences have subsequently shown that the amastin repertoire is much larger in Leishmania relative to Trypanosoma. However, it is not known when this expansion occurred, whether it is associated with the origins of Leishmania and vertebrate parasitism itself, or prior to this. To examine the timing of amastin diversification, as well as the evolutionary mechanisms regulating gene repertoire and sequence diversity, this study sequenced the genomic regions containing amastin loci from two related insect parasites (Leptomonas seymouri and Crithidia sp.) and estimated a phylogeny for these and other amastin sequences. The phylogeny shows that amastin includes four subfamilies with distinct genomic positions, secondary structures, and evolution, which were already differentiated in the ancestral trypanosomatid. Diversification in Leishmania was initiated from a single ancestral locus on chromosome 34, with rapid derivation of novel loci through transposition and accelerated sequence divergence. This is absent from related organisms showing that diversification occurred after the origin of Leishmania. These results describe a substantial elaboration of amastin repertoire directly associated with the origin of Leishmania, suggesting that some amastin genes evolved novel functions crucial to cell function in leishmanial parasites after the acquisition of a vertebrate host.
Collapse
Affiliation(s)
- Andrew P Jackson
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridgeshire, CB10 1SA, United Kingdom.
| |
Collapse
|
29
|
Reliabilities of identifying positive selection by the branch-site and the site-prediction methods. Proc Natl Acad Sci U S A 2009; 106:6700-5. [PMID: 19339501 DOI: 10.1073/pnas.0901855106] [Citation(s) in RCA: 125] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023] Open
Abstract
Natural selection operating in protein-coding genes is often studied by examining the ratio (omega) of the rates of nonsynonymous to synonymous nucleotide substitution. The branch-site method (BSM) based on a likelihood ratio test is one of such tests to detect positive selection for a predetermined branch of a phylogenetic tree. However, because the number of nucleotide substitutions involved is often very small, we conducted a computer simulation to examine the reliability of BSM in comparison with the small-sample method (SSM) based on Fisher's exact test. The results indicate that BSM often generates false positives compared with SSM when the number of nucleotide substitutions is approximately 80 or smaller. Because the omega value is also used for predicting positively selected sites, we examined the reliabilities of the site-prediction methods, using nucleotide sequence data for the dim-light and color vision genes in vertebrates. The results showed that the site-prediction methods have a low probability of identifying functional changes of amino acids experimentally determined and often falsely identify other sites where amino acid substitutions are unlikely to be important. This low rate of predictability occurs because most of the current statistical methods are designed to identify codon sites with high omega values, which may not have anything to do with functional changes. The codon sites showing functional changes generally do not show a high omega value. To understand adaptive evolution, some form of experimental confirmation is necessary.
Collapse
|
30
|
Tunstall NE, Sirey T, Newcomb RD, Warr CG. Selective pressures on Drosophila chemosensory receptor genes. J Mol Evol 2007; 64:628-36. [PMID: 17541681 DOI: 10.1007/s00239-006-0151-6] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2006] [Accepted: 02/28/2007] [Indexed: 01/06/2023]
Abstract
The evolution and patterns of selection of genes encoding 10 Drosophila odorant receptors (Or) and the sex pheromone receptor Gr68a were investigated by comparing orthologous sequences across five to eight ecologically diverse species of Drosophila. Using maximum likelihood estimates of dN/dS ratios we show that all 11 genes sampled are under purifying selection, indicating functional constraint. Four of these genes (Or33c, Or42a, Or85e, and Gr68a) may be under positive selection, and if so, there is good evidence that 12 specific amino acid sites may be under positive selection. All of these sites are predicted to be located either in loop regions or just inside membrane spanning regions, and interestingly one of the two sites in Gr68a is in a similar position to a previously described polymorphism in Gr5a that causes a shift in sensitivity to its ligand trehalose. For three Ors, possible evidence for positive selection was detected along a lineage. These include Or22a in the lineage leading to D. mauritiana and Or22b in the lineage leading to D. simulans. This is of interest in light of previous data showing a change in ligand response profile for these species in the sensory neuron (ab3A) which expresses both Or22a and Or22b in D. melanogaster. In summary, while the main chemosensory function and/or structural integrity of these 10 Or genes and Gr68a are evolutionarily preserved, positive selection appears to be acting on some of these genes, at specific sites and along certain lineages, and provides testable hypotheses for further functional experimentation.
Collapse
Affiliation(s)
- Narelle E Tunstall
- School of Biological Sciences, Monash University, Wellington Road, Clayton, VIC, Australia
| | | | | | | |
Collapse
|
31
|
Padhi A, Verghese B, Otta SK. Detecting the form of selection in the outer membrane protein C of Enterobacter aerogenes strains and Salmonella species. Microbiol Res 2007; 164:282-9. [PMID: 17418551 DOI: 10.1016/j.micres.2006.12.002] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2006] [Accepted: 12/13/2006] [Indexed: 11/22/2022]
Abstract
The types of selective pressure operating on the outer membrane protein C (ompC) of Enterobacter aerogenes strains, the causative agent for nosocomial infections, and Salmonella sp., the hazardous pathogen are investigated using the maximum likelihood-based codon substitution models. Although the rate of amino acid replacement to the silent substitution (omega) across the entire codon sites of ompC of E. aerogenes (omega=0.3194) and Salmonella sp. (omega=0.2047) indicate that the gene is subjected to purifying selection (i.e. omega<1), approximately 3.7% of ompC codon sites in E. aerogenes (omega=21.52) are under the influence of positive Darwinian selection (i.e. omega>1). Such contrast in the intensity of selective pressures in both pathogens could be associated with the differential response to the adverse environmental changes. In E. aerogenes, majority of the positively selected sites are located in the hypervariable cell-surface-exposed domains whereas the trans-membrane domains are functionally highly constrained.
Collapse
Affiliation(s)
- Abinash Padhi
- Department of Biological Science, University of Tulsa, Tulsa, OK-74104, USA.
| | | | | |
Collapse
|
32
|
Harlin-Cognato A, Hoffman EA, Jones AG. Gene cooption without duplication during the evolution of a male-pregnancy gene in pipefish. Proc Natl Acad Sci U S A 2006; 103:19407-12. [PMID: 17158805 PMCID: PMC1748239 DOI: 10.1073/pnas.0603000103] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Comparative studies of developmental processes suggest that novel traits usually evolve through the cooption of preexisting genes and proteins, mainly via gene duplication and functional specialization of paralogs. However, an alternative hypothesis is that novel protein function can evolve without gene duplication, through changes in the spatiotemporal patterns of gene expression (e.g., via cis-regulatory elements), or functional modifications (e.g., addition of functional domains) of the proteins they encode, or both. Here we present an astacin metalloprotease, dubbed patristacin, which has been coopted without duplication, via alteration in the expression of a preexisting gene from the kidney and liver of bony fishes, for a novel role in the brood pouch of pregnant male pipefish. We examined the molecular evolution of patristacin and found conservation of astacin-specific motifs but also several positively selected amino acids that may represent functional modifications for male pregnancy. Overall, our results pinpoint a clear case in which gene cooption occurred without gene duplication during the genesis of an evolutionarily significant novel structure, the male brood pouch. These findings contribute to a growing understanding of morphological innovation, a critically important but poorly understood process in evolutionary biology.
Collapse
Affiliation(s)
- April Harlin-Cognato
- Department of Biology, Texas A&M University, 3258 TAMU, College Station, TX 77843-3258, USA.
| | | | | |
Collapse
|
33
|
Preston JC, Kellogg EA. Reconstructing the evolutionary history of paralogous APETALA1/FRUITFULL-like genes in grasses (Poaceae). Genetics 2006; 174:421-37. [PMID: 16816429 PMCID: PMC1569798 DOI: 10.1534/genetics.106.057125] [Citation(s) in RCA: 128] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2006] [Accepted: 06/23/2006] [Indexed: 01/12/2023] Open
Abstract
Gene duplication is an important mechanism for the generation of evolutionary novelty. Paralogous genes that are not silenced may evolve new functions (neofunctionalization) that will alter the developmental outcome of preexisting genetic pathways, partition ancestral functions (subfunctionalization) into divergent developmental modules, or function redundantly. Functional divergence can occur by changes in the spatio-temporal patterns of gene expression and/or by changes in the activities of their protein products. We reconstructed the evolutionary history of two paralogous monocot MADS-box transcription factors, FUL1 and FUL2, and determined the evolution of sequence and gene expression in grass AP1/FUL-like genes. Monocot AP1/FUL-like genes duplicated at the base of Poaceae and codon substitutions occurred under relaxed selection mostly along the branch leading to FUL2. Following the duplication, FUL1 was apparently lost from early diverging taxa, a pattern consistent with major changes in grass floral morphology. Overlapping gene expression patterns in leaves and spikelets indicate that FUL1 and FUL2 probably share some redundant functions, but that FUL2 may have become temporally restricted under partial subfunctionalization to particular stages of floret development. These data have allowed us to reconstruct the history of AP1/FUL-like genes in Poaceae and to hypothesize a role for this gene duplication in the evolution of the grass spikelet.
Collapse
Affiliation(s)
- Jill C Preston
- Department of Biology, University of Missouri, Saint Louis, Missouri 63121, USA.
| | | |
Collapse
|
34
|
Codoñer FM, Fares MA, Elena SF. Adaptive covariation between the coat and movement proteins of prunus necrotic ringspot virus. J Virol 2006; 80:5833-40. [PMID: 16731922 PMCID: PMC1472603 DOI: 10.1128/jvi.00122-06] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The relative functional and/or structural importance of different amino acid sites in a protein can be assessed by evaluating the selective constraints to which they have been subjected during the course of evolution. Here we explore such constraints at the linear and three-dimensional levels for the movement protein (MP) and coat protein (CP) encoded by RNA 3 of prunus necrotic ringspot ilarvirus (PNRSV). By a maximum-parsimony approach, the nucleotide sequences from 46 isolates of PNRSV varying in symptomatology, host tree, and geographic origin have been analyzed and sites under different selective pressures have been identified in both proteins. We have also performed covariation analyses to explore whether changes in certain amino acid sites condition subsequent variation in other sites of the same protein or the other protein. These covariation analyses shed light on which particular amino acids should be involved in the physical and functional interaction between MP and CP. Finally, we discuss these findings in the light of what is already known about the implication of certain sites and domains in structure and protein-protein and RNA-protein interactions.
Collapse
Affiliation(s)
- Francisco M Codoñer
- Instituto de Biología Molecular y Celular de Plantas, Consejo Superior de Investigaciones Científicas-UPV, 46022 València, Spain
| | | | | |
Collapse
|
35
|
Chen SL, Hung CS, Xu J, Reigstad CS, Magrini V, Sabo A, Blasiar D, Bieri T, Meyer RR, Ozersky P, Armstrong JR, Fulton RS, Latreille JP, Spieth J, Hooton TM, Mardis ER, Hultgren SJ, Gordon JI. Identification of genes subject to positive selection in uropathogenic strains of Escherichia coli: a comparative genomics approach. Proc Natl Acad Sci U S A 2006; 103:5977-82. [PMID: 16585510 PMCID: PMC1424661 DOI: 10.1073/pnas.0600938103] [Citation(s) in RCA: 422] [Impact Index Per Article: 23.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Escherichia coli is a model laboratory bacterium, a species that is widely distributed in the environment, as well as a mutualist and pathogen in its human hosts. As such, E. coli represents an attractive organism to study how environment impacts microbial genome structure and function. Uropathogenic E. coli (UPEC) must adapt to life in several microbial communities in the human body, and has a complex life cycle in the bladder when it causes acute or recurrent urinary tract infection (UTI). Several studies designed to identify virulence factors have focused on genes that are uniquely represented in UPEC strains, whereas the role of genes that are common to all E. coli has received much less attention. Here we describe the complete 5,065,741-bp genome sequence of a UPEC strain recovered from a patient with an acute bladder infection and compare it with six other finished E. coli genome sequences. We searched 3,470 ortholog sets for genes that are under positive selection only in UPEC strains. Our maximum likelihood-based analysis yielded 29 genes involved in various aspects of cell surface structure, DNA metabolism, nutrient acquisition, and UTI. These results were validated by resequencing a subset of the 29 genes in a panel of 50 urinary, periurethral, and rectal E. coli isolates from patients with UTI. These studies outline a computational approach that may be broadly applicable for studying strain-specific adaptation and pathogenesis in other bacteria.
Collapse
Affiliation(s)
| | | | - Jian Xu
- *Center for Genome Sciences
- Genome Sequencing Center, and Departments of
- Genetics, and
| | - Christopher S. Reigstad
- *Center for Genome Sciences
- **Molecular Biology and Pharmacology, Washington University School of Medicine, St. Louis, MO 63110; and
| | | | - Aniko Sabo
- Genome Sequencing Center, and Departments of
| | | | | | | | | | | | | | | | - John Spieth
- Genome Sequencing Center, and Departments of
| | - Thomas M. Hooton
- Department of Medicine, University of Washington, Seattle, WA 98195
| | | | | | - Jeffrey I. Gordon
- *Center for Genome Sciences
- **Molecular Biology and Pharmacology, Washington University School of Medicine, St. Louis, MO 63110; and
- To whom correspondence should be addressed. E-mail:
| |
Collapse
|
36
|
Arnau V, Gallach M, Lucas JI, Marín I. UVPAR: fast detection of functional shifts in duplicate genes. BMC Bioinformatics 2006; 7:174. [PMID: 16569227 PMCID: PMC1570150 DOI: 10.1186/1471-2105-7-174] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2005] [Accepted: 03/28/2006] [Indexed: 12/29/2022] Open
Abstract
BACKGROUND The imprint of natural selection on gene sequences is often difficult to detect. A plethora of methods have been devised to detect genetic changes due to selective processes. However, many of those methods depend heavily on underlying assumptions regarding the mode of change of DNA sequences and often require sophisticated mathematical treatments that made them computationally slow. The development of fast and effective methods to detect modifications in the selective constraints of genes is therefore of great interest. RESULTS We describe UVPAR, a program designed to quickly test for changes in the functional constraints of duplicate genes. Starting with alignments of the proteins encoded by couples of duplicate genes in two different species, UVPAR detects the regions in which modifications of the functional constraints in the paralogs occurred since both species diverged. Sequences can be analyzed with UVPAR in just a few minutes on a standard PC computer. To demonstrate the power of the program, we first show how the results obtained with UVPAR compare to those based on other approaches, using data for vertebrate Hox genes. We then describe a comprehensive study of the RBR family of ubiquitin ligases in which we have performed 529 analyses involving 14 duplicate genes in seven model species. A significant increase in the number of functional shifts was observed for the species Danio rerio and for the gene Ariadne-2. CONCLUSION These results show that UVPAR can be used to generate sensitive analyses to detect changes in the selection constraints acting on paralogs. The high speed of the program allows its application to genome-scale analyses.
Collapse
Affiliation(s)
- Vicente Arnau
- Departamento de Informática, Universidad de Valencia, Burjassot, Spain
| | - Miguel Gallach
- Departamento de Genética, Universidad de Valencia, Burjassot, Spain
| | - J Ignasi Lucas
- Departamento de Genética, Universidad de Valencia, Burjassot, Spain
| | - Ignacio Marín
- Departamento de Genética, Universidad de Valencia, Burjassot, Spain
| |
Collapse
|
37
|
β-Defensin evolution: selection complexity and clues for residues of functional importance. Biochem Soc Trans 2006. [DOI: 10.1042/bst0340257] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
We have examined the evolution of the genes at the major human β-defensin locus and the orthologous loci in a range of other primates and mammals. For the first time, these data allow us to examine selective episodes in the more recent evolutionary history of this locus as well as in the ancient past. We have used a combination of maximum-likelihood-based tests and a maximum-parsimony-based sliding window approach to give a detailed view of the varying modes of selection operating at this locus. We provide evidence for strong positive selection soon after the duplication of these genes within an ancestral mammalian genome. During the divergence of primates, however, variable selective pressures have acted on β-defensin genes in different evolutionary lineages, with episodes of both negative and, more rarely, positive selection. Positive selection appears to have been more common in the rodent lineage, accompanying the birth of novel rodent-specific β-defensin gene clades. Sites in the second exon have been subject to positive selection and, by implication, are important in functional diversity. A small number of sites in the mature human peptides were found to have undergone repeated episodes of selection in different primate lineages. Particular sites were consistently implicated by multiple methods at positions throughout the mature peptides. These sites are clustered at positions that are predicted to be important for the function of β-defensins.
Collapse
|
38
|
Ryynänen HJ, Primmer CR. Varying signals of the effects of natural selection during teleost growth hormone gene evolution. Genome 2006; 49:42-53. [PMID: 16462900 DOI: 10.1139/g05-079] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
The growth hormone (GH) gene of teleost fish exhibits a higher degree of variability compared with other vertebrate groups. However, the different selective constraints at the sequence level are not well understood. In this study, maximum-likelihood (ML) models of codon substitutions were used to investigate Darwinian adaptive evolution of the GH gene in teleost fishes. Complete GH gene sequences of 54 fish species were classified into 4 orders, and the variable nature of GH was examined by determining the dNand dSrate variation and the rates of molecular evolution for each teleost order. The results indicate that although the overall evolution rate for teleost GH is high ((1.15 ± 0.01) × 10–9substitutions/(aa site·y)) compared with the "slow phases" in mammals ((0.21 to 0.28 ± 0.05) × 10–9), the vital structure of this gene has been retained. While the majority of the amino acid changes appear to be due to relaxation of purifying selection, some positively selected sites were detected in regions with no specifically identified role in protein function. The positively selected regions observed in salmoniformes lineage suggests a possible role for positive selection driving functional divergence in paralogous forms of the GH gene after whole-genome duplication in this lineage.Key words: teleost fish, growth hormone, positive selection, synonymous substitution, non-synonymous substitution, molecular evolution.
Collapse
Affiliation(s)
- Heikki J Ryynänen
- Department of Biological and Environmental Sciences, University of Helsinki, Finland
| | | |
Collapse
|
39
|
Abstract
Charles Darwin proposed that evolution occurs primarily by natural selection, but this view has been controversial from the beginning. Two of the major opposing views have been mutationism and neutralism. Early molecular studies suggested that most amino acid substitutions in proteins are neutral or nearly neutral and the functional change of proteins occurs by a few key amino acid substitutions. This suggestion generated an intense controversy over selectionism and neutralism. This controversy is partially caused by Kimura's definition of neutrality, which was too strict (|2Ns|< or =1). If we define neutral mutations as the mutations that do not change the function of gene products appreciably, many controversies disappear because slightly deleterious and slightly advantageous mutations are engulfed by neutral mutations. The ratio of the rate of nonsynonymous nucleotide substitution to that of synonymous substitution is a useful quantity to study positive Darwinian selection operating at highly variable genetic loci, but it does not necessarily detect adaptively important codons. Previously, multigene families were thought to evolve following the model of concerted evolution, but new evidence indicates that most of them evolve by a birth-and-death process of duplicate genes. It is now clear that most phenotypic characters or genetic systems such as the adaptive immune system in vertebrates are controlled by the interaction of a number of multigene families, which are often evolutionarily related and are subject to birth-and-death evolution. Therefore, it is important to study the mechanisms of gene family interaction for understanding phenotypic evolution. Because gene duplication occurs more or less at random, phenotypic evolution contains some fortuitous elements, though the environmental factors also play an important role. The randomness of phenotypic evolution is qualitatively different from allele frequency changes by random genetic drift. However, there is some similarity between phenotypic and molecular evolution with respect to functional or environmental constraints and evolutionary rate. It appears that mutation (including gene duplication and other DNA changes) is the driving force of evolution at both the genic and the phenotypic levels.
Collapse
Affiliation(s)
- Masatoshi Nei
- Department of Biology, Institute of Molecular Evolutionary Genetics, , Pennsylvania State University, USA.
| |
Collapse
|
40
|
Suzuki Y. Statistical properties of the methods for detecting positively selected amino acid sites. Gene 2005; 365:125-9. [PMID: 16256279 DOI: 10.1016/j.gene.2005.09.002] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2005] [Revised: 06/30/2005] [Accepted: 09/07/2005] [Indexed: 10/25/2022]
Abstract
Parsimony and Bayesian methods have been developed for detecting positively selected amino acid sites. It has been reported that the parsimony method is generally conservative. In contrast, the Bayesian method is known to identify more positively selected sites than the parsimony method, especially when the number of sequences analyzed is small, although the interpretation of results obtained from the former method is controversial. Here I show that the likelihood-ratio test (LRT) of the Bayesian method corresponds to the parsimony method with window analysis, by analyzing the nucleotide sequences encoding the trans-activator (tax) gene of human T-cell lymphotropic virus type I (HTLV-I). It is also indicated that in the parsimony method, the test of selective neutrality using the binomial probability tends to be conservative, but the Monte Carlo simulation is useful for solving this problem. In addition, in the Bayesian method, the bootstrap method appears to produce similar results to the LRT. This information may be useful for improving the methods for detecting positively selected amino acid sites.
Collapse
Affiliation(s)
- Yoshiyuki Suzuki
- Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima-shi, Shizuoka-ken 411-8540, Japan.
| |
Collapse
|
41
|
Vasemägi A, Primmer CR. Challenges for identifying functionally important genetic variation: the promise of combining complementary research strategies. Mol Ecol 2005; 14:3623-42. [PMID: 16202085 DOI: 10.1111/j.1365-294x.2005.02690.x] [Citation(s) in RCA: 239] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Strategies for the identification of functional genetic variation underlying phenotypic traits of ecological and evolutionary importance have received considerable attention in the literature recently. This paper aims to bring together and compare the relative strengths and limitations of various potentially useful research strategies for dissecting functionally important genetic variation in a wide range of organisms. We briefly explore the relative strengths and limitations of traditional and emerging approaches and evaluate their potential use in free-living populations. While it is likely that much of the progress in functional genetic analyses will rely on progress in traditional model species, it is clear that with prudent choices of methods and appropriate sampling designs, much headway can be also made in a diverse range of species. We suggest that combining research approaches targeting different functional and biological levels can potentially increase understanding the genetic basis of ecological and evolutionary processes both in model and non-model organisms.
Collapse
Affiliation(s)
- A Vasemägi
- Department of Biology, University of Turku, Finland
| | | |
Collapse
|
42
|
Bishop JG. Directed mutagenesis confirms the functional importance of positively selected sites in polygalacturonase inhibitor protein. Mol Biol Evol 2005; 22:1531-4. [PMID: 15829620 DOI: 10.1093/molbev/msi146] [Citation(s) in RCA: 51] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
Polygalacturonase inhibitor proteins (PGIPs) protect plants against invasion by diverse microbial and invertebrate enemies that use polygalacturonase (PG) to breach the plant cell wall. Directed mutagenesis has identified specific natural mutations conferring novel defensive capability in green bean PGIP against a specific fungal PG. These same sites are identified as positively selected by phylogenetic codon-substitution models, demonstrating the utility of such models for connecting retrospective comparative analyses with contemporary, ecologically relevant variation.
Collapse
|
43
|
Fitzpatrick DA, Creevey CJ, McInerney JO. Evidence of positive Darwinian selection in putative meningococcal vaccine antigens. J Mol Evol 2005; 61:90-8. [PMID: 16007491 DOI: 10.1007/s00239-004-0290-6] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2004] [Accepted: 02/25/2005] [Indexed: 10/25/2022]
Abstract
Meningococcal meningitidis is a life-threatening disease. In Europe and the United States the majority of cases are caused by virulent meningococcal strains belonging to serogroup B. Presently there is no effective vaccine against serogroup B strains, as traditional vaccine antigens such as polysaccharide capsules are unusable as they lead to autoimmunity. The year 2000 saw the publication of the complete genome of Neisseria meningitidis MC58, a virulent serogroup B bacterium. Working in conjunction with the sequencing project, researchers endeavored to locate highly conserved membrane-associated proteins that elicit an immune response. It is hoped that these proteins will provide a basis for novel vaccines against serogroup B strains. A number of potential vaccine antigens have been located and are presently in phase I clinical trials. Recently many reports pertaining to the evidence of positive Darwinian selection in membrane proteins of pathogens have been reported. This study utilized in silico methods to test for evidence of historical positive Darwinian selection in seven such vaccine candidates. We found that two of these proteins show signatures of adaptive evolution, while the remaining proteins show evidence of strong purifying selection. This has significant implications for the design of a vaccine against serogroup B strains, as it has been shown that vaccines that target epitopes that are under strong purifying selection are better than those that target variable epitopes.
Collapse
Affiliation(s)
- David A Fitzpatrick
- Department of Biology, National University of Ireland, Maynooth, Co. Kildare, Ireland
| | | | | |
Collapse
|
44
|
Podlaha O, Webb DM, Tucker PK, Zhang J. Positive selection for indel substitutions in the rodent sperm protein catsper1. Mol Biol Evol 2005; 22:1845-52. [PMID: 15930155 PMCID: PMC1351353 DOI: 10.1093/molbev/msi178] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Catsper1 is a voltage-gated calcium channel located in the plasma membrane of the sperm tail and is necessary for sperm motility and fertility in mice. We here examine the evolutionary pattern of Catsper1 from nine species of the rodent subfamily Murinae of family Muridae. We show that the rate of insertion/deletion (indel) substitutions in exon 1 of the gene is 4-15 times that in introns or neutral genomic regions, suggesting the presence of strong positive selection that promotes fixations of indel mutations in exon 1. The number of indel polymorphisms within species appears higher than expected from interspecific comparisons, although there are too little data to provide a statistically significant conclusion. These results, together with an earlier report in primates, indicate that positive selection promoting length variation in Catsper1 may be widespread in mammals. A structural model of Catsper1 suggested the importance of the exon 1-encoded region in regulating channel inactivation, which may affect sperm mobility and sperm competition. Our findings provide a necessary foundation for future experimental investigations of Catsper1's function in sperm physiology and role in sperm competition using rodent models.
Collapse
Affiliation(s)
| | | | - Priscilla K. Tucker
- Department of Ecology and Evolutionary Biology and
- Museum of Zoology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Jianzhi Zhang
- Department of Ecology and Evolutionary Biology and
- *Correspondence to: Jianzhi Zhang, Department of Ecology and Evolutionary Biology, University of Michigan, 1075 Natural Science Building, 830 North University Avenue, Ann Arbor, MI 48109, Phone: 734-763-0527, Fax: 734-763-0544,
| |
Collapse
|
45
|
Semple CAM, Maxwell A, Gautier P, Kilanowski FM, Eastwood H, Barran PE, Dorin JR. The complexity of selection at the major primate beta-defensin locus. BMC Evol Biol 2005; 5:32. [PMID: 15904491 PMCID: PMC1156880 DOI: 10.1186/1471-2148-5-32] [Citation(s) in RCA: 36] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2005] [Accepted: 05/18/2005] [Indexed: 12/21/2022] Open
Abstract
BACKGROUND We have examined the evolution of the genes at the major human beta-defensin locus and the orthologous loci in a range of other primates and mouse. For the first time these data allow us to examine selective episodes in the more recent evolutionary history of this locus as well as the ancient past. We have used a combination of maximum likelihood based tests and a maximum parsimony based sliding window approach to give a detailed view of the varying modes of selection operating at this locus. RESULTS We provide evidence for strong positive selection soon after the duplication of these genes within an ancestral mammalian genome. Consequently variable selective pressures have acted on beta-defensin genes in different evolutionary lineages, with episodes both of negative, and more rarely positive selection, during the divergence of primates. Positive selection appears to have been more common in the rodent lineage, accompanying the birth of novel, rodent-specific beta-defensin genes. These observations allow a fuller understanding of the evolution of mammalian innate immunity. In both the rodent and primate lineages, sites in the second exon have been subject to positive selection and by implication are important in functional diversity. A small number of sites in the mature human peptides were found to have undergone repeated episodes of selection in different primate lineages. Particular sites were consistently implicated by multiple methods at positions throughout the mature peptides. These sites are clustered at positions predicted to be important for the specificity of the antimicrobial or chemoattractant properties of beta-defensins. Surprisingly, sites within the prepropeptide region were also implicated as being subject to significant positive selection, suggesting previously unappreciated functional significance for this region. CONCLUSIONS Identification of these putatively functional sites has important implications for our understanding of beta-defensin function and for novel antibiotic design.
Collapse
Affiliation(s)
- Colin AM Semple
- MRC Human Genetics Unit, Western General Hospital, Edinburgh, EH4 2XU, UK
| | - Alison Maxwell
- MRC Human Genetics Unit, Western General Hospital, Edinburgh, EH4 2XU, UK
| | - Philippe Gautier
- MRC Human Genetics Unit, Western General Hospital, Edinburgh, EH4 2XU, UK
| | - Fiona M Kilanowski
- MRC Human Genetics Unit, Western General Hospital, Edinburgh, EH4 2XU, UK
| | - Hayden Eastwood
- School of Chemistry, The University of Edinburgh, The King's Buildings, West Mains Road, Edinburgh, EH9 3JJ, UK
| | - Perdita E Barran
- School of Chemistry, The University of Edinburgh, The King's Buildings, West Mains Road, Edinburgh, EH9 3JJ, UK
| | - Julia R Dorin
- MRC Human Genetics Unit, Western General Hospital, Edinburgh, EH4 2XU, UK
| |
Collapse
|
46
|
Fitzpatrick DA, McInerney JO. Evidence of positive Darwinian selection in Omp85, a highly conserved bacterial outer membrane protein essential for cell viability. J Mol Evol 2005; 60:268-73. [PMID: 15785855 DOI: 10.1007/s00239-004-0194-5] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2004] [Accepted: 08/16/2004] [Indexed: 10/25/2022]
Abstract
Omp85 is a highly conserved outer membrane protein found in all gram-negative bacteria. It is essential for bacterial cell viability and plays an integral function in the positioning and folding of other outer membrane proteins into the bacterial outer membrane. We have employed a maximum likelihood and a maximum parsimony approach to detect evidence of positive Darwinian selection in Omp85 homologues from 10 delta-proteobacteria and have identified 14 amino acid sites that show evidence of being under the influence of adaptive evolution. Interestingly all sites bar one are concentrated within surface loops of the protein that most likely interact with host immune response or the surrounding environment. Alternatively amino acids within membrane-spanning regions of the protein are found to be under purifying selection most likely as a result of structural constraints.
Collapse
Affiliation(s)
- David A Fitzpatrick
- Department of Biology, National University of Ireland, Maynooth, Country Kildare, Ireland
| | | |
Collapse
|
47
|
Bos DH. Natural selection during functional divergence to LMP7 and proteasome subunit X (PSMB5) following gene duplication. J Mol Evol 2005; 60:221-8. [PMID: 15785850 DOI: 10.1007/s00239-004-0120-x] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2004] [Accepted: 09/09/2004] [Indexed: 11/29/2022]
Abstract
The LMP7 and PSMB5 genes were created through an ancient gene duplication event of their ancestral locus. These proteins contain an active site of proteolysis, and LMP7 replaces PSMB5 as a component of the 20S proteasome after stimulation of cells by interferon-gamma. Replacement of PSMB5 by LMP7 changes the profile of the products of 20S proteasome processing, predisposing digested peptides for transport to and display by the immune system. The purpose of this study is to investigate evolutionary forces influencing functional divergence between LMP7 and PSMB5 following duplication. Levels of synonymous and nonsynonymous substitution rates are estimated to infer differences in levels of natural selection. Estimates of substitution rates indicate that natural selection elevated rates of nonsynonymous substitution in LMP7 following gene duplication, whereas PSMB5 experienced an increase in substitution rate that was not likely due to diversifying natural selection following duplication. Following initial divergence, nearly neutral mutations have dominated gene evolution in both lineages. The LMP7 gene locus provides a rare example of a protein with specialized function arising from duplication and divergence of a housekeeping protein by way of natural selection.
Collapse
Affiliation(s)
- David H Bos
- School of Biological Sciences, University of Canterbury, Christchurch, New Zealand.
| |
Collapse
|
48
|
Wong WSW, Yang Z, Goldman N, Nielsen R. Accuracy and power of statistical methods for detecting adaptive evolution in protein coding sequences and for identifying positively selected sites. Genetics 2005; 168:1041-51. [PMID: 15514074 PMCID: PMC1448811 DOI: 10.1534/genetics.104.031153] [Citation(s) in RCA: 447] [Impact Index Per Article: 23.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The parsimony method of Suzuki and Gojobori (1999) and the maximum likelihood method developed from the work of Nielsen and Yang (1998) are two widely used methods for detecting positive selection in homologous protein coding sequences. Both methods consider an excess of nonsynonymous (replacement) substitutions as evidence for positive selection. Previously published simulation studies comparing the performance of the two methods show contradictory results. Here we conduct a more thorough simulation study to cover and extend the parameter space used in previous studies. We also reanalyzed an HLA data set that was previously proposed to cause problems when analyzed using the maximum likelihood method. Our new simulations and a reanalysis of the HLA data demonstrate that the maximum likelihood method has good power and accuracy in detecting positive selection over a wide range of parameter values. Previous studies reporting poor performance of the method appear to be due to numerical problems in the optimization algorithms and did not reflect the true performance of the method. The parsimony method has a very low rate of false positives but very little power for detecting positive selection or identifying positively selected sites.
Collapse
Affiliation(s)
- Wendy S W Wong
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York 14850, USA.
| | | | | | | |
Collapse
|
49
|
Suzuki Y. New methods for detecting positive selection at single amino acid sites. J Mol Evol 2005; 59:11-9. [PMID: 15383903 PMCID: PMC1513646 DOI: 10.1007/s00239-004-2599-6] [Citation(s) in RCA: 68] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2003] [Accepted: 12/29/2003] [Indexed: 11/28/2022]
Abstract
Inferring positive selection at single amino acid sites is of particular importance for studying evolutionary mechanisms of a protein. For this purpose, Suzuki and Gojobori (1999) developed a method (SG method) for comparing the rates of synonymous and nonsynonymous substitutions at each codon site in a protein-coding nucleotide sequence, using ancestral codons at interior nodes of the phylogenetic tree as inferred by the maximum parsimony method. In the SG method, however, selective neutrality of nucleotide substitutions cannot be tested at codon sites, where only termination codons are inferred at any interior node or the number of equally parsimonious inferences of ancestral codons at all interior nodes exceeds 10,000. Here I present a modified SG method which is free from these problems. Specifically, I use the distance-based Bayesian method for inferring the single most likely ancestral codon from 61 sense codons at each interior node. In the computer simulation and real data analysis, the modified SG method showed a higher overall efficiency of detecting positive selection than the original SG method, particularly at highly polymorphic codon sites. These results indicate that the modified SG method is useful for inferring positive selection at codon sites where neutrality cannot be tested by the original SG method. I also discuss that the p-distance is preferable to the number of synonymous substitutions for inferring the phylogenetic tree in the SG method, and present a maximum likelihood method for detecting positive selection at single amino acid sites, which produced reasonable results in the real data analysis.
Collapse
Affiliation(s)
- Yoshiyuki Suzuki
- Institute of Molecular Evolutionary Genetics and Department of Biology, The Pennsylvania State University, 328 Mueller Laboratory, University Park, PA 16802, USA.
| |
Collapse
|
50
|
Kosakovsky Pond SL, Frost SDW. Not so different after all: a comparison of methods for detecting amino acid sites under selection. Mol Biol Evol 2005; 22:1208-22. [PMID: 15703242 DOI: 10.1093/molbev/msi105] [Citation(s) in RCA: 1595] [Impact Index Per Article: 83.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
We consider three approaches for estimating the rates of nonsynonymous and synonymous changes at each site in a sequence alignment in order to identify sites under positive or negative selection: (1) a suite of fast likelihood-based "counting methods" that employ either a single most likely ancestral reconstruction, weighting across all possible ancestral reconstructions, or sampling from ancestral reconstructions; (2) a random effects likelihood (REL) approach, which models variation in nonsynonymous and synonymous rates across sites according to a predefined distribution, with the selection pressure at an individual site inferred using an empirical Bayes approach; and (3) a fixed effects likelihood (FEL) method that directly estimates nonsynonymous and synonymous substitution rates at each site. All three methods incorporate flexible models of nucleotide substitution bias and variation in both nonsynonymous and synonymous substitution rates across sites, facilitating the comparison between the methods. We demonstrate that the results obtained using these approaches show broad agreement in levels of Type I and Type II error and in estimates of substitution rates. Counting methods are well suited for large alignments, for which there is high power to detect positive and negative selection, but appear to underestimate the substitution rate. A REL approach, which is more computationally intensive than counting methods, has higher power than counting methods to detect selection in data sets of intermediate size but may suffer from higher rates of false positives for small data sets. A FEL approach appears to capture the pattern of rate variation better than counting methods or random effects models, does not suffer from as many false positives as random effects models for data sets comprising few sequences, and can be efficiently parallelized. Our results suggest that previously reported differences between results obtained by counting methods and random effects models arise due to a combination of the conservative nature of counting-based methods, the failure of current random effects models to allow for variation in synonymous substitution rates, and the naive application of random effects models to extremely sparse data sets. We demonstrate our methods on sequence data from the human immunodeficiency virus type 1 env and pol genes and simulated alignments.
Collapse
|