101
|
Computational model for analyzing the evolutionary patterns of the neuraminidase gene of influenza A/H1N1. Comput Biol Chem 2012; 36:23-30. [DOI: 10.1016/j.compbiolchem.2011.12.002] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2011] [Revised: 12/02/2011] [Accepted: 12/21/2011] [Indexed: 11/19/2022]
|
102
|
Ocaña KACS, de Oliveira D, Horta F, Dias J, Ogasawara E, Mattoso M. Exploring Molecular Evolution Reconstruction Using a Parallel Cloud Based Scientific Workflow. ADVANCES IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY 2012. [DOI: 10.1007/978-3-642-31927-3_16] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
|
103
|
Gu M, Dong X, Shi L, Shi L, Lin K, Huang X, Chu J. Differences in mtDNA whole sequence between Tibetan and Han populations suggesting adaptive selection to high altitude. Gene 2011; 496:37-44. [PMID: 22233893 DOI: 10.1016/j.gene.2011.12.016] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2011] [Revised: 12/02/2011] [Accepted: 12/06/2011] [Indexed: 10/14/2022]
Abstract
We performed a mitochondrial whole-genome comparison study in 40 Tibetan and 50 Han Chinese. All subjects could be classified into 13 haplogroups pertained to the Macrohaplogroup M and N that pitched different quadrants by principal component analysis. We observed a difference in the M9 haplogroup and identified 18 significant variants by comparing whole sequences between Tibetan and Han populations. Variants in ND2, COX2, tRNA alanine and 12S rRNA were predicted to confer increased protein stability in Tibetans. We compared the base substitutions of nonsynonymous (NS) versus synonymous (S) of 13 protein-encoding genes and found the NS/S values of the ATP6, ATP8, and Cyt b genes were larger (>1) in Tibetans than that in Han population. Our findings provide clues for the existence of adaptive selection for the ATP6, ATP8, Cyt b, ND2, COX2, tRNA alanine and 12S rRNA genes in Tibetans which likely contributed to adaptation to their specific geographic environment, such as high altitude.
Collapse
Affiliation(s)
- Mingliang Gu
- Department of Medical Genetics, Institute of Medical Biology, Chinese Academy of Medical Sciences & Peking Union Medical College, Kunming, China
| | | | | | | | | | | | | |
Collapse
|
104
|
Polato NR, Vera JC, Baums IB. Gene discovery in the threatened elkhorn coral: 454 sequencing of the Acropora palmata transcriptome. PLoS One 2011; 6:e28634. [PMID: 22216101 PMCID: PMC3247206 DOI: 10.1371/journal.pone.0028634] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2011] [Accepted: 11/12/2011] [Indexed: 12/18/2022] Open
Abstract
Background Cnidarians, including corals and anemones, offer unique insights into metazoan evolution because they harbor genetic similarities with vertebrates beyond that found in model invertebrates and retain genes known only from non-metazoans. Cataloging genes expressed in Acropora palmata, a foundation-species of reefs in the Caribbean and western Atlantic, will advance our understanding of the genetic basis of ecologically important traits in corals and comes at a time when sequencing efforts in other cnidarians allow for multi-species comparisons. Results A cDNA library from a sample enriched for symbiont free larval tissue was sequenced on the 454 GS-FLX platform. Over 960,000 reads were obtained and assembled into 42,630 contigs. Annotation data was acquired for 57% of the assembled sequences. Analysis of the assembled sequences indicated that 83–100% of all A. palmata transcripts were tagged, and provided a rough estimate of the total number genes expressed in our samples (∼18,000–20,000). The coral annotation data contained many of the same molecular components as in the Bilateria, particularly in pathways associated with oxidative stress and DNA damage repair, and provided evidence that homologs of p53, a key player in DNA repair pathways, has experienced selection along the branch separating Cnidaria and Bilateria. Transcriptome wide screens of paralog groups and transition/transversion ratios highlighted genes including: green fluorescent proteins, carbonic anhydrase, and oxidative stress proteins; and functional groups involved in protein and nucleic acid metabolism, and the formation of structural molecules. These results provide a starting point for study of adaptive evolution in corals. Conclusions Currently available transcriptome data now make comparative studies of the mechanisms underlying coral's evolutionary success possible. Here we identified candidate genes that enable corals to maintain genomic integrity despite considerable exposure to genotoxic stress over long life spans, and showed conservation of important physiological pathways between corals and bilaterians.
Collapse
Affiliation(s)
- Nicholas R. Polato
- Department of Biology, The Pennsylvania State University, University Park, Pennsylvania, United States of America
| | - J. Cristobal Vera
- Department of Biology, The Pennsylvania State University, University Park, Pennsylvania, United States of America
| | - Iliana B. Baums
- Department of Biology, The Pennsylvania State University, University Park, Pennsylvania, United States of America
- * E-mail:
| |
Collapse
|
105
|
Lüke L, Vicens A, Serra F, Luque-Larena JJ, Dopazo H, Roldan ERS, Gomendio M. Sexual selection halts the relaxation of protamine 2 among rodents. PLoS One 2011; 6:e29247. [PMID: 22216223 PMCID: PMC3244444 DOI: 10.1371/journal.pone.0029247] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2011] [Accepted: 11/23/2011] [Indexed: 12/13/2022] Open
Abstract
Sexual selection has been proposed as the driving force promoting the rapid evolutionary changes observed in some reproductive genes including protamines. We test this hypothesis in a group of rodents which show marked differences in the intensity of sexual selection. Levels of sperm competition were not associated with the evolutionary rates of protamine 1 but, contrary to expectations, were negatively related to the evolutionary rate of cleaved- and mature-protamine 2. Since both domains were found to be under relaxation, our findings reveal an unforeseen role of sexual selection: to halt the degree of degeneration that proteins within families may experience due to functional redundancy. The degree of relaxation of protamine 2 in this group of rodents is such that in some species it has become dysfunctional and it is not expressed in mature spermatozoa. In contrast, protamine 1 is functionally conserved but shows directed positive selection on specific sites which are functionally relevant such as DNA-anchoring domains and phosphorylation sites. We conclude that in rodents protamine 2 is under relaxation and that sexual selection removes deleterious mutations among species with high levels of sperm competition to maintain the protein functional and the spermatozoa competitive.
Collapse
Affiliation(s)
- Lena Lüke
- Reproductive Ecology and Biology Group, Museo Nacional de Ciencias Naturales (CSIC), Madrid, Spain
| | - Alberto Vicens
- Reproductive Ecology and Biology Group, Museo Nacional de Ciencias Naturales (CSIC), Madrid, Spain
| | - Francois Serra
- Evolutionary Genomics Laboratory, Bioinformatics and Genomics Department, Centro de Investigación Príncipe Felipe (CIPF), Valencia, Spain
| | - Juan Jose Luque-Larena
- Area de Zoología, Departamento de Ciencias Agroforestales, E.T.S. Ingenierías Agrarias, Universidad de Valladolid, Campus La Yutera, Palencia, Spain
| | - Hernán Dopazo
- Evolutionary Genomics Laboratory, Bioinformatics and Genomics Department, Centro de Investigación Príncipe Felipe (CIPF), Valencia, Spain
| | - Eduardo R. S. Roldan
- Reproductive Ecology and Biology Group, Museo Nacional de Ciencias Naturales (CSIC), Madrid, Spain
| | - Montserrat Gomendio
- Reproductive Ecology and Biology Group, Museo Nacional de Ciencias Naturales (CSIC), Madrid, Spain
| |
Collapse
|
106
|
Brites D, Encinas-Viso F, Ebert D, Du Pasquier L, Haag CR. Population genetics of duplicated alternatively spliced exons of the Dscam gene in Daphnia and Drosophila. PLoS One 2011; 6:e27947. [PMID: 22174757 PMCID: PMC3236188 DOI: 10.1371/journal.pone.0027947] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2011] [Accepted: 10/28/2011] [Indexed: 01/02/2023] Open
Abstract
In insects and crustaceans, the Down syndrome cell adhesion molecule (Dscam) occurs in many different isoforms. These are produced by mutually exclusive alternative splicing of dozens of tandem duplicated exons coding for parts or whole immunoglobulin (Ig) domains of the Dscam protein. This diversity plays a role in the development of the nervous system and also in the immune system. Structural analysis of the protein suggested candidate epitopes where binding to pathogens could occur. These epitopes are coded by regions of the duplicated exons and are therefore diverse within individuals. Here we apply molecular population genetics and molecular evolution analyses using Daphnia magna and several Drosophila species to investigate the potential role of natural selection in the divergence between orthologs of these duplicated exons among species, as well as between paralogous exons within species. We found no evidence for a role of positive selection in the divergence of these paralogous exons. However, the power of this test was low, and the fact that no signs of gene conversion between paralogous exons were found suggests that paralog diversity may nonetheless be maintained by selection. The analysis of orthologous exons in Drosophila and in Daphnia revealed an excess of non-synonymous polymorphisms in the epitopes putatively involved in pathogen binding. This may be a sign of balancing selection. Indeed, in Dr. melanogaster the same derived non-synonymous alleles segregate in several populations around the world. Yet other hallmarks of balancing selection were not found. Hence, we cannot rule out that the excess of non-synonymous polymorphisms is caused by segregating slightly deleterious alleles, thus potentially indicating reduced selective constraints in the putative pathogen binding epitopes of Dscam.
Collapse
Affiliation(s)
- Daniela Brites
- Zoologisches Institut, Evolutionsbiologie, University of Basel, Basel, Switzerland.
| | | | | | | | | |
Collapse
|
107
|
Fatakia SN, Costanzi S, Chow CC. Molecular evolution of the transmembrane domains of G protein-coupled receptors. PLoS One 2011; 6:e27813. [PMID: 22132149 PMCID: PMC3221663 DOI: 10.1371/journal.pone.0027813] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2010] [Accepted: 10/25/2011] [Indexed: 11/19/2022] Open
Abstract
G protein-coupled receptors (GPCRs) are a superfamily of integral membrane proteins vital for signaling and are important targets for pharmaceutical intervention in humans. Previously, we identified a group of ten amino acid positions (called key positions), within the seven transmembrane domain (7TM) interhelical region, which had high mutual information with each other and many other positions in the 7TM. Here, we estimated the evolutionary selection pressure at those key positions. We found that the key positions of receptors for small molecule natural ligands were under strong negative selection. Receptors naturally activated by lipids had weaker negative selection in general when compared to small molecule-activated receptors. Selection pressure varied widely in peptide-activated receptors. We used this observation to predict that a subgroup of orphan GPCRs not under strong selection may not possess a natural small-molecule ligand. In the subgroup of MRGX1-type GPCRs, we identified a key position, along with two non-key positions, under statistically significant positive selection.
Collapse
Affiliation(s)
- Sarosh N. Fatakia
- Laboratory of Biological Modeling, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland, United States of America
| | - Stefano Costanzi
- Laboratory of Biological Modeling, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland, United States of America
| | - Carson C. Chow
- Laboratory of Biological Modeling, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland, United States of America
| |
Collapse
|
108
|
Analyses of 32 loci clarify phylogenetic relationships among Trypanosoma cruzi lineages and support a single hybridization prior to human contact. PLoS Negl Trop Dis 2011; 5:e1272. [PMID: 21829751 PMCID: PMC3149036 DOI: 10.1371/journal.pntd.0001272] [Citation(s) in RCA: 52] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2010] [Accepted: 06/22/2011] [Indexed: 01/23/2023] Open
Abstract
Background The genetic diversity of Trypanosoma cruzi, the etiological agent of Chagas disease, has been traditionally divided in two major groups, T. cruzi I and II, corresponding to discrete typing units TcI and TcII-VI under a recently proposed nomenclature. The two major groups of T. cruzi seem to differ in important biological characteristics, and are thus thought to represent a natural division relevant for epidemiological studies and development of prophylaxis. To understand the potential connection between the different manifestations of Chagas disease and variability of T. cruzi strains, it is essential to have a correct reconstruction of the evolutionary history of T. cruzi. Methodology/Principal Findings Nucleotide sequences from 32 unlinked loci (>26 Kilobases of aligned sequence) were used to reconstruct the evolutionary history of strains representing the known genetic variability of T. cruzi. Thorough phylogenetic analyses show that the original classification of T. cruzi in two major lineages does not reflect its evolutionary history and that there is only strong evidence for one major and recent hybridization event in the history of this species. Furthermore, estimates of divergence times using Bayesian methods show that current extant lineages of T. cruzi diverged very recently, within the last 3 million years, and that the major hybridization event leading to hybrid lineages TcV and TcVI occurred less than 1 million years ago, well before the contact of T. cruzi with humans in South America. Conclusions/Significance The described phylogenetic relationships among the six major genetic subdivisions of T. cruzi should serve as guidelines for targeted epidemiological and prophylaxis studies. We suggest that it is important to reconsider conclusions from previous studies that have attempted to uncover important biological differences between the two originally defined major lineages of T. cruzi especially if those conclusions were obtained from single or few strains. Trypanosoma cruzi is the protozoan parasite that causes Chagas disease, a major health problem in Latin America. The genetic diversity of this parasite has been traditionally divided in two major groups: T. cruzi I and II, which can be further divided in six major genetic subdivisions (subgroups TcI-TcVI). T. cruzi I and II seem to differ in important biological characteristics, and are thought to represent a natural division relevant for epidemiological studies and development of prophylaxis. Having a correct reconstruction of the evolutionary history of T. cruzi is essential for understanding the potential connection between the genetic and phenotypic variability of T. cruzi with the different manifestations of Chagas disease. Here we present results from a comprehensive phylogenetic analysis of T. cruzi using more than 26 Kb of aligned sequence data. We show strong evidence that T. cruzi II (TcII-VI) is not a natural evolutionary group but a paraphyletic lineage and that all major lineages of T. cruzi evolved recently (<3 million years ago [mya]). Furthermore, the sequence data is consistent with one major hybridization event having occurred in this species recently (< 1 mya) but well before T. cruzi entered in contact with humans in South America.
Collapse
|
109
|
Wei RX, Ge S. Evolutionary history and complementary selective relaxation of the duplicated PI genes in grasses. JOURNAL OF INTEGRATIVE PLANT BIOLOGY 2011; 53:682-693. [PMID: 21615687 DOI: 10.1111/j.1744-7909.2011.01058.x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]
Abstract
Gene duplication plays an important role in the evolution of organisms by allowing functional innovation and the divergence of duplicate genes. Previous studies found two PI-like genes in grass species, suggesting functional divergence between the paralogous copies. Here, we reconstructed the evolutionary history of two PI genes from major lineages of grasses and other monocot species, and demonstrated that two PI genes (PI1 and PI2) arose from a whole genome duplication that occurred in a common ancestor of extant grasses. Molecular evolutionary analyses at the family and tribal levels found strong purifying selection acting on two genes in grasses, consistent with the conserved class B function of the PI genes. Importantly, we detected different patterns of selective relaxation between the duplicated PI genes although no signature of positive selection was found. Likelihood ratio tests revealed that the ω ratio for M domain is significantly higher in PI1 than in PI2 but that for K domain is significantly higher in PI2 than in PI1. These findings imply that complementary selective relaxation occurs in two PI genes after duplication, and provide additional molecular evidence for the subfunctionalization of the duplicated PI genes in grasses.
Collapse
Affiliation(s)
- Ruo-Xun Wei
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, the Chinese Academy of Sciences, Beijing, China
| | | |
Collapse
|
110
|
Crisci JL, Wong A, Good JM, Jensen JD. On characterizing adaptive events unique to modern humans. Genome Biol Evol 2011; 3:791-8. [PMID: 21803765 PMCID: PMC3163466 DOI: 10.1093/gbe/evr075] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022] Open
Abstract
Ever since the first draft of the human genome was completed in 2001, there has been increased interest in identifying genetic changes that are uniquely human, which could account for our distinct morphological and cognitive capabilities with respect to other apes. Recently, draft sequences of two extinct hominin genomes, a Neanderthal and Denisovan, have been released. These two genomes provide a much greater resolution to identify human-specific genetic differences than the chimpanzee, our closest extant relative. The Neanderthal genome paper presented a list of regions putatively targeted by positive selection around the time of the human–Neanderthal split. We here seek to characterize the evolutionary history of these candidate regions—examining evidence for selective sweeps in modern human populations as well as for accelerated adaptive evolution across apes. Results indicate that 3 of the top 20 candidate regions show evidence of selection in at least one modern human population (P < 5 × 105). Additionally, four genes within the top 20 regions show accelerated amino acid substitutions across multiple apes (P < 0.01), suggesting importance across deeper evolutionary time. These results highlight the importance of evaluating evolutionary processes across both recent and ancient evolutionary timescales and intriguingly suggest a list of candidate genes that may have been uniquely important around the time of the human–Neanderthal split.
Collapse
Affiliation(s)
- Jessica L Crisci
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, USA.
| | | | | | | |
Collapse
|
111
|
Reed KM, Bauer MM, Monson MS, Benoit B, Chaves LD, O'Hare TH, Delany ME. Defining the turkey MHC: identification of expressed class I- and class IIB-like genes independent of the MHC-B. Immunogenetics 2011; 63:753-71. [PMID: 21710346 DOI: 10.1007/s00251-011-0549-1] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2011] [Accepted: 06/07/2011] [Indexed: 12/14/2022]
Abstract
The MHC of the turkey (Meleagris gallopavo) is divided into two genetically unlinked regions; the MHC-B and MHC-Y. Although previous studies found the turkey MHC-B to be highly similar to that of the chicken, little is known of the gene content and extent of the MHC-Y. This study describes two partially overlapping large-insert BAC clones that genetically and physically map to the turkey MHC chromosome (MGA18) but to a region that assorts independently of MHC-B. Within the sequence assembly, 14 genes were predicted including new class I- and class IIB-like loci. Additional unassembled sequences corresponded to multiple copies of the ribosomal RNA repeat unit (18S-5.8S-28S). Thus, this newly identified MHC region appears to represent a physical boundary of the turkey MHC-Y. High-resolution multi-color fluorescence in situ hybridization studies confirm rearrangement of MGA18 relative to the orthologous chicken chromosome (GGA16) in regard to chromosome architecture, but not gene order. The difference in centromere position between the species is indicative of multiple chromosome rearrangements or alternate events such as neocentromere formation/centromere inactivation in the evolution of the MHC chromosome. Comparative sequencing of commercial turkeys (six amplicons totaling 7.6 kb) identified 68 single nucleotide variants defining nine MHC-Y haplotypes. Sequences of the new class I- and class IIB-like genes are most similar to MHC-Y genes in the chicken. All three loci are expressed in the spleen. Differential transcription of the MHC-Y class IIB-like loci was evident as one class IIB-like locus was only expressed in some individuals.
Collapse
Affiliation(s)
- Kent M Reed
- Department of Veterinary and Biomedical Sciences, College of Veterinary Medicine, University of Minnesota, St Paul, MN 55108, USA,
| | | | | | | | | | | | | |
Collapse
|
112
|
Kosakovsky Pond SL, Murrell B, Fourment M, Frost SDW, Delport W, Scheffler K. A random effects branch-site model for detecting episodic diversifying selection. Mol Biol Evol 2011; 28:3033-43. [PMID: 21670087 DOI: 10.1093/molbev/msr125] [Citation(s) in RCA: 328] [Impact Index Per Article: 25.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
Adaptive evolution frequently occurs in episodic bursts, localized to a few sites in a gene, and to a small number of lineages in a phylogenetic tree. A popular class of "branch-site" evolutionary models provides a statistical framework to search for evidence of such episodic selection. For computational tractability, current branch-site models unrealistically assume that all branches in the tree can be partitioned a priori into two rigid classes--"foreground" branches that are allowed to undergo diversifying selective bursts and "background" branches that are negatively selected or neutral. We demonstrate that this assumption leads to unacceptably high rates of false positives or false negatives when the evolutionary process along background branches strongly deviates from modeling assumptions. To address this problem, we extend Felsenstein's pruning algorithm to allow efficient likelihood computations for models in which variation over branches (and not just sites) is described in the random effects likelihood framework. This enables us to model the process at every branch-site combination as a mixture of three Markov substitution models--our model treats the selective class of every branch at a particular site as an unobserved state that is chosen independently of that at any other branch. When benchmarked on a previously published set of simulated sequences, our method consistently matched or outperformed existing branch-site tests in terms of power and error rates. Using three empirical data sets, previously analyzed for episodic selection, we discuss how modeling assumptions can influence inference in practical situations.
Collapse
|
113
|
Sen L, Fares MA, Liang B, Gao L, Wang B, Wang T, Su YJ. Molecular evolution of rbcL in three gymnosperm families: identifying adaptive and coevolutionary patterns. Biol Direct 2011; 6:29. [PMID: 21639885 PMCID: PMC3129321 DOI: 10.1186/1745-6150-6-29] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2011] [Accepted: 06/03/2011] [Indexed: 11/10/2022] Open
Abstract
Background The chloroplast-localized ribulose-1, 5-biphosphate carboxylase/oxygenase (Rubisco), the primary enzyme responsible for autotrophy, is instrumental in the continual adaptation of plants to variations in the concentrations of CO2. The large subunit (LSU) of Rubisco is encoded by the chloroplast rbcL gene. Although adaptive processes have been previously identified at this gene, characterizing the relationships between the mutational dynamics at the protein level may yield clues on the biological meaning of such adaptive processes. The role of such coevolutionary dynamics in the continual fine-tuning of RbcL remains obscure. Results We used the timescale and phylogenetic analyses to investigate and search for processes of adaptive evolution in rbcL gene in three gymnosperm families, namely Podocarpaceae, Taxaceae and Cephalotaxaceae. To understand the relationships between regions identified as having evolved under adaptive evolution, we performed coevolutionary analyses using the software CAPS. Importantly, adaptive processes were identified at amino acid sites located on the contact regions among the Rubisco subunits and on the interface between Rubisco and its activase. Adaptive amino acid replacements at these regions may have optimized the holoenzyme activity. This hypothesis was pinpointed by evidence originated from our analysis of coevolution that supported the correlated evolution between Rubisco and its activase. Interestingly, the correlated adaptive processes between both these proteins have paralleled the geological variation history of the concentration of atmospheric CO2. Conclusions The gene rbcL has experienced bursts of adaptations in response to the changing concentration of CO2 in the atmosphere. These adaptations have emerged as a result of a continuous dynamic of mutations, many of which may have involved innovation of functional Rubisco features. Analysis of the protein structure and the functional implications of such mutations put forward the conclusion that this evolutionary scenario has been possible through a complex interplay between adaptive mutations, often structurally destabilizing, and compensatory mutations. Our results unearth patterns of evolution that have likely optimized the Rubisco activity and uncover mutational dynamics useful in the molecular engineering of enzymatic activities. Reviewers This article was reviewed by Prof. Christian Blouin (nominated by Dr W Ford Doolittle), Dr Endre Barta (nominated by Dr Sandor Pongor), and Dr Nicolas Galtier.
Collapse
Affiliation(s)
- Lin Sen
- CAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, China
| | | | | | | | | | | | | |
Collapse
|
114
|
Meslin C, Brimau F, Nagnan-Le Meillour P, Callebaut I, Pascal G, Monget P. The evolutionary history of the SAL1 gene family in eutherian mammals. BMC Evol Biol 2011; 11:148. [PMID: 21619679 PMCID: PMC3128046 DOI: 10.1186/1471-2148-11-148] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2011] [Accepted: 05/28/2011] [Indexed: 11/16/2022] Open
Abstract
Background SAL1 (salivary lipocalin) is a member of the OBP (Odorant Binding Protein) family and is involved in chemical sexual communication in pig. SAL1 and its relatives may be involved in pheromone and olfactory receptor binding and in pre-mating behaviour. The evolutionary history and the selective pressures acting on SAL1 and its orthologous genes have not yet been exhaustively described. The aim of the present work was to study the evolution of these genes, to elucidate the role of selective pressures in their evolution and the consequences for their functions. Results Here, we present the evolutionary history of SAL1 gene and its orthologous genes in mammals. We found that (1) SAL1 and its related genes arose in eutherian mammals with lineage-specific duplications in rodents, horse and cow and are lost in human, mouse lemur, bushbaby and orangutan, (2) the evolution of duplicated genes of horse, rat, mouse and guinea pig is driven by concerted evolution with extensive gene conversion events in mouse and guinea pig and by positive selection mainly acting on paralogous genes in horse and guinea pig, (3) positive selection was detected for amino acids involved in pheromone binding and amino acids putatively involved in olfactory receptor binding, (4) positive selection was also found for lineage, indicating a species-specific strategy for amino acid selection. Conclusions This work provides new insights into the evolutionary history of SAL1 and its orthologs. On one hand, some genes are subject to concerted evolution and to an increase in dosage, suggesting the need for homogeneity of sequence and function in certain species. On the other hand, positive selection plays a role in the diversification of the functions of the family and in lineage, suggesting adaptive evolution, with possible consequences for speciation and for the reinforcement of prezygotic barriers.
Collapse
Affiliation(s)
- Camille Meslin
- UMR85 Physiologie de la Reproduction et des Comportements, INRA, Nouzilly, France
| | | | | | | | | | | |
Collapse
|
115
|
Khan MMG, Rydén AM, Chowdhury MS, Hasan MA, Kazi JU. Maximum likelihood analysis of mammalian p53 indicates the presence of positively selected sites and higher tumorigenic mutations in purifying sites. Gene 2011; 483:29-35. [PMID: 21640173 DOI: 10.1016/j.gene.2011.05.011] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2011] [Revised: 05/15/2011] [Accepted: 05/16/2011] [Indexed: 01/10/2023]
Abstract
The tumor suppressor gene TP53 (p53) maintains genome stability. Mutation or loss of p53 is found in most cancers. Analysis of evolutionary constrains and p53 mutations reveal important sites for concomitant functional studies. In this study, phylogenetic analyses of the coding sequences of p53 from 26 mammals were carried out by applying a maximum likelihood method. The results display two branches under adaptive evolution in mammals. Moreover, each codon of p53 was analyzed by the PAML method for presence of positively selected sites. PAML identified several statistically significant amino acids that undergo positive selection. The data indicates that amino acids responsible for the core functions of p53 are highly conserved, while positively selected sites are predominantly located in the N- and C-terminus of p53. Further analysis of evolutionary pressure and mutations showed the occurrence of more frequent tumorigenic mutations in purifying sites of p53.
Collapse
Affiliation(s)
- Maola M G Khan
- Biochemistry and Molecular Biology, Jahangirnagar University, Bangladesh
| | | | | | | | | |
Collapse
|
116
|
Li W, Shi W, Qiao H, Ho SYW, Luo A, Zhang Y, Zhu C. Positive selection on hemagglutinin and neuraminidase genes of H1N1 influenza viruses. Virol J 2011; 8:183. [PMID: 21507270 PMCID: PMC3094300 DOI: 10.1186/1743-422x-8-183] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2010] [Accepted: 04/21/2011] [Indexed: 01/05/2023] Open
Abstract
BACKGROUND Since its emergence in March 2009, the pandemic 2009 H1N1 influenza A virus has posed a serious threat to public health. To trace the evolutionary path of these new pathogens, we performed a selection-pressure analysis of a large number of hemagglutinin (HA) and neuraminidase (NA) gene sequences of H1N1 influenza viruses from different hosts. RESULTS Phylogenetic analysis revealed that both HA and NA genes have evolved into five distinct clusters, with further analyses indicating that the pandemic 2009 strains have experienced the strongest positive selection. We also found evidence of strong selection acting on the seasonal human H1N1 isolates. However, swine viruses from North America and Eurasia were under weak positive selection, while there was no significant evidence of positive selection acting on the avian isolates. A site-by-site analysis revealed that the positively selected sites were located in both of the cleaved products of HA (HA1 and HA2), as well as NA. In addition, the pandemic 2009 strains were subject to differential selection pressures compared to seasonal human, North American swine and Eurasian swine H1N1 viruses. CONCLUSIONS Most of these positively and/or differentially selected sites were situated in the B-cell and/or T-cell antigenic regions, suggesting that selection at these sites might be responsible for the antigenic variation of the viruses. Moreover, some sites were also associated with glycosylation and receptor-binding ability. Thus, selection at these positions might have helped the pandemic 2009 H1N1 viruses to adapt to the new hosts after they were introduced from pigs to humans. Positive selection on position 274 of NA protein, associated with drug resistance, might account for the prevalence of drug-resistant variants of seasonal human H1N1 influenza viruses, but there was no evidence that positive selection was responsible for the spread of the drug resistance of the pandemic H1N1 strains.
Collapse
Affiliation(s)
- Wenfu Li
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
- Graduate University of Chinese Academy of Sciences, Beijing 100049, China
| | - Weifeng Shi
- UCD Conway Institute of Biomolecular and Biomedical Sciences, University College Dublin, Dublin 4, Ireland
| | - Huijie Qiao
- Key Laboratory of Animal Ecology and Conservation Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Simon YW Ho
- School of Biological Sciences, University of Sydney, Sydney NSW 2006, Australia
| | - Arong Luo
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
- Graduate University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yanzhou Zhang
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Chaodong Zhu
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| |
Collapse
|
117
|
Lenz TL. Computational prediction of MHC II-antigen binding supports divergent allele advantage and explains trans-species polymorphism. Evolution 2011; 65:2380-90. [PMID: 21790583 DOI: 10.1111/j.1558-5646.2011.01288.x] [Citation(s) in RCA: 65] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
The major histocompatibility complex (MHC), coding for antigen presenting molecules of the adaptive immune system, represents one of the most polymorphic regions in the vertebrate genome. The exceptional polymorphism, which is potentially maintained by balancing selection under host-parasite coevolution, comprises excessive sequence divergence among alleles as well as ancient allelic lineages that predate species divergence (trans-species polymorphism). Here, the mechanisms that are proposed to maintain such sequence divergence and ancient lineages are investigated. Established computational antigen-binding prediction algorithms, which are based on empirical databases, are employed to determine the overlap in bound antigens among individual MHC class IIB alleles. The results show that genetically more divergent allele pairs experience less overlap and thus present a broader range of potential antigens. These findings support the divergent allele advantage hypothesis and furthermore suggest an evolutionary advantage explaining the maintenance of divergent allelic lineages, that is, trans-species polymorphism. In addressing a quantitative rather than qualitative aspect of MHC alleles, these insights highlight a new direction for future research on MHC evolution.
Collapse
Affiliation(s)
- Tobias L Lenz
- Department of Evolutionary Ecology, Max Planck Institute for Evolutionary Biology, August-Thienemann-Str. 2, 24306 Plön, Germany.
| |
Collapse
|
118
|
Rokyta DR, Wray KP, Lemmon AR, Lemmon EM, Caudle SB. A high-throughput venom-gland transcriptome for the Eastern Diamondback Rattlesnake (Crotalus adamanteus) and evidence for pervasive positive selection across toxin classes. Toxicon 2011; 57:657-71. [PMID: 21255598 DOI: 10.1016/j.toxicon.2011.01.008] [Citation(s) in RCA: 101] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2010] [Revised: 01/05/2011] [Accepted: 01/10/2011] [Indexed: 10/18/2022]
Abstract
Despite causing considerable human mortality and morbidity, animal toxins represent a valuable source of pharmacologically active macromolecules, a unique system for studying molecular adaptation, and a powerful framework for examining structure-function relationships in proteins. Snake venoms are particularly useful in the latter regard as they consist primarily of a moderate number of proteins and peptides that have been found to belong to just a handful of protein families. As these proteins and peptides are produced in dedicated glands, transcriptome sequencing has proven to be an effective approach to identifying the expressed toxin genes. We generated a venom-gland transcriptome for the Eastern Diamondback Rattlesnake (Crotalus adamanteus) using Roche 454 sequencing technology. In the current work, we focus on transcripts encoding toxins. We identified 40 unique toxin transcripts, 30 of which have full-length coding sequences, and 10 have only partial coding sequences. These toxins account for 24% of the total sequencing reads. We found toxins from 11 previously described families of snake-venom toxins and have discovered two putative, previously undescribed toxin classes. The most diverse and highly expressed toxin classes in the C. adamanteus venom-gland transcriptome are the serine proteinases, metalloproteinases, and C-type lectins. The serine proteinases are the most abundant class, accounting for 35% of the toxin sequencing reads. Metalloproteinases are the most diverse; 11 different forms have been identified. Using our sequences and those available in public databases, we detected positive selection in seven of the eight toxin families for which sufficient sequences were available for the analysis. We find that the vast majority of the genes that contribute directly to this vertebrate trait show evidence for a role for positive selection in their evolutionary history.
Collapse
Affiliation(s)
- Darin R Rokyta
- Department of Biological Science, Florida State University, Tallahassee, FL 32306-4295, USA.
| | | | | | | | | |
Collapse
|
119
|
Rytkönen KT, Williams TA, Renshaw GM, Primmer CR, Nikinmaa M. Molecular Evolution of the Metazoan PHD–HIF Oxygen-Sensing System. Mol Biol Evol 2011; 28:1913-26. [DOI: 10.1093/molbev/msr012] [Citation(s) in RCA: 105] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
|
120
|
Farfán M, Miñana-Galbis D, Garreta A, Lorén JG, Fusté MC. Malate dehydrogenase: A useful phylogenetic marker for the genus Aeromonas. Syst Appl Microbiol 2010; 33:427-35. [DOI: 10.1016/j.syapm.2010.09.005] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2010] [Revised: 09/21/2010] [Accepted: 09/23/2010] [Indexed: 11/16/2022]
|
121
|
Yang Z, dos Reis M. Statistical properties of the branch-site test of positive selection. Mol Biol Evol 2010; 28:1217-28. [PMID: 21087944 DOI: 10.1093/molbev/msq303] [Citation(s) in RCA: 239] [Impact Index Per Article: 17.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The branch-site test is a likelihood ratio test to detect positive selection along prespecified lineages on a phylogeny that affects only a subset of codons in a protein-coding gene, with positive selection indicated by accelerated nonsynonymous substitutions (with ω = d(N)/d(S) > 1). This test may have more power than earlier methods, which average nucleotide substitution rates over sites in the protein and/or over branches on the tree. However, a few recent studies questioned the statistical basis of the test and claimed that the test generated too many false positives. In this paper, we examine the null distribution of the test and conduct a computer simulation to examine the false-positive rate and the power of the test. The results suggest that the asymptotic theory is reliable for typical data sets, and indeed in our simulations, the large-sample null distribution was reliable with as few as 20-50 codons in the alignment. We examined the impact of sequence length, the strength of positive selection, and the proportion of sites under positive selection on the power of the branch-site test. We found that the test was far more powerful in detecting episodic positive selection than branch-based tests, which average substitution rates over all codons in the gene and thus miss the signal when most codons are under strong selective constraint. Recent claims of statistical problems with the branch-site test are due to misinterpretations of simulation results. Our results, as well as previous simulation studies that have demonstrated the robustness of the test, suggest that the branch-site test may be a useful tool for detecting episodic positive selection and for generating biological hypotheses for mutation studies and functional analyses. The test is sensitive to sequence and alignment errors and caution should be exercised concerning its use when data quality is in doubt.
Collapse
Affiliation(s)
- Ziheng Yang
- Department of Genetics, Evolution and Environment, University College London, United Kingdom.
| | | |
Collapse
|
122
|
Demogines A, East AM, Lee JH, Grossman SR, Sabeti PC, Paull TT, Sawyer SL. Ancient and recent adaptive evolution of primate non-homologous end joining genes. PLoS Genet 2010; 6:e1001169. [PMID: 20975951 PMCID: PMC2958818 DOI: 10.1371/journal.pgen.1001169] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2010] [Accepted: 09/21/2010] [Indexed: 02/07/2023] Open
Abstract
In human cells, DNA double-strand breaks are repaired primarily by the non-homologous end joining (NHEJ) pathway. Given their critical nature, we expected NHEJ proteins to be evolutionarily conserved, with relatively little sequence change over time. Here, we report that while critical domains of these proteins are conserved as expected, the sequence of NHEJ proteins has also been shaped by recurrent positive selection, leading to rapid sequence evolution in other protein domains. In order to characterize the molecular evolution of the human NHEJ pathway, we generated large simian primate sequence datasets for NHEJ genes. Codon-based models of gene evolution yielded statistical support for the recurrent positive selection of five NHEJ genes during primate evolution: XRCC4, NBS1, Artemis, POLλ, and CtIP. Analysis of human polymorphism data using the composite of multiple signals (CMS) test revealed that XRCC4 has also been subjected to positive selection in modern humans. Crystal structures are available for XRCC4, Nbs1, and Polλ; and residues under positive selection fall exclusively on the surfaces of these proteins. Despite the positive selection of such residues, biochemical experiments with variants of one positively selected site in Nbs1 confirm that functions necessary for DNA repair and checkpoint signaling have been conserved. However, many viruses interact with the proteins of the NHEJ pathway as part of their infectious lifecycle. We propose that an ongoing evolutionary arms race between viruses and NHEJ genes may be driving the surprisingly rapid evolution of these critical genes. Because all cells experience DNA damage, they must also have mechanisms for repairing DNA. When the proteins that repair DNA malfunction, mutation and disease often result. Based on their fundamental importance, DNA repair proteins would be expected to be well preserved over evolutionary time in order to ensure optimal DNA repair function. However, a previous genome-wide study of molecular evolution in Saccharomyces yeast identified the non-homologous end joining (NHEJ) DNA repair pathway as one of the two most rapidly evolving pathways in the yeast genome. In order to analyze the evolution of this pathway in humans, we have generated large evolutionary sequence sets of NHEJ genes from our primate relatives. Similar to the scenario in yeast, several genes in this pathway are evolving rapidly in primate genomes and in modern human populations. Thus, complex and seemingly opposite selective forces are shaping the evolution of these important DNA repair genes. The finding that NHEJ genes are rapidly evolving in species groups as diverse as yeasts and primates indicates a systematic perturbation of the NHEJ pathway, one that is potentially important to human health.
Collapse
Affiliation(s)
- Ann Demogines
- Section of Molecular Genetics and Microbiology, and Institute for Cellular and Molecular Biology, The University of Texas at Austin, Austin, Texas, United States of America
| | - Alysia M. East
- Section of Molecular Genetics and Microbiology, and Institute for Cellular and Molecular Biology, The University of Texas at Austin, Austin, Texas, United States of America
| | - Ji-Hoon Lee
- Section of Molecular Genetics and Microbiology, and Institute for Cellular and Molecular Biology, The University of Texas at Austin, Austin, Texas, United States of America
- The Howard Hughes Medical Institute, Chevy Chase, Maryland, United States of America
| | - Sharon R. Grossman
- FAS Center for Systems Biology, Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts, United States of America
| | - Pardis C. Sabeti
- FAS Center for Systems Biology, Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts, United States of America
| | - Tanya T. Paull
- Section of Molecular Genetics and Microbiology, and Institute for Cellular and Molecular Biology, The University of Texas at Austin, Austin, Texas, United States of America
- The Howard Hughes Medical Institute, Chevy Chase, Maryland, United States of America
| | - Sara L. Sawyer
- Section of Molecular Genetics and Microbiology, and Institute for Cellular and Molecular Biology, The University of Texas at Austin, Austin, Texas, United States of America
- * E-mail:
| |
Collapse
|
123
|
Teng H, Cai W, Zhou L, Zhang J, Liu Q, Wang Y, Dai W, Zhao M, Sun Z. Evolutionary mode and functional divergence of vertebrate NMDA receptor subunit 2 genes. PLoS One 2010; 5:e13342. [PMID: 20976280 PMCID: PMC2954789 DOI: 10.1371/journal.pone.0013342] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2010] [Accepted: 09/21/2010] [Indexed: 11/19/2022] Open
Abstract
Background Ionotropic glutamate receptors in the central nervous system play a major role in numerous brain functions including learning and memory in many vertebrate species. NR2 subunits have been regarded as rate-limiting molecules in controlling the optimal N-methyl-D-aspartate (NMDA) receptor's coincidence-detection property and subsequent learning and memory function across multi-species. However, its evolutionary mode among vertebrate species remains unclear. Results With extensive analysis of phylogeny, exon structure, protein domain, paralogon and synteny, we demonstrated that two-round genome duplication generated quartet GRIN2 genes and the third-round fish-specific genome duplication generated extra copies of fish GRIN2 genes. In addition, in-depth investigation has enabled the identification of three novel genes, GRIN2C_Gg, GRIN2D-1_Ol and GRIN2D-2_Tr in the chicken, medaka and fugu genome, respectively. Furthermore, we showed functional divergence of NR2 genes mostly occurred at the first-round duplication, amino acid residues located at the N-terminal Lig_chan domain were responsible for type I functional divergence between these GRIN2 subfamilies and purifying selection has been the prominent natural pressure operating on these diversified GRIN2 genes. Conclusion and Significance These findings provide intriguing subjects for testing the 2R and 3R hypothesis and we expect it could provide new insights into the underlying evolution mechanisms of cognition in vertebrate.
Collapse
Affiliation(s)
- Huajing Teng
- Behavioral Genetics Center, Institute of Psychology, Chinese Academy of Science, Beijing, China
| | - Wanshi Cai
- Institute of Genomic Medicine, Wenzhou Medical College, Wenzhou, China
| | - LingLin Zhou
- Institute of Genomic Medicine, Wenzhou Medical College, Wenzhou, China
| | - Jing Zhang
- Behavioral Genetics Center, Institute of Psychology, Chinese Academy of Science, Beijing, China
| | - Qi Liu
- Institute of Genomic Medicine, Wenzhou Medical College, Wenzhou, China
| | - Yongqing Wang
- Behavioral Genetics Center, Institute of Psychology, Chinese Academy of Science, Beijing, China
| | - Wei Dai
- Behavioral Genetics Center, Institute of Psychology, Chinese Academy of Science, Beijing, China
| | - Mei Zhao
- Behavioral Genetics Center, Institute of Psychology, Chinese Academy of Science, Beijing, China
| | - Zhongsheng Sun
- Behavioral Genetics Center, Institute of Psychology, Chinese Academy of Science, Beijing, China
- * E-mail:
| |
Collapse
|
124
|
MHC class II DRB diversity in raccoons (Procyon lotor) reveals associations with raccoon rabies virus (Lyssavirus). Immunogenetics 2010; 63:103-13. [DOI: 10.1007/s00251-010-0485-5] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2010] [Accepted: 09/15/2010] [Indexed: 10/19/2022]
|
125
|
Evidence for positive selection in the gene fruitless in Anastrepha fruit flies. BMC Evol Biol 2010; 10:293. [PMID: 20868501 PMCID: PMC2958917 DOI: 10.1186/1471-2148-10-293] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2010] [Accepted: 09/24/2010] [Indexed: 01/22/2023] Open
Abstract
Background Many genes involved in the sex determining cascade have indicated signals of positive selection and rapid evolution across different species. Even though fruitless is an important gene involved mostly in several aspects of male courtship behavior, the few studies so far have explained its high rates of evolution by relaxed selective constraints. This would indicate that a large portion of this gene has evolved neutrally, contrary to what has been observed for other genes in the sex cascade. Results Here we test whether the fruitless gene has evolved neutrally or under positive selection in species of Anastrepha (Tephritidae: Diptera) using two different approaches, a long-term evolutionary analysis and a populational genetic data analysis. The first analysis was performed by using sequences of three species of Anastrepha and sequences from several species of Drosophila using the ratio of nonsynonymous to synonymous rates of evolution in PAML, which revealed that the fru region here studied has evolved by positive selection. Using Bayes Empirical Bayes we estimated that 16 sites located in the connecting region of the fruitless gene were evolving under positive selection. We also investigated for signs of this positive selection using populational data from 50 specimens from three species of Anastrepha from different localities in Brazil. The use of standard tests of selection and a new test that compares patterns of differential survival between synonymous and nonsynonymous in evolutionary time also provide evidence of positive selection across species and of a selective sweep for one of the species investigated. Conclusions Our data indicate that the high diversification of fru connecting region in Anastrepha flies is due at least in part to positive selection, not merely as a consequence of relaxed selective constraint. These conclusions are based not only on the comparison of distantly related taxa that show long-term divergence time, but also on recently diverged lineages and suggest that episodes of adaptive evolution in fru may be related to sexual selection and/or conflict related to its involvement in male courtship behavior.
Collapse
|
126
|
Datta S, Prado R, Rodríguez A, Escalante AA. Characterizing molecular adaptation: a hierarchical approach to assess the selective influence of amino acid properties. ACTA ACUST UNITED AC 2010; 26:2818-25. [PMID: 20847216 DOI: 10.1093/bioinformatics/btq532] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
MOTIVATION A number of methods for detecting positive selection in protein coding DNA sequences are based on whether each site/region has a non-synonymous to synonymous substitution rates ratio ω greater than one. However, a site/region may show a relatively large number of non-synonymous mutations that conserve a particular property. Recent methods have proposed to consider as evidence for molecular adaptations how conserving, or radically different, non-synonymous mutations are with respect to some key amino acid properties. While such methods have been useful in providing a qualitative assessment of molecular adaptation, they rely on independent statistical analyses for each amino acid property and typically do not properly adjust for multiple comparisons when selection needs to be assessed at several sites. RESULTS We consider a Bayesian hierarchical model that allows us to jointly determine if a set of amino acid properties are being conserved or radically changed while simultaneously adjusting for multiple comparisons at the codon level. We illustrate how this model can be used to characterize molecular adaptation in two datasets: an alignment from six class I alleles of the human major histocompatibility complex and a sperm lysin alignment from 25 abalone species. We compare the results obtained with the proposed hierarchical models to those obtained with alternative methods. Our analyses indicate that a more complete quantitative and qualitative characterization of molecular adaptation is achieved by taking into account changes in amino acid properties. AVAILABILITY The R code for implementing the hierarchical models is freely available at http://www.ams.ucsc.edu/∼raquel/software/.
Collapse
Affiliation(s)
- Saheli Datta
- Department of Applied Mathematics and Statistics, University of California Santa Cruz, CA, USA
| | | | | | | |
Collapse
|
127
|
Forbi JC, Vaughan G, Purdy MA, Campo DS, Xia GL, Ganova-Raeva LM, Ramachandran S, Thai H, Khudyakov YE. Epidemic history and evolutionary dynamics of hepatitis B virus infection in two remote communities in rural Nigeria. PLoS One 2010; 5:e11615. [PMID: 20657838 PMCID: PMC2906510 DOI: 10.1371/journal.pone.0011615] [Citation(s) in RCA: 77] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2010] [Accepted: 06/14/2010] [Indexed: 02/08/2023] Open
Abstract
Background In Nigeria, hepatitis B virus (HBV) infection has reached hyperendemic levels and its nature and origin have been described as a puzzle. In this study, we investigated the molecular epidemiology and epidemic history of HBV infection in two semi-isolated rural communities in North/Central Nigeria. It was expected that only a few, if any, HBV strains could have been introduced and effectively transmitted among these residents, reflecting limited contacts of these communities with the general population in the country. Methods and Findings Despite remoteness and isolation, ∼11% of the entire population in these communities was HBV-DNA seropositive. Analyses of the S-gene sequences obtained from 55 HBV-seropositive individuals showed the circulation of 37 distinct HBV variants. These HBV isolates belong predominantly to genotype E (HBV/E) (n = 53, 96.4%), with only 2 classified as sub-genotype A3 (HBV/A3). Phylogenetic analysis showed extensive intermixing between HBV/E variants identified in these communities and different countries in Africa. Quasispecies analysis of 22 HBV/E strains using end-point limiting-dilution real-time PCR, sequencing and median joining networks showed extensive intra-host heterogeneity and inter-host variant sharing. To investigate events that resulted in such remarkable HBV/E diversity, HBV full-size genome sequences were obtained from 47 HBV/E infected persons and P gene was subjected to Bayesian coalescent analysis. The time to the most recent common ancestor (tMRCA) for these HBV/E variants was estimated to be year 1952 (95% highest posterior density (95% HPD): 1927–1970). Using additional HBV/E sequences from other African countries, the tMRCA was estimated to be year 1948 (95% HPD: 1924–1966), indicating that HBV/E in these remote communities has a similar time of origin with multiple HBV/E variants broadly circulating in West/Central Africa. Phylogenetic analysis and statistical neutrality tests suggested rapid HBV/E population expansion. Additionally, skyline plot analysis showed an increase in the size of the HBV/E-infected population over the last ∼30–40 years. Conclusions Our data suggest a massive introduction and relatively recent HBV/E expansion in the human population in Africa. Collectively, these data show a significant shift in the HBV/E epidemic dynamics in Africa over the last century.
Collapse
Affiliation(s)
- Joseph C Forbi
- Division of Viral Hepatitis, Centers for Disease Control and Prevention, Atlanta, Georgia, United States of America.
| | | | | | | | | | | | | | | | | |
Collapse
|
128
|
Epidemic history and evolutionary dynamics of hepatitis B virus infection in two remote communities in rural Nigeria. PLoS One 2010. [PMID: 20657838 DOI: 10.1371/journal.pone.0011615.] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
BACKGROUND In Nigeria, hepatitis B virus (HBV) infection has reached hyperendemic levels and its nature and origin have been described as a puzzle. In this study, we investigated the molecular epidemiology and epidemic history of HBV infection in two semi-isolated rural communities in North/Central Nigeria. It was expected that only a few, if any, HBV strains could have been introduced and effectively transmitted among these residents, reflecting limited contacts of these communities with the general population in the country. METHODS AND FINDINGS Despite remoteness and isolation, approximately 11% of the entire population in these communities was HBV-DNA seropositive. Analyses of the S-gene sequences obtained from 55 HBV-seropositive individuals showed the circulation of 37 distinct HBV variants. These HBV isolates belong predominantly to genotype E (HBV/E) (n=53, 96.4%), with only 2 classified as sub-genotype A3 (HBV/A3). Phylogenetic analysis showed extensive intermixing between HBV/E variants identified in these communities and different countries in Africa. Quasispecies analysis of 22 HBV/E strains using end-point limiting-dilution real-time PCR, sequencing and median joining networks showed extensive intra-host heterogeneity and inter-host variant sharing. To investigate events that resulted in such remarkable HBV/E diversity, HBV full-size genome sequences were obtained from 47 HBV/E infected persons and P gene was subjected to Bayesian coalescent analysis. The time to the most recent common ancestor (tMRCA) for these HBV/E variants was estimated to be year 1952 (95% highest posterior density (95% HPD): 1927-1970). Using additional HBV/E sequences from other African countries, the tMRCA was estimated to be year 1948 (95% HPD: 1924-1966), indicating that HBV/E in these remote communities has a similar time of origin with multiple HBV/E variants broadly circulating in West/Central Africa. Phylogenetic analysis and statistical neutrality tests suggested rapid HBV/E population expansion. Additionally, skyline plot analysis showed an increase in the size of the HBV/E-infected population over the last approximately 30-40 years. CONCLUSIONS Our data suggest a massive introduction and relatively recent HBV/E expansion in the human population in Africa. Collectively, these data show a significant shift in the HBV/E epidemic dynamics in Africa over the last century.
Collapse
|
129
|
Gómez D, Conejeros P, Marshall SH, Consuegra S. MHC evolution in three salmonid species: a comparison between class II alpha and beta genes. Immunogenetics 2010; 62:531-42. [PMID: 20521040 DOI: 10.1007/s00251-010-0456-x] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2010] [Accepted: 05/15/2010] [Indexed: 11/26/2022]
Abstract
The genes of the major histocompatibility complex (MHC) are amongst the most variable in vertebrates and represent some of the best candidates to study processes of adaptive evolution. However, despite the number of studies available, most of the information on the structure and function of these genes come from studies in mammals and birds in which the MHC class I and II genes are tightly linked and class II alpha exhibits low variability in many cases. Teleost fishes are among the most primitive vertebrates with MHC and represent good organisms for the study of MHC evolution because their class I and class II loci are not physically linked, allowing for independent evolution of both classes of genes. We have compared the diversity and molecular mechanisms of evolution of classical MH class II alpha and class II beta loci in farm populations of three salmonid species: Oncorhynchus kisutch, Oncorhynchus mykiss and Salmo salar. We found single classical class II loci and high polymorphism at both class II alpha and beta genes in the three species. Mechanisms of evolution were common for both class II genes, with recombination and point mutation involved in generating diversity and positive selection acting on the peptide-binding residues. These results suggest that the maintenance of variability at the class IIalpha gene could be a mechanism to increase diversity in the MHC class II in salmonids in order to compensate for the expression of one single classical locus and to respond to a wider array of parasites.
Collapse
Affiliation(s)
- Daniela Gómez
- Instituto de Biología, Facultad de Ciencias Básicas y Matemáticas, Pontificia Universidad Católica de Valparaíso, Valparaíso, Chile
| | | | | | | |
Collapse
|
130
|
Prokupek AM, Eyun SI, Ko L, Moriyama EN, Harshman LG. Molecular evolutionary analysis of seminal receptacle sperm storage organ genes of Drosophila melanogaster. J Evol Biol 2010; 23:1386-98. [PMID: 20500366 DOI: 10.1111/j.1420-9101.2010.01998.x] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
Sperm storage organs are common and broadly distributed among animal taxa. However, little is known about how these organs function at the molecular level. Additionally, there is a paucity of knowledge about the evolution of genes expressed in these organs. This investigation is an evolutionary expressed sequence tag (EST) study of genes expressed in the seminal receptacle, one of the sperm storage organs in Drosophila. The incidence of positive selection is higher for the seminal receptacle genes than Drosophila reproductive genes as a whole, but lower than genes associated with the spermatheca, a second type of Drosophila sperm storage organ. By identifying overrepresented classes of proteins and classes for which sperm storage function is suggested by the nature of the proteins, candidate genes were discovered. These candidates belong to protein classes such as muscle contraction, odorant binding and odorant receptor, protease inhibitor and immunity.
Collapse
Affiliation(s)
- A M Prokupek
- School of Biological Sciences, University of Nebraska-Lincoln, Lincoln, NE 68588, USA
| | | | | | | | | |
Collapse
|
131
|
Torres-Morquecho A, Giono-Cerezo S, Camorlinga-Ponce M, Vargas-Mendoza CF, Torres J. Evolution of bacterial genes: evidences of positive Darwinian selection and fixation of base substitutions in virulence genes of Helicobacter pylori. INFECTION GENETICS AND EVOLUTION 2010; 10:764-76. [PMID: 20434592 DOI: 10.1016/j.meegid.2010.04.005] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/12/2009] [Revised: 04/15/2010] [Accepted: 04/18/2010] [Indexed: 01/30/2023]
Abstract
Gene diversity in Helicobacter pylori from different origins results in a phylogeographic differentiation, and this genetic variation among populations might be driven by random drift or by selective forces. However, only the selective forces would contribute to adaptation of the bacteria to the physiology and environment of its local host and to its association with gastroduodenal diseases. We studied evolutionary forces acting on variable regions of virulence genes cagA, babA and oipA, which present geographic differences among H. pylori strains from different human groups. Gene sequences in H. pylori strains from Asia, Europe and America were analysed using state of the art analytical methods like the Maximum Likelihood method. The rate and nature of polymorphisms in these virulence genes were also compared among populations using the AMOVA and McDonald-Kreitman tests. We found strong and significant positive selection acting on variable regions of cagA, babA and oipA. We found in cagA from Asian strains regions under positive selection, which localised in amino acid sites defining the Asian fingerprint for this gene and in sites with important biological activity. Different evolutionary forces are acting on the variable region of virulence genes; they partly explain the source of genetic diversity and the differences in risk for gastroduodenal diseases among different human populations.
Collapse
Affiliation(s)
- Araceli Torres-Morquecho
- Infectious Diseases Research Unit, Hospital de Pediatría, Instituto Mexicano del Seguro Social, Mexico
| | | | | | | | | |
Collapse
|
132
|
Lau C, Dölle C, Gossmann TI, Agledal L, Niere M, Ziegler M. Isoform-specific targeting and interaction domains in human nicotinamide mononucleotide adenylyltransferases. J Biol Chem 2010; 285:18868-76. [PMID: 20388704 DOI: 10.1074/jbc.m110.107631] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023] Open
Abstract
Several important signaling pathways require NAD as substrate, thereby leading to significant consumption of the molecule. Because NAD is also an essential redox carrier, its continuous resynthesis is vital. In higher eukaryotes, maintenance of compartmentalized NAD pools is critical, but so far rather little is known about the regulation and subcellular distribution of NAD biosynthetic enzymes. The key step in NAD biosynthesis is the formation of the dinucleotide by nicotinamide/nicotinic acid mononucleotide adenylyltransferases (NMNATs). The three human isoforms were localized to the nucleus, the Golgi complex, and mitochondria. Here, we show that their genes contain unique exons that encode isoform-specific domains to mediate subcellular targeting and post-translational modifications. These domains are dispensable for catalytic activity, consistent with their absence from NMNATs of lower organisms. We further demonstrate that the Golgi-associated NMNAT is palmitoylated at two adjacent cysteine residues of its isoform-specific domain and thereby anchored at the cytoplasmic surface, a potential mechanism to regulate the cytosolic NAD pool. Insertion of unique domains thus provides a yet unrecognized enzyme targeting mode, which has also been adapted to modulate subcellular NAD supply.
Collapse
Affiliation(s)
- Corinna Lau
- Department of Molecular Biology, University of Bergen, N-5008 Bergen, Norway
| | | | | | | | | | | |
Collapse
|
133
|
Findlay GD, Swanson WJ. Proteomics enhances evolutionary and functional analysis of reproductive proteins. Bioessays 2010; 32:26-36. [PMID: 20020477 DOI: 10.1002/bies.200900127] [Citation(s) in RCA: 78] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Reproductive proteins maintain species-specific barriers to fertilization, affect the outcome of sperm competition, mediate reproductive conflicts between the sexes, and potentially contribute to the formation of new species. However, the specific proteins and molecular mechanisms that underlie these processes are understood in only a handful of cases. Advances in genomic and proteomic technologies enable the identification of large suites of reproductive proteins, making it possible to dissect reproductive phenotypes at the molecular level. We first review these technological advances and describe how reproductive proteins are identified in diverse animal taxa. We then discuss the dynamic evolution of reproductive proteins and the potential selective forces that act on them. Finally, we describe molecular and genomic tools for functional analysis and detail how evolutionary data may be used to make predictions about interactions among reproductive proteins.
Collapse
Affiliation(s)
- Geoffrey D Findlay
- Department of Genome Sciences, University of Washington, Seattle, 98195-5065, USA.
| | | |
Collapse
|
134
|
Axelsson E, Albrechtsen A, van AP, Li L, Megens HJ, Vereijken ALJ, Crooijmans RPMA, Groenen MAM, Ellegren H, Willerslev E, Nielsen R. Segregation distortion in chicken and the evolutionary consequences of female meiotic drive in birds. Heredity (Edinb) 2010; 105:290-8. [DOI: 10.1038/hdy.2009.193] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
|
135
|
Elgoyhen AB, Franchini LF. Prestin and the cholinergic receptor of hair cells: positively-selected proteins in mammals. Hear Res 2010; 273:100-8. [PMID: 20056140 DOI: 10.1016/j.heares.2009.12.028] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/05/2009] [Revised: 12/28/2009] [Accepted: 12/29/2009] [Indexed: 11/26/2022]
Abstract
The hair cells of the vertebrate inner ear posses active mechanical processes to amplify their inputs. The stereocilia bundle of various vertebrate animals can produce active movements. Though standard stereocilia-based mechanisms to promote amplification persist in mammals, an additional radically different mechanism evolved: the so-called somatic electromotility which refers to the elongation/contraction of the outer hair cells' (OHC) cylindrical cell body in response to membrane voltage changes. Somatic electromotility in OHCs, as the basis for cochlear amplification, is a mammalian novelty and it is largely dependent upon the properties of the unique motor protein prestin. We review recent literature which has demonstrated that although the gene encoding prestin is present in all vertebrate species, mammalian prestin has been under positive selective pressure to acquire motor properties, probably rendering it fit to serve somatic motility in outer hair cells. Moreover, we discuss data which indicates that a modified α10 nicotinic cholinergic receptor subunit has co-evolved in mammals, most likely to give the auditory feedback system the capability to control somatic electromotility.
Collapse
Affiliation(s)
- Ana Belén Elgoyhen
- Instituto de Investigaciones en Ingeniería Genética y Biología Molecular, Consejo Nacional de Investigaciones Científicas y Técnicas, Buenos Aires 1428, Argentina.
| | | |
Collapse
|
136
|
Lu H, Lin L, Sato S, Xing Y, Lee CJ. Predicting functional alternative splicing by measuring RNA selection pressure from multigenome alignments. PLoS Comput Biol 2009; 5:e1000608. [PMID: 20019791 PMCID: PMC2784930 DOI: 10.1371/journal.pcbi.1000608] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2009] [Accepted: 11/12/2009] [Indexed: 12/31/2022] Open
Abstract
High-throughput methods such as EST sequencing, microarrays and deep sequencing have identified large numbers of alternative splicing (AS) events, but studies have shown that only a subset of these may be functional. Here we report a sensitive bioinformatics approach that identifies exons with evidence of a strong RNA selection pressure ratio (RSPR)--i.e., evolutionary selection against mutations that change only the mRNA sequence while leaving the protein sequence unchanged--measured across an entire evolutionary family, which greatly amplifies its predictive power. Using the UCSC 28 vertebrate genome alignment, this approach correctly predicted half to three-quarters of AS exons that are known binding targets of the NOVA splicing regulatory factor, and predicted 345 strongly selected alternative splicing events in human, and 262 in mouse. These predictions were strongly validated by several experimental criteria of functional AS such as independent detection of the same AS event in other species, reading frame-preservation, and experimental evidence of tissue-specific regulation: 75% (15/20) of a sample of high-RSPR exons displayed tissue specific regulation in a panel of ten tissues, vs. only 20% (4/20) among a sample of low-RSPR exons. These data suggest that RSPR can identify exons with functionally important splicing regulation, and provides biologists with a dataset of over 600 such exons. We present several case studies, including both well-studied examples (GRIN1) and novel examples (EXOC7). These data also show that RSPR strongly outperforms other approaches such as standard sequence conservation (which fails to distinguish amino acid selection pressure from RNA selection pressure), or pairwise genome comparison (which lacks adequate statistical power for predicting individual exons).
Collapse
Affiliation(s)
- Hongchao Lu
- Molecular Biology Institute, Center for Computational Biology, Institute for Genomics and Proteomics, Department of Chemistry and Biochemistry, University of California Los Angeles, Los Angeles, California, United States of America
| | - Lan Lin
- Department of Internal Medicine, University of Iowa, Iowa City, Iowa, United States of America
| | - Seiko Sato
- Department of Internal Medicine, University of Iowa, Iowa City, Iowa, United States of America
| | - Yi Xing
- Department of Internal Medicine, University of Iowa, Iowa City, Iowa, United States of America
- Department of Biomedical Engineering, University of Iowa, Iowa City, Iowa, United States of America
| | - Christopher J. Lee
- Molecular Biology Institute, Center for Computational Biology, Institute for Genomics and Proteomics, Department of Chemistry and Biochemistry, University of California Los Angeles, Los Angeles, California, United States of America
- * E-mail:
| |
Collapse
|
137
|
Carter R, Drouin G. Structural differentiation of the three eukaryotic RNA polymerases. Genomics 2009; 94:388-96. [DOI: 10.1016/j.ygeno.2009.08.011] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2009] [Revised: 08/18/2009] [Accepted: 08/22/2009] [Indexed: 12/27/2022]
|
138
|
Jiang X, Fares MA. Identifying coevolutionary patterns in human leukocyte antigen (HLA) molecules. Evolution 2009; 64:1429-45. [PMID: 19930454 DOI: 10.1111/j.1558-5646.2009.00903.x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
The antigenic peptide, major histocompatibility complex molecule (MHC; also called human leukocyte antigen, HLA), coreceptor CD8, or CD4 and T-cell receptor (TCR) function as a complex to initiate effectors' mechanisms of the immune system. The tight functional and physical interaction among these molecules may have involved strong coevolution links among domains within and between proteins. Despite the importance of unraveling such dependencies to understand the arms race of host-pathogen interaction, no previous studies have aimed at achieving such an objective. Here, we perform an exhaustive coevolution analysis and show that indeed such dependencies are strongly shaping the evolution and probably the function of these molecules. We identify intramolecular coevolution in HLA class I and II at domains important for their immune activity. Most of the amino acid sites identified to be coevolving in HLAI have been also detected to undergo positive Darwinian selection highlighting therefore their adaptive value. We also identify coevolution among antigen-binding pockets (P1-P9) and among these and TCR-binding sites. Conversely to HLAI, coevolution is weaker in HLAII. Our results support that such coevolutionary patterns are due to selective pressures of host-pathogen coevolution and cooperative binding of TCRs, antigenic peptides, and CD8/CD4 to HLAI and HLAII.
Collapse
Affiliation(s)
- Xiaowei Jiang
- Evolutionary Genetics and Bioinformatics Laboratory, Department of Genetics, Smurfit Institute of Genetics, University of Dublin, Trinity College Dublin, Dublin 2, Ireland.
| | | |
Collapse
|
139
|
Pond SLK, Scheffler K, Gravenor MB, Poon AFY, Frost SDW. Evolutionary fingerprinting of genes. Mol Biol Evol 2009; 27:520-36. [PMID: 19864470 DOI: 10.1093/molbev/msp260] [Citation(s) in RCA: 55] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
Over time, natural selection molds every gene into a unique mosaic of sites evolving rapidly or resisting change-an "evolutionary fingerprint" of the gene. Aspects of this evolutionary fingerprint, such as the site-specific ratio of nonsynonymous to synonymous substitution rates (dN/dS), are commonly used to identify genetic features of potential biological interest; however, no framework exists for comparing evolutionary fingerprints between genes. We hypothesize that protein-coding genes with similar protein structure and/or function tend to have similar evolutionary fingerprints and that comparing evolutionary fingerprints can be useful for discovering similarities between genes in a way that is analogous to, but independent of, discovery of similarity via sequence-based comparison tools such as Blast. To test this hypothesis, we develop a novel model of coding sequence evolution that uses a general bivariate discrete parameterization of the evolutionary rates. We show that this approach provides a better fit to the data using a smaller number of parameters than existing models. Next, we use the model to represent evolutionary fingerprints as probability distributions and present a methodology for comparing these distributions in a way that is robust against variations in data set size and divergence. Finally, using sequences of three rapidly evolving RNA viruses (HIV-1, hepatitis C virus, and influenza A virus), we demonstrate that genes within the same functional group tend to have similar evolutionary fingerprints. Our framework provides a sound statistical foundation for efficient inference and comparison of evolutionary rate patterns in arbitrary collections of gene alignments, clustering homologous and nonhomologous genes, and investigation of biological and functional correlates of evolutionary rates.
Collapse
|
140
|
Chaves LD, Krueth SB, Reed KM. Defining the turkey MHC: sequence and genes of the B locus. THE JOURNAL OF IMMUNOLOGY 2009; 183:6530-7. [PMID: 19864609 DOI: 10.4049/jimmunol.0901310] [Citation(s) in RCA: 60] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]
Abstract
The MHC, the most polymorphic and gene dense region in the vertebrate genome, contains many loci essential to immunity. In mammals, this region spans approximately 4 Mb. Studies of avian species have found the MHC to be greatly reduced in size and gene content with an overall locus organization differing from that of mammals. The chicken MHC has been mapped to two distinct regions (MHC-B and -Y) of a single chromosome. MHC-B haplotypes possess tightly linked genes encoding the classical MHC molecules and few other disease resistance genes. Furthermore, chicken haplotypes possess a dominantly expressed class I and class II B locus that have a significant effect on the progression or regression of pathogenic disease. In this study, we present the MHC-B region of the turkey (Meleagris gallopavo) as a similarly constricted locus, with 34 genes identified within a 0.2-Mb region in near-perfect synteny with that of the chicken MHC-B. Notable differences between the two species are three BG and class II B loci in the turkey compared with one BG and two class II B loci in the chicken MHC-B. The relative size and high level of similarity of the turkey MHC in relation to that of the chicken suggest that similar associations with disease susceptibility and resistance may also be found in turkey.
Collapse
Affiliation(s)
- Lee D Chaves
- Department of Veterinary and Biomedical Sciences, College of Veterinary Medicine, University of Minnesota, St. Paul, MN 55108, USA.
| | | | | |
Collapse
|
141
|
Layeghifard M, Pirhaji L, Rabani R. Adaptive evolution in thePergene family of vertebrates: neofunctionalization by positive Darwinian selection after two major gene duplications. BIOL RHYTHM RES 2009. [DOI: 10.1080/09291010802553733] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
142
|
Abstract
The evolution of protein function appears to involve alternating periods of conservative evolution and of relatively rapid change. Evidence for such episodic evolution, consistent with some theoretical expectations, comes from the application of increasingly sophisticated models of evolution to large sequence datasets. We present here some of the recent methods to detect functional shifts, using amino acid or codon models. Both provide evidence for punctual shifts in patterns of amino acid conservation, including the fixation of key changes by positive selection. Although a link to gene duplication, a presumed source of functional changes, has been difficult to establish, this episodic model appears to apply to a wide variety of proteins and organisms.
Collapse
|
143
|
Janzac B, Fabre F, Palloix A, Moury B. Constraints on evolution of virus avirulence factors predict the durability of corresponding plant resistances. MOLECULAR PLANT PATHOLOGY 2009; 10:599-610. [PMID: 19694951 PMCID: PMC6640373 DOI: 10.1111/j.1364-3703.2009.00554.x] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/07/2023]
Abstract
SUMMARY Understanding the factors driving pathogen emergence and re-emergence is a major challenge, particularly in agriculture, where the use of resistant plant cultivars imposes strong selective pressures on plant pathogen populations and leads frequently to 'resistance breakdown'. Presently, durable resistances are only identified after a long period of large-scale cultivation of resistant cultivars. We propose a new predictor of the durability of plant resistance. Because resistance breakdown involves modifications in the avirulence factors of pathogens, we tested for correlations between the evolutionary constraints acting on avirulence factors or their diversity and the durability of the corresponding resistance genes in the case of plant-virus interactions. An analysis performed on 20 virus species-resistance gene combinations revealed that the selective constraints applied on amino acid substitutions in virus avirulence factors correlate with the observed durability of the corresponding resistance genes. On the basis of this result, a model predicting the potential durability of resistance genes as a function of the selective constraints applied on the corresponding avirulence factors is proposed to help breeders to select the most durable resistance genes.
Collapse
Affiliation(s)
- Berenger Janzac
- INRA, UR407 Pathologie Végétale, Domaine Saint Maurice, BP94, F-84140 Montfavet, France.
| | | | | | | |
Collapse
|
144
|
Xia Z, Jin G, Zhu J, Zhou R. Using a mutual information-based site transition network to map the genetic evolution of influenza A/H3N2 virus. ACTA ACUST UNITED AC 2009; 25:2309-17. [PMID: 19706746 DOI: 10.1093/bioinformatics/btp423] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
MOTIVATION Mapping the antigenic and genetic evolution pathways of influenza A is of critical importance in the vaccine development and drug design of influenza virus. In this article, we have analyzed more than 4000 A/H3N2 hemagglutinin (HA) sequences from 1968 to 2008 to model the evolutionary path of the influenza virus, which allows us to predict its future potential drifts with specific mutations. RESULTS The mutual information (MI) method was used to design a site transition network (STN) for each amino acid site in the A/H3N2 HA sequence. The STN network indicates that most of the dynamic interactions are positioned around the epitopes and the receptor binding domain regions, with strong preferences in both the mutation sites and amino acid types being mutated to. The network also shows that antigenic changes accumulate over time, with occasional large changes due to multiple co-occurring mutations at antigenic sites. Furthermore, the cluster analysis by subdividing the STN into several subnetworks reveals a more detailed view about the features of the antigenic change: the characteristic inner sites and the connecting inter-subnetwork sites are both responsible for the drifts. A novel five-step prediction algorithm based on the STN shows a reasonable accuracy in reproducing historical HA mutations. For example, our method can reproduce the 2003-2004 A/H3N2 mutations with approximately 70% accuracy. The method also predicts seven possible mutations for the next antigenic drift in the coming 2009-2010 season. The STN approach also agrees well with the phylogenetic tree and antigenic maps based on HA inhibition assays. AVAILABILITY All code and data are available at http://ibi.zju.edu.cn/birdflu/.
Collapse
Affiliation(s)
- Zhen Xia
- Institute of Bioinformatics, Zhejiang University, Hangzhou, PR China
| | | | | | | |
Collapse
|
145
|
Selection at the MHC class IIB locus across guppy (Poecilia reticulata) populations. Heredity (Edinb) 2009; 104:155-67. [DOI: 10.1038/hdy.2009.99] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022] Open
|
146
|
Farfán M, Miñana-Galbis D, Fusté MC, Lorén JG. Divergent evolution and purifying selection of the flaA gene sequences in Aeromonas. Biol Direct 2009; 4:23. [PMID: 19622168 PMCID: PMC2724415 DOI: 10.1186/1745-6150-4-23] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2009] [Accepted: 07/21/2009] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The bacterial flagellum is the most important organelle of motility in bacteria and plays a key role in many bacterial lifestyles, including virulence. The flagellum also provides a paradigm of how hierarchical gene regulation, intricate protein-protein interactions and controlled protein secretion can result in the assembly of a complex multi-protein structure tightly orchestrated in time and space. As if to stress its importance, plants and animals produce receptors specifically dedicated to the recognition of flagella. Aside from motility, the flagellum also moonlights as an adhesion and has been adapted by humans as a tool for peptide display. Flagellar sequence variation constitutes a marker with widespread potential uses for studies of population genetics and phylogeny of bacterial species. RESULTS We sequenced the complete flagellin gene (flaA) in 18 different species and subspecies of Aeromonas. Sequences ranged in size from 870 (A. allosaccharophila) to 921 nucleotides (A. popoffii). The multiple alignment displayed 924 sites, 66 of which presented alignment gaps. The phylogenetic tree revealed the existence of two groups of species exhibiting different FlaA flagellins (FlaA1 and FlaA2). Maximum likelihood models of codon substitution were used to analyze flaA sequences. Likelihood ratio tests suggested a low variation in selective pressure among lineages, with an omega ratio of less than 1 indicating the presence of purifying selection in almost all cases. Only one site under potential diversifying selection was identified (isoleucine in position 179). However, 17 amino acid positions were inferred as sites that are likely to be under positive selection using the branch-site model. Ancestral reconstruction revealed that these 17 amino acids were among the amino acid changes detected in the ancestral sequence. CONCLUSION The models applied to our set of sequences allowed us to determine the possible evolutionary pathway followed by the flaA gene in Aeromonas, suggesting that this gene have probably been evolving independently in the two groups of Aeromonas species since the divergence of a distant common ancestor after one or several episodes of positive selection. REVIEWERS This article was reviewed by Alexey Kondrashov, John Logsdon and Olivier Tenaillon (nominated by Laurence D Hurst).
Collapse
Affiliation(s)
- Maribel Farfán
- Departament de Microbiologia i Parasitologia Sanitàries, Facultat de Farmàcia, Universitat de Barcelona, Av. Joan XXIII s/n, 08028 Barcelona, Spain.
| | | | | | | |
Collapse
|
147
|
Clark NL, Gasper J, Sekino M, Springer SA, Aquadro CF, Swanson WJ. Coevolution of interacting fertilization proteins. PLoS Genet 2009; 5:e1000570. [PMID: 19629160 PMCID: PMC2704960 DOI: 10.1371/journal.pgen.1000570] [Citation(s) in RCA: 105] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2008] [Accepted: 06/23/2009] [Indexed: 01/01/2023] Open
Abstract
Reproductive proteins are among the fastest evolving in the proteome, often due to the consequences of positive selection, and their rapid evolution is frequently attributed to a coevolutionary process between interacting female and male proteins. Such a process could leave characteristic signatures at coevolving genes. One signature of coevolution, predicted by sexual selection theory, is an association of alleles between the two genes. Another predicted signature is a correlation of evolutionary rates during divergence due to compensatory evolution. We studied female-male coevolution in the abalone by resequencing sperm lysin and its interacting egg coat protein, VERL, in populations of two species. As predicted, we found intergenic linkage disequilibrium between lysin and VERL, despite our demonstration that they are not physically linked. This finding supports a central prediction of sexual selection using actual genotypes, that of an association between a male trait and its female preference locus. We also created a novel likelihood method to show that lysin and VERL have experienced correlated rates of evolution. These two signatures of coevolution can provide statistical rigor to hypotheses of coevolution and could be exploited for identifying coevolving proteins a priori. We also present polymorphism-based evidence for positive selection and implicate recent selective events at the specific structural regions of lysin and VERL responsible for their species-specific interaction. Finally, we observed deep subdivision between VERL alleles in one species, which matches a theoretical prediction of sexual conflict. Thus, abalone fertilization proteins illustrate how coevolution can lead to reproductive barriers and potentially drive speciation.
Collapse
Affiliation(s)
- Nathaniel L. Clark
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York, United States of America
| | - Joe Gasper
- Department of Genome Sciences, University of Washington, Seattle, Washington, United States of America
| | - Masashi Sekino
- Tohoku National Fisheries Research Institute, Fisheries Research Agency, Shiogama, Miyagi, Japan
| | - Stevan A. Springer
- Department of Genome Sciences, University of Washington, Seattle, Washington, United States of America
| | - Charles F. Aquadro
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York, United States of America
| | - Willie J. Swanson
- Department of Genome Sciences, University of Washington, Seattle, Washington, United States of America
| |
Collapse
|
148
|
Miller PJ, Kim LM, Ip HS, Afonso CL. Evolutionary dynamics of Newcastle disease virus. Virology 2009; 391:64-72. [PMID: 19564032 DOI: 10.1016/j.virol.2009.05.033] [Citation(s) in RCA: 125] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2009] [Revised: 03/27/2009] [Accepted: 05/22/2009] [Indexed: 12/26/2022]
Abstract
A comprehensive dataset of NDV genome sequences was evaluated using bioinformatics to characterize the evolutionary forces affecting NDV genomes. Despite evidence of recombination in most genes, only one event in the fusion gene of genotype V viruses produced evolutionarily viable progenies. The codon-associated rate of change for the six NDV proteins revealed that the highest rate of change occurred at the fusion protein. All proteins were under strong purifying (negative) selection; the fusion protein displayed the highest number of amino acids under positive selection. Regardless of the phylogenetic grouping or the level of virulence, the cleavage site motif was highly conserved implying that mutations at this site that result in changes of virulence may not be favored. The coding sequence of the fusion gene and the genomes of viruses from wild birds displayed higher yearly rates of change in virulent viruses than in viruses of low virulence, suggesting that an increase in virulence may accelerate the rate of NDV evolution.
Collapse
Affiliation(s)
- Patti J Miller
- Southeast Poultry Research Laboratories, USDA ARS, Southeast Poultry Research Laboratory, Athens, GA 30605, USA
| | | | | | | |
Collapse
|
149
|
Polymorphism and selection in the major histocompatibility complex DRA and DQA genes in the family Equidae. Immunogenetics 2009; 61:513-27. [PMID: 19557406 DOI: 10.1007/s00251-009-0380-0] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2009] [Accepted: 06/05/2009] [Indexed: 12/24/2022]
Abstract
The major histocompatibility complex genes coding for antigen binding and presenting molecules are the most polymorphic genes in the vertebrate genome. We studied the DRA and DQA gene polymorphism of the family Equidae. In addition to 11 previously reported DRA and 24 DQA alleles, six new DRA sequences and 13 new DQA alleles were identified in the genus Equus. Phylogenetic analysis of both DRA and DQA sequences provided evidence for trans-species polymorphism in the family Equidae. The phylogenetic trees differed from species relationships defined by standard taxonomy of Equidae and from trees based on mitochondrial or neutral gene sequence data. Analysis of selection showed differences between the less variable DRA and more variable DQA genes. DRA alleles were more often shared by more species. The DQA sequences analysed showed strong amongst-species positive selection; the selected amino acid positions mostly corresponded to selected positions in rodent and human DQA genes.
Collapse
|
150
|
Zhang Z, Townsend JP. Maximum-likelihood model averaging to profile clustering of site types across discrete linear sequences. PLoS Comput Biol 2009; 5:e1000421. [PMID: 19557160 PMCID: PMC2695770 DOI: 10.1371/journal.pcbi.1000421] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2009] [Accepted: 05/21/2009] [Indexed: 11/19/2022] Open
Abstract
A major analytical challenge in computational biology is the detection and description of clusters of specified site types, such as polymorphic or substituted sites within DNA or protein sequences. Progress has been stymied by a lack of suitable methods to detect clusters and to estimate the extent of clustering in discrete linear sequences, particularly when there is no a priori specification of cluster size or cluster count. Here we derive and demonstrate a maximum likelihood method of hierarchical clustering. Our method incorporates a tripartite divide-and-conquer strategy that models sequence heterogeneity, delineates clusters, and yields a profile of the level of clustering associated with each site. The clustering model may be evaluated via model selection using the Akaike Information Criterion, the corrected Akaike Information Criterion, and the Bayesian Information Criterion. Furthermore, model averaging using weighted model likelihoods may be applied to incorporate model uncertainty into the profile of heterogeneity across sites. We evaluated our method by examining its performance on a number of simulated datasets as well as on empirical polymorphism data from diverse natural alleles of the Drosophila alcohol dehydrogenase gene. Our method yielded greater power for the detection of clustered sites across a breadth of parameter ranges, and achieved better accuracy and precision of estimation of clusters, than did the existing empirical cumulative distribution function statistics.
Collapse
Affiliation(s)
- Zhang Zhang
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, Connecticut, United States of America
| | - Jeffrey P. Townsend
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, Connecticut, United States of America
- * E-mail:
| |
Collapse
|