1
|
Lucaci AG, Zehr JD, Enard D, Thornton JW, Kosakovsky Pond SL. Evolutionary Shortcuts via Multinucleotide Substitutions and Their Impact on Natural Selection Analyses. Mol Biol Evol 2023; 40:msad150. [PMID: 37395787 PMCID: PMC10336034 DOI: 10.1093/molbev/msad150] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Revised: 06/15/2023] [Accepted: 06/26/2023] [Indexed: 07/04/2023] Open
Abstract
Inference and interpretation of evolutionary processes, in particular of the types and targets of natural selection affecting coding sequences, are critically influenced by the assumptions built into statistical models and tests. If certain aspects of the substitution process (even when they are not of direct interest) are presumed absent or are modeled with too crude of a simplification, estimates of key model parameters can become biased, often systematically, and lead to poor statistical performance. Previous work established that failing to accommodate multinucleotide (or multihit, MH) substitutions strongly biases dN/dS-based inference towards false-positive inferences of diversifying episodic selection, as does failing to model variation in the rate of synonymous substitution (SRV) among sites. Here, we develop an integrated analytical framework and software tools to simultaneously incorporate these sources of evolutionary complexity into selection analyses. We found that both MH and SRV are ubiquitous in empirical alignments, and incorporating them has a strong effect on whether or not positive selection is detected (1.4-fold reduction) and on the distributions of inferred evolutionary rates. With simulation studies, we show that this effect is not attributable to reduced statistical power caused by using a more complex model. After a detailed examination of 21 benchmark alignments and a new high-resolution analysis showing which parts of the alignment provide support for positive selection, we show that MH substitutions occurring along shorter branches in the tree explain a significant fraction of discrepant results in selection detection. Our results add to the growing body of literature which examines decades-old modeling assumptions (including MH) and finds them to be problematic for comparative genomic data analysis. Because multinucleotide substitutions have a significant impact on natural selection detection even at the level of an entire gene, we recommend that selection analyses of this type consider their inclusion as a matter of routine. To facilitate this procedure, we developed, implemented, and benchmarked a simple and well-performing model testing selection detection framework able to screen an alignment for positive selection with two biologically important confounding processes: site-to-site synonymous rate variation, and multinucleotide instantaneous substitutions.
Collapse
Affiliation(s)
- Alexander G Lucaci
- Institute for Genomics and Evolutionary Medicine, Temple University, Philadelphia, PA, USA
| | - Jordan D Zehr
- Institute for Genomics and Evolutionary Medicine, Temple University, Philadelphia, PA, USA
| | - David Enard
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona
| | - Joseph W Thornton
- Department of Human Genetics, University of Chicago, Chicago, Illinois
- Department of Ecology & Evolution, University of Chicago, Chicago, Illinois
| | | |
Collapse
|
2
|
Ahmad MZ, Shah Z, Ullah A, Ahmed S, Ahmad B, Khan A. Genome wide and evolutionary analysis of heat shock protein 70 proteins in tomato and their role in response to heat and drought stress. Mol Biol Rep 2022; 49:11229-11241. [PMID: 35788950 DOI: 10.1007/s11033-022-07734-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2022] [Accepted: 06/22/2022] [Indexed: 11/24/2022]
Abstract
Heat shock protein 70 (HSP70) proteins play a crucial role in mitigating the detrimental effects of abiotic stresses in plants. In the present study, 21 full length non-redundant SlHSP70 genes were detected and characterized in tomato (Solanum lycopersicum L.). The SlHSP70 genes were classified into four groups based on phylogenetic analysis. Similarities were observed in gene features and motif structures of SlHSP70s belonging to the same group. SlHSP70 genes were unevenly and unequally mapped on 11 chromosomes. Segmental and tandem duplication are the main events that have contributed to the expansion of the SlHSP70 genes. A large number of groups and sub-groups were generated during comparative analysis of HSP70 genes in multiple plant species including tomato. These findings indicated a common ancestor which created diverse sub-groups prior to a mono-dicot split. The selection pressure on specific codons was identified through a maximum-likelihood approach and we found some important coding sites in the coding region of all groups. Diversifying positive selection was indirectly associated with evolutionary changes in SlHSP70 proteins and suggests that gene evolution modulated the tomato domestication event. In addition, expression analysis using RNA-seq revealed that 21 SlHSP70 genes were differentially expressed in response to drought and heat stress. SlHSP70-5 was down-regulated by heat treatment and up-regulated by drought stress. Furthermore, the expression of some of the duplicate genes was partially redundant, while others showed functional diversity. Our results indicate the diverse role of HSP70 gene family in S. lycopersicum under drought and heat stress conditions and open the gate for further investigation of HSP70 gene family functions, especially under drought and heat stress.
Collapse
Affiliation(s)
- Muhammad Zulfiqar Ahmad
- Department of Plant Breeding and Genetics, Faculty of Agriculture, University of Agriculture, D.I. Khan, Pakistan.
| | - Zamarud Shah
- Department of Biotechnology, University of Science and Technology, Bannu, Pakistan
| | - Arif Ullah
- Department of Biotechnology, University of Science and Technology, Bannu, Pakistan
| | - Shakeel Ahmed
- Institute de Farmacia, Facultad de Ciencias, Universidad Austral de Chile, Campus Isla Teja, 5090000, Valdivia, Chile
| | - Bushra Ahmad
- Department of Biochemistry, Shaheed Benazir Bhutto Women University, Peshawar, Pakistan
| | - Afrasyab Khan
- Department of Biotechnology, University of Science and Technology, Bannu, Pakistan
| |
Collapse
|
3
|
Kabir N, Lin H, Kong X, Liu L, Qanmber G, Wang Y, Zhang L, Sun Z, Yang Z, Yu Y, Zhao N. Identification, evolutionary analysis and functional diversification of RAV gene family in cotton (G. hirsutum L.). PLANTA 2021; 255:14. [PMID: 34862931 DOI: 10.1007/s00425-021-03782-2] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/03/2021] [Accepted: 11/08/2021] [Indexed: 06/13/2023]
Abstract
Genome wide analysis, expression pattern analysis, and functional characterization of RAV genes highlight their roles in roots, stem development and hormonal response. RAV (Related to ABI3 and VP1) gene family members have been involved in tissues/organs growth and hormone signaling in various plant species. Here, we identified 247 RAVs from 12 different species with 33 RAV genes from G. hirsutum. Phylogenetic analysis classified RAV genes into four distinct groups. Analysis of gene structure showed that most GhRAVs lack introns. Motif distribution pattern and protein sequence logos indicated that GhRAV genes were highly conserved during the process of evolution. Promotor cis-acting elements revealed that promotor regions of GhRAV genes encode numerous elements related to plant growth, abiotic stresses and phytohormones. Chromosomal location information showed uneven distribution of 33 GhRAV genes on different chromosomes. Collinearity analysis identified 628 and 52 orthologous/ paralogous gene pairs in G. hirsutum and G. barbadense, respectively. Ka/Ks values indicated that GhRAV and GbRAV genes underwent strong purifying selection pressure. Selecton model and codon model selection revealed that GhRAV amino acids were under purifying selection and adaptive evolution exists among GhRAV proteins. Three dimensional structure of GhRAVs indicated the presence of numerous alpha helix and beta-barrels. Expression level revealed that some GhRAV genes exhibited high expression in roots (GhRAV3, GhRAV4, GhRAV11, GhRAV18, GhRAV20 and GhRAV30) and stem (GhRAV3 and GhRAV18), indicating their potential role in roots and stem development. GhRAV genes can be regulated by phytohormonal stresses (BL, JA and IAA). Our study provides a reference for future studies related to the functional analysis of GhRAVs in cotton.
Collapse
Affiliation(s)
- Nosheen Kabir
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, 455000, Henan, China
| | - Hai Lin
- Key Laboratory of China Northwestern Inland Region, Ministry of Agriculture and Rural Affairs, Cotton Research Institute of Xinjiang Academy of Agricultural and Reclamation Science, Shehezi, 832000, Xinjiang, China
| | - Xianhui Kong
- Key Laboratory of China Northwestern Inland Region, Ministry of Agriculture and Rural Affairs, Cotton Research Institute of Xinjiang Academy of Agricultural and Reclamation Science, Shehezi, 832000, Xinjiang, China
| | - Le Liu
- Zhengzhou Research Base, State Key Laboratory of Cotton Biology, Zhengzhou University, Zhengzhou, 450001, Henan, China
| | - Ghulam Qanmber
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, 455000, Henan, China
| | - YuXuan Wang
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, 455000, Henan, China
| | - Lian Zhang
- Key Laboratory of China Northwestern Inland Region, Ministry of Agriculture and Rural Affairs, Cotton Research Institute of Xinjiang Academy of Agricultural and Reclamation Science, Shehezi, 832000, Xinjiang, China
| | - Zhuojing Sun
- Development Center for Science and Technology, Ministry of Agriculture and Rural Affairs, Beijing, 100122, China
| | - Zuoren Yang
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, 455000, Henan, China
- Zhengzhou Research Base, State Key Laboratory of Cotton Biology, Zhengzhou University, Zhengzhou, 450001, Henan, China
- Key Laboratory of China Northwestern Inland Region, Ministry of Agriculture and Rural Affairs, Cotton Research Institute of Xinjiang Academy of Agricultural and Reclamation Science, Shehezi, 832000, Xinjiang, China
| | - Yu Yu
- Key Laboratory of China Northwestern Inland Region, Ministry of Agriculture and Rural Affairs, Cotton Research Institute of Xinjiang Academy of Agricultural and Reclamation Science, Shehezi, 832000, Xinjiang, China.
| | - Na Zhao
- Zhengzhou Research Base, State Key Laboratory of Cotton Biology, Zhengzhou University, Zhengzhou, 450001, Henan, China.
| |
Collapse
|
4
|
Weak selection on synonymous codons substantially inflates dN/dS estimates in bacteria. Proc Natl Acad Sci U S A 2021; 118:2023575118. [PMID: 33972434 DOI: 10.1073/pnas.2023575118] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open
Abstract
Synonymous codon substitutions are not always selectively neutral as revealed by several types of analyses, including studies of codon usage patterns among genes. We analyzed codon usage in 13 bacterial genomes sampled from across a large order of bacteria, Enterobacterales, and identified presumptively neutral and selected classes of synonymous substitutions. To estimate substitution rates, given a neutral/selected classification of synonymous substitutions, we developed a flexible [Formula: see text] substitution model that allows multiple classes of synonymous substitutions. Under this multiclass synonymous substitution (MSS) model, the denominator of [Formula: see text] includes only the strictly neutral class of synonymous substitutions. On average, the value of [Formula: see text] under the MSS model was 80% of that under the standard codon model in which all synonymous substitutions are assumed to be neutral. The indication is that conventional [Formula: see text] analyses overestimate these values and thus overestimate the frequency of positive diversifying selection and underestimate the strength of purifying selection. To quantify the strength of selection necessary to explain this reduction, we developed a model of selected compensatory codon substitutions. The reduction in synonymous substitution rate, and thus the contribution that selection makes to codon bias variation among genes, can be adequately explained by very weak selection, with a mean product of population size and selection coefficient, [Formula: see text].
Collapse
|
5
|
Kosakovsky Pond SL, Wisotsky SR, Escalante A, Magalis BR, Weaver S. Contrast-FEL-A Test for Differences in Selective Pressures at Individual Sites among Clades and Sets of Branches. Mol Biol Evol 2021; 38:1184-1198. [PMID: 33064823 PMCID: PMC7947784 DOI: 10.1093/molbev/msaa263] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
A number of evolutionary hypotheses can be tested by comparing selective pressures among sets of branches in a phylogenetic tree. When the question of interest is to identify specific sites within genes that may be evolving differently, a common approach is to perform separate analyses on subsets of sequences and compare parameter estimates in a post hoc fashion. This approach is statistically suboptimal and not always applicable. Here, we develop a simple extension of a popular fixed effects likelihood method in the context of codon-based evolutionary phylogenetic maximum likelihood testing, Contrast-FEL. It is suitable for identifying individual alignment sites where any among the K≥2 sets of branches in a phylogenetic tree have detectably different ω ratios, indicative of different selective regimes. Using extensive simulations, we show that Contrast-FEL delivers good power, exceeding 90% for sufficiently large differences, while maintaining tight control over false positive rates, when the model is correctly specified. We conclude by applying Contrast-FEL to data from five previously published studies spanning a diverse range of organisms and focusing on different evolutionary questions.
Collapse
Affiliation(s)
| | - Sadie R Wisotsky
- Institute for Genomics and Evolutionary Medicine, Temple University, Philadelphia, PA
| | - Ananias Escalante
- Institute for Genomics and Evolutionary Medicine, Temple University, Philadelphia, PA
| | - Brittany Rife Magalis
- Institute for Genomics and Evolutionary Medicine, Temple University, Philadelphia, PA
- Emerging Pathogens Institute, University of Florida, Gainesville, FL
| | - Steven Weaver
- Institute for Genomics and Evolutionary Medicine, Temple University, Philadelphia, PA
| |
Collapse
|
6
|
Adaptive evolution of peptidoglycan recognition protein family regulates the innate signaling against microbial pathogens in vertebrates. Microb Pathog 2020; 147:104361. [PMID: 32622926 DOI: 10.1016/j.micpath.2020.104361] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2020] [Revised: 04/28/2020] [Accepted: 06/22/2020] [Indexed: 12/16/2022]
Abstract
The innate immune system is the first line of defense in vertebrates against microbial pathogens. This defense system depends on the peptidoglycan pathogen recognition of receptors (PGRPs) existing in both invertebrates and vertebrates. Although some studies revealed the structural and functional differences between them, however, the evolutionary history and the selection pressures on these genes during adaptive evolution are poorly understood. In this study, we examined four (PGLYRP1, PGLYRP2, PGLYRP3, and PGLYRP4) genes of 127 vertebrates' species, conserved across vertebrates to evaluate positive selection pressure drives by adaptive evolution. The codons under positive selection were recognized through likelihood tests by comparing different models based on ω ratios in these genes across the vertebrate species. The positive selection test used two sets of models M1a vs. M2a and M7 vs. M8. The results showed that the test of these genes in M1a vs. M2a was not significant with the likelihood value 2ΔlnL = 0, while the likelihood ratios (2ΔlnL) were 2ΔlnL = 12.386, 2ΔlnL = 4.9283, 2ΔlnL = 24.031, and 2ΔlnL = 103.39 for PGLYRP1, PGLYRP2, PGLYRP3, and PGLYRP4 in M7 vs. M8, respectively. Our study identified the evidence of robust positive selection for these four genes across the vertebrates. These protuberant changes in PGRPs evolution of vertebrates reveal their role in innate immunity. Our study provides an insight based on PGRP genes to understand the evolution of host and pathogens interaction that leads to the progress of the novel conducts for immune diseases that include proteins linked to the recognition of pathogens.
Collapse
|
7
|
Adaptive Molecular Evolution of AKT3 Gene for Positive Diversifying Selection in Mammals. BIOMED RESEARCH INTERNATIONAL 2020; 2020:2584627. [PMID: 32550227 PMCID: PMC7256775 DOI: 10.1155/2020/2584627] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/25/2019] [Revised: 01/12/2020] [Accepted: 02/14/2020] [Indexed: 01/17/2023]
Abstract
The V-Akt Murine Thymoma Viral Oncogene Homolog 3 (AKT3) gene is of the serine/threonine-protein kinase family and influences the production of milk fats and cholesterol by acting on the sterol administrative area restricting protein (SREBP). The AKT3 gene is highly preserved in animals, and during lactation in cattle, its expression increases. The AKT3 gene is expressed in the digestive system, mammary gland, and immune cells. A phylogenetic investigation was performed to clarify the evolutionary role of AKT3, by maximum probability. The AKT3 gene sequence data of various mammalian species was evident even with animals undergoing breeding selection. From 39 mammalian species studied, there was a signal of positive diversifying selection with Hominidae at 13Q, 16G, 23R, 24P, 121P, 294K, 327V, 376L, 397K, 445T, and 471F among other codon sites of the AKT3 gene. These sites were codes for amino acids such as arginine, proline, lysine, and leucine indicating major roles for the function of immunological proteins, and in particular, the study highlighted the importance of changes in gene expression of AKT3 on immunity.
Collapse
|
8
|
Adaptation to host-specific bacterial pathogens drive rapid evolution of novel PhoP/PhoQ regulation pathway modulating the virulence. Microb Pathog 2020; 141:103997. [PMID: 31982569 DOI: 10.1016/j.micpath.2020.103997] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2019] [Revised: 01/14/2020] [Accepted: 01/22/2020] [Indexed: 01/18/2023]
Abstract
The presence of the PhoP-PhoQ system is usually different in various bacterial groups, suggesting that PhoP can control the expression of different genes in species. However, little is known about the evolution of the PhoP-PhoQ system among bacterial pathogens. Here, we study the evolution of PhoP and PhoQ regulation in 15 species of Enterobacteriaceae family. We have determined that the regulatory objectives adopted by PhoP and PhoQ are mainly different, due to the result of horizontal gene transfer events and even the change in the genetic content between closely related species. We have compared many possibilities tests (M1 vs. M2 and M7 with M8) to determine the positive selection. Estimating parameters at M1 and M2, with positive selection in M2 of the two proteins. The proportions of positive selection sites significant with ω = 4.53076 for PhoP and ω = 4.21041 PhQ. M8 was significant for PhoP and PhQ proteins. To further confirm the positive selection results, we used the Selecton server to confer positive selection on individual sites using the Mechanistic-Empirical Combination model, and we noticed that several sites had been identified under selection pressure during the evolution. There was a strong indication for the positive selection in bacterial genes of PhoP and PhoQ showed the results. By the use of REL and IFEL, the positive selection for PhoP was detected 14 and 11 sites respectively at different codon positions. The positively selected sites of amino acids such as Arginine, Alanine, Lysine, and Leucine are more important for the production of signals. Our results suggest that the positive selection of PhoP-PhoQ genes in host adaptation during evolution raises an intriguing possibility causes subtle variations in actions of PhoP-PhoQ and also increases the opportunities that cause modification in protein structure for the evolution of increasing pathogenicity in bacterial pathogens.
Collapse
|
9
|
Non-neutral evolution of H3.3-encoding genes occurs without alterations in protein sequence. Sci Rep 2019; 9:8472. [PMID: 31186448 PMCID: PMC6560044 DOI: 10.1038/s41598-019-44800-4] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2018] [Accepted: 05/14/2019] [Indexed: 11/08/2022] Open
Abstract
Histone H3.3 is a developmentally essential variant encoded by two independent genes in human (H3F3A and H3F3B). While this two-gene arrangement is evolutionarily conserved, its origins and function remain unknown. Phylogenetics, synteny and gene structure analyses of H3.3 genes from 32 metazoan genomes indicate independent evolutionary paths for H3F3A and H3F3B. While H3F3B bears similarities with H3.3 genes in distant organisms and with canonical H3 genes, H3F3A is sarcopterygian-specific and evolves under strong purifying selection. Additionally, H3F3B codon-usage preferences resemble those of broadly expressed genes and 'cell differentiation-induced' genes, while codon-usage of H3F3A resembles that of 'cell proliferation-induced' genes. We infer that H3F3B is more similar to the ancestral H3.3 gene and likely evolutionarily adapted for a broad expression pattern in diverse cellular programs, while H3F3A adapted for a subset of gene expression programs. Thus, the arrangement of two independent H3.3 genes facilitates fine-tuning of H3.3 expression across cellular programs.
Collapse
|
10
|
Hendrikse LD, Kambli A, Kayko C, Canuti M, Rodrigues B, Stevens B, Vashon J, Lang AS, Needle DB, Troyer RM. Identification of a Novel Gammaherpesvirus in Canada lynx ( Lynx canadensis). Viruses 2019; 11:v11040363. [PMID: 31010021 PMCID: PMC6520957 DOI: 10.3390/v11040363] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2019] [Revised: 04/15/2019] [Accepted: 04/17/2019] [Indexed: 12/25/2022] Open
Abstract
Gammaherpesviruses (GHVs) infect many animal species and are associated with lymphoproliferative disorders in some. Previously, we identified several novel GHVs in North American felids; however, a GHV had never been identified in Canada lynx (Lynx canadensis). We, therefore, hypothesized the existence of an unidentified GHV in lynx. Using degenerate nested and subsequently virus-specific PCR, we amplified and sequenced 3.4 kb of DNA from a novel GHV in lynx, which we named Lynx canadensis gammaherpesvirus 1 (LcaGHV1). Phylogenetic analysis determined that LcaGHV1 is a distinct GHV species belonging to the genus Percavirus. We then estimated the prevalence of LcaGHV1 in lynx by developing a PCR-based assay and detected LcaGHV1 DNA in 36% (95% CI: 22-53%) of lynx spleen DNA samples from Maine, USA and 17% (95% CI: 8-31%) from Newfoundland, Canada. The LcaGHV1 DNA sequences from Maine and Newfoundland lynx were nearly identical to each other (two nucleotide substitutions in 3.4 kb), suggesting that the unique lynx subspecies present on the island of Newfoundland (Lynx canadensis subsolanus) is infected with virus that very closely resembles virus found in mainland lynx. The potential ecologic and pathologic consequences of this novel virus for Canada lynx populations warrant further study.
Collapse
Affiliation(s)
- Liam D Hendrikse
- Department of Microbiology and Immunology, University of Western Ontario, 1151 Richmond St., London, ON N6A 5C1, Canada.
| | - Ankita Kambli
- Department of Microbiology and Immunology, University of Western Ontario, 1151 Richmond St., London, ON N6A 5C1, Canada.
| | - Caroline Kayko
- Map and Data Centre, Western Libraries, University of Western Ontario, 1151 Richmond St., London, ON N6A 5C1, Canada.
| | - Marta Canuti
- Department of Biology, Memorial University of Newfoundland, 232 Elizabeth Ave., St. John's, NF A1B 3X9, Canada.
| | - Bruce Rodrigues
- Wildlife Division, Newfoundland and Labrador Department of Fisheries and Land Resources, P.O. Box 2007, Corner Brook, NF A2H 7S1, Canada.
| | - Brian Stevens
- New Hampshire Veterinary Diagnostic Laboratory, College of Life Sciences and Agriculture, University of New Hampshire, 21 Botanical Lane, Durham, NH 03824, USA.
- Canadian Wildlife Health Cooperative⁻Ontario/Nunavut, Guelph, ON N1G 2W1, Canada.
| | - Jennifer Vashon
- Maine Department of Inland Fisheries and Wildlife, 650 State St., Bangor, ME 04401, USA.
| | - Andrew S Lang
- Department of Biology, Memorial University of Newfoundland, 232 Elizabeth Ave., St. John's, NF A1B 3X9, Canada.
| | - David B Needle
- New Hampshire Veterinary Diagnostic Laboratory, College of Life Sciences and Agriculture, University of New Hampshire, 21 Botanical Lane, Durham, NH 03824, USA.
| | - Ryan M Troyer
- Department of Microbiology and Immunology, University of Western Ontario, 1151 Richmond St., London, ON N6A 5C1, Canada.
| |
Collapse
|
11
|
Dunn KA, Kenney T, Gu H, Bielawski JP. Improved inference of site-specific positive selection under a generalized parametric codon model when there are multinucleotide mutations and multiple nonsynonymous rates. BMC Evol Biol 2019; 19:22. [PMID: 30642241 PMCID: PMC6332903 DOI: 10.1186/s12862-018-1326-7] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2018] [Accepted: 12/11/2018] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND An excess of nonsynonymous substitutions, over neutrality, is considered evidence of positive Darwinian selection. Inference for proteins often relies on estimation of the nonsynonymous to synonymous ratio (ω = dN/dS) within a codon model. However, to ease computational difficulties, ω is typically estimated assuming an idealized substitution process where (i) all nonsynonymous substitutions have the same rate (regardless of impact on organism fitness) and (ii) instantaneous double and triple (DT) nucleotide mutations have zero probability (despite evidence that they can occur). It follows that estimates of ω represent an imperfect summary of the intensity of selection, and that tests based on the ω > 1 threshold could be negatively impacted. RESULTS We developed a general-purpose parametric (GPP) modelling framework for codons. This novel approach allows specification of all possible instantaneous codon substitutions, including multiple nonsynonymous rates (MNRs) and instantaneous DT nucleotide changes. Existing codon models are specified as special cases of the GPP model. We use GPP models to implement likelihood ratio tests for ω > 1 that accommodate MNRs and DT mutations. Through both simulation and real data analysis, we find that failure to model MNRs and DT mutations reduces power in some cases and inflates false positives in others. False positives under traditional M2a and M8 models were very sensitive to DT changes. This was exacerbated by the choice of frequency parameterization (GY vs. MG), with rates sometimes > 90% under MG. By including MNRs and DT mutations, accuracy and power was greatly improved under the GPP framework. However, we also find that over-parameterized models can perform less well, and this can contribute to degraded performance of LRTs. CONCLUSIONS We suggest GPP models should be used alongside traditional codon models. Further, all codon models should be deployed within an experimental design that includes (i) assessing robustness to model assumptions, and (ii) investigation of non-standard behaviour of MLEs. As the goal of every analysis is to avoid false conclusions, more work is needed on model selection methods that consider both the increase in fit engendered by a model parameter and the degree to which that parameter is affected by un-modelled evolutionary processes.
Collapse
Affiliation(s)
- Katherine A. Dunn
- Department of Biology, Dalhousie University, Halifax, Nova Scotia B3H 4J1 Canada
| | - Toby Kenney
- Department of Mathematics & Statistics, Dalhousie University, Halifax, Nova Scotia B3H 4J1 Canada
| | - Hong Gu
- Department of Mathematics & Statistics, Dalhousie University, Halifax, Nova Scotia B3H 4J1 Canada
| | - Joseph P. Bielawski
- Department of Biology, Dalhousie University, Halifax, Nova Scotia B3H 4J1 Canada
- Department of Mathematics & Statistics, Dalhousie University, Halifax, Nova Scotia B3H 4J1 Canada
- Centre Comparative Genomics and Evolutionary Bioinformatics (CGEB) at Dalhousie University, Halifax, Canada
| |
Collapse
|
12
|
Forni D, Cagliani R, Pontremoli C, Pozzoli U, Vertemara J, De Gioia L, Clerici M, Sironi M. Evolutionary Analysis Provides Insight Into the Origin and Adaptation of HCV. Front Microbiol 2018; 9:854. [PMID: 29765366 PMCID: PMC5938362 DOI: 10.3389/fmicb.2018.00854] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2018] [Accepted: 04/13/2018] [Indexed: 12/12/2022] Open
Abstract
Hepatitis C virus (HCV) belongs to the Hepacivirus genus and is genetically heterogeneous, with seven major genotypes further divided into several recognized subtypes. HCV origin was previously dated in a range between ∼200 and 1000 years ago. Hepaciviruses have been identified in several domestic and wild mammals, the largest viral diversity being observed in bats and rodents. The closest relatives of HCV were found in horses/donkeys (equine hepaciviruses, EHV). However, the origin of HCV as a human pathogen is still an unsolved puzzle. Using a selection-informed evolutionary model, we show that the common ancestor of extant HCV genotypes existed at least 3000 years ago (CI: 3192–5221 years ago), with the oldest genotypes being endemic to Asia. EHV originated around 1100 CE (CI: 291–1640 CE). These time estimates exclude that EHV transmission was mainly sustained by widespread veterinary practices and suggest that HCV originated from a single zoonotic event with subsequent diversification in human populations. We also describe a number of biologically important sites in the major HCV genotypes that have been positively selected and indicate that drug resistance-associated variants are significantly enriched at positively selected sites. HCV exploits several cell-surface molecules for cell entry, but only two of these (CD81 and OCLN) determine the species-specificity of infection. Herein evolutionary analyses do not support a long-standing association between primates and hepaciviruses, and signals of positive selection at CD81 were only observed in Chiroptera. No evidence of selection was detected for OCLN in any mammalian order. These results shed light on the origin of HCV and provide a catalog of candidate genetic modulators of HCV phenotypic diversity.
Collapse
Affiliation(s)
- Diego Forni
- Bioinformatics Laboratory, Scientific Institute IRCCS E.Medea, Bosisio Parini, Italy
| | - Rachele Cagliani
- Bioinformatics Laboratory, Scientific Institute IRCCS E.Medea, Bosisio Parini, Italy
| | - Chiara Pontremoli
- Bioinformatics Laboratory, Scientific Institute IRCCS E.Medea, Bosisio Parini, Italy
| | - Uberto Pozzoli
- Bioinformatics Laboratory, Scientific Institute IRCCS E.Medea, Bosisio Parini, Italy
| | - Jacopo Vertemara
- Department of Biotechnology and Biosciences, University of Milan-Bicocca, Milan, Italy
| | - Luca De Gioia
- Department of Biotechnology and Biosciences, University of Milan-Bicocca, Milan, Italy
| | - Mario Clerici
- Department of Pathophysiology and Transplantation, University of Milan, Milan, Italy.,Don C. Gnocchi Foundation Onlus, IRCCS, Milan, Italy
| | - Manuela Sironi
- Bioinformatics Laboratory, Scientific Institute IRCCS E.Medea, Bosisio Parini, Italy
| |
Collapse
|
13
|
Hyperendemic dengue transmission and identification of a locally evolved DENV-3 lineage, Papua New Guinea 2007-2010. PLoS Negl Trop Dis 2018; 12:e0006254. [PMID: 29494580 PMCID: PMC5849365 DOI: 10.1371/journal.pntd.0006254] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2017] [Revised: 03/13/2018] [Accepted: 01/19/2018] [Indexed: 11/19/2022] Open
Abstract
Background Dengue is endemic in the Western Pacific and Oceania and the region reports more than 200,000 cases annually. Outbreaks of dengue and severe dengue occur regularly and movement of virus throughout the region has been reported. Disease surveillance systems, however, in many areas are not fully established and dengue incidence is underreported. Dengue epidemiology is likely least understood in Papua New Guinea (PNG), where the prototype DENV-2 strain New Guinea C was first isolated by Sabin in 1944 but where routine surveillance is not undertaken and little incidence and prevalence data is available. Methodology/Principal findings Serum samples from individuals with recent acute febrile illness or with non-febrile conditions collected between 2007–2010 were tested for anti-DENV neutralizing antibody. Responses were predominantly multitypic and seroprevalence increased with age, a pattern indicative of endemic dengue. DENV-1, DENV-2 and DENV-3 genomes were detected by RT-PCR within a nine-month period and in several instances, two serotypes were identified in individuals sampled within a period of 10 days. Phylogenetic analysis of whole genome sequences identified a DENV-3 Genotype 1 lineage which had evolved on the northern coast of PNG which was likely exported to the western Pacific five years later, in addition to a DENV-2 Cosmopolitan Genotype lineage which had previously circulated in the region. Conclusions/Significance We show that dengue is hyperendemic in PNG and identify an endemic, locally evolved lineage of DENV-3 that was associated with an outbreak of severe dengue in Pacific countries in subsequent years, although severe disease was not identified in PNG. Additional studies need to be undertaken to understand dengue epidemiology and burden of disease in PNG. Dengue virus (DENV) was first identified in Papua New Guinea (PNG) in 1944. Dengue is currently assumed to be an endemic disease in PNG although there is little incidence or prevalence data, and the evidence consensus for dengue presence is low. Routine surveillance is not undertaken and dengue is not a notifiable disease. Severe dengue is rarely identified by local clinicians and the reasons for this are unclear but may be related to poor recognition of dengue and a low index of suspicion, despite high incidence and prevalence rates in neighbouring countries. For example, Indonesia shares borders with PNG and regularly reports outbreaks of severe dengue and transmission of multiple DENV serotypes. DENV infection is identified in travellers from PNG however there are no data on locally circulating strains and how they may compare to viruses associated with severe dengue epidemics in other countries in the Asia Pacific region. We identified evidence for previous infection with all four DENV serotypes among people living on the northern coast of PNG, in Madang, and on Lihir Island in the Bismarck Archipelago off the northeastern coast. We also detected DENV-1, DENV-2, and DENV-3 virus in febrile patients, and we describe the first whole genome sequences of endemically circulating DENV since the prototype 1944 DENV-2New Guinea C strain was characterized. Of note, severe dengue was not diagnosed in any patient infected with these viruses in PNG although introduction of the PNG DENV-3 strain into the Solomon Islands five years later resulted in a large outbreak of severe dengue with hospitalizations and deaths in that country. Dengue epidemiology and burden of disease should be investigated in PNG.
Collapse
|
14
|
Origin and dispersal of Hepatitis E virus. Emerg Microbes Infect 2018; 7:11. [PMID: 29410449 PMCID: PMC5837148 DOI: 10.1038/s41426-017-0009-6] [Citation(s) in RCA: 38] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2017] [Revised: 11/17/2017] [Accepted: 12/03/2017] [Indexed: 12/21/2022]
Abstract
Hepatitis E virus (HEV, genus Orthohepevirus) is a common cause of hepatitis worldwide. Human-infecting HEV strains (Orthohepevirus A) include human-restricted and enzootic genotypes. Viruses in the Orthohepevirus A species also infect rabbits (HEV-3ra), camels, and swine. Using a selection-informed method, we dated the origin of the Orthohepevirus genus at least 21 million years ago, whereas the Orthohepevirus A species originated in Asia, most likely from a human-infecting ancestor that existed ~4500 to 6800 years ago. In this period, the appearance of large human settlements probably facilitated HEV emergence and spread. The earliest events in Orthohepevirus A evolutionary history involved the separation of the enzootic and human-restricted genotypes, as well as the split of the camel-infecting genotypes, which occurred during the time-frame of camel domestication. The place and timing of HEV-3ra divergence also correspond to the circumstances of rabbit domestication. This study clarifies the origin and historical events underlying HEV dispersal.
Collapse
|
15
|
De Grazia S, Lanave G, Bonura F, Urone N, Cappa V, Li Muli S, Pepe A, Gellért A, Banyai K, Martella V, Giammanco GM. Molecular evolutionary analysis of type-1 human astroviruses identifies putative sites under selection pressure on the capsid protein. INFECTION GENETICS AND EVOLUTION 2017; 58:199-208. [PMID: 29288011 DOI: 10.1016/j.meegid.2017.12.023] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/25/2017] [Revised: 12/06/2017] [Accepted: 12/22/2017] [Indexed: 12/23/2022]
Abstract
Human astroviruses (HAstV) are important enteric pathogens that can be classified into eight sero/genotypes (HAstV-1 to -8). Although the various HAstV types show global spread, type-1 strains tend to be predominant. Molecular analysis of the genomic region encoding the capsid protein (ORF2) has revealed discrete sequence variation, with different lineages within each HAstV type and at least three major lineages have been identified within HAstV-1. Longitudinal epidemiological surveillance has revealed temporal shift of the various HAstV-1 lineages. Metadata analysis of HAstV-1 sequences available in the databases also revealed temporal shifts of the circulation of HAstV-1 lineages, suggesting possible antigenic-related mechanisms of selection at the sub-genotype level. By comparison of HAstV-1 capsid sequences, lineage-defining residues under positive selection were identified. Structural analysis of HAstV-1 capsid allowed identifying at least six residues exposed on the virion surface. Two residues were located in the VP34 (shell region) whilst four residues were mapped in the VP25/27 (protruding region) of HAstV capsid protein, in proximity of the putative receptor binding S site. These findings suggest that mechanisms similar to those observed and/or hypothesized for other enteric viruses are also shaping the evolution of HAstVs, with intra-typic diversification being a possible mechanism to decrease the antigenic pressure to which these viruses are exposed.
Collapse
Affiliation(s)
- Simona De Grazia
- Department of Health Promotion Sciences and Mother and Child Care "G. D'Alessandro", University of Palermo, Palermo, Italy.
| | - Gianvito Lanave
- Department of Veterinary Medicine, University of Bari, Valenzano, Bari, Italy
| | - Floriana Bonura
- Department of Health Promotion Sciences and Mother and Child Care "G. D'Alessandro", University of Palermo, Palermo, Italy
| | - Noemi Urone
- Department of Health Promotion Sciences and Mother and Child Care "G. D'Alessandro", University of Palermo, Palermo, Italy
| | - Vincenzo Cappa
- Department of Health Promotion Sciences and Mother and Child Care "G. D'Alessandro", University of Palermo, Palermo, Italy
| | - Sara Li Muli
- Department of Health Promotion Sciences and Mother and Child Care "G. D'Alessandro", University of Palermo, Palermo, Italy
| | - Arcangelo Pepe
- Department of Health Promotion Sciences and Mother and Child Care "G. D'Alessandro", University of Palermo, Palermo, Italy
| | - Akos Gellért
- Institute for Veterinary Medical Research, Centre for Agricultural Research, Hungarian Academy of Sciences, Hungária krt. 21, Budapest H-1143, Hungary
| | - Krisztian Banyai
- Institute for Veterinary Medical Research, Centre for Agricultural Research, Hungarian Academy of Sciences, Hungária krt. 21, Budapest H-1143, Hungary
| | - Vito Martella
- Department of Veterinary Medicine, University of Bari, Valenzano, Bari, Italy
| | - Giovanni M Giammanco
- Department of Health Promotion Sciences and Mother and Child Care "G. D'Alessandro", University of Palermo, Palermo, Italy
| |
Collapse
|
16
|
Stejskalova K, Bayerova Z, Futas J, Hrazdilova K, Klumplerova M, Oppelt J, Splichalova P, Di Guardo G, Mazzariol S, Di Francesco CE, Di Francesco G, Terracciano G, Paiu RM, Ursache TD, Modry D, Horin P. Candidate gene molecular markers as tools for analyzing genetic susceptibility to morbillivirus infection in stranded Cetaceans. HLA 2017; 90:343-353. [DOI: 10.1111/tan.13146] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2016] [Revised: 07/25/2017] [Accepted: 09/06/2017] [Indexed: 12/17/2022]
Affiliation(s)
- K. Stejskalova
- Department of Animal Genetics; University of Veterinary and Pharmaceutical Sciences; Brno Czech Republic
| | - Z. Bayerova
- Department of Animal Genetics; University of Veterinary and Pharmaceutical Sciences; Brno Czech Republic
| | - J. Futas
- Department of Animal Genetics; University of Veterinary and Pharmaceutical Sciences; Brno Czech Republic
- Ceitec VFU, RG Animal Immunogenomics; Brno Czech Republic
| | - K. Hrazdilova
- Ceitec VFU, RG Molecular Microbiology; Brno Czech Republic
| | - M. Klumplerova
- Department of Animal Genetics; University of Veterinary and Pharmaceutical Sciences; Brno Czech Republic
| | - J. Oppelt
- Ceitec MU, Masaryk University; Brno Czech Republic
- Faculty of Science, National Centre for Biomolecular Research; Masaryk University; Brno Czech Republic
| | - P. Splichalova
- Ceitec VFU, RG Animal Immunogenomics; Brno Czech Republic
| | - G. Di Guardo
- Faculty of Veterinary Medicine; University of Teramo; Teramo Italy
| | - S. Mazzariol
- Department of Comparative Biomedicine and Food Science, Viale dell'Università; University of Padua; Padua Italy
| | | | - G. Di Francesco
- Istituto Zooprofilattico Sperimentale dell'Abruzzo e del Molise “G. Caporale”; Teramo Italy
| | - G. Terracciano
- Istituto Zooprofilattico Sperimentale del Lazio e della Toscana “M. Aleandri”; Pisa Italy
| | | | - T. D. Ursache
- Faculty of Veterinary Medicine, University of Agricultural Sciences and Veterinary Medicine Cluj-Napoca; Cluj-Napoca Romania
| | - D. Modry
- Ceitec VFU, RG Molecular Microbiology; Brno Czech Republic
- Department of Pathology and Parasitology; University of Veterinary and Pharmaceutical Sciences; Brno Czech Republic
- Biology Center, Czech Academy of Sciences; České Budějovice Czech Republic
| | - P. Horin
- Department of Animal Genetics; University of Veterinary and Pharmaceutical Sciences; Brno Czech Republic
- Ceitec VFU, RG Animal Immunogenomics; Brno Czech Republic
| |
Collapse
|
17
|
Gasse B, Prasad M, Delgado S, Huckert M, Kawczynski M, Garret-Bernardin A, Lopez-Cazaux S, Bailleul-Forestier I, Manière MC, Stoetzel C, Bloch-Zupan A, Sire JY. Evolutionary Analysis Predicts Sensitive Positions of MMP20 and Validates Newly- and Previously-Identified MMP20 Mutations Causing Amelogenesis Imperfecta. Front Physiol 2017; 8:398. [PMID: 28659819 PMCID: PMC5469888 DOI: 10.3389/fphys.2017.00398] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2017] [Accepted: 05/26/2017] [Indexed: 12/21/2022] Open
Abstract
Amelogenesis imperfecta (AI) designates a group of genetic diseases characterized by a large range of enamel disorders causing important social and health problems. These defects can result from mutations in enamel matrix proteins or protease encoding genes. A range of mutations in the enamel cleavage enzyme matrix metalloproteinase-20 gene (MMP20) produce enamel defects of varying severity. To address how various alterations produce a range of AI phenotypes, we performed a targeted analysis to find MMP20 mutations in French patients diagnosed with non-syndromic AI. Genomic DNA was isolated from saliva and MMP20 exons and exon-intron boundaries sequenced. We identified several homozygous or heterozygous mutations, putatively involved in the AI phenotypes. To validate missense mutations and predict sensitive positions in the MMP20 sequence, we evolutionarily compared 75 sequences extracted from the public databases using the Datamonkey webserver. These sequences were representative of mammalian lineages, covering more than 150 million years of evolution. This analysis allowed us to find 324 sensitive positions (out of the 483 MMP20 residues), pinpoint functionally important domains, and build an evolutionary chart of important conserved MMP20 regions. This is an efficient tool to identify new- and previously-identified mutations. We thus identified six functional MMP20 mutations in unrelated families, finding two novel mutated sites. The genotypes and phenotypes of these six mutations are described and compared. To date, 13 MMP20 mutations causing AI have been reported, making these genotypes and associated hypomature enamel phenotypes the most frequent in AI.
Collapse
Affiliation(s)
- Barbara Gasse
- Institut de Biologie Paris-Seine, UMR 7138-Evolution Paris-Seine, Sorbonne Universités, Université Pierre et Marie CurieParis, France
| | - Megana Prasad
- Laboratoire de Génétique Médicale, Institut National de la Santé et de la Recherche Médicale UMRS_1112, Institut de Génétique Médicale d'Alsace, FMTS, Université de StrasbourgStrasbourg, France
| | - Sidney Delgado
- Institut de Biologie Paris-Seine, UMR 7138-Evolution Paris-Seine, Sorbonne Universités, Université Pierre et Marie CurieParis, France
| | - Mathilde Huckert
- Laboratoire de Génétique Médicale, Institut National de la Santé et de la Recherche Médicale UMRS_1112, Institut de Génétique Médicale d'Alsace, FMTS, Université de StrasbourgStrasbourg, France.,Faculté de Chirurgie Dentaire, Université de StrasbourgStrasbourg, France
| | - Marzena Kawczynski
- Faculté de Chirurgie Dentaire, Université de StrasbourgStrasbourg, France.,Pôle de Médecine et Chirurgie Bucco-Dentaires, Centre de Référence des Manifestations Odontologiques des Maladies Rares, O-Rares, Hôpitaux Universitaires de StrasbourgStrasbourg, France
| | - Annelyse Garret-Bernardin
- Faculté de Chirurgie Dentaire, Université de StrasbourgStrasbourg, France.,Unit of Dentistry, IRCCS, Bambino Gesù Children's HospitalRome, Italy
| | - Serena Lopez-Cazaux
- Faculté de Chirurgie Dentaire, Département d'Odontologie Pédiatrique, Centre de Compétences Maladies Rares, CHU Hôtel Dieu, Service d'odontologie Conservatrice et PédiatriqueNantes, France
| | - Isabelle Bailleul-Forestier
- Faculté de Chirurgie Dentaire, CHU de Toulouse, Centre de Compétences Maladies Rares, Odontologie Pédiatrique, Université Paul SabatierToulouse, France
| | - Marie-Cécile Manière
- Faculté de Chirurgie Dentaire, Université de StrasbourgStrasbourg, France.,Pôle de Médecine et Chirurgie Bucco-Dentaires, Centre de Référence des Manifestations Odontologiques des Maladies Rares, O-Rares, Hôpitaux Universitaires de StrasbourgStrasbourg, France
| | - Corinne Stoetzel
- Laboratoire de Génétique Médicale, Institut National de la Santé et de la Recherche Médicale UMRS_1112, Institut de Génétique Médicale d'Alsace, FMTS, Université de StrasbourgStrasbourg, France
| | - Agnès Bloch-Zupan
- Faculté de Chirurgie Dentaire, Université de StrasbourgStrasbourg, France.,Pôle de Médecine et Chirurgie Bucco-Dentaires, Centre de Référence des Manifestations Odontologiques des Maladies Rares, O-Rares, Hôpitaux Universitaires de StrasbourgStrasbourg, France.,Centre Européen de Recherche en Biologie et en Médecine, Centre National de la Recherche Scientifique UMR7104, Institut National de la Santé et de la Recherche Médicale U964, Institut de Génétique et de Biologie Moléculaire and Cellulaire, Université de StrasbourgIllkirch, France.,Institut d'Etudes Avancées, Université de Strasbourg, USIASStrasbourg, France.,Eastman Dental Institute, University College LondonLondon, United Kingdom
| | - Jean-Yves Sire
- Institut de Biologie Paris-Seine, UMR 7138-Evolution Paris-Seine, Sorbonne Universités, Université Pierre et Marie CurieParis, France
| |
Collapse
|
18
|
Ahmad HI, Liu G, Jiang X, Liu C, Chong Y, Huarong H. Adaptive molecular evolution of MC1R gene reveals the evidence for positive diversifying selection in indigenous goat populations. Ecol Evol 2017; 7:5170-5180. [PMID: 28770057 PMCID: PMC5528238 DOI: 10.1002/ece3.2919] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2016] [Revised: 02/10/2017] [Accepted: 02/13/2017] [Indexed: 12/16/2022] Open
Abstract
Detecting signatures of selection can provide a new insight into the mechanism of contemporary breeding and artificial selection and further reveal the causal genes associated to the phenotypic variation. However, the signatures of selection on genes entailing for profitable traits between Chinese commercial and indigenous goats have been poorly interpreted. We noticed footprints of positive selection at MC1R gene containing SNPs genotyped in five Chinese native goat breeds. An experimental distribution of FST was built based on approximations of FST for each SNP across five breeds. We identified selection using the high FST outlier method and found that MC1R candidate gene show evidence of positive selection. Furthermore, adaptive selection pressure on specific codons was determined using different codon based on maximum‐likelihood methods; signature of positive selection in mammalian MC1R was explored in individual codons. Evolutionary analyses were inferred under maximum likelihood models, the HyPhy package implemented in the DATAMONKEY Web Server. The results of codon selection displayed positive diversifying selection at the sites were mainly involved in development of genetic variations in coat color in various mammalian species. Positive diversifying selection inferred with recent evolutionary changes in domesticated goat MC1R provides new insights that the gene evolution may have been modulated by domestication events in goats.
Collapse
Affiliation(s)
- Hafiz Ishfaq Ahmad
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of the Ministry of Education College of Animal Science and Technology Huazhong Agricultural University Wuhan China
| | - Guiqiong Liu
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of the Ministry of Education College of Animal Science and Technology Huazhong Agricultural University Wuhan China
| | - Xunping Jiang
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of the Ministry of Education College of Animal Science and Technology Huazhong Agricultural University Wuhan China
| | - Chenhui Liu
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of the Ministry of Education College of Animal Science and Technology Huazhong Agricultural University Wuhan China
| | - Yuqing Chong
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of the Ministry of Education College of Animal Science and Technology Huazhong Agricultural University Wuhan China
| | - Huang Huarong
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of the Ministry of Education College of Animal Science and Technology Huazhong Agricultural University Wuhan China
| |
Collapse
|
19
|
Raven N, Lisovski S, Klaassen M, Lo N, Madsen T, Ho SYW, Ujvari B. Purifying selection and concerted evolution of RNA-sensing toll-like receptors in migratory waders. INFECTION GENETICS AND EVOLUTION 2017; 53:135-145. [PMID: 28528860 DOI: 10.1016/j.meegid.2017.05.012] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/10/2016] [Revised: 05/15/2017] [Accepted: 05/17/2017] [Indexed: 01/05/2023]
Abstract
Migratory birds encounter a broad range of pathogens during their journeys, making them ideal models for studying immune gene evolution. Despite the potential value of these species to immunoecology and disease epidemiology, previous studies have typically focused on their adaptive immune gene repertoires. In this study, we examined the evolution of innate immune genes in three long-distance migratory waders (order Charadriiformes). We analysed two parts of the extracellular domains of two Toll-like receptors (TLR3 and TLR7) involved in virus recognition in the Sanderling (Calidris alba), Red-necked Stint (Calidris ruficollis), and Ruddy Turnstone (Arenaria interpres). Our analysis was extended to 50 avian species for which whole-genome sequences were available, including two additional waders. We found that the inferred relationships among avian TLR3 and TLR7 do not match the whole-genome phylogeny of birds. Further analyses showed that although both loci are predominantly under purifying selection, the evolution of the extracellular domain of avian TLR3 has also been driven by episodic diversifying selection. TLR7 was found to be duplicated in all five wader species and in two other orders of birds, Cuculiformes and Passeriformes. The duplication is likely to have occurred in the ancestor of each order, and the duplicated copies appear to be undergoing concerted evolution. The phylogenetic relationships of wader TLR7 matched those of the five wader species, but that of TLR3 did not. Instead, the tree inferred from TLR3 showed potential associations with the species' ecology, including migratory behaviour and exposure to pathogens. Our study demonstrates the importance of combining immunological and ecological knowledge to understand the impact of immune gene polymorphism on the evolutionary ecology of infectious diseases.
Collapse
Affiliation(s)
- Nynke Raven
- Centre for Integrative Ecology, School of Life and Environmental Sciences, Deakin University, Waurn Ponds, VIC 3216, Australia
| | - Simeon Lisovski
- Centre for Integrative Ecology, School of Life and Environmental Sciences, Deakin University, Waurn Ponds, VIC 3216, Australia
| | - Marcel Klaassen
- Centre for Integrative Ecology, School of Life and Environmental Sciences, Deakin University, Waurn Ponds, VIC 3216, Australia
| | - Nathan Lo
- School of Life and Environmental Sciences, The University of Sydney, Sydney, NSW 2006, Australia
| | - Thomas Madsen
- Centre for Integrative Ecology, School of Life and Environmental Sciences, Deakin University, Waurn Ponds, VIC 3216, Australia
| | - Simon Y W Ho
- School of Life and Environmental Sciences, The University of Sydney, Sydney, NSW 2006, Australia
| | - Beata Ujvari
- Centre for Integrative Ecology, School of Life and Environmental Sciences, Deakin University, Waurn Ponds, VIC 3216, Australia.
| |
Collapse
|
20
|
Simões BF, Sampaio FL, Douglas RH, Kodandaramaiah U, Casewell NR, Harrison RA, Hart NS, Partridge JC, Hunt DM, Gower DJ. Visual Pigments, Ocular Filters and the Evolution of Snake Vision. Mol Biol Evol 2016; 33:2483-95. [DOI: 10.1093/molbev/msw148] [Citation(s) in RCA: 50] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open
|
21
|
Fish I, Boissinot S. Functional evolution of the OAS1 viral sensor: Insights from old world primates. INFECTION GENETICS AND EVOLUTION 2016; 44:341-350. [PMID: 27393659 DOI: 10.1016/j.meegid.2016.07.005] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/02/2016] [Revised: 07/01/2016] [Accepted: 07/04/2016] [Indexed: 11/26/2022]
Abstract
Infections with viral pathogens impose considerable selective pressure on host defensive genes. Those genes at the forefront, responsible for identifying and binding exogenous molecular viral components, will carry the hallmarks of this struggle. Oligoadenylate synthetase (OAS) enzymes play a major role in the innate defense against a large number of viruses by acting as sensors of viral infections. Following their up-regulation by the interferon pathway, OASs bind viral dsRNA and then signal ribonuclease L (RNase L) to degrade RNA, shutting down viral and host protein synthesis. We have investigated the evolution of OAS1 in twenty-two Old World monkey species. We identified a total of 35 codons with the earmarks of positive selection and we performed a comprehensive analysis of their functional significance using in silico modeling of the OAS1 protein. Subdividing OAS1 into functional domains revealed intense purifying selection in the active domain but significant positive directional selection in the RNA-binding domain (RBD), the region where OAS1 binds viral dsRNA. The modeling analysis revealed a concentration of rapidly evolving residues in one region of the RBD suggestive of the sub-functionalization of different regions of the RBD. This analysis also identified several positively selected residues circumscribing the entry to the active site suggesting adaptive evasion of viral antagonism and/or selection for production of oligoadenylate of different length.
Collapse
Affiliation(s)
- Ian Fish
- Biology Department, Queens College, The City University of New York, 65-30 Kissena Boulevard, Flushing, NY 11367, USA; Graduate Center, Sub-program in Molecular, Cellular and Developmental Biology, The City University of New York, 365(th) avenue, New York, NY 10016, USA.
| | - Stéphane Boissinot
- New York University Abu Dhabi, P.O. Box 129188, Abu Dhabi, United Arab Emirates.
| |
Collapse
|
22
|
Abdullah T, Faiza M, Pant P, Rayyan Akhtar M, Pant P. An Analysis of Single Nucleotide Substitution in Genetic Codons - Probabilities and Outcomes. Bioinformation 2016; 12:98-104. [PMID: 28149042 PMCID: PMC5267951 DOI: 10.6026/97320630012098] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2016] [Revised: 04/12/2016] [Accepted: 04/14/2016] [Indexed: 12/02/2022] Open
Abstract
Background: Single nucleotide substitutions (SNS) in genetic codon are of prime importance due to their ability to alter an amino
acid sequence as a result. Given the nature of genetic code, any SNS is expected to change the protein sequence randomly into any
of the 64 possible codons. In this paper, we present a theoretical analysis of how single nucleotide substitutions in genetic codon
may affect resulting amino acid residue and what is the most likely amino acid that will get selected as a result. Methods: A
probability matrix was developed showing possible changes and routes likely being followed as a result of base substitution
mutation causing changes at the translational level for the amino acid being encoded. Results: We observe that in event of single
base pair substitution in a given amino acid; a chosen set of amino acids is theoretically more probable to be resulted suggesting a
directional rather than a random change. This study also indicates that for a given amino acid coded by a number of synonymous
codons, all synonymous codons will result into same list of amino acids in case of all possible SNS at three positions. Conclusion:
The present work has resulted into development of a theoretical probability matrix which can be used to predict changes in amino
acid residues in a protein sequence caused by single base substitutions.
Collapse
Affiliation(s)
- Tariq Abdullah
- Department of Computer Science, Faculty of Natural Sciences, Jamia Millia Islamia, New Delhi , India
| | - Muniba Faiza
- Department of Computer Science, Faculty of Natural Sciences, Jamia Millia Islamia, New Delhi , India
| | - Prashant Pant
- Department of Botany,Dyal Singh College, University of Delhi, New Delhi - 110003, India
| | | | - Pratibha Pant
- Department of Botany, Daulat Ram College, University of Delhi, New Delhi, India
| |
Collapse
|
23
|
Arenas M. Trends in substitution models of molecular evolution. Front Genet 2015; 6:319. [PMID: 26579193 PMCID: PMC4620419 DOI: 10.3389/fgene.2015.00319] [Citation(s) in RCA: 79] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2015] [Accepted: 10/09/2015] [Indexed: 11/13/2022] Open
Abstract
Substitution models of evolution describe the process of genetic variation through fixed mutations and constitute the basis of the evolutionary analysis at the molecular level. Almost 40 years after the development of first substitution models, highly sophisticated, and data-specific substitution models continue emerging with the aim of better mimicking real evolutionary processes. Here I describe current trends in substitution models of DNA, codon and amino acid sequence evolution, including advantages and pitfalls of the most popular models. The perspective concludes that despite the large number of currently available substitution models, further research is required for more realistic modeling, especially for DNA coding and amino acid data. Additionally, the development of more accurate complex models should be coupled with new implementations and improvements of methods and frameworks for substitution model selection and downstream evolutionary analysis.
Collapse
Affiliation(s)
- Miguel Arenas
- Institute of Molecular Pathology and Immunology of the University of Porto Porto, Portugal
| |
Collapse
|
24
|
The heptad repeat region is a major selection target in MERS-CoV and related coronaviruses. Sci Rep 2015; 5:14480. [PMID: 26404138 PMCID: PMC4585914 DOI: 10.1038/srep14480] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2015] [Accepted: 09/01/2015] [Indexed: 01/08/2023] Open
Abstract
Middle East respiratory syndrome coronavirus (MERS-CoV) originated in bats and spread to humans via zoonotic transmission from camels. We analyzed the evolution of the spike (S) gene in betacoronaviruses (betaCoVs) isolated from different mammals, in bat coronavirus populations, as well as in MERS-CoV strains from the current outbreak. Results indicated several positively selected sites located in the region comprising the two heptad repeats (HR1 and HR2) and their linker. Two sites (R652 and V1060) were positively selected in the betaCoVs phylogeny and correspond to mutations associated with expanded host range in other coronaviruses. During the most recent evolution of MERS-CoV, adaptive mutations in the HR1 (Q/R/H1020) arose in camels or in a previous host and spread to humans. We determined that different residues at position 1020 establish distinct inter- and intra-helical interactions and affect the stability of the six-helix bundle formed by the HRs. A similar effect on stability was observed for a nearby mutation (T1015N) that increases MERS-CoV infection efficiency in vitro. Data herein indicate that the heptad repeat region was a major target of adaptive evolution in MERS-CoV-related viruses; these results are relevant for the design of fusion inhibitor peptides with antiviral function.
Collapse
|
25
|
Consuegra S, John E, Verspoor E, de Leaniz CG. Patterns of natural selection acting on the mitochondrial genome of a locally adapted fish species. Genet Sel Evol 2015; 47:58. [PMID: 26138253 PMCID: PMC4490732 DOI: 10.1186/s12711-015-0138-0] [Citation(s) in RCA: 52] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2014] [Accepted: 06/23/2015] [Indexed: 11/10/2022] Open
Abstract
Background Mitochondrial DNA (mtDNA) is frequently used in population genetic studies and is usually considered as a neutral marker. However, given the functional importance of the proteins encoded by the mitochondrial genome, and the prominent role of mitochondria in cellular energy production, the assumption of neutrality is increasingly being questioned. Results We tested for evidence of selection on the mitochondrial genome of the Atlantic salmon, which is a locally adapted and widely farmed species and is distributed across a large latitudinal cline. We analysed 20 independent regions of the salmon mtDNA that represented nine genes (ND1, ND2, ND3, COX1, COX2, ATP6, ND4, ND5, and CYTB). These 20 mtDNA regions were sequenced using a 454 approach from samples collected across the entire European range of this species. We found evidence of positive selection at the ND1, ND3 and ND4 genes, which is supported by at least two different codon-based methods and also by differences in the chemical properties of the amino acids involved. The geographical distribution of some of the mutations indicated to be under selection was not random, and some mutations were private to artic populations. We discuss the possibility that selection acting on the Atlantic salmon mtDNA genome might be related to the need for increased metabolic efficiency at low temperatures. Conclusions The analysis of sequences representing nine mitochondrial genes that are involved in the OXPHOS pathway revealed signatures of positive selection in the mitochondrial genome of the Atlantic salmon. The properties of the amino acids involved suggest that some of the mutations that were identified to be under positive selection might have functional implications, possibly in relation to metabolic efficiency. Experimental evidence, and better understanding of regional phylogeographic structuring, are needed to clarify the potential role of selection acting on the mitochondrial genome of Atlantic salmon and other locally adapted fishes. Electronic supplementary material The online version of this article (doi:10.1186/s12711-015-0138-0) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Sofia Consuegra
- Department of BioSciences, Swansea University, Swansea, SA2 8PP, UK.
| | - Elgan John
- Department of BioSciences, Swansea University, Swansea, SA2 8PP, UK.
| | - Eric Verspoor
- Inverness College, University of Highlands and Islands, Inverness, IV1 1SA, Scotland, UK.
| | | |
Collapse
|
26
|
Lillie M, Grueber CE, Sutton JT, Howitt R, Bishop PJ, Gleeson D, Belov K. Selection on MHC class II supertypes in the New Zealand endemic Hochstetter's frog. BMC Evol Biol 2015; 15:63. [PMID: 25886729 PMCID: PMC4415247 DOI: 10.1186/s12862-015-0342-0] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2014] [Accepted: 03/27/2015] [Indexed: 11/29/2022] Open
Abstract
BACKGROUND The New Zealand native frogs, family Leiopelmatidae, are among the most archaic in the world. Leiopelma hochstetteri (Hochstetter's frog) is a small, semi-aquatic frog with numerous, fragmented populations scattered across New Zealand's North Island. We characterized a major histocompatibility complex (MHC) class II B gene (DAB) in L. hochstetteri from a spleen transcriptome, and then compared its diversity to neutral microsatellite markers to assess the adaptive genetic diversity of five populations ("evolutionarily significant units", ESUs). RESULTS L. hochstetteri possessed very high MHC diversity, with 74 DAB alleles characterized. Extremely high differentiation was observed at the DAB locus, with only two alleles shared between populations, a pattern that was not reflected in the microsatellites. Clustering analysis on putative peptide binding residues of the DAB alleles indicated four functional supertypes, all of which were represented in 4 of 5 populations, albeit at different frequencies. Otawa was an exception to these observations, with only two DAB alleles present. CONCLUSIONS This study of MHC diversity highlights extreme population differentiation at this functional locus. Supertype differentiation was high among populations, suggesting spatial and/or temporal variation in selection pressures. Low DAB diversity in Otawa may limit this population's adaptive potential to future pathogenic challenges.
Collapse
Affiliation(s)
- Mette Lillie
- Faculty of Veterinary Science, University of Sydney, Sydney, Australia.
| | - Catherine E Grueber
- Faculty of Veterinary Science, University of Sydney, Sydney, Australia.
- San Diego Zoo Global, San Diego, USA.
| | - Jolene T Sutton
- Department of Biology, University of Hawai'i at Mānoa, Honolulu, USA.
| | | | - Phillip J Bishop
- Department of Biology, University of Otago, Dunedin, New Zealand.
| | - Dianne Gleeson
- Landcare Research, Auckland, New Zealand.
- Institute for Applied Ecology, University of Canberra, Bruce, Australia.
| | - Katherine Belov
- Faculty of Veterinary Science, University of Sydney, Sydney, Australia.
| |
Collapse
|
27
|
Transmitted/Founder Viruses Rapidly Escape from CD8+ T Cell Responses in Acute Hepatitis C Virus Infection. J Virol 2015; 89:5478-90. [PMID: 25740982 DOI: 10.1128/jvi.03717-14] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2014] [Accepted: 02/25/2015] [Indexed: 12/22/2022] Open
Abstract
UNLABELLED The interaction between hepatitis C virus (HCV) and cellular immune responses during very early infection is critical for disease outcome. To date, the impact of antigen-specific cellular immune responses on the evolution of the viral population establishing infection and on potential escape has not been studied. Understanding these early host-virus dynamics is important for the development of a preventative vaccine. Three subjects who were followed longitudinally from the detection of viremia preseroconversion until disease outcome were analyzed. The evolution of transmitted/founder (T/F) viruses was undertaken using deep sequencing. CD8(+) T cell responses were measured via enzyme-linked immunosorbent spot (ELISpot) assay using HLA class I-restricted T/F epitopes. T/F viruses were rapidly extinguished in all subjects associated with either viral clearance (n = 1) or replacement with viral variants leading to establishment of chronic infection (n = 2). CD8(+) T cell responses against 11 T/F epitopes were detectable by 33 to 44 days postinfection, and 5 of these epitopes had not previously been reported. These responses declined rapidly in those who became chronically infected and were maintained in the subject who cleared infection. Higher-magnitude CD8(+) T cell responses were associated with rapid development of immune escape variants at a rate of up to 0.1 per day. Rapid escape from CD8(+) T cell responses has been quantified for the first time in the early phase of primary HCV infection. These rapid escape dynamics were associated with higher-magnitude CD8(+) T cell responses. These findings raise questions regarding optimal selection of immunogens for HCV vaccine development and suggest that detailed analysis of individual epitopes may be required. IMPORTANCE A major limitation in our detailed understanding of the role of immune response in HCV clearance has been the lack of data on very early primary infection when the transmitted viral variants successfully establish the acute infection. This study was made possible through the availability of specimens from a unique cohort of asymptomatic primary infection cases in whom the first available viremic samples were collected approximately 3 weeks postinfection and at regular intervals thereafter. The study included detailed examination of both the evolution of the viral population and the host cellular immune responses against the T/F viruses. The findings here provide the first evidence of host cellular responses targeting T/F variants and imposing a strong selective force toward viral escape. The results of this study provide useful insight on how virus escapes the host response and consequently on future analysis of vaccine-induced immunity.
Collapse
|
28
|
Smith MD, Wertheim JO, Weaver S, Murrell B, Scheffler K, Kosakovsky Pond SL. Less is more: an adaptive branch-site random effects model for efficient detection of episodic diversifying selection. Mol Biol Evol 2015; 32:1342-53. [PMID: 25697341 DOI: 10.1093/molbev/msv022] [Citation(s) in RCA: 444] [Impact Index Per Article: 49.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open
Abstract
Over the past two decades, comparative sequence analysis using codon-substitution models has been honed into a powerful and popular approach for detecting signatures of natural selection from molecular data. A substantial body of work has focused on developing a class of "branch-site" models which permit selective pressures on sequences, quantified by the ω ratio, to vary among both codon sites and individual branches in the phylogeny. We develop and present a method in this class, adaptive branch-site random effects likelihood (aBSREL), whose key innovation is variable parametric complexity chosen with an information theoretic criterion. By applying models of different complexity to different branches in the phylogeny, aBSREL delivers statistical performance matching or exceeding best-in-class existing approaches, while running an order of magnitude faster. Based on simulated data analysis, we offer guidelines for what extent and strength of diversifying positive selection can be detected reliably and suggest that there is a natural limit on the optimal parametric complexity for "branch-site" models. An aBSREL analysis of 8,893 Euteleostomes gene alignments demonstrates that over 80% of branches in typical gene phylogenies can be adequately modeled with a single ω ratio model, that is, current models are unnecessarily complicated. However, there are a relatively small number of key branches, whose identities are derived from the data using a model selection procedure, for which it is essential to accurately model evolutionary complexity.
Collapse
Affiliation(s)
- Martin D Smith
- Graduate Program in Bioinformatics and Systems Biology, University of California San Diego
| | | | - Steven Weaver
- Department of Medicine, University of California San Diego
| | - Ben Murrell
- Department of Medicine, University of California San Diego
| | - Konrad Scheffler
- Department of Medicine, University of California San Diego Department of Mathematical Sciences, Stellenbosch University, Stellenbosch, South Africa
| | | |
Collapse
|
29
|
Murrell B, Weaver S, Smith MD, Wertheim JO, Murrell S, Aylward A, Eren K, Pollner T, Martin DP, Smith DM, Scheffler K, Kosakovsky Pond SL. Gene-wide identification of episodic selection. Mol Biol Evol 2015; 32:1365-71. [PMID: 25701167 DOI: 10.1093/molbev/msv035] [Citation(s) in RCA: 352] [Impact Index Per Article: 39.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
We present BUSTED, a new approach to identifying gene-wide evidence of episodic positive selection, where the non-synonymous substitution rate is transiently greater than the synonymous rate. BUSTED can be used either on an entire phylogeny (without requiring an a priori hypothesis regarding which branches are under positive selection) or on a pre-specified subset of foreground lineages (if a suitable a priori hypothesis is available). Selection is modeled as varying stochastically over branches and sites, and we propose a computationally inexpensive evidence metric for identifying sites subject to episodic positive selection on any foreground branches. We compare BUSTED with existing models on simulated and empirical data. An implementation is available on www.datamonkey.org/busted, with a widget allowing the interactive specification of foreground branches.
Collapse
Affiliation(s)
- Ben Murrell
- Department of Medicine, University of California San Diego
| | - Steven Weaver
- Department of Medicine, University of California San Diego
| | - Martin D Smith
- Graduate program in Bioinformatics and Systems Biology, University of California San Diego
| | | | - Sasha Murrell
- Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA
| | - Anthony Aylward
- Graduate program in Bioinformatics and Systems Biology, University of California San Diego
| | - Kemal Eren
- Graduate program in Bioinformatics and Systems Biology, University of California San Diego Graduate program in Biomedical Informatics, University of California San Diego
| | | | - Darren P Martin
- Computational Biology Group, Institute of Infectious Diseases and Molecular Medicine, University of Cape Town, Cape Town, South Africa
| | - Davey M Smith
- Department of Medicine, University of California San Diego Veterans Affairs San Diego Healthcare System, San Diego, CA
| | - Konrad Scheffler
- Department of Medicine, University of California San Diego Department of Mathematical Sciences, Stellenbosch University, Stellenbosch, South Africa
| | | |
Collapse
|
30
|
Immonen TT, Leitner T. Reduced evolutionary rates in HIV-1 reveal extensive latency periods among replicating lineages. Retrovirology 2014; 11:81. [PMID: 25318357 PMCID: PMC4201670 DOI: 10.1186/s12977-014-0081-0] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2014] [Accepted: 09/01/2014] [Indexed: 02/01/2023] Open
Abstract
Background HIV-1 can persist for the duration of a patient’s life due in part to its ability to hide from the immune system, and from antiretroviral drugs, in long-lived latent reservoirs. Latent forms of HIV-1 may also be disproportionally involved in transmission. Thus, it is important to detect and quantify latency in the HIV-1 life cycle. Results We developed a novel molecular clock–based phylogenetic tool to investigate the prevalence of HIV-1 lineages that have experienced latency. The method removes alternative sources that may affect evolutionary rates, such as hypermutation, recombination, and selection, to reveal the contribution of generation-time effects caused by latency. Our method was able to recover latent lineages with high specificity and sensitivity, and low false discovery rates, even on relatively short branches on simulated phylogenies. Applying the tool to HIV-1 sequences from 26 patients, we show that the majority of phylogenetic lineages have been affected by generation-time effects in every patient type, whether untreated, elite controller, or under effective or failing treatment. Furthermore, we discovered extensive effects of latency in sequence data (gag, pol, and env) from reservoirs as well as in the replicating plasma population. To better understand our phylogenetic findings, we developed a dynamic model of virus-host interactions to investigate the proportion of lineages in the actively replicating population that have ever been latent. Assuming neutral evolution, our dynamic modeling showed that under most parameter conditions, it is possible for a few activated latent viruses to propagate so that in time, most HIV-1 lineages will have been latent at some time in their past. Conclusions These results suggest that cycling in and out of latency plays a major role in the evolution of HIV-1. Thus, no aspect of HIV-1 evolution can be fully understood without considering latency - including treatment, drug resistance, immune evasion, transmission, and pathogenesis. Electronic supplementary material The online version of this article (doi:10.1186/s12977-014-0081-0) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Taina T Immonen
- Theoretical Biology & Biophysics, Los Alamos National Laboratory, Los Alamos, NM, 87545, USA.
| | - Thomas Leitner
- Theoretical Biology & Biophysics, Los Alamos National Laboratory, Los Alamos, NM, 87545, USA.
| |
Collapse
|
31
|
Abstract
Models of codon evolution have attracted particular interest because of their unique capabilities to detect selection forces and their high fit when applied to sequence evolution. We described here a novel approach for modeling codon evolution, which is based on Kronecker product of matrices. The 61 × 61 codon substitution rate matrix is created using Kronecker product of three 4 × 4 nucleotide substitution matrices, the equilibrium frequency of codons, and the selection rate parameter. The entities of the nucleotide substitution matrices and selection rate are considered as parameters of the model, which are optimized by maximum likelihood. Our fully mechanistic model allows the instantaneous substitution matrix between codons to be fully estimated with only 19 parameters instead of 3,721, by using the biological interdependence existing between positions within codons. We illustrate the properties of our models using computer simulations and assessed its relevance by comparing the AICc measures of our model and other models of codon evolution on simulations and a large range of empirical data sets. We show that our model fits most biological data better compared with the current codon models. Furthermore, the parameters in our model can be interpreted in a similar way as the exchangeability rates found in empirical codon models.
Collapse
Affiliation(s)
- Maryam Zaheri
- Department of Ecology and Evolution, Biophore, University of Lausanne, 1015 Lausanne, SwitzerlandSwiss Institute of Bioinformatics, Genopode, Quartier Sorge, 1015 Lausanne, Switzerland
| | - Linda Dib
- Department of Ecology and Evolution, Biophore, University of Lausanne, 1015 Lausanne, SwitzerlandSwiss Institute of Bioinformatics, Genopode, Quartier Sorge, 1015 Lausanne, Switzerland
| | - Nicolas Salamin
- Department of Ecology and Evolution, Biophore, University of Lausanne, 1015 Lausanne, SwitzerlandSwiss Institute of Bioinformatics, Genopode, Quartier Sorge, 1015 Lausanne, Switzerland
| |
Collapse
|
32
|
Goka EA, Vallely PJ, Mutton KJ, Klapper PE. Mutations associated with severity of the pandemic influenza A(H1N1)pdm09 in humans: a systematic review and meta-analysis of epidemiological evidence. Arch Virol 2014; 159:3167-83. [PMID: 25078388 DOI: 10.1007/s00705-014-2179-z] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2014] [Accepted: 07/12/2014] [Indexed: 10/25/2022]
Abstract
Mutations in the haemagglutinin (HA), non-structural protein 1 (NS1) and polymerase basic protein 2 (PB2) of influenza viruses have been associated with virulence. This study investigated the association between mutations in these genes in influenza A(H1N1)pdm09 virus and the risk of severe or fatal disease. Searches were conducted on the MEDLINE, EMBASE and Web of Science electronic databases and the reference lists of published studies. The PRISMA and STROBE guidelines were followed in assessing the quality of studies and writing-up. Eighteen (18) studies, from all continents, were included in the systematic review (recruiting patients 0 - 77 years old). The mutation D222G was associated with a significant increase in severe disease (pooled RD: 11 %, 95 % CI: 3.0 % - 18.0 %, p = 0.004) and the risk of fatality (RD: 23 %, 95 % CI: 14.0 %-31.0 %, p = < 0.0001). No association was observed between the mutations HA-D222N, D222E, PB2-E627K and NS1-T123V and severe/fatal disease. The results suggest that no virus quasispecies bearing virulence-conferring mutations in the HA, PB2 and NS1 predominated. However issues of sampling bias, and bias due to uncontrolled confounders such as comorbidities, and viral and bacterial coinfection, should be born in mind. Influenza A viruses should continue to be monitored for the occurrence of virulence-conferring mutations in HA, PB2 and NS1. There are suggestions that respiratory virus coinfections also affect virus virulence. Studies investigating the role of genetic mutations on disease outcome should make efforts to also investigate the role of respiratory virus coinfections.
Collapse
Affiliation(s)
- E A Goka
- Institute of Inflammation and Repair, Faculty of Medical and Human Sciences, University of Manchester, 1st Floor Stopford building, Oxford Road, Manchester, M13 9PL, UK,
| | | | | | | |
Collapse
|
33
|
Silvent J, Gasse B, Mornet E, Sire JY. Molecular evolution of the tissue-nonspecific alkaline phosphatase allows prediction and validation of missense mutations responsible for hypophosphatasia. J Biol Chem 2014; 289:24168-79. [PMID: 25023282 DOI: 10.1074/jbc.m114.576843] [Citation(s) in RCA: 45] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
ALPL encodes the tissue nonspecific alkaline phosphatase (TNSALP), which removes phosphate groups from various substrates. Its function is essential for bone and tooth mineralization. In humans, ALPL mutations lead to hypophosphatasia, a genetic disorder characterized by defective bone and/or tooth mineralization. To date, 275 ALPL mutations have been reported to cause hypophosphatasia, of which 204 were simple missense mutations. Molecular evolutionary analysis has proved to be an efficient method to highlight residues important for the protein function and to predict or validate sensitive positions for genetic disease. Here we analyzed 58 mammalian TNSALP to identify amino acids unchanged, or only substituted by residues sharing similar properties, through 220 millions years of mammalian evolution. We found 469 sensitive positions of the 524 residues of human TNSALP, which indicates a highly constrained protein. Any substitution occurring at one of these positions is predicted to lead to hypophosphatasia. We tested the 204 missense mutations resulting in hypophosphatasia against our predictive chart, and validated 99% of them. Most sensitive positions were located in functionally important regions of TNSALP (active site, homodimeric interface, crown domain, calcium site, …). However, some important positions are located in regions, the structure and/or biological function of which are still unknown. Our chart of sensitive positions in human TNSALP (i) enables to validate or invalidate at low cost any ALPL mutation, which would be suspected to be responsible for hypophosphatasia, by contrast with time consuming and expensive functional tests, and (ii) displays higher predictive power than in silico models of prediction.
Collapse
Affiliation(s)
- Jérémie Silvent
- From the Université Pierre & Marie Curie, IBPS, Evolution Paris Seine, 7 quai St-Bernard, Case 05, 75005 Paris and
| | - Barbara Gasse
- From the Université Pierre & Marie Curie, IBPS, Evolution Paris Seine, 7 quai St-Bernard, Case 05, 75005 Paris and
| | - Etienne Mornet
- the Unité de Pathologie Cellulaire et Génétique, EA2493, Université de Versailles-Saint Quentin en Yvelines, Versailles & Unité de Génétique Constitutionnelle, Centre Hospitalier de Versailles, 78150 Le Chesnay, France
| | - Jean-Yves Sire
- From the Université Pierre & Marie Curie, IBPS, Evolution Paris Seine, 7 quai St-Bernard, Case 05, 75005 Paris and
| |
Collapse
|
34
|
Grueber CE, Wallis GP, Jamieson IG. Episodic positive selection in the evolution of avian toll-like receptor innate immunity genes. PLoS One 2014; 9:e89632. [PMID: 24595315 PMCID: PMC3940441 DOI: 10.1371/journal.pone.0089632] [Citation(s) in RCA: 75] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2013] [Accepted: 01/21/2014] [Indexed: 01/12/2023] Open
Abstract
Toll-like receptors (TLRs) are a family of conserved pattern-recognition molecules responsible for initiating innate and acquired immune responses. Because they play a key role in host defence, these genes have received increasing interest in the evolutionary and population genetics literature, as their variation represents a potential target of adaptive evolution. However, the role of pathogen-mediated selection (i.e. episodic positive selection) in the evolution of these genes remains poorly known and has not been examined outside of mammals. A recent increase in the number of bird species for which TLR sequences are available has enabled us to examine the selective processes that have influenced evolution of the 10 known avian TLR genes. Specifically, we tested for episodic positive selection to identify codons that experience purifying selection for the majority of their evolution, interspersed with bursts of positive selection that may occur only in restricted lineages. We included up to 23 species per gene (mean = 16.0) and observed that, although purifying selection was evident, an average of 4.5% of codons experienced episodic positive selection across all loci. For four genes in which sequence coverage traversed both the extracellular leucine-rich repeat region (LRR) and transmembrane/intracellular domains of the proteins, increased positive selection was observed at the extracellular domain, consistent with theoretical predictions. Our results provide evidence that episodic positive selection has played an important role in the evolution of most avian TLRs, consistent with the role of these loci in pathogen recognition and a mechanism of host-pathogen coevolution.
Collapse
Affiliation(s)
- Catherine E Grueber
- Allan Wilson Centre for Molecular Ecology and Evolution, Dunedin, New Zealand; Department of Zoology, University of Otago, Dunedin, New Zealand
| | - Graham P Wallis
- Department of Zoology, University of Otago, Dunedin, New Zealand
| | - Ian G Jamieson
- Allan Wilson Centre for Molecular Ecology and Evolution, Dunedin, New Zealand; Department of Zoology, University of Otago, Dunedin, New Zealand
| |
Collapse
|
35
|
Chaw RC, Zhao Y, Wei J, Ayoub NA, Allen R, Atrushi K, Hayashi CY. Intragenic homogenization and multiple copies of prey-wrapping silk genes in Argiope garden spiders. BMC Evol Biol 2014; 14:31. [PMID: 24552485 PMCID: PMC3933166 DOI: 10.1186/1471-2148-14-31] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2013] [Accepted: 02/14/2014] [Indexed: 11/15/2022] Open
Abstract
Background Spider silks are spectacular examples of phenotypic diversity arising from adaptive molecular evolution. An individual spider can produce an array of specialized silks, with the majority of constituent silk proteins encoded by members of the spidroin gene family. Spidroins are dominated by tandem repeats flanked by short, non-repetitive N- and C-terminal coding regions. The remarkable mechanical properties of spider silks have been largely attributed to the repeat sequences. However, the molecular evolutionary processes acting on spidroin terminal and repetitive regions remain unclear due to a paucity of complete gene sequences and sampling of genetic variation among individuals. To better understand spider silk evolution, we characterize a complete aciniform spidroin gene from an Argiope orb-weaving spider and survey aciniform gene fragments from congeneric individuals. Results We present the complete aciniform spidroin (AcSp1) gene from the silver garden spider Argiope argentata (Aar_AcSp1), and document multiple AcSp1 loci in individual genomes of A. argentata and the congeneric A. trifasciata and A. aurantia. We find that Aar_AcSp1 repeats have >98% pairwise nucleotide identity. By comparing AcSp1 repeat amino acid sequences between Argiope species and with other genera, we identify regions of conservation over vast amounts of evolutionary time. Through a PCR survey of individual A. argentata, A. trifasciata, and A. aurantia genomes, we ascertain that AcSp1 repeats show limited variation between species whereas terminal regions are more divergent. We also find that average dN/dS across codons in the N-terminal, repetitive, and C-terminal encoding regions indicate purifying selection that is strongest in the N-terminal region. Conclusions Using the complete A. argentata AcSp1 gene and spidroin genetic variation between individuals, this study clarifies some of the molecular evolutionary processes underlying the spectacular mechanical attributes of aciniform silk. It is likely that intragenic concerted evolution and functional constraints on A. argentata AcSp1 repeats result in extreme repeat homogeneity. The maintenance of multiple AcSp1 encoding loci in Argiope genomes supports the hypothesis that Argiope spiders require rapid and efficient protein production to support their prolific use of aciniform silk for prey-wrapping and web-decorating. In addition, multiple gene copies may represent the early stages of spidroin diversification.
Collapse
Affiliation(s)
- R Crystal Chaw
- Department of Biology, University of California, 900 University Avenue, Riverside 92507, Riverside, CA, USA.
| | | | | | | | | | | | | |
Collapse
|
36
|
Novel gammaherpesviruses in North American domestic cats, bobcats, and pumas: identification, prevalence, and risk factors. J Virol 2014; 88:3914-24. [PMID: 24453374 DOI: 10.1128/jvi.03405-13] [Citation(s) in RCA: 45] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
UNLABELLED Gammaherpesviruses (GHVs) are a diverse and rapidly expanding group of viruses associated with a variety of disease conditions in humans and animals. To identify felid GHVs, we screened domestic cat (Felis catus), bobcat (Lynx rufus), and puma (Puma concolor) blood cell DNA samples from California, Colorado, and Florida using a degenerate pan-GHV PCR. Additional pan-GHV and long-distance PCRs were used to sequence a contiguous 3.4-kb region of each putative virus species, including partial glycoprotein B and DNA polymerase genes. We identified three novel GHVs, each present predominantly in one felid species: Felis catus GHV 1 (FcaGHV1) in domestic cats, Lynx rufus GHV 1 (LruGHV1) in bobcats, and Puma concolor GHV 1 (PcoGHV1) in pumas. To estimate infection prevalence, we developed real-time quantitative PCR assays for each virus and screened additional DNA samples from all three species (n = 282). FcaGHV1 was detected in 16% of domestic cats across all study sites. LruGHV1 was detected in 47% of bobcats and 13% of pumas across all study sites, suggesting relatively common interspecific transmission. PcoGHV1 was detected in 6% of pumas, all from a specific region of Southern California. The risk of infection for each host varied with geographic location. Age was a positive risk factor for bobcat LruGHV1 infection, and age and being male were risk factors for domestic cat FcaGHV1 infection. Further characterization of these viruses may have significant health implications for domestic cats and may aid studies of free-ranging felid ecology. IMPORTANCE Gammaherpesviruses (GHVs) establish lifelong infection in many animal species and can cause cancer and other diseases in humans and animals. In this study, we identified the DNA sequences of three GHVs present in the blood of domestic cats (Felis catus), bobcats (Lynx rufus), and pumas (Puma concolor; also known as mountain lions, cougars, and panthers). We found that these viruses were closely related to, but distinct from, other known GHVs of animals and represent the first GHVs identified to be native to these feline species. We developed techniques to rapidly and specifically detect the DNA of these viruses in feline blood and found that the domestic cat and bobcat viruses were widespread across the United States. In contrast, puma virus was found only in a specific region of Southern California. Surprisingly, the bobcat virus was also detected in some pumas, suggesting relatively common virus transmission between these species. Adult domestic cats and bobcats were at greater risk for infection than juveniles. Male domestic cats were at greater risk for infection than females. This study identifies three new viruses that are widespread in three feline species, indicates risk factors for infection that may relate to the route of infection, and demonstrates cross-species transmission between bobcats and pumas. These newly identified viruses may have important effects on feline health and ecology.
Collapse
|
37
|
Phillips CJ, Phillips CD, Goecks J, Lessa EP, Sotero-Caio CG, Tandler B, Gannon MR, Baker RJ. Dietary and flight energetic adaptations in a salivary gland transcriptome of an insectivorous bat. PLoS One 2014; 9:e83512. [PMID: 24454705 PMCID: PMC3891661 DOI: 10.1371/journal.pone.0083512] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2013] [Accepted: 11/04/2013] [Indexed: 12/12/2022] Open
Abstract
We hypothesized that evolution of salivary gland secretory proteome has been important in adaptation to insectivory, the most common dietary strategy among Chiroptera. A submandibular salivary gland (SMG) transcriptome was sequenced for the little brown bat, Myotis lucifugus. The likely secretory proteome of 23 genes included seven (RETNLB, PSAP, CLU, APOE, LCN2, C3, CEL) related to M. lucifugus insectivorous diet and metabolism. Six of the secretory proteins probably are endocrine, whereas one (CEL) most likely is exocrine. The encoded proteins are associated with lipid hydrolysis, regulation of lipid metabolism, lipid transport, and insulin resistance. They are capable of processing exogenous lipids for flight metabolism while foraging. Salivary carboxyl ester lipase (CEL) is thought to hydrolyze insect lipophorins, which probably are absorbed across the gastric mucosa during feeding. The other six proteins are predicted either to maintain these lipids at high blood concentrations or to facilitate transport and uptake by flight muscles. Expression of these seven genes and coordinated secretion from a single organ is novel to this insectivorous bat, and apparently has evolved through instances of gene duplication, gene recruitment, and nucleotide selection. Four of the recruited genes are single-copy in the Myotis genome, whereas three have undergone duplication(s) with two of these genes exhibiting evolutionary 'bursts' of duplication resulting in multiple paralogs. Evidence for episodic directional selection was found for six of seven genes, reinforcing the conclusion that the recruited genes have important roles in adaptation to insectivory and the metabolic demands of flight. Intragenic frequencies of mobile- element-like sequences differed from frequencies in the whole M. lucifugus genome. Differences among recruited genes imply separate evolutionary trajectories and that adaptation was not a single, coordinated event.
Collapse
Affiliation(s)
- Carleton J. Phillips
- Department of Biological Sciences, Texas Tech University, Lubbock, Texas, United States of America
| | - Caleb D. Phillips
- Department of Biological Sciences, Texas Tech University, Lubbock, Texas, United States of America
| | - Jeremy Goecks
- Department of Biology, Emory University, Atlanta, Georgia, United States of America
- Department of Math and Computer Science, Emory University, Atlanta, Georgia, United States of America
| | - Enrique P. Lessa
- Departamento de Ecología y Evolución, Facultad de Ciencias, Universidad de la República, Montevideo, Uruguay
| | - Cibele G. Sotero-Caio
- Department of Biological Sciences, Texas Tech University, Lubbock, Texas, United States of America
| | - Bernard Tandler
- Department of Biological Sciences, School of Dental Medicine, Case Western Reserve University, Cleveland, Ohio, United States of America
| | - Michael R. Gannon
- Department of Biology, Pennsylvania State University, Altoona College, Altoona, Pennsylvania, United States of America
| | - Robert J. Baker
- Department of Biological Sciences, Texas Tech University, Lubbock, Texas, United States of America
| |
Collapse
|
38
|
Debiec R, Christofidou P, Denniff M, Bloomer LD, Bogdanski P, Wojnar L, Musialik K, Charchar FJ, Thompson JR, Waterworth D, Song K, Vollenweider P, Waeber G, Zukowska-Szczechowska E, Samani NJ, Lambert D, Tomaszewski M. Urotensin-II system in genetic control of blood pressure and renal function. PLoS One 2013; 8:e83137. [PMID: 24391740 PMCID: PMC3877024 DOI: 10.1371/journal.pone.0083137] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2013] [Accepted: 10/30/2013] [Indexed: 02/07/2023] Open
Abstract
Urotensin-II controls ion/water homeostasis in fish and vascular tone in rodents. We hypothesised that common genetic variants in urotensin-II pathway genes are associated with human blood pressure or renal function. We performed family-based analysis of association between blood pressure, glomerular filtration and genes of the urotensin-II pathway (urotensin-II, urotensin-II related peptide, urotensin-II receptor) saturated with 28 tagging single nucleotide polymorphisms in 2024 individuals from 520 families; followed by an independent replication in 420 families and 7545 unrelated subjects. The expression studies of the urotensin-II pathway were carried out in 97 human kidneys. Phylogenetic evolutionary analysis was conducted in 17 vertebrate species. One single nucleotide polymorphism (rs531485 in urotensin-II gene) was associated with adjusted estimated glomerular filtration rate in the discovery cohort (p = 0.0005). It showed no association with estimated glomerular filtration rate in the combined replication resource of 8724 subjects from 6 populations. Expression of urotensin-II and its receptor showed strong linear correlation (r = 0.86, p<0.0001). There was no difference in renal expression of urotensin-II system between hypertensive and normotensive subjects. Evolutionary analysis revealed accumulation of mutations in urotensin-II since the divergence of primates and weaker conservation of urotensin-II receptor in primates than in lower vertebrates. Our data suggest that urotensin-II system genes are unlikely to play a major role in genetic control of human blood pressure or renal function. The signatures of evolutionary forces acting on urotensin-II system indicate that it may have evolved towards loss of function since the divergence of primates.
Collapse
Affiliation(s)
- Radoslaw Debiec
- Department of Cardiovascular Sciences, University of Leicester, Leicester, United Kingdom
| | - Paraskevi Christofidou
- Department of Cardiovascular Sciences, University of Leicester, Leicester, United Kingdom
| | - Matthew Denniff
- Department of Cardiovascular Sciences, University of Leicester, Leicester, United Kingdom
| | - Lisa D. Bloomer
- Department of Cardiovascular Sciences, University of Leicester, Leicester, United Kingdom
| | - Pawel Bogdanski
- Department of Internal Medicine, Metabolic Disorders and Hypertension, Medical University of Poznan, Poznan, Poland
| | - Lukasz Wojnar
- Department of Urology and Oncological Urology, Medical University of Poznan, Poznan, Poland
| | - Katarzyna Musialik
- Department of Internal Medicine, Metabolic Disorders and Hypertension, Medical University of Poznan, Poznan, Poland
| | - Fadi J. Charchar
- School of Science and Engineering, University of Ballarat, Ballarat, Australia
| | - John R. Thompson
- Department of Cardiovascular Sciences, University of Leicester, Leicester, United Kingdom
| | - Dawn Waterworth
- GlaxoSmithKline, Philadelphia, Pennsylvania, United States of America
| | - Kijoung Song
- GlaxoSmithKline, Philadelphia, Pennsylvania, United States of America
| | - Peter Vollenweider
- Department of Medicine, Centre Hospitalier Universitaire Vaudois, Lausanne, Switzerland
| | - Gerard Waeber
- Department of Medicine, Centre Hospitalier Universitaire Vaudois, Lausanne, Switzerland
| | - Ewa Zukowska-Szczechowska
- Department of Internal Medicine, Diabetology and Nephrology, Medical University of Silesia, Zabrze, Poland
| | - Nilesh J. Samani
- Department of Cardiovascular Sciences, University of Leicester, Leicester, United Kingdom
| | - David Lambert
- Department of Cardiovascular Sciences, University of Leicester, Leicester, United Kingdom
| | - Maciej Tomaszewski
- Department of Cardiovascular Sciences, University of Leicester, Leicester, United Kingdom
- * E-mail:
| |
Collapse
|
39
|
Miyazawa S. Superiority of a mechanistic codon substitution model even for protein sequences in phylogenetic analysis. BMC Evol Biol 2013; 13:257. [PMID: 24256155 PMCID: PMC4225520 DOI: 10.1186/1471-2148-13-257] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2013] [Accepted: 11/14/2013] [Indexed: 11/25/2022] Open
Abstract
Background Nucleotide and amino acid substitution tendencies are characteristic of each species, organelle, and protein family. Hence, various empirical amino acid substitution rate matrices have needed to be estimated for phylogenetic analysis: JTT, WAG, and LG for nuclear proteins, mtREV for mitochondrial proteins, cpREV10 and cpREV64 for chloroplast-encoded proteins, and FLU for influenza proteins. On the other hand, in a mechanistic codon substitution model, in which each codon substitution rate is proportional to the product of a codon mutation rate and the ratio of fixation depending on the type of amino acid replacement, mutation rates and the strength of selective constraint on amino acids can be tailored to each protein family with additional 11 parameters. As a result, in the evolutionary analysis of codon sequences it outperforms codon substitution models equivalent to empirical amino acid substitution matrices. Is it superior even for amino acid sequences, among which synonymous substitutions cannot be identified? Results Nucleotide mutations are assumed to occur independently of codon positions but multiple nucleotide changes in infinitesimal time are allowed. Selective constraints on the respective types of amino acid replacements are tailored to each gene with a linear function of a given estimate of selective constraints, which were estimated by maximizing the likelihood of an empirical amino acid or codon substitution frequency matrix, each of JTT, WAG, LG, and KHG. It is shown that the mechanistic codon substitution model with the assumption of equal codon usage yields better values of Akaike and Bayesian information criteria for all three phylogenetic trees of mitochondrial, chloroplast, and influenza-A hemagglutinin proteins than the empirical amino acid substitution models with mtREV, cpREV64, and FLU, which were designed specifically for those protein families, respectively. The variation of selective constraint across sites fits the datasets significantly better than variable codon mutation rates, confirming that substitution rate variations across sites detected by amino acid substitution models are caused primarily by the variation of selective constraint against amino acid substitutions rather than the variation of codon mutation rate. Conclusions The mechanistic codon substitution model is superior to amino acid substitution models even in the evolutionary analysis of protein sequences.
Collapse
|
40
|
Lewitus E, Kalinka AT. Neocortical development as an evolutionary platform for intragenomic conflict. Front Neuroanat 2013; 7:2. [PMID: 23576960 PMCID: PMC3620502 DOI: 10.3389/fnana.2013.00002] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2013] [Accepted: 03/18/2013] [Indexed: 12/21/2022] Open
Abstract
Embryonic development in mammals has evolved a platform for genomic conflict between mothers and embryos and, by extension, between maternal and paternal genomes. The evolutionary interests of the mother and embryo may be maximized through the promotion of sex-chromosome genes and imprinted alleles, resulting in the rapid evolution of postzygotic phenotypes preferential to either the maternal or paternal genome. In eutherian mammals, extraordinary in utero maternal investment in the brain, and neocortex especially, suggests that convergent evolution of an expanded mammalian neocortex along divergent lineages may be explained, in part, by parent-of-origin-linked gene expression arising from parent-offspring conflict. The influence of this conflict on neocortical development and evolution, however, has not been investigated at the genomic level. In this hypothesis and theory article, we provide preliminary evidence for positive selection in humans in the regions of two platforms of intragenomic conflict—chromosomes 15q11-q13 and X—and explore the potential relevance of cis-regulated imprinted domains to neocortical expansion in mammalian evolution. We present the hypothesis that maternal- and paternal-specific pressures on the developing neocortex compete intragenomically to influence neocortical expansion in mammalian evolution.
Collapse
Affiliation(s)
- Eric Lewitus
- Max Planck Institute of Molecular Cell Biology and Genetics Dresden, Germany
| | | |
Collapse
|
41
|
Wisotzkey RG, Konikoff CE, Newfeld SJ. Hippo pathway phylogenetics predicts monoubiquitylation of Salvador and Merlin/Nf2. PLoS One 2012; 7:e51599. [PMID: 23272121 PMCID: PMC3522738 DOI: 10.1371/journal.pone.0051599] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2012] [Accepted: 11/08/2012] [Indexed: 01/20/2023] Open
Abstract
Recently we employed phylogenetics to predict that the cellular interpretation of TGF-β signals is modulated by monoubiquitylation cycles affecting the Smad4 signal transducer/tumor suppressor. This prediction was subsequently validated by experiments in flies, frogs and mammalian cells. Here we apply a phylogenetic approach to the Hippo pathway and predict that two of its signal transducers, Salvador and Merlin/Nf2 (also a tumor suppressor) are regulated by monoubiquitylation. This regulatory mechanism does not lead to protein degradation but instead serves as a highly efficient “off/on” switch when the protein is subsequently deubiquitylated. Overall, our study shows that the creative application of phylogenetics can predict new roles for pathway components and new mechanisms for regulating intercellular signaling pathways.
Collapse
Affiliation(s)
| | - Charlotte E. Konikoff
- Department of Biology, University of Washington, Seattle, Washington, United States of America
| | - Stuart J. Newfeld
- School of Life Sciences, Arizona State University, Tempe, Arizona, United States of America
- * E-mail:
| |
Collapse
|
42
|
Abstract
Empirical codon models (ECMs) estimated from a large number of globular protein families outperformed mechanistic codon models in their description of the general process of protein evolution. Among other factors, ECMs implicitly model the influence of amino acid properties and multiple nucleotide substitutions (MNS). However, the estimation of ECMs requires large quantities of data, and until recently, only few suitable data sets were available. Here, we take advantage of several new Drosophila species genomes to estimate codon models from genome-wide data. The availability of large numbers of genomes over varying phylogenetic depths in the Drosophila genus allows us to explore various divergence levels. In consequence, we can use these data to determine the appropriate level of divergence for the estimation of ECMs, avoiding overestimation of MNS rates caused by saturation. To account for variation in evolutionary rates along the genome, we develop new empirical codon hidden Markov models (ecHMMs). These models significantly outperform previous ones with respect to maximum likelihood values, suggesting that they provide a better fit to the evolutionary process. Using ECMs and ecHMMs derived from genome-wide data sets, we devise new likelihood ratio tests (LRTs) of positive selection. We found classical LRTs very sensitive to the presence of MNSs, showing high false-positive rates, especially with small phylogenies. The new LRTs are more conservative than the classical ones, having acceptable false-positive rates and reduced power.
Collapse
Affiliation(s)
- Nicola De Maio
- Institut für Populationsgenetik, Vetmeduni Vienna, Wien, Austria.
| | | | | | | |
Collapse
|
43
|
Liu X, Liu H, Guo W, Yu K. Codon substitution models based on residue similarity and their applications. Gene 2012; 509:136-41. [PMID: 22902303 DOI: 10.1016/j.gene.2012.07.075] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2012] [Accepted: 07/31/2012] [Indexed: 10/28/2022]
Abstract
Codon models are now widely used to draw evolutionary inferences from alignments of homologous sequence data. Incorporating physicochemical properties of amino acids into codon models, two novel codon substitution models describing the evolution of protein-coding DNA sequences are presented based on the similarity scores of amino acids. To describe substitutions between codons a continue-time Markov process is used. Transition/transversion rate bias and nonsynonymous codon usage bias are allowed in the models. In our implementation, the parameters are estimated by maximum-likelihood (ML) method as in previous studies. Furthermore, instantaneous mutations involving more than one nucleotide position of a codon are considered in the second model. Then the two suggested models are applied to five real data sets. The analytic results indicate that the new codon models considering physicochemical properties of amino acids can provide a better fit to the data comparing with existing codon models, and then produce more reliable estimates of certain biologically important measures than existing methods.
Collapse
Affiliation(s)
- Xinsheng Liu
- Institute of Nano Science, State Key Laboratory of Mechanics and Control of Mechanical Structures, and College of Science, Nanjing University of Aeronautics and Astronautics, Nanjing 210016, China.
| | | | | | | |
Collapse
|
44
|
Scherrer MP, Meyer AG, Wilke CO. Modeling coding-sequence evolution within the context of residue solvent accessibility. BMC Evol Biol 2012; 12:179. [PMID: 22967129 PMCID: PMC3527230 DOI: 10.1186/1471-2148-12-179] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2012] [Accepted: 09/03/2012] [Indexed: 11/30/2022] Open
Abstract
Background Protein structure mediates site-specific patterns of sequence divergence. In particular, residues in the core of a protein (solvent-inaccessible residues) tend to be more evolutionarily conserved than residues on the surface (solvent-accessible residues). Results Here, we present a model of sequence evolution that explicitly accounts for the relative solvent accessibility of each residue in a protein. Our model is a variant of the Goldman-Yang 1994 (GY94) model in which all model parameters can be functions of the relative solvent accessibility (RSA) of a residue. We apply this model to a data set comprised of nearly 600 yeast genes, and find that an evolutionary-rate ratio ω that varies linearly with RSA provides a better model fit than an RSA-independent ω or an ω that is estimated separately in individual RSA bins. We further show that the branch length t and the transition-transverion ratio κ also vary with RSA. The RSA-dependent GY94 model performs better than an RSA-dependent Muse-Gaut 1994 (MG94) model in which the synonymous and non-synonymous rates individually are linear functions of RSA. Finally, protein core size affects the slope of the linear relationship between ω and RSA, and gene expression level affects both the intercept and the slope. Conclusions Structure-aware models of sequence evolution provide a significantly better fit than traditional models that neglect structure. The linear relationship between ω and RSA implies that genes are better characterized by their ω slope and intercept than by just their mean ω.
Collapse
Affiliation(s)
- Michael P Scherrer
- Center for Computational Biology and Bioinformatics, Institute for Cellular and Molecular Biology, and Section of Integrative Biology, The University of Texas at Austin, Austin, TX 78712, USA
| | | | | |
Collapse
|
45
|
Gasse B, Silvent J, Sire JY. Evolutionary analysis suggests that AMTN is enamel-specific and a candidate for AI. J Dent Res 2012; 91:1085-9. [PMID: 22968158 DOI: 10.1177/0022034512460551] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open
Abstract
Molecular evolutionary analysis is an efficient method to predict and/or validate amino acid substitutions that could lead to a genetic disease and to highlight residues and motifs that could play an important role in the protein structure and/or function. We have applied such analysis to amelotin (AMTN), a recently identified enamel protein in the rat, mouse, and humans. An in silico search for AMTN provided 42 new mammalian sequences that were added to the 3 published sequences with which we performed the analysis using a dataset representative of all lineages (circa 220 million years of evolution), including 2 enamel-less species, sloth and armadillo. During evolution, of the 209 residues of human AMTN, 17 were unchanged and 34 had conserved their chemical properties. Substituting these important residues could lead to amelogenesis imperfecta (AI). Also, AMTN possesses a well-conserved signal peptide, 2 conserved motifs whose function is certainly important but unknown, and a putative phosphorylation site (SXE). In addition, the sequences of the 2 enamel-less species display mutations revealing that AMTN underwent pseudogenization, which suggests that AMTN is an enamel-specific protein.
Collapse
Affiliation(s)
- B Gasse
- Université Pierre et Marie Curie, UMR 7138, EDS research group, 7 quai St-Bernard, Case 5, 75005 Paris, France
| | | | | |
Collapse
|
46
|
Murrell B, de Oliveira T, Seebregts C, Kosakovsky Pond SL, Scheffler K. Modeling HIV-1 drug resistance as episodic directional selection. PLoS Comput Biol 2012; 8:e1002507. [PMID: 22589711 PMCID: PMC3349733 DOI: 10.1371/journal.pcbi.1002507] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2011] [Accepted: 03/16/2012] [Indexed: 12/02/2022] Open
Abstract
The evolution of substitutions conferring drug resistance to HIV-1 is both episodic, occurring when patients are on antiretroviral therapy, and strongly directional, with site-specific resistant residues increasing in frequency over time. While methods exist to detect episodic diversifying selection and continuous directional selection, no evolutionary model combining these two properties has been proposed. We present two models of episodic directional selection (MEDS and EDEPS) which allow the a priori specification of lineages expected to have undergone directional selection. The models infer the sites and target residues that were likely subject to directional selection, using either codon or protein sequences. Compared to its null model of episodic diversifying selection, MEDS provides a superior fit to most sites known to be involved in drug resistance, and neither one test for episodic diversifying selection nor another for constant directional selection are able to detect as many true positives as MEDS and EDEPS while maintaining acceptable levels of false positives. This suggests that episodic directional selection is a better description of the process driving the evolution of drug resistance. When exposed to treatment, HIV-1 and other rapidly evolving viruses have the capacity to acquire drug resistance mutations (DRAMs), which limit the efficacy of antivirals. There are a number of experimentally well characterized HIV-1 DRAMs, but many mutations whose roles are not fully understood have also been reported. In this manuscript we construct evolutionary models that identify the locations and targets of mutations conferring resistance to antiretrovirals from viral sequences sampled from treated and untreated individuals. While the evolution of drug resistance is a classic example of natural selection, existing analyses fail to detect the majority of DRAMs. We show that, in order to identify resistance mutations from sequence data, it is necessary to recognize that in this case natural selection is both episodic (it only operates when the virus is exposed to the drugs) and directional (only mutations to a particular amino-acid confer resistance while allowing the virus to continue replicating). The new class of models that allow for the episodic and directional nature of adaptive evolution performs very well at recovering known DRAMs, can be useful at identifying unknown resistance-associated mutations, and is generally applicable to a variety of biological scenarios where similar selective forces are at play.
Collapse
Affiliation(s)
- Ben Murrell
- Biomedical Informatics Research Division, eHealth Research and Innovation Platform, Medical Research Council, Tygerberg, South Africa
- Computer Science Division, Department of Mathematical Sciences, Stellenbosch University, Stellenbosch, South Africa
| | - Tulio de Oliveira
- Africa Centre for Health and Population Studies, Nelson R. Mandela School of Medicine, University of KwaZulu-Natal, Durban, South Africa
- Research Department of Infection and Population Health, University College London, London, United Kingdom
| | - Chris Seebregts
- Biomedical Informatics Research Division, eHealth Research and Innovation Platform, Medical Research Council, Tygerberg, South Africa
- School of Computer Science, University of KwaZulu-Natal, Durban, South Africa
| | - Sergei L. Kosakovsky Pond
- Department of Medicine, University of California San Diego, San Diego, California, United States of America
| | - Konrad Scheffler
- Computer Science Division, Department of Mathematical Sciences, Stellenbosch University, Stellenbosch, South Africa
- Department of Medicine, University of California San Diego, San Diego, California, United States of America
- * E-mail:
| | | |
Collapse
|
47
|
Pond SLK, Murrell B, Poon AFY. Evolution of viral genomes: interplay between selection, recombination, and other forces. Methods Mol Biol 2012; 856:239-72. [PMID: 22399462 DOI: 10.1007/978-1-61779-585-5_10] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
RNA viruses evolve very rapidly, often recombine, and are subject to strong host (immune response) and anthropogenic (antiretroviral drugs) selective forces. Given their compact and extensively sequenced genomes, comparative analysis of RNA viral data can provide important insights into the molecular mechanisms of adaptation, pathogenicity, immune evasion, and drug resistance. In this chapter, we present an example-based overview of recent advances in evolutionary models and statistical approaches that enable screening viral alignments for evidence of adaptive change in the presence of recombination, detecting bursts of directional adaptive evolution associated with the phenotypic changes, and detecting of coevolving sites in viral genes.
Collapse
|
48
|
Miyazawa S. Advantages of a mechanistic codon substitution model for evolutionary analysis of protein-coding sequences. PLoS One 2011; 6:e28892. [PMID: 22220197 PMCID: PMC3248416 DOI: 10.1371/journal.pone.0028892] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2011] [Accepted: 11/16/2011] [Indexed: 11/18/2022] Open
Abstract
BACKGROUND A mechanistic codon substitution model, in which each codon substitution rate is proportional to the product of a codon mutation rate and the average fixation probability depending on the type of amino acid replacement, has advantages over nucleotide, amino acid, and empirical codon substitution models in evolutionary analysis of protein-coding sequences. It can approximate a wide range of codon substitution processes. If no selection pressure on amino acids is taken into account, it will become equivalent to a nucleotide substitution model. If mutation rates are assumed not to depend on the codon type, then it will become essentially equivalent to an amino acid substitution model. Mutation at the nucleotide level and selection at the amino acid level can be separately evaluated. RESULTS The present scheme for single nucleotide mutations is equivalent to the general time-reversible model, but multiple nucleotide changes in infinitesimal time are allowed. Selective constraints on the respective types of amino acid replacements are tailored to each gene in a linear function of a given estimate of selective constraints. Their good estimates are those calculated by maximizing the respective likelihoods of empirical amino acid or codon substitution frequency matrices. Akaike and Bayesian information criteria indicate that the present model performs far better than the other substitution models for all five phylogenetic trees of highly-divergent to highly-homologous sequences of chloroplast, mitochondrial, and nuclear genes. It is also shown that multiple nucleotide changes in infinitesimal time are significant in long branches, although they may be caused by compensatory substitutions or other mechanisms. The variation of selective constraint over sites fits the datasets significantly better than variable mutation rates, except for 10 slow-evolving nuclear genes of 10 mammals. An critical finding for phylogenetic analysis is that assuming variable mutation rates over sites lead to the overestimation of branch lengths.
Collapse
Affiliation(s)
- Sanzo Miyazawa
- Graduate School of Engineering, Gunma University, Kiryu, Gunma, Japan
- * E-mail:
| |
Collapse
|
49
|
Murrell B, Weighill T, Buys J, Ketteringham R, Moola S, Benade G, du Buisson L, Kaliski D, Hands T, Scheffler K. Non-negative matrix factorization for learning alignment-specific models of protein evolution. PLoS One 2011; 6:e28898. [PMID: 22216138 PMCID: PMC3245233 DOI: 10.1371/journal.pone.0028898] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2011] [Accepted: 11/16/2011] [Indexed: 12/05/2022] Open
Abstract
Models of protein evolution currently come in two flavors: generalist and specialist. Generalist models (e.g. PAM, JTT, WAG) adopt a one-size-fits-all approach, where a single model is estimated from a number of different protein alignments. Specialist models (e.g. mtREV, rtREV, HIVbetween) can be estimated when a large quantity of data are available for a single organism or gene, and are intended for use on that organism or gene only. Unsurprisingly, specialist models outperform generalist models, but in most instances there simply are not enough data available to estimate them. We propose a method for estimating alignment-specific models of protein evolution in which the complexity of the model is adapted to suit the richness of the data. Our method uses non-negative matrix factorization (NNMF) to learn a set of basis matrices from a general dataset containing a large number of alignments of different proteins, thus capturing the dimensions of important variation. It then learns a set of weights that are specific to the organism or gene of interest and for which only a smaller dataset is available. Thus the alignment-specific model is obtained as a weighted sum of the basis matrices. Having been constrained to vary along only as many dimensions as the data justify, the model has far fewer parameters than would be required to estimate a specialist model. We show that our NNMF procedure produces models that outperform existing methods on all but one of 50 test alignments. The basis matrices we obtain confirm the expectation that amino acid properties tend to be conserved, and allow us to quantify, on specific alignments, how the strength of conservation varies across different properties. We also apply our new models to phylogeny inference and show that the resulting phylogenies are different from, and have improved likelihood over, those inferred under standard models.
Collapse
Affiliation(s)
- Ben Murrell
- Biomedical Informatics Research Division, eHealth Research and Innovation Platform, Medical Research Council, Cape Town, Western Cape, South Africa
- Stellenbosch University, Stellenbosch, Western Cape, South Africa
| | - Thomas Weighill
- Stellenbosch University, Stellenbosch, Western Cape, South Africa
| | - Jan Buys
- Stellenbosch University, Stellenbosch, Western Cape, South Africa
| | | | - Sasha Moola
- University of Cape Town, Cape Town, Western Cape, South Africa
| | - Gerdus Benade
- Stellenbosch University, Stellenbosch, Western Cape, South Africa
| | - Lise du Buisson
- Stellenbosch University, Stellenbosch, Western Cape, South Africa
| | - Daniel Kaliski
- University of Cape Town, Cape Town, Western Cape, South Africa
| | - Tristan Hands
- University of Cape Town, Cape Town, Western Cape, South Africa
| | - Konrad Scheffler
- Stellenbosch University, Stellenbosch, Western Cape, South Africa
- * E-mail:
| |
Collapse
|
50
|
Potent autologous and heterologous neutralizing antibody responses occur in HIV-2 infection across a broad range of infection outcomes. J Virol 2011; 86:930-46. [PMID: 22072758 DOI: 10.1128/jvi.06126-11] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Few studies have explored the role of neutralizing antibody (NAb) responses in controlling HIV-2 viremia and disease progression. Using a TZM-bl neutralization assay, we assessed heterologous and autologous NAb responses from a community cohort of HIV-2-infected individuals with a broad range of disease outcomes in rural Guinea-Bissau. All subjects (n = 40) displayed exceptionally high heterologous NAb titers (50% inhibitory plasma dilution or 50% inhibitory concentration [IC(50)], 1:7,000 to 1:1,000,000) against 5 novel primary HIV-2 envelopes and HIV-2 7312A, whereas ROD A and 3 primary envelopes were relatively resistant to neutralization. Most individuals also showed high autologous NAb against contemporaneous envelopes (78% of plasma-envelope combinations in 69 envelopes from 21 subjects), with IC(50)s above 1:10,000. No association between heterologous or autologous NAb titer and greater control of HIV-2 was found. A subset of envelopes was found to be more resistant to neutralization (by plasma and HIV-2 monoclonal antibodies). These envelopes were isolated from individuals with greater intrapatient sequence diversity and were associated with changes in potential N-linked glycosylation sites but not CD4 independence or CXCR4 use. Plasma collected from up to 15 years previously was able to potently neutralize recent autologous envelopes, suggesting a lack of escape from NAb and the persistence of neutralization-sensitive variants over time, despite significant NAb pressure. We conclude that despite the presence of broad and potent NAb responses in HIV-2-infected individuals, these are not the primary forces behind the dichotomous outcomes observed but reveal a limited capacity for adaptive selection and escape from host immunity in HIV-2 infection.
Collapse
|