1
|
Gupta MK, Vadde R. Next-generation development and application of codon model in evolution. Front Genet 2023; 14:1091575. [PMID: 36777719 PMCID: PMC9911445 DOI: 10.3389/fgene.2023.1091575] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Accepted: 01/17/2023] [Indexed: 01/28/2023] Open
Abstract
To date, numerous nucleotide, amino acid, and codon substitution models have been developed to estimate the evolutionary history of any sequence/organism in a more comprehensive way. Out of these three, the codon substitution model is the most powerful. These models have been utilized extensively to detect selective pressure on a protein, codon usage bias, ancestral reconstruction and phylogenetic reconstruction. However, due to more computational demanding, in comparison to nucleotide and amino acid substitution models, only a few studies have employed the codon substitution model to understand the heterogeneity of the evolutionary process in a genome-scale analysis. Hence, there is always a question of how to develop more robust but less computationally demanding codon substitution models to get more accurate results. In this review article, the authors attempted to understand the basis of the development of different types of codon-substitution models and how this information can be utilized to develop more robust but less computationally demanding codon substitution models. The codon substitution model enables to detect selection regime under which any gene or gene region is evolving, codon usage bias in any organism or tissue-specific region and phylogenetic relationship between different lineages more accurately than nucleotide and amino acid substitution models. Thus, in the near future, these codon models can be utilized in the field of conservation, breeding and medicine.
Collapse
|
2
|
Pettie N, Llopart A, Comeron JM. Meiotic, genomic and evolutionary properties of crossover distribution in Drosophila yakuba. PLoS Genet 2022; 18:e1010087. [PMID: 35320272 PMCID: PMC8979470 DOI: 10.1371/journal.pgen.1010087] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2021] [Revised: 04/04/2022] [Accepted: 02/09/2022] [Indexed: 12/14/2022] Open
Abstract
The number and location of crossovers across genomes are highly regulated during meiosis, yet the key components controlling them are fast evolving, hindering our understanding of the mechanistic causes and evolutionary consequences of changes in crossover rates. Drosophila melanogaster has been a model species to study meiosis for more than a century, with an available high-resolution crossover map that is, nonetheless, missing for closely related species, thus preventing evolutionary context. Here, we applied a novel and highly efficient approach to generate whole-genome high-resolution crossover maps in D. yakuba to tackle multiple questions that benefit from being addressed collectively within an appropriate phylogenetic framework, in our case the D. melanogaster species subgroup. The genotyping of more than 1,600 individual meiotic events allowed us to identify several key distinct properties relative to D. melanogaster. We show that D. yakuba, in addition to higher crossover rates than D. melanogaster, has a stronger centromere effect and crossover assurance than any Drosophila species analyzed to date. We also report the presence of an active crossover-associated meiotic drive mechanism for the X chromosome that results in the preferential inclusion in oocytes of chromatids with crossovers. Our evolutionary and genomic analyses suggest that the genome-wide landscape of crossover rates in D. yakuba has been fairly stable and captures a significant signal of the ancestral crossover landscape for the whole D. melanogaster subgroup, even informative for the D. melanogaster lineage. Contemporary crossover rates in D. melanogaster, on the other hand, do not recapitulate ancestral crossovers landscapes. As a result, the temporal stability of crossover landscapes observed in D. yakuba makes this species an ideal system for applying population genetic models of selection and linkage, given that these models assume temporal constancy in linkage effects. Our studies emphasize the importance of generating multiple high-resolution crossover rate maps within a coherent phylogenetic context to broaden our understanding of crossover control during meiosis and to improve studies on the evolutionary consequences of variable crossover rates across genomes and time.
Collapse
Affiliation(s)
- Nikale Pettie
- Interdisciplinary Program in Genetics, University of Iowa, Iowa City, Iowa, United States of America
| | - Ana Llopart
- Interdisciplinary Program in Genetics, University of Iowa, Iowa City, Iowa, United States of America
- Department of Biology, University of Iowa, Iowa City, Iowa, United States of America
| | - Josep M. Comeron
- Interdisciplinary Program in Genetics, University of Iowa, Iowa City, Iowa, United States of America
- Department of Biology, University of Iowa, Iowa City, Iowa, United States of America
- * E-mail:
| |
Collapse
|
3
|
Jackson B, Charlesworth B. Evidence for a force favoring GC over AT at short intronic sites in Drosophila simulans and Drosophila melanogaster. G3 GENES|GENOMES|GENETICS 2021; 11:6321237. [PMID: 34544137 PMCID: PMC8496279 DOI: 10.1093/g3journal/jkab240] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/13/2021] [Accepted: 07/06/2021] [Indexed: 11/13/2022]
Abstract
Population genetics studies often make use of a class of nucleotide site free from selective pressures, in order to make inferences about population size changes or natural selection at other sites. If such neutral sites can be identified, they offer the opportunity to avoid any confounding effects of selection. Here, we investigate evolution at putatively neutrally evolving short intronic sites in natural populations of Drosophila melanogaster and Drosophila simulans, in order to understand the properties of spontaneous mutations and the extent of GC-biased gene conversion in these species. Use of data on the genetics of natural populations is advantageous because it integrates information from large numbers of individuals over long timescales. In agreement with direct evidence from observations of spontaneous mutations in Drosophila, we find a bias in the spectrum of mutations toward AT basepairs. In addition, we find that this bias is stronger in the D. melanogaster lineage than in the D. simulans lineage. The evidence for GC-biased gene conversion in Drosophila has been equivocal. Here, we provide evidence for a weak force favoring GC in both species, which is correlated with the GC content of introns and is stronger in D. simulans than in D. melanogaster.
Collapse
Affiliation(s)
- Ben Jackson
- School of Biological Sciences, Institute of Evolutionary Biology, University of Edinburgh, Edinburgh EH9 3FL, UK
| | - Brian Charlesworth
- School of Biological Sciences, Institute of Evolutionary Biology, University of Edinburgh, Edinburgh EH9 3FL, UK
| |
Collapse
|
4
|
Hemmer LW, Dias GB, Smith B, Van Vaerenberghe K, Howard A, Bergman CM, Blumenstiel JP. Hybrid dysgenesis in Drosophila virilis results in clusters of mitotic recombination and loss-of-heterozygosity but leaves meiotic recombination unaltered. Mob DNA 2020; 11:10. [PMID: 32082426 PMCID: PMC7023781 DOI: 10.1186/s13100-020-0205-0] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2019] [Accepted: 01/28/2020] [Indexed: 12/11/2022] Open
Abstract
BACKGROUND Transposable elements (TEs) are endogenous mutagens and their harmful effects are especially evident in syndromes of hybrid dysgenesis. In Drosophila virilis, hybrid dysgenesis is a syndrome of incomplete gonadal atrophy that occurs when males with multiple active TE families fertilize females that lack active copies of the same families. This has been demonstrated to cause the transposition of paternally inherited TE families, with gonadal atrophy driven by the death of germline stem cells. Because there are abundant, active TEs in the male inducer genome, that are not present in the female reactive genome, the D. virilis syndrome serves as an excellent model for understanding the effects of hybridization between individuals with asymmetric TE profiles. RESULTS Using the D. virilis syndrome of hybrid dysgenesis as a model, we sought to determine how the landscape of germline recombination is affected by parental TE asymmetry. Using a genotyping-by-sequencing approach, we generated a high-resolution genetic map of D. virilis and show that recombination rate and TE density are negatively correlated in this species. We then contrast recombination events in the germline of dysgenic versus non-dysgenic F1 females to show that the landscape of meiotic recombination is hardly perturbed during hybrid dysgenesis. In contrast, hybrid dysgenesis in the female germline increases transmission of chromosomes with mitotic recombination. Using a de novo PacBio assembly of the D. virilis inducer genome we show that clusters of mitotic recombination events in dysgenic females are associated with genomic regions with transposons implicated in hybrid dysgenesis. CONCLUSIONS Overall, we conclude that increased mitotic recombination is likely the result of early TE activation in dysgenic progeny, but a stable landscape of meiotic recombination indicates that either transposition is ameliorated in the adult female germline or that regulation of meiotic recombination is robust to ongoing transposition. These results indicate that the effects of parental TE asymmetry on recombination are likely sensitive to the timing of transposition.
Collapse
Affiliation(s)
- Lucas W. Hemmer
- Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence, KS 66045 USA
- Present Address: Department of Biology, University of Rochester, Rochester, NY 14627 USA
| | - Guilherme B. Dias
- Department of Genetics and Institute of Bioinformatics, University of Georgia, Athens, GA 30602 USA
| | - Brittny Smith
- Department of Molecular Biosciences, University of Kansas, Lawrence, KS 66045 USA
| | - Kelley Van Vaerenberghe
- Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence, KS 66045 USA
| | - Ashley Howard
- Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence, KS 66045 USA
| | - Casey M. Bergman
- Department of Genetics and Institute of Bioinformatics, University of Georgia, Athens, GA 30602 USA
| | - Justin P. Blumenstiel
- Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence, KS 66045 USA
| |
Collapse
|
5
|
Distinguishing Among Evolutionary Forces Acting on Genome-Wide Base Composition: Computer Simulation Analysis of Approximate Methods for Inferring Site Frequency Spectra of Derived Mutations. G3-GENES GENOMES GENETICS 2018; 8:1755-1769. [PMID: 29588382 PMCID: PMC5940166 DOI: 10.1534/g3.117.300512] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
Inferred ancestral nucleotide states are increasingly employed in analyses of within- and between -species genome variation. Although numerous studies have focused on ancestral inference among distantly related lineages, approaches to infer ancestral states in polymorphism data have received less attention. Recently developed approaches that employ complex transition matrices allow us to infer ancestral nucleotide sequence in various evolutionary scenarios of base composition. However, the requirement of a single gene tree to calculate a likelihood is an important limitation for conducting ancestral inference using within-species variation in recombining genomes. To resolve this problem, and to extend the applicability of ancestral inference in studies of base composition evolution, we first evaluate three previously proposed methods to infer ancestral nucleotide sequences among within- and between-species sequence variation data. The methods employ a single allele, bifurcating tree, or a star tree for within-species variation data. Using simulated nucleotide sequences, we employ ancestral inference to infer fixations and polymorphisms. We find that all three methods show biased inference. We modify the bifurcating tree method to include weights to adjust for an expected site frequency spectrum, “bifurcating tree with weighting” (BTW). Our simulation analysis show that the BTW method can substantially improve the reliability and robustness of ancestral inference in a range of scenarios that include non-neutral and/or non-stationary base composition evolution.
Collapse
|
6
|
Jackson BC, Campos JL, Haddrill PR, Charlesworth B, Zeng K. Variation in the Intensity of Selection on Codon Bias over Time Causes Contrasting Patterns of Base Composition Evolution in Drosophila. Genome Biol Evol 2017; 9:102-123. [PMID: 28082609 PMCID: PMC5381600 DOI: 10.1093/gbe/evw291] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/07/2016] [Indexed: 12/11/2022] Open
Abstract
Four-fold degenerate coding sites form a major component of the genome, and are often used to make inferences about selection and demography, so that understanding their evolution is important. Despite previous efforts, many questions regarding the causes of base composition changes at these sites in Drosophila remain unanswered. To shed further light on this issue, we obtained a new whole-genome polymorphism data set from D. simulans. We analyzed samples from the putatively ancestral range of D. simulans, as well as an existing polymorphism data set from an African population of D. melanogaster. By using D. yakuba as an outgroup, we found clear evidence for selection on 4-fold sites along both lineages over a substantial period, with the intensity of selection increasing with GC content. Based on an explicit model of base composition evolution, we suggest that the observed AT-biased substitution pattern in both lineages is probably due to an ancestral reduction in selection intensity, and is unlikely to be the result of an increase in mutational bias towards AT alone. By using two polymorphism-based methods for estimating selection coefficients over different timescales, we show that the selection intensity on codon usage has been rather stable in D. simulans in the recent past, but the long-term estimates in D. melanogaster are much higher than the short-term ones, indicating a continuing decline in selection intensity, to such an extent that the short-term estimates suggest that selection is only active in the most GC-rich parts of the genome. Finally, we provide evidence for complex evolutionary patterns in the putatively neutral short introns, which cannot be explained by the standard GC-biased gene conversion model. These results reveal a dynamic picture of base composition evolution.
Collapse
Affiliation(s)
- Benjamin C Jackson
- Department of Animal and Plant Sciences, University of Sheffield, Sheffield, United Kingdom
| | - José L Campos
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom
| | - Penelope R Haddrill
- Centre for Forensic Science, Department of Pure and Applied Chemistry, University of Strathclyde, Glasgow, United Kingdom
| | - Brian Charlesworth
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom
| | - Kai Zeng
- Department of Animal and Plant Sciences, University of Sheffield, Sheffield, United Kingdom
| |
Collapse
|
7
|
Transition and Transversion Mutations Are Biased towards GC in Transposons of Chilo suppressalis (Lepidoptera: Pyralidae). Genes (Basel) 2016; 7:genes7100072. [PMID: 27669309 PMCID: PMC5083911 DOI: 10.3390/genes7100072] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2016] [Revised: 09/13/2016] [Accepted: 09/18/2016] [Indexed: 12/04/2022] Open
Abstract
Transposons are often regulated by their hosts, and as a result, there are transposons with several mutations within their host organisms. To gain insight into the patterns of the variations, nucleotide substitutions and indels of transposons were analysed in Chilo suppressalis Walker. The CsuPLE1.1 is a member of the piggyBac-like element (PLE) family, which belongs to the DNA transposons, and the Csu-Ty3 is a member of the Ty3/gypsy family, which belongs to the RNA transposons. Copies of CsuPLE1.1 and Csu-Ty3 were cloned separately from different C. suppressalis individuals, and then multiple sequence alignments were performed. There were numerous single-base substitutions in CsuPLE1.1 and Csu-Ty3, but only a few insertion and deletion mutations. Similarly, in both transposons, the occurring frequencies of transitions were significantly higher than transversions (p ≤ 0.01). In the single-base substitutions, the most frequently occurring base changes were A→G and T→C in both types of transposons. Additionally, single-base substitution frequencies occurring at positions 1, 2 or 3 (pos1, pos2 or pos3) of a given codon in the element transposase were not significantly different. Both in CsuPLE1.1 and Csu-Ty3, the patterns of nucleotide substitution had the same characteristics and nucleotide mutations were biased toward GC. This research provides a perspective on the understanding of transposon mutation patterns.
Collapse
|
8
|
Evaluation of Ancestral Sequence Reconstruction Methods to Infer Nonstationary Patterns of Nucleotide Substitution. Genetics 2015; 200:873-90. [PMID: 25948563 DOI: 10.1534/genetics.115.177386] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2015] [Accepted: 04/28/2015] [Indexed: 01/07/2023] Open
Abstract
Inference of gene sequences in ancestral species has been widely used to test hypotheses concerning the process of molecular sequence evolution. However, the approach may produce spurious results, mainly because using the single best reconstruction while ignoring the suboptimal ones creates systematic biases. Here we implement methods to correct for such biases and use computer simulation to evaluate their performance when the substitution process is nonstationary. The methods we evaluated include parsimony and likelihood using the single best reconstruction (SBR), averaging over reconstructions weighted by the posterior probabilities (AWP), and a new method called expected Markov counting (EMC) that produces maximum-likelihood estimates of substitution counts for any branch under a nonstationary Markov model. We simulated base composition evolution on a phylogeny for six species, with different selective pressures on G+C content among lineages, and compared the counts of nucleotide substitutions recorded during simulation with the inference by different methods. We found that large systematic biases resulted from (i) the use of parsimony or likelihood with SBR, (ii) the use of a stationary model when the substitution process is nonstationary, and (iii) the use of the Hasegawa-Kishino-Yano (HKY) model, which is too simple to adequately describe the substitution process. The nonstationary general time reversible (GTR) model, used with AWP or EMC, accurately recovered the substitution counts, even in cases of complex parameter fluctuations. We discuss model complexity and the compromise between bias and variance and suggest that the new methods may be useful for studying complex patterns of nucleotide substitution in large genomic data sets.
Collapse
|
9
|
Zhou D, Zhang D, Ding G, Shi L, Hou Q, Ye Y, Xu Y, Zhou H, Xiong C, Li S, Yu J, Hong S, Yu X, Zou P, Chen C, Chang X, Wang W, Lv Y, Sun Y, Ma L, Shen B, Zhu C. Genome sequence of Anopheles sinensis provides insight into genetics basis of mosquito competence for malaria parasites. BMC Genomics 2014; 15:42. [PMID: 24438588 PMCID: PMC3901762 DOI: 10.1186/1471-2164-15-42] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2013] [Accepted: 01/16/2014] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Anopheles sinensis is an important mosquito vector of Plasmodium vivax, which is the most frequent and widely distributed cause of recurring malaria throughout Asia, and particularly in China, Korea, and Japan. RESULTS We performed 454 next-generation sequencing and obtained a draft sequence of A. sinensis assembled into scaffolds spanning 220.8 million base pairs. Analysis of this genome sequence, we observed expansion and contraction of several immune-related gene families in anopheline relative to culicine mosquito species. These differences suggest that species-specific immune responses to Plasmodium invasion underpin the biological differences in susceptibility to Plasmodium infection that characterize these two mosquito subfamilies. CONCLUSIONS The A. sinensis genome produced in this study, provides an important resource for analyzing the genetic basis of susceptibility and resistance of mosquitoes to Plasmodium parasites research which will ultimately facilitate the design of urgently needed interventions against this debilitating mosquito-borne disease.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | - Bo Shen
- Department of Pathogen Biology, Nanjing Medical University, Nanjing, Jiangsu 210029, P,R, China.
| | | |
Collapse
|
10
|
Poh YP, Ting CT, Fu HW, Langley CH, Begun DJ. Population genomic analysis of base composition evolution in Drosophila melanogaster. Genome Biol Evol 2013; 4:1245-55. [PMID: 23160062 PMCID: PMC3542573 DOI: 10.1093/gbe/evs097] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
The relative importance of mutation, selection, and biased gene conversion to patterns of base composition variation in Drosophila melanogaster, and to a lesser extent, D. simulans, has been investigated for many years. However, genomic data from sufficiently large samples to thoroughly characterize patterns of base composition polymorphism within species have been lacking. Here, we report a genome-wide analysis of coding and noncoding polymorphism in a large sample of inbred D. melanogaster strains from Raleigh, North Carolina. Consistent with previous results, we observed that AT mutations fix more frequently than GC mutations in D. melanogaster. Contrary to predictions of previous models of codon usage in D. melanogaster, we found that synonymous sites segregating for derived AT polymorphisms were less skewed toward low frequencies compared with sites segregating a derived GC polymorphism. However, no such pattern was observed for comparable base composition polymorphisms in noncoding DNA. These results suggest that AT-ending codons could currently be favored by natural selection in the D. melanogaster lineage.
Collapse
Affiliation(s)
- Yu-Ping Poh
- Institute of Molecular and Cellular Biology, National Tsing Hua University, Taiwan, Republic of China.
| | | | | | | | | |
Collapse
|
11
|
Clemente F, Vogl C. Evidence for complex selection on four-fold degenerate sites in Drosophila melanogaster. J Evol Biol 2012; 25:2582-95. [PMID: 23020078 DOI: 10.1111/jeb.12003] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2012] [Revised: 08/31/2012] [Accepted: 08/31/2012] [Indexed: 01/04/2023]
Abstract
We considered genome-wide four-fold degenerate sites from an African Drosophila melanogaster population and compared them to short introns. To include divergence and to polarize the data, we used its close relatives Drosophila simulans, Drosophila sechellia, Drosophila erecta and Drosophila yakuba as outgroups. In D. melanogaster, the GC content at four-fold degenerate sites is higher than in short introns; compared to its relatives, more AT than GC is fixed. The former has been explained by codon usage bias (CUB) favouring GC; the latter by decreased intensity of directional selection or by increased mutation bias towards AT. With a biallelic equilibrium model, evidence for directional selection comes mostly from the GC-rich ancestral base composition. Together with a slight mutation bias, it leads to an asymmetry of the unpolarized allele frequency spectrum, from which directional selection is inferred. Using a quasi-equilibrium model and polarized spectra, however, only purifying and no directional selection is detected. Furthermore, polarized spectra are proportional to those of the presumably unselected short introns. As we have no evidence for a decrease in effective population size, relaxed CUB must be due to a reduction in the selection coefficient. Going beyond the biallelic model and considering all four bases, signs of directional selection are stronger. In contrast to short introns, complementary bases show strand specificity and allele frequency spectra depend on mutation directions. Hence, the traditional biallelic model to describe the evolution of four-fold degenerate sites should be replaced by more complex models assuming only quasi-equilibrium and accounting for all four bases.
Collapse
Affiliation(s)
- F Clemente
- Institute of Population Genetics, Veterinärmedizinische Universität Wien, Vienna, Austria
| | | |
Collapse
|
12
|
Lee Y, Seifert SN, Nieman CC, McAbee RD, Goodell P, Fryxell RT, Lanzaro GC, Cornel AJ. High degree of single nucleotide polymorphisms in California Culex pipiens (Diptera: Culicidae) sensu lato. JOURNAL OF MEDICAL ENTOMOLOGY 2012; 49:299-306. [PMID: 22493847 PMCID: PMC3553656 DOI: 10.1603/me11108] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/14/2023]
Abstract
Resolution of systematic relationships among members of the Culex pipiens (L.) complex has important implications for public health as well as for studies on the evolution of sibling species. Currently held views contend that in California considerable genetic introgression occurs between Cx. pipiens and Cx. quinquefasciatus Say, and as such, these taxa behave as if they are a single species. Development of high throughput SNP genotyping tools for the analysis of Cx. pipiens complex population structure is therefore desirable. As a first step toward this goal, we sequenced 12 gene fragments from specimens collected in Marin and Fresno counties. On average, we found a higher single nucleotide polymorphism (SNP) density than any other mosquito species reported thus far. Coding regions contained significantly higher GC content (median 54.7%) than noncoding regions (42.4%; Wilcoxon rank sum test, P = 5.29 x 10(-5)). Differences in SNP allele frequencies observed between mosquitoes from Marin and Fresno counties indicated significant genetic divergence and suggest that SNP markers will be useful for future detailed population genetic studies of this group. The high density of SNPs highlights the difficulty in identifying species within the complex and may be associated with the large degree of phenotypic variation observed in this group of mosquitoes.
Collapse
Affiliation(s)
- Yoosook Lee
- School of Veterinary Medicine, Department of Pathology, Microbiology and Immunology, University of California-Davis, Davis, CA 95616, USA.
| | | | | | | | | | | | | | | |
Collapse
|
13
|
Weber CC, Hurst LD. Intronic AT skew is a defendable proxy for germline transcription but does not predict crossing-over or protein evolution rates in Drosophila melanogaster. J Mol Evol 2010; 71:415-26. [PMID: 20938653 DOI: 10.1007/s00239-010-9395-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2010] [Accepted: 09/17/2010] [Indexed: 01/28/2023]
Abstract
Recent evidence suggests that germline transcription may affect both protein evolutionary rates, possibly mediated by repair processes, and recombination rates, possibly mediated by chromatin and epigenetic modification. Here, we test these propositions in Drosophila melanogaster. The challenge for such analyses is to provide defendable measures of germline gene expression. Intronic AT skew is a good candidate measure as it is thought to be a consequence, at least in part, of transcription-coupled repair. Prior evidence suggests that intronic AT skew in D. melanogaster is not affected by proximity to intron extremities and differs between transcribed DNA and flanking sequence. We now also establish that intronic AT skew is a defendable proxy for germline expression as (a) it is more similar than expected by chance between introns of the same gene (which is not accounted for by physical proximity), (b) is correlated with male germline expression, and (c) is more pronounced in broadly expressed genes. Furthermore, (d) a trend for intronic skew to differ between 3' and 5' ends of genes is particular to broadly expressed genes. Finally, (e) controlling for physical distance, introns of proximate genes are most different in skew if they have different tissue specificity. We find that intronic AT skew, employed as a proxy for germline transcription, correlates neither with recombination rates nor with the rate of protein evolution. We conclude that there is no prima facie evidence that germline expression modulates recombination rates or monotonically affects protein evolution rates in D. melanogaster.
Collapse
Affiliation(s)
- Claudia C Weber
- Department of Biology and Biochemistry, University of Bath, Bath, UK
| | | |
Collapse
|
14
|
Zeng K, Charlesworth B. Studying patterns of recent evolution at synonymous sites and intronic sites in Drosophila melanogaster. J Mol Evol 2009; 70:116-28. [PMID: 20041239 DOI: 10.1007/s00239-009-9314-6] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2009] [Accepted: 12/07/2009] [Indexed: 10/20/2022]
Abstract
Most previous studies of the evolution of codon usage bias (CUB) and intronic GC content (iGC) in Drosophila melanogaster were based on between-species comparisons, reflecting long-term evolutionary events. However, a complete picture of the evolution of CUB and iGC cannot be drawn without knowledge of their more recent evolutionary history. Here, we used a polymorphism dataset collected from Zimbabwe to study patterns of the recent evolution of CUB and iGC. Analyzing coding and intronic data jointly with a model which can simultaneously estimate selection, mutational, and demographic parameters, we have found that: (1) natural selection is probably acting on synonymous codons; (2) a constant population size model seems to be sufficient to explain most of the observed synonymous polymorphism patterns; (3) GC is favored over AT in introns. In agreement with the long-term evolutionary patterns, ongoing selection acting on X-linked synonymous codons is stronger than that acting on autosomal codons. The selective differences between preferred and unpreferred codons tend to be greater than the differences between GC and AT in introns, suggesting that natural selection, not just biased gene conversion, may have influenced the evolution of CUB. Interestingly, evidence for non-equilibrium evolution comes exclusively from the intronic data. However, three different models, an equilibrium model with two classes of selected sites and two non-equilibrium models with changes in either population size or mutational parameters, fit the intronic data equally well. These results show that using inadequate selection (or demographic) models can result in incorrect estimates of demographic (or selection) parameters.
Collapse
Affiliation(s)
- Kai Zeng
- Ashworth Laboratories, School of Biological Sciences, Institute of Evolutionary Biology, University of Edinburgh, King's Buildings, West Mains Road, Edinburgh EH9 3JT, UK.
| | | |
Collapse
|
15
|
Estimating selection intensity on synonymous codon usage in a nonequilibrium population. Genetics 2009; 183:651-62, 1SI-23SI. [PMID: 19620398 DOI: 10.1534/genetics.109.101782] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Codon usage bias is the nonrandom use of synonymous codons for the same amino acid. Most population genetic models of codon usage evolution assume that the population is at mutation-selection-drift equilibrium. Natural populations, however, frequently deviate from equilibrium, often because of recent demographic changes. Here, we construct a matrix model that includes the effects of a recent change in population size on estimates of selection on preferred vs. unpreferred codons. Our results suggest that patterns of synonymous polymorphisms affecting codon usage can be quite erratic after such a change; statistical methods that fail to take demographic effects into account can then give incorrect estimates of important parameters. We propose a new method that can accurately estimate both demographic and codon usage parameters. The method also provides a simple way of testing for the effects of covariates such as gene length and level of gene expression on the intensity of selection, which we apply to a large Drosophila melanogaster polymorphism data set. Our analyses of twofold degenerate codons reveal that (i) selection acts in favor of preferred codons, (ii) there is mutational bias in favor of unpreferred codons, (iii) shorter genes and genes with higher expression levels are under stronger selection, and (iv) there is little evidence for a recent change in population size in the Zimbabwe population of D. melanogaster.
Collapse
|
16
|
Singh ND, Arndt PF, Clark AG, Aquadro CF. Strong evidence for lineage and sequence specificity of substitution rates and patterns in Drosophila. Mol Biol Evol 2009; 26:1591-605. [PMID: 19351792 DOI: 10.1093/molbev/msp071] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
Rates of single nucleotide substitution in Drosophila are highly variable within the genome, and several examples illustrate that evolutionary rates differ among Drosophila species as well. Here, we use a maximum likelihood method to quantify lineage-specific substitutional patterns and apply this method to 4-fold degenerate synonymous sites and introns from more than 8,000 genes aligned in the Drosophila melanogaster group. We find that within species, different classes of sequence evolve at different rates, with long introns evolving most slowly and short introns evolving most rapidly. Relative rates of individual single nucleotide substitutions vary approximately 3-fold among lineages, yielding patterns of substitution that are comparatively less GC-biased in the melanogaster species complex relative to Drosophila yakuba and Drosophila erecta. These results are consistent with a model coupling a mutational shift toward reduced GC content, or a shift in mutation-selection balance, in the D. melanogaster species complex, with variation in selective constraint among different classes of DNA sequence. Finally, base composition of coding and intronic sequences is not at equilibrium with respect to substitutional patterns, which primarily reflects the slow rate of the substitutional process. These results thus support the view that mutational and/or selective processes are labile on an evolutionary timescale and that if the process is indeed selection driven, then the distribution of selective constraint is variable across the genome.
Collapse
Affiliation(s)
- Nadia D Singh
- Department of Molecular Biology and Genetics, Cornell University.
| | | | | | | |
Collapse
|
17
|
LI MK, GU L, CHEN SS, DAI JQ, TAO SH. Evolution of the isochore structure in the scale of chromosome: insight from the mutation bias and fixation bias. J Evol Biol 2007; 21:173-182. [DOI: 10.1111/j.1420-9101.2007.01455.x] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
18
|
Akashi H, Goel P, John A. Ancestral inference and the study of codon bias evolution: implications for molecular evolutionary analyses of the Drosophila melanogaster subgroup. PLoS One 2007; 2:e1065. [PMID: 17957249 PMCID: PMC2020436 DOI: 10.1371/journal.pone.0001065] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2007] [Accepted: 09/21/2007] [Indexed: 11/18/2022] Open
Abstract
Reliable inference of ancestral sequences can be critical to identifying both patterns and causes of molecular evolution. Robustness of ancestral inference is often assumed among closely related species, but tests of this assumption have been limited. Here, we examine the performance of inference methods for data simulated under scenarios of codon bias evolution within the Drosophila melanogaster subgroup. Genome sequence data for multiple, closely related species within this subgroup make it an important system for studying molecular evolutionary genetics. The effects of asymmetric and lineage-specific substitution rates (i.e., varying levels of codon usage bias and departures from equilibrium) on the reliability of ancestral codon usage was investigated. Maximum parsimony inference, which has been widely employed in analyses of Drosophila codon bias evolution, was compared to an approach that attempts to account for uncertainty in ancestral inference by weighting ancestral reconstructions by their posterior probabilities. The latter approach employs maximum likelihood estimation of rate and base composition parameters. For equilibrium and most non-equilibrium scenarios that were investigated, the probabilistic method appears to generate reliable ancestral codon bias inferences for molecular evolutionary studies within the D. melanogaster subgroup. These reconstructions are more reliable than parsimony inference, especially when codon usage is strongly skewed. However, inference biases are considerable for both methods under particular departures from stationarity (i.e., when adaptive evolution is prevalent). Reliability of inference can be sensitive to branch lengths, asymmetry in substitution rates, and the locations and nature of lineage-specific processes within a gene tree. Inference reliability, even among closely related species, can be strongly affected by (potentially unknown) patterns of molecular evolution in lineages ancestral to those of interest.
Collapse
Affiliation(s)
- Hiroshi Akashi
- Institute of Molecular Evolutionary Genetics, Department of Biology, Pennsylvania State University, State College, Pennsylvania, United States of America.
| | | | | |
Collapse
|
19
|
Díaz-Castillo C, Golic KG. Evolution of gene sequence in response to chromosomal location. Genetics 2007; 177:359-74. [PMID: 17890366 PMCID: PMC2013720 DOI: 10.1534/genetics.107.077081] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2007] [Accepted: 06/06/2007] [Indexed: 12/26/2022] Open
Abstract
Evolutionary forces acting on the repetitive DNA of heterochromatin are not constrained by the same considerations that apply to protein-coding genes. Consequently, such sequences are subject to rapid evolutionary change. By examining the Troponin C gene family of Drosophila melanogaster, which has euchromatic and heterochromatic members, we find that protein-coding genes also evolve in response to their chromosomal location. The heterochromatic members of the family show a reduced CG content and increased variation in DNA sequence. We show that the CG reduction applies broadly to the protein-coding sequences of genes located at the heterochromatin:euchromatin interface, with a very strong correlation between CG content and the distance from centric heterochromatin. We also observe a similar trend in the transition from telomeric heterochromatin to euchromatin. We propose that the methylation of DNA is one of the forces driving this sequence evolution.
Collapse
|
20
|
Keller I, Bensasson D, Nichols RA. Transition-transversion bias is not universal: a counter example from grasshopper pseudogenes. PLoS Genet 2007; 3:e22. [PMID: 17274688 PMCID: PMC1790724 DOI: 10.1371/journal.pgen.0030022] [Citation(s) in RCA: 108] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2006] [Accepted: 12/18/2006] [Indexed: 12/02/2022] Open
Abstract
Comparisons of the DNA sequences of metazoa show an excess of transitional over transversional substitutions. Part of this bias is due to the relatively high rate of mutation of methylated cytosines to thymine. Postmutation processes also introduce a bias, particularly selection for codon-usage bias in coding regions. It is generally assumed, however, that there is a universal bias in favour of transitions over transversions, possibly as a result of the underlying chemistry of mutation. Surprisingly, this underlying trend has been evaluated only in two types of metazoan, namely Drosophila and the Mammalia. Here, we investigate a third group, and find no such bias. We characterize the point substitution spectrum in Podisma pedestris, a grasshopper species with a very large genome. The accumulation of mutations was surveyed in two pseudogene families, nuclear mitochondrial and ribosomal DNA sequences. The cytosine-guanine (CpG) dinucleotides exhibit the high transition frequencies expected of methylated sites. The transition rate at other cytosine residues is significantly lower. After accounting for this methylation effect, there is no significant difference between transition and transversion rates. These results contrast with reports from other taxa and lead us to reject the hypothesis of a universal transition/transversion bias. Instead we suggest fundamental interspecific differences in point substitution processes. Some mutations occur more frequently than others. We need to understand these biases if we are to interpret the differences that have accumulated between species and individuals. Applications include estimating the time since evolutionary lineages diverged and detecting the signature of natural selection in DNA sequences. However, mutational biases have been obscured because, since mutations arose, natural selection has eliminated some whilst allowing others to persist to the present. We therefore study mutations that have accumulated in regions of the genome that are free from selection in a grasshopper with a gigantic genome. All other animal studies using this approach find an excess of mutations between DNA bases having similar biochemical properties (transitions rather than transversions). This bias has been widely interpreted as a consequence of the fundamental biochemical basis of mutation. However, once we exclude mutations associated with DNA methylation, we find no evidence of a transition bias, unlike the few comparable animal studies that make the same correction. We propose that this result indicates previously unanticipated differences between species in the selection on or mutation of their DNA.
Collapse
Affiliation(s)
- Irene Keller
- School of Biological and Chemical Sciences, Queen Mary, University of London, London, United Kingdom.
| | | | | |
Collapse
|
21
|
Cirulli ET, Kliman RM, Noor MAF. Fine-scale crossover rate heterogeneity in Drosophila pseudoobscura. J Mol Evol 2006; 64:129-35. [PMID: 17160365 DOI: 10.1007/s00239-006-0142-7] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2006] [Accepted: 10/03/2006] [Indexed: 12/26/2022]
Abstract
Broad-scale differences in crossover rate across the genome have been characterized in most genomes studied. Fine-scale differences, however, have only been examined in a few taxa, such as Arabidopsis, yeast, humans, and mice. No prior studies have directly looked for fine-scale recombination rate heterogeneity in Drosophila. We produced 370 Drosophila pseudoobscura containing a crossover event within the 2-megabase (MB) region between the genes yellow and white. We then examined 19 intervals within this region and determined where the crossovers occurred. We found that recombination events occur nonrandomly on a small scale and that mild "hotspots" of a few kilobases exist in Drosophila. Among the regions studied, recombination rates varied from 1.4 to 52 cM/MB. We also observed a trend toward high codon bias in regions of high recombination. Finally, we identified a significantly positive correlation between recombination rate and simple repeats, as well as the motif CACAC. These sequence features may contribute to broad-scale variation in crossover rate and, thus, shed light on features associated with crossover rate heterogeneity at a genome-wide scale.
Collapse
|
22
|
Chen JF, Lu F, Chen SS, Tao SH. Significant positive correlation between the recombination rate and GC content in the human pseudoautosomal region. Genome 2006; 49:413-9. [PMID: 16767166 DOI: 10.1139/g05-124] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
This paper establishes that recombination drives the evolution of GC content in a significant way. Because the human P-arm pseudoautosomal region (PAR1) has been shown to have a high recombination rate, at least 20-fold more frequent than the genomic average of approximately 1 cM/Mb, this region provides an ideal system to study the role of recombination in the evolution of base composition. Nine non-coding regions of PAR1 are analyzed in this study. We have observed a highly significant positive correlation between the recombination rate and GC content (rho = 0.837, p < or = 0.005). Five regions that lie in the distal part of PAR1 are shown to be significantly higher than genomic average divergence. By comparing the intra- and inter-specific AT->GC -GC->AT ratios, we have detected no fixation bias toward GC alleles except for L254915, which has excessive AT-->GC changes in the human lineage. Thus, we conclude that the high GC content of the PAR1 genes better fits the biased gene conversion (BGC) model.
Collapse
Affiliation(s)
- Jin-Feng Chen
- Institute of Bioinformatics, Northwest Agriculture and Forest University, Yangling, Shaanxi, China
| | | | | | | |
Collapse
|
23
|
Bierne N, Eyre-Walker A. Variation in synonymous codon use and DNA polymorphism within the Drosophila genome. J Evol Biol 2006; 19:1-11. [PMID: 16405571 DOI: 10.1111/j.1420-9101.2005.00996.x] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
A strong negative correlation between the rate of amino-acid substitution and codon usage bias in Drosophila has been attributed to interference between positive selection at nonsynonymous sites and weak selection on codon usage. To further explore this possibility we have investigated polymorphism and divergence at three kinds of sites: synonymous, nonsynonymous and intronic in relation to codon bias in D. melanogaster and D. simulans. We confirmed that protein evolution is one of the main explicative parameters for interlocus codon bias variation (r(2) approximately 40%). However, intron or synonymous diversities, which could have been expected to be good indicators of local interference [here defined as the additional increase of drift due to selection on tightly linked sites, also called 'genetic draft' by Gillespie (2000)] did not covary significantly with codon bias or with protein evolution. Concurrently, levels of polymorphism were reduced in regions of low recombination rates whereas codon bias was not. Finally, while nonsynonymous diversities were very well correlated between species, neither synonymous nor intron diversities observed in D. melanogaster were correlated with those observed in D. simulans. All together, our results suggest that the selective constraint on the protein is a stable component of gene evolution while local interference is not. The pattern of variation in genetic draft along the genome therefore seems to be instable through evolutionary times and should therefore be considered as a minor determinant of codon bias variance. We argue that selective constraints for optimal codon usage are likely to be correlated with selective constraints on the protein, both between codons within a gene, as previously suggested, and also between genes within a genome.
Collapse
Affiliation(s)
- N Bierne
- Centre for the Study of Evolution and School of Biological Sciences, University of Sussex, Brighton, UK.
| | | |
Collapse
|
24
|
Ko WY, Piao S, Akashi H. Strong regional heterogeneity in base composition evolution on the Drosophila X chromosome. Genetics 2006; 174:349-62. [PMID: 16547109 PMCID: PMC1569809 DOI: 10.1534/genetics.105.054346] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2005] [Accepted: 05/08/2006] [Indexed: 11/18/2022] Open
Abstract
Fluctuations in base composition appear to be prevalent in Drosophila and mammal genome evolution, but their timescale, genomic breadth, and causes remain obscure. Here, we study base composition evolution within the X chromosomes of Drosophila melanogaster and five of its close relatives. Substitutions were inferred on six extant and two ancestral lineages for 14 near-telomeric and 9 nontelomeric genes. GC content evolution is highly variable both within the genome and within the phylogenetic tree. In the lineages leading to D. yakuba and D. orena, GC content at silent sites has increased rapidly near telomeres, but has decreased in more proximal (nontelomeric) regions. D. orena shows a 17-fold excess of GC-increasing vs. AT-increasing synonymous changes within a small (approximately 130-kb) region close to the telomeric end. Base composition changes within introns are consistent with changes in mutation patterns, but stronger GC elevation at synonymous sites suggests contributions of natural selection or biased gene conversion. The Drosophila yakuba lineage shows a less extreme elevation of GC content distributed over a wider genetic region (approximately 1.2 Mb). A lack of change in GC content for most introns within this region suggests a role of natural selection in localized base composition fluctuations.
Collapse
Affiliation(s)
- Wen-Ya Ko
- Institute of Molecular Evolutionary Genetics and Department of Biology, Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | | | | |
Collapse
|
25
|
Pollard DA, Iyer VN, Moses AM, Eisen MB. Widespread discordance of gene trees with species tree in Drosophila: evidence for incomplete lineage sorting. PLoS Genet 2006; 2:e173. [PMID: 17132051 PMCID: PMC1626107 DOI: 10.1371/journal.pgen.0020173] [Citation(s) in RCA: 254] [Impact Index Per Article: 14.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2005] [Accepted: 08/28/2006] [Indexed: 11/19/2022] Open
Abstract
The phylogenetic relationship of the now fully sequenced species Drosophila erecta and D. yakuba with respect to the D. melanogaster species complex has been a subject of controversy. All three possible groupings of the species have been reported in the past, though recent multi-gene studies suggest that D. erecta and D. yakuba are sister species. Using the whole genomes of each of these species as well as the four other fully sequenced species in the subgenus Sophophora, we set out to investigate the placement of D. erecta and D. yakuba in the D. melanogaster species group and to understand the cause of the past incongruence. Though we find that the phylogeny grouping D. erecta and D. yakuba together is the best supported, we also find widespread incongruence in nucleotide and amino acid substitutions, insertions and deletions, and gene trees. The time inferred to span the two key speciation events is short enough that under the coalescent model, the incongruence could be the result of incomplete lineage sorting. Consistent with the lineage-sorting hypothesis, substitutions supporting the same tree were spatially clustered. Support for the different trees was found to be linked to recombination such that adjacent genes support the same tree most often in regions of low recombination and substitutions supporting the same tree are most enriched roughly on the same scale as linkage disequilibrium, also consistent with lineage sorting. The incongruence was found to be statistically significant and robust to model and species choice. No systematic biases were found. We conclude that phylogenetic incongruence in the D. melanogaster species complex is the result, at least in part, of incomplete lineage sorting. Incomplete lineage sorting will likely cause phylogenetic incongruence in many comparative genomics datasets. Methods to infer the correct species tree, the history of every base in the genome, and comparative methods that control for and/or utilize this information will be valuable advancements for the field of comparative genomics. To take full advantage of the growing number of genome sequences from different organisms, it is necessary to understand the evolutionary relationships (phylogeny) between organisms. Unfortunately, phylogenies inferred from individual genes often conflict, reflecting either poor inferences or real variation in the history of genes. In this study, the authors examine relationships within the Drosophila melanogaster species subgroup, a group of flies with three fully sequenced species in which phylogeny has been a source of controversy. Although the bulk of the data support a phylogeny with Drosophila melanogaster as an outgroup to sister species Drosophila erecta and Drosophila yakuba, large portions of their genes support alternative phylogenies. According to the authors, the most plausible explanation for these observations is that polymorphisms in the ancestral population were maintained during the two rapid speciation events that led to these species. Subsequent to speciation, polymorphisms were randomly fixed in each species, and in some cases non-sister species fixed the same ancestral polymorphisms, while sister species did not. In these cases the genes are correctly inferred to have conflicting phylogenies. The authors note that rapid speciation events will often lead to such conflict, which needs to be accounted for in evolutionary analyses.
Collapse
Affiliation(s)
- Daniel A Pollard
- Graduate Group in Biophysics, University of California Berkeley, Berkeley, California, United States of America
| | - Venky N Iyer
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, California, United States of America
| | - Alan M Moses
- Graduate Group in Biophysics, University of California Berkeley, Berkeley, California, United States of America
| | - Michael B Eisen
- Graduate Group in Biophysics, University of California Berkeley, Berkeley, California, United States of America
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, California, United States of America
- Department of Genome Sciences, Genomics Division, Ernest Orlando Lawrence Berkeley National Lab, Berkeley, California, United States of America
- Center for Integrative Genomics, University of California Berkeley, Berkeley, California, United States of America
- * To whom correspondence should be addressed. E-mail:
| |
Collapse
|
26
|
Akashi H, Ko WY, Piao S, John A, Goel P, Lin CF, Vitins AP. Molecular evolution in the Drosophila melanogaster species subgroup: frequent parameter fluctuations on the timescale of molecular divergence. Genetics 2005; 172:1711-26. [PMID: 16387879 PMCID: PMC1456288 DOI: 10.1534/genetics.105.049676] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Although mutation, genetic drift, and natural selection are well established as determinants of genome evolution, the importance (frequency and magnitude) of parameter fluctuations in molecular evolution is less understood. DNA sequence comparisons among closely related species allow specific substitutions to be assigned to lineages on a phylogenetic tree. In this study, we compare patterns of codon usage and protein evolution in 22 genes (>11,000 codons) among Drosophila melanogaster and five relatives within the D. melanogaster subgroup. We assign changes to eight lineages using a maximum-likelihood approach to infer ancestral states. Uncertainty in ancestral reconstructions is taken into account, at least to some extent, by weighting reconstructions by their posterior probabilities. Four of the eight lineages show potentially genomewide departures from equilibrium synonymous codon usage; three are decreasing and one is increasing in major codon usage. Several of these departures are consistent with lineage-specific changes in selection intensity (selection coefficients scaled to effective population size) at silent sites. Intron base composition and rates and patterns of protein evolution are also heterogeneous among these lineages. The magnitude of forces governing silent, intron, and protein evolution appears to have varied frequently, and in a lineage-specific manner, within the D. melanogaster subgroup.
Collapse
Affiliation(s)
- Hiroshi Akashi
- Institute of Molecular Evolutionary Genetics and Department of Biology, Pennsylvania State University, University Park, Pennsylvania 16802, USA.
| | | | | | | | | | | | | |
Collapse
|
27
|
Drouaud J, Camilleri C, Bourguignon PY, Canaguier A, Bérard A, Vezon D, Giancola S, Brunel D, Colot V, Prum B, Quesneville H, Mézard C. Variation in crossing-over rates across chromosome 4 of Arabidopsis thaliana reveals the presence of meiotic recombination "hot spots". Genome Res 2005; 16:106-14. [PMID: 16344568 PMCID: PMC1356134 DOI: 10.1101/gr.4319006] [Citation(s) in RCA: 144] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Crossover (CO) is a key process for the accurate segregation of homologous chromosomes during the first meiotic division. In most eukaryotes, meiotic recombination is not homogeneous along the chromosomes, suggesting a tight control of the location of recombination events. We genotyped 71 single nucleotide polymorphisms (SNPs) covering the entire chromosome 4 of Arabidopsis thaliana on 702 F2 plants, representing 1404 meioses and allowing the detection of 1171 COs, to study CO localization in a higher plant. The genetic recombination rates varied along the chromosome from 0 cM/Mb near the centromere to 20 cM/Mb on the short arm next to the NOR region, with a chromosome average of 4.6 cM/Mb. Principal component analysis showed that CO rates negatively correlate with the G+C content (P = 3x10(-4)), in contrast to that reported in other eukaryotes. COs also significantly correlate with the density of single repeats and the CpG ratio, but not with genes, pseudogenes, transposable elements, or dispersed repeats. Chromosome 4 has, on average, 1.6 COs per meiosis, and these COs are subjected to interference. A detailed analysis of several regions having high CO rates revealed "hot spots" of meiotic recombination contained in small fragments of a few kilobases. Both the intensity and the density of these hot spots explain the variation of CO rates along the chromosome.
Collapse
Affiliation(s)
- Jan Drouaud
- Station de Génétique et d'Amélioration des Plantes, Institut Jean-Pierre Bourgin, Institut National de la Recherche Agronomique, 78026, Versailles cedex, France
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
28
|
Abstract
The study of base composition evolution in Drosophila has been achieved mostly through the analysis of coding sequences. Third codon position GC content, however, is influenced by both neutral forces (e.g., mutation bias) and natural selection for codon usage optimization. In this article, large data sets of noncoding DNA sequence polymorphism in D. melanogaster and D. simulans were gathered from public databases to try to disentangle these two factors-noncoding sequences are not affected by selection for codon usage. Allele frequency analyses revealed an asymmetric pattern of AT vs. GC noncoding polymorphisms: AT --> GC mutations are less numerous, and tend to segregate at a higher frequency, than GC --> AT ones, especially at GC-rich loci. This is indicative of nonstationary evolution of base composition and/or of GC-biased allele transmission. Fitting population genetics models to the allele frequency spectra confirmed this result and favored the hypothesis of a biased transmission. These results, together with previous reports, suggest that GC-biased gene conversion has influenced base composition evolution in Drosophila and explain the correlation between intron and exon GC content.
Collapse
Affiliation(s)
- Nicolas Galtier
- UMR 5171, "Génome, Populations, Interactions, Adaptation," CNRS, Université Montpellier 2, IFREMER, 34095 Montpellier, France.
| | | | | |
Collapse
|
29
|
Benovoy D, Morris RT, Morin A, Drouin G. Ectopic Gene Conversions Increase the G + C Content of Duplicated Yeast and Arabidopsis Genes. Mol Biol Evol 2005; 22:1865-8. [PMID: 15917495 DOI: 10.1093/molbev/msi176] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
Allelic recombination has previously been shown to increase the GC-content of the sequences of a wide variety of eukaryotic species. Ectopic recombination between clustered tandemly repeated genes has also been shown to increase their GC-content. Here we show that gene conversions between the dispersed genes found in the duplicated regions of the yeast and Arabidopsis genomes also increase their GC-content when these genes are more than 88% similar.
Collapse
Affiliation(s)
- David Benovoy
- Département de biologie, Université d'Ottawa, Ottawa, Ontario, Canada
| | | | | | | |
Collapse
|
30
|
Wernegreen JJ, Funk DJ. Mutation exposed: a neutral explanation for extreme base composition of an endosymbiont genome. J Mol Evol 2005; 59:849-58. [PMID: 15599516 DOI: 10.1007/s00239-003-0192-z] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2003] [Accepted: 06/29/2004] [Indexed: 10/26/2022]
Abstract
The influence of neutral mutation pressure versus selection on base composition evolution is a subject of considerable controversy. Yet the present study represents the first explicit population genetic analysis of this issue in prokaryotes, the group in which base composition variation is most dramatic. Here, we explore the impact of mutation and selection on the dynamics of synonymous changes in Buchnera aphidicola, the AT-rich bacterial endosymbiont of aphids. Specifically, we evaluated three forms of evidence. (i) We compared the frequencies of directional base changes (AT-->GC vs. GC-->AT) at synonymous sites within and between Buchnera species, to test for selective preference versus effective neutrality of these mutational categories. Reconstructed mutational changes across a robust intraspecific phylogeny showed a nearly 1:1 AT-->GC:GC-->AT ratio. Likewise, stationarity of base composition among Buchnera species indicated equal rates of AT-->GC and GC-->AT substitutions. The similarity of these patterns within and between species supported the neutral model. (ii) We observed an equivalence of relative per-site AT mutation rate and current AT content at synonymous sites, indicating that base composition is at mutational equilibrium. (iii) We demonstrated statistically greater equality in the frequency of mutational categories in Buchnera than in parallel mammalian studies that documented selection on synonymous sites. Our results indicate that effectively neutral mutational pressure, rather than selection, represents the major force driving base composition evolution in Buchnera. Thus they further corroborate recent evidence for the critical role of reduced N(e) in the molecular evolution of bacterial endosymbionts.
Collapse
Affiliation(s)
- Jennifer J Wernegreen
- Josephine Bay Paul Center for Comparative Molecular Biology & Evolution, The Marine Biological Laboratory, 7 MBL Street, Woods Hole, MA 02543, USA.
| | | |
Collapse
|
31
|
DuMont VB, Fay JC, Calabrese PP, Aquadro CF. DNA variability and divergence at the notch locus in Drosophila melanogaster and D. simulans: a case of accelerated synonymous site divergence. Genetics 2005; 167:171-85. [PMID: 15166145 PMCID: PMC1470868 DOI: 10.1534/genetics.167.1.171] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
DNA diversity in two segments of the Notch locus was surveyed in four populations of Drosophila melanogaster and two of D. simulans. In both species we observed evidence of non-steady-state evolution. In D. simulans we observed a significant excess of intermediate frequency variants in a non-African population. In D. melanogaster we observed a disparity between levels of sequence polymorphism and divergence between one of the Notch regions sequenced and other neutral X chromosome loci. The striking feature of the data is the high level of synonymous site divergence at Notch, which is the highest reported to date. To more thoroughly investigate the pattern of synonymous site evolution between these species, we developed a method for calibrating preferred, unpreferred, and equal synonymous substitutions by the effective (potential) number of such changes. In D. simulans, we find that preferred changes per "site" are evolving significantly faster than unpreferred changes at Notch. In contrast we observe a significantly faster per site substitution rate of unpreferred changes in D. melanogaster at this locus. These results suggest that positive selection, and not simply relaxation of constraint on codon bias, has contributed to the higher levels of unpreferred divergence along the D. melanogaster lineage at Notch.
Collapse
Affiliation(s)
- Vanessa Bauer DuMont
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York 14853, USA
| | | | | | | |
Collapse
|
32
|
Zhang Z, Kishino H. Genomic background predicts the fate of duplicated genes: evidence from the yeast genome. Genetics 2005; 166:1995-9. [PMID: 15126414 PMCID: PMC1470803 DOI: 10.1534/genetics.166.4.1995] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Gene duplication with subsequent divergence plays a central role in the acquisition of genes with novel function and complexity during the course of evolution. With reduced functional constraints or through positive selection, these duplicated genes may experience accelerated evolution. Under the model of subfunctionalization, loss of subfunctions leads to complementary acceleration at sites with two copies, and the difference in average rate between the sequences may not be obvious. On the other hand, the classical model of neofunctionalization predicts that the evolutionary rate in one of the two duplicates is accelerated. However, the classical model does not tell which of the duplicates experiences the acceleration in evolutionary rate. Here, we present evidence from the Saccharomyces cerevisiae genome that a duplicate located in a genomic region with a low-recombination rate is likely to evolve faster than a duplicate in an area of high recombination. This observation is consistent with population genetics theory that predicts that purifying selection is less effective in genomic regions of low recombination (Hill-Robertson effect). Together with previous studies, our results suggest the genomic background (e.g., local recombination rate) as a potential force to drive the divergence between nontandemly duplicated genes. This implies the importance of structure and complexity of genomes in the diversification of organisms via gene duplications.
Collapse
Affiliation(s)
- Ze Zhang
- Laboratory of Biometrics and Bioinformatics, Graduate School of Agriculture and Life Sciences, University of Tokyo, Tokyo 113-8657, Japan
| | | |
Collapse
|
33
|
Kern AD, Begun DJ. Patterns of Polymorphism and Divergence from Noncoding Sequences of Drosophila melanogaster and D. simulans: Evidence for Nonequilibrium Processes. Mol Biol Evol 2004; 22:51-62. [PMID: 15456897 DOI: 10.1093/molbev/msh269] [Citation(s) in RCA: 36] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
Despite the fact that D. melanogaster and D. simulans have been the central model system for molecular population genetics, few data are available for noncoding regions. Here, we present an analysis of population genetic data from intergenic regions and comparisons of these data to previously collected data from introns and exons. Polymorphisms and fixations were categorized as A/T to G/C or G/C to A/T changes and were polarized by inferring the ancestral state using both parsimony and maximum likelihood. Noncoding fixations in both D. melanogaster and D. simulans were consistent with equilibrium base-composition evolution. However, polarized noncoding polymorphisms, revealed a different pattern. Although A/T to G/C and G/C to A/T polymorphisms in D. simulans were consistent with equilibrium, we observed a highly significant dearth of A/T to G/C polymorphisms in D. melanogaster introns but not in intergenic sequences. Such data could be explained by recent evolution of mutational biases associated with transcription or by lineage-specific selection on base composition. These data reveal the complexity of evolutionary processes acting even on noncoding DNA in Drosophila.
Collapse
Affiliation(s)
- Andrew D Kern
- Center for Population Biology, University of California, Davis, USA.
| | | |
Collapse
|
34
|
Ko WY, David RM, Akashi H. Molecular phylogeny of the Drosophila melanogaster species subgroup. J Mol Evol 2004; 57:562-73. [PMID: 14738315 DOI: 10.1007/s00239-003-2510-x] [Citation(s) in RCA: 67] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2002] [Accepted: 06/02/2003] [Indexed: 11/30/2022]
Abstract
Although molecular and phenotypic evolution have been studied extensively in Drosophila melanogaster and its close relatives, phylogenetic relationships within the D. melanogaster species subgroup remain unresolved. In particular, recent molecular studies have not converged on the branching orders of the D. yakuba-D. teissieri and D. erecta-D. orena species pairs relative to the D. melanogaster-D. simulans-D. mauritiana-D. sechellia species complex. Here, we reconstruct the phylogeny of the melanogaster species subgroup using DNA sequence data from four nuclear genes. We have employed "vectorette PCR" to obtain sequence data for orthologous regions of the Alcohol dehydrogenase (Adh), Alcohol dehydrogenase related (Adhr), Glucose dehydrogenase (Gld), and rosy (ry) genes (totaling 7164 bp) from six melanogaster subgroup species (D. melanogaster, D. simulans, D. teissieri, D. yakuba, D. erecta, and D. orena) and three species from subgroups outside the melanogaster species subgroup [D. eugracilis (eugracilis subgroup), D. mimetica (suzukii subgroup), and D. lutescens (takahashii subgroup)]. Relationships within the D. simulans complex are not addressed. Phylogenetic analyses employing maximum parsimony, neighbor-joining, and maximum likelihood methods strongly support a D. yakuba-D. teissieri and D. erecta-D. orena clade within the melanogaster species subgroup. D. eugracilis is grouped closer to the melanogaster subgroup than a D. mimetica-D. lutescens clade. This tree topology is supported by reconstructions employing simple (single parameter) and more complex (nonreversible) substitution models.
Collapse
Affiliation(s)
- Wen-Ya Ko
- Institute of Molecular Evolutionary Genetics and Department of Biology, The Pennsylvania State University, University Park, PA 16802, USA
| | | | | |
Collapse
|
35
|
Zhang Z, Kishino H. Genomic Background Predicts the Fate of Duplicated Genes: Evidence From the Yeast Genome. Genetics 2004. [DOI: 10.1093/genetics/166.4.1995] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Abstract
Gene duplication with subsequent divergence plays a central role in the acquisition of genes with novel function and complexity during the course of evolution. With reduced functional constraints or through positive selection, these duplicated genes may experience accelerated evolution. Under the model of subfunctionalization, loss of subfunctions leads to complementary acceleration at sites with two copies, and the difference in average rate between the sequences may not be obvious. On the other hand, the classical model of neofunctionalization predicts that the evolutionary rate in one of the two duplicates is accelerated. However, the classical model does not tell which of the duplicates experiences the acceleration in evolutionary rate. Here, we present evidence from the Saccharomyces cerevisiae genome that a duplicate located in a genomic region with a low-recombination rate is likely to evolve faster than a duplicate in an area of high recombination. This observation is consistent with population genetics theory that predicts that purifying selection is less effective in genomic regions of low recombination (Hill-Robertson effect). Together with previous studies, our results suggest the genomic background (e.g., local recombination rate) as a potential force to drive the divergence between nontandemly duplicated genes. This implies the importance of structure and complexity of genomes in the diversification of organisms via gene duplications.
Collapse
Affiliation(s)
- Ze Zhang
- Laboratory of Biometrics and Bioinformatics, Graduate School of Agriculture and Life Sciences, University of Tokyo, Tokyo 113-8657, Japan
- Institute for Bioinformatics Research and Development, Japan Science and Technology Agency, Tokyo 102-0081, Japan
| | - Hirohisa Kishino
- Laboratory of Biometrics and Bioinformatics, Graduate School of Agriculture and Life Sciences, University of Tokyo, Tokyo 113-8657, Japan
| |
Collapse
|
36
|
Bachtrog D. Protein Evolution and Codon Usage Bias on the Neo-Sex Chromosomes of Drosophila miranda. Genetics 2003; 165:1221-32. [PMID: 14668377 PMCID: PMC1462847 DOI: 10.1093/genetics/165.3.1221] [Citation(s) in RCA: 40] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Abstract
The neo-sex chromosomes of Drosophila miranda constitute an ideal system to study the effects of recombination on patterns of genome evolution. Due to a fusion of an autosome with the Y chromosome, one homolog is transmitted clonally. Here, I compare patterns of molecular evolution of 18 protein-coding genes located on the recombining neo-X and their homologs on the nonrecombining neo-Y chromosome. The rate of protein evolution has significantly increased on the neo-Y lineage since its formation. Amino acid substitutions are accumulating uniformly among neo-Y-linked genes, as expected if all loci on the neo-Y chromosome suffer from a reduced effectiveness of natural selection. In contrast, there is significant heterogeneity in the rate of protein evolution among neo-X-linked genes, with most loci being under strong purifying selection and two genes showing evidence for adaptive evolution. This observation agrees with theory predicting that linkage limits adaptive protein evolution. Both the neo-X and the neo-Y chromosome show an excess of unpreferred codon substitutions over preferred ones and no difference in this pattern was observed between the chromosomes. This suggests that there has been little or no selection maintaining codon bias in the D. miranda lineage. A change in mutational bias toward AT substitutions also contributes to the decline in codon bias. The contrast in patterns of molecular evolution between amino acid mutations and synonymous mutations on the neo-sex-linked genes can be understood in terms of chromosome-specific differences in effective population size and the distribution of selective effects of mutations.
Collapse
Affiliation(s)
- Doris Bachtrog
- Institute of Cell, Animal and Population Biology, University of Edinburgh, Edinburgh EH9 3JT, United Kingdom.
| |
Collapse
|
37
|
Abstract
Classical genetic studies show that gene conversion can favour some alleles over others. Molecular experiments suggest that gene conversion could favour GC over AT basepairs, leading to the concept of biased gene conversion towards GC (BGC(GC)). The expected consequence of such a process is the GC-enrichment of DNA sequences under gene conversion. Recent genomic work suggests that BGC(GC) affects the base composition of yeast, invertebrate and mammalian genomes. Hypotheses for the mechanisms and evolutionary origin of such a strange phenomenon have been proposed. Most BGC(GC) events probably occur during meiosis, which has implications for our understanding of the evolution of sex and recombination.
Collapse
Affiliation(s)
- Gabriel Marais
- Institute of Cell, Animal and Population Biology, University of Edinburgh, Edinburgh EH9 3JT, Scotland, UK.
| |
Collapse
|
38
|
Abstract
Changes in technology in the past decade have had such an impact on the way that molecular evolution research is done that it is difficult now to imagine working in a world without genomics or the Internet. In 1992, GenBank was less than a hundredth of its current size and was updated every three months on a huge spool of tape. Homology searches took 30 minutes and rarely found a hit. Now it is difficult to find sequences with only a few homologs to use as examples for teaching bioinformatics. For molecular evolution researchers, the genomics revolution has showered us with raw data and the information revolution has given us the wherewithal to analyze it. In broad terms, the most significant outcome from these changes has been our newfound ability to examine the evolution of genomes as a whole, enabling us to infer genome-wide evolutionary patterns and to identify subsets of genes whose evolution has been in some way atypical.
Collapse
Affiliation(s)
- Kenneth H Wolfe
- Department of Genetics, Smurfit Institute, University of Dublin, Trinity College, Dublin 2, Ireland.
| | | |
Collapse
|
39
|
Begun DJ, Whitley P. Molecular population genetics of Xdh and the evolution of base composition in Drosophila. Genetics 2002; 162:1725-35. [PMID: 12524344 PMCID: PMC1462376 DOI: 10.1093/genetics/162.4.1725] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Few loci have been measured for DNA polymorphism and divergence in several species. Here we report such data from the protein-coding region of xanthine dehydrogenase (Xdh) in 22 species of Drosophila. Many of our samples were from closely related species, allowing us to confidently assign substitutions to individual lineages. Surprisingly, Xdh appears to be fixing more A/T mutations than G/C mutations in most lineages, leading to evolution of higher A/T content in the recent past. We found no compelling evidence for selection on protein variation, though some aspects of the data support the notion that a significant fraction of amino acid polymorphisms are slightly deleterious. Finally, we found no convincing evidence that levels of silent heterozygosity are associated with rates of protein evolution.
Collapse
Affiliation(s)
- David J Begun
- Section of Integrative Biology, University of Texas, Austin, Texas 78712, USA.
| | | |
Collapse
|
40
|
Llopart A, Elwyn S, Lachaise D, Coyne JA. Genetics of a difference in pigmentation between Drosophila yakuba and Drosophila santomea. Evolution 2002; 56:2262-77. [PMID: 12487356 DOI: 10.1111/j.0014-3820.2002.tb00150.x] [Citation(s) in RCA: 69] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
Abstract
Drosophila yakuba is a species widespread in Africa, whereas D. santomea, its newly discovered sister species, is endemic to the volcanic island of São Tomé in the Gulf of Guinea. Drosophila santomea probably formed after colonization of the island by its common ancestor with D. yakuba. The two species differ strikingly in pigmentation: D. santomea, unlike the other eight species in the D. melanogaster subgroup, almost completely lacks dark abdominal pigmentation. D. yakuba shows the sexually dimorphic pigmentation typical of the group: both sexes have melanic patterns on the abdomen, but males are much darker than females. A genetic analysis of this species difference using morphological markers shows that the X chromosome accounts for nearly 90% of the species difference in the area of abdomen that is pigmented and that at least three genes (one on each major chromosome) are involved in each sex. The order of chromosome effects on pigmentation area are the same in males and females, suggesting that loss of pigmentation in D. santomea may have involved the same genes in both sexes. Further genetic analysis of the interspecific difference between males in pigmentation area and intensity using molecular markers shows that at least five genes are responsible, with no single locus having an overwhelming effect on the trait. The species difference is thus oligogenic or polygenic. Different chromosomal regions from each of the two species influenced pigmentation in the same direction, suggesting that the species difference (at least in males) is due to natural or sexual selection and not genetic drift. Measurements of sexual isolation between the species in both light and dark conditions show no difference, suggesting that the pigmentation difference is not an important cue for interspecific mate discrimination. Using DNA sequence differences in nine noncoding regions, we estimate that D. santomea and D. yakuba diverged about 400,000 years ago, a time similar to the divergences between two other well-studied pair of species in the subgroup, both of which also involved island colonization.
Collapse
Affiliation(s)
- Ana Llopart
- Department of Ecology and Evolution, The University of Chicago, 1101 East 57 Street, Chicago, Illinois 60637, USA.
| | | | | | | |
Collapse
|
41
|
Marais G, Piganeau G. Hill-Robertson interference is a minor determinant of variations in codon bias across Drosophila melanogaster and Caenorhabditis elegans genomes. Mol Biol Evol 2002; 19:1399-406. [PMID: 12200468 DOI: 10.1093/oxfordjournals.molbev.a004203] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
According to population genetics models, genomic regions with lower crossing-over rates are expected to experience less effective selection because of Hill-Robertson interference (HRi). The effect of genetic linkage is thought to be particularly important for a selection of weak intensity such as selection affecting codon usage. Consistent with this model, codon bias correlates positively with recombination rate in Drosophila melanogaster and Caenorhabditis elegans. However, in these species, the G+C content of both noncoding DNA and synonymous sites correlates positively with recombination, which suggests that mutation patterns and recombination are associated. To remove this effect of mutation patterns on codon bias, we used the synonymous sites of lowly expressed genes that are expected to be effectively neutral sites. We measured the differences between codon biases of highly expressed genes and their lowly expressed neighbors. In D. melanogaster we find that HRi weakly reduces selection on codon usage of genes located in regions of very low recombination; but these genes only comprise 4% of the total. In C. elegans we do not find any evidence for the effect of recombination on selection for codon bias. Computer simulations indicate that HRi poorly enhances codon bias if the local recombination rate is greater than the mutation rate. This prediction of the model is consistent with our data and with the current estimate of the mutation rate in D. melanogaster. The case of C. elegans, which is highly self-fertilizing, is discussed. Our results suggest that HRi is a minor determinant of variations in codon bias across the genome.
Collapse
Affiliation(s)
- Gabriel Marais
- Laboratoire Biométrie et biologie évolutive, UMR CNRS 5558, Université Claude Bernard Lyon 1, Villeurbanne, France.
| | | |
Collapse
|
42
|
Birdsell JA. Integrating genomics, bioinformatics, and classical genetics to study the effects of recombination on genome evolution. Mol Biol Evol 2002; 19:1181-97. [PMID: 12082137 DOI: 10.1093/oxfordjournals.molbev.a004176] [Citation(s) in RCA: 180] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
This study presents compelling evidence that recombination significantly increases the silent GC content of a genome in a selectively neutral manner, resulting in a highly significant positive correlation between recombination and "GC3s" in the yeast Saccharomyces cerevisiae. Neither selection nor mutation can explain this relationship. A highly significant GC-biased mismatch repair system is documented for the first time in any member of the Kingdom Fungi. Much of the variation in the GC3s within yeast appears to result from GC-biased gene conversion. Evidence suggests that GC-biased mismatch repair exists in numerous organisms spanning six kingdoms. This transkingdom GC mismatch repair bias may have evolved in response to a ubiquitous AT mutational bias. A significant positive correlation between recombination and GC content is found in many of these same organisms, suggesting that the processes influencing the evolution of the yeast genome may be a general phenomenon. Nonrecombining regions of the genome and nonrecombining genomes would not be subject to this type of molecular drive. It is suggested that the low GC content characteristic of many nonrecombining genomes may be the result of three processes (1) a prevailing AT mutational bias, (2) random fixation of the most common types of mutation, and (3) the absence of the GC-biased gene conversion which, in recombining organisms, permits the reversal of the most common types of mutation. A model is proposed to explain the observation that introns, intergenic regions, and pseudogenes typically have lower GC content than the silent sites of corresponding open reading frames. This model is based on the observation that the greater the heterology between two sequences, the less likely it is that recombination will occur between them. According to this "Constraint" hypothesis, the formation and propagation of heteroduplex DNA is expected to occur, on average, more frequently within conserved coding and regulatory regions of the genome. In organisms possessing GC-biased mismatch repair, this would enhance the GC content of these regions through biased gene conversion. These findings have a number of important implications for the way we view genome evolution and suggest a new model for the evolution of sex.
Collapse
Affiliation(s)
- John A Birdsell
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85121, USA.
| |
Collapse
|
43
|
Abstract
Weakly selected mutations are most likely to be physically clustered across genomes and, when sufficiently linked, they alter each others' fixation probability, a process we call interference selection (IS). Here we study population genetics and evolutionary consequences of IS on the selected mutations themselves and on adjacent selectively neutral variation. We show that IS reduces levels of polymorphism and increases low-frequency variants and linkage disequilibrium, in both selected and adjacent neutral mutations. IS can account for several well-documented patterns of variation and composition in genomic regions with low rates of crossing over in Drosophila. IS cannot be described simply as a reduction in the efficacy of selection and effective population size in standard models of selection and drift. Rather, IS can be better understood with models that incorporate a constant "traffic" of competing alleles. Our simulations also allow us to make genome-wide predictions that are specific to IS. We show that IS will be more severe at sites in the center of a region containing weakly selected mutations than at sites located close to the edge of the region. Drosophila melanogaster genomic data strongly support this prediction, with genes without introns showing significantly reduced codon bias in the center of coding regions. As expected, if introns relieve IS, genes with centrally located introns do not show reduced codon bias in the center of the coding region. We also show that reasonably small differences in the length of intermediate "neutral" sequences embedded in a region under selection increase the effectiveness of selection on the adjacent selected sequences. Hence, the presence and length of sequences such as introns or intergenic regions can be a trait subject to selection in recombining genomes. In support of this prediction, intron presence is positively correlated with a gene's codon bias in D. melanogaster. Finally, the study of temporal dynamics of IS after a change of recombination rate shows that nonequilibrium codon usage may be the norm rather than the exception.
Collapse
Affiliation(s)
- Josep M Comeron
- Department of Ecology and Evolution, University of Chicago, Chicago, Illinois 60637, USA.
| | | |
Collapse
|
44
|
Barker FK, Barrowclough GF, Groth JG. A phylogenetic hypothesis for passerine birds: taxonomic and biogeographic implications of an analysis of nuclear DNA sequence data. Proc Biol Sci 2002; 269:295-308. [PMID: 11839199 PMCID: PMC1690884 DOI: 10.1098/rspb.2001.1883] [Citation(s) in RCA: 289] [Impact Index Per Article: 13.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Passerine birds comprise over half of avian diversity, but have proved difficult to classify. Despite a long history of work on this group, no comprehensive hypothesis of passerine family-level relationships was available until recent analyses of DNA-DNA hybridization data. Unfortunately, given the value of such a hypothesis in comparative studies of passerine ecology and behaviour, the DNA-hybridization results have not been well tested using independent data and analytical approaches. Therefore, we analysed nucleotide sequence variation at the nuclear RAG-1 and c-mos genes from 69 passerine taxa, including representatives of most currently recognized families. In contradiction to previous DNA-hybridization studies, our analyses suggest paraphyly of suboscine passerines because the suboscine New Zealand wren Acanthisitta was found to be sister to all other passerines. Additionally, we reconstructed the parvorder Corvida as a basal paraphyletic grade within the oscine passerines. Finally, we found strong evidence that several family-level taxa are misplaced in the hybridization results, including the Alaudidae, Irenidae, and Melanocharitidae. The hypothesis of relationships we present here suggests that the oscine passerines arose on the Australian continental plate while it was isolated by oceanic barriers and that a major northern radiation of oscines (i.e. the parvorder Passerida) originated subsequent to dispersal from the south.
Collapse
Affiliation(s)
- F Keith Barker
- Department of Ornithology, American Museum of Natural History, Central Park West at 79th Street, New York, NY 10024, USA.
| | | | | |
Collapse
|
45
|
Llopart A, Elwyn S, Lachaise D, Coyne JA. GENETICS OF A DIFFERENCE IN PIGMENTATION BETWEEN DROSOPHILA YAKUBA AND DROSOPHILA SANTOMEA. Evolution 2002. [DOI: 10.1554/0014-3820(2002)056[2262:goadip]2.0.co;2] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
46
|
Abstract
Several recent studies of genome evolution indicate that the rate of DNA loss exceeds that of DNA gain, leading to an underlying mutational pressure towards collapsing the length of noncoding DNA. That such a collapse is not observed suggests opposing mechanisms favoring longer noncoding regions. The presence of transposable elements alone also does not explain observed features of noncoding DNA. At present, a multidisciplinary approach--using population genetics techniques, large-scale genomic analyses, and in silico evolution--is beginning to provide new and valuable insights into the forces that shape the length of noncoding DNA and, ultimately, genome size. Recombination, in a broad sense, might be the missing key parameter for understanding the observed variation in length of noncoding DNA in eukaryotes.
Collapse
Affiliation(s)
- J M Comeron
- Department of Ecology and Evolution, University of Chicago, 1101 East 57th Street, Chicago, Illinois 60637, USA.
| |
Collapse
|