1
|
Balakirev ES, Anisimova M, Pavlyuchkov VA, Ayala FJ. DNA polymorphism and selection at the bindin locus in three Strongylocentrotus sp. (Echinoidea). BMC Genet 2016; 17:66. [PMID: 27176219 PMCID: PMC4866015 DOI: 10.1186/s12863-016-0374-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2015] [Accepted: 05/02/2016] [Indexed: 02/04/2023] Open
Abstract
BACKGROUND The sperm gene bindin encodes a gamete recognition protein, which plays an important role in conspecific fertilization and reproductive isolation of sea urchins. Molecular evolution of the gene has been extensively investigated with the attention focused on the protein coding regions. Intron evolution has been investigated to a much lesser extent. We have studied nucleotide variability in the complete bindin locus, including two exons and one intron, in the sea urchin Strongylocentrotus intermedius represented by two morphological forms. We have also analyzed all available bindin sequences for two other sea urchin species, S. pallidus and S. droebachiensis. RESULTS The results show that the bindin sequences from the two forms of S. intermedius are intermingled with no evidence of genetic divergence; however, the forms exhibit slightly different patterns in bindin variability. The level of the bindin nucleotide diversity is close for S. intermedius and S. droebachiensis, but noticeably higher for S. pallidus. The distribution of variability is non-uniform along the gene; however there are striking similarities among the species, indicating similar evolutionary trends in this gene engaged in reproductive function. The patterns of nucleotide variability and divergence are radically different in the bindin coding and intron regions. Positive selection is detected in the bindin coding region. The neutrality tests as well as the maximum likelihood approaches suggest the action of diversifying selection in the bindin intron. CONCLUSIONS Significant deviation from neutrality has been detected in the bindin coding region and suggested in the intron, indicating the possible functional importance of the bindin intron variability. To clarify the question concerning possible involvement of diversifying selection in the bindin intron evolution more data combining population genetic and functional approaches are necessary.
Collapse
Affiliation(s)
- Evgeniy S Balakirev
- A. V. Zhirmunsky Institute of Marine Biology, Far Eastern Branch of the Russian Academy of Science, Vladivostok, 690041, Russia.
- Department of Ecology and Evolutionary Biology, University of California, 321 Steinhaus Hall, Irvine, CA, 92697-2525, USA.
- Far Eastern Federal University, Vladivostok, 690950, Russia.
| | - Maria Anisimova
- Institute of Applied Simulation, School of Life Sciences and Facility Management, Zürich University of Applied Sciences, Wädenswil, 8820, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, 1015, Switzerland
| | | | - Francisco J Ayala
- Department of Ecology and Evolutionary Biology, University of California, 321 Steinhaus Hall, Irvine, CA, 92697-2525, USA
| |
Collapse
|
2
|
Yu ZX, Wang LJ, Zhao B, Shan CM, Zhang YH, Chen DF, Chen XY. Progressive regulation of sesquiterpene biosynthesis in Arabidopsis and Patchouli (Pogostemon cablin) by the miR156-targeted SPL transcription factors. MOLECULAR PLANT 2015; 8:98-110. [PMID: 25578275 DOI: 10.1016/j.molp.2014.11.002] [Citation(s) in RCA: 102] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/08/2014] [Accepted: 10/25/2014] [Indexed: 05/18/2023]
Abstract
Plant metabolites vary at different stages of their life cycle. Although it is well documented that environmental factors stimulate biosynthesis of secondary metabolites, the regulation by endogenous developmental cues remains poorly understood. The microRNA156 (miR156)-targeted squamosa promoter binding protein-like (SPL) factors function as a major age cue in regulating developmental phase transition and flowering. We show here that the miR156-targeted SPL transcription factor plays an important role in the spatiotemporal regulation of sesquiterpene biosynthesis. In Arabidopsis thaliana, the miR156-SPL module regulates the formation of (E)-β-caryophyllene in the flowering stage through modulating expression of the sesquiterpene synthase gene TPS21. We demonstrated that SPL9 directly binds to TPS21 promoter and activates its expression. In the perennial fragrant herb Pogostemon cablin, the accumulation of patchouli oil, largely composed of sesquiterpenes dominated by (-)-patchoulol, is also age-regulated, and the SPL promotes biosynthesis of sesquiterpenes in elder plants by upregulating patchoulol synthase (PatPTS) gene expression. As miR156-SPLs are highly conserved in plants, our finding not only uncovers a molecular link between developmental timing and sesquiterpene production but also suggests a new strategy to engineer plants for accelerated growth with enhanced production of terpenoids.
Collapse
Affiliation(s)
- Zong-Xia Yu
- Plant Science Research Center, Shanghai Chenshan Botanical Garden, Shanghai Key Laboratory of Plant Functional Genomics and Resources, Shanghai 201602, China; National Key Laboratory of Plant Molecular Genetics, National Plant Gene Research Center, Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200032, China; University of CAS, Beijing 100049, China; College of Bioengineering, Dalian University, Dalian 116622, China
| | - Ling-Jian Wang
- National Key Laboratory of Plant Molecular Genetics, National Plant Gene Research Center, Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200032, China
| | - Bo Zhao
- National Key Laboratory of Plant Molecular Genetics, National Plant Gene Research Center, Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200032, China; University of CAS, Beijing 100049, China
| | - Chun-Min Shan
- National Key Laboratory of Plant Molecular Genetics, National Plant Gene Research Center, Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200032, China; University of CAS, Beijing 100049, China
| | - Yu-Hua Zhang
- Firmenich Aromatics (China) Co. Ltd., Shanghai 201108, China
| | - Dong-Fang Chen
- Firmenich Aromatics (China) Co. Ltd., Shanghai 201108, China
| | - Xiao-Ya Chen
- Plant Science Research Center, Shanghai Chenshan Botanical Garden, Shanghai Key Laboratory of Plant Functional Genomics and Resources, Shanghai 201602, China; National Key Laboratory of Plant Molecular Genetics, National Plant Gene Research Center, Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200032, China.
| |
Collapse
|
3
|
Gene conversion occurs within the mating-type locus of Cryptococcus neoformans during sexual reproduction. PLoS Genet 2012; 8:e1002810. [PMID: 22792079 PMCID: PMC3390403 DOI: 10.1371/journal.pgen.1002810] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2012] [Accepted: 05/23/2012] [Indexed: 12/30/2022] Open
Abstract
Meiotic recombination of sex chromosomes is thought to be repressed in organisms with heterogametic sex determination (e.g. mammalian X/Y chromosomes), due to extensive divergence and chromosomal rearrangements between the two chromosomes. However, proper segregation of sex chromosomes during meiosis requires crossing-over occurring within the pseudoautosomal regions (PAR). Recent studies reveal that recombination, in the form of gene conversion, is widely distributed within and may have played important roles in the evolution of some chromosomal regions within which recombination was thought to be repressed, such as the centromere cores of maize. Cryptococcus neoformans, a major human pathogenic fungus, has an unusually large mating-type locus (MAT, >100 kb), and the MAT alleles from the two opposite mating-types show extensive nucleotide sequence divergence and chromosomal rearrangements, mirroring characteristics of sex chromosomes. Meiotic recombination was assumed to be repressed within the C. neoformans MAT locus. A previous study identified recombination hot spots flanking the C. neoformans MAT, and these hot spots are associated with high GC content. Here, we investigated a GC-rich intergenic region located within the MAT locus of C. neoformans to establish if this region also exhibits unique recombination behavior during meiosis. Population genetics analysis of natural C. neoformans isolates revealed signals of homogenization spanning this GC-rich intergenic region within different C. neoformans lineages, consistent with a model in which gene conversion of this region during meiosis prevents it from diversifying within each lineage. By analyzing meiotic progeny from laboratory crosses, we found that meiotic recombination (gene conversion) occurs around the GC-rich intergenic region at a frequency equal to or greater than the meiotic recombination frequency observed in other genomic regions. We discuss the implications of these findings with regards to the possible functional and evolutionary importance of gene conversion within the C. neoformans MAT locus and, more generally, in fungi. Recombination has been thought to be repressed within sex chromosomes, as well as within the mating-type (MAT) loci in many fungi, due to the highly diverged and rearranged nature between alleles defining opposite sexes or mating-types. However, it has long been appreciated that recombination can occur within these presumptive recombinational “cold spots,” and recent studies reveal that recombination, including gene conversion, can occur at a frequency higher than previously appreciated and could play important roles in shaping evolution of these chromosomal regions. Here, we provide evidence that, during sexual reproduction of the human pathogenic fungus Cryptococcus neoformans, recombination (gene conversion) occurs across a GC-rich intergenic region within the MAT locus. The frequency of this gene conversion is comparable to those of typical meiotic recombination events observed in other chromosomal regions. This is in accord with population genetics analyses, which indicate homogenization between alleles of opposite mating-types within the intergenic region. Gene conversion within these highly rearranged chromosomal regions may serve to ensure proper meiosis and/or rejuvenate genes/chromosomal regions within MAT that are otherwise facing irreversible evolutionary decay. In conclusion, our study provides further experimental evidence that at least some recombinational “cold spots” are not that cold, after all.
Collapse
|
4
|
Lu P, Han X, Qi J, Yang J, Wijeratne AJ, Li T, Ma H. Analysis of Arabidopsis genome-wide variations before and after meiosis and meiotic recombination by resequencing Landsberg erecta and all four products of a single meiosis. Genome Res 2012; 22:508-18. [PMID: 22106370 PMCID: PMC3290786 DOI: 10.1101/gr.127522.111] [Citation(s) in RCA: 98] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2011] [Accepted: 11/17/2011] [Indexed: 11/24/2022]
Abstract
Meiotic recombination, including crossovers (COs) and gene conversions (GCs), impacts natural variation and is an important evolutionary force. COs increase genetic diversity by redistributing existing variation, whereas GCs can alter allelic frequency. Here, we sequenced Arabidopsis Landsberg erecta (Ler) and two sets of all four meiotic products from a Columbia (Col)/Ler hybrid to investigate genome-wide variation and meiotic recombination at nucleotide resolution. Comparing Ler and Col sequences uncovered 349,171 Single Nucleotide Polymorphisms (SNPs), 58,085 small and 2315 large insertions/deletions (indels), with highly correlated genome-wide distributions of SNPs, and small indels. A total of 443 genes have at least 10 nonsynonymous substitutions in protein-coding regions, with enrichment for disease-resistance genes. Another 316 genes are affected by large indels, including 130 genes with complete deletion of coding regions in Ler. Using the Arabidopsis qrt1 mutant, two sets of four meiotic products were generated and analyzed by sequencing for meiotic recombination, representing the first tetrad analysis with whole-genome sequencing in a nonfungal species. We detected 18 COs, six of which had an associated GC event, and four GCs without COs (NCOs), and revealed that Arabidopsis GCs are likely fewer and with shorter tracts than those in yeast. Meiotic recombination and chromosome assortment events dramatically redistributed genome variation in meiotic products, contributing to population diversity. In particular, meiosis provides a rapid mechanism to generate copy-number variation (CNV) of sequences that have different chromosomal positions in Col and Ler.
Collapse
Affiliation(s)
- Pingli Lu
- Department of Biology and the Huck Institutes of the Life Sciences, the Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Xinwei Han
- Department of Biology and the Huck Institutes of the Life Sciences, the Pennsylvania State University, University Park, Pennsylvania 16802, USA
- Intercollege Graduate Program in Genetics, the Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Ji Qi
- State Key Laboratory of Genetic Engineering, Institute of Plant Biology, Center for Evolutionary Biology, School of Life Sciences, Fudan University, Shanghai 200433, China
- Institutes of Biomedical Sciences, Fudan University, Shanghai 200032, China
| | - Jiange Yang
- Department of Biology and the Huck Institutes of the Life Sciences, the Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Asela J. Wijeratne
- Department of Biology and the Huck Institutes of the Life Sciences, the Pennsylvania State University, University Park, Pennsylvania 16802, USA
- Intercollege Graduate Program in Plant Biology, the Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Tao Li
- Institute of Hydrobiology, Chinese Academy of Science, Wuhan 430072, China
| | - Hong Ma
- State Key Laboratory of Genetic Engineering, Institute of Plant Biology, Center for Evolutionary Biology, School of Life Sciences, Fudan University, Shanghai 200433, China
- Institutes of Biomedical Sciences, Fudan University, Shanghai 200032, China
| |
Collapse
|
5
|
Kupriyanova EV, Ezhova TA, Shestakov SV. Duplicated peroxidase genes AtPrx53 and AtPrx54 of Arabidopsis thaliana are a recombination hotspot. DOKL BIOCHEM BIOPHYS 2010; 431:90-3. [PMID: 20514871 DOI: 10.1134/s1607672910020109] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Affiliation(s)
- E V Kupriyanova
- Faculty of Biology, Moscow State University, Moscow 119991, Russia
| | | | | |
Collapse
|
6
|
Gaeta RT, Chris Pires J. Homoeologous recombination in allopolyploids: the polyploid ratchet. THE NEW PHYTOLOGIST 2010; 186:18-28. [PMID: 20002315 DOI: 10.1111/j.1469-8137.2009.03089.x] [Citation(s) in RCA: 128] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/18/2023]
Abstract
Polyploidization and recombination are two important processes driving evolution through the building and reshaping of genomes. Allopolyploids arise from hybridization and chromosome doubling among distinct, yet related species. Polyploids may display novel variation relative to their progenitors, and the sources of this variation lie not only in the acquisition of extra gene dosages, but also in the genomic changes that occur after divergent genomes unite. Genomic changes (deletions, duplications, and translocations) have been detected in both recently formed natural polyploids and resynthesized polyploids. In resynthesized Brassica napus allopolyploids, there is evidence that many genetic changes are the consequence of homoeologous recombination. Homoeologous recombination can generate novel gene combinations and phenotypes, but may also destabilize the karyotype and lead to aberrant meiotic behavior and reduced fertility. Thus, natural selection plays a role in the establishment and maintenance of fertile natural allopolyploids that have stabilized chromosome inheritance and a few advantageous chromosomal rearrangements. We discuss the evidence for genome rearrangements that result from homoeologous recombination in resynthesized B. napus and how these observations may inform phenomena such as chromosome replacement, aneuploidy, non-reciprocal translocations and gene conversion seen in other polyploids.
Collapse
Affiliation(s)
- Robert T Gaeta
- Division of Biological Sciences, University of Missouri, Columbia, MO 65211-7310, USA
| | | |
Collapse
|
7
|
Understanding the evolution of defense metabolites in Arabidopsis thaliana using genome-wide association mapping. Genetics 2009; 185:991-1007. [PMID: 19737743 DOI: 10.1534/genetics.109.108522] [Citation(s) in RCA: 135] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
With the improvement and decline in cost of high-throughput genotyping and phenotyping technologies, genome-wide association (GWA) studies are fast becoming a preferred approach for dissecting complex quantitative traits. Glucosinolate (GSL) secondary metabolites within Arabidopsis spp. can serve as a model system to understand the genomic architecture of quantitative traits. GSLs are key defenses against insects in the wild and the relatively large number of cloned quantitative trait locus (QTL) controlling GSL traits allows comparison of GWA to previous QTL analyses. To better understand the specieswide genomic architecture controlling plant-insect interactions and the relative strengths of GWA and QTL studies, we conducted a GWA mapping study using 96 A. thaliana accessions, 43 GSL phenotypes, and approximately 230,000 SNPs. Our GWA analysis identified the two major polymorphic loci controlling GSL variation (AOP and MAM) in natural populations within large blocks of positive associations encompassing dozens of genes. These blocks of positive associations showed extended linkage disequilibrium (LD) that we hypothesize to have arisen from balancing or fluctuating selective sweeps at both the AOP and MAM loci. These potential sweep blocks are likely linked with the formation of new defensive chemistries that alter plant fitness in natural environments. Interestingly, this GWA analysis did not identify the majority of previously identified QTL even though these polymorphisms were present in the GWA population. This may be partly explained by a nonrandom distribution of phenotypic variation across population subgroups that links population structure and GSL variation, suggesting that natural selection can hinder the detection of phenotype-genotype associations in natural populations.
Collapse
|
8
|
Waugh R, Jannink JL, Muehlbauer GJ, Ramsay L. The emergence of whole genome association scans in barley. CURRENT OPINION IN PLANT BIOLOGY 2009; 12:218-22. [PMID: 19185530 DOI: 10.1016/j.pbi.2008.12.007] [Citation(s) in RCA: 61] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/01/2008] [Revised: 12/15/2008] [Accepted: 12/15/2008] [Indexed: 05/19/2023]
Abstract
Barley geneticists are currently using association genetics to identify and fine map traits directly in elite plant breeding material. This has been made possible by the development of a highly parallel SNP assay platform that provides sufficient marker density for genome-wide scans and linkage disequilibrium-led gene identification. By leveraging the combined resources of the barley research and breeding sectors, marker-trait associations are being identified and a renewed interest has emerged in novel strategies for barley improvement. New database and visualization tools have been developed and statistical methods adapted from human genetics to account for complexities in the datasets. Exciting early results suggest that association genetics will assume a central role in establishing genotype-to-phenotype relationships.
Collapse
Affiliation(s)
- Robbie Waugh
- Genetics, SCRI, Invergowrie, Dundee DD2 5DA, Scotland.
| | | | | | | |
Collapse
|
9
|
Berchowitz LE, Copenhaver GP. Visual markers for detecting gene conversion directly in the gametes of Arabidopsis thaliana. Methods Mol Biol 2009; 557:99-114. [PMID: 19799179 DOI: 10.1007/978-1-59745-527-5_8] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/28/2023]
Abstract
Measuring meiotic gene conversion is important both because of its role in the fundamental mechanisms of meiotic recombination and because of its influence on linkage relationships and allelic diversity in the genome. Historically, gene conversion has been most thoroughly examined in fungal organisms through the use of tetrad analysis. Here we describe a method for using tetrad analysis in the model plant Arabidopsis thaliana to detect and quantify gene conversion events - a resource unavailable in most other higher eukaryotic model systems.
Collapse
Affiliation(s)
- Luke E Berchowitz
- Department of Biology, University of North Carolina, Chapel Hill, NC, USA
| | | |
Collapse
|
10
|
Multilocus patterns of nucleotide diversity, population structure and linkage disequilibrium in Boechera stricta, a wild relative of Arabidopsis. Genetics 2008; 181:1021-33. [PMID: 19104077 DOI: 10.1534/genetics.108.095364] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Information about polymorphism, population structure, and linkage disequilibrium (LD) is crucial for association studies of complex trait variation. However, most genomewide studies have focused on model systems, with very few analyses of undisturbed natural populations. Here, we sequenced 86 mapped nuclear loci for a sample of 46 genotypes of Boechera stricta and two individuals of B. holboellii, both wild relatives of Arabidopsis. Isolation by distance was significant across the species range of B. stricta, and three geographic groups were identified by structure analysis, principal coordinates analysis, and distance-based phylogeny analyses. The allele frequency spectrum indicated a genomewide deviation from an equilibrium neutral model, with silent nucleotide diversity averaging 0.004. LD decayed rapidly, declining to background levels in approximately 10 kb or less. For tightly linked SNPs separated by <1 kb, LD was dependent on the reference population. LD was lower in the specieswide sample than within populations, suggesting that low levels of LD found in inbreeding species such as B. stricta, Arabidopsis thaliana, and barley may result from broad geographic sampling that spans heterogeneous genetic groups. Finally, analyses also showed that inbreeding B. stricta and A. thaliana have approximately 45% higher recombination per kilobase than outcrossing A. lyrata.
Collapse
|
11
|
Grouped nucleotide polymorphism: a major contributor to genetic variation in Arabidopsis. Gene 2008; 426:1-6. [PMID: 18835338 DOI: 10.1016/j.gene.2008.09.003] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2008] [Revised: 09/03/2008] [Accepted: 09/03/2008] [Indexed: 11/24/2022]
Abstract
While information for single nucleotide polymorphism is accumulating in many organisms, little is known about the magnitude and the occurrence of nucleotide dimorphism or higher order structural polymorphisms (grouped nucleotide polymorphisms or GNPs). To address these questions, we systematically investigated the genetic variations of 996 loci in 96 Arabidopsis accessions. Our data suggest that GNP loci are highly frequent in the genomes (40.4% of 996 loci), and that 66.7% of genetic variation in 996 loci is attributed to GNPs. The frequency distribution and the linkage analysis of GNP loci demonstrate that GNPs occurred randomly, locally, ceaselessly and independently in general, although they are particularly abundant in the region near centromeres. The age distribution of GNP loci shows that the majority of these loci are under a transient phase of neutral evolution. The characteristics of GNPs imply that a molecular isolation exists in GNP loci, and most likely the indels caused isolation, which could explain how GNPs are generated and maintained.
Collapse
|
12
|
Xu S, Clark T, Zheng H, Vang S, Li R, Wong GKS, Wang J, Zheng X. Gene conversion in the rice genome. BMC Genomics 2008; 9:93. [PMID: 18298833 PMCID: PMC2277409 DOI: 10.1186/1471-2164-9-93] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2007] [Accepted: 02/25/2008] [Indexed: 02/01/2023] Open
Abstract
Background Gene conversion causes a non-reciprocal transfer of genetic information between similar sequences. Gene conversion can both homogenize genes and recruit point mutations thereby shaping the evolution of multigene families. In the rice genome, the large number of duplicated genes increases opportunities for gene conversion. Results To characterize gene conversion in rice, we have defined 626 multigene families in which 377 gene conversions were detected using the GENECONV program. Over 60% of the conversions we detected were between chromosomes. We found that the inter-chromosomal conversions distributed between chromosome 1 and 5, 2 and 6, and 3 and 5 are more frequent than genome average (Z-test, P < 0.05). The frequencies of gene conversion on the same chromosome decreased with the physical distance between gene conversion partners. Ka/Ks analysis indicates that gene conversion is not tightly linked to natural selection in the rice genome. To assess the contribution of segmental duplication on gene conversion statistics, we determined locations of conversion partners with respect to inter-chromosomal segment duplication. The number of conversions associated with segmentation is less than ten percent. Pseudogenes in the rice genome with low similarity to Arabidopsis genes showed greater likelihood for gene conversion than those with high similarity to Arabidopsis genes. Functional annotations suggest that at least 14 multigene families related to disease or bacteria resistance were involved in conversion events. Conclusion The evolution of gene families in the rice genome may have been accelerated by conversion with pseudogenes. Our analysis suggests a possible role for gene conversion in the evolution of pathogen-response genes.
Collapse
Affiliation(s)
- Shuqing Xu
- Beijing Institute of Genomics of Chinese Academy of Sciences, Beijing Genomics Institute, Beijing Proteomics Institute, Beijing 101300, China.
| | | | | | | | | | | | | | | |
Collapse
|
13
|
Moore RC, Stevens MHH. Local patterns of nucleotide polymorphism are highly variable in the selfing species Arabidopsis thaliana. J Mol Evol 2008; 66:116-29. [PMID: 18273534 DOI: 10.1007/s00239-007-9063-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2007] [Revised: 09/04/2007] [Accepted: 11/21/2007] [Indexed: 01/07/2023]
Abstract
Neighboring genes predictably share similar evolutionary histories to an extent delineated by recombination. This correlation should extend across multiple linked genes in a selfing species such as Arabidopsis thaliana due to its low effective recombination rate. To test this prediction, we performed a molecular population genetics analysis of nucleotide polymorphism and divergence in chromosomal regions surrounding four low-diversity loci. Three of these loci, At1g67140, At3g03700, and TERMINAL FLOWER1 (TFL1), have been previously implicated as targets of selection and we would predict stronger correlations in polymorphism between neighboring loci due to genetic hitchhiking around these loci. The remaining locus, At1g04300, was identified in a study of linkage disequilibrium surrounding the CRYPTOCHROME2 (CRY2) locus. Although we found broad valleys of reduced nucleotide variation around two of our focal genes, At1g67140 and At3g03700, all chromosomal regions exhibited extreme variation in the patterns of polymorphism and evolution between neighboring loci. Although three of our four regions contained potential targets of selection, application of the composite-likelihood-ratio test of selection in conjunction with a goodness-of-fit test supports the selection hypothesis only for the region containing At3g03700. The degree of discordance in evolutionary histories between linked loci within each region generally correlated with estimates of recombination and linkage disequilibrium for that region, with the exception of the region containing At1g04300. We discuss the implications of these data for future population genetics analyses and genomics studies in A. thaliana.
Collapse
Affiliation(s)
- Richard C Moore
- Department of Botany, Miami University, 316 Pearson Hall, Oxford, OH 45056, USA.
| | | |
Collapse
|
14
|
Du J, Wang X, Zhang M, Tian D, Yang YH. Unique nucleotide polymorphism of ankyrin gene cluster in Arabidopsis. J Genet 2007; 86:27-35. [PMID: 17656846 DOI: 10.1007/s12041-007-0004-0] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
The ankyrin (ANK) gene cluster is a part of a multigene family encoding ANK transmembrane proteins in Arabidopsis thaliana, and plays an important role in protein-protein interactions and in signal pathways. In contrast to other regions of a genome, the ANK gene cluster exhibits an extremely high level of DNA polymorphism in an approximately 5-kb region, without apparent decay. Phylogenetic analysis detects two clear, deeply differentiated haplotypes (dimorphism). The divergence between haplotypes of accession Col-0 and Ler-0 (Hap-C and Hap-L) is estimated to be 10.7%, approximately equal to the 10.5% average divergence between A. thaliana and A. lyrata. Sequence comparisons for the ANK gene cluster homologues in Col-0 indicate that the members evolve independently, and that the similarity among paralogues is lower than between alleles. Very little intralocus recombination or gene conversion is detected in ANK regions. All these characteristics of the ANK gene cluster are consistent with a tandem gene duplication and birth-and-death process. The possible mechanisms for and implications of this elevated nucleotide variation are also discussed, including the suggestion of balancing selection.
Collapse
Affiliation(s)
- Jianchang Du
- State Key Laboratory of Pharmaceutical Biotechnology, Department of Biology, Nanjing University, Nanjing 210093, China.
| | | | | | | | | |
Collapse
|
15
|
Kim S, Plagnol V, Hu TT, Toomajian C, Clark RM, Ossowski S, Ecker JR, Weigel D, Nordborg M. Recombination and linkage disequilibrium in Arabidopsis thaliana. Nat Genet 2007; 39:1151-5. [PMID: 17676040 DOI: 10.1038/ng2115] [Citation(s) in RCA: 368] [Impact Index Per Article: 21.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2007] [Accepted: 07/16/2007] [Indexed: 01/09/2023]
Abstract
Linkage disequilibrium (LD) is a major aspect of the organization of genetic variation in natural populations. Here we describe the genome-wide pattern of LD in a sample of 19 Arabidopsis thaliana accessions using 341,602 non-singleton SNPs. LD decays within 10 kb on average, considerably faster than previously estimated. Tag SNP selection algorithms and 'hide-the-SNP' simulations suggest that genome-wide association mapping will require only 40%-50% of the observed SNPs, a reduction similar to estimates in a sample of African Americans. An Affymetrix genotyping array containing 250,000 SNPs has been designed based on these results; we demonstrate that it should have more than adequate coverage for genome-wide association mapping. The extent of LD is highly variable, and we find clear evidence of recombination hotspots, which seem to occur preferentially in intergenic regions. LD also reflects the action of selection, and it is more extensive between nonsynonymous polymorphisms than between synonymous polymorphisms.
Collapse
Affiliation(s)
- Sung Kim
- Molecular and Computational Biology, University of Southern California, Los Angeles, California 90089, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
16
|
Barrière A, Félix MA. Temporal dynamics and linkage disequilibrium in natural Caenorhabditis elegans populations. Genetics 2007; 176:999-1011. [PMID: 17409084 PMCID: PMC1894625 DOI: 10.1534/genetics.106.067223] [Citation(s) in RCA: 126] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2006] [Accepted: 03/13/2007] [Indexed: 11/18/2022] Open
Abstract
Caenorhabditis elegans is a major laboratory model system yet a newcomer to the field of population genetics, and relatively little is known of its biology in the wild. Recent studies of natural populations at a single time point revealed strong spatial population structure and suggested that these populations may be very dynamic. We have therefore studied several natural C. elegans populations over time and genotyped them at polymorphic microsatellite loci. While some populations appear to be genetically stable over the course of observation, others seem to go extinct, with full replacement of multilocus genotypes upon regrowth. The frequency of heterozygotes indicates that outcrossing occurs at a mean frequency of 1.7% and is variable between populations. However, in genetically stable populations, linkage disequilibrium between different chromosomes can be maintained over several years at a level much higher than expected from the heterozygote frequency. C. elegans seems to follow metapopulation dynamics, and the maintenance of linkage disequilibrium despite a low yet significant level of outcrossing suggests that selection may act against the progeny of outcrossings.
Collapse
Affiliation(s)
| | - Marie-Anne Félix
- Institut Jacques Monod, CNRS–Universities of Paris 6 and 7, 75251 Paris Cedex 05, France
| |
Collapse
|
17
|
Chan CX, Beiko RG, Ragan MA. Detecting recombination in evolving nucleotide sequences. BMC Bioinformatics 2006; 7:412. [PMID: 16978423 PMCID: PMC1592127 DOI: 10.1186/1471-2105-7-412] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2006] [Accepted: 09/18/2006] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Genetic recombination can produce heterogeneous phylogenetic histories within a set of homologous genes. These recombination events can be obscured by subsequent residue substitutions, which consequently complicate their detection. While there are many algorithms for the identification of recombination events, little is known about the effects of subsequent substitutions on the accuracy of available recombination-detection approaches. RESULTS We assessed the effect of subsequent substitutions on the detection of simulated recombination events within sets of four nucleotide sequences under a homogeneous evolutionary model. The amount of subsequent substitutions per site, prior evolutionary history of the sequences, and reciprocality or non-reciprocality of the recombination event all affected the accuracy of the recombination-detecting programs examined. Bayesian phylogenetic-based approaches showed high accuracy in detecting evidence of recombination event and in identifying recombination breakpoints. These approaches were less sensitive to parameter settings than other methods we tested, making them easier to apply to various data sets in a consistent manner. CONCLUSION Post-recombination substitutions tend to diminish the predictive accuracy of recombination-detecting programs. The best method for detecting recombined regions is not necessarily the most accurate in identifying recombination breakpoints. For difficult detection problems involving highly divergent sequences or large data sets, different types of approach can be run in succession to increase efficiency, and can potentially yield better predictive accuracy than any single method used in isolation.
Collapse
Affiliation(s)
- Cheong Xin Chan
- ARC Centre in Bioinformatics and Institute for Molecular Bioscience, the University of Queensland, Brisbane, QLD 4072, Australia
| | - Robert G Beiko
- ARC Centre in Bioinformatics and Institute for Molecular Bioscience, the University of Queensland, Brisbane, QLD 4072, Australia
| | - Mark A Ragan
- ARC Centre in Bioinformatics and Institute for Molecular Bioscience, the University of Queensland, Brisbane, QLD 4072, Australia
| |
Collapse
|
18
|
Schaefer I, Domes K, Heethoff M, Schneider K, Schön I, Norton RA, Scheu S, Maraun M. No evidence for the 'Meselson effect' in parthenogenetic oribatid mites (Oribatida, Acari). J Evol Biol 2006; 19:184-93. [PMID: 16405590 DOI: 10.1111/j.1420-9101.2005.00975.x] [Citation(s) in RCA: 50] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
It has been hypothesized that in ancient apomictic, nonrecombining lineages the two alleles of a single copy gene will become highly divergent as a result of the independent accumulation of mutations (Meselson effect). We used a partial sequence of the elongation factor-1alpha (ef-1alpha) and the heat shock protein 82 (hsp82) genes to test this hypothesis for putative ancient parthenogenetic oribatid mite lineages. In addition, we tested if the hsp82 gene is fully transcribed by sequencing the cDNA and we also tested if there is evidence for recombination and gene conversion in sexual and parthenogenetic oribatid mite species. The average maximum intra-specific divergence in the ef-1alpha was 2.7% in three parthenogenetic species and 8.6% in three sexual species; the average maximum intra-individual genetic divergence was 0.9% in the parthenogenetic and 6.0% in the sexual species. In the hsp82 gene the average maximum intra-individual genetic divergence in the sexual species Steganacarus magnus and in the parthenogenetic species Platynothrus peltifer was 1.1% and 1.2%, respectively. None of the differences were statistically significant. The cDNA data indicated that the hsp82 sequence is transcribed and intron-free. Likelihood permutation tests indicate that ef-1alpha has undergone recombination in all three studied sexual species and gene conversion in two of the sexual species, but neither process has occurred in any of the parthenogenetic species. No evidence for recombination or gene conversion was found for sexual or parthenogenetic oribatid mite species in the hsp 82 gene. There appears to be no Meselson effect in parthenogenetic oribatid mite species. Presumably, their low genetic divergence is due to automixis, other homogenizing mechanisms or strong selection to keep both the ef-1alpha and the hsp82 gene functioning.
Collapse
Affiliation(s)
- I Schaefer
- Technische Universität Darmstadt, Institut für Zoologie, Darmstadt, Germany
| | | | | | | | | | | | | | | |
Collapse
|
19
|
Wenzl P, Li H, Carling J, Zhou M, Raman H, Paul E, Hearnden P, Maier C, Xia L, Caig V, Ovesná J, Cakir M, Poulsen D, Wang J, Raman R, Smith KP, Muehlbauer GJ, Chalmers KJ, Kleinhofs A, Huttner E, Kilian A. A high-density consensus map of barley linking DArT markers to SSR, RFLP and STS loci and agricultural traits. BMC Genomics 2006. [PMID: 16904008 DOI: 10.1186/1471‐2164‐7‐206] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023] Open
Abstract
BACKGROUND Molecular marker technologies are undergoing a transition from largely serial assays measuring DNA fragment sizes to hybridization-based technologies with high multiplexing levels. Diversity Arrays Technology (DArT) is a hybridization-based technology that is increasingly being adopted by barley researchers. There is a need to integrate the information generated by DArT with previous data produced with gel-based marker technologies. The goal of this study was to build a high-density consensus linkage map from the combined datasets of ten populations, most of which were simultaneously typed with DArT and Simple Sequence Repeat (SSR), Restriction Enzyme Fragment Polymorphism (RFLP) and/or Sequence Tagged Site (STS) markers. RESULTS The consensus map, built using a combination of JoinMap 3.0 software and several purpose-built perl scripts, comprised 2,935 loci (2,085 DArT, 850 other loci) and spanned 1,161 cM. It contained a total of 1,629 'bins' (unique loci), with an average inter-bin distance of 0.7 +/- 1.0 cM (median = 0.3 cM). More than 98% of the map could be covered with a single DArT assay. The arrangement of loci was very similar to, and almost as optimal as, the arrangement of loci in component maps built for individual populations. The locus order of a synthetic map derived from merging the component maps without considering the segregation data was only slightly inferior. The distribution of loci along chromosomes indicated centromeric suppression of recombination in all chromosomes except 5H. DArT markers appeared to have a moderate tendency toward hypomethylated, gene-rich regions in distal chromosome areas. On the average, 14 +/- 9 DArT loci were identified within 5 cM on either side of SSR, RFLP or STS loci previously identified as linked to agricultural traits. CONCLUSION Our barley consensus map provides a framework for transferring genetic information between different marker systems and for deploying DArT markers in molecular breeding schemes. The study also highlights the need for improved software for building consensus maps from high-density segregation data of multiple populations.
Collapse
Affiliation(s)
- Peter Wenzl
- Triticarte P/L, PO Box 7141 Yarralumla, Canberra, ACT 2600, Australia.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
20
|
Wenzl P, Li H, Carling J, Zhou M, Raman H, Paul E, Hearnden P, Maier C, Xia L, Caig V, Ovesná J, Cakir M, Poulsen D, Wang J, Raman R, Smith KP, Muehlbauer GJ, Chalmers KJ, Kleinhofs A, Huttner E, Kilian A. A high-density consensus map of barley linking DArT markers to SSR, RFLP and STS loci and agricultural traits. BMC Genomics 2006; 7:206. [PMID: 16904008 PMCID: PMC1564146 DOI: 10.1186/1471-2164-7-206] [Citation(s) in RCA: 196] [Impact Index Per Article: 10.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2006] [Accepted: 08/12/2006] [Indexed: 02/02/2023] Open
Abstract
BACKGROUND Molecular marker technologies are undergoing a transition from largely serial assays measuring DNA fragment sizes to hybridization-based technologies with high multiplexing levels. Diversity Arrays Technology (DArT) is a hybridization-based technology that is increasingly being adopted by barley researchers. There is a need to integrate the information generated by DArT with previous data produced with gel-based marker technologies. The goal of this study was to build a high-density consensus linkage map from the combined datasets of ten populations, most of which were simultaneously typed with DArT and Simple Sequence Repeat (SSR), Restriction Enzyme Fragment Polymorphism (RFLP) and/or Sequence Tagged Site (STS) markers. RESULTS The consensus map, built using a combination of JoinMap 3.0 software and several purpose-built perl scripts, comprised 2,935 loci (2,085 DArT, 850 other loci) and spanned 1,161 cM. It contained a total of 1,629 'bins' (unique loci), with an average inter-bin distance of 0.7 +/- 1.0 cM (median = 0.3 cM). More than 98% of the map could be covered with a single DArT assay. The arrangement of loci was very similar to, and almost as optimal as, the arrangement of loci in component maps built for individual populations. The locus order of a synthetic map derived from merging the component maps without considering the segregation data was only slightly inferior. The distribution of loci along chromosomes indicated centromeric suppression of recombination in all chromosomes except 5H. DArT markers appeared to have a moderate tendency toward hypomethylated, gene-rich regions in distal chromosome areas. On the average, 14 +/- 9 DArT loci were identified within 5 cM on either side of SSR, RFLP or STS loci previously identified as linked to agricultural traits. CONCLUSION Our barley consensus map provides a framework for transferring genetic information between different marker systems and for deploying DArT markers in molecular breeding schemes. The study also highlights the need for improved software for building consensus maps from high-density segregation data of multiple populations.
Collapse
Affiliation(s)
- Peter Wenzl
- Triticarte P/L, PO Box 7141 Yarralumla, Canberra, ACT 2600, Australia
- DArT P/L, PO Box 7141 Yarralumla, Canberra, ACT 2600, Australia
| | - Haobing Li
- School of Agricultural Science, University of Tasmania, PO Box 252-54, Hobart TAS 7001, Australia
| | - Jason Carling
- Triticarte P/L, PO Box 7141 Yarralumla, Canberra, ACT 2600, Australia
- DArT P/L, PO Box 7141 Yarralumla, Canberra, ACT 2600, Australia
| | - Meixue Zhou
- Tasmanian Institute of Agricultural Research, PO Box 46, Kings Meadows TAS 7249, Australia
| | - Harsh Raman
- NSW Agricultural Genomics Centre and NSW Department of Primary Industries, Wagga Wagga Agricultural Institute, PMB, Wagga Wagga NSW 2650, Australia
| | - Edie Paul
- GeneFlow Inc., 14582 Olde Kent Rd., Centreville VA 20120, USA
| | - Phillippa Hearnden
- School of Agriculture, Food and Wine, Plant Genomics Centre, The University of Adelaide, PMB1, Glen Osmond SA 5064, Australia
| | - Christina Maier
- Dept. Crop and Soil Sciences and School of Molecular Biosciences, Washington State University, Pullman WA 99164-6420, USA
| | - Ling Xia
- Triticarte P/L, PO Box 7141 Yarralumla, Canberra, ACT 2600, Australia
- DArT P/L, PO Box 7141 Yarralumla, Canberra, ACT 2600, Australia
| | - Vanessa Caig
- Triticarte P/L, PO Box 7141 Yarralumla, Canberra, ACT 2600, Australia
- DArT P/L, PO Box 7141 Yarralumla, Canberra, ACT 2600, Australia
| | - Jaroslava Ovesná
- Research Institute of Crop Production, Drnovská 507, 161 06 Prague 6, Czech Republic
| | - Mehmet Cakir
- Molecular Plant Breeding CRC, WA State Agricultural Biotechnology Centre, Murdoch University, Murdoch, WA 6150, Australia
| | - David Poulsen
- Department of Primary Industries & Fisheries, Plant Science, MS 508 Warwick, QLD 4370, Australia
| | - Junping Wang
- NSW Agricultural Genomics Centre and NSW Department of Primary Industries, Wagga Wagga Agricultural Institute, PMB, Wagga Wagga NSW 2650, Australia
| | - Rosy Raman
- NSW Agricultural Genomics Centre and NSW Department of Primary Industries, Wagga Wagga Agricultural Institute, PMB, Wagga Wagga NSW 2650, Australia
| | - Kevin P Smith
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN 55108, USA
| | - Gary J Muehlbauer
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN 55108, USA
| | - Ken J Chalmers
- School of Agriculture, Food and Wine, Plant Genomics Centre, The University of Adelaide, PMB1, Glen Osmond SA 5064, Australia
| | - Andris Kleinhofs
- Dept. Crop and Soil Sciences and School of Molecular Biosciences, Washington State University, Pullman WA 99164-6420, USA
| | - Eric Huttner
- Triticarte P/L, PO Box 7141 Yarralumla, Canberra, ACT 2600, Australia
- DArT P/L, PO Box 7141 Yarralumla, Canberra, ACT 2600, Australia
| | - Andrzej Kilian
- Triticarte P/L, PO Box 7141 Yarralumla, Canberra, ACT 2600, Australia
- DArT P/L, PO Box 7141 Yarralumla, Canberra, ACT 2600, Australia
| |
Collapse
|
21
|
Morrell PL, Toleno DM, Lundy KE, Clegg MT. Estimating the contribution of mutation, recombination and gene conversion in the generation of haplotypic diversity. Genetics 2006; 173:1705-23. [PMID: 16624913 PMCID: PMC1526701 DOI: 10.1534/genetics.105.054502] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2005] [Accepted: 04/11/2006] [Indexed: 11/18/2022] Open
Abstract
Recombination occurs through both homologous crossing over and homologous gene conversion during meiosis. The contribution of recombination relative to mutation is expected to be dramatically reduced in inbreeding organisms. We report coalescent-based estimates of the recombination parameter (rho) relative to estimates of the mutation parameter (theta) for 18 genes from the highly self-fertilizing grass, wild barley, Hordeum vulgare ssp. spontaneum. Estimates of rho/theta are much greater than expected, with a mean rho/theta approximately 1.5, similar to estimates from outcrossing species. We also estimate rho with and without the contribution of gene conversion. Genotyping errors can mimic the effect of gene conversion, upwardly biasing estimates of the role of conversion. Thus we report a novel method for identifying genotyping errors in nucleotide sequence data sets. We show that there is evidence for gene conversion in many large nucleotide sequence data sets including our data that have been purged of all detectable sequencing errors and in data sets from Drosophila melanogaster, D. simulans, and Zea mays. In total, 13 of 27 loci show evidence of gene conversion. For these loci, gene conversion is estimated to contribute an average of twice as much as crossing over to total recombination.
Collapse
Affiliation(s)
- Peter L Morrell
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92697, USA
| | | | | | | |
Collapse
|
22
|
Ostrowski MF, David J, Santoni S, McKhann H, Reboud X, Le Corre V, Camilleri C, Brunel D, Bouchez D, Faure B, Bataillon T. Evidence for a large-scale population structure among accessions of Arabidopsis thaliana: possible causes and consequences for the distribution of linkage disequilibrium. Mol Ecol 2006; 15:1507-17. [PMID: 16629807 DOI: 10.1111/j.1365-294x.2006.02865.x] [Citation(s) in RCA: 116] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
The existence of a large-scale population structure was investigated in Arabidopsis thaliana by studying patterns of polymorphism in a set of 71 European accessions. We used sequence polymorphism surveyed in 10 fragments of approximately 600 nucleotides and a set of nine microsatellite markers. Population structure was investigated using a model-based inference framework. Among the accessions studied, the presence of four groups was inferred using genetic data, without using prior information on the geographical origin of the accessions. Significant genetic isolation by geographical distance was detected at the group level, together with a geographical gradient in allelic richness across groups. These results are discussed with respect to the previously proposed scenario of postglacial colonization of Europe from putative glacial refugia. Finally, the contribution of the inferred structure to linkage disequilibrium among 171 pairs of essentially unlinked markers was also investigated. Linkage disequilibrium analysis revealed that significant associations detected in the whole sample were mainly due to genetic differentiation among the inferred groups. We discuss the implication of this finding for future association studies in A. thaliana.
Collapse
Affiliation(s)
- Marie-France Ostrowski
- UMR 1097 Diversité et Génomes des Plantes Cultivées, INRA, Domaine de Melgueil, Mauguio, France.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
23
|
Plagnol V, Padhukasahasram B, Wall JD, Marjoram P, Nordborg M. Relative influences of crossing over and gene conversion on the pattern of linkage disequilibrium in Arabidopsis thaliana. Genetics 2006; 172:2441-8. [PMID: 16299388 PMCID: PMC1456404 DOI: 10.1534/genetics.104.040311] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2004] [Accepted: 11/01/2005] [Indexed: 01/16/2023] Open
Abstract
In this article we infer the rates of gene conversion and crossing over in Arabidopsis thaliana from population genetic data. Our data set is a genomewide survey consisting of 1347 fragments of length 600 bp sequenced in 96 accessions. It has several orders of magnitude more markers than any previous nonhuman study. This allows for more accurate inference as well as a detailed comparison between theoretical expectations and observations. Our methodology is specifically set to account for deviations such as recurrent mutations or a skewed frequency spectrum. We found that even if some components of the model clearly do not fit, the pattern of LD conforms to theoretical expectations quite well. The ratio of gene conversion to crossing over is estimated to be around one. We also find evidence for fine-scale variations of the crossing-over rate.
Collapse
Affiliation(s)
- Vincent Plagnol
- Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, California 90089-2910, USA.
| | | | | | | | | |
Collapse
|
24
|
Caldwell KS, Russell J, Langridge P, Powell W. Extreme population-dependent linkage disequilibrium detected in an inbreeding plant species, Hordeum vulgare. Genetics 2005; 172:557-67. [PMID: 16219791 PMCID: PMC1456183 DOI: 10.1534/genetics.104.038489] [Citation(s) in RCA: 205] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
In human genetics a detailed knowledge of linkage disequilibrium (LD) is considered a prerequisite for effective population-based, high-resolution gene mapping and cloning. Similar opportunities exist for plants; however, differences in breeding system and population history need to be considered. Here we report a detailed study of localized LD in different populations of an inbreeding crop species. We measured LD between and within four gene loci within the region surrounding the hardness locus in three different gene pools of barley (Hordeum vulgare). We demonstrate that LD extends to at least 212 kb in elite barley cultivars but is rapidly eroded in related inbreeding ancestral populations. Our results indicate that haplotype-based sequence analysis in multiple populations will provide new opportunities to adjust the resolution of association studies in inbreeding crop species.
Collapse
|
25
|
Hamblin MT, Salas Fernandez MG, Casa AM, Mitchell SE, Paterson AH, Kresovich S. Equilibrium processes cannot explain high levels of short- and medium-range linkage disequilibrium in the domesticated grass Sorghum bicolor. Genetics 2005; 171:1247-56. [PMID: 16157678 PMCID: PMC1456844 DOI: 10.1534/genetics.105.041566] [Citation(s) in RCA: 63] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open
Abstract
Patterns of linkage disequilibrium (LD) are of interest because they provide evidence of both equilibrium (e.g., mating system or long-term population structure) and nonequilibrium (e.g., demographic or selective) processes, as well as because of their importance in strategies for identifying the genetic basis of complex phenotypes. We report patterns of short and medium range (up to 100 kb) LD in six unlinked genomic regions in the partially selfing domesticated grass, Sorghum bicolor. The extent of allelic associations in S. bicolor, as assessed by pairwise measures of LD, is higher than in maize but lower than in Arabidopsis, in qualitative agreement with expectations based on mating system. Quantitative analyses of the population recombination parameter, rho, however, based on empirical estimates of rates of recombination, mutation, and self-pollination, show that LD is more extensive than expected under a neutral equilibrium model. The disparity between rho and the population mutation parameter, , is similar to that observed in other species whose population history appears to be complex. From a practical standpoint, these results suggest that S. bicolor is well suited for association studies using reasonable numbers of markers, since LD typically extends at least several kilobases but has largely decayed by 15 kb.
Collapse
Affiliation(s)
- Martha T Hamblin
- Institute for Genomic Diversity, Cornell University, Ithaca, New York 14853, USA
| | | | | | | | | | | |
Collapse
|
26
|
Tan YY, Xu HH, Tao WJ, Hoffmann MH, Wang XF, Lu YT. Transgenic GFP as a molecular marker for approaches to quantify pollination mechanism and gene flow in Arabidopsis thaliana. PLANT BIOLOGY (STUTTGART, GERMANY) 2005; 7:405-10. [PMID: 16025413 DOI: 10.1055/s-2005-837734] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
Arabidopsis thaliana is commonly regarded as a self-pollinated plant. We observed that the stigma in each flower of A. thaliana cannot be pollinated by its own pollen in the early phases of the flowering process, when the anthers had dehisced but the filaments were still too short for the pollen to be deposited on the stigma. In the later stages, after elongation of the filaments, self-pollination can occur. After artificial pollination of the flower of a wild plant with GFP transgenic pollen grains in earlier stages of flowering, GFP expressed within epidermal cells was detected in some of the offspring (26.1-57.1 %). Wind-mediated pollen dispersal was poor but is likely to exist in natural habitats, while insects were observed visiting flowers of A. thaliana in natural and experimental populations. We constructed an experimental population consisting of 28 GFP transgenic plants and 240 wild plants and examined gene flow in the population. The result was that the distance of gene flow was limited to 0.5 m. 22 offspring with expressed GFP were found in 28,299 filial individuals examined, which suggested a relatively low outcrossing rate (0.74%). We conclude that outcrossing in populations of A. thaliana is mainly due to insect pollination. The data on gene flow could be useful to assess the ecological hazards of experimental transgene combinations.
Collapse
Affiliation(s)
- Y-Y Tan
- Key Laboratory of MOE for Developmental Biology, College of Life Sciences, Wuhan University, Wuhan, P.R. China
| | | | | | | | | | | |
Collapse
|
27
|
Benovoy D, Morris RT, Morin A, Drouin G. Ectopic Gene Conversions Increase the G + C Content of Duplicated Yeast and Arabidopsis Genes. Mol Biol Evol 2005; 22:1865-8. [PMID: 15917495 DOI: 10.1093/molbev/msi176] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
Allelic recombination has previously been shown to increase the GC-content of the sequences of a wide variety of eukaryotic species. Ectopic recombination between clustered tandemly repeated genes has also been shown to increase their GC-content. Here we show that gene conversions between the dispersed genes found in the duplicated regions of the yeast and Arabidopsis genomes also increase their GC-content when these genes are more than 88% similar.
Collapse
Affiliation(s)
- David Benovoy
- Département de biologie, Université d'Ottawa, Ottawa, Ontario, Canada
| | | | | | | |
Collapse
|
28
|
Nordborg M, Hu TT, Ishino Y, Jhaveri J, Toomajian C, Zheng H, Bakker E, Calabrese P, Gladstone J, Goyal R, Jakobsson M, Kim S, Morozov Y, Padhukasahasram B, Plagnol V, Rosenberg NA, Shah C, Wall JD, Wang J, Zhao K, Kalbfleisch T, Schulz V, Kreitman M, Bergelson J. The pattern of polymorphism in Arabidopsis thaliana. PLoS Biol 2005; 3:e196. [PMID: 15907155 PMCID: PMC1135296 DOI: 10.1371/journal.pbio.0030196] [Citation(s) in RCA: 677] [Impact Index Per Article: 35.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2004] [Accepted: 03/31/2005] [Indexed: 11/28/2022] Open
Abstract
We resequenced 876 short fragments in a sample of 96 individuals of Arabidopsis thaliana that included stock center accessions as well as a hierarchical sample from natural populations. Although A. thaliana is a selfing weed, the pattern of polymorphism in general agrees with what is expected for a widely distributed, sexually reproducing species. Linkage disequilibrium decays rapidly, within 50 kb. Variation is shared worldwide, although population structure and isolation by distance are evident. The data fail to fit standard neutral models in several ways. There is a genome-wide excess of rare alleles, at least partially due to selection. There is too much variation between genomic regions in the level of polymorphism. The local level of polymorphism is negatively correlated with gene density and positively correlated with segmental duplications. Because the data do not fit theoretical null distributions, attempts to infer natural selection from polymorphism data will require genome-wide surveys of polymorphism in order to identify anomalous regions. Despite this, our data support the utility of A. thaliana as a model for evolutionary functional genomics. A systematic global survey of genomic DNA sequence polymorphism in Arabidopsis thaliana reveals that standard genetic tests for selection do not apply to this species but supports its status as a model organism.
Collapse
Affiliation(s)
- Magnus Nordborg
- Molecular and Computational Biology, University of Southern California, Los Angeles, California, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
29
|
Kroymann J, Mitchell-Olds T. Epistasis and balanced polymorphism influencing complex trait variation. Nature 2005; 435:95-8. [PMID: 15875023 DOI: 10.1038/nature03480] [Citation(s) in RCA: 235] [Impact Index Per Article: 12.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2004] [Accepted: 02/17/2005] [Indexed: 01/13/2023]
Abstract
Complex traits such as human disease, growth rate, or crop yield are polygenic, or determined by the contributions from numerous genes in a quantitative manner. Although progress has been made in identifying major quantitative trait loci (QTL), experimental constraints have limited our knowledge of small-effect QTL, which may be responsible for a large proportion of trait variation. Here, we identified and dissected a one-centimorgan chromosome interval in Arabidopsis thaliana without regard to its effect on growth rate, and examined the signature of historical sequence polymorphism among Arabidopsis accessions. We found that the interval contained two growth rate QTL within 210 kilobases. Both QTL showed epistasis; that is, their phenotypic effects depended on the genetic background. This amount of complexity in such a small area suggests a highly polygenic architecture of quantitative variation, much more than previously documented. One QTL was limited to a single gene. The gene in question displayed a nucleotide signature indicative of balancing selection, and its phenotypic effects are reversed depending on genetic background. If this region typifies many complex trait loci, then non-neutral epistatic polymorphism may be an important contributor to genetic variation in complex traits.
Collapse
Affiliation(s)
- Juergen Kroymann
- Max Planck Institute for Chemical Ecology, Department of Genetics & Evolution, Hans-Knoell-Str. 8, D-07745 Jena, Germany.
| | | |
Collapse
|
30
|
Gupta PK, Rustgi S, Kulwal PL. Linkage disequilibrium and association studies in higher plants: present status and future prospects. PLANT MOLECULAR BIOLOGY 2005; 57:461-85. [PMID: 15821975 DOI: 10.1007/s11103-005-0257-z] [Citation(s) in RCA: 284] [Impact Index Per Article: 14.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/01/2004] [Accepted: 01/04/2005] [Indexed: 05/19/2023]
Abstract
During the last two decades, DNA-based molecular markers have been extensively utilized for a variety of studies in both plant and animal systems. One of the major uses of these markers is the construction of genome-wide molecular maps and the genetic analysis of simple and complex traits. However, these studies are generally based on linkage analysis in mapping populations, thus placing serious limitations in using molecular markers for genetic analysis in a variety of plant systems. Therefore, alternative approaches have been suggested, and one of these approaches makes use of linkage disequilibrium (LD)-based association analysis. Although this approach of association analysis has already been used for studies on genetics of complex traits (including different diseases) in humans, its use in plants has just started. In the present review, we first define and distinguish between LD and association mapping, and then briefly describe various measures of LD and the two methods of its depiction. We then give a list of different factors that affect LD without discussing them, and also discuss the current issues of LD research in plants. Later, we also describe the various uses of LD in plant genomics research and summarize the present status of LD research in different plant genomes. In the end, we discuss briefly the future prospects of LD research in plants, and give a list of softwares that are useful in LD research, which is available as electronic supplementary material (ESM).
Collapse
Affiliation(s)
- Pushpendra K Gupta
- Molecular Biology Laboratory, Department of Genetics & Plant Breeding, Ch. Charan Singh University, Meerut 250 004 (UP), India.
| | | | | |
Collapse
|
31
|
Abstract
We introduce a new method for jointly estimating crossing-over and gene conversion rates using sequence polymorphism data. The method calculates probabilities for subsets of the data consisting of three segregating sites and then forms a composite likelihood by multiplying together the probabilities of many subsets. Simulations show that this new method performs better than previously proposed methods for estimating gene conversion rates, but that all methods require large amounts of data to provide reliable estimates. While existing methods can easily estimate an "average" gene conversion rate over many loci, they cannot reliably estimate gene conversion rates for a single region of the genome.
Collapse
Affiliation(s)
- Jeffrey D Wall
- Program in Molecular and Computational Biology, University of Southern California, Los Angeles, California 90089, USA.
| |
Collapse
|
32
|
Schmid KJ, Ramos-Onsins S, Ringys-Beckstein H, Weisshaar B, Mitchell-Olds T. A multilocus sequence survey in Arabidopsis thaliana reveals a genome-wide departure from a neutral model of DNA sequence polymorphism. Genetics 2005; 169:1601-15. [PMID: 15654111 PMCID: PMC1449538 DOI: 10.1534/genetics.104.033795] [Citation(s) in RCA: 174] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open
Abstract
The simultaneous analysis of multiple genomic loci is a powerful approach to studying the effects of population history and natural selection on patterns of genetic variation of a species. By surveying nucleotide sequence polymorphism at 334 randomly distributed genomic regions in 12 accessions of Arabidopsis thaliana, we examined whether a standard neutral model of nucleotide sequence polymorphism is consistent with observed data. The average nucleotide diversity was 0.0071 for total sites and 0.0083 for silent sites. Although levels of diversity are variable among loci, no correlation with local recombination rate was observed, but polymorphism levels were correlated for physically linked loci (<250 kb). We found that observed distributions of Tajima's D- and D/D(min)- and of Fu and Li's D-, D*- and F-, F*-statistics differed significantly from the expected distributions under a standard neutral model due to an excess of rare polymorphisms and high variances. Observed and expected distributions of Fay and Wu's H were not different, suggesting that demographic processes and not selection at multiple loci are responsible for the deviation from a neutral model. Maximum-likelihood comparisons of alternative demographic models like logistic population growth, glacial refugia, or past bottlenecks did not produce parameter estimates that were more consistent with observed patterns. However, exclusion of highly polymorphic "outlier loci" resulted in a fit to the logistic growth model. Various tests of neutrality revealed a set of candidate loci that may evolve under selection.
Collapse
Affiliation(s)
- Karl J Schmid
- Department of Genetics and Evolution, Max-Planck Institute of Chemical Ecology, Jena, Germany.
| | | | | | | | | |
Collapse
|
33
|
Stranger BE, Mitchell-Olds T. Nucleotide variation at the myrosinase-encoding locus, TGG1, and quantitative myrosinase enzyme activity variation in Arabidopsis thaliana. Mol Ecol 2004; 14:295-309. [PMID: 15643972 DOI: 10.1111/j.1365-294x.2004.02403.x] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
The Arabidopsis thaliana TGG1 gene encodes thioglucoside glucohydrolase (myrosinase), an enzyme catalysing the hydrolysis of glucosinolate compounds. The enzyme is involved in plant defence against some insect herbivores, and is present in species of the order Capparales (Brassicales). Nucleotide variation was surveyed by sequencing c. 2.4 kb of the TGG1 locus in a sample of 28 worldwide A. thaliana accessions, and one Arabidopsis lyrata ssp. lyrata individual. Myrosinase activity was quantified for 27 of these same A. thaliana accessions, plus five additional others. Overall, estimated nucleotide diversity in A. thaliana was low compared to other published A. thaliana surveys, and the frequency distribution was skewed toward an excess of low-frequency variants. Furthermore, comparison to the outgroup species A. lyrata demonstrated that A. thaliana exhibited an excess of high-frequency derived variants relative to a neutral equilibrium model, suggesting a selective sweep. A. thaliana accessions differed significantly in total myrosinase activity, but analysis of variance detected no statistical evidence for an association between quantitative enzyme activity and alleles at the TGG1 myrosinase-encoding locus. We thus conclude that other, unsurveyed factors primarily affect the observed myrosinase activity levels in this species. The pattern of nucleotide variation was consistent with a model of positive selection but might also be compatible with a completely neutral model that takes into account the metapopulation behaviour of this highly inbreeding species which experienced a relatively recent worldwide expansion.
Collapse
Affiliation(s)
- Barbara E Stranger
- Department of Genetics and Evolution, Max Planck Institute of Chemical Ecology, Jena, Germany.
| | | |
Collapse
|
34
|
Clauss MJ, Mitchell-Olds T. Functional divergence in tandemly duplicated Arabidopsis thaliana trypsin inhibitor genes. Genetics 2004; 166:1419-36. [PMID: 15082560 PMCID: PMC1470761 DOI: 10.1534/genetics.166.3.1419] [Citation(s) in RCA: 37] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023] Open
Abstract
In multigene families, variation among loci and alleles can contribute to trait evolution. We explored patterns of functional and genetic variation in six duplicated Arabidopsis thaliana trypsin inhibitor (ATTI) loci. We demonstrate significant variation in constitutive and herbivore-induced transcription among ATTI loci that show, on average, 65% sequence divergence. Significant variation in ATTI expression was also found between two molecularly defined haplotype classes. Population genetic analyses for 17 accessions of A. thaliana showed that six ATTI loci arranged in tandem within 10 kb varied 10-fold in nucleotide diversity, from 0.0009 to 0.0110, and identified a minimum of six recombination events throughout the tandem array. We observed a significant peak in nucleotide and indel polymorphism spanning ATTI loci in the interior of the array, due primarily to divergence between the two haplotype classes. Significant deviation from the neutral equilibrium model for individual genes was interpreted within the context of intergene linkage disequilibrium and correlated patterns of functional differentiation. In contrast to the outcrosser Arabidopsis lyrata for which recombination is observed even within ATTI loci, our data suggest that response to selection was slowed in the inbreeding, annual A. thaliana because of interference among functionally divergent ATTI loci.
Collapse
Affiliation(s)
- M J Clauss
- Department of Genetics and Evolution, Max Planck Institute of Chemical Ecology, 07745 Jena, Germany.
| | | |
Collapse
|
35
|
Wright SI, Gaut BS. Molecular Population Genetics and the Search for Adaptive Evolution in Plants. Mol Biol Evol 2004; 22:506-19. [PMID: 15525701 DOI: 10.1093/molbev/msi035] [Citation(s) in RCA: 201] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
The first papers on plant molecular population genetics were published approximately 10 years ago. Since that time, well over 50 additional studies of plant nucleotide polymorphism have been published, and many of these studies focused on detecting the signature of balancing or positive selection at a locus. In this review, we discuss some of the theoretical and statistical issues surrounding the detection of selection, with focus on plant populations, and we also summarize the empirical plant molecular population genetics literature. At face value, the literature suggests that a history of balancing or positive selection in plant genes is rampant. In two well-studied taxa (maize and Arabidopsis) over 20% of studied genes have been interpreted as containing the signature of selection. We argue that this is probably an overstatement of the prevalence of natural selection in plant genomes, for two reasons. First, demographic effects are difficult to incorporate and have generally not been well integrated into the plant population genetics literature. Second, the genes studied to date are not a random sample, so selected genes may be overrepresented. The next generation of studies in plant molecular population genetics requires additional sampling of local populations, explicit comparisons among loci, and improved theoretical methods to control for demography. Eventually, candidate loci should be confirmed by explicit consideration of phenotypic effects.
Collapse
Affiliation(s)
- Stephen I Wright
- Department of Ecology and Evolutionary Biology, University of California, Irvine, USA
| | | |
Collapse
|
36
|
Kuang H, Woo SS, Meyers BC, Nevo E, Michelmore RW. Multiple genetic processes result in heterogeneous rates of evolution within the major cluster disease resistance genes in lettuce. THE PLANT CELL 2004; 16:2870-94. [PMID: 15494555 PMCID: PMC527186 DOI: 10.1105/tpc.104.025502] [Citation(s) in RCA: 105] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/25/2004] [Accepted: 09/02/2004] [Indexed: 05/05/2023]
Abstract
Resistance Gene Candidate2 (RGC2) genes belong to a large, highly duplicated family of nucleotide binding site-leucine rich repeat (NBS-LRR) encoding disease resistance genes located at a single locus in lettuce (Lactuca sativa). To investigate the genetic events occurring during the evolution of this locus, approximately 1.5- to 2-kb 3' fragments of 126 RGC2 genes from seven genotypes were sequenced from three species of Lactuca, and 107 additional RGC2 sequences were obtained from 40 wild accessions of Lactuca spp. The copy number of RGC2 genes varied from 12 to 32 per genome in the seven genotypes studied extensively. LRR number varied from 40 to 47; most of this variation had resulted from 13 events duplicating two to five LRRs because of unequal crossing-over within or between RGC2 genes at one of two recombination hot spots. Two types of RGC2 genes (Type I and Type II) were initially distinguished based on the pattern of sequence identities between their 3' regions. The existence of two types of RGC2 genes was further supported by intron similarities, the frequency of sequence exchange, and their prevalence in natural populations. Type I genes are extensive chimeras caused by frequent sequence exchanges. Frequent sequence exchanges between Type I genes homogenized intron sequences, but not coding sequences, and obscured allelic/orthologous relationships. Sequencing of Type I genes from additional wild accessions confirmed the high frequency of sequence exchange and the presence of numerous chimeric RGC2 genes in nature. Unlike Type I genes, Type II genes exhibited infrequent sequence exchange between paralogous sequences. Type II genes from different genotype/species within the genus Lactuca showed obvious allelic/orthologous relationships. Trans-specific polymorphism was observed for different groups of orthologs, suggesting balancing selection. Unequal crossover, insertion/deletion, and point mutation events were distributed unequally through the gene. Different evolutionary forces have impacted different parts of the LRR.
Collapse
Affiliation(s)
- Hanhui Kuang
- Department of Vegetable Crops, University of California, Davis, California 95616, USA
| | | | | | | | | |
Collapse
|
37
|
Abstract
The genomics tools available for studying Arabidopsis thaliana are a great resource for researchers trying to characterize and understand the genetic basis of natural variation. Abundant polymorphic markers aid quantitative trait locus (QTL) mapping, the fully sequenced genome provides rapid identification of candidate loci, and extensive knockout collections allow those candidate loci to be tested. Combining QTL mapping of classic phenotypic traits with biochemical or expression analysis is providing mechanistic insight into the traits of interest. Conversely, natural variation studies are now being done on genomic traits such as methylation or chiasma frequency.
Collapse
Affiliation(s)
- Julin N Maloof
- Division of Biological Sciences, One Shields Ave, University of California, Davis, CA 95616, USA.
| |
Collapse
|
38
|
Wenzl P, Carling J, Kudrna D, Jaccoud D, Huttner E, Kleinhofs A, Kilian A. Diversity Arrays Technology (DArT) for whole-genome profiling of barley. Proc Natl Acad Sci U S A 2004; 101:9915-20. [PMID: 15192146 PMCID: PMC470773 DOI: 10.1073/pnas.0401076101] [Citation(s) in RCA: 286] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2004] [Indexed: 11/18/2022] Open
Abstract
Diversity Arrays Technology (DArT) can detect and type DNA variation at several hundred genomic loci in parallel without relying on sequence information. Here we show that it can be effectively applied to genetic mapping and diversity analyses of barley, a species with a 5,000-Mbp genome. We tested several complexity reduction methods and selected two that generated the most polymorphic genomic representations. Arrays containing individual fragments from these representations generated DArT fingerprints with a genotype call rate of 98.0% and a scoring reproducibility of at least 99.8%. The fingerprints grouped barley lines according to known genetic relationships. To validate the Mendelian behavior of DArT markers, we constructed a genetic map for a cross between cultivars Steptoe and Morex. Nearly all polymorphic array features could be incorporated into one of seven linkage groups (98.8%). The resulting map comprised approximately 385 unique DArT markers and spanned 1,137 centimorgans. A comparison with the restriction fragment length polymorphism-based framework map indicated that the quality of the DArT map was equivalent, if not superior, to that of the framework map. These results highlight the potential of DArT as a generic technique for genome profiling in the context of molecular breeding and genomics.
Collapse
Affiliation(s)
- Peter Wenzl
- Center for the Application of Molecular Biology to International Agriculture, G.P.O. Box 3200, Canberra, ACT 2601, Australia
| | | | | | | | | | | | | |
Collapse
|
39
|
Abstract
Interpreting the functional content of a given genomic sequence is one of the central challenges of biology today. Perhaps the most promising approach to this problem is based on the comparative method of classic biology in the modern guise of sequence comparison. For instance, protein-coding regions tend to be conserved between species. Hence, a simple method for distinguishing a functional exon from the chance absence of stop codons is to investigate its homologue from closely related species. Predicting regulatory elements is even more difficult than exon prediction, but again, comparisons pinpointing conserved sequence motifs upstream of translation start sites are helping to unravel gene regulatory networks. In addition to interspecific studies, intraspecific sequence comparison yields insights into the evolutionary forces that have acted on a species in the past. Of particular interest here is the identification of selection events such as selective sweeps. Both intra- and interspecific sequence comparisons are based on a variety of computational methods, including alignment, phylogenetic reconstruction, and coalescent theory. This article surveys the biology and the central computational ideas applied in recent comparative genomics projects. We argue that the most fruitful method of understanding the functional content of genomes is to study them in the context of related genomic sequences. In particular, such a study may reveal selection, a fundamental pointer to biological relevance.
Collapse
Affiliation(s)
- Bernhard Haubold
- Fachbereich Biotechnologie & Bioinformatik, Fachhochschule Weihenstephan, 85350 Freising, Germany.
| | | |
Collapse
|
40
|
McKhann HI, Camilleri C, Bérard A, Bataillon T, David JL, Reboud X, Le Corre V, Caloustian C, Gut IG, Brunel D. Nested core collections maximizing genetic diversity in Arabidopsis thaliana. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2004; 38:193-202. [PMID: 15053772 DOI: 10.1111/j.1365-313x.2004.02034.x] [Citation(s) in RCA: 85] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]
Abstract
The successful exploitation of natural genetic diversity requires a basic knowledge of the extent of the variation present in a species. To study natural variation in Arabidopsis thaliana, we defined nested core collections maximizing the diversity present among a worldwide set of 265 accessions. The core collections were generated based on DNA sequence data from a limited number of fragments evenly distributed in the genome and were shown to successfully capture the molecular diversity in other loci as well as the morphological diversity. The core collections are available to the scientific community and thus provide an important resource for the study of genetic variation and its functional consequences in Arabidopsis. Moreover, this strategy can be used in other species to provide a rational framework for undertaking diversity surveys, including single nucleotide polymorphism (SNP) discovery and phenotyping, allowing the utilization of genetic variation for the study of complex traits.
Collapse
Affiliation(s)
- Heather I McKhann
- Centre National de Génotypage, 2 rue Gaston Crémieux, 91057 Evry Cedex, France.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
41
|
Schmuths H, Hoffmann MH, Bachmann K. Geographic distribution and recombination of genomic fragments on the short arm of chromosome 2 of Arabidopsis thaliana. PLANT BIOLOGY (STUTTGART, GERMANY) 2004; 6:128-139. [PMID: 15045663 DOI: 10.1055/s-2004-817837] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Range expansion from Pleistocene refugia and anthropogenic influences contribute to the present distribution pattern of Arabidopsis thaliana. We scored a genome-wide set of CAPSs and found two markers with an east-west geographic distribution across the Eurasian range of the species. Regions around the two SNPs were sequenced in 98 accessions, including newly collected plants from Middle Asia and Western Siberia. These regions correspond to a gene ( approximately 1500 bp) and a non-coding region ( approximately 500 bp) 300 kbp apart on chromosome 2. Nucleotide diversities, pi, of the two sequenced fragments were 0.0032 and 0.0130. The haplotypes of both sequences belonged to one of two groups: a rather uniform "Asian" and a more variable "European" haplotype group, on the basis of non-disjunct clusters of SNPs. Recombination between "Asian" and "European" haplotypes occurs where they meet. Especially in the "European" haplotype, many rare SNP variants representing independent mutations are scattered among the shared haplotype-specific SNPs. This agrees with previous suggestions of two large haplotype groups in A. thaliana and the post-glacial colonization of central Europe from the east and the west. A clear correlation between climatic factors and the haplotype distribution may reflect the dispersal history rather than local climate adaptation. The pattern of SNP variation within the contiguous sequences explains why only a minority of SNPs selected across the genome show evidence of this geographic pattern.
Collapse
Affiliation(s)
- H Schmuths
- Department of Taxonomy, Institute of Plant Genetics and Crop Plant Research, Gatersleben, Germany.
| | | | | |
Collapse
|
42
|
Molinier J, Ries G, Bonhoeffer S, Hohn B. Interchromatid and interhomolog recombination in Arabidopsis thaliana. THE PLANT CELL 2004; 16:342-52. [PMID: 14729918 PMCID: PMC341908 DOI: 10.1105/tpc.019042] [Citation(s) in RCA: 54] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/07/2003] [Accepted: 12/03/2003] [Indexed: 05/18/2023]
Abstract
Intermolecular recombination events were monitored in Arabidopsis thaliana lines using specially designed recombination traps consisting of tandem disrupted beta-glucuronidase or luciferase reporter genes in direct repeat orientation. Recombination frequencies (RFs) varied between the different lines, indicating possible position effects influencing intermolecular recombination processes. The RFs between sister chromatids and between homologous chromosomes were measured in plants either hemizygous or homozygous for a transgene locus. The RFs in homozygous plants exceeded those of hemizygous plants by a factor of >2, implying that in somatic plant cells both sister chromatid recombination and recombination between homologous chromosomes exist for recombinational DNA repair. In addition, different DNA-damaging agents stimulated recombination in homozygous and hemizygous plants to different extents in a manner dependent on the type of DNA damage and on the genomic region. The genetic and molecular analysis of recombination events showed that most of the somatic recombination events result from gene conversion, although a pop-out event has also been characterized.
Collapse
Affiliation(s)
- Jean Molinier
- Friedrich Miescher Institute, CH-4058 Basel, Switzerland.
| | | | | | | |
Collapse
|
43
|
Udupa SM, Malhotra RS, Baum M. Tightly linked di- and tri-nucleotide microsatellites do not evolve in complete independence: evidence from linked (TA)n and (TAA)n microsatellites of chickpea (Cicer arietinum L.). TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2004; 108:550-557. [PMID: 14564394 DOI: 10.1007/s00122-003-1458-y] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/24/2003] [Accepted: 08/13/2003] [Indexed: 05/24/2023]
Abstract
In order to understand the dynamics of microsatellite evolution, we have studied allelic variation at a closely linked (TA)(n) and (TAA)(n) microsatellite loci in 114 land races of chickpea ( Cicer arietinum L.), sampled worldwide. These two loci are separated by 27 bp. The two loci showed a very high degree of polymorphism and hence the combined length with the genetic diversity of 0.93, 0.90 and 0.98 for (TAA)(n), (TA)(n) and the combined length, respectively. Using the variation data at the linked loci, a standardized index of linkage disequilibrium was also computed ( I(S)(A)=0.092), which tests the null hypothesis of no linkage and was significant, indicating the presence of linkage disequilibrium. Furthermore, the dynamics of allelic variation showed that there is a threshold combined length, below which both (TAA)(n) and (TA)(n) loci evolve independently, and above which, if one locus increase in size, the other closely linked locus has a tendency to decrease its size and vice versa, without change in the overall ratio of (TAA)(n) and (TA)(n) allele sizes at the region. This result indicates that there are processes in the cell, which 'read' the combined size of the two loci both for proportion and length and determine the direction of tightly linked di- and tri-nucleotide repeat evolution.
Collapse
Affiliation(s)
- S M Udupa
- International Center for Agricultural Research in the Dry Areas (ICARDA), P.O. Box 5466, Aleppo, Syria
| | | | | |
Collapse
|
44
|
Kroymann J, Donnerhacke S, Schnabelrauch D, Mitchell-Olds T. Evolutionary dynamics of an Arabidopsis insect resistance quantitative trait locus. Proc Natl Acad Sci U S A 2003; 100 Suppl 2:14587-92. [PMID: 14506289 PMCID: PMC304123 DOI: 10.1073/pnas.1734046100] [Citation(s) in RCA: 177] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open
Abstract
Glucosinolate profiles differ among Arabidopsis thaliana ecotypes, caused by the composition of alleles at several glucosinolate biosynthetic loci. One of these, GS-Elong, harbors a family of methylthioalkylmalate synthase (MAM) genes that determine the side chain length of aliphatic glucosinolate structures. Fine mapping reveals that GS-Elong constitutes an insect resistance quantitative trait locus, caused by variation in glucosinolate profiles conferred by polymorphism of MAM alleles in this region. A sequence survey of randomly chosen ecotypes indicates that GS-Elong is highly variable among A. thaliana ecotypes: indel polymorphisms are frequent, as well as gene conversion events between gene copies arranged in tandem. Furthermore, statistical methods of molecular population genetics suggest that one of the genes, MAM2, is subject to balancing selection. This may be caused by ecological tradeoffs, i.e., by contrasting physiological effects of glucosinolates on generalist vs. specialist insects.
Collapse
Affiliation(s)
- Juergen Kroymann
- Department of Genetics and Evolution, Max Planck Institute for Chemical Ecology, Winzerlaer Strasse 10, 07745 Jena, Germany
| | | | | | | |
Collapse
|
45
|
Törjék O, Berger D, Meyer RC, Müssig C, Schmid KJ, Rosleff Sörensen T, Weisshaar B, Mitchell-Olds T, Altmann T. Establishment of a high-efficiency SNP-based framework marker set for Arabidopsis. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2003; 36:122-140. [PMID: 12974817 DOI: 10.1046/j.1365-313x.2003.01861.x] [Citation(s) in RCA: 56] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
The major goal of this project was the establishment of a tool for rapid mapping of new mutations and genotyping in Arabidopsis consisting of at least 100 evenly spaced framework markers. We assembled a single nucleotide polymorphism (SNP)-based marker set consisting of 112 polymorphic sites with average spacing of 1.15 Mbp derived from an SNP database that we recently developed. This information was used to set up efficient SNP detection reactions based on multiplexed primer extension assays. The 112 Columbia (Col-0)/C24 framework markers were used to assemble 18 multiplexed SNaPshot assays with which up to eight separate loci can be genotyped in a single-tube/single-capillary format. In addition, for 110 framework markers matrix-assisted laser desorption/ionization time of flight (MALDI-ToF) assays have been established for high throughput analyses. We demonstrated the usefulness and the robustness of both procedures of this tool by genotyping 48 BC3F1 individuals created between the accessions Col-0 and C24. Subsets of 10-62 of the established markers discriminate between various combinations of the accessions Col-0, C24, Landsberg erecta (Ler), Cape Verdi Islands (Cvi) and Niederzenz (Nd). Using a subset of 17 evenly distributed and established SNP markers that are also polymorphic between Ler and Col-0, we were able to rapidly map a mutant gene (tbr1) to an interval of 2.3 Mbp in an Ler (tbr1) x Col-0 cross.
Collapse
Affiliation(s)
- O Törjék
- University of Potsdam, Institute of Biochemistry and Biology-Genetics, Golm, Germany
| | | | | | | | | | | | | | | | | |
Collapse
|
46
|
Huang L, Brooks SA, Li W, Fellers JP, Trick HN, Gill BS. Map-based cloning of leaf rust resistance gene Lr21 from the large and polyploid genome of bread wheat. Genetics 2003; 164:655-64. [PMID: 12807786 PMCID: PMC1462593 DOI: 10.1093/genetics/164.2.655] [Citation(s) in RCA: 235] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
We report the map-based cloning of the leaf rust resistance gene Lr21, previously mapped to a gene-rich region at the distal end of chromosome arm 1DS of bread wheat (Triticum aestivum L.). Molecular cloning of Lr21 was facilitated by diploid/polyploid shuttle mapping strategy. Cloning of Lr21 was confirmed by genetic transformation and by a stably inherited resistance phenotype in transgenic plants. Lr21 spans 4318 bp and encodes a 1080-amino-acid protein containing a conserved nucleotide-binding site (NBS) domain, 13 imperfect leucine-rich repeats (LRRs), and a unique 151-amino-acid sequence missing from known NBS-LRR proteins at the N terminus. Fine-structure genetic analysis at the Lr21 locus detected a noncrossover (recombination without exchange of flanking markers) within a 1415-bp region resulting from either a gene conversion tract of at least 191 bp or a double crossover. The successful map-based cloning approach as demonstrated here now opens the door for cloning of many crop-specific agronomic traits located in the gene-rich regions of bread wheat.
Collapse
Affiliation(s)
- Li Huang
- Wheat Genetics Resource Center, Department of Plant Pathology, Kansas State University, Manhattan 66506-5502, USA
| | | | | | | | | | | |
Collapse
|
47
|
Schmid KJ, Sorensen TR, Stracke R, Torjek O, Altmann T, Mitchell-Olds T, Weisshaar B. Large-scale identification and analysis of genome-wide single-nucleotide polymorphisms for mapping in Arabidopsis thaliana. Genome Res 2003; 13:1250-7. [PMID: 12799357 PMCID: PMC403656 DOI: 10.1101/gr.728603] [Citation(s) in RCA: 169] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2002] [Accepted: 03/19/2003] [Indexed: 01/19/2023]
Abstract
Genetic markers such as single nucleotide polymorphisms (SNPs) are essential tools for positional cloning, association, or quantitative trait locus mapping and the determination of genetic relationships between individuals. We identified and characterized a genome-wide set of SNP markers by generating 10,706 expressed sequence tags (ESTs) from cDNA libraries derived from 6 different accessions, and by analysis of 606 sequence tagged sites (STS) from up to 12 accessions of the model flowering plant Arabidopsis thaliana. The cDNA libraries for EST sequencing were made from individuals that were stressed by various means to enrich for transcripts from genes expressed under such conditions. SNPs discovered in these sequences may be useful markers for mapping genes involved in interactions with the biotic and abiotic environment. The STS loci are distributed randomly over the genome. By comparison with the Col-0 genome sequence, we identified a total of 8051 SNPs and 637 insertion/deletion polymorphisms (InDel). Analysis of STS-derived SNPs shows that most SNPs are rare, but that it is possible to identify intermediate frequency framework markers that can be used for genetic mapping in many different combinations of accessions. A substantial proportion of SNPs located in ORFs caused a change of the encoded amino acid. A comparison of the density of our SNP markers among accessions in both the EST and STS datasets, revealed that Cvi-0 is the most divergent accession from Col-0 among the 12 accessions studied. All of these markers are freely available via the internet.
Collapse
Affiliation(s)
- Karl J Schmid
- Max-Planck-Institute of Chemical Ecology, Jena, Germany
| | | | | | | | | | | | | |
Collapse
|
48
|
Borevitz JO, Nordborg M. The impact of genomics on the study of natural variation in Arabidopsis. PLANT PHYSIOLOGY 2003; 132:718-25. [PMID: 12805600 PMCID: PMC523862 DOI: 10.1104/pp.103.023549] [Citation(s) in RCA: 63] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/13/2003] [Revised: 03/18/2003] [Accepted: 03/19/2003] [Indexed: 05/18/2023]
Affiliation(s)
- Justin O Borevitz
- Plant Biology, Salk Institute, 10010 North Torrey Pines Rd, La Jolla, California 92037, USA
| | | |
Collapse
|
49
|
Abstract
Classical genetic studies show that gene conversion can favour some alleles over others. Molecular experiments suggest that gene conversion could favour GC over AT basepairs, leading to the concept of biased gene conversion towards GC (BGC(GC)). The expected consequence of such a process is the GC-enrichment of DNA sequences under gene conversion. Recent genomic work suggests that BGC(GC) affects the base composition of yeast, invertebrate and mammalian genomes. Hypotheses for the mechanisms and evolutionary origin of such a strange phenomenon have been proposed. Most BGC(GC) events probably occur during meiosis, which has implications for our understanding of the evolution of sex and recombination.
Collapse
Affiliation(s)
- Gabriel Marais
- Institute of Cell, Animal and Population Biology, University of Edinburgh, Edinburgh EH9 3JT, Scotland, UK.
| |
Collapse
|
50
|
Clauss MJ, Mitchell-Olds T. Population genetics of tandem trypsin inhibitor genes in Arabidopsis species with contrasting ecology and life history. Mol Ecol 2003; 12:1287-99. [PMID: 12694291 DOI: 10.1046/j.1365-294x.2003.01832.x] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
Duplicated genes are important in the evolution and ecology of plant-defences because herbivore and pathogen attack can be countered via functional diversification at two levels: among duplicated loci and within loci. We explore molecular sequence variation for two members of a defence-related gene family, Arabidopsis thaliana trypsin inhibitors (ATTI), in A. thaliana and a closely related species, A. lyrata subspp. petraea. A worldwide sample of the inbreeding annual A. thaliana had less genetic variation at two ATTI loci (piTOTAL <or= 0.0006) than observed previously at other functional loci. A significant excess of high frequency derived alleles in the signal sequence and 5' UTR of ATTI2 was consistent with a model of positive selection. However, demographic processes such as population subdivision and expansion, both likely to have occurred in A. thaliana during the last 10 000 years, can also give rise to similar deviations from neutrality. A single population of A. lyrata subspp. petraea in Germany had up to an order of magnitude more standing genetic variation at ATTI loci than the species-wide sample of A. thaliana. Although the level of variability for ATTI1 and ATTI2 within this single population was similar to, or even greater than, observed species-wide diversity for other loci in A. lyrata, there was little evidence to reject an equilibrium neutral model. A spatially explicit sample of 87 A. lyrata subspp. petraea individuals detected outbreeding (FIS = -0.16; FIT = -0.15) but little population subdivision (FST = 0.006) in this self-incompatible perennial herb. Genetic differences between Arabidopsis species were consistent with, but not fully explained by, divergence in ecology and life history. Diversification appears to have occurred in different functional domains for the tandemly duplicated ATTI1 and ATTI2 genes; the majority of fixed replacements in ATTI1 surround the enzyme binding site of the mature protein, whereas in ATTI2 most functional evolutionary change is located in the signal peptide. This pattern is consistent with a hypothesis of subfunctionalization in trypsin inhibitory function.
Collapse
Affiliation(s)
- M J Clauss
- Department of Genetics and Evolution, Max Planck Institute of Chemical Ecology, Beutenberg Campus, Winzerlaer Str. 10, 07745 Jena, Germany.
| | | |
Collapse
|