551
|
Hunter B, Wright KM, Bomblies K. Short read sequencing in studies of natural variation and adaptation. CURRENT OPINION IN PLANT BIOLOGY 2013. [PMID: 23177206 DOI: 10.1016/j.pbi.2012.10.003] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/10/2023]
Abstract
Today's high throughput sequencing approaches, coupled with equally revolutionary advances in bioinformatics, allow us to describe and analyze genomes in unprecedented detail. Short Read Sequencing (SRS) approaches have been especially instrumental in bringing genomic analysis to a wide range of questions and species in plant biology. We can now connect genotypes and phenotypes with greater efficiency, and investigate the molecular basis of natural variation and adaptation in a genomic framework. New and creative applications of SRS and other genomic approaches are not only reshaping how we study natural variation, but also our overall understanding of gene and genome evolution. Here we discuss examples of the application of SRS technologies to the characterization of genetic diversity, genome evolution and adaptation in plants.
Collapse
Affiliation(s)
- Ben Hunter
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA, USA
| | | | | |
Collapse
|
552
|
Olson-Manning CF, Lee CR, Rausher MD, Mitchell-Olds T. Evolution of flux control in the glucosinolate pathway in Arabidopsis thaliana. Mol Biol Evol 2013; 30:14-23. [PMID: 22923463 PMCID: PMC3525143 DOI: 10.1093/molbev/mss204] [Citation(s) in RCA: 50] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023] Open
Abstract
Network characteristics of biochemical pathways are believed to influence the rate of evolutionary change in constituent enzymes. One characteristic that may affect rate heterogeneity is control of the amount of product produced by a biochemical pathway or flux control. In particular, theoretical analyses suggest that adaptive substitutions should be concentrated in the enzyme(s) that exert the greatest control over flux. Although a handful of studies have found a correlation between position in a pathway and evolutionary rate, these investigations have not examined the relationship between evolutionary rate and flux control. Given that genes with greater control will experience stronger selection and that the probability of fixation is proportional to the selective advantage, we ask the following: 1) do upstream enzymes have majority flux control, 2) do enzymes with majority flux control accumulate adaptive substitutions, and 3) are upstream enzymes under higher selective constraint? First, by perturbing the enzymes in the aliphatic glucosinolate pathway in Arabidopsis thaliana with gene insertion lines, we show that flux control is focused in the first enzyme in the pathway. Next, by analyzing several sequence signatures of selection, we also show that this enzyme is the only one in the pathway that shows convincing evidence of selection. Our results support the hypothesis that natural selection preferentially acts on enzymes with high flux control.
Collapse
|
553
|
Moghe GD, Lehti-Shiu MD, Seddon AE, Yin S, Chen Y, Juntawong P, Brandizzi F, Bailey-Serres J, Shiu SH. Characteristics and significance of intergenic polyadenylated RNA transcription in Arabidopsis. PLANT PHYSIOLOGY 2013; 161:210-24. [PMID: 23132786 PMCID: PMC3532253 DOI: 10.1104/pp.112.205245] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/08/2012] [Accepted: 10/21/2012] [Indexed: 05/23/2023]
Abstract
The Arabidopsis (Arabidopsis thaliana) genome is the most well-annotated plant genome. However, transcriptome sequencing in Arabidopsis continues to suggest the presence of polyadenylated (polyA) transcripts originating from presumed intergenic regions. It is not clear whether these transcripts represent novel noncoding or protein-coding genes. To understand the nature of intergenic polyA transcription, we first assessed its abundance using multiple messenger RNA sequencing data sets. We found 6,545 intergenic transcribed fragments (ITFs) occupying 3.6% of Arabidopsis intergenic space. In contrast to transcribed fragments that map to protein-coding and RNA genes, most ITFs are significantly shorter, are expressed at significantly lower levels, and tend to be more data set specific. A surprisingly large number of ITFs (32.1%) may be protein coding based on evidence of translation. However, our results indicate that these "translated" ITFs tend to be close to and are likely associated with known genes. To investigate if ITFs are under selection and are functional, we assessed ITF conservation through cross-species as well as within-species comparisons. Our analysis reveals that 237 ITFs, including 49 with translation evidence, are under strong selective constraint and relatively distant from annotated features. These ITFs are likely parts of novel genes. However, the selective pressure imposed on most ITFs is similar to that of randomly selected, untranscribed intergenic sequences. Our findings indicate that despite the prevalence of ITFs, apart from the possibility of genomic contamination, many may be background or noisy transcripts derived from "junk" DNA, whose production may be inherent to the process of transcription and which, on rare occasions, may act as catalysts for the creation of novel genes.
Collapse
MESH Headings
- Arabidopsis/genetics
- Arabidopsis/metabolism
- Base Sequence
- Conserved Sequence
- DNA, Intergenic/genetics
- DNA, Intergenic/metabolism
- DNA, Plant/genetics
- DNA, Plant/metabolism
- Evolution, Molecular
- Gene Expression Regulation, Plant
- Genes, Plant
- Molecular Sequence Annotation
- Plants, Genetically Modified/genetics
- Plants, Genetically Modified/metabolism
- Protein Biosynthesis
- Pseudogenes
- RNA, Messenger/genetics
- RNA, Messenger/metabolism
- RNA, Plant/genetics
- RNA, Plant/metabolism
- Ribosomes/genetics
- Ribosomes/metabolism
- Selection, Genetic
- Sequence Analysis, RNA
- Transcription, Genetic
Collapse
Affiliation(s)
- Gaurav D. Moghe
- Department of Plant Biology (G.D.M., M.D.L.-S., A.E.S., S.Y., Y.C., F.B., S.-H.S.), Programs in Genetics and Quantitative Biology (G.D.M., S.-H.S.), and Plant Research Laboratory (Y.C., F.B.), Michigan State University, East Lansing, Michigan 48824; and Center for Plant Cell Biology and Department of Botany and Plant Sciences, University of California, Riverside, California 92521 (P.J., J.B.-S.)
| | - Melissa D. Lehti-Shiu
- Department of Plant Biology (G.D.M., M.D.L.-S., A.E.S., S.Y., Y.C., F.B., S.-H.S.), Programs in Genetics and Quantitative Biology (G.D.M., S.-H.S.), and Plant Research Laboratory (Y.C., F.B.), Michigan State University, East Lansing, Michigan 48824; and Center for Plant Cell Biology and Department of Botany and Plant Sciences, University of California, Riverside, California 92521 (P.J., J.B.-S.)
| | - Alex E. Seddon
- Department of Plant Biology (G.D.M., M.D.L.-S., A.E.S., S.Y., Y.C., F.B., S.-H.S.), Programs in Genetics and Quantitative Biology (G.D.M., S.-H.S.), and Plant Research Laboratory (Y.C., F.B.), Michigan State University, East Lansing, Michigan 48824; and Center for Plant Cell Biology and Department of Botany and Plant Sciences, University of California, Riverside, California 92521 (P.J., J.B.-S.)
| | - Shan Yin
- Department of Plant Biology (G.D.M., M.D.L.-S., A.E.S., S.Y., Y.C., F.B., S.-H.S.), Programs in Genetics and Quantitative Biology (G.D.M., S.-H.S.), and Plant Research Laboratory (Y.C., F.B.), Michigan State University, East Lansing, Michigan 48824; and Center for Plant Cell Biology and Department of Botany and Plant Sciences, University of California, Riverside, California 92521 (P.J., J.B.-S.)
| | - Yani Chen
- Department of Plant Biology (G.D.M., M.D.L.-S., A.E.S., S.Y., Y.C., F.B., S.-H.S.), Programs in Genetics and Quantitative Biology (G.D.M., S.-H.S.), and Plant Research Laboratory (Y.C., F.B.), Michigan State University, East Lansing, Michigan 48824; and Center for Plant Cell Biology and Department of Botany and Plant Sciences, University of California, Riverside, California 92521 (P.J., J.B.-S.)
| | - Piyada Juntawong
- Department of Plant Biology (G.D.M., M.D.L.-S., A.E.S., S.Y., Y.C., F.B., S.-H.S.), Programs in Genetics and Quantitative Biology (G.D.M., S.-H.S.), and Plant Research Laboratory (Y.C., F.B.), Michigan State University, East Lansing, Michigan 48824; and Center for Plant Cell Biology and Department of Botany and Plant Sciences, University of California, Riverside, California 92521 (P.J., J.B.-S.)
| | - Federica Brandizzi
- Department of Plant Biology (G.D.M., M.D.L.-S., A.E.S., S.Y., Y.C., F.B., S.-H.S.), Programs in Genetics and Quantitative Biology (G.D.M., S.-H.S.), and Plant Research Laboratory (Y.C., F.B.), Michigan State University, East Lansing, Michigan 48824; and Center for Plant Cell Biology and Department of Botany and Plant Sciences, University of California, Riverside, California 92521 (P.J., J.B.-S.)
| | - Julia Bailey-Serres
- Department of Plant Biology (G.D.M., M.D.L.-S., A.E.S., S.Y., Y.C., F.B., S.-H.S.), Programs in Genetics and Quantitative Biology (G.D.M., S.-H.S.), and Plant Research Laboratory (Y.C., F.B.), Michigan State University, East Lansing, Michigan 48824; and Center for Plant Cell Biology and Department of Botany and Plant Sciences, University of California, Riverside, California 92521 (P.J., J.B.-S.)
| | - Shin-Han Shiu
- Department of Plant Biology (G.D.M., M.D.L.-S., A.E.S., S.Y., Y.C., F.B., S.-H.S.), Programs in Genetics and Quantitative Biology (G.D.M., S.-H.S.), and Plant Research Laboratory (Y.C., F.B.), Michigan State University, East Lansing, Michigan 48824; and Center for Plant Cell Biology and Department of Botany and Plant Sciences, University of California, Riverside, California 92521 (P.J., J.B.-S.)
| |
Collapse
|
554
|
Yang R, Wang X. Organ evolution in angiosperms driven by correlated divergences of gene sequences and expression patterns. THE PLANT CELL 2013; 25:71-82. [PMID: 23341336 PMCID: PMC3584551 DOI: 10.1105/tpc.112.106716] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
The evolution of a species involves changes in its genome and its transcriptome. Divergence in expression patterns may be more important than divergence in sequences for determining phenotypic changes, particularly among closely related species. We examined the relationships between organ evolution, sequence evolution, and expression evolution in Arabidopsis thaliana, rice (Oryza sativa), and maize (Zea mays). We found correlated divergence of gene sequences and expression patterns, with distinct divergence rates that depend on the organ types in which a gene is expressed. For instance, genes specifically expressed in reproductive organs (i.e., stamen) evolve more quickly than those specifically expressed in vegetative organs (e.g., root). The different rates in organ evolution may be due to different degrees of functional constraint associated with the different physiological functions of plant organs. Additionally, the evolutionary rate of a gene sequence is correlated with the breadth of its expression in terms of the number of tissues, the number of coregulation modules, and the number of species in which the gene is expressed, as well as the number of genes with which it may interact. This linkage supports the hypothesis that constitutively expressed genes may experience higher levels of functional constraint accumulated from multiple tissues than do tissue-specific genes.
Collapse
|
555
|
Korte A, Farlow A. The advantages and limitations of trait analysis with GWAS: a review. PLANT METHODS 2013; 9:29. [PMID: 23876160 PMCID: PMC3750305 DOI: 10.1186/1746-4811-9-29] [Citation(s) in RCA: 814] [Impact Index Per Article: 74.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/12/2013] [Accepted: 06/13/2013] [Indexed: 05/17/2023]
Abstract
Over the last 10 years, high-density SNP arrays and DNA re-sequencing have illuminated the majority of the genotypic space for a number of organisms, including humans, maize, rice and Arabidopsis. For any researcher willing to define and score a phenotype across many individuals, Genome Wide Association Studies (GWAS) present a powerful tool to reconnect this trait back to its underlying genetics. In this review we discuss the biological and statistical considerations that underpin a successful analysis or otherwise. The relevance of biological factors including effect size, sample size, genetic heterogeneity, genomic confounding, linkage disequilibrium and spurious association, and statistical tools to account for these are presented. GWAS can offer a valuable first insight into trait architecture or candidate loci for subsequent validation.
Collapse
Affiliation(s)
- Arthur Korte
- Gregor Mendel Institute of Molecular Plant Biology, Vienna, Austria
| | - Ashley Farlow
- Gregor Mendel Institute of Molecular Plant Biology, Vienna, Austria
| |
Collapse
|
556
|
Elbaidouri M, Chaparro C, Panaud O. Use of Next Generation Sequencing (NGS) technologies for the genome-wide detection of transposition. Methods Mol Biol 2013; 1057:265-74. [PMID: 23918435 DOI: 10.1007/978-1-62703-568-2_19] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
Plant transposable elements are ubiquitous in eukaryotes. Their propensity to densely populate the genomes of many plants and animal species has put them in the focus of both structural and functional genomics. Although a number of bioinformatic software have been recently developed for the annotation of TEs in sequenced genomes, there are very few computational tools strictly dedicated to the identification of active TEs using genome-wide approaches. In this paper, we describe SearchTESV, a pipeline that we have developed to detect Transposable Elements-associated structural variants (TEASVs) using Next Generation Sequencing (NGS) technologies.
Collapse
Affiliation(s)
- Moaine Elbaidouri
- Laboratoire Génome et développement des plantes, Université de Perpignan Via Domitia, Perpignan, Cedex, France
| | | | | |
Collapse
|
557
|
O'Rourke JA, Iniguez LP, Bucciarelli B, Roessler J, Schmutz J, McClean PE, Jackson SA, Hernandez G, Graham MA, Stupar RM, Vance CP. A re-sequencing based assessment of genomic heterogeneity and fast neutron-induced deletions in a common bean cultivar. FRONTIERS IN PLANT SCIENCE 2013; 4:210. [PMID: 23805147 PMCID: PMC3691542 DOI: 10.3389/fpls.2013.00210] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/25/2013] [Accepted: 06/03/2013] [Indexed: 05/22/2023]
Abstract
A small fast neutron (FN) mutant population has been established from Phaseolus vulgaris cv. Red Hawk. We leveraged the available P. vulgaris genome sequence and high throughput next generation DNA sequencing to examine the genomic structure of five P. vulgaris cv. Red Hawk FN mutants with striking visual phenotypes. Analysis of these genomes identified three classes of structural variation (SV); between cultivar variation, natural variation within the FN mutant population, and FN induced mutagenesis. Our analyses focused on the latter two classes. We identified 23 large deletions (>40 bp) common to multiple individuals, illustrating residual heterogeneity and regions of SV within the common bean cv. Red Hawk. An additional 18 large deletions were identified in individual mutant plants. These deletions, ranging in size from 40 bp to 43,000 bp, are potentially the result of FN mutagenesis. Six of the 18 deletions lie near or within gene coding regions, identifying potential candidate genes causing the mutant phenotype.
Collapse
Affiliation(s)
- Jamie A. O'Rourke
- Plant Science Research Unit, USDA-Agricultural Research ServiceSt. Paul, MN, USA
- Department of Agronomy and Plant Genetics, University of MinnesotaSt. Paul, MN, USA
- *Correspondence: Jamie A. O'Rourke, Plant Science Research Unit, USDA-Agricultural Research Service, 495 Borlaug Hall, 1991 Upper Buford Circle, University of Minnesota, St. Paul, MN 55108, USA e-mail:
| | - Luis P. Iniguez
- Centro de Ciencias Genomicas-Universidad Nacional Autonoma de MexicoCuernavaca, Mexico
| | - Bruna Bucciarelli
- Plant Science Research Unit, USDA-Agricultural Research ServiceSt. Paul, MN, USA
- Department of Agronomy and Plant Genetics, University of MinnesotaSt. Paul, MN, USA
| | - Jeffrey Roessler
- Department of Agronomy and Plant Genetics, University of MinnesotaSt. Paul, MN, USA
| | - Jeremy Schmutz
- Hudson Alpha Institute for BiotechnologyHuntsville, AL, USA
| | - Phillip E. McClean
- Department of Plant Sciences, North Dakota State UniversityFargo, ND, USA
| | - Scott A. Jackson
- Department of Crop and Soil Sciences, University of GeorgiaAthens, GA, USA
| | - Georgina Hernandez
- Centro de Ciencias Genomicas-Universidad Nacional Autonoma de MexicoCuernavaca, Mexico
| | - Michelle A. Graham
- Corn Insects and Crop Genetics Research Unit, USDA-Agricultural Research ServiceAmes, IA, USA
| | - Robert M. Stupar
- Department of Agronomy and Plant Genetics, University of MinnesotaSt. Paul, MN, USA
| | - Carroll P. Vance
- Plant Science Research Unit, USDA-Agricultural Research ServiceSt. Paul, MN, USA
- Department of Agronomy and Plant Genetics, University of MinnesotaSt. Paul, MN, USA
| |
Collapse
|
558
|
Capron A, Chang XF, Hall H, Ellis B, Beatson RP, Berleth T. Identification of quantitative trait loci controlling fibre length and lignin content in Arabidopsis thaliana stems. JOURNAL OF EXPERIMENTAL BOTANY 2013; 64:185-97. [PMID: 23136168 PMCID: PMC3528028 DOI: 10.1093/jxb/ers319] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/07/2023]
Abstract
Fibre properties and the biochemical composition of cell walls are important traits in many applications. For example, the lengths of fibres define the strength and quality of paper, and lignin content is a critical parameter for the use of biomass in biofuel production. Identifying genes controlling these traits is comparatively difficult in woody species, because of long generation times and limited amenability to high-resolution genetic mapping. To address this problem, this study mapped quantitative trait loci (QTLs) defining fibre length and lignin content in the Arabidopsis recombinant inbred line population Col-4 × Ler-0. Adapting high-throughput phenotyping techniques for both traits for measurements in Arabidopsis inflorescence stems identified significant QTLs for fibre length on chromosomes 2 and 5, as well as one significant QTL affecting lignin content on chromosome 2. For fibre length, total variation within the population was 208% higher than between parental lines and the identified QTLs explained 50.58% of the observed variation. For lignin content, the values were 261 and 26.51%, respectively. Bioinformatics analysis of the associated intervals identified a number of candidate genes for fibre length and lignin content. This study demonstrates that molecular mapping of QTLs pertaining to wood and fibre properties is possible in Arabidopsis, which substantially broadens the use of Arabidopsis as a model species for the functional characterization of plant genes.
Collapse
Affiliation(s)
- Arnaud Capron
- University of Toronto-CSB, 25 Willcocks Street, Toronto, ON, Canada, M5S 3B2
| | - Xue Feng Chang
- British Columbia Institute of Technology, 3700 Willingdon Avenue, Burnaby, BC, Canada, V5G 3H2
| | - Hardy Hall
- University of British Columbia – Michael Smith Laboratories, #301–2185 East Mall, Vancouver, BC, V6T 1Z4, Canada
| | - Brian Ellis
- University of British Columbia – Michael Smith Laboratories, #301–2185 East Mall, Vancouver, BC, V6T 1Z4, Canada
| | - Rodger P. Beatson
- British Columbia Institute of Technology, 3700 Willingdon Avenue, Burnaby, BC, Canada, V5G 3H2
| | - Thomas Berleth
- University of Toronto-CSB, 25 Willcocks Street, Toronto, ON, Canada, M5S 3B2
| |
Collapse
|
559
|
Michelmore RW, Christopoulou M, Caldwell KS. Impacts of resistance gene genetics, function, and evolution on a durable future. ANNUAL REVIEW OF PHYTOPATHOLOGY 2013; 51:291-319. [PMID: 23682913 DOI: 10.1146/annurev-phyto-082712-102334] [Citation(s) in RCA: 88] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
Studies on resistance gene function and evolution lie at the confluence of structural and molecular biology, genetics, and plant breeding. However, knowledge from these disparate fields has yet to be extensively integrated. This review draws on ideas and information from these different fields to elucidate the influences driving the evolution of different types of resistance genes in plants and the concurrent evolution of virulence in pathogens. It provides an overview of the factors shaping the evolution of recognition, signaling, and response genes in the context of emerging functional information along with a consideration of the new opportunities for durable resistance enabled by high-throughput DNA sequencing technologies.
Collapse
|
560
|
Structured patterns in geographic variability of metabolic phenotypes in Arabidopsis thaliana. Nat Commun 2012; 3:1319. [DOI: 10.1038/ncomms2333] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2012] [Accepted: 11/26/2012] [Indexed: 11/08/2022] Open
|
561
|
Bellos E, Johnson MR, Coin LJM. cnvHiTSeq: integrative models for high-resolution copy number variation detection and genotyping using population sequencing data. Genome Biol 2012; 13:R120. [PMID: 23259578 PMCID: PMC4056371 DOI: 10.1186/gb-2012-13-12-r120] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2012] [Accepted: 12/22/2012] [Indexed: 02/08/2023] Open
Abstract
Recent advances in sequencing technologies provide the means for identifying copy number variation (CNV) at an unprecedented resolution. A single next-generation sequencing experiment offers several features that can be used to detect CNV, yet current methods do not incorporate all available signatures into a unified model. cnvHiTSeq is an integrative probabilistic method for CNV discovery and genotyping that jointly analyzes multiple features at the population level. By combining evidence from complementary sources, cnvHiTSeq achieves high genotyping accuracy and a substantial improvement in CNV detection sensitivity over existing methods, while maintaining a low false discovery rate. cnvHiTSeq is available at http://sourceforge.net/projects/cnvhitseq.
Collapse
|
562
|
dela Paz JS, Stronghill PE, Douglas SJ, Saravia S, Hasenkampf CA, Riggs CD. Chromosome fragile sites in Arabidopsis harbor matrix attachment regions that may be associated with ancestral chromosome rearrangement events. PLoS Genet 2012; 8:e1003136. [PMID: 23284301 PMCID: PMC3527283 DOI: 10.1371/journal.pgen.1003136] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2012] [Accepted: 10/17/2012] [Indexed: 11/18/2022] Open
Abstract
Mutations in the BREVIPEDICELLUS (BP) gene of Arabidopsis thaliana condition a pleiotropic phenotype featuring defects in internode elongation, the homeotic conversion of internode to node tissue, and downward pointing flowers and pedicels. We have characterized five mutant alleles of BP, generated by EMS, fast neutrons, x-rays, and aberrant T–DNA insertion events. Curiously, all of these mutagens resulted in large deletions that range from 140 kbp to over 900 kbp just south of the centromere of chromosome 4. The breakpoints of these mutants were identified by employing inverse PCR and DNA sequencing. The south breakpoints of all alleles cluster in BAC T12G13, while the north breakpoint locations are scattered. With the exception of a microhomology at the bp-5 breakpoint, there is no homology in the junction regions, suggesting that double-stranded breaks are repaired via non-homologous end joining. Southwestern blotting demonstrated the presence of nuclear matrix binding sites in the south breakpoint cluster (SBC), which is A/T rich and possesses a variety of repeat sequences. In situ hybridization on pachytene chromosome spreads complemented the molecular analyses and revealed heretofore unrecognized structural variation between the Columbia and Landsberg erecta genomes. Data mining was employed to localize other large deletions around the HY4 locus to the SBC region and to show that chromatin modifications in the region shift from a heterochromatic to euchromatic profile. Comparisons between the BP/HY4 regions of A. lyrata and A. thaliana revealed that several chromosome rearrangement events have occurred during the evolution of these two genomes. Collectively, the features of the region are strikingly similar to the features of characterized metazoan chromosome fragile sites, some of which are associated with karyotype evolution. Chromosome evolution involves both small-scale (e.g. single nucleotide) changes, as well as large-scale rearrangements such as inversions, translocations, and fusion events. We investigated mutations of the BREVIPEDICELLUS gene of Arabidopsis, which is a master regulator of inflorescence architecture. These mutations are not due to single nucleotide changes, but rather to large deletions, some spanning nearly one million base pairs. Molecular and biochemical analyses reveal that the chromosome breakpoints cluster in an area that is rich in repetitive elements and harbor multiple binding sites for nuclear matrix proteins. Data mining revealed intriguing correlations between the breakpoint cluster and hotspots of genetic recombination, regions of the chromosome that have undergone several rearrangement events during evolution, and changes in histone protein modifications. We propose that these unstable regions are chromosome fragile sites that assist in marking a boundary between gene-poor, transcriptionally repressed centromeric chromatin and a more relaxed chromatin domain that is gene-rich.
Collapse
Affiliation(s)
- Joelle S dela Paz
- Department of Biological Sciences and Centre for the Analysis of Genome Evolution and Function, University of Toronto, Toronto, Canada
| | | | | | | | | | | |
Collapse
|
563
|
Takeuchi H, Higashiyama T. A species-specific cluster of defensin-like genes encodes diffusible pollen tube attractants in Arabidopsis. PLoS Biol 2012; 10:e1001449. [PMID: 23271953 PMCID: PMC3525529 DOI: 10.1371/journal.pbio.1001449] [Citation(s) in RCA: 205] [Impact Index Per Article: 17.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2012] [Accepted: 11/02/2012] [Indexed: 11/19/2022] Open
Abstract
AtLURE1 defensin-like peptides, which show species-specific evolution, are essential in Arabidopsis for attracting pollen tubes and can function in the breakdown of reproductive isolation barriers. Genes directly involved in male/female and host/parasite interactions are believed to be under positive selection. The flowering plant Arabidopsis thaliana has more than 300 defensin-like (DEFL) genes, which are likely to be involved in both natural immunity and cell-to-cell communication including pollen–pistil interactions. However, little is known of the relationship between the molecular evolution of DEFL genes and their functions. Here, we identified a recently evolved cluster of DEFL genes in A. thaliana and demonstrated that these DEFL (cysteine-rich peptide [CRP810_1]) peptides, named AtLURE1 peptides, are pollen tube attractants guiding pollen tubes to the ovular micropyle. The AtLURE1 genes formed the sole species-specific cluster among DEFL genes compared to its close relative, A. lyrata. No evidence for positive selection was detected in AtLURE1 genes and their orthologs, implying neutral evolution of AtLURE1 genes. AtLURE1 peptides were specifically expressed in egg-accompanying synergid cells and secreted toward the funicular surface through the micropyle. Genetic analyses showed that gametophytic mutants defective in micropylar guidance (myb98, magatama3, and central cell guidance) do not express AtLURE1 peptides. Downregulation of the expression of these peptides impaired precise pollen tube attraction to the micropylar opening of some populations of ovules. Recombinant AtLURE1 peptides attracted A. thaliana pollen tubes at a higher frequency compared to A. lyrata pollen tubes, suggesting that these peptides are species-preferential attractants in micropylar guidance. In support of this idea, the heterologous expression of a single AtLURE1 peptide in the synergid cell of Torenia fournieri was sufficient to guide A. thaliana pollen tubes to the T. fournieri embryo sac and to permit entry into it. Our results suggest the unique evolution of AtLURE1 genes, which are directly involved in male–female interaction among the DEFL multigene family, and furthermore suggest that these peptides are sufficient to overcome interspecific barriers in gametophytic attraction and penetration. Defensin-like (DEFL) peptides commonly function as effector peptides and are involved in male-female and host-parasite interactions in eukaryotes. In higher plants, DEFL genes belong to a large multigene family and are highly variable between species. However, little is known about the relationship between the molecular evolution of DEFL genes and their functions. By comparing multiply duplicated DEFL genes between A. thaliana and its close relative A. lyrata, we have now identified pollen tube attractant peptides called AtLURE1 peptides, in A. thaliana. We find that AtLURE1 genes form a species-specific gene cluster and that the AtLURE1 peptides these genes encode are specifically expressed in the synergid (egg-accompanying) cells and are secreted along the path down which the pollen tube elongates to reach the female gametophyte. AtLURE1 peptides attract pollen tubes in a species-preferential manner and their downregulation impairs pollen tube guidance. Interestingly, the genetic introduction of a single AtLURE1 gene from A. thaliana into another plant, T. fournieri, is sufficient to breakdown reproductive isolation barriers in pollen tube guidance and penetration. These results suggest that AtLURE1 peptides, which show species-specific evolution, are key molecules that attract pollen tubes from a plant's own species to the embryo sac to enable successful reproduction.
Collapse
Affiliation(s)
- Hidenori Takeuchi
- Division of Biological Science, Graduate School of Science, Nagoya University, Nagoya, Aichi, Japan
| | - Tetsuya Higashiyama
- Division of Biological Science, Graduate School of Science, Nagoya University, Nagoya, Aichi, Japan
- JST ERATO Higashiyama Live-Holonics Project, Nagoya University, Nagoya, Aichi, Japan
- * E-mail:
| |
Collapse
|
564
|
Wang L, Si W, Yao Y, Tian D, Araki H, Yang S. Genome-wide survey of pseudogenes in 80 fully re-sequenced Arabidopsis thaliana accessions. PLoS One 2012; 7:e51769. [PMID: 23272162 PMCID: PMC3521719 DOI: 10.1371/journal.pone.0051769] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2012] [Accepted: 11/07/2012] [Indexed: 11/18/2022] Open
Abstract
Pseudogenes (Ψs), including processed and non-processed Ψs, are ubiquitous genetic elements derived from originally functional genes in all studied genomes within the three kingdoms of life. However, systematic surveys of non-processed Ψs utilizing genomic information from multiple samples within a species are still rare. Here a systematic comparative analysis was conducted of Ψs within 80 fully re-sequenced Arabidopsis thaliana accessions, and 7546 genes, representing ∼28% of the genomic annotated open reading frames (ORFs), were found with disruptive mutations in at least one accession. The distribution of these Ψs on chromosomes showed a significantly negative correlation between Ψs/ORFs and their local gene densities, suggesting a higher proportion of Ψs in gene desert regions, e.g. near centromeres. On the other hand, compared with the non-Ψ loci, even the intact coding sequences (CDSs) in the Ψ loci were found to have shorter CDS length, fewer exon number and lower GC content. In addition, a significant functional bias against the null hypothesis was detected in the Ψs mainly involved in responses to environmental stimuli and biotic stress as reported, suggesting that they are likely important for adaptive evolution to rapidly changing environments by pseudogenization to accumulate successive mutations.
Collapse
Affiliation(s)
- Long Wang
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, China
| | - Weina Si
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, China
| | - Yongfang Yao
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, China
| | - Dacheng Tian
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, China
| | - Hitoshi Araki
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, China
- Eawag, Swiss Federal Institute of Aquatic Science and Technology, Center of Ecology, Evolution and Biogeochemistry, Kastanienbaum, Switzerland
- * E-mail: (SY); (HA)
| | - Sihai Yang
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, China
- * E-mail: (SY); (HA)
| |
Collapse
|
565
|
Shao H, Bellos E, Yin H, Liu X, Zou J, Li Y, Wang J, Coin LJM. A population model for genotyping indels from next-generation sequence data. Nucleic Acids Res 2012; 41:e46. [PMID: 23221639 PMCID: PMC3562001 DOI: 10.1093/nar/gks1143] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Insertion and deletion polymorphisms (indels) are an important source of genomic variation in plant and animal genomes, but accurate genotyping from low-coverage and exome next-generation sequence data remains challenging. We introduce an efficient population clustering algorithm for diploids and polyploids which was tested on a dataset of 2000 exomes. Compared with existing methods, we report a 4-fold reduction in overall indel genotype error rates with a 9-fold reduction in low coverage regions.
Collapse
|
566
|
O'Malley RC, Ecker JR. Epiallelic variation in Arabidopsis thaliana. COLD SPRING HARBOR SYMPOSIA ON QUANTITATIVE BIOLOGY 2012; 77:135-45. [PMID: 23223383 PMCID: PMC5241134 DOI: 10.1101/sqb.2012.77.014571] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Genotype is the primary determinate of phenotype. During the past two decades, however, there has been an emergent recognition of the epigenotype, a separate layer of heredity distinct from the primary DNA sequence that can have profound effects on phenotype. The epigenotype is a collection of chemical modifications to the DNA and nucleosomes in conjunction with noncoding RNA transcripts, and together these epigenetic marks act as a potent and expansive regulatory system for controlling gene expression. In this review, we discuss our current understanding of variation in epigenotype in the model plant Arabidopsis and how allelic differences attributable to epigenetic changes, or epialleles, can affect phenotype. We discuss examples of epialleles that have been created in the laboratory and others that have been identified in natural populations, because these two models provide complementary information regarding the genetic pathways, mechanisms of transmission, and biological and evolutionary context for the role of the epigenotype in phenotypic variation.
Collapse
Affiliation(s)
- R C O'Malley
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, California 92037, USA
| | | |
Collapse
|
567
|
Olson-Manning CF, Wagner MR, Mitchell-Olds T. Adaptive evolution: evaluating empirical support for theoretical predictions. Nat Rev Genet 2012; 13:867-77. [PMID: 23154809 PMCID: PMC3748133 DOI: 10.1038/nrg3322] [Citation(s) in RCA: 148] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Adaptive evolution is shaped by the interaction of population genetics, natural selection and underlying network and biochemical constraints. Variation created by mutation, the raw material for evolutionary change, is translated into phenotypes by flux through metabolic pathways and by the topography and dynamics of molecular networks. Finally, the retention of genetic variation and the efficacy of selection depend on population genetics and demographic history. Emergent high-throughput experimental methods and sequencing technologies allow us to gather more evidence and to move beyond the theory in different systems and populations. Here we review the extent to which recent evidence supports long-established theoretical principles of adaptation.
Collapse
Affiliation(s)
- Carrie F. Olson-Manning
- Department of Biology, Box 90338, Program in Genetics and Genomics, Institute for Genome Sciences and Policy, Duke University, Durham, NC 27708
| | - Maggie R. Wagner
- Department of Biology, Box 90338, Program in Genetics and Genomics, Institute for Genome Sciences and Policy, Duke University, Durham, NC 27708
| | - Thomas Mitchell-Olds
- Department of Biology, Box 90338, Program in Genetics and Genomics, Institute for Genome Sciences and Policy, Duke University, Durham, NC 27708
| |
Collapse
|
568
|
Seren Ü, Vilhjálmsson BJ, Horton MW, Meng D, Forai P, Huang YS, Long Q, Segura V, Nordborg M. GWAPP: a web application for genome-wide association mapping in Arabidopsis. THE PLANT CELL 2012; 24:4793-805. [PMID: 23277364 PMCID: PMC3556958 DOI: 10.1105/tpc.112.108068] [Citation(s) in RCA: 120] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/18/2023]
Abstract
Arabidopsis thaliana is an important model organism for understanding the genetics and molecular biology of plants. Its highly selfing nature, small size, short generation time, small genome size, and wide geographic distribution make it an ideal model organism for understanding natural variation. Genome-wide association studies (GWAS) have proven a useful technique for identifying genetic loci responsible for natural variation in A. thaliana. Previously genotyped accessions (natural inbred lines) can be grown in replicate under different conditions and phenotyped for different traits. These important features greatly simplify association mapping of traits and allow for systematic dissection of the genetics of natural variation by the entire A. thaliana community. To facilitate this, we present GWAPP, an interactive Web-based application for conducting GWAS in A. thaliana. Using an efficient implementation of a linear mixed model, traits measured for a subset of 1386 publicly available ecotypes can be uploaded and mapped with a mixed model and other methods in just a couple of minutes. GWAPP features an extensive, interactive, and user-friendly interface that includes interactive Manhattan plots and linkage disequilibrium plots. It also facilitates exploratory data analysis by implementing features such as the inclusion of candidate polymorphisms in the model as cofactors.
Collapse
Affiliation(s)
- Ümit Seren
- Gregor Mendel Institute, Austrian Academy of Sciences, 1030, Viena, Austria
| | | | | | | | | | | | | | | | | |
Collapse
|
569
|
Alex Buerkle C, Gompert Z. Population genomics based on low coverage sequencing: how low should we go? Mol Ecol 2012; 22:3028-35. [DOI: 10.1111/mec.12105] [Citation(s) in RCA: 157] [Impact Index Per Article: 13.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2012] [Revised: 09/17/2012] [Accepted: 09/26/2012] [Indexed: 12/17/2022]
Affiliation(s)
- C. Alex Buerkle
- Department of Botany and Program in Ecology; University of Wyoming; Laramie WY USA
| | | |
Collapse
|
570
|
Massouras A, Waszak SM, Albarca-Aguilera M, Hens K, Holcombe W, Ayroles JF, Dermitzakis ET, Stone EA, Jensen JD, Mackay TFC, Deplancke B. Genomic variation and its impact on gene expression in Drosophila melanogaster. PLoS Genet 2012. [PMID: 23189034 PMCID: PMC3499359 DOI: 10.1371/journal.pgen.1003055] [Citation(s) in RCA: 86] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
Understanding the relationship between genetic and phenotypic variation is one of the great outstanding challenges in biology. To meet this challenge, comprehensive genomic variation maps of human as well as of model organism populations are required. Here, we present a nucleotide resolution catalog of single-nucleotide, multi-nucleotide, and structural variants in 39 Drosophila melanogaster Genetic Reference Panel inbred lines. Using an integrative, local assembly-based approach for variant discovery, we identify more than 3.6 million distinct variants, among which were more than 800,000 unique insertions, deletions (indels), and complex variants (1 to 6,000 bp). While the SNP density is higher near other variants, we find that variants themselves are not mutagenic, nor are regions with high variant density particularly mutation-prone. Rather, our data suggest that the elevated SNP density around variants is mainly due to population-level processes. We also provide insights into the regulatory architecture of gene expression variation in adult flies by mapping cis-expression quantitative trait loci (cis-eQTLs) for more than 2,000 genes. Indels comprise around 10% of all cis-eQTLs and show larger effects than SNP cis-eQTLs. In addition, we identified two-fold more gene associations in males as compared to females and found that most cis-eQTLs are sex-specific, revealing a partial decoupling of the genomic architecture between the sexes as well as the importance of genetic factors in mediating sex-biased gene expression. Finally, we performed RNA-seq-based allelic expression imbalance analyses in the offspring of crosses between sequenced lines, which revealed that the majority of strong cis-eQTLs can be validated in heterozygous individuals. One of the principal challenges in current biology is to understand the relationship between genetic and phenotypic variation. The increasing availability of genomic variation maps of human as well as of model organism populations (mouse and Arabidopsis) constitutes an important step towards meeting this challenge. However, despite its excellent track record as a premier model to understand genome function, no genome-wide variation data beyond single-nucleotide variants and microsatellites are currently available for D. melanogaster. Here, we present a comprehensive, nucleotide-resolution catalogue of variants of various types (single-nucleotide, multi-nucleotide, and structural variants) for 39 wild-derived inbred D. melanogaster lines based on high-throughput sequencing. This catalogue confirms that non–SNP variants account for more than half of genomic variation, allowing us to provide new insights into the non-random distribution of variants in the Drosophila genome. We further present genome-wide cis-associations with gene expression based on whole adult fly microarray data, revealing significant associations for about 2,000 genes. Most associations are sex-specific, providing evidence for a decoupling of the genomic, regulatory architecture between males and females.
Collapse
Affiliation(s)
- Andreas Massouras
- Laboratory of Systems Biology and Genetics, Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| | - Sebastian M. Waszak
- Laboratory of Systems Biology and Genetics, Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| | - Monica Albarca-Aguilera
- Laboratory of Systems Biology and Genetics, Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| | - Korneel Hens
- Laboratory of Systems Biology and Genetics, Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| | - Wiebke Holcombe
- Laboratory of Systems Biology and Genetics, Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| | - Julien F. Ayroles
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts, United States of America
| | - Emmanouil T. Dermitzakis
- Department of Genetic Medicine and Development, University of Geneva Medical School, Geneva, Switzerland
| | - Eric A. Stone
- Department of Genetics, North Carolina State University, Raleigh, North Carolina, United States of America
| | - Jeffrey D. Jensen
- Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| | - Trudy F. C. Mackay
- Department of Genetics, North Carolina State University, Raleigh, North Carolina, United States of America
| | - Bart Deplancke
- Laboratory of Systems Biology and Genetics, Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
- * E-mail:
| |
Collapse
|
571
|
Ness RW, Siol M, Barrett SCH. Genomic consequences of transitions from cross- to self-fertilization on the efficacy of selection in three independently derived selfing plants. BMC Genomics 2012; 13:611. [PMID: 23145563 PMCID: PMC3533693 DOI: 10.1186/1471-2164-13-611] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2012] [Accepted: 10/05/2012] [Indexed: 11/10/2022] Open
Abstract
Background Transitions from cross- to self-fertilization are associated with increased genetic drift rendering weakly selected mutations effectively neutral. The effect of drift is predicted to reduce selective constraints on amino acid sequences of proteins and relax biased codon usage. We investigated patterns of nucleotide variation to assess the effect of inbreeding on the accumulation of deleterious mutations in three independently evolved selfing plants. Using high-throughput sequencing, we assembled the floral transcriptomes of four individuals of Eichhornia (Pontederiaceae); these included one outcrosser and two independently derived selfers of E. paniculata, and E. paradoxa, a selfing outgroup. The dataset included ~8000 loci totalling ~3.5 Mb of coding DNA. Results Tests of selection were consistent with purifying selection constraining evolution of the transcriptome. However, we found an elevation in the proportion of non-synonymous sites that were potentially deleterious in the E. paniculata selfers relative to the outcrosser. Measurements of codon usage in high versus low expression genes demonstrated reduced bias in both E. paniculata selfers. Conclusions Our findings are consistent with a small reduction in the efficacy of selection on protein sequences associated with transitions to selfing, and reduced selection in selfers on synonymous changes that influence codon usage.
Collapse
Affiliation(s)
- Rob W Ness
- Department of Ecology and Evolutionary Biology, University of Toronto, ON, Canada.
| | | | | |
Collapse
|
572
|
Henderson IR. Control of meiotic recombination frequency in plant genomes. CURRENT OPINION IN PLANT BIOLOGY 2012; 15:556-561. [PMID: 23017241 DOI: 10.1016/j.pbi.2012.09.002] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/01/2012] [Revised: 07/18/2012] [Accepted: 09/05/2012] [Indexed: 05/27/2023]
Abstract
Sexual eukaryotes reproduce via the meiotic cell division, where ploidy is halved and homologous chromosomes undergo reciprocal genetic exchange, termed crossover (CO). CO frequency has a profound effect on patterns of genetic variation and species evolution. Relative CO rates vary extensively both within and between plant genomes. Plant genome size varies by over 1000-fold, largely due to differential expansion of repetitive sequences, and increased genome size is associated with reduced CO frequency. Gene versus repeat sequences associate with distinct chromatin modifications, and evidence from plant genomes indicates that this epigenetic information influences CO patterns. This is consistent with data from diverse eukaryotes that demonstrate the importance of chromatin structure for control of meiotic recombination. In this review I will discuss CO frequency patterns in plant genomes and recent advances in understanding recombination distributions.
Collapse
Affiliation(s)
- Ian R Henderson
- Department of Plant Sciences, University of Cambridge, Cambridge, CB2 3EA, United Kingdom.
| |
Collapse
|
573
|
Jaquiéry J, Stoeckel S, Nouhaud P, Mieuzet L, Mahéo F, Legeai F, Bernard N, Bonvoisin A, Vitalis R, Simon JC. Genome scans reveal candidate regions involved in the adaptation to host plant in the pea aphid complex. Mol Ecol 2012; 21:5251-64. [PMID: 23017212 DOI: 10.1111/mec.12048] [Citation(s) in RCA: 60] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2012] [Accepted: 08/06/2012] [Indexed: 11/26/2022]
Abstract
A major goal in evolutionary biology is to uncover the genetic basis of adaptation. Divergent selection exerted on ecological traits may result in adaptive population differentiation and reproductive isolation and affect differentially the level of genetic divergence along the genome. Genome-wide scan of large sets of individuals from multiple populations is a powerful approach to identify loci or genomic regions under ecologically divergent selection. Here, we focused on the pea aphid, a species complex of divergent host races, to explore the organization of the genomic divergence associated with host plant adaptation and ecological speciation. We analysed 390 microsatellite markers located at variable distances from predicted genes in replicate samples of sympatric populations of the pea aphid collected on alfalfa, red clover and pea, which correspond to three common host-adapted races reported in this species complex. Using a method that accounts for the hierarchical structure of our data set, we found a set of 11 outlier loci that show higher genetic differentiation between host races than expected under the null hypothesis of neutral evolution. Two of the outliers are close to olfactory receptor genes and three other nearby genes encoding salivary proteins. The remaining outliers are located in regions with genes of unknown functions, or which functions are unlikely to be involved in interactions with the host plant. This study reveals genetic signatures of divergent selection across the genome and provides an inventory of candidate genes responsible for plant specialization in the pea aphid, thereby setting the stage for future functional studies.
Collapse
Affiliation(s)
- J Jaquiéry
- INRA, UMR 1349, Institute of Genetics, Environment and Plant Protection, Domaine de la Motte, BP 35327, 35653, Le Rheu Cedex, France
| | | | | | | | | | | | | | | | | | | |
Collapse
|
574
|
Rodriguez-Leal D, Vielle-Calzada JP. Regulation of apomixis: learning from sexual experience. CURRENT OPINION IN PLANT BIOLOGY 2012; 15:549-55. [PMID: 23000434 DOI: 10.1016/j.pbi.2012.09.005] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/30/2012] [Revised: 07/17/2012] [Accepted: 09/05/2012] [Indexed: 05/10/2023]
Abstract
Apomixis is a natural form of asexual reproduction through seeds that leads to viable offspring genetically identical to the mother plant. New evidence from sexual model species indicates that the regulation of female gametogenesis and seed formation is also directed by epigenetic mechanisms that are crucial to control events that distinguish sexuality from apomixis, with important implications for our understanding of the evolutionary forces that shape structural variation and diversity in plant reproduction.
Collapse
Affiliation(s)
- Daniel Rodriguez-Leal
- Group of Reproductive Development and Apomixis, Laboratorio Nacional de Genómica para la Biodiversidad, CINVESTAV, Irapuato, Guanajuato, Mexico
| | | |
Collapse
|
575
|
Pagny G, Paulstephenraj PS, Poque S, Sicard O, Cosson P, Eyquard JP, Caballero M, Chague A, Gourdon G, Negrel L, Candresse T, Mariette S, Decroocq V. Family-based linkage and association mapping reveals novel genes affecting Plum pox virus infection in Arabidopsis thaliana. THE NEW PHYTOLOGIST 2012; 196:873-886. [PMID: 22943366 DOI: 10.1111/j.1469-8137.2012.04289.x] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/23/2012] [Accepted: 07/21/2012] [Indexed: 05/03/2023]
Abstract
Sharka is a devastating viral disease caused by the Plum pox virus (PPV) in stone fruit trees and few sources of resistance are known in its natural hosts. Since any knowledge gained from Arabidopsis on plant virus susceptibility factors is likely to be transferable to crop species, Arabidopsis's natural variation was searched for host factors essential for PPV infection. To locate regions of the genome associated with susceptibility to PPV, linkage analysis was performed on six biparental populations as well as on multiparental lines. To refine quantitative trait locus (QTL) mapping, a genome-wide association analysis was carried out using 147 Arabidopsis accessions. Evidence was found for linkage on chromosomes 1, 3 and 5 with restriction of PPV long-distance movement. The most relevant signals occurred within a region at the bottom of chromosome 3, which comprises seven RTM3-like TRAF domain-containing genes. Since the resistance mechanism analyzed here is recessive and the rtm3 knockout mutant is susceptible to PPV infection, it suggests that other gene(s) present in the small identified region encompassing RTM3 are necessary for PPV long-distance movement. In consequence, we report here the occurrence of host factor(s) that are indispensable for virus long-distance movement.
Collapse
Affiliation(s)
- Gaëlle Pagny
- INRA, Université de Bordeaux, UMR 1332 BFP, BP81, 33883, Villenave d'Ornon Cedex, France
| | | | - Sylvain Poque
- INRA, Université de Bordeaux, UMR 1332 BFP, BP81, 33883, Villenave d'Ornon Cedex, France
| | - Ophélie Sicard
- INRA, Université de Bordeaux, UMR 1332 BFP, BP81, 33883, Villenave d'Ornon Cedex, France
| | - Patrick Cosson
- INRA, Université de Bordeaux, UMR 1332 BFP, BP81, 33883, Villenave d'Ornon Cedex, France
| | - Jean-Philippe Eyquard
- INRA, Université de Bordeaux, UMR 1332 BFP, BP81, 33883, Villenave d'Ornon Cedex, France
| | - Mélodie Caballero
- INRA, Université de Bordeaux, UMR 1332 BFP, BP81, 33883, Villenave d'Ornon Cedex, France
| | - Aurélie Chague
- INRA, Université de Bordeaux, UMR 1332 BFP, BP81, 33883, Villenave d'Ornon Cedex, France
| | - Germain Gourdon
- INRA, Université de Bordeaux, UMR 1332 BFP, BP81, 33883, Villenave d'Ornon Cedex, France
| | - Lise Negrel
- INRA, Université de Bordeaux, UMR 1332 BFP, BP81, 33883, Villenave d'Ornon Cedex, France
| | - Thierry Candresse
- INRA, Université de Bordeaux, UMR 1332 BFP, BP81, 33883, Villenave d'Ornon Cedex, France
| | - Stéphanie Mariette
- INRA, Université de Bordeaux, UMR 1332 BFP, BP81, 33883, Villenave d'Ornon Cedex, France
| | - Véronique Decroocq
- INRA, Université de Bordeaux, UMR 1332 BFP, BP81, 33883, Villenave d'Ornon Cedex, France
| |
Collapse
|
576
|
The inheritance pattern of 24 nt siRNA clusters in arabidopsis hybrids is influenced by proximity to transposable elements. PLoS One 2012; 7:e47043. [PMID: 23118865 PMCID: PMC3485269 DOI: 10.1371/journal.pone.0047043] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2012] [Accepted: 09/07/2012] [Indexed: 11/19/2022] Open
Abstract
Hybrids often display increased size and growth, and thus are widely cultivated in agriculture and horticulture. Recent discoveries demonstrating the important regulatory roles of small RNAs have greatly improved our understanding of many basic biological questions, and could illuminate the molecular basis for the enhanced growth and size of hybrid plants. We profiled small RNAs by deep sequencing to characterize the inheritance patterns of small RNA levels in reciprocal hybrids of two Arabidopsis thaliana accessions, Columbia and Landsberg erecta. We find 24-nt siRNAs predominate among those small RNAs that are differentially expressed between the parents. Following hybridization, the transposable element (TE)-derived siRNAs are often inherited in an additive manner, whereas siRNAs associated with protein-coding genes are often down-regulated in hybrids to the levels observed for the parent with lower relative siRNA levels. Among the protein-coding genes that exhibit this pattern, genes that function in pathogen defense, abiotic stress tolerance, and secondary metabolism are significantly enriched. Small RNA clusters from protein-coding genes where a TE is present within one kilobase show a different predominant inheritance pattern (additive) from those that do not (low-parent dominance). Thus, down-regulation in the form of low-parent dominance is likely the default pattern of inheritance for genic siRNA, and a different inheritance mechanism for TE siRNA is suggested.
Collapse
|
577
|
Günther T, Lampei C, Schmid KJ. Mutational bias and gene conversion affect the intraspecific nitrogen stoichiometry of the Arabidopsis thaliana transcriptome. Mol Biol Evol 2012; 30:561-8. [PMID: 23115321 DOI: 10.1093/molbev/mss249] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
The transcriptome and proteome of Arabidopsis thaliana are reduced in nitrogen content when compared with other taxa, which may result from ecological nitrogen limitation. We hypothesized that if the A. thaliana transcriptome is selected for a low nitrogen content, nitrogen-reducing derived alleles of single nucleotide polymorphisms (SNPs) should segregate at higher frequencies than nitrogen-increasing alleles. This pattern should be stronger in populations with a larger effective population size (N(e)) if natural selection is more efficient in large than in small populations. We analyzed variation in the nitrogen content in the transcriptome of 80 natural accessions of A. thaliana. In contrast to our expectations, derived alleles increase the nitrogen content in all accessions, and there is a positive correlation between nitrogen difference and derived allele frequency, which is strongest with nonsynonymous SNPs (nsSNPs). Also, there is a positive correlation between nitrogen difference and N(e) that was mainly caused by nsSNPs. These observations led us to reject the hypothesis that the transcriptome of A. thaliana is currently under selection to reduce nitrogen content. Instead, we show that a change in nitrogen content is a side effect of interacting evolutionary factors that influence base composition and include mutational bias, purifying selection of functionally deleterious alleles, and GC-biased gene conversion. We provide strong evidence that GC-biased gene conversion may play an important role for base composition in the highly selfing plant A. thaliana.
Collapse
Affiliation(s)
- Torsten Günther
- Institute of Plant Breeding, Seed Science and Population Genetics, University of Hohenheim, Stuttgart, Germany
| | | | | |
Collapse
|
578
|
Leal Valentim F, Neven F, Boyen P, van Dijk ADJ. Interactome-wide prediction of protein-protein binding sites reveals effects of protein sequence variation in Arabidopsis thaliana. PLoS One 2012; 7:e47022. [PMID: 23077539 PMCID: PMC3471968 DOI: 10.1371/journal.pone.0047022] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2012] [Accepted: 09/07/2012] [Indexed: 11/18/2022] Open
Abstract
The specificity of protein-protein interactions is encoded in those parts of the sequence that compose the binding interface. Therefore, understanding how changes in protein sequence influence interaction specificity, and possibly the phenotype, requires knowing the location of binding sites in those sequences. However, large-scale detection of protein interfaces remains a challenge. Here, we present a sequence- and interactome-based approach to mine interaction motifs from the recently published Arabidopsis thaliana interactome. The resultant proteome-wide predictions are available via www.ab.wur.nl/sliderbio and set the stage for further investigations of protein-protein binding sites. To assess our method, we first show that, by using a priori information calculated from protein sequences, such as evolutionary conservation and residue surface accessibility, we improve the performance of interface prediction compared to using only interactome data. Next, we present evidence for the functional importance of the predicted sites, which are under stronger selective pressure than the rest of protein sequence. We also observe a tendency for compensatory mutations in the binding sites of interacting proteins. Subsequently, we interrogated the interactome data to formulate testable hypotheses for the molecular mechanisms underlying effects of protein sequence mutations. Examples include proteins relevant for various developmental processes. Finally, we observed, by analysing pairs of paralogs, a correlation between functional divergence and sequence divergence in interaction sites. This analysis suggests that large-scale prediction of binding sites can cast light on evolutionary processes that shape protein-protein interaction networks.
Collapse
Affiliation(s)
| | - Frank Neven
- Hasselt University and Transnational University of Limburg, Hasselt, Belgium
| | - Peter Boyen
- Hasselt University and Transnational University of Limburg, Hasselt, Belgium
| | - Aalt D. J. van Dijk
- Plant Research International, Bioscience, Wageningen, The Netherlands
- * E-mail:
| |
Collapse
|
579
|
Oh DH, Dassanayake M, Bohnert HJ, Cheeseman JM. Life at the extreme: lessons from the genome. Genome Biol 2012; 13:241. [PMID: 22390828 DOI: 10.1186/gb-2012-13-3-241] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
Extremophile plants thrive in places where most plant species cannot survive. Recent developments in high-throughput technologies and comparative genomics are shedding light on the evolutionary mechanisms leading to their adaptation.
Collapse
Affiliation(s)
- Dong-Ha Oh
- Department of Plant Biology, University of Illinois at Urbana-Champaign, 61801, USA
| | | | | | | |
Collapse
|
580
|
Kritsas K, Wuest SE, Hupalo D, Kern AD, Wicker T, Grossniklaus U. Computational analysis and characterization of UCE-like elements (ULEs) in plant genomes. Genome Res 2012; 22:2455-66. [PMID: 22987666 PMCID: PMC3514675 DOI: 10.1101/gr.129346.111] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Ultraconserved elements (UCEs), stretches of DNA that are identical between distantly related species, are enigmatic genomic features whose function is not well understood. First identified and characterized in mammals, UCEs have been proposed to play important roles in gene regulation, RNA processing, and maintaining genome integrity. However, because all of these functions can tolerate some sequence variation, their ultraconserved and ultraselected nature is not explained. We investigated whether there are highly conserved DNA elements without genic function in distantly related plant genomes. We compared the genomes of Arabidopsis thaliana and Vitis vinifera; species that diverged ∼115 million years ago (Mya). We identified 36 highly conserved elements with at least 85% similarity that are longer than 55 bp. Interestingly, these elements exhibit properties similar to mammalian UCEs, such that we named them UCE-like elements (ULEs). ULEs are located in intergenic or intronic regions and are depleted from segmental duplications. Like UCEs, ULEs are under strong purifying selection, suggesting a functional role for these elements. As their mammalian counterparts, ULEs show a sharp drop of A+T content at their borders and are enriched close to genes encoding transcription factors and genes involved in development, the latter showing preferential expression in undifferentiated tissues. By comparing the genomes of Brachypodium distachyon and Oryza sativa, species that diverged ∼50 Mya, we identified a different set of ULEs with similar properties in monocots. The identification of ULEs in plant genomes offers new opportunities to study their possible roles in genome function, integrity, and regulation.
Collapse
Affiliation(s)
- Konstantinos Kritsas
- Institute of Plant Biology & Zürich-Basel Plant Science Center, University Zürich, CH-8008 Zürich, Switzerland
| | | | | | | | | | | |
Collapse
|
581
|
Genomic characterization of DArT markers based on high-density linkage analysis and physical mapping to the Eucalyptus genome. PLoS One 2012; 7:e44684. [PMID: 22984541 PMCID: PMC3439404 DOI: 10.1371/journal.pone.0044684] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2012] [Accepted: 08/06/2012] [Indexed: 01/07/2023] Open
Abstract
Diversity Arrays Technology (DArT) provides a robust, high throughput, cost-effective method to query thousands of sequence polymorphisms in a single assay. Despite the extensive use of this genotyping platform for numerous plant species, little is known regarding the sequence attributes and genome-wide distribution of DArT markers. We investigated the genomic properties of the 7,680 DArT marker probes of a Eucalyptus array, by sequencing them, constructing a high density linkage map and carrying out detailed physical mapping analyses to the Eucalyptus grandis reference genome. A consensus linkage map with 2,274 DArT markers anchored to 210 microsatellites and a framework map, with improved support for ordering, displayed extensive collinearity with the genome sequence. Only 1.4 Mbp of the 75 Mbp of still unplaced scaffold sequence was captured by 45 linkage mapped but physically unaligned markers to the 11 main Eucalyptus pseudochromosomes, providing compelling evidence for the quality and completeness of the current Eucalyptus genome assembly. A highly significant correspondence was found between the locations of DArT markers and predicted gene models, while most of the 89 DArT probes unaligned to the genome correspond to sequences likely absent in E. grandis, consistent with the pan-genomic feature of this multi-Eucalyptus species DArT array. These comprehensive linkage-to-physical mapping analyses provide novel data regarding the genomic attributes of DArT markers in plant genomes in general and for Eucalyptus in particular. DArT markers preferentially target the gene space and display a largely homogeneous distribution across the genome, thereby providing superb coverage for mapping and genome-wide applications in breeding and diversity studies. Data reported on these ubiquitous properties of DArT markers will be particularly valuable to researchers working on less-studied crop species who already count on DArT genotyping arrays but for which no reference genome is yet available to allow such detailed characterization.
Collapse
|
582
|
Abstract
Targeting induced local lesions in genomes (TILLING), initially a functional genomics tool in model plants, has been extended to many plant species and become of paramount importance to reverse genetics in crops species. Because it is readily applicable to most plants, it remains a dominant non-transgenic method for obtaining mutations in known genes. The process has seen many technological changes over the last 10 years; a major recent change has been the application of next-generation sequencing (NGS) to the process, which permits multiplexing of gene targets and genomes. NGS will ultimately lead to TILLING becoming an in silico procedure. We review here the history and technology in brief, but focus more importantly on recent developments in polyploids, vegetatively propagated crops and the future of TILLING for plant breeding.
Collapse
Affiliation(s)
- Trevor L Wang
- John Innes Centre, Norwich Research Park, Norwich, UK.
| | | | | | | |
Collapse
|
583
|
Hu TT, Eisen MB, Thornton KR, Andolfatto P. A second-generation assembly of the Drosophila simulans genome provides new insights into patterns of lineage-specific divergence. Genome Res 2012; 23:89-98. [PMID: 22936249 PMCID: PMC3530686 DOI: 10.1101/gr.141689.112] [Citation(s) in RCA: 119] [Impact Index Per Article: 9.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
We create a new assembly of the Drosophila simulans genome using 142 million paired short-read sequences and previously published data for strain w501. Our assembly represents a higher-quality genomic sequence with greater coverage, fewer misassemblies, and, by several indexes, fewer sequence errors. Evolutionary analysis of this genome reference sequence reveals interesting patterns of lineage-specific divergence that are different from those previously reported. Specifically, we find that Drosophila melanogaster evolves faster than D. simulans at all annotated classes of sites, including putatively neutrally evolving sites found in minimal introns. While this may be partly explained by a higher mutation rate in D. melanogaster, we also find significant heterogeneity in rates of evolution across classes of sites, consistent with historical differences in the effective population size for the two species. Also contrary to previous findings, we find that the X chromosome is evolving significantly faster than autosomes for nonsynonymous and most noncoding DNA sites and significantly slower for synonymous sites. The absence of a X/A difference for putatively neutral sites and the robustness of the pattern to Gene Ontology and sex-biased expression suggest that partly recessive beneficial mutations may comprise a substantial fraction of noncoding DNA divergence observed between species. Our results have more general implications for the interpretation of evolutionary analyses of genomes of different quality.
Collapse
Affiliation(s)
- Tina T Hu
- Department of Ecology and Evolutionary Biology and the Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey 08544, USA.
| | | | | | | |
Collapse
|
584
|
Wu J, Wei K, Cheng F, Li S, Wang Q, Zhao J, Bonnema G, Wang X. A naturally occurring InDel variation in BraA.FLC.b (BrFLC2) associated with flowering time variation in Brassica rapa. BMC PLANT BIOLOGY 2012; 12:151. [PMID: 22925611 PMCID: PMC3487953 DOI: 10.1186/1471-2229-12-151] [Citation(s) in RCA: 50] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/10/2012] [Accepted: 08/14/2012] [Indexed: 05/18/2023]
Abstract
BACKGROUND Flowering time is an important trait in Brassica rapa crops. FLOWERING LOCUS C (FLC) is a MADS-box transcription factor that acts as a potent repressor of flowering. Expression of FLC is silenced when plants are exposed to low temperature, which activates flowering. There are four copies of FLC in B. rapa. Analyses of different segregating populations have suggested that BraA.FLC.a (BrFLC1) and BraA.FLC.b (BrFLC2) play major roles in controlling flowering time in B. rapa. RESULTS We analyzed the BrFLC2 sequence in nine B. rapa accessions, and identified a 57-bp insertion/deletion (InDel) across exon 4 and intron 4 resulting in a non-functional allele. In total, three types of transcripts were identified for this mutated BrFLC2 allele. The InDel was used to develop a PCR-based marker, which was used to screen a collection of 159 B. rapa accessions. The deletion genotype was present only in oil-type B. rapa, including ssp. oleifera and ssp. tricolaris, and not in other subspecies. The deletion genotype was significantly correlated with variation in flowering time. In contrast, the reported splicing site variation in BrFLC1, which also leads to a non-functional locus, was detected but not correlated with variation in flowering time in oil-type B. rapa, although it was correlated with variation in flowering time in vegetable-type B. rapa. CONCLUSIONS Our results suggest that the naturally occurring deletion mutation across exon 4 and intron 4 in BrFLC2 gene contributes greatly to variation in flowering time in oil-type B. rapa. The observed different relationship between BrFLC1 or BrFLC2 and flowering time variation indicates that the control of flowering time has evolved separately between oil-type and vegetable-type B. rapa groups.
Collapse
Affiliation(s)
- Jian Wu
- Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Keyun Wei
- Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Feng Cheng
- Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Shikai Li
- Institute of Horticultural Science, Yunnan Academy of Agricultural Sciences, Kunming, 650205, China
| | - Qian Wang
- Institute of Horticultural Science, Yunnan Academy of Agricultural Sciences, Kunming, 650205, China
| | - Jianjun Zhao
- Laboratory of Plant Breeding, Wageningen University, Droevendaalsesteeg 1, 6708 PB, Wageningen, The Netherlands
| | - Guusje Bonnema
- Laboratory of Plant Breeding, Wageningen University, Droevendaalsesteeg 1, 6708 PB, Wageningen, The Netherlands
| | - Xiaowu Wang
- Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| |
Collapse
|
585
|
Independent FLC mutations as causes of flowering-time variation in Arabidopsis thaliana and Capsella rubella. Genetics 2012; 192:729-39. [PMID: 22865739 PMCID: PMC3454893 DOI: 10.1534/genetics.112.143958] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
Capsella rubella is an inbreeding annual forb closely related to Arabidopsis thaliana, a model species widely used for studying natural variation in adaptive traits such as flowering time. Although mutations in dozens of genes can affect flowering of A. thaliana in the laboratory, only a handful of such genes vary in natural populations. Chief among these are FRIGIDA (FRI) and FLOWERING LOCUS C (FLC). Common and rare FRI mutations along with rare FLC mutations explain a large fraction of flowering-time variation in A. thaliana. Here we document flowering time under different conditions in 20 C. rubella accessions from across the species’ range. Similar to A. thaliana, vernalization, long photoperiods and elevated ambient temperature generally promote flowering. In this collection of C. rubella accessions, we did not find any obvious loss-of-function FRI alleles. Using mapping-by-sequencing with two strains that have contrasting flowering behaviors, we identified a splice-site mutation in FLC as the likely cause of early flowering in accession 1408. However, other similarly early C. rubella accessions did not share this mutation. We conclude that the genetic basis of flowering-time variation in C. rubella is complex, despite this very young species having undergone an extreme genetic bottleneck when it split from C. grandiflora a few tens of thousands of years ago.
Collapse
|
586
|
de Bruijn S, Angenent GC, Kaufmann K. Plant 'evo-devo' goes genomic: from candidate genes to regulatory networks. TRENDS IN PLANT SCIENCE 2012; 17:441-7. [PMID: 22698378 DOI: 10.1016/j.tplants.2012.05.002] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/22/2012] [Revised: 04/23/2012] [Accepted: 05/01/2012] [Indexed: 05/21/2023]
Abstract
Plant development gives rise to a staggering complexity of morphological structures with different shapes, colors, and functions. Understanding the evolution of control mechanisms that underlie developmental processes provides insights into causes of morphological diversity and, therefore, is of great interest to biologists. New genomic resources and techniques enable biologists to assess for the first time the evolution of developmental regulatory networks at a global scale. Here, we address the question of how comparative regulatory genomics can be used to reveal the evolutionary dynamics of control networks linked to morphological evolution in plants.
Collapse
Affiliation(s)
- Suzanne de Bruijn
- Wageningen University, Laboratory for Molecular Biology, Droevendaalsesteeg 1, 6708 PB Wageningen, The Netherlands
| | | | | |
Collapse
|
587
|
Eckert AJ, Shahi H, Datwyler SL, Neale DB. Spatially variable natural selection and the divergence between parapatric subspecies of lodgepole pine (Pinus contorta, Pinaceae). AMERICAN JOURNAL OF BOTANY 2012; 99:1323-1334. [PMID: 22837407 DOI: 10.3732/ajb.1200055] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]
Abstract
PREMISE OF THE STUDY Plant populations arrayed across sharp environmental gradients are ideal systems for identifying the genetic basis of ecologically relevant phenotypes. A series of five uplifted marine terraces along the northern coast of California represents one such system where morphologically distinct populations of lodgepole pine (Pinus contorta) are distributed across sharp soil gradients ranging from fertile soils near the coast to podzolic soils ca. 5 km inland. METHODS A total of 92 trees was sampled across four coastal marine terraces (N = 10-46 trees/terrace) located in Mendocino County, California and sequenced for a set of 24 candidate genes for growth and responses to various soil chemistry variables. Statistical analyses relying on patterns of nucleotide diversity were employed to identify genes whose diversity patterns were inconsistent with three null models. KEY RESULTS Most genes displayed patterns of nucleotide diversity that were consistent with null models (N = 19) or with the presence of paralogs (N = 3). Two genes, however, were exceptional: an aluminum responsive ABC-transporter with F(ST) = 0.664 and an inorganic phosphate transporter characterized by divergent haplotypes segregating at intermediate frequencies in most populations. CONCLUSIONS Spatially variable natural selection along gradients of aluminum and phosphate ion concentrations likely accounted for both outliers. These results shed light on some of the genetic components comprising the extended phenotype of this ecosystem, as well as highlight ecotones as fruitful study systems for the detection of adaptive genetic variants.
Collapse
Affiliation(s)
- Andrew J Eckert
- Department of Biology, Virginia Commonwealth University, Richmond, 23284, USA.
| | | | | | | |
Collapse
|
588
|
McHale LK, Haun WJ, Xu WW, Bhaskar PB, Anderson JE, Hyten DL, Gerhardt DJ, Jeddeloh JA, Stupar RM. Structural variants in the soybean genome localize to clusters of biotic stress-response genes. PLANT PHYSIOLOGY 2012; 159:1295-308. [PMID: 22696021 PMCID: PMC3425179 DOI: 10.1104/pp.112.194605] [Citation(s) in RCA: 128] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/26/2012] [Accepted: 06/12/2012] [Indexed: 05/19/2023]
Abstract
Genome-wide structural and gene content variations are hypothesized to drive important phenotypic variation within a species. Structural and gene content variations were assessed among four soybean (Glycine max) genotypes using array hybridization and targeted resequencing. Many chromosomes exhibited relatively low rates of structural variation (SV) among genotypes. However, several regions exhibited both copy number and presence-absence variation, the most prominent found on chromosomes 3, 6, 7, 16, and 18. Interestingly, the regions most enriched for SV were specifically localized to gene-rich regions that harbor clustered multigene families. The most abundant classes of gene families associated with these regions were the nucleotide-binding and receptor-like protein classes, both of which are important for plant biotic defense. The colocalization of SV with plant defense response signal transduction pathways provides insight into the mechanisms of soybean resistance gene evolution and may inform the development of new approaches to resistance gene cloning.
Collapse
Affiliation(s)
- Leah K. McHale
- Department of Horticulture and Crop Science, Ohio State University, Columbus, Ohio 43210 (L.K.M.)
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, Minnesota 55108 (W.J.H., P.B.B., J.E.A., R.M.S.)
- Minnesota Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota 55455 (W.W.X.)
- Soybean Genomics and Improvement Laboratory, Agricultural Research Service, Beltsville, Maryland 20705 (D.L.H.); and
- Roche NimbleGen, Research and Development, Madison, Wisconsin 53719 (D.J.G., J.A.J.)
| | | | - Wayne W. Xu
- Department of Horticulture and Crop Science, Ohio State University, Columbus, Ohio 43210 (L.K.M.)
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, Minnesota 55108 (W.J.H., P.B.B., J.E.A., R.M.S.)
- Minnesota Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota 55455 (W.W.X.)
- Soybean Genomics and Improvement Laboratory, Agricultural Research Service, Beltsville, Maryland 20705 (D.L.H.); and
- Roche NimbleGen, Research and Development, Madison, Wisconsin 53719 (D.J.G., J.A.J.)
| | | | - Justin E. Anderson
- Department of Horticulture and Crop Science, Ohio State University, Columbus, Ohio 43210 (L.K.M.)
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, Minnesota 55108 (W.J.H., P.B.B., J.E.A., R.M.S.)
- Minnesota Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota 55455 (W.W.X.)
- Soybean Genomics and Improvement Laboratory, Agricultural Research Service, Beltsville, Maryland 20705 (D.L.H.); and
- Roche NimbleGen, Research and Development, Madison, Wisconsin 53719 (D.J.G., J.A.J.)
| | | | - Daniel J. Gerhardt
- Department of Horticulture and Crop Science, Ohio State University, Columbus, Ohio 43210 (L.K.M.)
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, Minnesota 55108 (W.J.H., P.B.B., J.E.A., R.M.S.)
- Minnesota Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota 55455 (W.W.X.)
- Soybean Genomics and Improvement Laboratory, Agricultural Research Service, Beltsville, Maryland 20705 (D.L.H.); and
- Roche NimbleGen, Research and Development, Madison, Wisconsin 53719 (D.J.G., J.A.J.)
| | - Jeffrey A. Jeddeloh
- Department of Horticulture and Crop Science, Ohio State University, Columbus, Ohio 43210 (L.K.M.)
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, Minnesota 55108 (W.J.H., P.B.B., J.E.A., R.M.S.)
- Minnesota Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota 55455 (W.W.X.)
- Soybean Genomics and Improvement Laboratory, Agricultural Research Service, Beltsville, Maryland 20705 (D.L.H.); and
- Roche NimbleGen, Research and Development, Madison, Wisconsin 53719 (D.J.G., J.A.J.)
| | | |
Collapse
|
589
|
You FM, Deal KR, Wang J, Britton MT, Fass JN, Lin D, Dandekar AM, Leslie CA, Aradhya M, Luo MC, Dvorak J. Genome-wide SNP discovery in walnut with an AGSNP pipeline updated for SNP discovery in allogamous organisms. BMC Genomics 2012; 13:354. [PMID: 22849334 PMCID: PMC3527177 DOI: 10.1186/1471-2164-13-354] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2012] [Accepted: 07/05/2012] [Indexed: 01/09/2023] Open
Abstract
BACKGROUND A genome-wide set of single nucleotide polymorphisms (SNPs) is a valuable resource in genetic research and breeding and is usually developed by re-sequencing a genome. If a genome sequence is not available, an alternative strategy must be used. We previously reported the development of a pipeline (AGSNP) for genome-wide SNP discovery in coding sequences and other single-copy DNA without a complete genome sequence in self-pollinating (autogamous) plants. Here we updated this pipeline for SNP discovery in outcrossing (allogamous) species and demonstrated its efficacy in SNP discovery in walnut (Juglans regia L.). RESULTS The first step in the original implementation of the AGSNP pipeline was the construction of a reference sequence and the identification of single-copy sequences in it. To identify single-copy sequences, multiple genome equivalents of short SOLiD reads of another individual were mapped to shallow genome coverage of long Sanger or Roche 454 reads making up the reference sequence. The relative depth of SOLiD reads was used to filter out repeated sequences from single-copy sequences in the reference sequence. The second step was a search for SNPs between SOLiD reads and the reference sequence. Polymorphism within the mapped SOLiD reads would have precluded SNP discovery; hence both individuals had to be homozygous. The AGSNP pipeline was updated here for using SOLiD or other type of short reads of a heterozygous individual for these two principal steps. A total of 32.6X walnut genome equivalents of SOLiD reads of vegetatively propagated walnut scion cultivar 'Chandler' were mapped to 48,661 'Chandler' bacterial artificial chromosome (BAC) end sequences (BESs) produced by Sanger sequencing during the construction of a walnut physical map. A total of 22,799 putative SNPs were initially identified. A total of 6,000 Infinium II type SNPs evenly distributed along the walnut physical map were selected for the construction of an Infinium BeadChip, which was used to genotype a walnut mapping population having 'Chandler' as one of the parents. Genotyping results were used to adjust the filtering parameters of the updated AGSNP pipeline. With the adjusted filtering criteria, 69.6% of SNPs discovered with the updated pipeline were real and could be mapped on the walnut genetic map. A total of 13,439 SNPs were discovered by BES re-sequencing. BESs harboring SNPs were in 677 FPC contigs covering 98% of the physical map of the walnut genome. CONCLUSION The updated AGSNP pipeline is a versatile SNP discovery tool for a high-throughput, genome-wide SNP discovery in both autogamous and allogamous species. With this pipeline, a large set of SNPs were identified in a single walnut cultivar.
Collapse
Affiliation(s)
- Frank M You
- Department of Plant Sciences, University of California, Davis, CA 95616, USA.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
590
|
Lee CR, Mitchell-Olds T. Environmental adaptation contributes to gene polymorphism across the Arabidopsis thaliana genome. Mol Biol Evol 2012; 29:3721-8. [PMID: 22798389 DOI: 10.1093/molbev/mss174] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open
Abstract
The level of within-species polymorphism differs greatly among genes in a genome. Many genomic studies have investigated the relationship between gene polymorphism and factors such as recombination rate or expression pattern. However, the polymorphism of a gene is affected not only by its physical properties or functional constraints but also by natural selection on organisms in their environments. Specifically, if functionally divergent alleles enable adaptation to different environments, locus-specific polymorphism may be maintained by spatially heterogeneous natural selection. To test this hypothesis and estimate the extent to which environmental selection shapes the pattern of genome-wide polymorphism, we define the "environmental relevance" of a gene as the proportion of genetic variation explained by environmental factors, after controlling for population structure. We found substantial effects of environmental relevance on patterns of polymorphism among genes. In addition, the correlation between environmental relevance and gene polymorphism is positive, consistent with the expectation that balancing selection among heterogeneous environments maintains genetic variation at ecologically important genes. Comparison of the gene ontology annotations shows that genes with high environmental relevance are enriched in unknown function categories. These results suggest an important role for environmental factors in shaping genome-wide patterns of polymorphism and indicate another direction of genomic study.
Collapse
|
591
|
Zhang Z, Wu Y, Gao M, Zhang J, Kong Q, Liu Y, Ba H, Zhou J, Zhang Y. Disruption of PAMP-induced MAP kinase cascade by a Pseudomonas syringae effector activates plant immunity mediated by the NB-LRR protein SUMM2. Cell Host Microbe 2012; 11:253-63. [PMID: 22423965 DOI: 10.1016/j.chom.2012.01.015] [Citation(s) in RCA: 235] [Impact Index Per Article: 19.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2011] [Revised: 12/06/2011] [Accepted: 01/20/2012] [Indexed: 01/01/2023]
Abstract
Pathogen-associated molecular pattern (PAMP)-triggered immunity (PTI) serves as a primary plant defense response against microbial pathogens, with MEKK1, MKK1/MKK2, and MPK4 functioning as a MAP kinase cascade downstream of PAMP receptors. Plant Resistance (R) proteins sense specific pathogen effectors to initiate a second defense mechanism, termed effector-triggered immunity (ETI). In a screen for suppressors of the mkk1 mkk2 autoimmune phenotype, we identify the nucleotide-binding leucine-rich repeat (NB-LRR) protein SUMM2 and find that the MEKK1-MKK1/MKK2-MPK4 cascade negatively regulates SUMM2-mediated immunity. Further, the MEKK1-MKK1/MKK2-MPK4 cascade positively regulates basal defense targeted by the Pseudomonas syringae pathogenic effector HopAI1, which inhibits MPK4 kinase activity. Inactivation of MPK4 by HopAI1 results in activation of SUMM2-mediated defense responses. Our data suggest that SUMM2 is an R protein that becomes active when the MEKK1-MKK1/MKK2-MPK4 cascade is disrupted by pathogens, supporting the hypothesis that R proteins evolved to protect plants when microbial effectors suppress basal resistance.
Collapse
Affiliation(s)
- Zhibin Zhang
- National Institute of Biological Sciences, Zhongguancun Life Science Park, Beijing, People's Republic of China
| | | | | | | | | | | | | | | | | |
Collapse
|
592
|
Feulner PGD, Chain FJJ, Panchal M, Eizaguirre C, Kalbe M, Lenz TL, Mundry M, Samonte IE, Stoll M, Milinski M, Reusch TBH, Bornberg-Bauer E. Genome-wide patterns of standing genetic variation in a marine population of three-spined sticklebacks. Mol Ecol 2012; 22:635-49. [PMID: 22747593 DOI: 10.1111/j.1365-294x.2012.05680.x] [Citation(s) in RCA: 73] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Since the end of the Pleistocene, the three-spined stickleback (Gasterosteus aculeatus) has repeatedly colonized and adapted to various freshwater habitats probably originating from ancestral marine populations. Standing genetic variation and the underlying genomic architecture both have been speculated to contribute to recent adaptive radiations of sticklebacks. Here, we expand on the current genomic resources of this fish by providing extensive genome-wide variation data from six individuals from a marine (North Sea) stickleback population. Using next-generation sequencing and a combination of paired-end and mate-pair libraries, we detected a wide size range of genetic variation. Among the six individuals, we found more than 7% of the genome is polymorphic, consisting of 2599111 SNPs, 233464 indels and structural variation (SV) (>50 bp) such as 1054 copy-number variable regions (deletions and duplications) and 48 inversions. Many of these polymorphisms affect gene and coding sequences. Based on SNP diversity, we determined outlier regions concordant with signatures expected under adaptive evolution. As some of these outliers overlap with pronounced regions of copy-number variation, we propose the consideration of such SV when analysing SNP data from re-sequencing approaches. We further discuss the value of this resource on genome-wide variation for further investigation upon the relative contribution of standing variation on the parallel evolution of sticklebacks and the importance of the genomic architecture in adaptive radiation.
Collapse
Affiliation(s)
- Philine G D Feulner
- Institute for Evolution and Biodiversity, Evolutionary Bioinformatics, Westfaelische Wilhelms University, Muenster, Germany.
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
593
|
Gene discovery using mutagen-induced polymorphisms and deep sequencing: application to plant disease resistance. Genetics 2012; 192:139-46. [PMID: 22714407 DOI: 10.1534/genetics.112.141986] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
Next-generation sequencing technologies are accelerating gene discovery by combining multiple steps of mapping and cloning used in the traditional map-based approach into one step using DNA sequence polymorphisms existing between two different accessions/strains/backgrounds of the same species. The existing next-generation sequencing method, like the traditional one, requires the use of a segregating population from a cross of a mutant organism in one accession with a wild-type (WT) organism in a different accession. It therefore could potentially be limited by modification of mutant phenotypes in different accessions and/or by the lengthy process required to construct a particular mapping parent in a second accession. Here we present mapping and cloning of an enhancer mutation with next-generation sequencing on bulked segregants in the same accession using sequence polymorphisms induced by a chemical mutagen. This method complements the conventional cloning approach and makes forward genetics more feasible and powerful in molecularly dissecting biological processes in any organisms. The pipeline developed in this study can be used to clone causal genes in background of single mutants or higher order of mutants and in species with or without sequence information on multiple accessions.
Collapse
|
594
|
Tan S, Zhong Y, Hou H, Yang S, Tian D. Variation of presence/absence genes among Arabidopsis populations. BMC Evol Biol 2012; 12:86. [PMID: 22697058 PMCID: PMC3433342 DOI: 10.1186/1471-2148-12-86] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2011] [Accepted: 06/14/2012] [Indexed: 12/23/2022] Open
Abstract
BACKGROUND Gene presence/absence (P/A) polymorphisms are commonly observed in plants and are important in individual adaptation and species differentiation. Detecting their abundance, distribution and variation among individuals would help to understand the role played by these polymorphisms in a given species. The recently sequenced 80 Arabidopsis genomes provide an opportunity to address these questions. RESULTS By systematically investigating these accessions, we identified 2,407 P/A genes (or 8.9%) absent in one or more genomes, averaging 444 absent genes per accession. 50.6% of P/A genes belonged to multi-copy gene families, or 31.0% to clustered genes. However, the highest proportion of P/A genes, outnumbered in singleton genes, was observed in the regions near centromeres. In addition, a significant correlation was observed between the P/A gene frequency among the 80 accessions and the diversity level at P/A loci. Furthermore, the proportion of P/A genes was different among functional gene categories. Finally, a P/A gene tree showed a diversified population structure in the worldwide Arabidopsis accessions. CONCLUSIONS An estimate of P/A genes and their frequency distribution in the worldwide Arabidopsis accessions was obtained. Our results suggest that there are diverse mechanisms to generate or maintain P/A genes, by which individuals and functionally different genes can selectively maintain P/A polymorphisms for a specific adaptation.
Collapse
Affiliation(s)
- Shengjun Tan
- State Key Laboratory of Pharmaceutical Biotechnology, Department of Biology, Nanjing University, Nanjing, 210093, China
| | - Yan Zhong
- State Key Laboratory of Pharmaceutical Biotechnology, Department of Biology, Nanjing University, Nanjing, 210093, China
| | - Huan Hou
- State Key Laboratory of Pharmaceutical Biotechnology, Department of Biology, Nanjing University, Nanjing, 210093, China
| | - Sihai Yang
- State Key Laboratory of Pharmaceutical Biotechnology, Department of Biology, Nanjing University, Nanjing, 210093, China
| | - Dacheng Tian
- State Key Laboratory of Pharmaceutical Biotechnology, Department of Biology, Nanjing University, Nanjing, 210093, China
| |
Collapse
|
595
|
Intron-mediated alternative splicing of Arabidopsis P5CS1 and its association with natural variation in proline and climate adaptation. Proc Natl Acad Sci U S A 2012; 109:9197-202. [PMID: 22615385 DOI: 10.1073/pnas.1203433109] [Citation(s) in RCA: 92] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Drought-induced proline accumulation is widely observed in plants but its regulation and adaptive value are not as well understood. Proline accumulation of the Arabidopsis accession Shakdara (Sha) was threefold less than that of Landsberg erecta (Ler) and quantitative trait loci mapping identified a reduced function allele of the proline synthesis enzyme Δ(1)-pyrroline-5-carboxylate synthetase1 (P5CS1) as a basis for the lower proline of Sha. Sha P5CS1 had additional TA repeats in intron 2 and a G-to-T transversion in intron 3 that were sufficient to promote alternative splicing and production of a nonfunctional transcript lacking exon 3 (exon 3-skip P5CS1). In Sha, and additional accessions with the same intron polymorphisms, the nonfunctional exon 3-skip P5CS1 splice variant constituted as much as half of the total P5CS1 transcript. In a larger panel of Arabidopsis accessions, low water potential-induced proline accumulation varied by 10-fold and variable production of exon 3-skip P5CS1 among accessions was an important, but not the sole, factor underlying variation in proline accumulation. Population genetic analyses suggest that P5CS1 may have evolved under positive selection, and more extensive correlation of exon 3-skip P5CS1 production than proline abundance with climate conditions of natural accessions also suggest a role of P5CS1 in local adaptation to the environment. These data identify a unique source of alternative splicing in plants, demonstrate a role of exon 3-skip P5CS1 in natural variation of proline metabolism, and suggest an association of P5CS1 and its alternative splicing with environmental adaptation.
Collapse
|
596
|
Stone EA. Joint genotyping on the fly: identifying variation among a sequenced panel of inbred lines. Genome Res 2012; 22:966-74. [PMID: 22367192 PMCID: PMC3337441 DOI: 10.1101/gr.129122.111] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2011] [Accepted: 02/21/2012] [Indexed: 02/03/2023]
Abstract
High-throughput sequencing is enabling remarkably deep surveys of genomic variation. It is now possible to completely sequence multiple individuals from a single species, yet the identification of variation among them remains an evolving computational challenge. This challenge is compounded for experimental organisms when strains are studied instead of individuals. In response, we present the Joint Genotyper for Inbred Lines (JGIL) as a method for obtaining genotypes and identifying variation among a large panel of inbred strains or lines. JGIL inputs the sequence reads from each line after their alignment to a common reference. Its probabilistic model includes site-specific parameters common to all lines that describe the frequency of nucleotides segregating in the population from which the inbred panel was derived. The distribution of line genotypes is conditional on these parameters and reflects the experimental design. Site-specific error probabilities, also common to all lines, parameterize the distribution of reads conditional on line genotype and realized coverage. Both sets of parameters are estimated per site from the aggregate read data, and posterior probabilities are calculated to decode the genotype of each line. We present an application of JGIL to 162 inbred Drosophila melanogaster lines from the Drosophila Genetic Reference Panel. We explore by simulation the effect of varying coverage, sequencing error, mapping error, and the number of lines. In doing so, we illustrate how JGIL is robust to moderate levels of error. Supported by these analyses, we advocate the importance of modeling the data and the experimental design when possible.
Collapse
Affiliation(s)
- Eric A Stone
- Department of Genetics and Bioinformatics Research Center, North Carolina State University, Raleigh, NC 27603, USA.
| |
Collapse
|
597
|
Richards CL, Rosas U, Banta J, Bhambhra N, Purugganan MD. Genome-wide patterns of Arabidopsis gene expression in nature. PLoS Genet 2012; 8:e1002662. [PMID: 22532807 PMCID: PMC3330097 DOI: 10.1371/journal.pgen.1002662] [Citation(s) in RCA: 69] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2011] [Accepted: 03/05/2012] [Indexed: 02/01/2023] Open
Abstract
Organisms in the wild are subject to multiple, fluctuating environmental factors, and it is in complex natural environments that genetic regulatory networks actually function and evolve. We assessed genome-wide gene expression patterns in the wild in two natural accessions of the model plant Arabidopsis thaliana and examined the nature of transcriptional variation throughout its life cycle and gene expression correlations with natural environmental fluctuations. We grew plants in a natural field environment and measured genome-wide time-series gene expression from the plant shoot every three days, spanning the seedling to reproductive stages. We find that 15,352 genes were expressed in the A. thaliana shoot in the field, and accession and flowering status (vegetative versus flowering) were strong components of transcriptional variation in this plant. We identified between ∼110 and 190 time-varying gene expression clusters in the field, many of which were significantly overrepresented by genes regulated by abiotic and biotic environmental stresses. The two main principal components of vegetative shoot gene expression (PC(veg)) correlate to temperature and precipitation occurrence in the field. The largest PC(veg) axes included thermoregulatory genes while the second major PC(veg) was associated with precipitation and contained drought-responsive genes. By exposing A. thaliana to natural environments in an open field, we provide a framework for further understanding the genetic networks that are deployed in natural environments, and we connect plant molecular genetics in the laboratory to plant organismal ecology in the wild.
Collapse
Affiliation(s)
- Christina L. Richards
- Department of Biology and Center for Genomics and Systems Biology, New York University, New York, New York, United States of America
- Department of Integrative Biology, University of South Florida, Tampa, Florida, United States of America
| | - Ulises Rosas
- Department of Biology and Center for Genomics and Systems Biology, New York University, New York, New York, United States of America
| | - Joshua Banta
- Department of Biology and Center for Genomics and Systems Biology, New York University, New York, New York, United States of America
- Department of Biology, University of Texas at Tyler, Tyler, Texas, United States of America
| | - Naeha Bhambhra
- Department of Biology and Center for Genomics and Systems Biology, New York University, New York, New York, United States of America
| | - Michael D. Purugganan
- Department of Biology and Center for Genomics and Systems Biology, New York University, New York, New York, United States of America
| |
Collapse
|
598
|
Abstract
Selective sweeps are typically associated with a local reduction of genetic diversity around the adaptive site. However, selective sweeps can also quickly carry neutral mutations to observable population frequencies if they arise early in a sweep and hitchhike with the adaptive allele. We show that the interplay between mutation and exponential amplification through hitchhiking results in a characteristic frequency spectrum of the resulting novel haplotype variation that depends only on the ratio of the mutation rate and the selection coefficient of the sweep. On the basis of this result, we develop an estimator for the selection coefficient driving a sweep. Since this estimator utilizes the novel variation arising from mutations during a sweep, it does not rely on preexisting variation and can also be applied to loci that lack recombination. Compared with standard approaches that infer selection coefficients from the size of dips in genetic diversity around the adaptive site, our estimator requires much shorter sequences but sampled at high population depth to capture low-frequency variants; given such data, it consistently outperforms standard approaches. We investigate analytically and numerically how the accuracy of our estimator is affected by the decay of the sweep pattern over time as a consequence of random genetic drift and discuss potential effects of recombination, soft sweeps, and demography. As an example for its use, we apply our estimator to deep sequencing data from human immunodeficiency virus populations.
Collapse
|
599
|
Colautti RI, Lee CR, Mitchell-Olds T. Origin, fate, and architecture of ecologically relevant genetic variation. CURRENT OPINION IN PLANT BIOLOGY 2012; 15:199-204. [PMID: 22341792 PMCID: PMC3413448 DOI: 10.1016/j.pbi.2012.01.016] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/01/2011] [Revised: 01/11/2012] [Accepted: 01/23/2012] [Indexed: 05/08/2023]
Abstract
Recent advances in molecular genetics combined with field manipulations are yielding new insight into the origin, evolutionary fate, and genetic architecture of phenotypic variation in natural plant populations, with two surprising implications for the evolution of plant genomes. First, genetic loci exhibiting antagonistic pleiotropy across natural environments appear rare relative to loci that are adaptive in one or more environments and neutral elsewhere. These 'conditionally neutral' alleles should sweep to fixation when they arise, yet genome comparisons find little evidence for such selective sweeps. Second, genes under biotic selection tend to be of larger effect than genes under abiotic selection. Recent theory suggests this may be a consequence of high gene flow among populations under selection for local adaptation.
Collapse
Affiliation(s)
- Robert I Colautti
- Biology Department, Duke University, Durham, NC 27708, United States.
| | | | | |
Collapse
|
600
|
Kover PX, Mott R. Mapping the genetic basis of ecologically and evolutionarily relevant traits in Arabidopsis thaliana. CURRENT OPINION IN PLANT BIOLOGY 2012; 15:212-7. [PMID: 22401882 DOI: 10.1016/j.pbi.2012.02.002] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/25/2011] [Revised: 01/11/2012] [Accepted: 02/09/2012] [Indexed: 05/04/2023]
Abstract
There has been a long standing interest in the relationship between genetic and phenotypic variation in natural populations, in order to understand the genetic basis of adaptation and to discover natural alleles to improve crops. Here we review recent developments in mapping approaches that have significantly improved our ability to identify causal polymorphism explaining natural variation in ecological and evolutionarily relevant traits. However, challenges in interpreting these discoveries remain. In particular, we need more detailed transcriptomic, epigenomic, and gene network data to help understand the mechanisms behind identified associations. Also, more studies need to be performed under field conditions or using experimental evolution to determine whether polymorphisms identified in the lab are relevant for adaptation and improvement under natural conditions.
Collapse
Affiliation(s)
- Paula X Kover
- University of Bath, Department of Biology and Biochemistry, Claverton Down, BA2 7AY, UK.
| | | |
Collapse
|