1
|
Zhang T, Zhou L, Pu Y, Tang Y, Liu J, Yang L, Zhou T, Feng L, Wang X. A chromosome-level genome reveals genome evolution and molecular basis of anthraquinone biosynthesis in Rheum palmatum. BMC PLANT BIOLOGY 2024; 24:261. [PMID: 38594606 PMCID: PMC11005207 DOI: 10.1186/s12870-024-04972-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/27/2024] [Accepted: 04/01/2024] [Indexed: 04/11/2024]
Abstract
BACKGROUND Rhubarb is one of common traditional Chinese medicine with a diverse array of therapeutic efficacies. Despite its widespread use, molecular research into rhubarb remains limited, constraining our comprehension of the geoherbalism. RESULTS We assembled the genome of Rheum palmatum L., one of the source plants of rhubarb, to elucidate its genome evolution and unpack the biosynthetic pathways of its bioactive compounds using a combination of PacBio HiFi, Oxford Nanopore, Illumina, and Hi-C scaffolding approaches. Around 2.8 Gb genome was obtained after assembly with more than 99.9% sequences anchored to 11 pseudochromosomes (scaffold N50 = 259.19 Mb). Transposable elements (TE) with a continuous expansion of long terminal repeat retrotransposons (LTRs) is predominant in genome size, contributing to the genome expansion of R. palmatum. Totally 30,480 genes were predicted to be protein-coding genes with 473 significantly expanded gene families enriched in diverse pathways associated with high-altitude adaptation for this species. Two successive rounds of whole genome duplication event (WGD) shared by Fagopyrum tataricum and R. palmatum were confirmed. We also identified 54 genes involved in anthraquinone biosynthesis and other 97 genes entangled in flavonoid biosynthesis. Notably, RpALS emerged as a compelling candidate gene for the octaketide biosynthesis after the key residual screening. CONCLUSION Overall, our findings offer not only an enhanced understanding of this remarkable medicinal plant but also pave the way for future innovations in its genetic breeding, molecular design, and functional genomic studies.
Collapse
Affiliation(s)
- Tianyi Zhang
- School of Pharmacy, Xi'an Jiaotong University, Xi'an, 710061, China
| | - Lipan Zhou
- School of Pharmacy, Xi'an Jiaotong University, Xi'an, 710061, China
| | - Yang Pu
- School of Pharmacy, Xi'an Jiaotong University, Xi'an, 710061, China
| | - Yadi Tang
- School of Pharmacy, Xi'an Jiaotong University, Xi'an, 710061, China
| | - Jie Liu
- School of Pharmacy, Xi'an Jiaotong University, Xi'an, 710061, China
| | - Li Yang
- School of Pharmacy, Xi'an Jiaotong University, Xi'an, 710061, China
| | - Tao Zhou
- School of Pharmacy, Xi'an Jiaotong University, Xi'an, 710061, China
| | - Li Feng
- School of Pharmacy, Xi'an Jiaotong University, Xi'an, 710061, China.
| | - Xumei Wang
- School of Pharmacy, Xi'an Jiaotong University, Xi'an, 710061, China.
| |
Collapse
|
2
|
Liu Y, Liang N, Xian Q, Zhang W. GC heterogeneity reveals sequence-structures evolution of angiosperm ITS2. BMC PLANT BIOLOGY 2023; 23:608. [PMID: 38036992 PMCID: PMC10691020 DOI: 10.1186/s12870-023-04634-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/21/2023] [Accepted: 11/26/2023] [Indexed: 12/02/2023]
Abstract
BACKGROUND Despite GC variation constitutes a fundamental element of genome and species diversity, the precise mechanisms driving it remain unclear. The abundant sequence data available for the ITS2, a commonly employed phylogenetic marker in plants, offers an exceptional resource for exploring the GC variation across angiosperms. RESULTS A comprehensive selection of 8666 species, comprising 165 genera, 63 families, and 30 orders were used for the analyses. The alignment of ITS2 sequence-structures and partitioning of secondary structures into paired and unpaired regions were performed using 4SALE. Substitution rates and frequencies among GC base-pairs in the paired regions of ITS2 were calculated using RNA-specific models in the PHASE package. The results showed that the distribution of ITS2 GC contents on the angiosperm phylogeny was heterogeneous, but their increase was generally associated with ITS2 sequence homogenization, thereby supporting the occurrence of GC-biased gene conversion (gBGC) during the concerted evolution of ITS2. Additionally, the GC content in the paired regions of the ITS2 secondary structure was significantly higher than that of the unpaired regions, indicating the selection of GC for thermodynamic stability. Furthermore, the RNA substitution models demonstrated that base-pair transformations favored both the elevation and fixation of GC in the paired regions, providing further support for gBGC. CONCLUSIONS Our findings highlight the significance of secondary structure in GC investigation, which demonstrate that both gBGC and structure-based selection are influential factors driving angiosperm ITS2 GC content.
Collapse
Affiliation(s)
- Yubo Liu
- Marine College, Shandong University, Weihai, 264209, China
- Division of Physical Biology, CAS Key Laboratory of Interfacial Physics and Technology, Shanghai Institute of Applied Physics, Chinese Academy of Sciences, University of Chinese Academy of Sciences, Shanghai, 201800, China
| | - Nan Liang
- Marine College, Shandong University, Weihai, 264209, China
- Allergy Department, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, 100730, China
| | - Qing Xian
- Marine College, Shandong University, Weihai, 264209, China
| | - Wei Zhang
- Marine College, Shandong University, Weihai, 264209, China.
| |
Collapse
|
3
|
Serrano-León IM, Prieto P, Aguilar M. Telomere and subtelomere high polymorphism might contribute to the specificity of homologous recognition and pairing during meiosis in barley in the context of breeding. BMC Genomics 2023; 24:642. [PMID: 37884878 PMCID: PMC10601145 DOI: 10.1186/s12864-023-09738-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2023] [Accepted: 10/12/2023] [Indexed: 10/28/2023] Open
Abstract
Barley (Hordeum vulgare) is one of the most popular cereal crops globally. Although it is a diploid species, (2n = 2x = 14) the study of its genome organization is necessary in the framework of plant breeding since barley is often used in crosses with other cereals like wheat to provide them with advantageous characters. We already have an extensive knowledge on different stages of the meiosis, the cell division to generate the gametes in species with sexual reproduction, such as the formation of the synaptonemal complex, recombination, and chromosome segregation. But meiosis really starts with the identification of homologous chromosomes and pairing initiation, and it is still unclear how chromosomes exactly choose a partner to appropriately pair for additional recombination and segregation. In this work we present an exhaustive molecular analysis of both telomeres and subtelomeres of barley chromosome arms 2H-L, 3H-L and 5H-L. As expected, the analysis of multiple features, including transposable elements, repeats, GC content, predicted CpG islands, recombination hotspots, G4 quadruplexes, genes and targeted sequence motifs for key DNA-binding proteins, revealed a high degree of variability both in telomeres and subtelomeres. The molecular basis for the specificity of homologous recognition and pairing occurring in the early chromosomal interactions at the start of meiosis in barley may be provided by these polymorphisms. A more relevant role of telomeres and most distal part of subtelomeres is suggested.
Collapse
Affiliation(s)
- I M Serrano-León
- Plant Breeding Department, Institute for Sustainable Agriculture, Agencia Estatal Consejo Superior de Investigaciones Científicas (CSIC), Avenida Menéndez Pidal S/N., Campus Alameda del Obispo, 14004, Córdoba, Spain
| | - P Prieto
- Plant Breeding Department, Institute for Sustainable Agriculture, Agencia Estatal Consejo Superior de Investigaciones Científicas (CSIC), Avenida Menéndez Pidal S/N., Campus Alameda del Obispo, 14004, Córdoba, Spain.
| | - M Aguilar
- Área de Fisiología Vegetal, Universidad de Córdoba, Campus de Rabanales, Edif. C4, 3ª Planta, Córdoba, Spain
| |
Collapse
|
4
|
Smith SA, Walker-Hale N, Parins-Fukuchi CT. Compositional shifts associated with major evolutionary transitions in plants. THE NEW PHYTOLOGIST 2023; 239:2404-2415. [PMID: 37381083 DOI: 10.1111/nph.19099] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/10/2023] [Accepted: 06/04/2023] [Indexed: 06/30/2023]
Abstract
Heterogeneity in gene trees, morphological characters, and composition has been associated with several major plant clades. Here, we examine heterogeneity in composition across a large transcriptomic dataset of plants to better understand whether locations of shifts in composition are shared across gene regions and whether directions of shifts within clades are shared across gene regions. We estimate mixed models of composition for both nucleotide and amino acids across a recent large-scale transcriptomic dataset for plants. We find shifts in composition across both nucleotide and amino acid datasets, with more shifts detected in nucleotides. We find that Chlorophytes and lineages within experience the most shifts. However, many shifts occur at the origins of land, vascular, and seed plants. While genes in these clades do not typically share the same composition, they tend to shift in the same direction. We discuss potential causes of these patterns. Compositional heterogeneity has been highlighted as a potential problem for phylogenetic analysis, but the variation presented here highlights the need to further investigate these patterns for the signal of biological processes.
Collapse
Affiliation(s)
- Stephen A Smith
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI, 48103, USA
| | | | | |
Collapse
|
5
|
Näsvall K, Boman J, Talla V, Backström N. Base Composition, Codon Usage, and Patterns of Gene Sequence Evolution in Butterflies. Genome Biol Evol 2023; 15:evad150. [PMID: 37565492 PMCID: PMC10462419 DOI: 10.1093/gbe/evad150] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2022] [Revised: 07/17/2023] [Accepted: 08/08/2023] [Indexed: 08/12/2023] Open
Abstract
Coding sequence evolution is influenced by both natural selection and neutral evolutionary forces. In many species, the effects of mutation bias, codon usage, and GC-biased gene conversion (gBGC) on gene sequence evolution have not been detailed. Quantification of how these forces shape substitution patterns is therefore necessary to understand the strength and direction of natural selection. Here, we used comparative genomics to investigate the association between base composition and codon usage bias on gene sequence evolution in butterflies and moths (Lepidoptera), including an in-depth analysis of underlying patterns and processes in one species, Leptidea sinapis. The data revealed significant G/C to A/T substitution bias at third codon position with some variation in the strength among different butterfly lineages. However, the substitution bias was lower than expected from previously estimated mutation rate ratios, partly due to the influence of gBGC. We found that A/T-ending codons were overrepresented in most species, but there was a positive association between the magnitude of codon usage bias and GC-content in third codon positions. In addition, the tRNA-gene population in L. sinapis showed higher GC-content at third codon positions compared to coding sequences in general and less overrepresentation of A/T-ending codons. There was an inverse relationship between synonymous substitutions and codon usage bias indicating selection on synonymous sites. We conclude that the evolutionary rate in Lepidoptera is affected by a complex interaction between underlying G/C -> A/T mutation bias and partly counteracting fixation biases, predominantly conferred by overall purifying selection, gBGC, and selection on codon usage.
Collapse
Affiliation(s)
- Karin Näsvall
- Evolutionary Biology Program, Department of Ecology and Genetics (IEG), Uppsala University, Uppsala, Sweden
| | - Jesper Boman
- Evolutionary Biology Program, Department of Ecology and Genetics (IEG), Uppsala University, Uppsala, Sweden
| | - Venkat Talla
- Evolutionary Biology Program, Department of Ecology and Genetics (IEG), Uppsala University, Uppsala, Sweden
| | - Niclas Backström
- Evolutionary Biology Program, Department of Ecology and Genetics (IEG), Uppsala University, Uppsala, Sweden
| |
Collapse
|
6
|
Deb SK, Edger PP, Pires JC, McKain MR. Patterns, mechanisms, and consequences of homoeologous exchange in allopolyploid angiosperms: a genomic and epigenomic perspective. THE NEW PHYTOLOGIST 2023; 238:2284-2304. [PMID: 37010081 DOI: 10.1111/nph.18927] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Accepted: 03/16/2023] [Indexed: 05/19/2023]
Abstract
Allopolyploids result from hybridization between different evolutionary lineages coupled with genome doubling. Homoeologous chromosomes (chromosomes with common shared ancestry) may undergo recombination immediately after allopolyploid formation and continue over successive generations. The outcome of this meiotic pairing behavior is dynamic and complex. Homoeologous exchanges (HEs) may lead to the formation of unbalanced gametes, reduced fertility, and selective disadvantage. By contrast, HEs could act as sources of novel evolutionary substrates, shifting the relative dosage of parental gene copies, generating novel phenotypic diversity, and helping the establishment of neo-allopolyploids. However, HE patterns vary among lineages, across generations, and even within individual genomes and chromosomes. The causes and consequences of this variation are not fully understood, though interest in this evolutionary phenomenon has increased in the last decade. Recent technological advances show promise in uncovering the mechanistic basis of HEs. Here, we describe recent observations of the common patterns among allopolyploid angiosperm lineages, underlying genomic and epigenomic features, and consequences of HEs. We identify critical research gaps and discuss future directions with far-reaching implications in understanding allopolyploid evolution and applying them to the development of important phenotypic traits of polyploid crops.
Collapse
Affiliation(s)
- Sontosh K Deb
- Department of Biological Sciences, The University of Alabama, Tuscaloosa, AL, 35487, USA
- Department of Forestry and Environmental Science, Shahjalal University of Science and Technology, Sylhet, 3114, Bangladesh
| | - Patrick P Edger
- Department of Horticulture, Michigan State University, East Lansing, MI, 48823, USA
- Genetics and Genome Sciences Program, Michigan State University, East Lansing, MI, 48823, USA
| | - J Chris Pires
- Department of Soil and Crop Sciences, Colorado State University, Fort Collins, CO, 80523, USA
| | - Michael R McKain
- Department of Biological Sciences, The University of Alabama, Tuscaloosa, AL, 35487, USA
| |
Collapse
|
7
|
Palahí I Torres A, Höök L, Näsvall K, Shipilina D, Wiklund C, Vila R, Pruisscher P, Backström N. The fine-scale recombination rate variation and associations with genomic features in a butterfly. Genome Res 2023; 33:810-823. [PMID: 37308293 PMCID: PMC10317125 DOI: 10.1101/gr.277414.122] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2022] [Accepted: 05/03/2023] [Indexed: 06/14/2023]
Abstract
Recombination is a key molecular mechanism that has profound implications on both micro- and macroevolutionary processes. However, the determinants of recombination rate variation in holocentric organisms are poorly understood, in particular in Lepidoptera (moths and butterflies). The wood white butterfly (Leptidea sinapis) shows considerable intraspecific variation in chromosome numbers and is a suitable system for studying regional recombination rate variation and its potential molecular underpinnings. Here, we developed a large whole-genome resequencing data set from a population of wood whites to obtain high-resolution recombination maps using linkage disequilibrium information. The analyses revealed that larger chromosomes had a bimodal recombination landscape, potentially caused by interference between simultaneous chiasmata. The recombination rate was significantly lower in subtelomeric regions, with exceptions associated with segregating chromosome rearrangements, showing that fissions and fusions can have considerable effects on the recombination landscape. There was no association between the inferred recombination rate and base composition, supporting a limited influence of GC-biased gene conversion in butterflies. We found significant but variable associations between the recombination rate and the density of different classes of transposable elements, most notably a significant enrichment of short interspersed nucleotide elements in genomic regions with higher recombination rate. Finally, the analyses unveiled significant enrichment of genes involved in farnesyltranstransferase activity in recombination coldspots, potentially indicating that expression of transferases can inhibit formation of chiasmata during meiotic division. Our results provide novel information about recombination rate variation in holocentric organisms and have particular implications for forthcoming research in population genetics, molecular/genome evolution, and speciation.
Collapse
Affiliation(s)
- Aleix Palahí I Torres
- Evolutionary Biology Program, Department of Ecology and Genetics (IEG), Uppsala University, SE-752 36 Uppsala, Sweden;
| | - Lars Höök
- Evolutionary Biology Program, Department of Ecology and Genetics (IEG), Uppsala University, SE-752 36 Uppsala, Sweden
| | - Karin Näsvall
- Evolutionary Biology Program, Department of Ecology and Genetics (IEG), Uppsala University, SE-752 36 Uppsala, Sweden
| | - Daria Shipilina
- Evolutionary Biology Program, Department of Ecology and Genetics (IEG), Uppsala University, SE-752 36 Uppsala, Sweden
| | - Christer Wiklund
- Department of Zoology: Division of Ecology, Stockholm University, SE-106 91 Stockholm, Sweden
| | - Roger Vila
- Butterfly Diversity and Evolution Lab, Institut de Biologia Evolutiva (CSIC-UPF), 08003 Barcelona, Spain
| | - Peter Pruisscher
- Evolutionary Biology Program, Department of Ecology and Genetics (IEG), Uppsala University, SE-752 36 Uppsala, Sweden
| | - Niclas Backström
- Evolutionary Biology Program, Department of Ecology and Genetics (IEG), Uppsala University, SE-752 36 Uppsala, Sweden
| |
Collapse
|
8
|
Xian Q, Wang S, Liu Y, Kan S, Zhang W. Structure-Based GC Investigation Sheds New Light on ITS2 Evolution in Corydalis Species. Int J Mol Sci 2023; 24:ijms24097716. [PMID: 37175423 PMCID: PMC10178233 DOI: 10.3390/ijms24097716] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2023] [Revised: 04/20/2023] [Accepted: 04/21/2023] [Indexed: 05/15/2023] Open
Abstract
Guanine and cytosine (GC) content is a fundamental component of genetic diversity and essential for phylogenetic analyses. However, the GC content of the ribosomal internal transcribed spacer 2 (ITS2) remains unknown, despite the fact that ITS2 is a widely used phylogenetic marker. Here, the ITS2 was high-throughput sequenced from 29 Corydalis species, and their GC contents were comparatively investigated in the context of ITS2's characteristic secondary structure and concerted evolution. Our results showed that the GC contents of ITS2 were 131% higher than those of their adjacent 5.8S regions, suggesting that ITS2 underwent GC-biased evolution. These GCs were distributed in a heterogeneous manner in the ITS2 secondary structure, with the paired regions being 130% larger than the unpaired regions, indicating that GC is chosen for thermodynamic stability. In addition, species with homogeneous ITS2 sequences were always GC-rich, supporting GC-biased gene conversion (gBGC), which occurred with ITS2's concerted evolution. The RNA substitution model inferred also showed a GC preference among base pair transformations, which again supports gBGC. Overall, structurally based GC investigation reveals that ITS2 evolves under structural stability and gBGC selection, significantly increasing its GC content.
Collapse
Affiliation(s)
- Qing Xian
- Marine College, Shandong University, Weihai 264209, China
| | - Suyin Wang
- Marine College, Shandong University, Weihai 264209, China
| | - Yanyan Liu
- College of Plant Protection, Henan Agricultural University, Zhengzhou 450002, China
| | - Shenglong Kan
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
| | - Wei Zhang
- Marine College, Shandong University, Weihai 264209, China
| |
Collapse
|
9
|
Liu C, Chen HH, Tang LZ, Khine PK, Han LH, Song Y, Tan YH. Plastid genome evolution of a monophyletic group in the subtribe Lauriineae (Laureae, Lauraceae). PLANT DIVERSITY 2022; 44:377-388. [PMID: 35967258 PMCID: PMC9363652 DOI: 10.1016/j.pld.2021.11.009] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/20/2021] [Revised: 11/28/2021] [Accepted: 11/29/2021] [Indexed: 06/15/2023]
Abstract
Litsea, a non-monophyletic group of the tribe Laureae (Lauraceae), plays important roles in the tropical and subtropical forests of Asia, Australia, Central and North America, and the islands of the Pacific. However, intergeneric relationships between Litsea and Laurus, Lindera, Parasassafras and Sinosassafras of the tribe Laureae remain unresolved. In this study, we present phylogenetic analyses of seven newly sequenced Litsea plastomes, together with 47 Laureae plastomes obtained from public databases, representing six genera of the Laureae. Our results highlight two highly supported monophyletic groups of Litsea taxa. One is composed of 16 Litsea taxa and two Lindera taxa. The 18 plastomes of these taxa were further compared for their gene structure, codon usage, contraction and expansion of inverted repeats, sequence repeats, divergence hotspots, and gene evolution. The complete plastome size of newly sequenced taxa varied between 152,377 bp (Litsea auriculata) and 154,117 bp (Litsea pierrei). Seven of the 16 Litsea plastomes have a pair of insertions in the IRa (trnL-trnH) and IRb (ycf2) regions. The 18 plastomes of Litsea and Lindera taxa exhibit similar gene features, codon usage, oligonucleotide repeats, and inverted repeat dynamics. The codons with the highest frequency among these taxa favored A/T endings and each of these plastomes had nine divergence hotspots, which are located in the same regions. We also identified six protein coding genes (accD, ndhJ, rbcL, rpoC2, ycf1 and ycf2) under positive selection in Litsea; these genes may play important roles in adaptation of Litsea species to various environments.
Collapse
Affiliation(s)
- Chao Liu
- College of Biological Resource and Food Engineering, Yunnan Engineering Research Center of Fruit Wine, Qujing Normal University, Qujing, Yunnan, 655011, China
| | - Huan-Huan Chen
- College of Biological Resource and Food Engineering, Yunnan Engineering Research Center of Fruit Wine, Qujing Normal University, Qujing, Yunnan, 655011, China
| | - Li-Zhou Tang
- College of Biological Resource and Food Engineering, Yunnan Engineering Research Center of Fruit Wine, Qujing Normal University, Qujing, Yunnan, 655011, China
| | - Phyo Kay Khine
- Center for Integrative Conservation, Xishuangbanna Tropical Botanical Garden, Chinese Academy of Sciences, Mengla, Yunnan, 666303, China
| | - Li-Hong Han
- College of Biological Resource and Food Engineering, Yunnan Engineering Research Center of Fruit Wine, Qujing Normal University, Qujing, Yunnan, 655011, China
| | - Yu Song
- Key Laboratory of Ecology of Rare and Endangered Species and Environmental Protection (Ministry of Education), Guangxi Key Laboratory of Landscape Resources Conservation and Sustainable Utilization in Lijiang River Basin, Guangxi Normal University, Guilin, Guangxi, 541004, China
| | - Yun-Hong Tan
- Center for Integrative Conservation, Xishuangbanna Tropical Botanical Garden, Chinese Academy of Sciences, Mengla, Yunnan, 666303, China
- Southeast Asia Biodiversity Research Institute, Chinese Academy of Sciences, Yezin, Nay Pyi Taw, 05282, Myanmar
| |
Collapse
|
10
|
GC content of plant genes is linked to past gene duplications. PLoS One 2022; 17:e0261748. [PMID: 35025913 PMCID: PMC8758071 DOI: 10.1371/journal.pone.0261748] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2021] [Accepted: 12/09/2021] [Indexed: 11/24/2022] Open
Abstract
The frequency of G and C nucleotides in genomes varies from species to species, and sometimes even between different genes in the same genome. The monocot grasses have a bimodal distribution of genic GC content absent in dicots. We categorized plant genes from 5 dicots and 4 monocot grasses by synteny to related species and determined that syntenic genes have significantly higher GC content than non-syntenic genes at their 5`-end in the third position within codons for all 9 species. Lower GC content is correlated with gene duplication, as lack of synteny to distantly related genomes is associated with past interspersed gene duplications. Two mutation types can account for biased GC content, mutation of methylated C to T and gene conversion from A to G. Gene conversion involves non-reciprocal exchanges between homologous alleles and is not detectable when the alleles are identical or heterozygous for presence-absence variation, both likely situations for genes duplicated to new loci. Gene duplication can cause production of siRNA which can induce targeted methylation, elevating mC→T mutations. Recently duplicated plant genes are more frequently methylated and less likely to undergo gene conversion, each of these factors synergistically creating a mutational environment favoring AT nucleotides. The syntenic genes with high GC content in the grasses compose a subset that have undergone few duplications, or for which duplicate copies were purged by selection. We propose a “biased gene duplication / biased mutation” (BDBM) model that may explain the origin and trajectory of the observed link between duplication and genic GC bias. The BDBM model is supported by empirical data based on joint analyses of 9 angiosperm species with their genes categorized by duplication status, GC content, methylation levels and functional classes.
Collapse
|
11
|
Jackson EK, Bellott DW, Skaletsky H, Page DC. GC-biased gene conversion in X-chromosome palindromes conserved in human, chimpanzee, and rhesus macaque. G3 GENES|GENOMES|GENETICS 2021; 11:6317831. [PMID: 34849781 PMCID: PMC8981503 DOI: 10.1093/g3journal/jkab224] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/21/2021] [Accepted: 06/28/2021] [Indexed: 12/03/2022]
Abstract
Gene conversion is GC-biased across a wide range of taxa. Large palindromes on mammalian
sex chromosomes undergo frequent gene conversion that maintains arm-to-arm sequence
identity greater than 99%, which may increase their susceptibility to the effects of
GC-biased gene conversion. Here, we demonstrate a striking history of GC-biased gene
conversion in 12 palindromes conserved on the X chromosomes of human, chimpanzee, and
rhesus macaque. Primate X-chromosome palindrome arms have significantly higher GC content
than flanking single-copy sequences. Nucleotide replacements that occurred in human and
chimpanzee palindrome arms over the past 7 million years are one-and-a-half times as
GC-rich as the ancestral bases they replaced. Using simulations, we show that our observed
pattern of nucleotide replacements is consistent with GC-biased gene conversion with a
magnitude of 70%, similar to previously reported values based on analyses of human
meioses. However, GC-biased gene conversion since the divergence of human and rhesus
macaque explains only a fraction of the observed difference in GC content between
palindrome arms and flanking sequence, suggesting that palindromes are older than 29
million years and/or had elevated GC content at the time of their formation. This work
supports a greater than 2:1 preference for GC bases over AT bases during gene conversion
and demonstrates that the evolution and composition of mammalian sex chromosome
palindromes is strongly influenced by GC-biased gene conversion.
Collapse
Affiliation(s)
- Emily K Jackson
- Whitehead Institute, Cambridge, MA 02142, USA
- Howard Hughes Medical Institute, Whitehead Institute, Cambridge, MA 02142, USA
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | | | - Helen Skaletsky
- Whitehead Institute, Cambridge, MA 02142, USA
- Howard Hughes Medical Institute, Whitehead Institute, Cambridge, MA 02142, USA
| | - David C Page
- Whitehead Institute, Cambridge, MA 02142, USA
- Howard Hughes Medical Institute, Whitehead Institute, Cambridge, MA 02142, USA
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| |
Collapse
|
12
|
Singh K, Sharmila P, Kumar PA, Pardha-Saradhi P. Successful expression of the synthetic merBps gene in tobacco. PLANT PHYSIOLOGY AND BIOCHEMISTRY : PPB 2021; 167:874-883. [PMID: 34537577 DOI: 10.1016/j.plaphy.2021.09.018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/08/2021] [Revised: 09/10/2021] [Accepted: 09/12/2021] [Indexed: 06/13/2023]
Abstract
Organomercury is the most toxic biomagnifiable state of mercury, and to date, no natural organomercurial detoxification mechanism is encountered in plants. Bacterial merB gene encoding organomercury lyase show low expression in transgenic plants. For ideal expression, a synthetic merBps gene possessing143 out of 213 codons discrete from native merB gene from Escherichia. coli was fabricated based on codon usage in tobacco. Through Agrobacterium-mediated transformation, the merBps gene got successfully integrated into tobacco. Of several putative merBps transformants selected with 200 μg ml-1 kanamycin, only ∼45% were PCR positive for both nptII and merBps genes. Healthy and vigorously growing shoots of few PCR-positive putative transgenic lines were multiplied and rooted. After transplantation and acclimatization, the resultant plants flowered and fruited in pots. Southern analysis revealed the presence of a single copy of the merBps gene in four lines. RT-PCR and Western investigations established successful transcription and translation of the merBps gene in these transgenic lines, respectively. Fabrication of fully functional organomercury lyase in merBps transgenic lines was established based on the potential of their (i) seeds to germinate; (ii) shoots to grow and multiply; and (iii) leaf disc to remain green, even in the presence of 4 nmole ml-1 phenylmercuryacetate (PMA) while the wild type was susceptible to even 1 nmole ml-1 PMA. These findings confirmed that the synthetic merBps gene could be effectively expressed in plants and exploited for remediation of organomercurial contaminated sites.
Collapse
Affiliation(s)
- Kavita Singh
- Department of Environmental Studies, University of Delhi, Delhi, 110007, India; National Research Center on Plant Biotechnology, Indian Agricultural Research Institute, NewDelhi, 110012, India
| | - Peddisetty Sharmila
- Department of Environmental Studies, University of Delhi, Delhi, 110007, India
| | - P Ananda Kumar
- National Research Center on Plant Biotechnology, Indian Agricultural Research Institute, NewDelhi, 110012, India
| | - P Pardha-Saradhi
- Department of Environmental Studies, University of Delhi, Delhi, 110007, India.
| |
Collapse
|
13
|
Wang J, Lin Y, Xi M. Analysis of Codon Usage Patterns of Six Sequenced Brachypodium distachyon Lines Reveals a Declining CG Skew of the CDSs from the 5'-ends to the 3'-ends. Genes (Basel) 2021; 12:1467. [PMID: 34680862 PMCID: PMC8535453 DOI: 10.3390/genes12101467] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2021] [Revised: 09/09/2021] [Accepted: 09/20/2021] [Indexed: 02/01/2023] Open
Abstract
Brachypodium distachyon, a new monocotyledonous model plant, has received wide attention in biological research due to its small genome and numerous genetic resources. Codon usage bias is an important feature of genes and genomes, and it can be used in transgenic and evolutionary studies. In this study, the nucleotide compositions and patterns of codon usage bias were calculated using Codon W. Additionally, an ENC plot, Parity rule 2 and correspondence analyses were used to explore the major factors influencing codon usage bias patterns. The numbers of hydrogen bonds and skews were used to analyze the GC trend in the 5'-ends of the coding sequences. The results showed that minor differences in the codon usage bias patterns were revealed by the ENC plot, Parity rule 2 and correspondence analyses. The analyses of the CG-skew and the number of hydrogen bonds showed a declining trend in the number of cytosines at the 5'-ends of the CDSs (from the 5'-ends to the 3'-ends), indicating that GC may play a major role in codon usage bias. In addition, our results laid a foundation for the study of codon usage bias patterns in Brachypodium genus and suggested that the GC plays a major role in determining these patterns.
Collapse
Affiliation(s)
- Jianyong Wang
- Key Laboratory of Forest Genetics and Biotechnology of Ministry of Education, Co-Innovation Center for Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing 210037, China;
| | - Yujing Lin
- Shanghai Center for Plant Stress Biology and Center for Excellence in Molecular Plant Sciences, University of Chinese Academy of Sciences, Shanghai 200032, China;
| | - Mengli Xi
- Key Laboratory of Forest Genetics and Biotechnology of Ministry of Education, Co-Innovation Center for Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing 210037, China;
| |
Collapse
|
14
|
Daron J, Bravo IG. Variability in Codon Usage in Coronaviruses Is Mainly Driven by Mutational Bias and Selective Constraints on CpG Dinucleotide. Viruses 2021; 13:v13091800. [PMID: 34578381 PMCID: PMC8473333 DOI: 10.3390/v13091800] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2021] [Revised: 08/30/2021] [Accepted: 08/31/2021] [Indexed: 12/18/2022] Open
Abstract
The Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is the third human-emerged virus of the 21st century from the Coronaviridae family, causing the ongoing coronavirus disease 2019 (COVID-19) pandemic. Due to the high zoonotic potential of coronaviruses, it is critical to unravel their evolutionary history of host species breadth, host-switch potential, adaptation and emergence, to identify viruses posing a pandemic risk in humans. We present here a comprehensive analysis of the composition and codon usage bias of the 82 Orthocoronavirinae members, infecting 47 different avian and mammalian hosts. Our results clearly establish that synonymous codon usage varies widely among viruses, is only weakly dependent on their primary host, and is dominated by mutational bias towards AU-enrichment and by CpG avoidance. Indeed, variation in GC3 explains around 34%, while variation in CpG frequency explains around 14% of total variation in codon usage bias. Further insight on the mutational equilibrium within Orthocoronavirinae revealed that most coronavirus genomes are close to their neutral equilibrium, the exception being the three recently infecting human coronaviruses, which lie further away from the mutational equilibrium than their endemic human coronavirus counterparts. Finally, our results suggest that, while replicating in humans, SARS-CoV-2 is slowly becoming AU-richer, likely until attaining a new mutational equilibrium.
Collapse
Affiliation(s)
- Josquin Daron
- Laboratoire MIVEGEC (CNRS, IRD, Université de Montpellier), 34394 Montpellier, France;
- Correspondence:
| | - Ignacio G. Bravo
- Laboratoire MIVEGEC (CNRS, IRD, Université de Montpellier), 34394 Montpellier, France;
- Center for Research on the Ecology and Evolution of Diseases (CREES), 34394 Montpellier, France
| |
Collapse
|
15
|
Yang C, Zhao Q, Wang Y, Zhao J, Qiao L, Wu B, Yan S, Zheng J, Zheng X. Comparative Analysis of Genomic and Transcriptome Sequences Reveals Divergent Patterns of Codon Bias in Wheat and Its Ancestor Species. Front Genet 2021; 12:732432. [PMID: 34490050 PMCID: PMC8417831 DOI: 10.3389/fgene.2021.732432] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2021] [Accepted: 07/29/2021] [Indexed: 11/29/2022] Open
Abstract
The synonymous codons usage shows a characteristic pattern of preference in each organism. This codon usage bias is thought to have evolved for efficient protein synthesis. Synonymous codon usage was studied in genes of the hexaploid wheat Triticum aestivum (AABBDD) and its progenitor species, Triticum urartu (AA), Aegilops tauschii (DD), and Triticum turgidum (AABB). Triticum aestivum exhibited stronger usage bias for G/C-ending codons than did the three progenitor species, and this bias was especially higher compared to T. turgidum and Ae. tauschii. High GC content is a primary factor influencing codon usage in T. aestivum. Neutrality analysis showed a significant positive correlation (p<0.001) between GC12 and GC3 in the four species with regression line slopes near zero (0.16–0.20), suggesting that the effect of mutation on codon usage was only 16–20%. The GC3s values of genes were associated with gene length and distribution density within chromosomes. tRNA abundance data indicated that codon preference corresponded to the relative abundance of isoaccepting tRNAs in the four species. Both mutation and selection have affected synonymous codon usage in hexaploid wheat and its progenitor species. GO enrichment showed that GC biased genes were commonly enriched in physiological processes such as photosynthesis and response to acid chemical. In some certain gene families with important functions, the codon usage of small parts of genes has changed during the evolution process of T. aestivum.
Collapse
Affiliation(s)
- Chenkang Yang
- School of Life Science, Shanxi University, Taiyuan, China
| | - Qi Zhao
- School of Life Science, Shanxi University, Taiyuan, China
| | - Ying Wang
- School of Life Science, Shanxi University, Taiyuan, China
| | - Jiajia Zhao
- State Key Laboratory of Sustainable Dryland Agriculture, Institute of Wheat Research, Shanxi Agricultural University, Linfen, China
| | - Ling Qiao
- State Key Laboratory of Sustainable Dryland Agriculture, Institute of Wheat Research, Shanxi Agricultural University, Linfen, China
| | - Bangbang Wu
- State Key Laboratory of Sustainable Dryland Agriculture, Institute of Wheat Research, Shanxi Agricultural University, Linfen, China
| | - Suxian Yan
- State Key Laboratory of Sustainable Dryland Agriculture, Institute of Wheat Research, Shanxi Agricultural University, Linfen, China
| | - Jun Zheng
- School of Life Science, Shanxi University, Taiyuan, China.,State Key Laboratory of Sustainable Dryland Agriculture, Institute of Wheat Research, Shanxi Agricultural University, Linfen, China
| | - Xingwei Zheng
- School of Life Science, Shanxi University, Taiyuan, China.,State Key Laboratory of Sustainable Dryland Agriculture, Institute of Wheat Research, Shanxi Agricultural University, Linfen, China
| |
Collapse
|
16
|
Boman J, Mugal CF, Backström N. The Effects of GC-Biased Gene Conversion on Patterns of Genetic Diversity among and across Butterfly Genomes. Genome Biol Evol 2021; 13:evab064. [PMID: 33760095 PMCID: PMC8175052 DOI: 10.1093/gbe/evab064] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/22/2021] [Indexed: 12/28/2022] Open
Abstract
Recombination reshuffles the alleles of a population through crossover and gene conversion. These mechanisms have considerable consequences on the evolution and maintenance of genetic diversity. Crossover, for example, can increase genetic diversity by breaking the linkage between selected and nearby neutral variants. Bias in favor of G or C alleles during gene conversion may instead promote the fixation of one allele over the other, thus decreasing diversity. Mutation bias from G or C to A and T opposes GC-biased gene conversion (gBGC). Less recognized is that these two processes may-when balanced-promote genetic diversity. Here, we investigate how gBGC and mutation bias shape genetic diversity patterns in wood white butterflies (Leptidea sp.). This constitutes the first in-depth investigation of gBGC in butterflies. Using 60 resequenced genomes from six populations of three species, we find substantial variation in the strength of gBGC across lineages. When modeling the balance of gBGC and mutation bias and comparing analytical results with empirical data, we reject gBGC as the main determinant of genetic diversity in these butterfly species. As alternatives, we consider linked selection and GC content. We find evidence that high values of both reduce diversity. We also show that the joint effects of gBGC and mutation bias can give rise to a diversity pattern which resembles the signature of linked selection. Consequently, gBGC should be considered when interpreting the effects of linked selection on levels of genetic diversity.
Collapse
Affiliation(s)
- Jesper Boman
- Evolutionary Biology Program, Department of Ecology and Genetics (IEG), Uppsala University, Sweden
| | - Carina F Mugal
- Evolutionary Biology Program, Department of Ecology and Genetics (IEG), Uppsala University, Sweden
| | - Niclas Backström
- Evolutionary Biology Program, Department of Ecology and Genetics (IEG), Uppsala University, Sweden
| |
Collapse
|
17
|
Jia X, Zhang Q, Jiang M, Huang J, Yu L, Traw MB, Tian D, Hurst LD, Yang S. Mitotic gene conversion can be as important as meiotic conversion in driving genetic variability in plants and other species without early germline segregation. PLoS Biol 2021; 19:e3001164. [PMID: 33750968 PMCID: PMC8016264 DOI: 10.1371/journal.pbio.3001164] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2020] [Revised: 04/01/2021] [Accepted: 03/02/2021] [Indexed: 12/24/2022] Open
Abstract
In contrast to common meiotic gene conversion, mitotic gene conversion, because it is so rare, is often ignored as a process influencing allelic diversity. We show that if there is a large enough number of premeiotic cell divisions, as seen in many organisms without early germline sequestration, such as plants, this is an unsafe position. From examination of 1.1 million rice plants, we determined that the rate of mitotic gene conversion events, per mitosis, is 2 orders of magnitude lower than the meiotic rate. However, owing to the large number of mitoses between zygote and gamete and because of long mitotic tract lengths, meiotic and mitotic gene conversion can be of approximately equivalent importance in terms of numbers of markers converted from zygote to gamete. This holds even if we assume a low number of premeiotic cell divisions (approximately 40) as witnessed in Arabidopsis. A low mitotic rate associated with long tracts is also seen in yeast, suggesting generality of results. For species with many mitoses between each meiotic event, mitotic gene conversion should not be overlooked. Gene conversion associated with meiosis has long been a focus of attention in population genomics, but mitotic conversion has been relatively overlooked as it was thought to be rare. Analysis in plants suggests that this could be a mistake; long tract lengths and multiple mitoses in species lacking germline sequestration suggest that mitotic conversion, although rare per mitosis, should not be ignored.
Collapse
Affiliation(s)
- Xianqing Jia
- State Key Laboratory for Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, China.,State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, China
| | - Qijun Zhang
- Institute of Food Crops, Jiangsu Academy of Agricultural Sciences, Nanjing, China
| | - Mengmeng Jiang
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, China
| | - Ju Huang
- State Key Laboratory for Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, China
| | - Luyao Yu
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, China
| | - Milton Brian Traw
- State Key Laboratory for Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, China
| | - Dacheng Tian
- State Key Laboratory for Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, China.,State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, China
| | - Laurence D Hurst
- The Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, United Kingdom
| | - Sihai Yang
- State Key Laboratory for Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, China.,State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, China
| |
Collapse
|
18
|
Yu Y, Li HT, Wu YH, Li DZ. Correlation Analysis Reveals an Important Role of GC Content in Accumulation of Deletion Mutations in the Coding Region of Angiosperm Plastomes. J Mol Evol 2021; 89:73-80. [PMID: 33433638 DOI: 10.1007/s00239-020-09987-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2020] [Accepted: 12/21/2020] [Indexed: 10/22/2022]
Abstract
Variation in GC content is assumed to correlate with various processes, including mutation biases, recombination, and environmental parameters. To date, most genomic studies exploring the evolution of GC content have focused on nuclear genomes, but relatively few have concentrated on organelle genomes. We explored the mechanisms maintaining the GC content in angiosperm plastomes, with a particular focus on the hypothesis of phylogenetic dependence and the correlation with deletion mutations. We measured three genetic traits, namely, GC content, A/T tracts, and G/C tracts, in the coding region of plastid genomes for 1382 angiosperm species representing 350 families and 64 orders, and tested the phylogenetic signal. Then, we performed correlation analyses and revealed the variation in evolutionary rate of selected traits using RRphylo. The plastid GC content in the coding region varied from 28.10% to 43.20% across angiosperms, with a few non-photosynthetic species showing highly reduced values, highlighting the significance of functional constraints. We found strong phylogenetic signal in A/T tracts, but weak ones in GC content and G/C tracts, indicating adaptive potential. GC content was positively and negatively correlated with G/C and A/T tracts, respectively, suggesting a trade-off between these two deletion events. GC content evolved at various rates across the phylogeny, with significant increases in monocots and Lamiids, and a decrease in Fabids, implying the effects of some other factors. We hypothesize that variation in plastid GC content might be a mixed strategy of species to optimize fitness in fluctuating climates, partly through influencing the trade-off between AT → GC and GC → AT mutations.
Collapse
Affiliation(s)
- Ying Yu
- College of Life and Environmental Sciences, Hangzhou Normal University, Hangzhou, 311121, China
| | - Hong-Tao Li
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, 650201, China
| | - Yu-Huan Wu
- College of Life and Environmental Sciences, Hangzhou Normal University, Hangzhou, 311121, China.
| | - De-Zhu Li
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, 650201, China.
| |
Collapse
|
19
|
Aguilar M, Prieto P. Sequence analysis of wheat subtelomeres reveals a high polymorphism among homoeologous chromosomes. THE PLANT GENOME 2020; 13:e20065. [PMID: 33029942 DOI: 10.1002/tpg2.20065] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/04/2020] [Revised: 07/20/2020] [Accepted: 09/08/2020] [Indexed: 05/23/2023]
Abstract
Bread wheat, Triticum aestivum L., is one of the most important crops in the world. Understanding its genome organization (allohexaploid; AABBDD; 2n = 6x = 42) is essential for geneticists and plant breeders. Particularly, the knowledge of how homologous chromosomes (equivalent chromosomes from the same genome) specifically recognize each other to pair at the beginning of meiosis, the cellular process to generate gametes in sexually reproducing organisms, is fundamental for plant breeding and has a big influence on the fertility of wheat plants. Initial homologous chromosome interactions contribute to specific recognition and pairing between homologues at the onset of meiosis. Understanding the molecular basis of these critical processes can help to develop genetic tools in a breeding context to promote interspecific chromosome associations in hybrids or interspecific genetic crosses to facilitate the transfer of desirable agronomic traits from related species into a crop like wheat. The terminal regions of chromosomes, which include telomeres and subtelomeres, participate in chromosome recognition and pairing. We present a detailed molecular analysis of subtelomeres of wheat chromosome arms 1AS, 4AS, 7AS, 7BS and 7DS. Results showed a high polymorphism in the subtelomeric region among homoeologues (equivalent chromosomes from related genomes) for all the features analyzed, including genes, transposable elements, repeats, GC content, predicted CpG islands, recombination hotspots and targeted sequence motifs for relevant DNA-binding proteins. These polymorphisms might be the molecular basis for the specificity of homologous recognition and pairing in initial chromosome interactions at the beginning of meiosis in wheat.
Collapse
Affiliation(s)
- Miguel Aguilar
- Área de Fisiología Vegetal. Universidad de Córdoba. Campus de Rabanales, edif. C4, 3a planta, Córdoba, Spain
| | - Pilar Prieto
- Plant Breeding Department, Institute for Sustainable Agriculture, Agencia Estatal Consejo Superior de Investigaciones Científicas (CSIC), Alameda del Obispo s/n, Apartado 4084, Córdoba, 14080, Spain
| |
Collapse
|
20
|
Hämälä T, Tiffin P. Biased Gene Conversion Constrains Adaptation in Arabidopsis thaliana. Genetics 2020; 215:831-846. [PMID: 32414868 PMCID: PMC7337087 DOI: 10.1534/genetics.120.303335] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2020] [Accepted: 05/14/2020] [Indexed: 02/01/2023] Open
Abstract
Reduction of fitness due to deleterious mutations imposes a limit to adaptive evolution. By characterizing features that influence this genetic load we may better understand constraints on responses to both natural and human-mediated selection. Here, using whole-genome, transcriptome, and methylome data from >600 Arabidopsis thaliana individuals, we set out to identify important features influencing selective constraint. Our analyses reveal that multiple factors underlie the accumulation of maladaptive mutations, including gene expression level, gene network connectivity, and gene-body methylation. We then focus on a feature with major effect, nucleotide composition. The ancestral vs. derived status of segregating alleles suggests that GC-biased gene conversion, a recombination-associated process that increases the frequency of G and C nucleotides regardless of their fitness effects, shapes sequence patterns in A. thaliana Through estimation of mutational effects, we present evidence that biased gene conversion hinders the purging of deleterious mutations and contributes to a genome-wide signal of decreased efficacy of selection. By comparing these results to two outcrossing relatives, Arabidopsis lyrata and Capsella grandiflora, we find that protein evolution in A. thaliana is as strongly affected by biased gene conversion as in the outcrossing species. Last, we perform simulations to show that natural levels of outcrossing in A. thaliana are sufficient to facilitate biased gene conversion despite increased homozygosity due to selfing. Together, our results show that even predominantly selfing taxa are susceptible to biased gene conversion, suggesting that it may constitute an important constraint to adaptation among plant species.
Collapse
Affiliation(s)
- Tuomas Hämälä
- Department of Plant and Microbial Biology, University of Minnesota, St. Paul, Minnesota 55108
| | - Peter Tiffin
- Department of Plant and Microbial Biology, University of Minnesota, St. Paul, Minnesota 55108
| |
Collapse
|
21
|
Borges R, Szöllősi GJ, Kosiol C. Quantifying GC-Biased Gene Conversion in Great Ape Genomes Using Polymorphism-Aware Models. Genetics 2019; 212:1321-1336. [PMID: 31147380 PMCID: PMC6707462 DOI: 10.1534/genetics.119.302074] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2018] [Accepted: 05/20/2019] [Indexed: 11/18/2022] Open
Abstract
As multi-individual population-scale data become available, more complex modeling strategies are needed to quantify genome-wide patterns of nucleotide usage and associated mechanisms of evolution. Recently, the multivariate neutral Moran model was proposed. However, it was shown insufficient to explain the distribution of alleles in great apes. Here, we propose a new model that includes allelic selection. Our theoretical results constitute the basis of a new Bayesian framework to estimate mutation rates and selection coefficients from population data. We apply the new framework to a great ape dataset, where we found patterns of allelic selection that match those of genome-wide GC-biased gene conversion (gBGC). In particular, we show that great apes have patterns of allelic selection that vary in intensity-a feature that we correlated with great apes' distinct demographies. We also demonstrate that the AT/GC toggling effect decreases the probability of a substitution, promoting more polymorphisms in the base composition of great ape genomes. We further assess the impact of GC-bias in molecular analysis, and find that mutation rates and genetic distances are estimated under bias when gBGC is not properly accounted for. Our results contribute to the discussion on the tempo and mode of gBGC evolution, while stressing the need for gBGC-aware models in population genetics and phylogenetics.
Collapse
Affiliation(s)
- Rui Borges
- Institut für Populationsgenetik, Vetmeduni Vienna, 1210 Wien, Wien, Austria
| | - Gergely J Szöllősi
- Department of Biological Physics, MTA-ELTE "Lendulet" Evolutionary Genomics Research Group, Eötvös University, Pázmány P. stny. 1A, Budapest 1117, Hungary
| | - Carolin Kosiol
- Institut für Populationsgenetik, Vetmeduni Vienna, 1210 Wien, Wien, Austria
- Centre for Biological Diversity, School of Biology, University of St Andrews, Fife KY16 9TH, UK
| |
Collapse
|
22
|
Barton HJ, Zeng K. New Methods for Inferring the Distribution of Fitness Effects for INDELs and SNPs. Mol Biol Evol 2019; 35:1536-1546. [PMID: 29635416 PMCID: PMC5967470 DOI: 10.1093/molbev/msy054] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
Small insertions and deletions (INDELs; ≤50 bp) are the most common type of variability after single nucleotide polymorphism (SNP). However, compared with SNPs, we know little about the distribution of fitness effects (DFE) of new INDEL mutations and how prevalent adaptive INDEL substitutions are. Studying INDELs has been difficult partly because identifying ancestral states at these sites is error-prone and misidentification can lead to severely biased estimates of the strength of selection. To solve these problems, we develop new maximum likelihood methods, which use polymorphism data to simultaneously estimate the DFE, the mutation rate, and the misidentification rate. These methods are applicable to both INDELs and SNPs. Simulations show that they can provide highly accurate results. We applied the methods to an INDEL polymorphism data set in Drosophila melanogaster. We found that the DFE for polymorphic INDELs in protein-coding regions is bimodal, with the variants being either nearly neutral or strongly deleterious. Based on the DFE, we estimated that 71.5–83.7% of the INDEL substitutions that took place along the D. melanogaster lineage were fixed by positive selection, which is comparable with the prevalence of adaptive substitutions at nonsynonymous sites. The new methods have been implemented in the software package anavar.
Collapse
Affiliation(s)
- Henry J Barton
- Department of Animal and Plant Sciences, University of Sheffield, Sheffield, United Kingdom
| | - Kai Zeng
- Department of Animal and Plant Sciences, University of Sheffield, Sheffield, United Kingdom
| |
Collapse
|
23
|
Choi JY, Purugganan MD. Evolutionary Epigenomics of Retrotransposon-Mediated Methylation Spreading in Rice. Mol Biol Evol 2019; 35:365-382. [PMID: 29126199 PMCID: PMC5850837 DOI: 10.1093/molbev/msx284] [Citation(s) in RCA: 34] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Plant genomes contain numerous transposable elements (TEs), and many hypotheses on the evolutionary drivers that restrict TE activity have been postulated. Few models, however, have focused on the evolutionary epigenomic interaction between the plant host and its TE. The host genome recruits epigenetic factors, such as methylation, to silence TEs but methylation can spread beyond the TE sequence and influence the expression of nearby host genes. In this study, we investigated this epigenetic trade-off between TE and proximal host gene silencing by studying the epigenomic regulation of repressing long terminal repeat (LTR) retrotransposons (RTs) in Oryza sativa. Results showed significant evidence of methylation spreading originating from the LTR-RT sequences, and the extent of spreading was dependent on five factors: 1) LTR-RT family, 2) time since the LTR-RT insertion, 3) recombination rate of the LTR-RT region, 4) level of LTR-RT sequence methylation, and 5) chromosomal location. Methylation spreading had negative effects by reducing host gene expression, but only on host genes with LTR-RT inserted in its introns. Our results also suggested high levels of LTR-RT methylation might have a role in suppressing TE-mediated deleterious ectopic recombination. In the end, despite the methylation spreading, no strong epigenetic trade-off was detected and majority of LTR-RT may have only minor epigenetic effects on nearby host genes.
Collapse
Affiliation(s)
- Jae Young Choi
- Department of Biology, Center for Genomics and Systems Biology, New York University, New York, NY
| | - Michael D Purugganan
- Department of Biology, Center for Genomics and Systems Biology, New York University, New York, NY.,Center for Genomics and Systems Biology, New York University Abu Dhabi, Saadiyat Island, Abu Dhabi, United Arab Emirates
| |
Collapse
|
24
|
Abstract
A major current molecular evolution challenge is to link comparative genomic patterns to species' biology and ecology. Breeding systems are pivotal because they affect many population genetic processes and thus genome evolution. We review theoretical predictions and empirical evidence about molecular evolutionary processes under three distinct breeding systems-outcrossing, selfing, and asexuality. Breeding systems may have a profound impact on genome evolution, including molecular evolutionary rates, base composition, genomic conflict, and possibly genome size. We present and discuss the similarities and differences between the effects of selfing and clonality. In reverse, comparative and population genomic data and approaches help revisiting old questions on the long-term evolution of breeding systems.
Collapse
Affiliation(s)
- Sylvain Glémin
- Institut des Sciences de l'Evolution, UMR5554, Université Montpellier II, Montpellier, France
| | - Clémentine M François
- Institut des Sciences de l'Evolution, UMR5554, Université Montpellier II, Montpellier, France
| | - Nicolas Galtier
- Institut des Sciences de l'Evolution, UMR5554, Université Montpellier II, Montpellier, France.
| |
Collapse
|
25
|
Fine-Grained Analysis of Spontaneous Mutation Spectrum and Frequency in Arabidopsis thaliana. Genetics 2018; 211:703-714. [PMID: 30514707 PMCID: PMC6366913 DOI: 10.1534/genetics.118.301721] [Citation(s) in RCA: 70] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2018] [Accepted: 11/29/2018] [Indexed: 01/17/2023] Open
Abstract
Mutations are the ultimate source of all genetic variation. However, few direct estimates of the contribution of mutation to molecular genetic variation are available. To address this issue, we first analyzed the rate and spectrum of mutations in the Arabidopsis thaliana reference accession after 25 generations of single-seed descent. We then compared the mutation profile in these mutation accumulation (MA) lines against genetic variation observed in the 1001 Genomes Project. The estimated haploid single nucleotide mutation (SNM) rate for A. thaliana is 6.95 × 10−9 (SE ± 2.68 × 10−10) per site per generation, with SNMs having higher frequency in transposable elements (TEs) and centromeric regions. The estimated indel mutation rate is 1.30 × 10−9 (±1.07 × 10−10) per site per generation, with deletions being more frequent and larger than insertions. Among the 1694 unique SNMs identified in the MA lines, the positions of 389 SNMs (23%) coincide with biallelic SNPs from the 1001 Genomes population, and in 289 (17%) cases the changes are identical. Of the 329 unique indels identified in the MA lines, 96 (29%) overlap with indels from the 1001 Genomes dataset, and 16 indels (5% of the total) are identical. These overlap frequencies are significantly higher than expected, suggesting that de novo mutations are not uniformly distributed and arise at polymorphic sites more frequently than assumed. These results suggest that high mutation rate potentially contributes to high polymorphism and low mutation rate to reduced polymorphism in natural populations providing insights of mutational inputs in generating natural genetic diversity.
Collapse
|
26
|
Corcoran P, Gossmann TI, Barton HJ, Slate J, Zeng K. Determinants of the Efficacy of Natural Selection on Coding and Noncoding Variability in Two Passerine Species. Genome Biol Evol 2018; 9:2987-3007. [PMID: 29045655 PMCID: PMC5714183 DOI: 10.1093/gbe/evx213] [Citation(s) in RCA: 33] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/16/2017] [Indexed: 02/06/2023] Open
Abstract
Population genetic theory predicts that selection should be more effective when the effective population size (Ne) is larger, and that the efficacy of selection should correlate positively with recombination rate. Here, we analyzed the genomes of ten great tits and ten zebra finches. Nucleotide diversity at 4-fold degenerate sites indicates that zebra finches have a 2.83-fold larger Ne. We obtained clear evidence that purifying selection is more effective in zebra finches. The proportion of substitutions at 0-fold degenerate sites fixed by positive selection (α) is high in both species (great tit 48%; zebra finch 64%) and is significantly higher in zebra finches. When α was estimated on GC-conservative changes (i.e., between A and T and between G and C), the estimates reduced in both species (great tit 22%; zebra finch 53%). A theoretical model presented herein suggests that failing to control for the effects of GC-biased gene conversion (gBGC) is potentially a contributor to the overestimation of α, and that this effect cannot be alleviated by first fitting a demographic model to neutral variants. We present the first estimates in birds for α in the untranslated regions, and found evidence for substantial adaptive changes. Finally, although purifying selection is stronger in high-recombination regions, we obtained mixed evidence for α increasing with recombination rate, especially after accounting for gBGC. These results highlight that it is important to consider the potential confounding effects of gBGC when quantifying selection and that our understanding of what determines the efficacy of selection is incomplete.
Collapse
Affiliation(s)
- Pádraic Corcoran
- Department of Animal and Plant Sciences, University of Sheffield, South Yorkshire, United Kingdom
| | - Toni I Gossmann
- Department of Animal and Plant Sciences, University of Sheffield, South Yorkshire, United Kingdom
| | - Henry J Barton
- Department of Animal and Plant Sciences, University of Sheffield, South Yorkshire, United Kingdom
| | | | - Jon Slate
- Department of Animal and Plant Sciences, University of Sheffield, South Yorkshire, United Kingdom
| | - Kai Zeng
- Department of Animal and Plant Sciences, University of Sheffield, South Yorkshire, United Kingdom
| |
Collapse
|
27
|
Rife TW, Graybosch RA, Poland JA. Genomic Analysis and Prediction within a US Public Collaborative Winter Wheat Regional Testing Nursery. THE PLANT GENOME 2018; 11:180012. [PMID: 30512033 DOI: 10.3835/plantgenome2018.02.0012] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
The development of inexpensive, whole-genome profiling enables a transition to allele-based breeding using genomic prediction models. These models consider alleles shared between lines to predict phenotypes and select new lines based on estimated breeding values. This approach can leverage highly unbalanced datasets that are common to breeding programs. The Southern Regional Performance Nursery (SRPN) is a public nursery established by the USDA-ARS in 1931 to characterize performance and quality of near-release wheat ( L.) varieties from breeding programs in the US Central Plains. New entries are submitted annually and can be re-entered only once. The trial is grown at >30 locations each year and lines are evaluated for grain yield, disease resistance, and agronomic traits. Overall genetic gain is measured across years by including common check cultivars for comparison. We have generated whole-genome profiles via genotyping-by-sequencing (GBS) for 939 SPRN entries dating back to 1992 to explore the potential use of the nursery as a genomic selection (GS) training population (TP). The GS prediction models across years (average = 0.33) outperformed year-to-year phenotypic correlation for yield ( = 0.27) for a majority of the years evaluated, suggesting that genomic selection has the potential to outperform low heritability selection on yield in these highly variable environments. We also examined the predictability of programs using both program-specific and whole-set TPs. Generally, the predictability of a program was similar with both approaches. These results suggest that wheat breeding programs can collaboratively leverage the immense datasets that are generated from regional testing networks.
Collapse
|
28
|
Stapley J, Feulner PGD, Johnston SE, Santure AW, Smadja CM. Variation in recombination frequency and distribution across eukaryotes: patterns and processes. Philos Trans R Soc Lond B Biol Sci 2018; 372:rstb.2016.0455. [PMID: 29109219 PMCID: PMC5698618 DOI: 10.1098/rstb.2016.0455] [Citation(s) in RCA: 203] [Impact Index Per Article: 33.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/08/2017] [Indexed: 01/04/2023] Open
Abstract
Recombination, the exchange of DNA between maternal and paternal chromosomes during meiosis, is an essential feature of sexual reproduction in nearly all multicellular organisms. While the role of recombination in the evolution of sex has received theoretical and empirical attention, less is known about how recombination rate itself evolves and what influence this has on evolutionary processes within sexually reproducing organisms. Here, we explore the patterns of, and processes governing recombination in eukaryotes. We summarize patterns of variation, integrating current knowledge with an analysis of linkage map data in 353 organisms. We then discuss proximate and ultimate processes governing recombination rate variation and consider how these influence evolutionary processes. Genome-wide recombination rates (cM/Mb) can vary more than tenfold across eukaryotes, and there is large variation in the distribution of recombination events across closely related taxa, populations and individuals. We discuss how variation in rate and distribution relates to genome architecture, genetic and epigenetic mechanisms, sex, environmental perturbations and variable selective pressures. There has been great progress in determining the molecular mechanisms governing recombination, and with the continued development of new modelling and empirical approaches, there is now also great opportunity to further our understanding of how and why recombination rate varies.This article is part of the themed issue 'Evolutionary causes and consequences of recombination rate variation in sexual organisms'.
Collapse
Affiliation(s)
- Jessica Stapley
- Centre for Adaptation to a Changing Environment, IBZ, ETH Zürich, 8092 Zürich, Switzerland
| | - Philine G D Feulner
- Department of Fish Ecology and Evolution, Centre of Ecology, Evolution and Biogeochemistry, EAWAG Swiss Federal Institute of Aquatic Science and Technology, 6047 Kastanienbaum, Switzerland.,Division of Aquatic Ecology and Evolution, Institute of Ecology and Evolution, University of Bern, 3012 Bern, Switzerland
| | - Susan E Johnston
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh EH9 3JY, UK
| | - Anna W Santure
- School of Biological Sciences, University of Auckland, Auckland 1142, New Zealand
| | - Carole M Smadja
- Institut des Sciences de l'Evolution UMR 5554, CNRS, IRD, EPHE, Université de Montpellier, 3095 Montpellier cedex 05, France
| |
Collapse
|
29
|
Distinguishing Among Evolutionary Forces Acting on Genome-Wide Base Composition: Computer Simulation Analysis of Approximate Methods for Inferring Site Frequency Spectra of Derived Mutations. G3-GENES GENOMES GENETICS 2018; 8:1755-1769. [PMID: 29588382 PMCID: PMC5940166 DOI: 10.1534/g3.117.300512] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
Inferred ancestral nucleotide states are increasingly employed in analyses of within- and between -species genome variation. Although numerous studies have focused on ancestral inference among distantly related lineages, approaches to infer ancestral states in polymorphism data have received less attention. Recently developed approaches that employ complex transition matrices allow us to infer ancestral nucleotide sequence in various evolutionary scenarios of base composition. However, the requirement of a single gene tree to calculate a likelihood is an important limitation for conducting ancestral inference using within-species variation in recombining genomes. To resolve this problem, and to extend the applicability of ancestral inference in studies of base composition evolution, we first evaluate three previously proposed methods to infer ancestral nucleotide sequences among within- and between-species sequence variation data. The methods employ a single allele, bifurcating tree, or a star tree for within-species variation data. Using simulated nucleotide sequences, we employ ancestral inference to infer fixations and polymorphisms. We find that all three methods show biased inference. We modify the bifurcating tree method to include weights to adjust for an expected site frequency spectrum, “bifurcating tree with weighting” (BTW). Our simulation analysis show that the BTW method can substantially improve the reliability and robustness of ancestral inference in a range of scenarios that include non-neutral and/or non-stationary base composition evolution.
Collapse
|
30
|
Mazumdar P, Binti Othman R, Mebus K, Ramakrishnan N, Ann Harikrishna J. Codon usage and codon pair patterns in non-grass monocot genomes. ANNALS OF BOTANY 2017; 120:893-909. [PMID: 29155926 PMCID: PMC5710610 DOI: 10.1093/aob/mcx112] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/17/2017] [Accepted: 09/19/2017] [Indexed: 05/19/2023]
Abstract
BACKGROUND AND AIMS Studies on codon usage in monocots have focused on grasses, and observed patterns of this taxon were generalized to all monocot species. Here, non-grass monocot species were analysed to investigate the differences between grass and non-grass monocots. METHODS First, studies of codon usage in monocots were reviewed. The current information was then extended regarding codon usage, as well as codon-pair context bias, using four completely sequenced non-grass monocot genomes (Musa acuminata, Musa balbisiana, Phoenix dactylifera and Spirodela polyrhiza) for which comparable transcriptome datasets are available. Measurements were taken regarding relative synonymous codon usage, effective number of codons, derived optimal codon and GC content and then the relationships investigated to infer the underlying evolutionary forces. KEY RESULTS The research identified optimal codons, rare codons and preferred codon-pair context in the non-grass monocot species studied. In contrast to the bimodal distribution of GC3 (GC content in third codon position) in grasses, non-grass monocots showed a unimodal distribution. Disproportionate use of G and C (and of A and T) in two- and four-codon amino acids detected in the analysis rules out the mutational bias hypothesis as an explanation of genomic variation in GC content. There was found to be a positive relationship between CAI (codon adaptation index; predicts the level of expression of a gene) and GC3. In addition, a strong correlation was observed between coding and genomic GC content and negative correlation of GC3 with gene length, indicating a strong impact of GC-biased gene conversion (gBGC) in shaping codon usage and nucleotide composition in non-grass monocots. CONCLUSION Optimal codons in these non-grass monocots show a preference for G/C in the third codon position. These results support the concept that codon usage and nucleotide composition in non-grass monocots are mainly driven by gBGC.
Collapse
Affiliation(s)
- Purabi Mazumdar
- Centre for Research in Biotechnology for Agriculture, University of Malaya, Kuala Lumpur, Malaysia
| | - RofinaYasmin Binti Othman
- Centre for Research in Biotechnology for Agriculture, University of Malaya, Kuala Lumpur, Malaysia
- Institute of Biological Sciences, Faculty of Science, University of Malaya, Kuala Lumpur, Malaysia
| | - Katharina Mebus
- Centre for Research in Biotechnology for Agriculture, University of Malaya, Kuala Lumpur, Malaysia
| | - N Ramakrishnan
- Electrical and Computer System Engineering, School of Engineering, Monash University Malaysia, Bandar Sunway, Malaysia
| | - Jennifer Ann Harikrishna
- Centre for Research in Biotechnology for Agriculture, University of Malaya, Kuala Lumpur, Malaysia
- Institute of Biological Sciences, Faculty of Science, University of Malaya, Kuala Lumpur, Malaysia
- For correspondence. E-mail:
| |
Collapse
|
31
|
Niu Z, Xue Q, Wang H, Xie X, Zhu S, Liu W, Ding X. Mutational Biases and GC-Biased Gene Conversion Affect GC Content in the Plastomes of Dendrobium Genus. Int J Mol Sci 2017; 18:E2307. [PMID: 29099062 PMCID: PMC5713276 DOI: 10.3390/ijms18112307] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2017] [Revised: 09/27/2017] [Accepted: 10/20/2017] [Indexed: 01/03/2023] Open
Abstract
The variation of GC content is a key genome feature because it is associated with fundamental elements of genome organization. However, the reason for this variation is still an open question. Different kinds of hypotheses have been proposed to explain the variation of GC content during genome evolution. However, these hypotheses have not been explicitly investigated in whole plastome sequences. Dendrobium is one of the largest genera in the orchid species. Evolutionary studies of the plastomic organization and base composition are limited in this genus. In this study, we obtained the high-quality plastome sequences of D. loddigesii and D. devonianum. The comparison results showed a nearly identical organization in Dendrobium plastomes, indicating that the plastomic organization is highly conserved in Dendrobium genus. Furthermore, the impact of three evolutionary forces-selection, mutational biases, and GC-biased gene conversion (gBGC)-on the variation of GC content in Dendrobium plastomes was evaluated. Our results revealed: (1) consistent GC content evolution trends and mutational biases in single-copy (SC) and inverted repeats (IRs) regions; and (2) that gBGC has influenced the plastome-wide GC content evolution. These results suggest that both mutational biases and gBGC affect GC content in the plastomes of Dendrobium genus.
Collapse
Affiliation(s)
- Zhitao Niu
- College of Life Sciences, Nanjing Normal University, Nanjing 210023, China.
| | - Qingyun Xue
- College of Life Sciences, Nanjing Normal University, Nanjing 210023, China.
| | - Hui Wang
- College of Life Sciences, Nanjing Normal University, Nanjing 210023, China.
| | - Xuezhu Xie
- College of Life Sciences, Nanjing Normal University, Nanjing 210023, China.
| | - Shuying Zhu
- College of Life Sciences, Nanjing Normal University, Nanjing 210023, China.
| | - Wei Liu
- College of Life Sciences, Nanjing Normal University, Nanjing 210023, China.
| | - Xiaoyu Ding
- College of Life Sciences, Nanjing Normal University, Nanjing 210023, China.
| |
Collapse
|
32
|
Evolutionary forces affecting synonymous variations in plant genomes. PLoS Genet 2017; 13:e1006799. [PMID: 28531201 PMCID: PMC5460877 DOI: 10.1371/journal.pgen.1006799] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2016] [Revised: 06/06/2017] [Accepted: 05/04/2017] [Indexed: 01/04/2023] Open
Abstract
Base composition is highly variable among and within plant genomes, especially at third codon positions, ranging from GC-poor and homogeneous species to GC-rich and highly heterogeneous ones (particularly Monocots). Consequently, synonymous codon usage is biased in most species, even when base composition is relatively homogeneous. The causes of these variations are still under debate, with three main forces being possibly involved: mutational bias, selection and GC-biased gene conversion (gBGC). So far, both selection and gBGC have been detected in some species but how their relative strength varies among and within species remains unclear. Population genetics approaches allow to jointly estimating the intensity of selection, gBGC and mutational bias. We extended a recently developed method and applied it to a large population genomic dataset based on transcriptome sequencing of 11 angiosperm species spread across the phylogeny. We found that at synonymous positions, base composition is far from mutation-drift equilibrium in most genomes and that gBGC is a widespread and stronger process than selection. gBGC could strongly contribute to base composition variation among plant species, implying that it should be taken into account in plant genome analyses, especially for GC-rich ones. In protein coding genes, base composition strongly varies within and among plant genomes, especially at positions where changes do not alter the coded protein (synonymous variations). Some species, such as the model plant Arabidopsis thaliana, are relatively GC-poor and homogeneous while others, such as grasses, are highly heterogeneous and GC-rich. The causes of these variations are still debated: are they mainly due to selective or neutral processes? Answering to this question is important to correctly infer whether variations in base composition may have functional roles or not. We extended a population genetics method to jointly estimate the different forces that may affect synonymous variations and applied it to genomic datasets in 11 flowering plant species. We found that GC-biased gene conversion, a neutral process associated with recombination that mimics selection by favouring G and C bases, is a widespread and stronger process than selection and that it could explain the large variation in base composition observed in plant genomes. Our results bear implications for analysing plant genomes and for correctly interpreting what could be functional or not.
Collapse
|
33
|
Wang J, Yu Y, Tao F, Zhang J, Copetti D, Kudrna D, Talag J, Lee S, Wing RA, Fan C. DNA methylation changes facilitated evolution of genes derived from Mutator-like transposable elements. Genome Biol 2016; 17:92. [PMID: 27154274 PMCID: PMC4858842 DOI: 10.1186/s13059-016-0954-8] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2015] [Accepted: 04/14/2016] [Indexed: 01/17/2023] Open
Abstract
Background Mutator-like transposable elements, a class of DNA transposons, exist pervasively in both prokaryotic and eukaryotic genomes, with more than 10,000 copies identified in the rice genome. These elements can capture ectopic genomic sequences that lead to the formation of new gene structures. Here, based on whole-genome comparative analyses, we comprehensively investigated processes and mechanisms of the evolution of putative genes derived from Mutator-like transposable elements in ten Oryza species and the outgroup Leersia perieri, bridging ~20 million years of evolutionary history. Results Our analysis identified thousands of putative genes in each of the Oryza species, a large proportion of which have evidence of expression and contain chimeric structures. Consistent with previous reports, we observe that the putative Mutator-like transposable element-derived genes are generally GC-rich and mainly derive from GC-rich parental sequences. Furthermore, we determine that Mutator-like transposable elements capture parental sequences preferentially from genomic regions with low methylation levels and high recombination rates. We explicitly show that methylation levels in the internal and terminated inverted repeat regions of these elements, which might be directed by the 24-nucleotide small RNA-mediated pathway, are different and change dynamically over evolutionary time. Lastly, we demonstrate that putative genes derived from Mutator-like transposable elements tend to be expressed in mature pollen, which have undergone de-methylation programming, thereby providing a permissive expression environment for newly formed/transposable element-derived genes. Conclusions Our results suggest that DNA methylation may be a primary mechanism to facilitate the origination, survival, and regulation of genes derived from Mutator-like transposable elements, thus contributing to the evolution of gene innovation and novelty in plant genomes. Electronic supplementary material The online version of this article (doi:10.1186/s13059-016-0954-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Jun Wang
- Department of Biological Sciences, Wayne State University, 5047 Gullen Mall, Detroit, MI, 48202, USA
| | - Yeisoo Yu
- Arizona Genomics Institute, BIO5 Institute and School of Plant Sciences, University of Arizona, Tucson, AZ, 85721, USA
| | - Feng Tao
- Department of Biological Sciences, Wayne State University, 5047 Gullen Mall, Detroit, MI, 48202, USA
| | - Jianwei Zhang
- Arizona Genomics Institute, BIO5 Institute and School of Plant Sciences, University of Arizona, Tucson, AZ, 85721, USA
| | - Dario Copetti
- Arizona Genomics Institute, BIO5 Institute and School of Plant Sciences, University of Arizona, Tucson, AZ, 85721, USA
| | - Dave Kudrna
- Arizona Genomics Institute, BIO5 Institute and School of Plant Sciences, University of Arizona, Tucson, AZ, 85721, USA
| | - Jayson Talag
- Arizona Genomics Institute, BIO5 Institute and School of Plant Sciences, University of Arizona, Tucson, AZ, 85721, USA
| | - Seunghee Lee
- Arizona Genomics Institute, BIO5 Institute and School of Plant Sciences, University of Arizona, Tucson, AZ, 85721, USA
| | - Rod A Wing
- Arizona Genomics Institute, BIO5 Institute and School of Plant Sciences, University of Arizona, Tucson, AZ, 85721, USA.,T.T. Chang Genetics Resources Center, International Rice Research Institute, Los Baños, Laguna, 4031, Philippines
| | - Chuanzhu Fan
- Department of Biological Sciences, Wayne State University, 5047 Gullen Mall, Detroit, MI, 48202, USA.
| |
Collapse
|
34
|
McKain MR, Tang H, McNeal JR, Ayyampalayam S, Davis JI, dePamphilis CW, Givnish TJ, Pires JC, Stevenson DW, Leebens-Mack JH. A Phylogenomic Assessment of Ancient Polyploidy and Genome Evolution across the Poales. Genome Biol Evol 2016; 8:1150-64. [PMID: 26988252 PMCID: PMC4860692 DOI: 10.1093/gbe/evw060] [Citation(s) in RCA: 57] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Comparisons of flowering plant genomes reveal multiple rounds of ancient polyploidy characterized by large intragenomic syntenic blocks. Three such whole-genome duplication (WGD) events, designated as rho (ρ), sigma (σ), and tau (τ), have been identified in the genomes of cereal grasses. Precise dating of these WGD events is necessary to investigate how they have influenced diversification rates, evolutionary innovations, and genomic characteristics such as the GC profile of protein-coding sequences. The timing of these events has remained uncertain due to the paucity of monocot genome sequence data outside the grass family (Poaceae). Phylogenomic analysis of protein-coding genes from sequenced genomes and transcriptome assemblies from 35 species, including representatives of all families within the Poales, has resolved the timing of rho and sigma relative to speciation events and placed tau prior to divergence of Asparagales and the commelinids but after divergence with eudicots. Examination of gene family phylogenies indicates that rho occurred just prior to the diversification of Poaceae and sigma occurred before early diversification of Poales lineages but after the Poales-commelinid split. Additional lineage-specific WGD events were identified on the basis of the transcriptome data. Gene families exhibiting high GC content are underrepresented among those with duplicate genes that persisted following these genome duplications. However, genome duplications had little overall influence on lineage-specific changes in the GC content of coding genes. Improved resolution of the timing of WGD events in monocot history provides evidence for the influence of polyploidization on functional evolution and species diversification.
Collapse
Affiliation(s)
- Michael R McKain
- Donald Danforth Plant Science Center, St. Louis, Missouri Department of Plant Biology, University of Georgia
| | - Haibao Tang
- Center for Genomics and Biotechnology, Fujian Agriculture and Forestry University, Fuzhou, Fujian Province, China School of Plant Sciences, iPlant Collaborative, University of Arizona
| | - Joel R McNeal
- Department of Ecology, Evolution, and Organismal Biology, Kennesaw State University Department of Plant Biology, University of Georgia
| | | | - Jerrold I Davis
- L. H. Bailey Hortorium and Department of Plant Biology, Cornell University
| | - Claude W dePamphilis
- Department of Biology and Institute of Molecular Evolutionary Genetics, Pennsylvania State University, University Park, Pennsylvania
| | | | - J Chris Pires
- Division of Biological Sciences, University of Missouri, Columbia
| | | | | |
Collapse
|
35
|
Dia N, Lavie L, Faye N, Méténier G, Yeramian E, Duroure C, Toguebaye BS, Frutos R, Niang MN, Vivarès CP, Ben Mamoun C, Cornillot E. Subtelomere organization in the genome of the microsporidian Encephalitozoon cuniculi: patterns of repeated sequences and physicochemical signatures. BMC Genomics 2016; 17:34. [PMID: 26744270 PMCID: PMC4704409 DOI: 10.1186/s12864-015-1920-7] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2015] [Accepted: 09/11/2015] [Indexed: 12/23/2022] Open
Abstract
Background The microsporidian Encephalitozoon cuniculi is an obligate intracellular eukaryotic pathogen with a small nuclear genome (2.9 Mbp) consisting of 11 chromosomes. Although each chromosome end is known to contain a single rDNA unit, the incomplete assembly of subtelomeric regions following sequencing of the genome identified only 3 of the 22 expected rDNA units. While chromosome end assembly remains a difficult process in most eukaryotic genomes, it is of significant importance for pathogens because these regions encode factors important for virulence and host evasion. Results Here we report the first complete assembly of E. cuniculi chromosome ends, and describe a novel mosaic structure of segmental duplications (EXT repeats) in these regions. EXT repeats range in size between 3.5 and 23.8 kbp and contain four multigene families encoding membrane associated proteins. Twenty-one recombination sites were identified in the sub-terminal region of E. cuniculi chromosomes. Our analysis suggests that these sites contribute to the diversity of chromosome ends organization through Double Strand Break repair mechanisms. The region containing EXT repeats at chromosome extremities can be differentiated based on gene composition, GC content, recombination sites density and chromosome landscape. Conclusion Together this study provides the complete structure of the chromosome ends of E. cuniculi GB-M1, and identifies important factors, which could play a major role in parasite diversity and host-parasite interactions. Comparison with other eukaryotic genomes suggests that terminal regions could be distinguished precisely based on gene content, genetic instability and base composition biais. The diversity of processes assciated with chromosome extremities and their biological consequences, as they are presented in the present study, emphasize the fact that great effort will be necessary in the future to characterize more carefully these regions during whole genome sequencing efforts. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-1920-7) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Ndongo Dia
- Unité de Virologie Médicale, Institut Pasteur de Dakar, 36 Avenue Pasteur, B.P. 220, Dakar, Sénégal.
| | - Laurence Lavie
- Clermont Université, Université Blaise Pascal, Laboratoire Microorganismes, Génome et Environnement, UMR 6023, CNRS, 63177, Aubière, France.
| | - Ngor Faye
- Laboratoire de Parasitologie Générale, Département de Biologie Animale, Faculté des Sciences et Technologies, Université Cheikh Anta Diop, Dakar, Sénégal.
| | - Guy Méténier
- Clermont Université, Université Blaise Pascal, Laboratoire Microorganismes, Génome et Environnement, UMR 6023, CNRS, 63177, Aubière, France.
| | - Edouard Yeramian
- Unité de Bioinformatique Structurale, UMR 3528 CNRS, Institut Pasteur, 25-28, rue du Dr Roux, 75015, Paris, France.
| | - Christophe Duroure
- Laboratoire de Météorologie Physique, OPGC UMR 6016 CNRS-Université Blaise Pascal, 24 Avenue des Landais, 63177, Aubière Cedex, France.
| | - Bhen S Toguebaye
- Laboratoire de Parasitologie Générale, Département de Biologie Animale, Faculté des Sciences et Technologies, Université Cheikh Anta Diop, Dakar, Sénégal.
| | - Roger Frutos
- CIRAD, UMR 17, Cirad-Ird, TA-A17/G, Campus International de Baillarguet, 34398, Montpellier, France.
| | - Mbayame N Niang
- Unité de Virologie Médicale, Institut Pasteur de Dakar, 36 Avenue Pasteur, B.P. 220, Dakar, Sénégal.
| | - Christian P Vivarès
- Clermont Université, Université Blaise Pascal, Laboratoire Microorganismes, Génome et Environnement, UMR 6023, CNRS, 63177, Aubière, France.
| | - Choukri Ben Mamoun
- Section of Infectious Disease and Department of Microbial Pathogenesis, Winchester Building WWW403D, Yale School of Medicine, 15 York St., New Haven, CT, 06520, USA.
| | - Emmanuel Cornillot
- Institut de Recherche en Cancérologie de Montpellier, IRCM - INSERM U1194 & Université de Montpellier & ICM, Institut régional du Cancer Montpellier, Campus Val d'Aurelle, 34298, Montpellier cedex 5, France. .,Institut de Biologie Computationnelle, IBC, Campus Saint Priest, 34090, Montpellier, France.
| |
Collapse
|
36
|
Sundararajan A, Dukowic-Schulze S, Kwicklis M, Engstrom K, Garcia N, Oviedo OJ, Ramaraj T, Gonzales MD, He Y, Wang M, Sun Q, Pillardy J, Kianian SF, Pawlowski WP, Chen C, Mudge J. Gene Evolutionary Trajectories and GC Patterns Driven by Recombination in Zea mays. FRONTIERS IN PLANT SCIENCE 2016; 7:1433. [PMID: 27713757 PMCID: PMC5031598 DOI: 10.3389/fpls.2016.01433] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/20/2016] [Accepted: 09/08/2016] [Indexed: 05/20/2023]
Abstract
Recombination occurring during meiosis is critical for creating genetic variation and plays an essential role in plant evolution. In addition to creating novel gene combinations, recombination can affect genome structure through altering GC patterns. In maize (Zea mays) and other grasses, another intriguing GC pattern exists. Maize genes show a bimodal GC content distribution that has been attributed to nucleotide bias in the third, or wobble, position of the codon. Recombination may be an underlying driving force given that recombination sites are often associated with high GC content. Here we explore the relationship between recombination and genomic GC patterns by comparing GC gene content at each of the three codon positions (GC1, GC2, and GC3, collectively termed GCx) to instances of a variable GC-rich motif that underlies double strand break (DSB) hotspots and to meiocyte-specific gene expression. Surprisingly, GCx bimodality in maize cannot be fully explained by the codon wobble hypothesis. High GCx genes show a strong overlap with the DSB hotspot motif, possibly providing a mechanism for the high evolutionary rates seen in these genes. On the other hand, genes that are turned on in meiosis (early prophase I) are biased against both high GCx genes and genes with the DSB hotspot motif, possibly allowing important meiotic genes to avoid DSBs. Our data suggests a strong link between the GC-rich motif underlying DSB hotspots and high GCx genes.
Collapse
Affiliation(s)
| | | | | | | | - Nathan Garcia
- National Center for Genome Resources, Santa FeNM, USA
| | | | | | | | - Yan He
- Section of Plant Biology, School of Integrative Plant Science, Cornell University, IthacaNY, USA
| | - Minghui Wang
- Section of Plant Biology, School of Integrative Plant Science, Cornell University, IthacaNY, USA
- Biotechnology Resource Center Bioinformatics Facility, Cornell University, IthacaNY, USA
| | - Qi Sun
- Biotechnology Resource Center Bioinformatics Facility, Cornell University, IthacaNY, USA
| | - Jaroslaw Pillardy
- Biotechnology Resource Center Bioinformatics Facility, Cornell University, IthacaNY, USA
| | - Shahryar F. Kianian
- Cereal Disease Laboratory, United States Department of Agriculture – Agricultural Research Service, St. PaulMN, USA
| | - Wojciech P. Pawlowski
- Section of Plant Biology, School of Integrative Plant Science, Cornell University, IthacaNY, USA
| | - Changbin Chen
- Department of Horticultural Science, University of Minnesota, St. PaulMN, USA
| | - Joann Mudge
- National Center for Genome Resources, Santa FeNM, USA
- *Correspondence: Joann Mudge,
| |
Collapse
|
37
|
Chen F, Zhu Z, Zhou X, Yan Y, Dong Z, Cui D. High-Throughput Sequencing Reveals Single Nucleotide Variants in Longer-Kernel Bread Wheat. FRONTIERS IN PLANT SCIENCE 2016; 7:1193. [PMID: 27551288 PMCID: PMC4976665 DOI: 10.3389/fpls.2016.01193] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/24/2016] [Accepted: 07/25/2016] [Indexed: 05/09/2023]
Abstract
The transcriptomes of bread wheat Yunong 201 and its ethyl methanesulfonate derivative Yunong 3114 were obtained by next-sequencing technology. Single nucleotide variants (SNVs) in the wheat strains were explored and compared. A total of 5907 and 6287 non-synonymous SNVs were acquired for Yunong 201 and 3114, respectively. A total of 4021 genes with SNVs were obtained. The genes that underwent non-synonymous SNVs were significantly involved in ATP binding, protein phosphorylation, and cellular protein metabolic process. The heat map analysis also indicated that most of these mutant genes were significantly differentially expressed at different developmental stages. The SNVs in these genes possibly contribute to the longer kernel length of Yunong 3114. Our data provide useful information on wheat transcriptome for future studies on wheat functional genomics. This study could also help in illustrating the gene functions of the non-synonymous SNVs of Yunong 201 and 3114.
Collapse
|
38
|
Bolívar P, Mugal CF, Nater A, Ellegren H. Recombination Rate Variation Modulates Gene Sequence Evolution Mainly via GC-Biased Gene Conversion, Not Hill-Robertson Interference, in an Avian System. Mol Biol Evol 2015; 33:216-27. [PMID: 26446902 PMCID: PMC4693978 DOI: 10.1093/molbev/msv214] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
The ratio of nonsynonymous to synonymous substitution rates (ω) is often used to measure the strength of natural selection. However, ω may be influenced by linkage among different targets of selection, that is, Hill–Robertson interference (HRI), which reduces the efficacy of selection. Recombination modulates the extent of HRI but may also affect ω by means of GC-biased gene conversion (gBGC), a process leading to a preferential fixation of G:C (“strong,” S) over A:T (“weak,” W) alleles. As HRI and gBGC can have opposing effects on ω, it is essential to understand their relative impact to make proper inferences of ω. We used a model that separately estimated S-to-S, S-to-W, W-to-S, and W-to-W substitution rates in 8,423 avian genes in the Ficedula flycatcher lineage. We found that the W-to-S substitution rate was positively, and the S-to-W rate negatively, correlated with recombination rate, in accordance with gBGC but not predicted by HRI. The W-to-S rate further showed the strongest impact on both dN and dS. However, since the effects were stronger at 4-fold than at 0-fold degenerated sites, likely because the GC content of these sites is farther away from its equilibrium, ω slightly decreases with increasing recombination rate, which could falsely be interpreted as a consequence of HRI. We corroborated this hypothesis analytically and demonstrate that under particular conditions, ω can decrease with increasing recombination rate. Analyses of the site-frequency spectrum showed that W-to-S mutations were skewed toward high, and S-to-W mutations toward low, frequencies, consistent with a prevalent gBGC-driven fixation bias.
Collapse
Affiliation(s)
- Paulina Bolívar
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| | - Carina F Mugal
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| | - Alexander Nater
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| | - Hans Ellegren
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| |
Collapse
|
39
|
The Impact of Recombination Hotspots on Genome Evolution of a Fungal Plant Pathogen. Genetics 2015; 201:1213-28. [PMID: 26392286 DOI: 10.1534/genetics.115.180968] [Citation(s) in RCA: 70] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2015] [Accepted: 09/17/2015] [Indexed: 12/30/2022] Open
Abstract
Recombination has an impact on genome evolution by maintaining chromosomal integrity, affecting the efficacy of selection, and increasing genetic variability in populations. Recombination rates are a key determinant of the coevolutionary dynamics between hosts and their pathogens. Historic recombination events created devastating new pathogens, but the impact of ongoing recombination in sexual pathogens is poorly understood. Many fungal pathogens of plants undergo regular sexual cycles, and sex is considered to be a major factor contributing to virulence. We generated a recombination map at kilobase-scale resolution for the haploid plant pathogenic fungus Zymoseptoria tritici. To account for intraspecific variation in recombination rates, we constructed genetic maps from two independent crosses. We localized a total of 10,287 crossover events in 441 progeny and found that recombination rates were highly heterogeneous within and among chromosomes. Recombination rates on large chromosomes were inversely correlated with chromosome length. Short accessory chromosomes often lacked evidence for crossovers between parental chromosomes. Recombination was concentrated in narrow hotspots that were preferentially located close to telomeres. Hotspots were only partially conserved between the two crosses, suggesting that hotspots are short-lived and may vary according to genomic background. Genes located in hotspot regions were enriched in genes encoding secreted proteins. Population resequencing showed that chromosomal regions with high recombination rates were strongly correlated with regions of low linkage disequilibrium. Hence, genes in pathogen recombination hotspots are likely to evolve faster in natural populations and may represent a greater threat to the host.
Collapse
|
40
|
The relationship of recombination rate, genome structure, and patterns of molecular evolution across angiosperms. BMC Evol Biol 2015; 15:194. [PMID: 26377000 PMCID: PMC4574184 DOI: 10.1186/s12862-015-0473-3] [Citation(s) in RCA: 52] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2015] [Accepted: 09/01/2015] [Indexed: 12/31/2022] Open
Abstract
Background Although homologous recombination affects the efficacy of selection in populations, the pattern of recombination rate evolution and its effects on genome evolution across plants are largely unknown. Recombination can reduce genome size by enabling the removal of LTR retrotransposons, alter codon usage by GC biased gene conversion, contribute to complex histories of gene duplication and loss through tandem duplication, and enhance purifying selection on genes. Therefore, variation in recombination rate across species may explain some of the variation in genomic architecture as well as rates of molecular evolution. We used phylogenetic comparative methods to investigate the evolution of global meiotic recombination rate in angiosperms and its effects on genome architecture and selection at the molecular level using genetic maps and genome sequences from thirty angiosperm species. Results Recombination rate is negatively correlated with genome size, which is likely caused by the removal of LTR retrotransposons. After correcting recombination rates for euchromatin content, we also found an association between global recombination rate and average gene family size. This suggests a role for recombination in the preservation of duplicate genes or expansion of gene families. An analysis of the correlation between the ratio of nonsynonymous to synonymous substitution rates (dN/dS) and recombination rate in 3748 genes indicates that higher recombination rates are associated with an increased efficacy of purifying selection, suggesting that global recombination rates affect variation in rates of molecular evolution across distantly related angiosperm species, not just between populations. We also identified shifts in dN/dS for recombination proteins that are associated with shifts in global recombination rate across our sample of angiosperms. Conclusions Although our analyses only reveal correlations, not mechanisms, and do not include potential covariates of recombination rate, like effective population size, they suggest that global recombination rates may play an important role in shaping the macroevolutionary patterns of gene and genome evolution in plants. Interspecific recombination rate variation is tightly correlated with genome size as well as variation in overall LTR retrotransposon abundances. Recombination may shape gene-to-gene variation in dN/dS between species, which might impact the overall gene duplication and loss rates. Electronic supplementary material The online version of this article (doi:10.1186/s12862-015-0473-3) contains supplementary material, which is available to authorized users.
Collapse
|
41
|
Si W, Yuan Y, Huang J, Zhang X, Zhang Y, Zhang Y, Tian D, Wang C, Yang Y, Yang S. Widely distributed hot and cold spots in meiotic recombination as shown by the sequencing of rice F2 plants. THE NEW PHYTOLOGIST 2015; 206:1491-502. [PMID: 25664766 DOI: 10.1111/nph.13319] [Citation(s) in RCA: 67] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/02/2014] [Accepted: 12/29/2014] [Indexed: 05/02/2023]
Abstract
Numerous studies have argued that environmental variations may contribute to evolution through the generation of novel heritable variations via meiotic recombination, which plays a crucial role in crop domestication and improvement. Rice is one of the most important staple crops, but no direct estimate of recombination events has yet been made at a fine scale. Here, we address this limitation by sequencing 41 rice individuals with high sequencing coverage and c. 900 000 accurate markers. An average of 33.9 crossover (c. 4.53 cM Mb(-1) ) and 2.47 non-crossover events were detected per F2 plant, which is similar to the values in Arabidopsis. Although not all samples in the stress treatment group showed an increased number of crossover events, environmental stress increased the recombination rate in c. 28.5% of samples. Interestingly, the crossovers showed a highly uneven distribution among and along chromosomes, with c. 13.9% of the entire genome devoid of crossovers, including 11 of the 12 centromere regions, and c. 0.72% of the genome containing large numbers of crossovers (> 50 cM Mb(-1) ). The gene ontology (GO) categories showed that genes clustered within the recombination hot spot regions primarily tended to be involved in responses to environmental stimuli, suggesting that recombination plays an important role for adaptive evolution in rapidly changing environments.
Collapse
Affiliation(s)
- Weina Si
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, 210093, China
| | - Yang Yuan
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, 210093, China
| | - Ju Huang
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, 210093, China
| | - Xiaohui Zhang
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, 210093, China
| | - Yanchun Zhang
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, 210093, China
| | - Yadong Zhang
- Institute of Food Crops, Jiangsu Academy of Agricultural Science, Nanjing, 210014, China
| | - Dacheng Tian
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, 210093, China
| | - Cailin Wang
- Institute of Food Crops, Jiangsu Academy of Agricultural Science, Nanjing, 210014, China
| | - Yonghua Yang
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, 210093, China
| | - Sihai Yang
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, 210093, China
| |
Collapse
|
42
|
Glémin S, Arndt PF, Messer PW, Petrov D, Galtier N, Duret L. Quantification of GC-biased gene conversion in the human genome. Genome Res 2015; 25:1215-28. [PMID: 25995268 PMCID: PMC4510005 DOI: 10.1101/gr.185488.114] [Citation(s) in RCA: 108] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2014] [Accepted: 05/18/2015] [Indexed: 11/25/2022]
Abstract
Much evidence indicates that GC-biased gene conversion (gBGC) has a major impact on the evolution of mammalian genomes. However, a detailed quantification of the process is still lacking. The strength of gBGC can be measured from the analysis of derived allele frequency spectra (DAF), but this approach is sensitive to a number of confounding factors. In particular, we show by simulations that the inference is pervasively affected by polymorphism polarization errors and by spatial heterogeneity in gBGC strength. We propose a new general method to quantify gBGC from DAF spectra, incorporating polarization errors, taking spatial heterogeneity into account, and jointly estimating mutation bias. Applying it to human polymorphism data from the 1000 Genomes Project, we show that the strength of gBGC does not differ between hypermutable CpG sites and non-CpG sites, suggesting that in humans gBGC is not caused by the base-excision repair machinery. Genome-wide, the intensity of gBGC is in the nearly neutral area. However, given that recombination occurs primarily within recombination hotspots, 1%–2% of the human genome is subject to strong gBGC. On average, gBGC is stronger in African than in non-African populations, reflecting differences in effective population sizes. However, due to more heterogeneous recombination landscapes, the fraction of the genome affected by strong gBGC is larger in non-African than in African populations. Given that the location of recombination hotspots evolves very rapidly, our analysis predicts that, in the long term, a large fraction of the genome is affected by short episodes of strong gBGC.
Collapse
Affiliation(s)
- Sylvain Glémin
- Institut des Sciences de l'Evolution (ISEM - UMR 5554 Université de Montpellier-CNRS-IRD-EPHE), 34095 Montpellier, France; Department of Ecology and Genetics, Evolutionary Biology Centre, Uppsala University, SE-752 36 Uppsala, Sweden
| | - Peter F Arndt
- Department of Computational Molecular Biology, Max Planck Institute for Molecular Genetics, 14195 Berlin, Germany
| | - Philipp W Messer
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York 14853, USA
| | - Dmitri Petrov
- Department of Biology, Stanford University, Stanford, California 94305-5020, USA
| | - Nicolas Galtier
- Institut des Sciences de l'Evolution (ISEM - UMR 5554 Université de Montpellier-CNRS-IRD-EPHE), 34095 Montpellier, France
| | - Laurent Duret
- Laboratoire de Biométrie et Biologie Evolutive, UMR CNRS 5558, Université Lyon 1, 69622 Villeurbanne, France
| |
Collapse
|
43
|
Burgarella C, Gayral P, Ballenghien M, Bernard A, David P, Jarne P, Correa A, Hurtrez-Boussès S, Escobar J, Galtier N, Glémin S. Molecular Evolution of Freshwater Snails with Contrasting Mating Systems. Mol Biol Evol 2015; 32:2403-16. [PMID: 25980005 DOI: 10.1093/molbev/msv121] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Because mating systems affect population genetics and ecology, they are expected to impact the molecular evolution of species. Self-fertilizing species experience reduced effective population size, recombination rates, and heterozygosity, which in turn should decrease the efficacy of natural selection, both adaptive and purifying, and the strength of meiotic drive processes such as GC-biased gene conversion. The empirical evidence is only partly congruent with these predictions, depending on the analyzed species, some, but not all, of the expected effects have been observed. One possible reason is that self-fertilization is an evolutionary dead-end, so that most current selfers recently evolved self-fertilization, and their genome has not yet been strongly impacted by selfing. Here, we investigate the molecular evolution of two groups of freshwater snails in which mating systems have likely been stable for several millions of years. Analyzing coding sequence polymorphism, divergence, and expression levels, we report a strongly reduced genetic diversity, decreased efficacy of purifying selection, slower rate of adaptive evolution, and weakened codon usage bias/GC-biased gene conversion in the selfer Galba compared with the outcrosser Physa, in full agreement with theoretical expectations. Our results demonstrate that self-fertilization, when effective in the long run, is a major driver of population genomic and molecular evolutionary processes. Despite the genomic effects of selfing, Galba truncatula seems to escape the demographic consequences of the genetic load. We suggest that the particular ecology of the species may buffer the negative consequences of selfing, shedding new light on the dead-end hypothesis.
Collapse
Affiliation(s)
- Concetta Burgarella
- Institut des Sciences de l'Evolution, UMR, CNRS 5554, Université Montpellier II, Montpellier, France
| | - Philippe Gayral
- Institut des Sciences de l'Evolution, UMR, CNRS 5554, Université Montpellier II, Montpellier, France
| | - Marion Ballenghien
- Institut des Sciences de l'Evolution, UMR, CNRS 5554, Université Montpellier II, Montpellier, France
| | - Aurélien Bernard
- Institut des Sciences de l'Evolution, UMR, CNRS 5554, Université Montpellier II, Montpellier, France
| | | | | | - Ana Correa
- MIVEGEC (Maladies Infectieuses et Vecteurs: Ecologie, Génétique, Evolution, Contrôle), UMR (UM1-UM2-CNRS 5290-IRD224), IRD, Montpellier, France
| | - Sylvie Hurtrez-Boussès
- MIVEGEC (Maladies Infectieuses et Vecteurs: Ecologie, Génétique, Evolution, Contrôle), UMR (UM1-UM2-CNRS 5290-IRD224), IRD, Montpellier, France
| | - Juan Escobar
- Institut des Sciences de l'Evolution, UMR, CNRS 5554, Université Montpellier II, Montpellier, France
| | - Nicolas Galtier
- Institut des Sciences de l'Evolution, UMR, CNRS 5554, Université Montpellier II, Montpellier, France
| | - Sylvain Glémin
- Institut des Sciences de l'Evolution, UMR, CNRS 5554, Université Montpellier II, Montpellier, France
| |
Collapse
|
44
|
Wallberg A, Glémin S, Webster MT. Extreme recombination frequencies shape genome variation and evolution in the honeybee, Apis mellifera. PLoS Genet 2015; 11:e1005189. [PMID: 25902173 PMCID: PMC4406589 DOI: 10.1371/journal.pgen.1005189] [Citation(s) in RCA: 85] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2014] [Accepted: 04/01/2015] [Indexed: 01/10/2023] Open
Abstract
Meiotic recombination is a fundamental cellular process, with important consequences for evolution and genome integrity. However, we know little about how recombination rates vary across the genomes of most species and the molecular and evolutionary determinants of this variation. The honeybee, Apis mellifera, has extremely high rates of meiotic recombination, although the evolutionary causes and consequences of this are unclear. Here we use patterns of linkage disequilibrium in whole genome resequencing data from 30 diploid honeybees to construct a fine-scale map of rates of crossing over in the genome. We find that, in contrast to vertebrate genomes, the recombination landscape is not strongly punctate. Crossover rates strongly correlate with levels of genetic variation, but not divergence, which indicates a pervasive impact of selection on the genome. Germ-line methylated genes have reduced crossover rate, which could indicate a role of methylation in suppressing recombination. Controlling for the effects of methylation, we do not infer a strong association between gene expression patterns and recombination. The site frequency spectrum is strongly skewed from neutral expectations in honeybees: rare variants are dominated by AT-biased mutations, whereas GC-biased mutations are found at higher frequencies, indicative of a major influence of GC-biased gene conversion (gBGC), which we infer to generate an allele fixation bias 5 – 50 times the genomic average estimated in humans. We uncover further evidence that this repair bias specifically affects transitions and favours fixation of CpG sites. Recombination, via gBGC, therefore appears to have profound consequences on genome evolution in honeybees and interferes with the process of natural selection. These findings have important implications for our understanding of the forces driving molecular evolution. Evolution results from changes in allele frequencies in populations. The main forces that cause such changes are natural selection and random genetic drift. However, an additional process, GC-biased gene conversion (gBGC), associated with meiotic recombination, affects the probability that alleles are passed from one generation to the next. The honeybee, Apis mellifera, has extremely high recombination rates—more than 20 times to those observed in humans. However, the reason for this is unknown and the effects of such high recombination rates on evolution are not well understood. Here we use patterns of genetic variation in the genomes of 30 honeybees to infer variation in the rate of recombination across the genome. We find that recombination rates and levels of genetic variation are strongly correlated, which is indicative of a pervasive impact of natural selection on genetic variation. We also infer a major role of DNA methylation in determining recombination rates in genes. Patterns of genetic variation appear to be strongly skewed due to the effects of gBGC, suggesting that recombination generates a bias in transmission of alleles during meiosis. This process seems to be interfering with the efficacy of selection at removing deleterious alleles and favouring beneficial ones. Recombination therefore has a huge impact on genetic variation and evolution in honeybees and appears to play a dominant role in genome evolution.
Collapse
Affiliation(s)
- Andreas Wallberg
- Department of Medical Biochemistry and Microbiology, Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | - Sylvain Glémin
- Institut des Sciences de l’Evolution (ISEM—UMR 5554 Université de Montpellier-CNRS-IRD-EPHE), France
- Department of Ecology and Genetics, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| | - Matthew T. Webster
- Department of Medical Biochemistry and Microbiology, Science for Life Laboratory, Uppsala University, Uppsala, Sweden
- * E-mail:
| |
Collapse
|
45
|
Clément Y, Fustier MA, Nabholz B, Glémin S. The bimodal distribution of genic GC content is ancestral to monocot species. Genome Biol Evol 2014; 7:336-48. [PMID: 25527839 PMCID: PMC4316631 DOI: 10.1093/gbe/evu278] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
In grasses such as rice or maize, the distribution of genic GC content is well known to be bimodal. It is mainly driven by GC content at third codon positions (GC3 for short). This feature is thought to be specific to grasses as closely related species like banana have a unimodal GC3 distribution. GC3 is associated with numerous genomics features and uncovering the origin of this peculiar distribution will help understanding the potential roles and consequences of GC3 variations within and between genomes. Until recently, the origin of the peculiar GC3 distribution in grasses has remained unknown. Thanks to the recent publication of several complete genomes and transcriptomes of nongrass monocots, we studied more than 1,000 groups of one-to-one orthologous genes in seven grasses and three outgroup species (banana, palm tree, and yam). Using a maximum likelihood-based method, we reconstructed GC3 at several ancestral nodes. We found that the bimodal GC3 distribution observed in extant grasses is ancestral to both grasses and most monocot species, and that other species studied here have lost this peculiar structure. We also found that GC3 in grass lineages is globally evolving very slowly and that the decreasing GC3 gradient observed from 5′ to 3′ along coding sequences is also conserved and ancestral to monocots. This result strongly challenges the previous views on the specificity of grass genomes and we discuss its implications for the possible causes of the evolution of GC content in monocots.
Collapse
Affiliation(s)
- Yves Clément
- Montpellier SupAgro, Unité Mixte de Recherche 1334, Amélioration Génétique et Adaptation des Plantes Méditerranéennes et Tropicales, Montpellier, France Institut des Sciences de l'Evolution de Montpellier, Unité Mixte de Recherche 5554, Centre National de la Recherche Scientifique, Université Montpellier, France
| | | | - Benoit Nabholz
- Institut des Sciences de l'Evolution de Montpellier, Unité Mixte de Recherche 5554, Centre National de la Recherche Scientifique, Université Montpellier, France
| | - Sylvain Glémin
- Institut des Sciences de l'Evolution de Montpellier, Unité Mixte de Recherche 5554, Centre National de la Recherche Scientifique, Université Montpellier, France
| |
Collapse
|
46
|
The red queen model of recombination hotspots evolution in the light of archaic and modern human genomes. PLoS Genet 2014; 10:e1004790. [PMID: 25393762 PMCID: PMC4230742 DOI: 10.1371/journal.pgen.1004790] [Citation(s) in RCA: 54] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2014] [Accepted: 10/01/2014] [Indexed: 12/17/2022] Open
Abstract
Recombination is an essential process in eukaryotes, which increases diversity by disrupting genetic linkage between loci and ensures the proper segregation of chromosomes during meiosis. In the human genome, recombination events are clustered in hotspots, whose location is determined by the PRDM9 protein. There is evidence that the location of hotspots evolves rapidly, as a consequence of changes in PRDM9 DNA-binding domain. However, the reasons for these changes and the rate at which they occur are not known. In this study, we investigated the evolution of human hotspot loci and of PRDM9 target motifs, both in modern and archaic human lineages (Denisovan) to quantify the dynamic of hotspot turnover during the recent period of human evolution. We show that present-day human hotspots are young: they have been active only during the last 10% of the time since the divergence from chimpanzee, starting to be operating shortly before the split between Denisovans and modern humans. Surprisingly, however, our analyses indicate that Denisovan recombination hotspots did not overlap with modern human ones, despite sharing similar PRDM9 target motifs. We further show that high-affinity PRDM9 target motifs are subject to a strong self-destructive drive, known as biased gene conversion (BGC), which should lead to the loss of the majority of them in the next 3 MYR. This depletion of PRDM9 genomic targets is expected to decrease fitness, and thereby to favor new PRDM9 alleles binding different motifs. Our refined estimates of the age and life expectancy of human hotspots provide empirical evidence in support of the Red Queen hypothesis of recombination hotspots evolution. In eukaryotic genomes, recombination plays a central role by ensuring the proper segregation of chromosomes during meiosis and increasing genetic diversity at the population scale. Recombination events are not uniformly distributed along chromosomes, but cluster in narrow regions called hotspots. The absence of overlap between human and chimpanzee hotspots indicates that the location of these hotspots evolves rapidly. However, the reasons for this rapid dynamic are still unknown. To gain insight into the processes driving the evolution of recombination hotspots we analyzed the recent history of human hotspots, using the genome of a closely related archaic hominid, Denisovan. We searched for genomic signatures of past recombination activity and compared them to present-day patterns of recombination in humans. Our results show that human hotspots are younger than previously thought and that they are not conserved in Denisovans. Moreover, we confirm that hotspots are subject to a self-destruction process, due to biased gene conversion. We quantified this process, and showed that its intensity is strong enough to cause the fast turnover of human hotspots.
Collapse
|
47
|
Evidence for stabilizing selection on codon usage in chromosomal rearrangements of Drosophila pseudoobscura. G3-GENES GENOMES GENETICS 2014; 4:2433-49. [PMID: 25326424 PMCID: PMC4267939 DOI: 10.1534/g3.114.014860] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
There has been a renewed interest in investigating the role of stabilizing selection acting on genome-wide traits such as codon usage bias. Codon bias, when synonymous codons are used at unequal frequencies, occurs in a wide variety of taxa. Standard evolutionary models explain the maintenance of codon bias through a balance of genetic drift, mutation and weak purifying selection. The efficacy of selection is expected to be reduced in regions of suppressed recombination. Contrary to observations in Drosophila melanogaster, some recent studies have failed to detect a relationship between the recombination rate, intensity of selection acting at synonymous sites, and the magnitude of codon bias as predicted under these standard models. Here, we examined codon bias in 2798 protein coding loci on the third chromosome of D. pseudoobscura using whole-genome sequences of 47 individuals, representing five common third chromosome gene arrangements. Fine-scale recombination maps were constructed using more than 1 million segregating sites. As expected, recombination was demonstrated to be significantly suppressed between chromosome arrangements, allowing for a direct examination of the relationship between recombination, selection, and codon bias. As with other Drosophila species, we observe a strong mutational bias away from the most frequently used codons. We find the rate of synonymous and nonsynonymous polymorphism is variable between different amino acids. However, we do not observe a reduction in codon bias or the strength of selection in regions of suppressed recombination as expected. Instead, we find that the interaction between weak stabilizing selection and mutational bias likely plays a role in shaping the composition of synonymous codons across the third chromosome in D. pseudoobscura.
Collapse
|
48
|
Glémin S, Clément Y, David J, Ressayre A. GC content evolution in coding regions of angiosperm genomes: a unifying hypothesis. Trends Genet 2014; 30:263-70. [PMID: 24916172 DOI: 10.1016/j.tig.2014.05.002] [Citation(s) in RCA: 50] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2014] [Revised: 05/09/2014] [Accepted: 05/13/2014] [Indexed: 01/06/2023]
Abstract
In angiosperms (as in other species), GC content varies along and between genes, within a genome, and between genomes of different species, but the reason for this distribution is still an open question. Grass genomes are particularly intriguing because they exhibit a strong bimodal distribution of genic GC content and a sharp 5'-3' decreasing GC content gradient along most genes. Here, we propose a unifying model to explain the main patterns of GC content variation at the gene and genome scale. We argue that GC content patterns could be mainly determined by the interactions between gene structure, recombination patterns, and GC-biased gene conversion. Recent studies on fine-scale recombination maps in angiosperms support this hypothesis and previous results also fit this model. We propose that our model could be used as a null hypothesis to search for additional forces that affect GC content in angiosperms.
Collapse
Affiliation(s)
- Sylvain Glémin
- Institut des Sciences de l'Evolution de Montpellier, Unité Mixte de Recherche 5554, Centre National de la Recherche Scientifique, UMR 5554 CNRS, Université Montpellier 2, F-34095 Montpellier, France.
| | - Yves Clément
- Institut des Sciences de l'Evolution de Montpellier, Unité Mixte de Recherche 5554, Centre National de la Recherche Scientifique, UMR 5554 CNRS, Université Montpellier 2, F-34095 Montpellier, France; Montpellier SupAgro, Unité Mixte de Recherche 1334 Amélioration Génétique et Adaptation des Plantes Méditerranéennes et Tropicales, F-34398 Montpellier, France
| | - Jacques David
- Montpellier SupAgro, Unité Mixte de Recherche 1334 Amélioration Génétique et Adaptation des Plantes Méditerranéennes et Tropicales, F-34398 Montpellier, France
| | - Adrienne Ressayre
- INRA, UMR de Génétique Végétale, INRA/CNRS/Univ Paris-Sud/AgroParistech, Ferme du Moulon, F-91190 Gif sur Yvette, France
| |
Collapse
|
49
|
Nabholz B, Sarah G, Sabot F, Ruiz M, Adam H, Nidelet S, Ghesquière A, Santoni S, David J, Glémin S. Transcriptome population genomics reveals severe bottleneck and domestication cost in the African rice (Oryza glaberrima). Mol Ecol 2014; 23:2210-27. [PMID: 24684265 DOI: 10.1111/mec.12738] [Citation(s) in RCA: 64] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2013] [Accepted: 03/19/2014] [Indexed: 12/17/2022]
Abstract
The African cultivated rice (Oryza glaberrima) was domesticated in West Africa 3000 years ago. Although less cultivated than the Asian rice (O. sativa), O. glaberrima landraces often display interesting adaptation to rustic environment (e.g. drought). Here, using RNA-seq technology, we were able to compare more than 12,000 transcripts between 9 O. glaberrima, 10 wild O. barthii and one O. meridionalis individuals. With a synonymous nucleotide diversity πs = 0.0006 per site, O. glaberrima appears as the least genetically diverse crop grass ever documented. Using approximate Bayesian computation, we estimated that O. glaberrima experienced a severe bottleneck during domestication. This demographic scenario almost fully accounts for the pattern of genetic diversity across O. glaberrima genome as we detected very few outliers regions where positive selection may have further impacted genetic diversity. Moreover, the large excess of derived nonsynonymous substitution that we detected suggests that the O. glaberrima population suffered from the 'cost of domestication'. In addition, we used this genome-scale data set to demonstrate that (i) O. barthii genetic diversity is positively correlated with recombination rate and negatively with gene density, (ii) expression level is negatively correlated with evolutionary constraint, and (iii) one region on chromosome 5 (position 4-6 Mb) exhibits a clear signature of introgression with a yet unidentified Oryza species. This work represents the first genome-wide survey of the African rice genetic diversity and paves the way for further comparison between the African and the Asian rice, notably regarding the genetics underlying domestication traits.
Collapse
Affiliation(s)
- Benoit Nabholz
- Institut des Sciences de l'Evolution-Montpellier, UMR CNRS-UM2 5554, University Montpellier II, Montpellier, France; UMR AGAP 1334, Montpellier SupAgro, Montpellier, France
| | | | | | | | | | | | | | | | | | | |
Collapse
|
50
|
Jacquemin J, Ammiraju JSS, Haberer G, Billheimer DD, Yu Y, Liu LC, Rivera LF, Mayer K, Chen M, Wing RA. Fifteen million years of evolution in the Oryza genus shows extensive gene family expansion. MOLECULAR PLANT 2014; 7:642-56. [PMID: 24214894 DOI: 10.1093/mp/sst149] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/02/2023]
Abstract
In analyzing gene families in the whole-genome sequences available for O. sativa (AA), O. glaberrima (AA), and O. brachyantha (FF), we observed large size expansions in the AA genomes compared to FF genomes for the super-families F-box and NB-ARC, and five additional families: the Aspartic proteases, BTB/POZ proteins (BTB), Glutaredoxins, Trypsin α-amylase inhibitor proteins, and Zf-Dof proteins. Their evolutionary dynamic was investigated to understand how and why such important size variations are observed between these closely related species. We show that expansions resulted from both amplification, largely by tandem duplications, and contraction by gene losses. For the F-box and NB-ARC gene families, the genes conserved in all species were under strong purifying selection while expanded orthologous genes were under more relaxed purifying selection. In F-box, NB-ARC, and BTB, the expanded groups were enriched in genes with little evidence of expression, in comparison with conserved groups. We also detected 87 loci under positive selection in the expanded groups. These results show that most of the duplicated copies in the expanded groups evolve neutrally after duplication because of functional redundancy but a fraction of these genes were preserved following neofunctionalization. Hence, the lineage-specific expansions observed between Oryza species were partly driven by directional selection.
Collapse
Affiliation(s)
- Julie Jacquemin
- Arizona Genomics Institute, School of Plant Sciences, University of Arizona, Tucson, AZ 85721, USA
| | | | | | | | | | | | | | | | | | | |
Collapse
|