1
|
Zhang X, Zhang S, Liu Z, Zhao W, Zhang X, Song J, Jia H, Yang W, Ma Y, Wang Y, Xie K, Budahn H, Wang H. Characterization and acceleration of genome shuffling and ploidy reduction in synthetic allopolyploids by genome sequencing and editing. Nucleic Acids Res 2022; 51:198-217. [PMID: 36583364 PMCID: PMC9841408 DOI: 10.1093/nar/gkac1209] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2022] [Revised: 12/04/2022] [Accepted: 12/06/2022] [Indexed: 12/31/2022] Open
Abstract
Polyploidy and the subsequent ploidy reduction and genome shuffling are the major driving forces of genome evolution. Here, we revealed short-term allopolyploid genome evolution by sequencing a synthetic intergeneric hybrid (Raphanobrassica, RRCC). In this allotetraploid, the genome deletion was quick, while rearrangement was slow. The core and high-frequency genes tended to be retained while the specific and low-frequency genes tended to be deleted in the hybrid. The large-fragment deletions were enriched in the heterochromatin region and probably derived from chromosome breaks. The intergeneric translocations were primarily of short fragments dependent on homoeology, indicating a gene conversion origin. To accelerate genome shuffling, we developed an efficient genome editing platform for Raphanobrassica. By editing Fanconi Anemia Complementation Group M (FANCM) genes, homoeologous recombination, chromosome deletion and secondary meiosis with additional ploidy reduction were accelerated. FANCM was shown to be a checkpoint of meiosis and controller of ploidy stability. By simultaneously editing FLIP genes, gene conversion was precisely introduced, and mosaic genes were produced around the target site. This intergeneric hybrid and genome editing platform not only provides models that facilitate experimental evolution research by speeding up genome shuffling and conversion but also accelerates plant breeding by enhancing intergeneric genetic exchange and creating new genes.
Collapse
Affiliation(s)
- Xiaohui Zhang
- To whom correspondence should be addressed. Tel: +86 10 82105947; Fax: +86 10 62174123;
| | | | | | - Wei Zhao
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops, Ministry of Agriculture and Rural Affairs; Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Xiaoxue Zhang
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops, Ministry of Agriculture and Rural Affairs; Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Jiangping Song
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops, Ministry of Agriculture and Rural Affairs; Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Huixia Jia
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops, Ministry of Agriculture and Rural Affairs; Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Wenlong Yang
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops, Ministry of Agriculture and Rural Affairs; Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Yang Ma
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops, Ministry of Agriculture and Rural Affairs; Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Yang Wang
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops, Ministry of Agriculture and Rural Affairs; Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Kabin Xie
- National Key Laboratory of Crop Genetic Improvement and National Center of Plant Gene Research (Wuhan); College of Plant Science and Technology, Huazhong Agricultural University, Wuhan 430070, China
| | - Holger Budahn
- Institute for Breeding Research on Horticultural Crops, Julius-Kuehn-Institute, Federal Research Centre for Cultivated Plants, D-06484 Quedlinburg, Germany
| | - Haiping Wang
- Correspondence may also be addressed to Haiping Wang. Tel: +86 10 82105942; Fax: +86 10 62174123;
| |
Collapse
|
2
|
Kögler A, Seibt KM, Heitkam T, Morgenstern K, Reiche B, Brückner M, Wolf H, Krabel D, Schmidt T. Divergence of 3' ends as a driver of short interspersed nuclear element (SINE) evolution in the Salicaceae. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2020; 103:443-458. [PMID: 32056333 DOI: 10.1111/tpj.14721] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/18/2019] [Revised: 01/13/2020] [Accepted: 01/29/2020] [Indexed: 06/10/2023]
Abstract
Short interspersed nuclear elements (SINEs) are small, non-autonomous and heterogeneous retrotransposons that are widespread in plants. To explore the amplification dynamics and evolutionary history of SINE populations in representative deciduous tree species, we analyzed the genomes of the six following Salicaceae species: Populus deltoides, Populus euphratica, Populus tremula, Populus tremuloides, Populus trichocarpa, and Salix purpurea. We identified 11 Salicaceae SINE families (SaliS-I to SaliS-XI), comprising 27 077 full-length copies. Most of these families harbor segmental similarities, providing evidence for SINE emergence by reshuffling or heterodimerization. We observed two SINE groups, differing in phylogenetic distribution pattern, similarity and 3' end structure. These groups probably emerged during the 'salicoid duplication' (~65 million years ago) in the Salix-Populus progenitor and during the separation of the genus Salix (45-65 million years ago), respectively. In contrast to conserved 5' start motifs across species and SINE families, the 3' ends are highly variable in sequence and length. This extraordinary 3'-end variability results from mutations in the poly(A) tail, which were fixed by subsequent amplificational bursts. We show that the dissemination of newly evolved 3' ends is accomplished by a displacement of older motifs, leading to various 3'-end subpopulations within the SaliS families.
Collapse
Affiliation(s)
- Anja Kögler
- Faculty of Biology, Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| | - Kathrin M Seibt
- Faculty of Biology, Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| | - Tony Heitkam
- Faculty of Biology, Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| | - Kristin Morgenstern
- Department of Forest Sciences, Institute of Forest Botany and Forest Zoology, Technische Universität Dresden, 01735, Tharandt, Germany
| | - Birgit Reiche
- Department of Forest Sciences, Institute of Forest Botany and Forest Zoology, Technische Universität Dresden, 01735, Tharandt, Germany
| | | | - Heino Wolf
- Staatsbetrieb Sachsenforst, 01796, Pirna, Germany
| | - Doris Krabel
- Department of Forest Sciences, Institute of Forest Botany and Forest Zoology, Technische Universität Dresden, 01735, Tharandt, Germany
| | - Thomas Schmidt
- Faculty of Biology, Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| |
Collapse
|
3
|
Cossu RM, Casola C, Giacomello S, Vidalis A, Scofield DG, Zuccolo A. LTR Retrotransposons Show Low Levels of Unequal Recombination and High Rates of Intraelement Gene Conversion in Large Plant Genomes. Genome Biol Evol 2018; 9:3449-3462. [PMID: 29228262 PMCID: PMC5751070 DOI: 10.1093/gbe/evx260] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/07/2017] [Indexed: 12/29/2022] Open
Abstract
The accumulation and removal of transposable elements (TEs) is a major driver of genome size evolution in eukaryotes. In plants, long terminal repeat (LTR) retrotransposons (LTR-RTs) represent the majority of TEs and form most of the nuclear DNA in large genomes. Unequal recombination (UR) between LTRs leads to removal of intervening sequence and formation of solo-LTRs. UR is a major mechanism of LTR-RT removal in many angiosperms, but our understanding of LTR-RT-associated recombination within the large, LTR-RT-rich genomes of conifers is quite limited. We employ a novel read-based methodology to estimate the relative rates of LTR-RT-associated UR within the genomes of four conifer and seven angiosperm species. We found the lowest rates of UR in the largest genomes studied, conifers and the angiosperm maize. Recombination may also resolve as gene conversion, which does not remove sequence, so we analyzed LTR-RT-associated gene conversion events (GCEs) in Norway spruce and six angiosperms. Opposite the trend for UR, we found the highest rates of GCEs in Norway spruce and maize. Unlike previous work in angiosperms, we found no evidence that rates of UR correlate with retroelement structural features in the conifers, suggesting that another process is suppressing UR in these species. Recent results from diverse eukaryotes indicate that heterochromatin affects the resolution of recombination, by favoring gene conversion over crossing-over, similar to our observation of opposed rates of UR and GCEs. Control of LTR-RT proliferation via formation of heterochromatin would be a likely step toward large genomes in eukaryotes carrying high LTR-RT content.
Collapse
Affiliation(s)
- Rosa Maria Cossu
- Institute of Life Sciences, Scuola Superiore Sant'Anna, Pisa, Italy.,Department of Neuroscience and Brain Technologies, Istituto Italiano di Tecnologia (IIT), Genova, Italy
| | - Claudio Casola
- Department of Ecosystem Science and Management, Texas A&M University
| | - Stefania Giacomello
- Science for Life Laboratory, School of Biotechnology, Royal Institute of Technology, Solna, Sweden.,Science for Life Laboratory, Department of Biochemistry and Biophysics, Stockholm University, Solna, Sweden
| | - Amaryllis Vidalis
- Department of Ecology and Environmental Science, Umeå University, Sweden.,Section of Population Epigenetics and Epigenomics, Center of Life and Food Sciences Weihenstephan, Technische Universität München, Freising, Germany
| | - Douglas G Scofield
- Department of Ecology and Environmental Science, Umeå University, Sweden.,Department of Ecology and Genetics: Evolutionary Biology, Uppsala University, Sweden.,Uppsala Multidisciplinary Center for Advanced Computational Science, Uppsala University, Sweden
| | - Andrea Zuccolo
- Institute of Life Sciences, Scuola Superiore Sant'Anna, Pisa, Italy.,Istituto di Genomica Applicata, Udine, Italy
| |
Collapse
|
4
|
Zhang Y, Tang D, Lin X, Ding M, Tong Z. Genome-wide identification of MADS-box family genes in moso bamboo (Phyllostachys edulis) and a functional analysis of PeMADS5 in flowering. BMC PLANT BIOLOGY 2018; 18:176. [PMID: 30176795 PMCID: PMC6122543 DOI: 10.1186/s12870-018-1394-2] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/19/2018] [Accepted: 08/27/2018] [Indexed: 05/18/2023]
Abstract
BACKGROUND MADS-box genes encode a large family of transcription factors that play significant roles in plant growth and development. Bamboo is an important non-timber forest product worldwide, but previous studies on the moso bamboo (Phyllostachys edulis) MADS-box gene family were not accurate nor sufficiently detailed. RESULTS Here, a complete genome-wide identification and characterization of the MADS-box genes in moso bamboo was conducted. There was an unusual lack of type-I MADS-box genes in the bamboo genome database ( http://202.127.18.221/bamboo/index.php ), and some of the PeMADS sequences are fragmented and/or inaccurate. We performed several bioinformatics techniques to obtain more precise sequences using transcriptome assembly. In total, 42 MADS-box genes, including six new type-I MADS-box genes, were identified in bamboo, and their structures, phylogenetic relationships, predicted conserved motifs and promoter cis-elements were systematically investigated. An expression analysis of the bamboo MADS-box genes in floral organs and leaves revealed that several key members are involved in bamboo inflorescence development, like their orthologous genes in Oryza. The ectopic overexpression of one MADS-box gene, PeMADS5, in Arabidopsis triggered an earlier flowering time and the development of an aberrant flower phenotype, suggesting that PeMADS5 acts as a floral activator and is involved in bamboo flowering. CONCLUSION We produced the most comprehensive information on MADS-box genes in moso bamboo. Additionally, a critical PeMADS gene (PeMADS5) responsible for the transition from vegetative to reproductive growth was identified and shown to be related to bamboo floral development.
Collapse
Affiliation(s)
- Yuting Zhang
- State Key Laboratory of Subtropical Silviculture, Zhejiang A & F University, Lin’an, Zhejiang China
| | - Dingqin Tang
- State Key Laboratory of Subtropical Silviculture, Zhejiang A & F University, Lin’an, Zhejiang China
| | - Xinchun Lin
- State Key Laboratory of Subtropical Silviculture, Zhejiang A & F University, Lin’an, Zhejiang China
| | - Mingquan Ding
- The Key Laboratory for Quality Improvement of Agricultural Products of Zhejiang Province, College of Agriculture and Food Science, Zhejiang A & F University, Lin’an, Zhejiang China
| | - Zaikang Tong
- State Key Laboratory of Subtropical Silviculture, Zhejiang A & F University, Lin’an, Zhejiang China
| |
Collapse
|
5
|
Lee J, Kim JW, Lee IY, Ahn JH. Comparison of the complete genomes of two Echinochloa species: barnyard grass and jungle rice. Mitochondrial DNA B Resour 2017; 2:512-513. [PMID: 33473880 PMCID: PMC7800638 DOI: 10.1080/23802359.2017.1361355] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
We sequenced and compared the complete chloroplast (cp) genomes of the two Echinochloa species; E. crus-galli (KT983256) and E. colona (KT983255). The size of complete chloroplast genomes of the two species are 139,851 and 139,592 base pairs in length, respectively. They include a pair of inverted repeats (22,640 and 22,618 bp) separated by the small (12,518 and 12,519 bp) and large (82,053 and 81,837 bp) single copy regions. Both chloroplast genomes include a total of 136 genes. Phylogenetic analysis revealed that E. colona was diverged between 2.65 and 3.18 million years ago (Mya) from the E. oryzicola and E. crus-galli.
Collapse
Affiliation(s)
- Jeongran Lee
- Department of Agro-food Safety and Crop Protection, National Institute of Agricultural Sciences (NAS), Iseo, Republic of Korea
| | - Jin-Won Kim
- Department of Agro-food Safety and Crop Protection, National Institute of Agricultural Sciences (NAS), Iseo, Republic of Korea
| | - In-Yong Lee
- Department of Agro-food Safety and Crop Protection, National Institute of Agricultural Sciences (NAS), Iseo, Republic of Korea
| | - Jong Hwa Ahn
- National Instrumentation Center for Environment Management (NICEM), College of Agriculture and Life Sciences, Seoul National University, Seoul, Korea
| |
Collapse
|
6
|
Nah G, Im JH, Lim SH, Kim K, Choi AY, Yook MJ, Kim S, Kim C, Kim DS. Complete chloroplast genomes of two Miscanthus species. Mitochondrial DNA A DNA Mapp Seq Anal 2015; 27:4359-4360. [PMID: 26465710 DOI: 10.3109/19401736.2015.1089500] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
The complete chloroplast (cp) genomes of two Miscanthus species, M. sinensis and M. sacchariflorus, were sequenced and investigated for genes, genome size variation, and polymorphisms. There are 170 genes in both cp genomes, consisting of 122 mRNA genes (84 protein-coding genes and 38 hypothetical genes), 40 tRNA genes, and 8 rRNA genes. The cp genome contains two inverted repeat (IR) regions, separated by large single copy (LSC) region and small single copy (SSC) region. Indels were responsible for 40 bp difference in cp genome size in two species. In addition, we established phylogenetic relationship with other monocot cp genomes, and estimated divergence time. The two Miscanthus species clustered together among other C4 monocot species and the divergence time of two Miscanthus species was approximately 0.5 1-0.84 Mya.
Collapse
Affiliation(s)
- Gyoungju Nah
- a Department of Plant Science , College of Agriculture and Life Sciences, Research Institute of Agriculture and Life Sciences, Seoul National University , Seoul , Korea
| | - Ji-Hoon Im
- a Department of Plant Science , College of Agriculture and Life Sciences, Research Institute of Agriculture and Life Sciences, Seoul National University , Seoul , Korea
| | - Soo-Hyun Lim
- a Department of Plant Science , College of Agriculture and Life Sciences, Research Institute of Agriculture and Life Sciences, Seoul National University , Seoul , Korea
| | - Kyunghee Kim
- a Department of Plant Science , College of Agriculture and Life Sciences, Research Institute of Agriculture and Life Sciences, Seoul National University , Seoul , Korea
| | - Ah Young Choi
- b College of Agriculture and Life Sciences, National Instrumentation Center for Environment Management (NICEM), Seoul National University , Seoul , Korea , and
| | - Min Jung Yook
- a Department of Plant Science , College of Agriculture and Life Sciences, Research Institute of Agriculture and Life Sciences, Seoul National University , Seoul , Korea
| | - Soonok Kim
- c Biological and Genetic Resources Assessment Division, National Institute of Biological Resources , Incheon , Korea
| | - Changmu Kim
- c Biological and Genetic Resources Assessment Division, National Institute of Biological Resources , Incheon , Korea
| | - Do-Soon Kim
- a Department of Plant Science , College of Agriculture and Life Sciences, Research Institute of Agriculture and Life Sciences, Seoul National University , Seoul , Korea
| |
Collapse
|
7
|
Nah G, Im JH, Kim JW, Kim K, Lim J, Choi AY, Choi IY, Yang TJ, Park TS, Lee D, Kim DS. The complete chloroplast genomes of three Korean Echinochloa crus-galli accessions. Mitochondrial DNA A DNA Mapp Seq Anal 2015; 27:4357-4358. [PMID: 26466198 DOI: 10.3109/19401736.2015.1089499] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
The complete chloroplast (cp) genomes of three Echinochloa crus-galli accessions (KR822684, KR822685, and KR822686) are reported in this work. The cp genome size is similar in three accessions, ranging from 139 846 bp to 139 860 bp. All three genomes have two inverted repeats (IR) of 22 748 bp per each IR with a large single copy (LSC) region of 81 833-81 844 bp and a small single copy (SSC) region of 12 517-12 520 bp. The total of 131 genes was identified in individual accession. Phylogenetic analysis revealed three Korean Echinochloa accessions belonged to E. crus-galli, and diverged less than 0.1 million years ago (Mya).
Collapse
Affiliation(s)
- Gyoungju Nah
- a Department of Plant Science , Research Institute of Agriculture and Life Sciences, College of Agriculture and Life Sciences, Seoul National University , Seoul , Korea
| | - Ji-Hoon Im
- a Department of Plant Science , Research Institute of Agriculture and Life Sciences, College of Agriculture and Life Sciences, Seoul National University , Seoul , Korea
| | - Jin-Won Kim
- a Department of Plant Science , Research Institute of Agriculture and Life Sciences, College of Agriculture and Life Sciences, Seoul National University , Seoul , Korea
| | - Kyunghee Kim
- a Department of Plant Science , Research Institute of Agriculture and Life Sciences, College of Agriculture and Life Sciences, Seoul National University , Seoul , Korea
| | - Jongsung Lim
- b National Instrumentation Center for Environment Management (NICEM), College of Agriculture and Life Sciences, Seoul National University , Seoul , Korea
| | - Ah Young Choi
- b National Instrumentation Center for Environment Management (NICEM), College of Agriculture and Life Sciences, Seoul National University , Seoul , Korea
| | - Ik-Young Choi
- b National Instrumentation Center for Environment Management (NICEM), College of Agriculture and Life Sciences, Seoul National University , Seoul , Korea
| | - Tae-Jin Yang
- a Department of Plant Science , Research Institute of Agriculture and Life Sciences, College of Agriculture and Life Sciences, Seoul National University , Seoul , Korea
| | - Tae-Seon Park
- c National Academy of Agricultural Science , RDA, Wanju , Jeollabuk-Do , Korea , and
| | - DoKyoung Lee
- a Department of Plant Science , Research Institute of Agriculture and Life Sciences, College of Agriculture and Life Sciences, Seoul National University , Seoul , Korea.,d Department of Crop Science , University of Illinois , Urbana-Champaign , Champaign , IL , USA
| | - Do-Soon Kim
- a Department of Plant Science , Research Institute of Agriculture and Life Sciences, College of Agriculture and Life Sciences, Seoul National University , Seoul , Korea
| |
Collapse
|
8
|
Zhao M, Meyers BC, Cai C, Xu W, Ma J. Evolutionary patterns and coevolutionary consequences of MIRNA genes and microRNA targets triggered by multiple mechanisms of genomic duplications in soybean. THE PLANT CELL 2015; 27:546-62. [PMID: 25747880 PMCID: PMC4558674 DOI: 10.1105/tpc.15.00048] [Citation(s) in RCA: 64] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/19/2015] [Revised: 02/07/2015] [Accepted: 02/18/2015] [Indexed: 05/04/2023]
Abstract
The evolutionary dynamics of duplicated protein-encoding genes (PEGs) is well documented. However, the evolutionary patterns and consequences of duplicated MIRNAs and the potential influence on the evolution of their PEG targets are poorly understood. Here, we demonstrate the evolution of plant MIRNAs subsequent to a recent whole-genome duplication. Overall, the retention of MIRNA duplicates was correlated to the retention of adjacent PEG duplicates, and the retained MIRNA duplicates exhibited a higher level of interspecific preservation of orthologs than singletons, suggesting that the retention of MIRNA duplicates is related to their functional constraints and local genomic stability. Nevertheless, duplication status, rather than local genic collinearity, was the primary determinant of levels of nucleotide divergence of MIRNAs. In addition, the retention of duplicated MIRNAs appears to be associated with the retention of their corresponding duplicated PEG targets. Furthermore, we characterized the evolutionary novelty of a legume-specific microRNA (miRNA) family, which resulted from rounds of genomic duplication, and consequent dynamic evolution of its NB-LRR targets, an important gene family with primary roles in plant-pathogen interactions. Together, these observations depict evolutionary patterns and novelty of MIRNAs in the context of genomic duplication and evolutionary interplay between MIRNAs and their PEG targets mediated by miRNAs.
Collapse
Affiliation(s)
- Meixia Zhao
- Department of Agronomy, Purdue University, West Lafayette, Indiana 47906
| | - Blake C Meyers
- Department of Plant and Soil Sciences, Delaware Biotechnology Institute, University of Delaware, Newark, Delaware 19716
| | - Chunmei Cai
- College of Life Sciences, Qingdao Agricultural University, Qingdao 266109, China
| | - Wei Xu
- College of Life Sciences, Qingdao Agricultural University, Qingdao 266109, China
| | - Jianxin Ma
- Department of Agronomy, Purdue University, West Lafayette, Indiana 47906 College of Life Sciences, Qingdao Agricultural University, Qingdao 266109, China
| |
Collapse
|
9
|
Gill N, Buti M, Kane N, Bellec A, Helmstetter N, Berges H, Rieseberg LH. Sequence-Based Analysis of Structural Organization and Composition of the Cultivated Sunflower (Helianthus annuus L.) Genome. BIOLOGY 2014; 3:295-319. [PMID: 24833511 PMCID: PMC4085609 DOI: 10.3390/biology3020295] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/30/2013] [Revised: 03/16/2014] [Accepted: 03/25/2014] [Indexed: 12/19/2022]
Abstract
Sunflower is an important oilseed crop, as well as a model system for evolutionary studies, but its 3.6 gigabase genome has proven difficult to assemble, in part because of the high repeat content of its genome. Here we report on the sequencing, assembly, and analyses of 96 randomly chosen BACs from sunflower to provide additional information on the repeat content of the sunflower genome, assess how repetitive elements in the sunflower genome are organized relative to genes, and compare the genomic distribution of these repeats to that found in other food crops and model species. We also examine the expression of transposable element-related transcripts in EST databases for sunflower to determine the representation of repeats in the transcriptome and to measure their transcriptional activity. Our data confirm previous reports in suggesting that the sunflower genome is >78% repetitive. Sunflower repeats share very little similarity to other plant repeats such as those of Arabidopsis, rice, maize and wheat; overall 28% of repeats are “novel” to sunflower. The repetitive sequences appear to be randomly distributed within the sequenced BACs. Assuming the 96 BACs are representative of the genome as a whole, then approximately 5.2% of the sunflower genome comprises non TE-related genic sequence, with an average gene density of 18kbp/gene. Expression levels of these transposable elements indicate tissue specificity and differential expression in vegetative and reproductive tissues, suggesting that expressed TEs might contribute to sunflower development. The assembled BACs will also be useful for assessing the quality of several different draft assemblies of the sunflower genome and for annotating the reference sequence.
Collapse
Affiliation(s)
- Navdeep Gill
- Department of Botany and The Biodiversity Research Centre, University of British Columbia, Vancouver V6T 1Z4, BC, Canada.
| | - Matteo Buti
- Applied Rosaceous Genomics Group, Centre for Research and Innovation, Michele all'Adige (TN) P.IVA 020384102, Italy.
| | - Nolan Kane
- Department of Ecology and Evolutionary Biology, University of Colorado, Boulder, CO 80309, USA.
| | - Arnaud Bellec
- French Plant Genomic Resource Centre, INRA-CNRGV, Chemin de Borde Rouge, CS 52627, 31326 Castanet Tolosan, France.
| | - Nicolas Helmstetter
- French Plant Genomic Resource Centre, INRA-CNRGV, Chemin de Borde Rouge, CS 52627, 31326 Castanet Tolosan, France.
| | - Hélène Berges
- French Plant Genomic Resource Centre, INRA-CNRGV, Chemin de Borde Rouge, CS 52627, 31326 Castanet Tolosan, France.
| | - Loren H Rieseberg
- Department of Botany and The Biodiversity Research Centre, University of British Columbia, Vancouver V6T 1Z4, BC, Canada.
| |
Collapse
|
10
|
Kolano B, Bednara E, Weiss-Schneeweiss H. Isolation and characterization of reverse transcriptase fragments of LTR retrotransposons from the genome of Chenopodium quinoa (Amaranthaceae). PLANT CELL REPORTS 2013; 32:1575-1588. [PMID: 23754338 PMCID: PMC3778962 DOI: 10.1007/s00299-013-1468-4] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/30/2013] [Revised: 04/30/2013] [Accepted: 05/28/2013] [Indexed: 05/29/2023]
Abstract
High heterogeneity was observed among conserved domains of reverse transcriptase ( rt ) isolated from quinoa. Only one Ty1- copia rt was highly amplified. Reverse transcriptase sequences were located predominantly in pericentromeric region of quinoa chromosomes. The heterogeneity, genomic abundance, and chromosomal distribution of reverse transcriptase (rt)-coding fragments of Ty1-copia and Ty3-gypsy long terminal repeat retrotransposons were analyzed in the Chenopodium quinoa genome. Conserved domains of the rt gene were amplified and characterized using degenerate oligonucleotide primer pairs. Sequence analyses indicated that half of Ty1-copia rt (51 %) and 39 % of Ty3-gypsy rt fragments contained intact reading frames. High heterogeneity among rt sequences was observed for both Ty1-copia and Ty3-gypsy rt amplicons, with Ty1-copia more heterogeneous than Ty3-gypsy. Most of the isolated rt fragments were present in quinoa genome in low copy numbers, with only one highly amplified Ty1-copia rt sequence family. The gypsy-like RNase H fragments co-amplified with Ty1-copia-degenerate primers were shown to be highly amplified in the quinoa genome indicating either higher abundance of some gypsy families of which rt domains could not be amplified, or independent evolution of this gypsy-region in quinoa. Both Ty1-copia and Ty3-gypsy retrotransposons were preferentially located in pericentromeric heterochromatin of quinoa chromosomes. Phylogenetic analyses of newly amplified rt fragments together with well-characterized retrotransposon families from other organisms allowed identification of major lineages of retroelements in the genome of quinoa and provided preliminary insight into their evolutionary dynamics.
Collapse
Affiliation(s)
- Bozena Kolano
- Department of Plant Anatomy and Cytology, University of Silesia, Jagiellonska 28, 40-032, Katowice, Poland,
| | | | | |
Collapse
|
11
|
Manetti ME, Rossi M, Cruz GMQ, Saccaro NL, Nakabashi M, Altebarmakian V, Rodier-Goud M, Domingues D, D’Hont A, Van Sluys MA. Mutator System Derivatives Isolated from Sugarcane Genome Sequence. TROPICAL PLANT BIOLOGY 2012; 5:233-243. [PMID: 22905278 PMCID: PMC3418495 DOI: 10.1007/s12042-012-9104-y] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/02/2012] [Accepted: 05/03/2012] [Indexed: 06/01/2023]
Abstract
Mutator-like transposase is the most represented transposon transcript in the sugarcane transcriptome. Phylogenetic reconstructions derived from sequenced transcripts provided evidence that at least four distinct classes exist (I-IV) and that diversification among these classes occurred early in Angiosperms, prior to the divergence of Monocots/Eudicots. The four previously described classes served as probes to select and further sequence six BAC clones from a genomic library of cultivar R570. A total of 579,352 sugarcane base pairs were produced from these "Mutator system" BAC containing regions for further characterization. The analyzed genomic regions confirmed that the predicted structure and organization of the Mutator system in sugarcane is composed of two true transposon lineages, each containing a specific terminal inverted repeat and two transposase lineages considered to be domesticated. Each Mutator transposase class displayed a particular molecular structure supporting lineage specific evolution. MUSTANG, previously described domesticated genes, are located in syntenic regions across Sacharineae and, as expected for a host functional gene, posses the same gene structure as in other Poaceae. Two sequenced BACs correspond to hom(eo)logous locus with specific retrotransposon insertions that discriminate sugarcane haplotypes. The comparative studies presented, add information to the Mutator systems previously identified in the maize and rice genomes by describing lineage specific molecular structure and genomic distribution pattern in the sugarcane genome. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s12042-012-9104-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- M. E. Manetti
- Departamento de Botânica-IB-USP, GaTE Lab, Brasil, Rua do Matão, 277, 05508-900 São Paulo, SP Brazil
| | - M. Rossi
- Departamento de Botânica-IB-USP, GaTE Lab, Brasil, Rua do Matão, 277, 05508-900 São Paulo, SP Brazil
| | - G. M. Q. Cruz
- Departamento de Botânica-IB-USP, GaTE Lab, Brasil, Rua do Matão, 277, 05508-900 São Paulo, SP Brazil
| | - N. L. Saccaro
- Departamento de Botânica-IB-USP, GaTE Lab, Brasil, Rua do Matão, 277, 05508-900 São Paulo, SP Brazil
| | - M. Nakabashi
- Departamento de Botânica-IB-USP, GaTE Lab, Brasil, Rua do Matão, 277, 05508-900 São Paulo, SP Brazil
| | - V. Altebarmakian
- Departamento de Botânica-IB-USP, GaTE Lab, Brasil, Rua do Matão, 277, 05508-900 São Paulo, SP Brazil
| | - M. Rodier-Goud
- Centre de Coopération Internationale en Recherche Agronomique pour le Développement (CIRAD), UMR AGAP, Avenue Agropolis, 34398 Montpellier Cedex 5, France
| | - D. Domingues
- Departamento de Botânica-IB-USP, GaTE Lab, Brasil, Rua do Matão, 277, 05508-900 São Paulo, SP Brazil
- Centre de Coopération Internationale en Recherche Agronomique pour le Développement (CIRAD), UMR AGAP, Avenue Agropolis, 34398 Montpellier Cedex 5, France
| | - A. D’Hont
- Centre de Coopération Internationale en Recherche Agronomique pour le Développement (CIRAD), UMR AGAP, Avenue Agropolis, 34398 Montpellier Cedex 5, France
| | - M. A. Van Sluys
- Departamento de Botânica-IB-USP, GaTE Lab, Brasil, Rua do Matão, 277, 05508-900 São Paulo, SP Brazil
| |
Collapse
|
12
|
Figueroa DM, Bass HW. Development of pachytene FISH maps for six maize chromosomes and their integration with other maize maps for insights into genome structure variation. Chromosome Res 2012; 20:363-80. [PMID: 22588802 PMCID: PMC3391363 DOI: 10.1007/s10577-012-9281-4] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2012] [Revised: 03/27/2012] [Accepted: 03/28/2012] [Indexed: 12/18/2022]
Abstract
Integrated cytogenetic pachytene fluorescence in situ hybridization (FISH) maps were developed for chromosomes 1, 3, 4, 5, 6, and 8 of maize using restriction fragment length polymorphism marker-selected Sorghum propinquum bacterial artificial chromosomes (BACs) for 19 core bin markers and 4 additional genetic framework loci. Using transgenomic BAC FISH mapping on maize chromosome addition lines of oats, we found that the relative locus position along the pachytene chromosome did not change as a function of total arm length, indicative of uniform axial contraction along the fibers during mid-prophase for tested loci on chromosomes 4 and 5. Additionally, we cytogenetically FISH mapped six loci from chromosome 9 onto their duplicated syntenic regions on chromosomes 1 and 6, which have varying amounts of sequence divergence, using sorghum BACs homologous to the chromosome 9 loci. We found that successful FISH mapping was possible even when the chromosome 9 selective marker had no counterpart in the syntenic block. In total, these 29 FISH-mapped loci were used to create the most extensive pachytene FISH maps to date for these six maize chromosomes. The FISH-mapped loci were then merged into one composite karyotype for direct comparative analysis with the recombination nodule-predicted cytogenetic, genetic linkage, and genomic physical maps using the relative marker positions of the loci on all the maps. Marker colinearity was observed between all pair-wise map comparisons, although marker distribution patterns varied widely in some cases. As expected, we found that the recombination nodule-based predictions most closely resembled the cytogenetic map positions overall. Cytogenetic and linkage map comparisons agreed with previous studies showing a decrease in marker spacing in the peri-centromeric heterochromatin region on the genetic linkage maps. In fact, there was a general trend with most loci mapping closer towards the telomere on the linkage maps than on the cytogenetic maps, regardless of chromosome number or maize inbred line source, with just some of the telomeric loci exempted. Finally and somewhat surprisingly, we observed considerable variation between the relative arm positions of loci when comparing our cytogenetic FISH map to the B73 genomic physical maps, even where comparisons were to a B73-derived cytogenetic map. This variation is more evident between different chromosome arms, but less so within a given arm, ruling out any type of inbred-line dependent global features of linear deoxyribonucleic acid compared with the meiotic fiber organization. This study provides a means for analyzing the maize genome structure by producing new connections for integrating the cytogenetic, linkage, and physical maps of maize.
Collapse
Affiliation(s)
- Debbie M Figueroa
- Department of Biological Science, Florida State University, Tallahassee, 32306-4295, USA.
| | | |
Collapse
|
13
|
Tian Z, Yu Y, Lin F, Yu Y, Sanmiguel PJ, Wing RA, McCouch SR, Ma J, Jackson SA. Exceptional lability of a genomic complex in rice and its close relatives revealed by interspecific and intraspecific comparison and population analysis. BMC Genomics 2011; 12:142. [PMID: 21385395 PMCID: PMC3060143 DOI: 10.1186/1471-2164-12-142] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2010] [Accepted: 03/08/2011] [Indexed: 11/16/2022] Open
Abstract
Background Extensive DNA rearrangement of genic colinearity, as revealed by comparison of orthologous genomic regions, has been shown to be a general concept describing evolutionary dynamics of plant genomes. However, the nature, timing, lineages and adaptation of local genomic rearrangement in closely related species (e.g., within a genus) and haplotype variation of genomic rearrangement within populations have not been well documented. Results We previously identified a hotspot for genic rearrangement and transposon accumulation in the Orp region of Asian rice (Oryza sativa, AA) by comparison with its orthologous region in sorghum. Here, we report the comparative analysis of this region with its orthologous regions in the wild progenitor species (O. nivara, AA) of Asian rice and African rice (O. glaberrima) using the BB genome Oryza species (O. punctata) as an outgroup, and investigation of transposon insertion sites and a segmental inversion event in the AA genomes at the population level. We found that Orp region was primarily and recently expanded in the Asian rice species O. sativa and O. nivara. LTR-retrotransposons shared by the three AA-genomic regions have been fixed in all the 94 varieties that represent different populations of the AA-genome species/subspecies, indicating their adaptive role in genome differentiation. However, LTR-retrotransposons unique to either O. nivara or O. sativa regions exhibited dramatic haplotype variation regarding their presence or absence between or within populations/subpopulations. Conclusions The LTR-retrotransposon insertion hotspot in the Orp region was formed recently, independently and concurrently in different AA-genome species, and that the genic rearrangements detected in different species appear to be differentially triggered by transposable elements. This region is located near the end of the short arm of chromosome 8 and contains a high proportion of LTR-retrotransposons similar to observed in the centromeric region of this same chromosome, and thus may represent a genomic region that has recently switched from euchromatic to heterochromatic states. The haplotype variation of LTR-retrotransposon insertions within this region reveals substantial admixture among various subpopulations as established by molecular markers at the whole genome level, and can be used to develop retrotransposon junction markers for simple and rapid classification of O. sativa germplasm.
Collapse
Affiliation(s)
- Zhixi Tian
- Department of Agronomy, Purdue University, West Lafayette, IN 47907, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
14
|
Frenkel Z, Paux E, Mester D, Feuillet C, Korol A. LTC: a novel algorithm to improve the efficiency of contig assembly for physical mapping in complex genomes. BMC Bioinformatics 2010; 11:584. [PMID: 21118513 PMCID: PMC3098104 DOI: 10.1186/1471-2105-11-584] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2010] [Accepted: 11/30/2010] [Indexed: 11/25/2022] Open
Abstract
Background Physical maps are the substrate of genome sequencing and map-based cloning and their construction relies on the accurate assembly of BAC clones into large contigs that are then anchored to genetic maps with molecular markers. High Information Content Fingerprinting has become the method of choice for large and repetitive genomes such as those of maize, barley, and wheat. However, the high level of repeated DNA present in these genomes requires the application of very stringent criteria to ensure a reliable assembly with the FingerPrinted Contig (FPC) software, which often results in short contig lengths (of 3-5 clones before merging) as well as an unreliable assembly in some difficult regions. Difficulties can originate from a non-linear topological structure of clone overlaps, low power of clone ordering algorithms, and the absence of tools to identify sources of gaps in Minimal Tiling Paths (MTPs). Results To address these problems, we propose a novel approach that: (i) reduces the rate of false connections and Q-clones by using a new cutoff calculation method; (ii) obtains reliable clusters robust to the exclusion of single clone or clone overlap; (iii) explores the topological contig structure by considering contigs as networks of clones connected by significant overlaps; (iv) performs iterative clone clustering combined with ordering and order verification using re-sampling methods; and (v) uses global optimization methods for clone ordering and Band Map construction. The elements of this new analytical framework called Linear Topological Contig (LTC) were applied on datasets used previously for the construction of the physical map of wheat chromosome 3B with FPC. The performance of LTC vs. FPC was compared also on the simulated BAC libraries based on the known genome sequences for chromosome 1 of rice and chromosome 1 of maize. Conclusions The results show that compared to other methods, LTC enables the construction of highly reliable and longer contigs (5-12 clones before merging), the detection of "weak" connections in contigs and their "repair", and the elongation of contigs obtained by other assembly methods.
Collapse
Affiliation(s)
- Zeev Frenkel
- University of Haifa, Institute of Evolution, Haifa 31905, Israel.
| | | | | | | | | |
Collapse
|
15
|
Du J, Tian Z, Hans CS, Laten HM, Cannon SB, Jackson SA, Shoemaker RC, Ma J. Evolutionary conservation, diversity and specificity of LTR-retrotransposons in flowering plants: insights from genome-wide analysis and multi-specific comparison. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2010; 63:584-98. [PMID: 20525006 DOI: 10.1111/j.1365-313x.2010.04263.x] [Citation(s) in RCA: 114] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]
Abstract
The availability of complete or nearly complete genome sequences from several plant species permits detailed discovery and cross-species comparison of transposable elements (TEs) at the whole genome level. We initially investigated 510 long terminal repeat-retrotransposon (LTR-RT) families comprising 32370 elements in soybean (Glycine max (L.) Merr.). Approximately 87% of these elements were located in recombination-suppressed pericentromeric regions, where the ratio (1.26) of solo LTRs to intact elements (S/I) is significantly lower than that of chromosome arms (1.62). Further analysis revealed a significant positive correlation between S/I and LTR sizes, indicating that larger LTRs facilitate solo LTR formation. Phylogenetic analysis revealed seven Copia and five Gypsy evolutionary lineages that were present before the divergence of eudicot and monocot species, but the scales and timeframes within which they proliferated vary dramatically across families, lineages and species, and notably, a Copia lineage has been lost in soybean. Analysis of the physical association of LTR-RTs with centromere satellite repeats identified two putative centromere retrotransposon (CR) families of soybean, which were grouped into the CR (e.g. CRR and CRM) lineage found in grasses, indicating that the 'functional specification' of CR pre-dates the bifurcation of eudicots and monocots. However, a number of families of the CR lineage are not concentrated in centromeres, suggesting that their CR roles may now be defunct. Our data also suggest that the envelope-like genes in the putative Copia retrovirus-like family are probably derived from the Gypsy retrovirus-like lineage, and thus we propose the hypothesis of a single ancient origin of envelope-like genes in flowering plants.
Collapse
Affiliation(s)
- Jianchang Du
- Department of Agronomy, Purdue University, West Lafayette, IN 47907, USA
| | | | | | | | | | | | | | | |
Collapse
|
16
|
Sanyal A, Ammiraju JSS, Lu F, Yu Y, Rambo T, Currie J, Kollura K, Kim HR, Chen J, Ma J, San Miguel P, Mingsheng C, Wing RA, Jackson SA. Orthologous comparisons of the Hd1 region across genera reveal Hd1 gene lability within diploid Oryza species and disruptions to microsynteny in Sorghum. Mol Biol Evol 2010; 27:2487-506. [PMID: 20522726 DOI: 10.1093/molbev/msq133] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Heading date is one of the most important quantitative traits responsible for the domestication of rice. We compared a 155-kb reference segment of the Oryza sativa ssp. japonica cv. Nipponbare genome surrounding Hd1, a major heading date gene in rice, with orthologous regions from nine diploid Oryza species that diverged over a relatively short time frame (∼16 My) to study sequence evolution around a domestication locus. The orthologous Hd1 region from Sorghum bicolor was included to compare and contrast the evolution in a more distant relative of rice. Consistent with other observations at the adh1/adh2, monoculm1, and sh2/a1 loci in grass species, we found high gene colinearity in the Hd1 region amidst size differences that were lineage specific and long terminal repeat retrotransposon driven. Unexpectedly, the Hd1 gene was deleted in O. glaberrima, whereas the O. rufipogon and O. punctata copies had degenerative mutations, suggesting that other heading date loci might compensate for the loss or nonfunctionality of Hd1 in these species. Compared with the japonica Hd1 region, the orthologous region in sorghum exhibited micro-rearrangements including gene translocations, seven additional genes, and a gene triplication and truncation event predating the divergence from Oryza.
Collapse
|
17
|
Tian Z, Rizzon C, Du J, Zhu L, Bennetzen JL, Jackson SA, Gaut BS, Ma J. Do genetic recombination and gene density shape the pattern of DNA elimination in rice long terminal repeat retrotransposons? Genome Res 2009; 19:2221-30. [PMID: 19789376 DOI: 10.1101/gr.083899.108] [Citation(s) in RCA: 116] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
In flowering plants, the accumulation of small deletions through unequal homologous recombination (UR) and illegitimate recombination (IR) is proposed to be the major process counteracting genome expansion, which is caused primarily by the periodic amplification of long terminal repeat retrotransposons (LTR-RTs). However, the full suite of evolutionary forces that govern the gain or loss of transposable elements (TEs) and their distribution within a genome remains unclear. Here, we investigated the distribution and structural variation of LTR-RTs in relation to the rates of local genetic recombination (GR) and gene densities in the rice (Oryza sativa) genome. Our data revealed a positive correlation between GR rates and gene densities and negative correlations between LTR-RT densities and both GR and gene densities. The data also indicate a tendency for LTR-RT elements and fragments to be shorter in regions with higher GR rates; the size reduction of LTR-RTs appears to be achieved primarily through solo LTR formation by UR. Comparison of indica and japonica rice revealed patterns and frequencies of LTR-RT gain and loss within different evolutionary timeframes. Different LTR-RT families exhibited variable distribution patterns and structural changes, but overall LTR-RT compositions and genes were organized according to the GR gradients of the genome. Further investigation of non-LTR-RTs and DNA transposons revealed a negative correlation between gene densities and the abundance of DNA transposons and a weak correlation between GR rates and the abundance of long interspersed nuclear elements (LINEs)/short interspersed nuclear elements (SINEs). Together, these observations suggest that GR and gene density play important roles in shaping the dynamic structure of the rice genome.
Collapse
Affiliation(s)
- Zhixi Tian
- Department of Agronomy, Purdue University, West Lafayette, Indiana 47907, USA
| | | | | | | | | | | | | | | |
Collapse
|
18
|
Tandem organization of StarkB element (22.8 kb) in the maize B chromosome. Mol Genet Genomics 2009; 282:131-9. [PMID: 19418069 DOI: 10.1007/s00438-009-0452-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2009] [Accepted: 04/17/2009] [Indexed: 10/20/2022]
Abstract
A very large repetitive element (StarkB, 22.8 kb) is present in the maize B chromosome, presumably not organized as tandem arrays. Results of the current study are contrary to this notion. Out of eighteen StarkB-carrying sequences, nine were the expected internal fragment of StarkB, and nine others were fragments spanning two StarkB elements. One of the two StarkB components, GrandeB, was flanked in all clones with identical target sequences, as opposed to other Grandes that are associated with different target sequences. Also observed was a prominent Southern signal associated with a fragment representing the junction of two adjacent StarkB units. A clone possessing a structure inverse to that of the second component of StarkB is proposed to be the initial element into which a GrandeB inserted to derive StarkB. Most, if not all, isolated StarkB arrays were not the original form, being disrupted by the invasion of various mobile elements intertwined with various stages of amplification.
Collapse
|
19
|
Huo N, Vogel JP, Lazo GR, You FM, Ma Y, McMahon S, Dvorak J, Anderson OD, Luo MC, Gu YQ. Structural characterization of Brachypodium genome and its syntenic relationship with rice and wheat. PLANT MOLECULAR BIOLOGY 2009; 70:47-61. [PMID: 19184460 DOI: 10.1007/s11103-009-9456-3] [Citation(s) in RCA: 54] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/01/2008] [Accepted: 01/07/2009] [Indexed: 05/22/2023]
Abstract
Brachypodium distachyon (Brachypodium) has been recently recognized as an emerging model system for both comparative and functional genomics in grass species. In this study, 55,221 repeat masked Brachypodium BAC end sequences (BES) were used for comparative analysis against the 12 rice pseudomolecules. The analysis revealed that approximately 26.4% of BES have significant matches with the rice genome and 82.4% of the matches were homologous to known genes. Further analysis of paired-end BES and approximately 1.0 Mb sequences from nine selected BACs proved to be useful in revealing conserved regions and regions that have undergone considerable genomic changes. Differential gene amplification, insertions/deletions and inversions appeared to be the common evolutionary events that caused variations of microcolinearity at different orthologous genomic regions. It was found that approximately 17% of genes in the two genomes are not colinear in the orthologous regions. Analysis of BAC sequences also revealed higher gene density (approximately 9 kb/gene) and lower repeat DNA content (approximately 13.1%) in Brachypodium when compared to the orthologous rice regions, consistent with the smaller size of the Brachypodium genome. The 119 annotated Brachypodium genes were BLASTN compared against the wheat EST database and deletion bin mapped wheat ESTs. About 77% of the genes retrieved significant matches in the EST database, while 9.2% matched to the bin mapped ESTs. In some cases, genes in single Brachypodium BACs matched to multiple ESTs that were mapped to the same deletion bins, suggesting that the Brachypodium genome will be useful for ordering wheat ESTs within the deletion bins and developing specific markers at targeted regions in the wheat genome.
Collapse
Affiliation(s)
- Naxin Huo
- Genomics and Gene Discovery Research Unit, USDA-ARS, Western Regional Research Center, 800 Buchanan Street, Albany, CA 94710, USA
| | | | | | | | | | | | | | | | | | | |
Collapse
|
20
|
Innes RW, Ameline-Torregrosa C, Ashfield T, Cannon E, Cannon SB, Chacko B, Chen NWG, Couloux A, Dalwani A, Denny R, Deshpande S, Egan AN, Glover N, Hans CS, Howell S, Ilut D, Jackson S, Lai H, Mammadov J, Del Campo SM, Metcalf M, Nguyen A, O'Bleness M, Pfeil BE, Podicheti R, Ratnaparkhe MB, Samain S, Sanders I, Ségurens B, Sévignac M, Sherman-Broyles S, Thareau V, Tucker DM, Walling J, Wawrzynski A, Yi J, Doyle JJ, Geffroy V, Roe BA, Maroof MAS, Young ND. Differential accumulation of retroelements and diversification of NB-LRR disease resistance genes in duplicated regions following polyploidy in the ancestor of soybean. PLANT PHYSIOLOGY 2008; 148:1740-59. [PMID: 18842825 PMCID: PMC2593655 DOI: 10.1104/pp.108.127902] [Citation(s) in RCA: 93] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/10/2008] [Accepted: 10/06/2008] [Indexed: 05/18/2023]
Abstract
The genomes of most, if not all, flowering plants have undergone whole genome duplication events during their evolution. The impact of such polyploidy events is poorly understood, as is the fate of most duplicated genes. We sequenced an approximately 1 million-bp region in soybean (Glycine max) centered on the Rpg1-b disease resistance gene and compared this region with a region duplicated 10 to 14 million years ago. These two regions were also compared with homologous regions in several related legume species (a second soybean genotype, Glycine tomentella, Phaseolus vulgaris, and Medicago truncatula), which enabled us to determine how each of the duplicated regions (homoeologues) in soybean has changed following polyploidy. The biggest change was in retroelement content, with homoeologue 2 having expanded to 3-fold the size of homoeologue 1. Despite this accumulation of retroelements, over 77% of the duplicated low-copy genes have been retained in the same order and appear to be functional. This finding contrasts with recent analyses of the maize (Zea mays) genome, in which only about one-third of duplicated genes appear to have been retained over a similar time period. Fluorescent in situ hybridization revealed that the homoeologue 2 region is located very near a centromere. Thus, pericentromeric localization, per se, does not result in a high rate of gene inactivation, despite greatly accelerated retrotransposon accumulation. In contrast to low-copy genes, nucleotide-binding-leucine-rich repeat disease resistance gene clusters have undergone dramatic species/homoeologue-specific duplications and losses, with some evidence for partitioning of subfamilies between homoeologues.
Collapse
Affiliation(s)
- Roger W Innes
- Department of Biology, Indiana University, Bloomington, Indiana 47405, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
21
|
Ammiraju JSS, Lu F, Sanyal A, Yu Y, Song X, Jiang N, Pontaroli AC, Rambo T, Currie J, Collura K, Talag J, Fan C, Goicoechea JL, Zuccolo A, Chen J, Bennetzen JL, Chen M, Jackson S, Wing RA. Dynamic evolution of oryza genomes is revealed by comparative genomic analysis of a genus-wide vertical data set. THE PLANT CELL 2008; 20:3191-209. [PMID: 19098269 PMCID: PMC2630430 DOI: 10.1105/tpc.108.063727] [Citation(s) in RCA: 93] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/14/2008] [Revised: 12/01/2008] [Accepted: 12/06/2008] [Indexed: 05/18/2023]
Abstract
Oryza (23 species; 10 genome types) contains the world's most important food crop - rice. Although the rice genome serves as an essential tool for biological research, little is known about the evolution of the other Oryza genome types. They contain a historical record of genomic changes that led to diversification of this genus around the world as well as an untapped reservoir of agriculturally important traits. To investigate the evolution of the collective Oryza genome, we sequenced and compared nine orthologous genomic regions encompassing the Adh1-Adh2 genes (from six diploid genome types) with the rice reference sequence. Our analysis revealed the architectural complexities and dynamic evolution of this region that have occurred over the past approximately 15 million years. Of the 46 intact genes and four pseudogenes in the japonica genome, 38 (76%) fell into eight multigene families. Analysis of the evolutionary history of each family revealed independent and lineage-specific gain and loss of gene family members as frequent causes of synteny disruption. Transposable elements were shown to mediate massive replacement of intergenic space (>95%), gene disruption, and gene/gene fragment movement. Three cases of long-range structural variation (inversions/deletions) spanning several hundred kilobases were identified that contributed significantly to genome diversification.
Collapse
Affiliation(s)
- Jetty S S Ammiraju
- Arizona Genomics Institute, Department of Plant Sciences, BIO5 Institute, University of Arizona, Tucson, Arizona 85721, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
22
|
The 172-kb genomic DNA region of the O. rufipogon yld1.1 locus: comparative sequence analysis with O. sativa ssp. japonica and O. sativa ssp. indica. Funct Integr Genomics 2008; 9:97-108. [PMID: 18633654 DOI: 10.1007/s10142-008-0091-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2008] [Revised: 06/08/2008] [Accepted: 06/08/2008] [Indexed: 10/21/2022]
Abstract
Common wild rice (Oryza rufipogon) plays an important role by contributing to modern rice breeding. In this paper, we report the sequence and analysis of a 172-kb genomic DNA region of wild rice around the RM5 locus, which is associated with the yield QTL yld1.1. Comparative sequence analysis between orthologous RM5 regions from Oryza sativa ssp. japonica, O. sativa ssp. indica and O. rufipogon revealed a high level of conserved synteny in the content, homology, structure, orientation, and physical distance of all 14 predicted genes. Twelve of the putative genes were supported by matches to proteins with known function, whereas two were predicted by homology to rice and other plant expressed sequence tags or complementary DNAs. The remarkably high level of conservation found in coding, intronic and intergenic regions may indicate high evolutionary selection on the RM5 region. Although our analysis has not defined which gene(s) determine the yld1.1 phenotype, allelic variation and the insertion of transposable elements, among other nucleotide changes, represent potential variation responsible for the yield QTL. However, as suggested previously, two putative receptor-like protein kinase genes remain the key suspects for yld1.1.
Collapse
|
23
|
Shyu CR, Harnsomburana J, Green J, Barb AS, Kazic T, Schaeffer M, Coe E. Searching and mining visually observed phenotypes of maize mutants. J Bioinform Comput Biol 2008; 5:1193-213. [PMID: 18172925 DOI: 10.1142/s0219720007003181] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2007] [Revised: 08/06/2007] [Accepted: 08/13/2007] [Indexed: 11/18/2022]
Abstract
There are thousands of maize mutants, which are invaluable resources for plant research. Geneticists use them to study underlying mechanisms of biochemistry, cell biology, cell development, and cell physiology. To streamline the understanding of such complex processes, researchers need the most current versions of genetic and physical maps, tools with the ability to recognize novel phenotypes or classify known phenotypes, and an intimate knowledge of the biochemical processes generating physiological and phenotypic effects. They must also know how all of these factors change and differ among species, diverse alleles, germplasms, and environmental conditions. While there are robust databases, such as MaizeGDB, for some of these types of raw data, other crucial components are missing. Moreover, the management of visually observed mutant phenotypes is still in its infant stage, let alone the complex query methods that can draw upon high-level and aggregated information to answer the questions of geneticists. In this paper, we address the scientific challenge and propose to develop a robust framework for managing the knowledge of visually observed phenotypes, mining the correlation of visual characteristics with genetic maps, and discovering the knowledge relating to cross-species conservation of visual and genetic patterns. The ultimate goal of this research is to allow a geneticist to submit phenotypic and genomic information on a mutant to a knowledge base and ask, "What genes or environmental factors cause this visually observed phenotype?".
Collapse
Affiliation(s)
- Chi-Ren Shyu
- Computer Science Department, University of Missouri, Columbia, MO 65211, USA.
| | | | | | | | | | | | | |
Collapse
|
24
|
Doust A. Architectural evolution and its implications for domestication in grasses. ANNALS OF BOTANY 2007; 100:941-50. [PMID: 17478546 PMCID: PMC2759198 DOI: 10.1093/aob/mcm040] [Citation(s) in RCA: 71] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/28/2006] [Revised: 11/28/2006] [Accepted: 01/22/2007] [Indexed: 05/15/2023]
Abstract
BACKGROUND The cereal crops domesticated from grasses provide a large percentage of the calories consumed by humans. Domestication and breeding in individual cereals has historically occurred in isolation, although this is rapidly changing with comparative genomics of the sequenced or soon-to-be sequenced genomes of rice, sorghum, maize and Brachypodium. Genetic information transferred through genomic comparisons is helping our understanding of genetically less tractable crops such as the hexaploid wheats and polyploid sugarcane, as well as the approx. 10 000 species of wild grasses. In turn, phylogenetic analysis helps put our knowledge of the morphology of cereal crops into an evolutionary context. GRASS ARCHITECTURE: Domestication often involves a change in the pattern and timing of branching, which affects both vegetative and inflorescence architecture, and ultimately yield. Cereal grasses exhibit two main forms of vegetative architecture: the pooid and erhartoid cereals such as wheat and rice have multiple basal tillers, while panicoid cereals such as maize, sorghum and the millets have few tillers or even only a single main stem. These differences are reflected in the differences between the wild species of pooid and some erhartoid grasses, which emphasize basal branching over axillary branching, and the panicoid grasses, where axillary branching is more frequently found. A combination of phylogenetic and genomic analysis is beginning to reveal the similarities and differences between different cereal crops, and relate these to the diversity of wild grasses to which they are related. Recent work on genes controlling branching emphasizes that developmental genetics needs to be viewed in both an evolutionary and ecological framework, if it is to be useful in understanding how morphology evolves. Increasingly, exploring the phylogenetic context of the crop grasses will suggest new ways to identify and create combinations of morphological traits that will best suit our future needs.
Collapse
Affiliation(s)
- Andrew Doust
- University of Missouri-St. Louis, 1 University Boulevard, St Louis, MO 63121, USA.
| |
Collapse
|
25
|
Schlueter JA, Lin JY, Schlueter SD, Vasylenko-Sanders IF, Deshpande S, Yi J, O'Bleness M, Roe BA, Nelson RT, Scheffler BE, Jackson SA, Shoemaker RC. Gene duplication and paleopolyploidy in soybean and the implications for whole genome sequencing. BMC Genomics 2007; 8:330. [PMID: 17880721 PMCID: PMC2077340 DOI: 10.1186/1471-2164-8-330] [Citation(s) in RCA: 94] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2007] [Accepted: 09/19/2007] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Soybean, Glycine max (L.) Merr., is a well documented paleopolyploid. What remains relatively under characterized is the level of sequence identity in retained homeologous regions of the genome. Recently, the Department of Energy Joint Genome Institute and United States Department of Agriculture jointly announced the sequencing of the soybean genome. One of the initial concerns is to what extent sequence identity in homeologous regions would have on whole genome shotgun sequence assembly. RESULTS Seventeen BACs representing approximately 2.03 Mb were sequenced as representative potential homeologous regions from the soybean genome. Genetic mapping of each BAC shows that 11 of the 20 chromosomes are represented. Sequence comparisons between homeologous BACs shows that the soybean genome is a mosaic of retained paleopolyploid regions. Some regions appear to be highly conserved while other regions have diverged significantly. Large-scale "batch" reassembly of all 17 BACs combined showed that even the most homeologous BACs with upwards of 95% sequence identity resolve into their respective homeologous sequences. Potential assembly errors were generated by tandemly duplicated pentatricopeptide repeat containing genes and long simple sequence repeats. Analysis of a whole-genome shotgun assembly of 80,000 randomly chosen JGI-DOE sequence traces reveals some new soybean-specific repeat sequences. CONCLUSION This analysis investigated both the structure of the paleopolyploid soybean genome and the potential effects retained homeology will have on assembling the whole genome shotgun sequence. Based upon these results, homeologous regions similar to those characterized here will not cause major assembly issues.
Collapse
Affiliation(s)
| | - Jer-Young Lin
- Purdue Genetics Program, Purdue University, West Lafayette, IN 47907, USA
| | | | | | - Shweta Deshpande
- Department of Chemistry and Biochemistry, University of Oklahoma, Norman, OK 73019, USA
| | - Jing Yi
- Department of Chemistry and Biochemistry, University of Oklahoma, Norman, OK 73019, USA
| | - Majesta O'Bleness
- Department of Chemistry and Biochemistry, University of Oklahoma, Norman, OK 73019, USA
| | - Bruce A Roe
- Department of Chemistry and Biochemistry, University of Oklahoma, Norman, OK 73019, USA
| | - Rex T Nelson
- USDA-ARS, Corn Insect and Crop Genetics Research Unit, and Department of Agronomy, Iowa State University, Ames, IA 50011, USA
| | | | - Scott A Jackson
- Department of Agronomy, Purdue University, West Lafayette, IN 47907, USA
| | - Randy C Shoemaker
- USDA-ARS, Corn Insect and Crop Genetics Research Unit, and Department of Agronomy, Iowa State University, Ames, IA 50011, USA
| |
Collapse
|
26
|
Schlueter JA, Lin JY, Schlueter SD, Vasylenko-Sanders IF, Deshpande S, Yi J, O'Bleness M, Roe BA, Nelson RT, Scheffler BE, Jackson SA, Shoemaker RC. Gene duplication and paleopolyploidy in soybean and the implications for whole genome sequencing. BMC Genomics 2007. [PMID: 17880721 DOI: 10.1186/1471‐2164‐8‐330] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Soybean, Glycine max (L.) Merr., is a well documented paleopolyploid. What remains relatively under characterized is the level of sequence identity in retained homeologous regions of the genome. Recently, the Department of Energy Joint Genome Institute and United States Department of Agriculture jointly announced the sequencing of the soybean genome. One of the initial concerns is to what extent sequence identity in homeologous regions would have on whole genome shotgun sequence assembly. RESULTS Seventeen BACs representing approximately 2.03 Mb were sequenced as representative potential homeologous regions from the soybean genome. Genetic mapping of each BAC shows that 11 of the 20 chromosomes are represented. Sequence comparisons between homeologous BACs shows that the soybean genome is a mosaic of retained paleopolyploid regions. Some regions appear to be highly conserved while other regions have diverged significantly. Large-scale "batch" reassembly of all 17 BACs combined showed that even the most homeologous BACs with upwards of 95% sequence identity resolve into their respective homeologous sequences. Potential assembly errors were generated by tandemly duplicated pentatricopeptide repeat containing genes and long simple sequence repeats. Analysis of a whole-genome shotgun assembly of 80,000 randomly chosen JGI-DOE sequence traces reveals some new soybean-specific repeat sequences. CONCLUSION This analysis investigated both the structure of the paleopolyploid soybean genome and the potential effects retained homeology will have on assembling the whole genome shotgun sequence. Based upon these results, homeologous regions similar to those characterized here will not cause major assembly issues.
Collapse
|
27
|
Liu R, Vitte C, Ma J, Mahama AA, Dhliwayo T, Lee M, Bennetzen JL. A GeneTrek analysis of the maize genome. Proc Natl Acad Sci U S A 2007; 104:11844-9. [PMID: 17615239 PMCID: PMC1913904 DOI: 10.1073/pnas.0704258104] [Citation(s) in RCA: 53] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2007] [Indexed: 11/18/2022] Open
Abstract
Analysis of the sequences of 74 randomly selected BACs demonstrated that the maize nuclear genome contains approximately 37,000 candidate genes with homologues in other plant species. An additional approximately 5,500 predicted genes are severely truncated and probably pseudogenes. The distribution of genes is uneven, with approximately 30% of BACs containing no genes. BAC gene density varies from 0 to 7.9 per 100 kb, whereas most gene islands contain only one gene. The average number of genes per gene island is 1.7. Only 72% of these genes show collinearity with the rice genome. Particular LTR retrotransposon families (e.g., Gyma) are enriched on gene-free BACs, most of which do not come from pericentromeres or other large heterochromatic regions. Gene-containing BACs are relatively enriched in different families of LTR retrotransposons (e.g., Ji). Two major bursts of LTR retrotransposon activity in the last 2 million years are responsible for the large size of the maize genome, but only the more recent of these is well represented in gene-containing BACs, suggesting that LTR retrotransposons are more efficiently removed in these domains. The results demonstrate that sample sequencing and careful annotation of a few randomly selected BACs can provide a robust description of a complex plant genome.
Collapse
Affiliation(s)
- Renyi Liu
- *Department of Genetics, University of Georgia, Athens, GA 30602; and
| | - Clémentine Vitte
- *Department of Genetics, University of Georgia, Athens, GA 30602; and
| | - Jianxin Ma
- *Department of Genetics, University of Georgia, Athens, GA 30602; and
| | | | - Thanda Dhliwayo
- Department of Agronomy, Iowa State University, Ames, IA 50011
| | - Michael Lee
- Department of Agronomy, Iowa State University, Ames, IA 50011
| | | |
Collapse
|
28
|
Zhang S, Gu YQ, Singh J, Coleman-Derr D, Brar DS, Jiang N, Lemaux PG. New insights into Oryza genome evolution: high gene colinearity and differential retrotransposon amplification. PLANT MOLECULAR BIOLOGY 2007; 64:589-600. [PMID: 17534720 DOI: 10.1007/s11103-007-9178-3] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/09/2007] [Accepted: 04/26/2007] [Indexed: 05/15/2023]
Abstract
An approximately 247-kb genomic region from FF genome of wild rice Oryza brachyantha, possessing the smallest Oryza genome, was compared to the orthologous approximately 450-kb region from AA genome, O. sativa L. ssp. japonica. 37 of 38 genes in the orthologous regions are shared between japonica and O. brachyantha. Analyses of nucleotide substitution in coding regions suggest the two genomes diverged approximately 10 million years ago. Comparisons of transposable elements (TEs) reveal that the density of DNA TEs in O. brachyantha is comparable to O. sativa; however, the density of RNA TEs is dramatically lower. The genomic fraction of RNA TEs in japonica is two times greater than in O. brachyantha. Differences, particularly in RNA TEs, in this region and in BAC end sequences from five wild and two cultivated Oryza species explain major genome size differences between sativa and brachyantha. Gene expression analyses of three ObDREB1 genes in the sequenced region indicate orthologous genes retain similar expression patterns following cold stress. Our results demonstrate that size and number of RNA TEs play a major role in genomic differentiation and evolution in Oryza. Additionally, distantly related O. brachyantha shares colinearity with O. sativa, offering opportunities to use comparative genomics to explore the genetic diversity of wild species to improve cultivated rice.
Collapse
Affiliation(s)
- Shibo Zhang
- Department of Plant and Microbial Biology, University of California, Berkeley, CA 94720, USA
| | | | | | | | | | | | | |
Collapse
|
29
|
Jannoo N, Grivet L, Chantret N, Garsmeur O, Glaszmann JC, Arruda P, D'Hont A. Orthologous comparison in a gene-rich region among grasses reveals stability in the sugarcane polyploid genome. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2007; 50:574-85. [PMID: 17425713 DOI: 10.1111/j.1365-313x.2007.03082.x] [Citation(s) in RCA: 54] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/14/2023]
Abstract
Modern sugarcane (Saccharum spp.) is an important grass that contributes 60% of the raw sugar produced worldwide and has a high biofuel production potential. It was created about a century ago through hybridization of two highly polyploid species, namely S. officinarum and S. spontaneum. We investigated genome dynamics in this highly polyploid context by analyzing two homoeologous sequences (97 and 126 kb) in a region that has already been studied in several cereals. Our findings indicate that the two Saccharum species diverged 1.5-2 million years ago from one another and 8-9 million years ago from sorghum. The two sugarcane homoeologous haplotypes show perfect colinearity as well as high gene structure conservation. Apart from the insertion of a few retrotransposable elements, high homology was also observed for the non-transcribed regions. Relative to sorghum, the sugarcane sequences displayed colinearity, with the exception of two genes present only in sorghum, and striking homology in most non-coding parts of the genome. The gene distribution highlighted high synteny and colinearity with rice, and partial colinearity with each homoeologous maize region, which became perfect when the sequences were combined. The haplotypes observed in sugarcane may thus closely represent the ancestral Andropogoneae haplotype. This analysis of sugarcane haplotype organization at the sequence level suggests that the high ploidy in sugarcane did not induce generalized reshaping of its genome, thus challenging the idea that polyploidy quickly induces generalized rearrangement of genomes. These results also confirm the view that sorghum is the model of choice for sugarcane.
Collapse
Affiliation(s)
- Nazeema Jannoo
- Centro de Biologia Molecular e Engenharia Genética, Universidade Estadual de Campinas, Campinas, SP, Brazil
| | | | | | | | | | | | | |
Collapse
|
30
|
Ma J, Wing RA, Bennetzen JL, Jackson SA. Plant centromere organization: a dynamic structure with conserved functions. Trends Genet 2007; 23:134-9. [PMID: 17275131 DOI: 10.1016/j.tig.2007.01.004] [Citation(s) in RCA: 96] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2006] [Revised: 10/30/2006] [Accepted: 01/17/2007] [Indexed: 11/25/2022]
Abstract
Although the structural features of centromeres from most multicellular eukaryotes remain to be characterized, recent analyses of the complete sequences of two centromeric regions of rice, together with data from Arabidopsis thaliana and maize, have illuminated the considerable size variation and sequence divergence of plant centromeres. Despite the severe suppression of meiotic chromosomal exchange in centromeric and pericentromeric regions of rice, the centromere core shows high rates of unequal homologous recombination in the absence of chromosomal exchange, resulting in frequent and extensive DNA rearrangement. Not only is the sequence of centromeric tandem and non-tandem repeats highly variable but also the copy number, spacing, order and orientation, providing ample natural variation as the basis for selection of superior centromere performance. This review article focuses on the structural and evolutionary dynamics of plant centromere organization and the potential molecular mechanisms responsible for the rapid changes of centromeric components.
Collapse
Affiliation(s)
- Jianxin Ma
- Department of Agronomy, Purdue University, West Lafayette, IN 47907, USA
| | | | | | | |
Collapse
|
31
|
Bossolini E, Wicker T, Knobel PA, Keller B. Comparison of orthologous loci from small grass genomes Brachypodium and rice: implications for wheat genomics and grass genome annotation. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2007; 49:704-17. [PMID: 17270010 DOI: 10.1111/j.1365-313x.2006.02991.x] [Citation(s) in RCA: 87] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/13/2023]
Abstract
Brachypodium sylvaticum and Brachypodium distachyon were recently proposed as new model plants because of their small genomes and their phylogenetic position between rice and Triticeae crops. We sequenced a 371-kb region in B. sylvaticum, the largest genomic sequence available so far from this species, providing quantitative data on gene conservation, collinearity and phylogeny. We compared it with orthologous regions from rice and wheat. Brachypodium and wheat show perfect macro-collinearity of genetic markers, whereas rice contains an approximately 220-kb inversion. Rice contains almost twice as many genes as Brachypodium in the region studied, whereas wheat has about 40% more. Through comparative annotation, we identified alternative transcripts and improved the annotation for several rice genes, indicating that approximately 15% of rice genes might require re-annotation. Surprisingly, our data suggest that 10-15% of functional sequences in small grass genomes may not encode any proteins. From available genomic and expressed sequence tag sequences, we estimated Brachypodium to have diverged from wheat about 35-40 Mya, significantly more recently than the divergence of rice and wheat. However, our data also indicate that orthologous regions from Brachypodium and wheat differ considerably in gene content, thus the Brachypodium genome sequence probably cannot replace genomic studies in the large Triticeae genomes.
Collapse
Affiliation(s)
- Eligio Bossolini
- Institute of Plant Biology, University of Zürich, Zollikerstrasse 107, 8008 Zürich, Switzerland
| | | | | | | |
Collapse
|
32
|
Vitte C, Bennetzen JL. Analysis of retrotransposon structural diversity uncovers properties and propensities in angiosperm genome evolution. Proc Natl Acad Sci U S A 2006; 103:17638-43. [PMID: 17101966 PMCID: PMC1693799 DOI: 10.1073/pnas.0605618103] [Citation(s) in RCA: 178] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2006] [Indexed: 11/18/2022] Open
Abstract
Analysis of LTR retrotransposon structures in five diploid angiosperm genomes uncovered very different relative levels of different types of genomic diversity. All species exhibited recent LTR retrotransposon mobility and also high rates of DNA removal by unequal homologous recombination and illegitimate recombination. The larger plant genomes contained many LTR retrotransposon families with >10,000 copies per haploid genome, whereas the smaller genomes contained few or no LTR retrotransposon families with >1,000 copies, suggesting that this differential potential for retroelement amplification is a primary factor in angiosperm genome size variation. The average ratios of transition to transversion mutations (Ts/Tv) in diverging LTRs were >1.5 for each species studied, suggesting that these elements are mostly 5-methylated at cytosines in an epigenetically silenced state. However, the diploid wheat Triticum monococcum and barley have unusually low Ts/Tv values (respectively, 1.9 and 1.6) compared with maize (3.9), medicago (3.6), and lotus (2.5), suggesting that this silencing is less complete in the two Triticeae. Such characteristics as the ratios of point mutations to indels (insertions and deletions) and the relative efficiencies of DNA removal by unequal homologous recombination compared with illegitimate recombination were highly variable between species. These latter variations did not correlate with genome size or phylogenetic relatedness, indicating that they frequently change during the evolutionary descent of plant lineages. In sum, the results indicate that the different sizes, contents, and structures of angiosperm genomes are outcomes of the same suite of mechanistic processes, but acting with different relative efficiencies in different plant lineages.
Collapse
Affiliation(s)
- Clémentine Vitte
- Department of Genetics, University of Georgia, Athens, GA 30602-7223
| | | |
Collapse
|
33
|
Pelgas B, Beauseigle S, Acheré V, Jeandroz S, Bousquet J, Isabel N. Comparative genome mapping among Picea glauca, P. mariana x P. rubens and P. abies, and correspondence with other Pinaceae. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2006; 113:1371-93. [PMID: 17061103 DOI: 10.1007/s00122-006-0354-7] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/21/2006] [Accepted: 06/21/2006] [Indexed: 05/12/2023]
Abstract
A composite linkage map was constructed from four individual maps for the conifer Picea glauca (Moench) Voss, from anonymous and gene-specific markfers (714 AFLPs, 38 SSRs, and 53 ESTPs). A total of 12 linkage groups were delineated with an average marker density of 2.7 cM. Macro-synteny and macro-colinearity comparisons with two other composite linkage maps developed for the species complex P. mariana (Mill.) B.S.P. x P. rubens Sarg., and for P. abies (L.) Karst. revealed an identical number of linkage groups and a remarkable conservation of the gene content and gene order of linkage groups over the million years since the split between these taxa. Identical gene order among taxa was observed for 10 of the 12 assembled composite linkage groups. The discovery of one breakdown in synteny between P. glauca and the other two taxa indicated the occurrence of an inter-chromosomal rearrangement involving an insertional translocation. Analysis of marker colinearity also revealed a putative segmental duplication. The combined information from these three Picea genomes validated and improved large-scale genome comparisons at the inter-generic level in the family Pinaceae by allowing for the identification of 11 homoeologous linkage groups between Picea and Pinus, and nine such groups between Picea and Pseudotsuga menziesii. Notably, the analysis of synteny among the three genera revealed a putative case of chromosomal fission and an inter-chromosomal rearrangement in the genome of P. menziesii. Both of these changes are inter-connected, indicating much instability in this part of the P. menziesii genome. Overall, the macro-structure of the Pinaceae genome was well conserved, which is notable given the Cretaceous origin of its main lineages.
Collapse
Affiliation(s)
- Betty Pelgas
- Centre de Recherche en Biologie Forestière, Pavillon Charles-Eugène Marchand, Université Laval, Québec, QC, Canada, G1K 7P4
| | | | | | | | | | | |
Collapse
|
34
|
Schlueter JA, Scheffler BE, Schlueter SD, Shoemaker RC. Sequence conservation of homeologous bacterial artificial chromosomes and transcription of homeologous genes in soybean (Glycine max L. Merr.). Genetics 2006; 174:1017-28. [PMID: 16888343 PMCID: PMC1602103 DOI: 10.1534/genetics.105.055020] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2005] [Accepted: 07/19/2006] [Indexed: 11/18/2022] Open
Abstract
The paleopolyploid soybean genome was investigated by sequencing homeologous BAC clones anchored by duplicate N-hydroxycinnamoyl/benzoyltransferase (HCBT) genes. The homeologous BACs were genetically mapped to linkage groups C1 and C2. Annotation of the 173,747- and 98,760-bp BACs showed that gene conservation in both order and orientation is high between homeologous regions with only a single gene insertion/deletion and local tandem duplications differing between the regions. The nucleotide sequence conservation extends into intergenic regions as well, probably due to conserved regulatory sequences. Most of the homeologs appear to have a role in either transcription/DNA binding or cellular signaling, suggesting a potential preference for retention of duplicate genes with these functions. Reverse transcriptase-PCR analysis of homeologs showed that in the tissues sampled, most homeologs have not diverged greatly in their transcription profiles. However, four cases of changes in transcription were identified, primarily in the HCBT gene cluster. Because a mapped locus corresponds to a soybean cyst nematode (SCN) QTL, the potential role of HCBT genes in response to SCN is discussed. These results are the first sequenced-based analysis of homeologous BACs in soybean, a diploidized paleopolyploid.
Collapse
Affiliation(s)
- Jessica A Schlueter
- Department of Genetics, Developmental and Cellular Biology, Iowa State University, Iowa 50011, USA
| | | | | | | |
Collapse
|
35
|
Piegu B, Guyot R, Picault N, Roulin A, Saniyal A, Kim H, Collura K, Brar DS, Jackson S, Wing RA, Panaud O. Doubling genome size without polyploidization: dynamics of retrotransposition-driven genomic expansions in Oryza australiensis, a wild relative of rice. Genome Res 2006; 16:1262-9. [PMID: 16963705 PMCID: PMC1581435 DOI: 10.1101/gr.5290206] [Citation(s) in RCA: 377] [Impact Index Per Article: 19.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Retrotransposons are the main components of eukaryotic genomes, representing up to 80% of some large plant genomes. These mobile elements transpose via a "copy and paste" mechanism, thus increasing their copy number while active. Their accumulation is now accepted as the main factor of genome size increase in higher eukaryotes, besides polyploidy. However, the dynamics of this process are poorly understood. In this study, we show that Oryza australiensis, a wild relative of the Asian cultivated rice O. sativa, has undergone recent bursts of three LTR-retrotransposon families. This genome has accumulated more than 90,000 retrotransposon copies during the last three million years, leading to a rapid twofold increase of its size. In addition, phenetic analyses of these retrotransposons clearly confirm that the genomic bursts occurred posterior to the radiation of the species. This provides direct evidence of retrotransposon-mediated variation of genome size within a plant genus.
Collapse
Affiliation(s)
- Benoit Piegu
- Laboratoire Génome et Développement des Plantes, UMR 5096 CNRS-IRD, Université de Perpignan, Perpignan 66860, France
| | - Romain Guyot
- Laboratoire Génome et Développement des Plantes, UMR 5096 CNRS-IRD, Université de Perpignan, Perpignan 66860, France
| | - Nathalie Picault
- Laboratoire Génome et Développement des Plantes, UMR 5096 CNRS-IRD, Université de Perpignan, Perpignan 66860, France
| | - Anne Roulin
- Laboratoire Génome et Développement des Plantes, UMR 5096 CNRS-IRD, Université de Perpignan, Perpignan 66860, France
| | - Abhijit Saniyal
- Agricultural Genomics, Purdue University, West Lafayette, Indiana 47907, USA
| | - Hyeran Kim
- Arizona Genomics Institute, Department of Plant Sciences, University of Arizona, Tucson, Arizona 85721, USA
| | - Kristi Collura
- Arizona Genomics Institute, Department of Plant Sciences, University of Arizona, Tucson, Arizona 85721, USA
| | - Darshan S. Brar
- Plant Breeding Genetics and Biochemistry Division, International Rice Research Institute, Manila 1099, Philippines, USA
| | - Scott Jackson
- Agricultural Genomics, Purdue University, West Lafayette, Indiana 47907, USA
| | - Rod A. Wing
- Arizona Genomics Institute, Department of Plant Sciences, University of Arizona, Tucson, Arizona 85721, USA
| | - Olivier Panaud
- Laboratoire Génome et Développement des Plantes, UMR 5096 CNRS-IRD, Université de Perpignan, Perpignan 66860, France
- Corresponding author.E-mail ; fax 33-04-468664899
| |
Collapse
|
36
|
Messing J, Dooner HK. Organization and variability of the maize genome. CURRENT OPINION IN PLANT BIOLOGY 2006; 9:157-63. [PMID: 16459130 DOI: 10.1016/j.pbi.2006.01.009] [Citation(s) in RCA: 54] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/07/2005] [Accepted: 01/24/2006] [Indexed: 05/06/2023]
Abstract
With a size approximating that of the human genome, the maize genome is about to become the largest plant genome yet sequenced. Contributing to that size are a whole-genome duplication event and a retrotransposition explosion that produced a large amount of repetitive DNA. This DNA is greatly under-represented in cDNA collections, so analysis of the maize transcriptome has been an expedient way of assessing the gene content of maize. Over 2 million maize cDNA sequences are now available, making maize the third most widely studied organism, behind mouse and man. To date, the sequencing of large-sized DNA clones has been largely driven by the genetic interests of different investigators. The recent construction of a physical map that is anchored to the genetic map will aid immensely in the maize genome-sequencing effort. However, studies showing that the repetitive DNA component is highly polymorphic among maize inbred lines point to the need to sample vertically a few specific regions of the genome to evaluate the extent and importance of this variability.
Collapse
Affiliation(s)
- Joachim Messing
- Waksman Institute of Microbiology, Rutgers, The State University of New Jersey, 190 Frelinghuysen Road, Piscataway, New Jersey 08854, USA
| | | |
Collapse
|
37
|
Rabinowicz PD, Bennetzen JL. The maize genome as a model for efficient sequence analysis of large plant genomes. CURRENT OPINION IN PLANT BIOLOGY 2006; 9:149-56. [PMID: 16459129 DOI: 10.1016/j.pbi.2006.01.015] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/04/2006] [Accepted: 01/20/2006] [Indexed: 05/06/2023]
Abstract
The genomes of flowering plants vary in size from about 0.1 to over 100 gigabase pairs (Gbp), mostly because of polyploidy and variation in the abundance of repetitive elements in intergenic regions. High-quality sequences of the relatively small genomes of Arabidopsis (0.14 Gbp) and rice (0.4 Gbp) have now been largely completed. The sequencing of plant genomes that have a more representative size (the mean for flowering plant genomes is 5.6 Gbp) has been seen as a daunting task, partly because of their size and partly because of the numerous highly conserved repeats. Nevertheless, creative strategies and powerful new tools have been generated recently in the plant genetics community, so that sequencing large plant genomes is now a realistic possibility. Maize (2.4-2.7 Gbp) will be the first gigabase-size plant genome to be sequenced using these novel approaches. Pilot studies on maize indicate that the new gene-enrichment, gene-finishing and gene-orientation technologies are efficient, robust and comprehensive. These strategies will succeed in sequencing the gene-space of large genome plants, and in locating all of these genes and adjacent sequences on the genetic and physical maps.
Collapse
Affiliation(s)
- Pablo D Rabinowicz
- The Institute for Genomic Research, 9712 Medical Center Drive, Rockville, Maryland 20850, USA
| | | |
Collapse
|
38
|
Ma J, Bennetzen JL. Recombination, rearrangement, reshuffling, and divergence in a centromeric region of rice. Proc Natl Acad Sci U S A 2006; 103:383-8. [PMID: 16381819 PMCID: PMC1326179 DOI: 10.1073/pnas.0509810102] [Citation(s) in RCA: 109] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Centromeres have many unusual biological properties, including kinetochore attachment and severe repression of local meiotic recombination. These properties are partly an outcome, partly a cause, of unusual DNA structure in the centromeric region. Although several plant and animal genomes have been sequenced, most centromere sequences have not been completed or analyzed in depth. To shed light on the unique organization, variability, and evolution of centromeric DNA, detailed analysis of a 1.97-Mb sequence that includes centromere 8 (CEN8) of japonica rice was undertaken. Thirty-three long-terminal repeat (LTR)-retrotransposon families (including 11 previously unknown) were identified in the CEN8 region, totaling 245 elements and fragments that account for 67% of the region. The ratio of solo LTRs to intact elements in the CEN8 region is approximately 0.9:1, compared with approximately 2.2:1 in noncentromeric regions of rice. However, the ratio of solo LTRs to intact elements in the core of the CEN8 region ( approximately 2.5:1) is higher than in any other region investigated in rice, suggesting a hotspot for unequal recombination. Comparison of the CEN8 region of japonica and its orthologous segments from indica rice indicated that approximately 15% of the intact retrotransposons and solo LTRs were inserted into CEN8 after the divergence of japonica and indica from a common ancestor, compared with approximately 50% for previously studied euchromatic regions. Frequent DNA rearrangements were observed in the CEN8 region, including a 212-kb subregion that was found to be composed of three rearranged tandem repeats. Phylogenetic analysis also revealed recent segmental duplication and extensive rearrangement and reshuffling of the CentO satellite repeats.
Collapse
Affiliation(s)
- Jianxin Ma
- Department of Genetics, University of Georgia, Athens, GA 30602, USA
| | | |
Collapse
|
39
|
Devos KM, Ma J, Pontaroli AC, Pratt LH, Bennetzen JL. Analysis and mapping of randomly chosen bacterial artificial chromosome clones from hexaploid bread wheat. Proc Natl Acad Sci U S A 2005; 102:19243-8. [PMID: 16357197 PMCID: PMC1323192 DOI: 10.1073/pnas.0509473102] [Citation(s) in RCA: 79] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The current view of wheat genome composition is that genes are compartmentalized into gene-rich and gene-poor regions. This model can be tested by analyzing randomly selected bacterial artificial chromosome (BAC) clones for gene content, followed by placement of these BACs onto physical and genetic maps. Map localization could be difficult for BACs that consist entirely of repeated elements. We therefore developed a technique where repeat junctions are used to generate unique markers. Four BAC clones from hexaploid wheat variety Chinese Spring were randomly selected and sequenced at 4- to 6-fold redundancy. About 50% of the BAC sequences corresponded to previously identified repeats, mainly LTR-retrotransposons, whereas most of the remaining DNA consisted of sequences with unknown origin or function. The average gene content was <1%, although each BAC contained one or two identified genes. Repeat boundaries were amplified and used to map each clone to a chromosome arm. Extrapolation from wheat-rice comparative knowledge suggests that three of the four BAC clones originate from "gene-rich" regions of the wheat genome. Nevertheless, because these BACs carry only a single gene (two BACs) or two genes (one BAC), the predicted gene density is approximately 1 gene per 75 kb, which is considerably lower than previously estimated gene densities (one gene per 5-20 kb) for gene-rich regions in wheat. This analysis of randomly selected wheat BAC clones suggests that genes are more evenly distributed in wheat than previously believed and substantiates the need for large-scale random BAC sequencing to determine wheat genome organization.
Collapse
Affiliation(s)
- Katrien M Devos
- Department of Crop and Soil Sciences, University of Georgia, Athens, GA 30602, USA.
| | | | | | | | | |
Collapse
|
40
|
Ma J, Jackson SA. Retrotransposon accumulation and satellite amplification mediated by segmental duplication facilitate centromere expansion in rice. Genome Res 2005; 16:251-9. [PMID: 16354755 PMCID: PMC1361721 DOI: 10.1101/gr.4583106] [Citation(s) in RCA: 66] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
The abundance of repetitive DNA varies greatly across centromeres within an individual or between different organisms. To shed light on the molecular mechanisms of centromere repeat proliferation, we performed structural analysis of LTR-retrotransposons, mostly centromere retrotransposons of rice (CRRs), and phylogenetic analysis of CentO satellite repeats harbored in the core region of the rice chromosome 4 centromere (CEN4). The data obtained demonstrate that the CRRs in the centromeric region we investigated have been enriched more significantly by recent rounds of segmental duplication than by original integration of active elements, suggesting that segmental duplication is an important process for CRR accumulation in the centromeric region. Our results also indicate that segmental duplication of large arrays of satellite repeats is primarily responsible for the amplification of satellite repeats, contributing to rapid reshuffling of CentO satellites. Intercentromere satellite homogenization was revealed by genome-wide comparison of CentO satellite monomers. However, a 10-bp duplication present in nearly half of the CEN4 monomers was found to be completely absent in rice centromere 8 (CEN8), suggesting that CEN4 and CEN8 may represent two different stages in the evolution of rice centromeres. These observations, obtained from the only complex eukaryotic centromeres to have been completely sequenced thus far, depict the evolutionary dynamics of rice centromeres with respect to the nature, timing, and process of centromeric repeat amplification.
Collapse
Affiliation(s)
- Jianxin Ma
- Department of Agronomy, Purdue University, West Lafayette, IN 47907, USA
| | | |
Collapse
|