151
|
Kawabe A, Hansson B, Hagenblad J, Forrest A, Charlesworth D. Centromere locations and associated chromosome rearrangements in Arabidopsis lyrata and A. thaliana. Genetics 2006; 173:1613-9. [PMID: 16648590 PMCID: PMC1526690 DOI: 10.1534/genetics.106.057182] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
We analyzed linkage and chromosomal positions of genes in A. lyrata ssp. petraea that are located near the centromere (CEN) regions of A. thaliana, using at least two genes from the short and long arms of each chromosome. In our map, genes from all 10 A. thaliana chromosome arms are also tightly linked in A. lyrata. Genes from the regions on the two sides of CEN5 have distant map localizations in A. lyrata (genes on the A. thaliana short-arm genes are on linkage group AL6, and long-arm genes are on AL7), but genes from the other four A. thaliana centromere regions remain closely linked in A. lyrata. The observation of complete linkage between short- and long-arm centromere genes, but not between genes in other genome regions that are separated by similar physical distances, suggests that crossing-over frequencies near the A. lyrata ssp. petraea centromere regions are low, as in A. thaliana. Thus, the centromere positions appear to be conserved between A. thaliana and A. lyrata, even though three centromeres have been lost in A. thaliana, and the core satellite sequences in the two species are very different. We can now definitively identify the three centromeres that were eliminated in the fusions that formed the A. thaliana chromosomes. However, we cannot tell whether genes were lost along with these centromeres, because such genes are absent from the A. thaliana genome, which is the sole source of markers for our mapping.
Collapse
Affiliation(s)
- Akira Kawabe
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh EH9 3JT, United Kingdom
| | | | | | | | | |
Collapse
|
152
|
Murata M, Shibata F, Yokota E. The origin, meiotic behavior, and transmission of a novel minichromosome in Arabidopsis thaliana. Chromosoma 2006; 115:311-9. [PMID: 16607510 DOI: 10.1007/s00412-005-0045-1] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2005] [Revised: 12/09/2005] [Accepted: 12/12/2005] [Indexed: 11/28/2022]
Abstract
A plant carrying a small extra chromosome was found in Landsberg erecta ecotype of Arabidopsis thaliana. Fluorescence in situ hybridization revealed that this minichromosome was derived from the short arm of chromosome 4. The size of this "mini4S" chromosome was estimated to be approximately 7.5 Mb on the basis of previously reported data and the amount of the centromeric major satellite (180-bp family) present, which was determined to be about 1 Mb, or about one third of that in the normal chromosome 4. No pairing between mini4S and its original chromosome 4 was observed at pachytene and metaphase I stages. The transmission of mini4S through pollen was limited, but about 30% of selfed progeny carried the mini4S chromosomes. The transmission rates considerably increased when the mini4S chromosomes were transferred to plants with a Columbia background by successive backcrosses. This suggests that the stability of the minichromosomes is controlled genetically by factors that can vary between ecotypes.
Collapse
Affiliation(s)
- Minoru Murata
- Research Institute for Bioresources, Okayama University, Kurashiki, 710-0046, Japan.
| | | | | |
Collapse
|
153
|
Morgante M. Plant genome organisation and diversity: the year of the junk! Curr Opin Biotechnol 2006; 17:168-73. [PMID: 16530402 DOI: 10.1016/j.copbio.2006.03.001] [Citation(s) in RCA: 56] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2006] [Revised: 02/28/2006] [Accepted: 03/02/2006] [Indexed: 01/21/2023]
Abstract
Having gained a thorough understanding of the structure and organization of model plant genomes, such as those of Arabidopsis thaliana and rice, we have now started to investigate the most interesting aspect of genome structure - its variations. Variation in DNA sequence is responsible for the genetic component of phenotypic variation (i.e. the component upon which both natural and artificial selection act). Recent studies have started to shed light on sequence variation outside of the genic regions, owing mainly to large insertion/deletion (indel) polymorphisms caused by the presence or absence of transposable elements of different classes. In addition to long terminal repeat retrotransposons, DNA transposons have been shown to be responsible for these polymorphisms. These comprise Helitrons, CACTA and Mu-like elements that are capable of acquiring and piecing together fragments of plant genes and are often expressed. Future analyses of the functional roles of intergenic sequence variation will tell us if we will need to pay more attention not only to genes, but also to the 'junk' DNA surrounding them.
Collapse
Affiliation(s)
- Michele Morgante
- Dipartimento di Scienze Agrarie ed Ambientali, Universita' di Udine, Via delle Scienze 208, I-33100 Udine, Italy.
| |
Collapse
|
154
|
Wang Y, Tang X, Cheng Z, Mueller L, Giovannoni J, Tanksley SD. Euchromatin and pericentromeric heterochromatin: comparative composition in the tomato genome. Genetics 2006; 172:2529-40. [PMID: 16489216 PMCID: PMC1456407 DOI: 10.1534/genetics.106.055772] [Citation(s) in RCA: 91] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2005] [Accepted: 02/06/2006] [Indexed: 01/13/2023] Open
Abstract
Eleven sequenced BACs were annotated and localized via FISH to tomato pachytene chromosomes providing the first global insights into the compositional differences of euchromatin and pericentromeric heterochromatin in this model dicot species. The results indicate that tomato euchromatin has a gene density (6.7 kb/gene) similar to that of Arabidopsis and rice. Thus, while the euchromatin comprises only 25% of the tomato nuclear DNA, it is sufficient to account for approximately 90% of the estimated 38,000 nontransposon genes that compose the tomato genome. Moreover, euchromatic BACs were largely devoid of transposons or other repetitive elements. In contrast, BACs assigned to the pericentromeric heterochromatin had a gene density 10-100 times lower than that of the euchromatin and are heavily populated by retrotransposons preferential to the heterochromatin-the most abundant transposons belonging to the Jinling Ty3/gypsy-like retrotransposon family. Jinling elements are highly methylated and rarely transcribed. Nonetheless, they have spread throughout the pericentromeric heterochromatin in tomato and wild tomato species fairly recently-well after tomato diverged from potato and other related solanaceous species. The implications of these findings on evolution and on sequencing the genomes of tomato and other solanaceous species are discussed.
Collapse
Affiliation(s)
- Ying Wang
- Department of Plant Breeding and Genetics, U.S. Department of Agriculture--Agricultural Research Service, Plant, Soil, and Nutrition Lab, Cornell University, Ithaca, New York 14853, USA
| | | | | | | | | | | |
Collapse
|
155
|
Paterson AH. Leafing through the genomes of our major crop plants: strategies for capturing unique information. Nat Rev Genet 2006; 7:174-84. [PMID: 16485017 DOI: 10.1038/nrg1806] [Citation(s) in RCA: 73] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
Crop plants not only have economic significance, but also comprise important botanical models for evolution and development. This is reflected by the recent increase in the percentage of publicly available sequence data that are derived from angiosperms. Further genome sequencing of the major crop plants will offer new learning opportunities, but their large, repetitive, and often polyploid genomes present challenges. Reduced-representation approaches - such as EST sequencing, methyl filtration and Cot-based cloning and sequencing - provide increased efficiency in extracting key information from crop genomes without full-genome sequencing. Combining these methods with phylogenetically stratified sampling to allow comparative genomic approaches has the potential to further accelerate progress in angiosperm genomics.
Collapse
Affiliation(s)
- Andrew H Paterson
- Plant Genome Mapping Laboratory, University of Georgia, Athens, Georgia 30602, USA.
| |
Collapse
|
156
|
Wong NC, Wong LH, Quach JM, Canham P, Craig JM, Song JZ, Clark SJ, Choo KHA. Permissive transcriptional activity at the centromere through pockets of DNA hypomethylation. PLoS Genet 2006; 2:e17. [PMID: 16477312 PMCID: PMC1361766 DOI: 10.1371/journal.pgen.0020017] [Citation(s) in RCA: 63] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2005] [Accepted: 12/29/2005] [Indexed: 02/02/2023] Open
Abstract
DNA methylation is a hallmark of transcriptional silencing, yet transcription has been reported at the centromere. To address this apparent paradox, we employed a fully sequence-defined ectopic human centromere (or neocentromere) to investigate the relationship between DNA methylation and transcription. We used sodium bisulfite PCR and sequencing to determine the methylation status of 2,041 CpG dinucleotides distributed across a 6.76-Mbp chromosomal region containing a neocentromere. These CpG dinucleotides were associated with conventional and nonconventional CpG islands. We found an overall hypermethylation of the neocentric DNA at nonconventional CpG islands that we designated as CpG islets and CpG orphans. The observed hypermethylation was consistent with the presence of a presumed transcriptionally silent chromatin state at the neocentromere. Within this neocentric chromatin, specific sites of active transcription and the centromeric chromatin boundary are defined by DNA hypomethylation. Our data demonstrate, for the first time to our knowledge, a correlation between DNA methylation and centromere formation in mammals, and that transcription and “chromatin-boundary activity” are permissible at the centromere through the selective hypomethylation of pockets of sequences without compromising the overall silent chromatin state and function of the centromere. The centromere is a chromosomal structure that is vital for the correct partitioning of chromosomes during cell division. Recent studies in a number of different species have shown that transcription is permissible within the centromere, but the mode of transcription regulation at the centromere remains unclear. DNA methylation is a well-characterized mechanism for the genomic regulation of transcription. Here, the authors investigate the relationship between DNA methylation and transcription activity at a functional human centromere. They demonstrate a high level of DNA methylation across the centromere but identify pockets of DNA sequences within the methylated domain that are non-methylated. These pockets correspond to sites of transcription and/or boundaries that separate major centromeric chromatin sub-domains. This study shows the complexity of the centromere as it uses DNA methylation to both maintain a tight chromatin structure and to allow transcription to occur.
Collapse
Affiliation(s)
- Nicholas C Wong
- Chromosome Research Laboratory, Murdoch Childrens Research Institute, Royal Children's Hospital, Parkville, Victoria, Australia
- Department of Paediatrics, University of Melbourne, Parkville, Victoria, Australia
| | - Lee H Wong
- Chromosome Research Laboratory, Murdoch Childrens Research Institute, Royal Children's Hospital, Parkville, Victoria, Australia
- Department of Paediatrics, University of Melbourne, Parkville, Victoria, Australia
| | - Julie M Quach
- Chromosome Research Laboratory, Murdoch Childrens Research Institute, Royal Children's Hospital, Parkville, Victoria, Australia
| | - Paul Canham
- Chromosome Research Laboratory, Murdoch Childrens Research Institute, Royal Children's Hospital, Parkville, Victoria, Australia
| | - Jeffrey M Craig
- Chromosome Research Laboratory, Murdoch Childrens Research Institute, Royal Children's Hospital, Parkville, Victoria, Australia
- Department of Paediatrics, University of Melbourne, Parkville, Victoria, Australia
| | - Jenny Z Song
- Epigenetics Group, the Garvan Institute of Medical Research, Darlinghurst, New South Wales, Australia
| | - Susan J Clark
- Epigenetics Group, the Garvan Institute of Medical Research, Darlinghurst, New South Wales, Australia
| | - K. H. Andy Choo
- Chromosome Research Laboratory, Murdoch Childrens Research Institute, Royal Children's Hospital, Parkville, Victoria, Australia
- Department of Paediatrics, University of Melbourne, Parkville, Victoria, Australia
- * To whom correspondence should be addressed. E-mail:
| |
Collapse
|
157
|
Abstract
Pericentromeres are exceptional genomic regions: in animals they contain extensive segmental duplications implicated in gene creation, and in plants they sustain rearrangements and insertions uncommon in euchromatin. To examine the mechanisms and patterns of plant pericentromere evolution, we compared pericentromere sequence from four Brassicaceae species separated by <15 million years (Myr). This flowering plant family is ideal for studying relationships between genome reorganization and pericentromere evolution-its members have undergone recent polyploidization and hybridization, with close relatives changing in genome size and chromosome number. Through sequence and hybridization analyses, we examined regions from Arabidopsis arenosa, Capsella rubella, and Olimarabidopsis pumila that are homologous to Arabidopsis thaliana pericentromeres (peri-CENs) III and V, and used FISH to demonstrate they have been maintained near centromere satellite arrays in each species. Sequence analysis revealed a set of highly conserved genes, yet we discovered substantial differences in intergenic length and species-specific changes in sequence content and gene density. We discovered that A. thaliana has undergone recent, significant expansions within its pericentromeres, in some cases measuring hundreds of kilobases; these findings are in marked contrast to euchromatic segments in these species that exhibit only minor length changes. While plant pericentromeres do contain some duplications, we did not find evidence of extensive segmental duplications, as has been documented in primates. Our data support a model in which plant pericentromeres may experience selective pressures distinct from euchromatin, tolerating rapid, dynamic changes in structure and sequence content, including large insertions of mobile elements, 5S rDNA arrays and pseudogenes.
Collapse
Affiliation(s)
- Anne E Hall
- Howard Hughes Medical Institute, The University of Chicago, Chicago, Illinois 60637, USA
| | | | | |
Collapse
|
158
|
Ma J, Bennetzen JL. Recombination, rearrangement, reshuffling, and divergence in a centromeric region of rice. Proc Natl Acad Sci U S A 2006; 103:383-8. [PMID: 16381819 PMCID: PMC1326179 DOI: 10.1073/pnas.0509810102] [Citation(s) in RCA: 109] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Centromeres have many unusual biological properties, including kinetochore attachment and severe repression of local meiotic recombination. These properties are partly an outcome, partly a cause, of unusual DNA structure in the centromeric region. Although several plant and animal genomes have been sequenced, most centromere sequences have not been completed or analyzed in depth. To shed light on the unique organization, variability, and evolution of centromeric DNA, detailed analysis of a 1.97-Mb sequence that includes centromere 8 (CEN8) of japonica rice was undertaken. Thirty-three long-terminal repeat (LTR)-retrotransposon families (including 11 previously unknown) were identified in the CEN8 region, totaling 245 elements and fragments that account for 67% of the region. The ratio of solo LTRs to intact elements in the CEN8 region is approximately 0.9:1, compared with approximately 2.2:1 in noncentromeric regions of rice. However, the ratio of solo LTRs to intact elements in the core of the CEN8 region ( approximately 2.5:1) is higher than in any other region investigated in rice, suggesting a hotspot for unequal recombination. Comparison of the CEN8 region of japonica and its orthologous segments from indica rice indicated that approximately 15% of the intact retrotransposons and solo LTRs were inserted into CEN8 after the divergence of japonica and indica from a common ancestor, compared with approximately 50% for previously studied euchromatic regions. Frequent DNA rearrangements were observed in the CEN8 region, including a 212-kb subregion that was found to be composed of three rearranged tandem repeats. Phylogenetic analysis also revealed recent segmental duplication and extensive rearrangement and reshuffling of the CentO satellite repeats.
Collapse
Affiliation(s)
- Jianxin Ma
- Department of Genetics, University of Georgia, Athens, GA 30602, USA
| | | |
Collapse
|
159
|
Ma J, Jackson SA. Retrotransposon accumulation and satellite amplification mediated by segmental duplication facilitate centromere expansion in rice. Genome Res 2005; 16:251-9. [PMID: 16354755 PMCID: PMC1361721 DOI: 10.1101/gr.4583106] [Citation(s) in RCA: 66] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
The abundance of repetitive DNA varies greatly across centromeres within an individual or between different organisms. To shed light on the molecular mechanisms of centromere repeat proliferation, we performed structural analysis of LTR-retrotransposons, mostly centromere retrotransposons of rice (CRRs), and phylogenetic analysis of CentO satellite repeats harbored in the core region of the rice chromosome 4 centromere (CEN4). The data obtained demonstrate that the CRRs in the centromeric region we investigated have been enriched more significantly by recent rounds of segmental duplication than by original integration of active elements, suggesting that segmental duplication is an important process for CRR accumulation in the centromeric region. Our results also indicate that segmental duplication of large arrays of satellite repeats is primarily responsible for the amplification of satellite repeats, contributing to rapid reshuffling of CentO satellites. Intercentromere satellite homogenization was revealed by genome-wide comparison of CentO satellite monomers. However, a 10-bp duplication present in nearly half of the CEN4 monomers was found to be completely absent in rice centromere 8 (CEN8), suggesting that CEN4 and CEN8 may represent two different stages in the evolution of rice centromeres. These observations, obtained from the only complex eukaryotic centromeres to have been completely sequenced thus far, depict the evolutionary dynamics of rice centromeres with respect to the nature, timing, and process of centromeric repeat amplification.
Collapse
Affiliation(s)
- Jianxin Ma
- Department of Agronomy, Purdue University, West Lafayette, IN 47907, USA
| | | |
Collapse
|
160
|
Fang Y, Spector DL. Centromere positioning and dynamics in living Arabidopsis plants. Mol Biol Cell 2005; 16:5710-8. [PMID: 16195344 PMCID: PMC1289415 DOI: 10.1091/mbc.e05-08-0706] [Citation(s) in RCA: 95] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2005] [Revised: 09/13/2005] [Accepted: 09/15/2005] [Indexed: 11/11/2022] Open
Abstract
The organization and dynamics of the genome have been shown to influence gene expression in many organisms. Data from mammalian tissue culture cells have provided conflicting conclusions with regard to the extent to which chromatin organization is inherited from mother to daughter nuclei. To gain insight into chromatin organization and dynamics, we developed transgenic Arabidopsis lines in which centromeres were tagged with a green fluorescent protein fusion of the centromere-specific histone H3. Using four-dimensional (4-D) live cell imaging, we show that Arabidopsis centromeres are constrained at the nuclear periphery during interphase and that the organization of endoreduplicated sister centromeres is cell type dependent with predominant clustering in root epidermal cells and dispersion in leaf epidermal cells. 4-D tracking of the entire set of centromeres through mitosis, in growing root meristematic cells, demonstrated that global centromere position is not precisely transmitted from the mother cell to daughter cells. These results provide important insight into our understanding of chromatin organization among different cells of a living organism.
Collapse
Affiliation(s)
- Yuda Fang
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA
| | | |
Collapse
|
161
|
Neves N, Delgado M, Silva M, Caperta A, Morais-Cecílio L, Viegas W. Ribosomal DNA heterochromatin in plants. Cytogenet Genome Res 2005; 109:104-11. [PMID: 15753565 DOI: 10.1159/000082388] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2003] [Accepted: 02/19/2004] [Indexed: 11/19/2022] Open
Abstract
The aim of this review is to integrate earlier results and recent findings to present the current state-of-the-art vision concerning the dynamic behavior of the ribosomal DNA (rDNA) fraction in plants. The global organization and behavioral features of rDNA make it a most useful system to analyse the relationship between chromatin topology and gene expression patterns. Correlations between several heterochromatin fractions and rDNA arrays demonstrate the heterochromatic nature of the rDNA and reveal the importance of the genomic environment and of developmental controls in modulating its dynamics.
Collapse
Affiliation(s)
- N Neves
- Secção de Genética, Centro de Botânica Aplicada à Agricultura, Instituto Superior de Agronomia, Lisboa, Portugal
| | | | | | | | | | | |
Collapse
|
162
|
Barbazuk WB, Bedell JA, Rabinowicz PD. Reduced representation sequencing: a success in maize and a promise for other plant genomes. Bioessays 2005; 27:839-48. [PMID: 16015589 DOI: 10.1002/bies.20262] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Plant, and particularly cereal genomes, are challenging to sequence due to their large size and high repetitive DNA content. Gene-enrichment strategies are alternative or complementary approaches to complete genome sequencing that yield, rapidly and inexpensively, useful sequence data from large and complex genomes. The maize genome is large (2.7 Gbp) and contains large amounts of conserved repetitive elements. Furthermore, the high allelic diversity found between maize inbred lines may necessitate sequencing several inbred lines in order to recover the maize "gene pool". Two gene-enrichment approaches, methylation filtration (MF) and high C(o)t (HC) sequencing have been tested in maize and their ability to sample the gene space has been examined. Combined with other genomic sequencing strategies, gene-enriched genomic sequencing is a practical way to examine the maize gene pool, to order and orient the genic sequences on the genome, and to enable investigation of gene content of other complex plant genomes.
Collapse
Affiliation(s)
- W Brad Barbazuk
- Donald Danforth Plant Science Center, St. Louis, MO 63132, USA.
| | | | | |
Collapse
|
163
|
Mondragon-Palomino M, Gaut BS. Gene conversion and the evolution of three leucine-rich repeat gene families in Arabidopsis thaliana. Mol Biol Evol 2005; 22:2444-56. [PMID: 16120808 DOI: 10.1093/molbev/msi241] [Citation(s) in RCA: 108] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
The high number of duplicated genes in plant genomes provides a potential template for gene conversion and unequal crossing-over. Within a gene family these two processes can render all members homogeneous or generate diversity by reassorting variants among paralogs. The latter is especially feasible in families where gene diversity confers a selective advantage and thus conversion events are likely to be retained. Consequently, the most complete record of gene conversion is expected to be most evident in gene families commonly subjected to positive selection. Here, we describe the extent and characteristics of gene conversion and unequal crossing-over in the coding and noncoding regions of nucleotide-binding site leucine-rich repeat (NBS-LRR), receptor-like kinases (RLK), and receptor-like proteins (RLP) in the plant Arabidopsis thaliana. Members of these three gene families are associated with disease resistance and their pathogen-recognition domain is a documented target of positive selection. Our bioinformatic approach to study the major family features that may influence gene conversion revealed that in these families there is a significant association between the occurrence of gene conversion and high levels of sequence similarity, close physical clustering, gene orientation, and recombination rate. We discuss these results in the context of the overlap between gene conversion and positive selection during the evolutionary expansion of the NBS-LRR, RLK, and RLP gene families.
Collapse
|
164
|
Chen LL, Gao F. Detection of nucleolar organizer and mitochondrial DNA insertion regions based on the isochore map of Arabidopsis thaliana. FEBS J 2005; 272:3328-36. [PMID: 15978039 DOI: 10.1111/j.1742-4658.2005.04748.x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Eukaryotic genomes are composed of isochores, i.e. long sequences relatively homogeneous in GC content. In this paper, the isochore structure of Arabidopsis thaliana genome has been studied using a windowless technique based on the Z curve method and intuitive curves are drawn for all the five chromosomes. Using these curves, we can calculate the GC content at any resolution, even at the base level. It is observed that all the five chromosomes are composed of several GC-rich and AT-rich regions alternatively. Usually, these regions, named 'isochore-like regions', have large fluctuations in the GC content. Five isochores with little fluctuations are also observed. Detailed analyses have been performed for these isochores. A GC-rich 'isochore-like region' and a GC-isochore in chromosome II and IV, respectively, are the nucleolar organizer regions (NORs), and genes located in the two regions prefer to use GC-ending codons. Another GC-isochore located in chromosome II is a mitochondrial DNA insertion region, the position and size of this region is precisely predicted by the current method. The amino acid usage and codon preference of genes in this organellar-to-nuclear transfer region show significant difference from other regions. Moreover, the centromeres are located in GC-rich 'isochore-like regions' in all the five chromosomes. The current method can provide a useful tool for analyzing whole genomic sequences of eukaryotes.
Collapse
Affiliation(s)
- Ling-Ling Chen
- Laboratory for Computational Biology, Shandong Provincial Research Center for Bioinformatic Engineering and Techniques, Shandong University of Technology, Zibo, China.
| | | |
Collapse
|
165
|
Abstract
There are clear theoretical reasons and many well-documented examples which show that repetitive, DNA is essential for genome function. Generic repeated signals in the DNA are necessary to format expression of unique coding sequence files and to organise additional functions essential for genome replication and accurate transmission to progeny cells. Repetitive DNA sequence elements are also fundamental to the cooperative molecular interactions forming nucleoprotein complexes. Here, we review the surprising abundance of repetitive DNA in many genomes, describe its structural diversity, and discuss dozens of cases where the functional importance of repetitive elements has been studied in molecular detail. In particular, the fact that repeat elements serve either as initiators or boundaries for heterochromatin domains and provide a significant fraction of scaffolding/matrix attachment regions (S/MARs) suggests that the repetitive component of the genome plays a major architectonic role in higher order physical structuring. Employing an information science model, the 'functionalist' perspective on repetitive DNA leads to new ways of thinking about the systemic organisation of cellular genomes and provides several novel possibilities involving repeat elements in evolutionarily significant genome reorganisation. These ideas may facilitate the interpretation of comparisons between sequenced genomes, where the repetitive DNA component is often greater than the coding sequence component.
Collapse
Affiliation(s)
- James A Shapiro
- Department of Biochemistry and Molecular Biology, University of Chicago, 920 E. 58th Street, Chicago, IL 60637, USA.
| | | |
Collapse
|
166
|
Yasuhara JC, DeCrease CH, Wakimoto BT. Evolution of heterochromatic genes of Drosophila. Proc Natl Acad Sci U S A 2005; 102:10958-63. [PMID: 16033869 PMCID: PMC1176909 DOI: 10.1073/pnas.0503424102] [Citation(s) in RCA: 59] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
Heterochromatin is generally associated with gene silencing, yet in Drosophila melanogaster, heterochromatin harbors hundreds of functional protein-encoding genes, some of which depend on heterochromatin for expression. Here we document a recent evolutionary transition of a gene cluster from euchromatin to heterochromatin, which occurred <20 million years ago in the drosophilid lineage. This finding reveals evolutionary fluidity between these two genomic compartments and provides a powerful approach to identifying differences between euchromatic and heterochromatic genes. Promoter mapping of orthologous gene pairs led to the discovery of the "slippery promoter," characterized by multiple transcriptional start sites predominantly at adenines, as a common promoter type found in both heterochromatic and euchromatic genes of Drosophila. Promoter type is diverse within the gene cluster but largely conserved between heterochromatic and euchromatic genes, eliminating the hypothesis that adaptation to heterochromatin required major alterations in promoter structure. Transition to heterochromatin is consistently associated with gene expansion due to the accumulation of transposable elements and increased A-T content. We conclude that heterochromatin-dependent regulation requires specialized enhancers or higher-order interactions and propose a facilitating role for transposable elements.
Collapse
Affiliation(s)
- Jiro C Yasuhara
- Department of Biology, University of Washington, Box 351800, Seattle, WA 98195, USA
| | | | | |
Collapse
|
167
|
Guyot R, Cheng X, Su Y, Cheng Z, Schlagenhauf E, Keller B, Ling HQ. Complex organization and evolution of the tomato pericentromeric region at the FER gene locus. PLANT PHYSIOLOGY 2005; 138:1205-15. [PMID: 16009996 PMCID: PMC1176395 DOI: 10.1104/pp.104.058099] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
Tomato (Lycopersicon esculentum) is a model species for molecular biology research and a candidate for large-scale genome sequencing. Pericentromeric heterochromatin constitutes a large portion of the tomato chromosomes. However, the knowledge of the structure, organization, and evolution of such regions remains very limited. Here, we report the analysis of a 198-kb sequence near the FER gene, located in a distal part of pericentromeric heterochromatin on the long arm of tomato chromosome 6. Nine genes, one pseudogene, and 55 transposable elements (TEs) were identified, showing a low gene density (19.8 kb/gene) and a high content of transposable elements (>45% of the sequence). Six genes (56 B23_g3, g5, g7, g8, g9, and g10) have perfect matches (>98% identity) with tomato expressed sequence tags. Two genes (56 B23_g1 and g6), which share <98% sequence identity with expressed sequence tags, were confirmed for transcriptional activity by reverse transcription-PCR. The genes were not uniformly distributed along the sequence and grouped into gene islands separated by stretches of retrotransposons, forming a pattern similar to that found in the gene-rich regions of the large genomes of maize (Zea mays) and Triticeae. Long terminal repeat retrotransposons account for 60% of the TE sequence length. Sixteen of 55 TEs were completely new and remain unclassified. Surprisingly, five of the seven identified DNA transposons were closely associated with coding regions. The action of transposable elements and DNA rearrangements form the molecular basis of the dynamic genome evolution at the FER locus. Multiple rounds of genome duplication in Arabidopsis (Arabidopsis thaliana) and subsequent gene loss have generated a mosaic pattern of conservation between tomato and Arabidopsis orthologous sequences. Our data show that the distal parts of pericentromeric heterochromatin may contain many valuable genes and that these regions form an evolutionary active part of the tomato genome.
Collapse
Affiliation(s)
- Romain Guyot
- Institute of Plant Biology, University of Zurich, 8008 Zurich, Switzerland
| | | | | | | | | | | | | |
Collapse
|
168
|
Li J, Thomson M, McCouch SR. Fine mapping of a grain-weight quantitative trait locus in the pericentromeric region of rice chromosome 3. Genetics 2005; 168:2187-95. [PMID: 15611185 PMCID: PMC1448733 DOI: 10.1534/genetics.104.034165] [Citation(s) in RCA: 106] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
As the basis for fine mapping of a grain-weight QTL, gw3.1, a set of near isogenic lines (NILs), was developed from an Oryza sativa, cv. Jefferson x O. rufipogon (IRGC105491) population based on five generations of backcrossing and seven generations of selfing. Despite the use of an interspecific cross for mapping and the pericentromeric location of the QTL, we observed no suppression of recombination and have been able to narrow down the location of the gene underlying this QTL to a 93.8-kb region. The locus was associated with transgressive variation for grain size and grain weight in this population and features prominently in many other inter- and intraspecific crosses of rice. The phenotype was difficult to evaluate due to the large amount of variance in size and weight among grains on a panicle and between grains on primary and secondary panicles, underscoring the value of using multiple approaches to phenotyping, including extreme sampling and NIL group-mean comparisons. The fact that a QTL for kernel size has also been identified in a homeologous region of maize chromosome 1 suggests that this locus, in which the dominant O. rufipogon allele confers small seed size, may be associated with domestication in cereals.
Collapse
Affiliation(s)
- Jiming Li
- Department of Plant Breeding, Cornell University, Ithaca, New York 14853, USA
| | | | | |
Collapse
|
169
|
Yang TJ, Lee S, Chang SB, Yu Y, de Jong H, Wing RA. In-depth sequence analysis of the tomato chromosome 12 centromeric region: identification of a large CAA block and characterization of pericentromere retrotranposons. Chromosoma 2005; 114:103-17. [PMID: 15965704 DOI: 10.1007/s00412-005-0342-8] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2005] [Revised: 03/21/2005] [Accepted: 03/21/2005] [Indexed: 11/30/2022]
Abstract
We sequenced a continuous 326-kb DNA stretch of a microscopically defined centromeric region of tomato chromosome 12. A total of 84% of the sequence (270 kb) was composed of a nested complex of repeat sequences including 27 retrotransposons, two transposable elements, three MITEs, two terminal repeat retrotransposons in miniature (TRIMs), ten unclassified repeats and three chloroplast DNA insertions. The retrotransposons were grouped into three families of Ty3-Gypsy type long terminal repeat (LTR) retrotransposons (PCRT1-PCRT3) and one LINE-like retrotransposon (PCRT4). High-resolution fluorescence in situ hybridization analyses on pachytene complements revealed that PCRT1a occurs on the pericentromere heterochromatin blocks. PCRT1 was the prevalent retrotransposon family occupying more than 60% of the 326-kb sequence with 19 members grouped into eight subfamilies (PCRT1a-PCRT1h) based on LTR sequence. The PCRT1a subfamily is a rapidly amplified element occupying tens of megabases. The other PCRT1 subfamilies (PCRT1b-PCRT1h) were highly degenerated and interrupted by insertions of other elements. The PCRT1 family shows identity with a previously identified tomato-specific repeat TGR2 and a CENP-B like sequence. A second previously described genomic repeat, TGR3, was identified as a part of the LTR sequence of an Athila-like PCRT2 element of which four copies were found in the 326-kb stretch. A large block of trinucleotide microsatellite (CAA)n occupies the centromere and large portions of the flanking pericentromere heterochromatin blocks of chromosome 12 and most of the other chromosomes. Five putative genes in the remaining 14% of the centromere region were identified, of which one is similar to a transcription regulator (ToCPL1) and a candidate jointless-2 gene.
Collapse
Affiliation(s)
- Tae-Jin Yang
- Brassica Genomics Team, National Institute of Agricultural Biotechnology, RDA, Suwon 441-707, South Korea
| | | | | | | | | | | |
Collapse
|
170
|
Hall SE, Luo S, Hall AE, Preuss D. Differential rates of local and global homogenization in centromere satellites from Arabidopsis relatives. Genetics 2005; 170:1913-27. [PMID: 15937135 PMCID: PMC1449784 DOI: 10.1534/genetics.104.038208] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Higher eukaryotic centromeres contain thousands of satellite repeats organized into tandem arrays. As species diverge, new satellite variants are homogenized within and between chromosomes, yet the processes by which particular sequences are dispersed are poorly understood. Here, we isolated and analyzed centromere satellites in plants separated from Arabidopsis thaliana by 5-20 million years, uncovering more rapid satellite divergence compared to primate alpha-satellite repeats. We also found that satellites derived from the same genomic locus were more similar to each other than satellites derived from disparate genomic regions, indicating that new sequence alterations were homogenized more efficiently at a local, rather than global, level. Nonetheless, the presence of higher-order satellite arrays, similar to those identified in human centromeres, indicated limits to local homogenization and suggested that sequence polymorphisms may play important functional roles. In two species, we defined more extensive polymorphisms, identifying physically separated and highly distinct satellite types. Taken together, these data show that there is a balance between plant satellite homogenization and the persistence of satellite variants. This balance could ultimately generate sufficient sequence divergence to cause mating incompatibilities between plant species, while maintaining adequate conservation within a species for centromere activity.
Collapse
MESH Headings
- Amino Acid Sequence
- Arabidopsis/genetics
- Base Sequence
- Centromere/genetics
- Chromatin Immunoprecipitation
- Consensus Sequence
- DNA, Plant/analysis
- DNA, Satellite/genetics
- DNA, Satellite/metabolism
- Fluorescein-5-isothiocyanate
- Fluorescent Antibody Technique, Direct
- Fluorescent Dyes
- Genome, Plant
- Heterochromatin/metabolism
- In Situ Hybridization, Fluorescence
- Indoles
- Microscopy, Fluorescence
- Molecular Sequence Data
- Phylogeny
- Sequence Analysis, DNA
- Sequence Homology, Amino Acid
- Sequence Homology, Nucleic Acid
Collapse
Affiliation(s)
- Sarah E Hall
- Howard Hughes Medical Institute, University of Chicago, Chicago, Illinois 60637, USA
| | | | | | | |
Collapse
|
171
|
Kendal WS, Suomela BP. Large-scale genomic correlations in Arabidopsis thaliana relate to chromosomal structure. BMC Genomics 2005; 6:82. [PMID: 15932642 PMCID: PMC1166552 DOI: 10.1186/1471-2164-6-82] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2005] [Accepted: 06/02/2005] [Indexed: 11/10/2022] Open
Abstract
Background The chromosomes of the plant Arabidopsis thaliana contain various genomic elements, distributed with appreciable spatial heterogeneity. Clustering of and/or correlations between these elements presumably should reflect underlying functional or structural factors. We studied the positional density fluctuations and correlations between genes, indels, single nucleotide polymorphisms (SNPs), retrotransposons, 180 bp tandem repeats, and conserved centromeric sequences (CCSs) in Arabidopsis in order to elucidate any patterns and possible responsible factors for their genomic distributions. Results The spatial distributions of all these elements obeyed a common pattern: the density profiles of each element within chromosomes exhibited low-frequency fluctuations indicative of regional clustering, and the individual density profiles tended to correlate with each other at large measurement scales. This pattern could be attributed to the influence of major chromosomal structures, such as centromeres. At smaller scales the correlations tended to weaken – evidence that localized cis-interactions between the different elements had a comparatively minor, if any, influence on their placement. Conclusion The conventional notion that retrotransposon insertion sites are strongly influenced by cis-interactions was not supported by these observations. Moreover, we would propose that large-scale chromosomal structure has a dominant influence on the intrachromosomal distributions of genomic elements, and provides for an additional shared hierarchy of genomic organization within Arabidopsis.
Collapse
Affiliation(s)
- Wayne S Kendal
- Division of Radiation Oncology, The Ottawa Hospital Regional Cancer Centre, Ottawa, Ontario, K1H 1C4 Canada
- The Ottawa Hospital Research Institute, Ottawa, Ontario, K1H 8L6 Canada
| | - Brian P Suomela
- Ontario Genomics Innovation Centre, Ottawa, Ontario, K1H 8L6 Canada
| |
Collapse
|
172
|
Abstract
High-diversity genes represent an important class of loci in organismal genomes. Since elevated levels of nucleotide variation are a key component of the molecular signature for balancing selection or local adaptation, high-diversity genes may represent loci whose alleles are selectively maintained as balanced polymorphisms. Comparison of 4300 random shotgun sequence fragments of the Arabidopsis thaliana Ler ecotype genome with the whole genomic sequence of the Col-0 ecotype identified 60 genes with putatively high levels of intraspecific variability. Eleven of these genes were sequenced in multiple A. thaliana accessions, 3 of which were found to display elevated levels of nucleotide polymorphism. These genes encode the myb-like transcription factor MYB103, a putative soluble starch synthase I, and a homeodomain-leucine zipper transcription factor. Analysis of these genes and 4-7 flanking genes in 14-20 A. thaliana ecotypes revealed that two of these loci show other characteristics of balanced polymorphisms, including broad peaks of nucleotide diversity spanning multiple linked genes and an excess of intermediate-frequency polymorphisms. Scanning genomes for high-diversity genomic regions may be useful in approaches to adaptive trait locus mapping for uncovering candidate balanced polymorphisms.
Collapse
Affiliation(s)
- Jennifer M Cork
- Department of Genetics, North Carolina State University, Raleigh, North Carolina 27695, USA
| | | |
Collapse
|
173
|
Schmid M, Davison TS, Henz SR, Pape UJ, Demar M, Vingron M, Schölkopf B, Weigel D, Lohmann JU. A gene expression map of Arabidopsis thaliana development. Nat Genet 2005; 37:501-6. [PMID: 15806101 DOI: 10.1038/ng1543] [Citation(s) in RCA: 1794] [Impact Index Per Article: 94.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2004] [Accepted: 02/24/2005] [Indexed: 11/09/2022]
Abstract
Regulatory regions of plant genes tend to be more compact than those of animal genes, but the complement of transcription factors encoded in plant genomes is as large or larger than that found in those of animals. Plants therefore provide an opportunity to study how transcriptional programs control multicellular development. We analyzed global gene expression during development of the reference plant Arabidopsis thaliana in samples covering many stages, from embryogenesis to senescence, and diverse organs. Here, we provide a first analysis of this data set, which is part of the AtGenExpress expression atlas. We observed that the expression levels of transcription factor genes and signal transduction components are similar to those of metabolic genes. Examining the expression patterns of large gene families, we found that they are often more similar than would be expected by chance, indicating that many gene families have been co-opted for specific developmental processes.
Collapse
Affiliation(s)
- Markus Schmid
- Max Planck Institute for Developmental Biology, Spemannstrasse 37-39, 72076 Tübingen, Germany
| | | | | | | | | | | | | | | | | |
Collapse
|
174
|
Lam SY, Horn SR, Radford SJ, Housworth EA, Stahl FW, Copenhaver GP. Crossover interference on nucleolus organizing region-bearing chromosomes in Arabidopsis. Genetics 2005; 170:807-12. [PMID: 15802520 PMCID: PMC1450402 DOI: 10.1534/genetics.104.040055] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
In most eukaryotes, crossovers are not independently distributed along the length of a chromosome. Instead, they appear to avoid close proximity to one another--a phenomenon known as crossover interference. Previously, for three of the five Arabidopsis chromosomes, we measured the strength of interference and suggested a model wherein some crossovers experience interference while others do not. Here we show, using the same model, that the fraction of interference-insensitive crossovers is significantly smaller on the remaining two chromosomes. Since these two chromosomes bear the Arabidopsis NOR domains, the possibility that these chromosomal regions influence interference is discussed.
Collapse
Affiliation(s)
- Sandy Y Lam
- Department of Biology, University of North Carolina, Chapel Hill, 27599, USA
| | | | | | | | | | | |
Collapse
|
175
|
Linkiewicz AM, Qi LL, Gill BS, Ratnasiri A, Echalier B, Chao S, Lazo GR, Hummel DD, Anderson OD, Akhunov ED, Dvorák J, Pathan MS, Nguyen HT, Peng JH, Lapitan NLV, Gustafson JP, La Rota CM, Sorrells ME, Hossain KG, Kalavacharla V, Kianian SF, Sandhu D, Bondareva SN, Gill KS, Conley EJ, Anderson JA, Fenton RD, Close TJ, McGuire PE, Qualset CO, Dubcovsky J. A 2500-locus bin map of wheat homoeologous group 5 provides insights on gene distribution and colinearity with rice. Genetics 2005; 168:665-76. [PMID: 15514043 PMCID: PMC1448825 DOI: 10.1534/genetics.104.034835] [Citation(s) in RCA: 64] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
We constructed high-density deletion bin maps of wheat chromosomes 5A, 5B, and 5D, including 2338 loci mapped with 1052 EST probes and 217 previously mapped loci (total 2555 loci). This information was combined to construct a consensus chromosome bin map of group 5 including 24 bins. A relatively higher number of loci were mapped on chromosome 5B (38%) compared to 5A (34%) and 5D (28%). Differences in the levels of polymorphism among the three chromosomes were partially responsible for these differences. A higher number of duplicated loci was found on chromosome 5B (42%). Three times more loci were mapped on the long arms than on the short arms, and a significantly higher number of probes, loci, and duplicated loci were mapped on the distal halves than on the proximal halves of the chromosome arms. Good overall colinearity was observed among the three homoeologous group 5 chromosomes, except for the previously known 5AL/4AL translocation and a putative small pericentric inversion in chromosome 5A. Statistically significant colinearity was observed between low-copy-number ESTs from wheat homoeologous group 5 and rice chromosomes 12 (88 ESTs), 9 (72 ESTs), and 3 (84 ESTs).
Collapse
Affiliation(s)
- A M Linkiewicz
- Department of Agronomy and Range Science, University of California, Davis, California 95616, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
176
|
Puertas MJ, García-Chico R, Sotillo E, González-Sánchez M, Manzanero S. Movement ability of rye terminal neocentromeres. Cytogenet Genome Res 2005; 109:120-7. [PMID: 15753567 DOI: 10.1159/000082390] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2004] [Accepted: 04/19/2004] [Indexed: 11/19/2022] Open
Abstract
Rye terminal neocentromeres were analyzed in various aspects. Plants with and without neocentromeres were crossed to determine the possible genetic control on their formation. The segregation obtained in our work is consistent with the hypothesis of two trans-acting genes determining neocentric activity in such a way that individuals with no neocentromeres at all would carry all non-activating alleles, whereas one activating allele might permit the activation of a few neocentromeres. Individuals with four activating alleles would show the maximum frequency of neocentromeres per cell. Anti-tubulin immunolabelling was used to visualize the interaction between the neocentromeres and the microtubules. In most cases an end-on interaction between neocentromeres and microtubules was observed, but a few neocentromeres were observed free of them. Spikes were irradiated at early meiosis to determine whether acentric fragments carrying subtelomeric heterochromatin were able to behave as neocentromeres. In no case were acentric fragments observed to form an extension polewards as they did in whole chromosomes. Broken chromosomes joined by a thin thread of chromatin to the centromeric region
Collapse
Affiliation(s)
- M J Puertas
- Departamento de Genética, Facultad de Biología, Universidad Complutense, Madrid, Spain.
| | | | | | | | | |
Collapse
|
177
|
Schneeberger RG, Zhang K, Tatarinova T, Troukhan M, Kwok SF, Drais J, Klinger K, Orejudos F, Macy K, Bhakta A, Burns J, Subramanian G, Donson J, Flavell R, Feldmann KA. Agrobacterium T-DNA integration in Arabidopsis is correlated with DNA sequence compositions that occur frequently in gene promoter regions. Funct Integr Genomics 2005; 5:240-53. [PMID: 15744539 DOI: 10.1007/s10142-005-0138-1] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2004] [Revised: 12/30/2004] [Accepted: 01/01/2005] [Indexed: 10/25/2022]
Abstract
Mobile insertion elements such as transposons and T-DNA generate useful genetic variation and are important tools for functional genomics studies in plants and animals. The spectrum of mutations obtained in different systems can be highly influenced by target site preferences inherent in the mechanism of DNA integration. We investigated the target site preferences of Agrobacterium T-DNA insertions in the chromosomes of the model plant Arabidopsis thaliana. The relative frequencies of insertions in genic and intergenic regions of the genome were calculated and DNA composition features associated with the insertion site flanking sequences were identified. Insertion frequencies across the genome indicate that T-strand integration is suppressed near centromeres and rDNA loci, progressively increases towards telomeres, and is highly correlated with gene density. At the gene level, T-DNA integration events show a statistically significant preference for insertion in the 5' and 3' flanking regions of protein coding sequences as well as the promoter region of RNA polymerase I transcribed rRNA gene repeats. The increased insertion frequencies in 5' upstream regions compared to coding sequences are positively correlated with gene expression activity and DNA sequence composition. Analysis of the relationship between DNA sequence composition and gene activity further demonstrates that DNA sequences with high CG-skew ratios are consistently correlated with T-DNA insertion site preference and high gene expression. The results demonstrate genomic and gene-specific preferences for T-strand integration and suggest that DNA sequences with a pronounced transition in CG- and AT-skew ratios are preferred targets for T-DNA integration.
Collapse
|
178
|
Matsuo M, Ito Y, Yamauchi R, Obokata J. The rice nuclear genome continuously integrates, shuffles, and eliminates the chloroplast genome to cause chloroplast-nuclear DNA flux. THE PLANT CELL 2005; 17:665-75. [PMID: 15705954 PMCID: PMC1069690 DOI: 10.1105/tpc.104.027706] [Citation(s) in RCA: 102] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/17/2004] [Accepted: 12/27/2004] [Indexed: 05/19/2023]
Abstract
Plastid DNA fragments are often found in the plant nuclear genome, and DNA transfer from plastids to the nucleus is ongoing. However, successful gene transfer is rare. What happens to compensate for this? To address this question, we analyzed nuclear-localized plastid DNA (nupDNA) fragments throughout the rice (Oryza sativa ssp japonica) genome, with respect to their age, size, structure, and integration sites on chromosomes. The divergence of nupDNA sequences from the sequence of the present plastid genome strongly suggests that plastid DNA has been transferred repeatedly to the nucleus in rice. Age distribution profiles of the nupDNA population, together with the size and structural characteristics of each fragment, revealed that once plastid DNAs are integrated into the nuclear genome, they are rapidly fragmented and vigorously shuffled, and surprisingly, 80% of them are eliminated from the nuclear genome within a million years. Large nupDNA fragments preferentially localize to the pericentromeric region of the chromosomes, where integration and elimination frequencies are markedly higher. These data indicate that the plant nuclear genome is in equilibrium between frequent integration and rapid elimination of the chloroplast genome and that the pericentromeric regions play a significant role in facilitating the chloroplast-nuclear DNA flux.
Collapse
Affiliation(s)
- Mitsuhiro Matsuo
- Center for Gene Research, Nagoya University, Nagoya 464-8602, Japan
| | | | | | | |
Collapse
|
179
|
Abstract
Although heterochromatin has been studied for 80 years, its genetic function and molecular organization have remained elusive. In almost all organisms, heterochromatin has been regarded as genetically inactive chromosome regions. However, from genetic and genomic studies in Drosophila melanogaster and other organisms including humans, it is now clear that transcriptionally active domains are present within constitutive heterochromatin. These domains contain essential coding genes whose expression during development ensures the formation of the proper biochemical and morphological phenotypes, together with several gene models defined by genome annotation whose functions still need to be determined.
Collapse
Affiliation(s)
- Patrizio Dimitri
- Dipartimento di Genetica e Biologia Molecolare, Università La Sapienza, 70-00185 Roma, Italy.
| | | | | | | |
Collapse
|
180
|
Salzberg SL, Hotopp JCD, Delcher AL, Pop M, Smith DR, Eisen MB, Nelson WC. Serendipitous discovery of Wolbachia genomes in multiple Drosophila species. Genome Biol 2005; 6:R23. [PMID: 15774024 PMCID: PMC1088942 DOI: 10.1186/gb-2005-6-3-r23] [Citation(s) in RCA: 113] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2004] [Revised: 01/24/2005] [Accepted: 01/24/2005] [Indexed: 11/10/2022] Open
Abstract
By searching the publicly available repository of DNA sequencing trace data, we discovered three new species of the bacterial endosymbiont Wolbachia pipientis in three different species of fruit fly: Drosophila ananassae, D. simulans, and D. mojavensis. Background The Trace Archive is a repository for the raw, unanalyzed data generated by large-scale genome sequencing projects. The existence of this data offers scientists the possibility of discovering additional genomic sequences beyond those originally sequenced. In particular, if the source DNA for a sequencing project came from a species that was colonized by another organism, then the project may yield substantial amounts of genomic DNA, including near-complete genomes, from the symbiotic or parasitic organism. Results By searching the publicly available repository of DNA sequencing trace data, we discovered three new species of the bacterial endosymbiont Wolbachia pipientis in three different species of fruit fly: Drosophila ananassae, D. simulans, and D. mojavensis. We extracted all sequences with partial matches to a previously sequenced Wolbachia strain and assembled those sequences using customized software. For one of the three new species, the data recovered were sufficient to produce an assembly that covers more than 95% of the genome; for a second species the data produce the equivalent of a 'light shotgun' sampling of the genome, covering an estimated 75-80% of the genome; and for the third species the data cover approximately 6-7% of the genome. Conclusions The results of this study reveal an unexpected benefit of depositing raw data in a central genome sequence repository: new species can be discovered within this data. The differences between these three new Wolbachia genomes and the previously sequenced strain revealed numerous rearrangements and insertions within each lineage and hundreds of novel genes. The three new genomes, with annotation, have been deposited in GenBank.
Collapse
Affiliation(s)
- Steven L Salzberg
- The Institute for Genomic Research, 9712 Medical Center Drive, Rockville, MD 20850, USA
| | | | - Arthur L Delcher
- The Institute for Genomic Research, 9712 Medical Center Drive, Rockville, MD 20850, USA
| | - Mihai Pop
- The Institute for Genomic Research, 9712 Medical Center Drive, Rockville, MD 20850, USA
| | - Douglas R Smith
- Agencourt Bioscience Corporation, 100 Cumming Center, Beverley, MA 01915, USA
| | - Michael B Eisen
- Center for Integrative Genomics, University of California, Berkeley, CA 94720, USA
| | - William C Nelson
- The Institute for Genomic Research, 9712 Medical Center Drive, Rockville, MD 20850, USA
| |
Collapse
|
181
|
Phelps-Durr TL, Birchler JA. An asymptotic determination of minimum centromere size for the maize B chromosome. Cytogenet Genome Res 2005; 106:309-13. [PMID: 15292608 DOI: 10.1159/000079304] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2003] [Accepted: 01/22/2004] [Indexed: 11/19/2022] Open
Abstract
The maize B chromosome is a dispensable chromosome and therefore serves as a model system to study centromere function. The B centromere region is estimated to be approximately 9,000 kb in size and contains a 1.4 kb repeat that is specific to this centromere. When maintained as a univalent, the B chromosome occasionally undergoes centric misdivision. Consecutive misdivision analysis of the maize B chromosome centromere has generated a collection of functional centromeres that are greatly reduced in complexity. These small centromeres are often correlated with strongly reduced meiotic transmission. Molecular analyses of the misdivision collection have revealed that the smallest functional maize B centromere is a minimum of 110 kb in size. Considering the collection as a whole, meiotic transmission becomes severely compromised when the estimated centromere size is reduced to a few hundred kilobases.
Collapse
Affiliation(s)
- T L Phelps-Durr
- Division of Biological Sciences, University of Missouri, Columbia, MO 65211, USA
| | | |
Collapse
|
182
|
Abstract
The genomes of grasses and cereals include a diverse and large collection of selfish genetic elements, many of which are fossil relics of ancient origin. Some of these elements are active and, because of their selfish nature and the way in which they exist to perpetuate themselves, they cause a conflict for genomes both within and between species in hybrids and allopolyploids. The conflict arises from how the various elements may undergo 'drive', through transposition, centromere and neocentromere drive, and in mitotic and meiotic drive processes in supernumerary B chromosomes. Experimental and newly formed hybrids and polyploids, where new combinations of genomes are brought together for the first time, find themselves sharing a common nuclear and cytoplasmic environment, and they can respond with varying degrees of instability to adjust to their new partnerships. B chromosomes are harmful to fertility and to the physiology of the cells and plants that carry them. In this review we take a broad view of genome conflict, drawing together aspects arising from a range of genetic elements that have not hitherto been considered in their entirety, and we find some common themes linking these various elements in their activities.
Collapse
Affiliation(s)
- Neil Jones
- Institute of Biological Sciences, The University of Wales Aberystwyth, Ceredigion, SY23 3DD, UK.
| | | |
Collapse
|
183
|
Schmid KJ, Ramos-Onsins S, Ringys-Beckstein H, Weisshaar B, Mitchell-Olds T. A multilocus sequence survey in Arabidopsis thaliana reveals a genome-wide departure from a neutral model of DNA sequence polymorphism. Genetics 2005; 169:1601-15. [PMID: 15654111 PMCID: PMC1449538 DOI: 10.1534/genetics.104.033795] [Citation(s) in RCA: 174] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open
Abstract
The simultaneous analysis of multiple genomic loci is a powerful approach to studying the effects of population history and natural selection on patterns of genetic variation of a species. By surveying nucleotide sequence polymorphism at 334 randomly distributed genomic regions in 12 accessions of Arabidopsis thaliana, we examined whether a standard neutral model of nucleotide sequence polymorphism is consistent with observed data. The average nucleotide diversity was 0.0071 for total sites and 0.0083 for silent sites. Although levels of diversity are variable among loci, no correlation with local recombination rate was observed, but polymorphism levels were correlated for physically linked loci (<250 kb). We found that observed distributions of Tajima's D- and D/D(min)- and of Fu and Li's D-, D*- and F-, F*-statistics differed significantly from the expected distributions under a standard neutral model due to an excess of rare polymorphisms and high variances. Observed and expected distributions of Fay and Wu's H were not different, suggesting that demographic processes and not selection at multiple loci are responsible for the deviation from a neutral model. Maximum-likelihood comparisons of alternative demographic models like logistic population growth, glacial refugia, or past bottlenecks did not produce parameter estimates that were more consistent with observed patterns. However, exclusion of highly polymorphic "outlier loci" resulted in a fit to the logistic growth model. Various tests of neutrality revealed a set of candidate loci that may evolve under selection.
Collapse
Affiliation(s)
- Karl J Schmid
- Department of Genetics and Evolution, Max-Planck Institute of Chemical Ecology, Jena, Germany.
| | | | | | | | | |
Collapse
|
184
|
Khrustaleva LI, de Melo PE, van Heusden AW, Kik C. The integration of recombination and physical maps in a large-genome monocot using haploid genome analysis in a trihybrid allium population. Genetics 2005; 169:1673-85. [PMID: 15654085 PMCID: PMC1449564 DOI: 10.1534/genetics.104.038687] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Integrated mapping in large-genome monocots has been carried out on a limited number of species. Furthermore, integrated maps are difficult to construct for these species due to, among other reasons, the specific plant populations needed. To fill these gaps, Alliums were chosen as target species and a new strategy for constructing suitable populations was developed. This strategy involves the use of trihybrid genotypes in which only one homeolog of a chromosome pair is recombinant due to interspecific recombination. We used genotypes from a trihybrid Allium cepa x (A. roylei x A. fistulosum) population. Recombinant chromosomes 5 and 8 from the interspecific parent were analyzed using genomic in situ hybridization visualization of recombination points and the physical positions of recombination were integrated into AFLP linkage maps of both chromosomes. The integrated maps showed that in Alliums recombination predominantly occurs in the proximal half of chromosome arms and that 57.9% of PstI/MseI markers are located in close proximity to the centromeric region, suggesting the presence of genes in this region. These findings are different from data obtained on cereals, where recombination rate and gene density tends to be higher in distal regions.
Collapse
Affiliation(s)
- L I Khrustaleva
- Plant Research International, Wageningen University and Research Center, The Netherlands
| | | | | | | |
Collapse
|
185
|
Stanley Kim H, Yu Y, Snesrud EC, Moy LP, Linford LD, Haas BJ, Nierman WC, Quackenbush J. Transcriptional divergence of the duplicated oxidative stress-responsive genes in the Arabidopsis genome. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2005; 41:212-20. [PMID: 15634198 DOI: 10.1111/j.1365-313x.2004.02295.x] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Previous studies have indicated that Arabidopsis thaliana experienced a genome-wide duplication event shortly before its divergence from Brassica followed by extensive chromosomal rearrangements and deletions. While a large number of the duplicated genes have significantly diverged or lost their sister genes, we found 4222 pairs that are still highly conserved, and as a result had similar functional assignments during the annotation of the genome sequence. Using whole-genome DNA microarrays, we identified 906 duplicated gene pairs in which at least one member exhibited a significant response to oxidative stress. Among these, only 117 pairs were up- or down-regulated in both pairs and many of these exhibited dissimilar patterns of expression. Examination of the expression patterns of PAL1 and PAL2, ACD1 and ACD2, genes coding for two Hsp20s, various P450s, and electron transfer flavoproteins suggests Arabidopsis evolved a number of distinct oxidative stress response mechanisms using similar gene sets following the duplication of its genome.
Collapse
Affiliation(s)
- H Stanley Kim
- The Institute for Genomic Research, 9712 Medical Center Drive, Rockville, MD 20850, USA
| | | | | | | | | | | | | | | |
Collapse
|
186
|
Meister A. Calculation of binding length of base-specific DNA dyes by comparison of sequence and flow cytometric data. Application to Oryza sativa and Arabidopsis thaliana. J Theor Biol 2004; 232:93-7. [PMID: 15498596 DOI: 10.1016/j.jtbi.2004.07.022] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2004] [Revised: 07/26/2004] [Accepted: 07/26/2004] [Indexed: 11/15/2022]
Abstract
From biochemical experiments it has been found that AT- and GC-specific dyes need a certain number of consecutive bases of the same type for binding one dye molecule. From known base sequences the amount of bases included in dye binding can be calculated and compared with experimental data from flow cytometry. Oryza sativa and Arabidopsis thaliana are the first higher plants which are nearly completely (>90%) sequenced. From the published sequences the theoretical fluorescence intensity of base-specific dyes in relation to a base-unspecific dye is calculated for different binding lengths. These values are compared with the actual fluorescence intensities of nuclei analyzed by flow cytometry. For all investigated dyes (DAPI, Hoechst 33258, Hoechst 33342 (all AT specific) and Mithramycin A (GC specific)) a binding length of 1 bp results from the comparison of theoretical and experimental data. This is, however, in disagreement with former results on dye binding. The main reason for the discrepancy seems to be the remaining gap in the sequencing of the Arabidopsis genome.
Collapse
Affiliation(s)
- Armin Meister
- Department of Cytogenetics, Institute of Plant Genetics and Crop Plant Research, Corrensstr. 3, D-06466 Gatersleben, Germany.
| |
Collapse
|
187
|
Kawabe A, Nasuda S. Structure and genomic organization of centromeric repeats in Arabidopsis species. Mol Genet Genomics 2004; 272:593-602. [PMID: 15586291 DOI: 10.1007/s00438-004-1081-x] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2004] [Accepted: 10/05/2004] [Indexed: 10/26/2022]
Abstract
Centromeric repetitive sequences were isolated from Arabidopsis halleri ssp. gemmifera and A. lyrata ssp. kawasakiana. Two novel repeat families isolated from A. gemmifera were designated pAge1 and pAge2. These repeats are 180 bp in length and are organized in a head-to-tail manner. They are similar to the pAL1 repeats of A. thaliana and the pAa units of A. arenosa. Both A. gemmifera and A. kawasakiana possess the pAa, pAge1 and pAge2 repeat families. Sequence comparisons of different centromeric repeats revealed that these families share a highly conserved region of approximately 50 bp. Within each of the four repeat families, two or three regions showed low levels of sequence variation. The average difference in nucleotide sequence was approximately 10% within families and 30% between families, which resulted in clear distinctions between families upon phylogenetic analysis. FISH analysis revealed that the localization patterns for the pAa, pAge1 and pAge2 families were chromosome specific in A. gemmifera and A. kawasakiana. In one pair of chromosomes in A. gemmifera, and three pairs of chromosomes in A. kawasakiana, two repeat families were present. The presence of three families of centromeric repeats in A. gemmifera and A. kawasakiana indicates that the first step toward homogenization of centromeric repeats occurred at the chromosome level.
Collapse
Affiliation(s)
- A Kawabe
- Laboratory of Plant Genetics, Graduate School of Agriculture, Kyoto University, 606-8502, Kyoto, Japan
| | | |
Collapse
|
188
|
Ma H, Moore PH, Liu Z, Kim MS, Yu Q, Fitch MMM, Sekioka T, Paterson AH, Ming R. High-density linkage mapping revealed suppression of recombination at the sex determination locus in papaya. Genetics 2004; 166:419-36. [PMID: 15020433 PMCID: PMC1470706 DOI: 10.1534/genetics.166.1.419] [Citation(s) in RCA: 112] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
A high-density genetic map of papaya (Carica papaya L.) was constructed using 54 F(2) plants derived from cultivars Kapoho and SunUp with 1501 markers, including 1498 amplified fragment length polymorphism (AFLP) markers, the papaya ringspot virus coat protein marker, morphological sex type, and fruit flesh color. These markers were mapped into 12 linkage groups at a LOD score of 5.0 and recombination frequency of 0.25. The 12 major linkage groups covered a total length of 3294.2 cM, with an average distance of 2.2 cM between adjacent markers. This map revealed severe suppression of recombination around the sex determination locus with a total of 225 markers cosegregating with sex types. The cytosine bases were highly methylated in this region on the basis of the distribution of methylation-sensitive and -insensitive markers. This high-density genetic map is essential for cloning of specific genes of interest such as the sex determination gene and for the integration of genetic and physical maps of papaya.
Collapse
Affiliation(s)
- Hao Ma
- Department of Molecular Biosciences and Bioengineering, University of Hawaii, Honolulu, Hawaii 96822, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
189
|
Abstract
Centromere function is remarkably conserved between species, yet the satellite sequences that make up centromeric DNA are highly divergent. Proteins that bind these sequences appear to be evolving under positive selection, supporting a model wherein the interplay between centromeric repeats and the proteins that bind them creates an opportunity for an intriguing phenomenon known as centromere-based meiotic drive.
Collapse
Affiliation(s)
- Gregory P Copenhaver
- Department of Biology and The Carolina Center for Genome Sciences, The University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| |
Collapse
|
190
|
Rudd MK, Schueler MG, Willard HF. Sequence organization and functional annotation of human centromeres. COLD SPRING HARBOR SYMPOSIA ON QUANTITATIVE BIOLOGY 2004; 68:141-9. [PMID: 15338612 DOI: 10.1101/sqb.2003.68.141] [Citation(s) in RCA: 38] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Affiliation(s)
- M K Rudd
- Institute for Genome Sciences & Policy, Department of Molecular Genetics and Microbiology, Duke University, Durham, North Carolina 27710, USA
| | | | | |
Collapse
|
191
|
Peterson-Burch BD, Nettleton D, Voytas DF. Genomic neighborhoods for Arabidopsis retrotransposons: a role for targeted integration in the distribution of the Metaviridae. Genome Biol 2004; 5:R78. [PMID: 15461796 PMCID: PMC545598 DOI: 10.1186/gb-2004-5-10-r78] [Citation(s) in RCA: 65] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2004] [Revised: 08/03/2004] [Accepted: 09/02/2004] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Retrotransposons are an abundant component of eukaryotic genomes. The high quality of the Arabidopsis thaliana genome sequence makes it possible to comprehensively characterize retroelement populations and explore factors that contribute to their genomic distribution. RESULTS We identified the full complement of A. thaliana long terminal repeat (LTR) retroelements using RetroMap, a software tool that iteratively searches genome sequences for reverse transcriptases and then defines retroelement insertions. Relative ages of full-length elements were estimated by assessing sequence divergence between LTRs: the Pseudoviridae were significantly younger than the Metaviridae. All retroelement insertions were mapped onto the genome sequence and their distribution was distinctly non-uniform. Although both Pseudoviridae and Metaviridae tend to cluster within pericentromeric heterochromatin, this association is significantly more pronounced for all three Metaviridae sublineages (Metavirus, Tat and Athila). Among these, Tat and Athila are strictly associated with pericentromeric heterochromatin. CONCLUSIONS The non-uniform genomic distribution of the Pseudoviridae and the Metaviridae can be explained by a variety of factors including target-site bias, selection against integration into euchromatin and pericentromeric accumulation of elements as a result of suppression of recombination. However, comparisons based on the age of elements and their chromosomal location indicate that integration-site specificity is likely to be the primary factor determining distribution of the Athila and Tat sublineages of the Metaviridae. We predict that, like retroelements in yeast, the Athila and Tat elements target integration to pericentromeric regions by recognizing a specific feature of pericentromeric heterochromatin.
Collapse
Affiliation(s)
| | - Dan Nettleton
- Department of Statistics, 124 Snedecor Hall, Iowa State University, Ames, IA 50011, USA
| | - Daniel F Voytas
- Department of Genetics, Development and Cell Biology, 1035A Roy J. Carver Co-Lab, Iowa State University, Ames, IA 50011, USA
| |
Collapse
|
192
|
Luo S, Hall AE, Hall SE, Preuss D. Whole-genome fractionation rapidly purifies DNA from centromeric regions. Nat Methods 2004; 1:67-71. [PMID: 15782155 DOI: 10.1038/nmeth703] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2004] [Accepted: 08/03/2004] [Indexed: 11/09/2022]
Abstract
The condensed centromeric regions of higher eukaryotic chromosomes contain satellite sequences, transposons and retroelements, as well as transcribed genes that perform a variety of functions. These chromosomal domains nucleate kinetochores, mediate sister chromatid cohesion and inhibit recombination, yet their characterization has often lagged behind that of chromosome arms. Here, we describe a whole-genome fractionation technique that rapidly identifies bacterial artificial chromosome (BAC) clones derived from plant centromeric regions. This approach, which relies on hybridization of methylated genomic DNA, revealed BACs that correspond to the genetically mapped and sequenced Arabidopsis thaliana centromeric regions. Extending this method to other species in the Brassicaceae family identified centromere-linked clones and provided genome-wide estimates of methylated DNA abundance. Sequencing these clones will elucidate the changes that occur during plant centromere evolution. This genomic fractionation technique could identify centromeric DNA in genomes with similar methylation and repetitive DNA content, including those from crops and mammals.
Collapse
Affiliation(s)
- Song Luo
- Department of Molecular Genetics and Cell Biology, University of Chicago, Chicago, Illinois 60637, USA
| | | | | | | |
Collapse
|
193
|
She X, Horvath JE, Jiang Z, Liu G, Furey TS, Christ L, Clark R, Graves T, Gulden CL, Alkan C, Bailey JA, Sahinalp C, Rocchi M, Haussler D, Wilson RK, Miller W, Schwartz S, Eichler EE. The structure and evolution of centromeric transition regions within the human genome. Nature 2004; 430:857-64. [PMID: 15318213 DOI: 10.1038/nature02806] [Citation(s) in RCA: 144] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2004] [Accepted: 07/02/2004] [Indexed: 11/09/2022]
Abstract
An understanding of how centromeric transition regions are organized is a critical aspect of chromosome structure and function; however, the sequence context of these regions has been difficult to resolve on the basis of the draft genome sequence. We present a detailed analysis of the structure and assembly of all human pericentromeric regions (5 megabases). Most chromosome arms (35 out of 43) show a gradient of dwindling transcriptional diversity accompanied by an increasing number of interchromosomal duplications in proximity to the centromere. At least 30% of the centromeric transition region structure originates from euchromatic gene-containing segments of DNA that were duplicatively transposed towards pericentromeric regions at a rate of six-seven events per million years during primate evolution. This process has led to the formation of a minimum of 28 new transcripts by exon exaptation and exon shuffling, many of which are primarily expressed in the testis. The distribution of these duplicated segments is nonrandom among pericentromeric regions, suggesting that some regions have served as preferential acceptors of euchromatic DNA.
Collapse
Affiliation(s)
- Xinwei She
- Department of Genetics, Center for Computational Genomics and the Center for Human Genetics, Case Western Reserve University School of Medicine and University Hospitals of Cleveland, Cleveland, Ohio 44106, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
194
|
Dussle CM, Hahn V, Knapp SJ, Bauer E. PlArg from Helianthus argophyllus is unlinked to other known downy mildew resistance genes in sunflower. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2004; 109:1083-6. [PMID: 15221147 DOI: 10.1007/s00122-004-1722-9] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/04/2003] [Accepted: 05/06/2004] [Indexed: 05/04/2023]
Abstract
The PlArg locus in the sunflower (Helianthus annuus L.) inbred line Arg1575-2 conferring resistance to at least four tested races (300, 700, 730, 770) of downy mildew (Plasmopara halstedii) was localized by the use of simple sequence repeat (SSR) markers. Bulked segregant analysis (BSA) was conducted on 126 individuals of an F2 progeny from a cross between a downy mildew susceptible line, CmsHA342, and Arg1575-2. Twelve SSR markers linked to the PlArg locus were identified. All markers were located proximal to PlArg on linkage group LG1 based on the map of Yu et al. (2003) in a window of 9.3 cM. Since PlArg was mapped to a linkage group different from all other Pl genes previously mapped with SSRs, it can be concluded that PlArg provides a new source of resistance against P. halstedii in sunflower.
Collapse
Affiliation(s)
- C M Dussle
- State Plant Breeding Institute (720), University of Hohenheim, 70593, Stuttgart, Germany
| | | | | | | |
Collapse
|
195
|
Abstract
The complete sequence of rice centromere 8 reveals a small amount of centromere-specific satellite sequence in blocks interrupted by retrotransposons and other repetitive DNA, in an arrangement that is similar in size and content to other centromeres of multicellular eukaryotes. The complete sequence of rice centromere 8 reveals a small amount of centromere-specific satellite sequence in blocks interrupted by retrotransposons and other repetitive DNA, in an arrangement that is strikingly similar in overall size and content to other centromeres of multicellular eukaryotes.
Collapse
Affiliation(s)
- Jonathan C Lamb
- Division of Biological Sciences, University of Missouri, Columbia, MO 65211, USA
| | - James Theuri
- Division of Biological Sciences, University of Missouri, Columbia, MO 65211, USA
| | - James A Birchler
- Division of Biological Sciences, University of Missouri, Columbia, MO 65211, USA
| |
Collapse
|
196
|
Zhang R, Zhang CT. Isochore Structures in the Genome of the Plant Arabidopsis thaliana. J Mol Evol 2004; 59:227-38. [PMID: 15486696 DOI: 10.1007/s00239-004-2617-8] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2003] [Accepted: 02/10/2004] [Indexed: 10/26/2022]
Abstract
Arabidopsis thaliana is an important model system for the study of plant biology. We have analyzed the complete genome sequences of Arabidopsis by using a newly developed windowless method for the GC content computation, the cumulative GC profile. It is shown that the Arabidopsis genome is organized into a mosaic structure of isochores. All the centromeric regions are located in GC-rich isochores, called centromere-isochores, which are characterized by a high GC content but low gene and T-DNA insertion densities. This characteristic distinguishes centromere-isochores from the other class of GC-rich isochores, called GC-isochores, which have high gene and T-DNA insertion densities. Consequently, 15 isochores have been identified, i.e., 7 AT-isochores, 3 GC-isochores, and 5 centromere-isochores. The genes in centromere-isochores, which have the highest GC content, have much shorter intron lengths and lower intron numbers, compared to those of the other two types. There is also considerable difference in the numbers and lengths of transposable elements (TEs) between AT and GC-isochores, i.e., the TE number (length) of AT-isochores is 6.3 (7.3) times that of GC-isochores. It is generally believed that TEs are accumulated in the regions surrounding the centromeres. However, within these TE-rich regions, there are regions of extremely low TE numbers (TE deserts), which correspond to the positions of centromere-isochores. In addition, a heterochromatic knob is located at the boundary of an AT-isochore. Furthermore, we show that the differences in GC content among isochores are mainly due to the GC content variation of introns, the third codon positions and intergenic regions.
Collapse
Affiliation(s)
- Ren Zhang
- Department of Epidemiology and Biostatistics, Tianjin Cancer Institute and Hospital, 300060 Tianjin, China
| | | |
Collapse
|
197
|
Abstract
Arabidopsis thaliana has emerged in recent years as a leading model for understanding the structure and function of higher eukaryotic centromeres. Arabidopsis centromeres, like those of virtually all higher eukaryotes, encompass large DNA domains consisting of a complex combination of unique, dispersed middle repetitive and highly repetitive DNA. For this reason, they have required creative analysis using molecular, genetic, cytological and genomic techniques. This synergy of approaches, reinforced by rapid progress in understanding how proteins interact with the centromere DNA to form a complete functional unit, has made Arabidopsis one the best understood centromere systems. Yet major problems remain to be solved: gaining a complete structural definition of the centromere has been surprisingly difficult, and developing synthetic mini-chromosomes in plants has been even more challenging.
Collapse
Affiliation(s)
- Gregory P Copenhaver
- Department of Biology, The Carolina Center for Genome Sciences, The University of North Carolina at Chapel Hill, CB 3280, Coker Hall 305, Chapel Hill, NC 27599, USA.
| |
Collapse
|
198
|
Heslop-Harrison JS, Brandes A, Schwarzacher T. Tandemly repeated DNA sequences and centromeric chromosomal regions of Arabidopsis species. Chromosome Res 2004; 11:241-53. [PMID: 12769291 DOI: 10.1023/a:1022998709969] [Citation(s) in RCA: 51] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
Despite their common function, centromeric DNA sequences are not conserved between organisms. Most centromeres of animals and plants so far investigated have now been shown to consist of large blocks of tandemly repeated satellite sequences that are embedded in recombination-deficient heterochromatic regions. This central domain of satellite sequences that is postulated to mediate spindle attachment is surrounded by pericentromeric sequences incorporating various classes of repetitive sequences often including retroelements. The centromeric satellite DNA sequences are amongst the most rapidly evolving sequences and pose some fundamental problems of maintaining function. In this overview, we will discuss work on centromeric repetitive sequences in Arabidopsis thaliana and its relatives, and highlight some of the common features that are emerging when analysing closely related species.
Collapse
Affiliation(s)
- J S Heslop-Harrison
- CREST Project, Department of Biology, University of Leicester, Leicester LE1 7RH, UK.
| | | | | |
Collapse
|
199
|
Rudd S, Frisch M, Grote K, Meyers BC, Mayer K, Werner T. Genome-wide in silico mapping of scaffold/matrix attachment regions in Arabidopsis suggests correlation of intragenic scaffold/matrix attachment regions with gene expression. PLANT PHYSIOLOGY 2004; 135:715-22. [PMID: 15208419 PMCID: PMC514109 DOI: 10.1104/pp.103.037861] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/05/2004] [Revised: 03/26/2004] [Accepted: 03/28/2004] [Indexed: 05/17/2023]
Abstract
We carried out a genome-wide prediction of scaffold/matrix attachment regions (S/MARs) in Arabidopsis. Results indicate no uneven distribution on the chromosomal level but a clear underrepresentation of S/MARs inside genes. In cases where S/MARs were predicted within genes, these intragenic S/MARs were preferentially located within the 5'-half, most prominently within introns 1 and 2. Using Arabidopsis whole-genome expression data generated by the massively parallel signature sequencing methodology, we found a negative correlation between S/MAR-containing genes and transcriptional abundance. Expressed sequence tag data correlated the same way with S/MAR-containing genes. Thus, intragenic S/MARs show a negative correlation with transcription level. For various genes it has been shown experimentally that S/MARs can function as transcriptional regulators and that they have an implication in stabilizing expression levels within transgenic plants. On the basis of a genome-wide in silico S/MAR analysis, we found a significant correlation between the presence of intragenic S/MARs and transcriptional down-regulation.
Collapse
Affiliation(s)
- Stephen Rudd
- Munich Information Center for Protein Sequences/Institute for Bioinformatics, GSF-National Research Center for Environment and Health, 85764 Neuherberg, Germany
| | | | | | | | | | | |
Collapse
|
200
|
Shibata F, Murata M. Differential localization of the centromere-specific proteins in the major centromeric satellite of Arabidopsis thaliana. J Cell Sci 2004; 117:2963-70. [PMID: 15161939 DOI: 10.1242/jcs.01144] [Citation(s) in RCA: 61] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The 180 bp family of tandem repetitive sequences, which constitutes the major centromeric satellite in Arabidopsis thaliana, is thought to play important roles in kinetochore assembly. To assess the centromere activities of the 180 bp repeats, we performed indirect fluorescence immunolabeling with antibodies against phosphorylated histone H3 at Ser10, HTR12 (Arabidopsis centromeric histone H3 variant) and AtCENP-C (Arabidopsis CENP-C homologue) for the A. thaliana cell cultures. The immunosignals from all three antibodies appeared on all sites of the 180 bp repeats detected by fluorescence in situ hybridization. However, some of the 180 bp repeat clusters, particularly those that were long or stretched at interphase, were not fully covered with the signals from anti-HTR12 or AtCENP-C. Chromatin fiber immunolabeling clearly revealed that the centromeric proteins examined in this study, localize only at the knobs on the extended chromatin fibers, which form a limited part of the 180 bp clusters. Furthermore, outer HTR12 and inner phosphohistone H3 (Ser10) localization at the kinetochores of metaphase chromosomes suggests that two kinds of histone H3 (a centromere variant and a phosphorylated form) might be linked to different roles in centromere functionality; the former for spindle-fiber attachment, and the latter for chromatid cohesion.
Collapse
Affiliation(s)
- Fukashi Shibata
- Core Research for Evolutionary Science and Technology, Japan Science and Technology, Kawaguchi 332-0012, Japan
| | | |
Collapse
|