1
|
Ricard G, de Graaf RM, Dutilh BE, Duarte I, van Alen TA, van Hoek AH, Boxma B, van der Staay GWM, Moon-van der Staay SY, Chang WJ, Landweber LF, Hackstein JHP, Huynen MA. Macronuclear genome structure of the ciliate Nyctotherus ovalis: single-gene chromosomes and tiny introns. BMC Genomics 2008; 9:587. [PMID: 19061489 PMCID: PMC2633312 DOI: 10.1186/1471-2164-9-587] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2008] [Accepted: 12/05/2008] [Indexed: 01/17/2023] Open
Abstract
BACKGROUND Nyctotherus ovalis is a single-celled eukaryote that has hydrogen-producing mitochondria and lives in the hindgut of cockroaches. Like all members of the ciliate taxon, it has two types of nuclei, a micronucleus and a macronucleus. N. ovalis generates its macronuclear chromosomes by forming polytene chromosomes that subsequently develop into macronuclear chromosomes by DNA elimination and rearrangement. RESULTS We examined the structure of these gene-sized macronuclear chromosomes in N. ovalis. We determined the telomeres, subtelomeric regions, UTRs, coding regions and introns by sequencing a large set of macronuclear DNA sequences (4,242) and cDNAs (5,484) and comparing them with each other. The telomeres consist of repeats CCC(AAAACCCC)n, similar to those in spirotrichous ciliates such as Euplotes, Sterkiella (Oxytricha) and Stylonychia. Per sequenced chromosome we found evidence for either a single protein-coding gene, a single tRNA, or the complete ribosomal RNAs cluster. Hence the chromosomes appear to encode single transcripts. In the short subtelomeric regions we identified a few overrepresented motifs that could be involved in gene regulation, but there is no consensus polyadenylation site. The introns are short (21-29 nucleotides), and a significant fraction (1/3) of the tiny introns is conserved in the distantly related ciliate Paramecium tetraurelia. As has been observed in P. tetraurelia, the N. ovalis introns tend to contain in-frame stop codons or have a length that is not dividable by three. This pattern causes premature termination of mRNA translation in the event of intron retention, and potentially degradation of unspliced mRNAs by the nonsense-mediated mRNA decay pathway. CONCLUSION The combination of short leaders, tiny introns and single genes leads to very minimal macronuclear chromosomes. The smallest we identified contained only 150 nucleotides.
Collapse
Affiliation(s)
- Guénola Ricard
- Centre for Molecular and Biomolecular Informatics, Nijmegen Centre for Molecular Life Sciences, Radboud University Nijmegen Medical Centre, Geert Nijmegen, the Netherlands.
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
2
|
Sperling L, Dessen P, Zagulski M, Pearlman RE, Migdalski A, Gromadka R, Froissard M, Keller AM, Cohen J. Random sequencing of Paramecium somatic DNA. EUKARYOTIC CELL 2002; 1:341-52. [PMID: 12455983 PMCID: PMC118014 DOI: 10.1128/ec.1.3.341-352.2002] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/26/2001] [Accepted: 02/15/2002] [Indexed: 11/20/2022]
Abstract
We report a random survey of 1 to 2% of the somatic genome of the free-living ciliate Paramecium tetraurelia by single-run sequencing of the ends of plasmid inserts. As in all ciliates, the germ line genome of Paramecium (100 to 200 Mb) is reproducibly rearranged at each sexual cycle to produce a somatic genome of expressed or potentially expressed genes, stripped of repeated sequences, transposons, and AT-rich unique sequence elements limited to the germ line. We found the somatic genome to be compact (>68% coding, estimated from the sequence of several complete library inserts) and to feature uniformly small introns (18 to 35 nucleotides). This facilitated gene discovery: 722 open reading frames (ORFs) were identified by similarity with known proteins, and 119 novel ORFs were tentatively identified by internal comparison of the data set. We determined the phylogenetic position of Paramecium with respect to eukaryotes whose genomes have been sequenced by the distance matrix neighbor-joining method by using random combined protein data from the project. The unrooted tree obtained is very robust and in excellent agreement with accepted topology, providing strong support for the quality and consistency of the data set. Our study demonstrates that a random survey of the somatic genome of Paramecium is a good strategy for gene discovery in this organism.
Collapse
Affiliation(s)
- Linda Sperling
- Centre de Génétique Moléculaire, CNRS, 91198 Gif-sur-Yvette Cedex, France.
| | | | | | | | | | | | | | | | | |
Collapse
|
3
|
Prescott DM, Dizick SJ. A unique pattern of intrastrand anomalies in base composition of the DNA in hypotrichs. Nucleic Acids Res 2000; 28:4679-88. [PMID: 11095678 PMCID: PMC115164 DOI: 10.1093/nar/28.23.4679] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The 50 non-coding bases immediately internal to the telomeric repeats in the two 5' ends of macronuclear DNA molecules of a group of hypotrichous ciliates are anomalous in composition, consisting of 61% purines and 39% pyrimidines, A>T (ratio of 44:32), and G>C (ratio of 17:7). These ratio imbalances violate parity rule 2, according to which A should equal T and G should equal C within a DNA strand and therefore pyrimidines should equal purines. The purine-rich and base ratio imbalances are in marked contrast to the rest of the non-coding parts of the molecules, which have the theoretically expected purine content of 50%, with A = T and G = C. The ORFs contain an average of 52% purines as a result of bias in codon usage. The 50 bases that flank the 5' ends of macronuclear sequences in micronuclear DNA (12 cases) consist of approximately 50% purines. Thus, the 50 bases in the 5' ends of macronuclear sequences in micronuclear DNA are islands of purine richness in which A>T and G>C. These islands may serve as signals for the excision of macronuclear molecules during macronuclear development. We have found no published reports of coding or non-coding native DNA with such anomalous base composition.
Collapse
Affiliation(s)
- D M Prescott
- University of Colorado, Department of Molecular, Cellular and Developmental Biology, Boulder, CO 80309-0347, USA.
| | | |
Collapse
|
4
|
Grimm M, Brünen-Nieweler C, Junker V, Heckmann K, Beier H. The hypotrichous ciliate Euplotes octocarinatus has only one type of tRNACys with GCA anticodon encoded on a single macronuclear DNA molecule. Nucleic Acids Res 1998; 26:4557-65. [PMID: 9753721 PMCID: PMC147889 DOI: 10.1093/nar/26.20.4557] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Deviations from the universal genetic code have evolved independently several times in ciliated protozoa. Thus, in some species UAA and UAG are no longer used as termination codons, but are read as glutamine, whereas in the genus Euplotes , UGA is translated as cysteine. We have investigated the nature of the tRNACys isoacceptor responsible for decoding UGA in Euplotes cells. Southern hybridization analyses indicated that a single DNA molecule of 630 bp encoding tRNACys exists in the macronucleus of Euplotes octocarinatus . Cloning and sequencing of this fragment revealed that it contains only one copy of a tRNACys gene, which codes for a normal tRNACys with GCA anticodon. This is the first report of the characterization of a tRNA gene in any hypotrichous ciliate. It contains putative signals for initiation and termination of transcription by RNA polymerase III and can be transcribed efficiently in vitro in HeLa cell nuclear extract. Intensive studies on the DNA and tRNA level involving PCR analyses have not disclosed the existence of any tRNA Cys isoacceptor with UCA or ICA anticodons. Translation of the UGA codon by tRNA sub GCA sup Cys necessitates a G:A mispairing in the first anticodon position. We discuss a number of aspects which might contribute to the finding that a near-cognate tRNA isoacceptor efficiently translates the UGA stop codon.
Collapse
MESH Headings
- Animals
- Anticodon/genetics
- Base Sequence
- Blotting, Southern
- Cell Nucleus/genetics
- Codon, Terminator
- DNA, Protozoan/genetics
- Escherichia coli/genetics
- Euplotes/genetics
- Genes, Protozoan
- Genetic Code
- Genomic Library
- HeLa Cells
- Humans
- Molecular Sequence Data
- RNA Polymerase III/metabolism
- RNA, Protozoan/genetics
- RNA, Transfer, Cys/genetics
- RNA, Transfer, Cys/isolation & purification
- Restriction Mapping
- Reverse Transcriptase Polymerase Chain Reaction
- Sequence Analysis, DNA
- Transcription, Genetic
Collapse
Affiliation(s)
- M Grimm
- Institut für Biochemie, Bayerische Julius-Maximilians-Universität, Biozentrum, Am Hubland, D-97074 Würzburg, Germany
| | | | | | | | | |
Collapse
|
5
|
Bender J, Klein A. The telomere binding protein of Euplotes crassus prevents non-specific transcription initiation but has no role in positioning transcription initiation complexes. Nucleic Acids Res 1997; 25:2877-82. [PMID: 9280899 PMCID: PMC146823 DOI: 10.1093/nar/25.14.2877] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open
Abstract
Model substrates mimicking the telomeric as well as the 5'-untranslated regions in front of a 5'-terminal part of a macronuclear gene of Euplotes crassus were transcribed in vitro using cellular extracts. The obtained transcripts were characterized by primer extension and shown to start at the natural initiation points. The situation changed in the absence of telomere binding protein or with substrates lacking functional telomeric sequences. In these cases non-specific transcription was observed. Since it had been previously found that transcription starts are frequently located at an apparently fixed distance from the telomere, a model substrate was constructed which carried a duplication of the non-transcribed region. This resulted in only one transcription start site, the position of which was conserved relative to the start of the open reading frame but moved away from the telomere by the distance of the duplication.
Collapse
Affiliation(s)
- J Bender
- Molecular Genetics, Department of Biology, University of Marburg, D-35032, Marburg, Germany
| | | |
Collapse
|
6
|
|
7
|
Huttenlauch I, Geisler N, Plessmann U, Peck RK, Weber K, Stick R. Major epiplasmic proteins of ciliates are articulins: cloning, recombinant expression, and structural characterization. J Cell Biol 1995; 130:1401-12. [PMID: 7559761 PMCID: PMC2120578 DOI: 10.1083/jcb.130.6.1401] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023] Open
Abstract
The cytoskeleton of certain protists comprises an extensive membrane skeleton, the epiplasm, which contributes to the cell shape and patterning of the species-specific cortical architecture. The isolated epiplasm of the ciliated protist Pseudomicrothorax dubius consists of two major groups of proteins with molecular masses of 78-80 kD and 11-13 kD, respectively. To characterize the structure of these proteins, peptide sequences of two major polypeptides (78-80 kD) as well as a cDNA representing the entire coding sequence of a minor and hitherto unidentified component (60 kD; p60) of the epiplasm have been determined. All three polypeptides share sequence similarities. They contain repeated valine- and proline-rich motifs of 12 residues with the consensus VPVP--V-V-V-. In p60 the central core domain consists of 24 tandemly repeated VPV motifs. Within the repeat motifs positively and negatively charged residues, when present, show an alternating pattern in register with the V and P positions. Recombinant p60 was purified in 8 M urea and dialyzed against buffer. Infrared spectroscopic measurements indicate 30% beta-sheet. Electron microscopy reveals short filamentous polymers with a rather homogenous diameter (approximately 15-20 nm), but variable lengths. The small polymers form thicker filaments, ribbons, and larger sheets or tubes. A core domain similar to that of P. dubius p60 is also found in the recently described epiplasmic proteins of the flagellate Euglena, the so-called articulins. Our results show that the members of this protein family are not restricted to flagellates, but are also present in the distantly related ciliates where they are major constituents of the epiplasm. Comparison of flagellate and ciliate articulins highlights common features of this novel family of cytoskeletal proteins.
Collapse
Affiliation(s)
- I Huttenlauch
- Max Planck Institute for Biophysical Chemistry, Department of Biochemistry, Göttingen, Federal Republic of Germany
| | | | | | | | | | | |
Collapse
|
8
|
Hoffman DC, Anderson RC, DuBois ML, Prescott DM. Macronuclear gene-sized molecules of hypotrichs. Nucleic Acids Res 1995; 23:1279-83. [PMID: 7753617 PMCID: PMC306850 DOI: 10.1093/nar/23.8.1279] [Citation(s) in RCA: 52] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023] Open
Abstract
The macronuclear genome of hypotrichous ciliates consists of DNA molecules of gene-sized length. A macronuclear DNA molecule contains a single coding region. We have analyzed the many hypotrich macronuclear DNA sequences sequenced by us and others. No highly conserved promoter sequences nor replication initiation sequences have been identified in the 5' nor in the 3' non-translated regions, suggesting that promoter function in hypotrichs may differ from other eukaryotes. The macronuclear genes are intron-poor; approximately 19% of the genes sequenced to date have one to three introns. Not all macronuclear DNA molecules may be transcribed; some macronuclear molecules may not have any coding function. Codon bias in hypotrichs is different in many respects from other ciliates and from other eukaryotes.
Collapse
Affiliation(s)
- D C Hoffman
- University of Colorado, Department of Molecular, Cellular and Developmental Biology, Boulder 80309-0347, USA
| | | | | | | |
Collapse
|
9
|
Abstract
Ciliates contain two types of nuclei: a micronucleus and a macronucleus. The micronucleus serves as the germ line nucleus but does not express its genes. The macronucleus provides the nuclear RNA for vegetative growth. Mating cells exchange haploid micronuclei, and a new macronucleus develops from a new diploid micronucleus. The old macronucleus is destroyed. This conversion consists of amplification, elimination, fragmentation, and splicing of DNA sequences on a massive scale. Fragmentation produces subchromosomal molecules in Tetrahymena and Paramecium cells and much smaller, gene-sized molecules in hypotrichous ciliates to which telomere sequences are added. These molecules are then amplified, some to higher copy numbers than others. rDNA is differentially amplified to thousands of copies per macronucleus. Eliminated sequences include transposonlike elements and sequences called internal eliminated sequences that interrupt gene coding regions in the micronuclear genome. Some, perhaps all, of these are excised as circular molecules and destroyed. In at least some hypotrichs, segments of some micronuclear genes are scrambled in a nonfunctional order and are recorded during macronuclear development. Vegetatively growing ciliates appear to possess a mechanism for adjusting copy numbers of individual genes, which corrects gene imbalances resulting from random distribution of DNA molecules during amitosis of the macronucleus. Other distinctive features of ciliate DNA include an altered use of the conventional stop codons.
Collapse
Affiliation(s)
- D M Prescott
- Department of Molecular, Cellular and Developmental Biology, University of Colorado, Boulder 80309-0347
| |
Collapse
|
10
|
Abstract
The alpha- and the beta-tubulin genes of the hypotrichous ciliate Euplotes octocarinatus were isolated from a size-selected macronuclear DNA library. The alpha-tubulin gene is located on a 1,587 bp macronuclear DNA molecule and the beta-tubulin gene on a 1,524 bp macronuclear DNA molecule. Sequencing revealed that all the cysteine residues of the two genes are encoded by the common cysteine codons UGU and UGC and none by an UGA codon. This is in contrast to the genes of E. octocarinatus sequenced so far, where some of the cysteines are encoded by the opal codon UGA. The tubulin genes end like other Euplotes genes with a TAA. They do not contain introns. The last codon for an amino acid in the alpha-tubulin gene is a GAA which codes for glutamic acid. This is in contrast to what has been reported for most alpha-tubulin genes, but it supports for other hypotrichous ciliates. No evidence for the existence of more than one type of alpha- and one type of beta-tubulin genes could be obtained.
Collapse
Affiliation(s)
- A Liang
- Institut für Allgemeine Zoologie und Genetik Universität Münster, Germany
| | | | | |
Collapse
|
11
|
Ghosh S, Jaraczewski JW, Klobutcher LA, Jahn CL. Characterization of transcription initiation, translation initiation, and poly(A) addition sites in the gene-sized macronuclear DNA molecules of Euplotes. Nucleic Acids Res 1994; 22:214-21. [PMID: 7907176 PMCID: PMC307774 DOI: 10.1093/nar/22.2.214] [Citation(s) in RCA: 43] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023] Open
Abstract
The DNA in the transcriptionally active macronucleus of the hypotrichous ciliate Euplotes crassus exists as short, linear molecules with each molecule encoding a single genetic function. Previous work has indicated that coding regions occupy the majority of macronuclear DNA molecules. In the present study we have defined the transcription initiation sites and poly(A) addition sites for a number of different macronuclear genes in Euplotes crassus. Our results indicate that mature transcripts represent all but approximately 100-200 bases of the non-telomeric sequences in macronuclear DNA molecules. We have also examined the sequences in the vicinity of transcription start sites, poly(A) addition sites, and translation initiation sites for Euplotes species genes in an attempt to define the cis-acting elements that control these processes. Our results indicate that some of the common sequence elements known to control these processes in higher eukaryotes are likely not utilized by Euplotes genes. The data do indicate the presence of other conserved sequences both preceding and at the site of poly(A) addition, as well as at the site of translation initiation. These conserved sequences may serve an analogous role in these organisms. Finally, we have found that most macronuclear DNA molecules have transcription initiation sites within 30 bp of the telomere, suggesting that the telomere may play a role in promoting transcription.
Collapse
Affiliation(s)
- S Ghosh
- Department of Biochemistry, University of Connecticut Health Center, Farmington 06030
| | | | | | | |
Collapse
|