1
|
Anjos A, Milani D, Bardella VB, Paladini A, Cabral-de-Mello DC. Evolution of satDNAs on holocentric chromosomes: insights from hemipteran insects of the genus Mahanarva. CHROMOSOME RESEARCH : AN INTERNATIONAL JOURNAL ON THE MOLECULAR, SUPRAMOLECULAR AND EVOLUTIONARY ASPECTS OF CHROMOSOME BIOLOGY 2023; 31:5. [PMID: 36705735 DOI: 10.1007/s10577-023-09710-2] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/14/2022] [Revised: 11/09/2022] [Accepted: 12/05/2022] [Indexed: 01/28/2023]
Abstract
Satellite DNAs (satDNAs) constitute one of the main components of eukaryote genomes and are involved in chromosomal organization and diversification. Although largely studied, little information was gathered about their evolution on holocentric species, i.e., diffuse centromeres, which, due to differences in repeat organization, could result in different evolutionary patterns. Here, we combined bioinformatics and cytogenetic approaches to evaluate the evolution of the satellitomes in Mahanarva holocentric insects. In two species, de novo identification revealed a high number of satDNAs, 110 and 113, with an extreme monomer length range of 18-4228 bp. The overall abundance of satDNAs was observed to be 6.67% in M. quadripunctata and 1.98% in M. spectabilis, with different abundances for the shared satDNAs. Chromosomal mapping of the most abundant repeats of M. quadripunctata and M. spectabilis on other Mahanarva reinforced the dynamic nature of satDNAs. Variable patterns of chromosomal distribution for the satDNAs were noticed, with the occurrence of clusters on distinct numbers of chromosomes and at different positions and the occurrence of scattered signals or nonclustered satDNAs. Altogether, our data demonstrated the high dynamism of satDNAs in Mahanarva with the involvement of this genomic fraction in chromosome diversification of the genus. The general characteristics and patterns of evolution of satDNAs are similar to those observed on monocentric chromosomes, suggesting that the differential organization of genome compartments observed on holocentric chromosomes compared with monocentric chromosomes does not have a large impact on the evolution of satDNAs. Analysis of the satellitomes of other holocentric species in a comparative manner will shed light on this issue.
Collapse
Affiliation(s)
- Allison Anjos
- Departamento de Biologia Geral e Aplicada, Instituto de Biociências, UNESP, Rio Claro, SP, 13506-900, Brazil
| | - Diogo Milani
- Departamento de Biologia Geral e Aplicada, Instituto de Biociências, UNESP, Rio Claro, SP, 13506-900, Brazil
| | - Vanessa B Bardella
- Departamento de Biologia Geral e Aplicada, Instituto de Biociências, UNESP, Rio Claro, SP, 13506-900, Brazil
| | - Andressa Paladini
- Departamento de Ecologia e Evolução, Universidade Federal de Santa Maria, Santa Maria, RS, Brazil
| | - Diogo C Cabral-de-Mello
- Departamento de Biologia Geral e Aplicada, Instituto de Biociências, UNESP, Rio Claro, SP, 13506-900, Brazil.
- Department of Experimental Biology, Genetics Area, University of Jaén, Paraje las Lagunillas s/n, 23071, Jaen, Spain.
| |
Collapse
|
2
|
Subirana JA, Albà MM, Messeguer X. High evolutionary turnover of satellite families in Caenorhabditis. BMC Evol Biol 2015; 15:218. [PMID: 26438045 PMCID: PMC4595182 DOI: 10.1186/s12862-015-0495-x] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2015] [Accepted: 09/22/2015] [Indexed: 02/07/2023] Open
Abstract
Background The high density of tandem repeat sequences (satellites) in nematode genomes and the availability of genome sequences from several species in the group offer a unique opportunity to better understand the evolutionary dynamics and the functional role of these sequences. We take advantage of the previously developed SATFIND program to study the satellites in four Caenorhabditis species and investigate these questions. Methods The identification and comparison of satellites is carried out in three steps. First we find all the satellites present in each species with the SATFIND program. Each satellite is defined by its length, number of repeats, and repeat sequence. Only satellites with at least ten repeats are considered. In the second step we build satellite families with a newly developed alignment program. Satellite families are defined by a consensus sequence and the number of satellites in the family. Finally we compare the consensus sequence of satellite families in different species. Results We give a catalog of individual satellites in each species. We have also identified satellite families with a related sequence and compare them in different species. We analyze the turnover of satellites: they increased in size through duplications of fragments of 100-300 bases. It appears that in many cases they have undergone an explosive expansion. In C. elegans we have identified a subset of large satellites that have strong affinity for the centromere protein CENP-A. We have also compared our results with those obtained from other species, including one nematode and three mammals. Conclusions Most satellite families found in Caenorhabditis are species-specific; in particular those with long repeats. A subset of these satellites may facilitate the formation of kinetochores in mitosis. Other satellite families in C. elegans are either related to Helitron transposons or to meiotic pairing centers. Electronic supplementary material The online version of this article (doi:10.1186/s12862-015-0495-x) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Juan A Subirana
- Department of Computer Science, Universitat Politècnica de Catalunya, Jordi Girona 31, Barcelona, 08034, Spain. .,Evolutionary Genomics Group, Research Programme on Biomedical Informatics (GRIB) - Hospital del Mar Research Institute (IMIM), Universitat Pompeu Fabra (UPF), Dr. Aiguader 86, Barcelona, 08003, Spain.
| | - M Mar Albà
- Evolutionary Genomics Group, Research Programme on Biomedical Informatics (GRIB) - Hospital del Mar Research Institute (IMIM), Universitat Pompeu Fabra (UPF), Dr. Aiguader 86, Barcelona, 08003, Spain.
| | - Xavier Messeguer
- Department of Computer Science, Universitat Politècnica de Catalunya, Jordi Girona 31, Barcelona, 08034, Spain.
| |
Collapse
|
3
|
Glauser DA. The multiplicity of alternative splicing decisions in Caenorhabditis elegans is linked to specific intronic regulatory motifs and minisatellites. BMC Genomics 2014; 15:364. [PMID: 24884695 PMCID: PMC4039745 DOI: 10.1186/1471-2164-15-364] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2013] [Accepted: 04/15/2014] [Indexed: 11/28/2022] Open
Abstract
Background Alternative splicing diversifies the pool of messenger RNA molecules encoded by individual genes. This diversity is particularly high when multiple splicing decisions cause a combinatorial arrangement of several alternate exons. We know very little on how the multiple decisions occurring during the maturation of single transcripts are coordinated and whether specific sequence elements might be involved. Results Here, the Caenorhabditis elegans genome was surveyed in order to identify sequence elements that might play a specific role in the regulation of multiple splicing decisions. The introns flanking alternate exons in transcripts whose maturation involves multiple alternative splicing decisions were compared to those whose maturation involves a single decision. Fifty-eight penta-, hexa-, and hepta-meric elements, clustered in 17 groups, were significantly over-represented in genes subject to multiple alternative splicing decisions. Most of these motifs relate to known splicing regulatory elements and appear to be well conserved in the related species Caenorhabditis briggsae. The usage of specific motifs is not linked to the gene product function, but rather depends on the gene structure, since it is influenced by the distance separating the multiple splicing decision sites. Two of these motifs are part of the CeRep25B minisatellite, which is also over-represented at the vicinity of alternative splicing regions. Most of the remaining motifs are not part of repeated sequence elements, but tend to occur in specific heterologous pairs in genes subject to multiple alternative splicing decisions. Conclusions The existence of specific intronic sequence elements linked to multiple alternative splicing decisions is intriguing and suggests that these elements might have some specialized regulatory role during splicing. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-15-364) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Dominique A Glauser
- Department of Biology, University of Fribourg, Chemin du Musée 10, 1700 Fribourg, Switzerland.
| |
Collapse
|
4
|
Manicardi GC, Mandrioli M, Blackman RL. The cytogenetic architecture of the aphid genome. Biol Rev Camb Philos Soc 2014; 90:112-25. [PMID: 24593177 DOI: 10.1111/brv.12096] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2013] [Revised: 02/04/2014] [Accepted: 02/07/2014] [Indexed: 12/28/2022]
Abstract
In recent years aphids, with their well-defined polyphenism, have become favoured as model organisms for the study of epigenetic processes. The availability of the pea aphid (Acyrthosiphon pisum) genome sequence has engendered much research aimed at elucidating the mechanisms by which the phenotypic plasticity of aphids is inherited and controlled. Yet so far this research effort has paid little attention to the cytogenetic processes that play a vital part in the organisation, expression and inheritance of the aphid genome. Aphids have holocentric chromosomes, which have very different properties from the chromosomes with localised centromeres that are found in most other organisms. Here we review the diverse forms of aphid chromosome behaviour that occur during sex determination and male and female meiosis, often in response to environmental changes and mediated by endocrine factors. Remarkable differences occur, even between related species, that could have significant effects on the inheritance of all or parts of the genome. In relation to this, we review the particular features of the distribution of heterochromatin, rDNA genes and other repetitive DNA in aphid chromosomes, and discuss the part that these may play in the epigenetic modification of chromatin structure and function.
Collapse
Affiliation(s)
- Gian Carlo Manicardi
- Department of Life Sciences, University of Modena and Reggio Emilia, Besta Building, via Amendola 2, 42122, Reggio Emilia, Italy
| | | | | |
Collapse
|
5
|
Monti V, Manicardi GC, Mandrioli M. Distribution and molecular composition of heterochromatin in the holocentric chromosomes of the aphid Rhopalosiphum padi (Hemiptera: Aphididae). Genetica 2010; 138:1077-84. [DOI: 10.1007/s10709-010-9493-5] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2010] [Accepted: 08/27/2010] [Indexed: 11/24/2022]
|
6
|
Abstract
The purpose of this work is to determine the most frequent short sequences in non-coding DNA. They may play a role in maintaining the structure and function of eukaryotic chromosomes. We present a simple method for the detection and analysis of such sequences in several genomes, including Arabidopsis thaliana, Caenorhabditis elegans, Drosophila melanogaster and Homo sapiens. We also study two chromosomes of man and mouse with a length similar to the whole genomes of the other species. We provide a list of the most common sequences of 9–14 bases in each genome. As expected, they are present in human Alu sequences. Our programs may also give a graph and a list of their position in the genome. Detection of clusters is also possible. In most cases, these sequences contain few alternating regions. Their intrinsic structure and their influence on nucleosome formation are not known. In particular, we have found new features of short sequences in C. elegans, which are distributed in heterogeneous clusters. They appear as punctuation marks in the chromosomes. Such clusters are not found in either A. thaliana or D. melanogaster. We discuss the possibility that they play a role in centromere function and homolog recognition in meiosis.
Collapse
Affiliation(s)
- Juan A Subirana
- Departament d'Enginyeria Química, Universitat Politècnica de Catalunya, Av. Diagonal 647, E-08028, Barcelona, Spain.
| | | |
Collapse
|
7
|
Cutter AD, Dey A, Murray RL. Evolution of the Caenorhabditis elegans genome. Mol Biol Evol 2009; 26:1199-234. [PMID: 19289596 DOI: 10.1093/molbev/msp048] [Citation(s) in RCA: 85] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
A fundamental problem in genome biology is to elucidate the evolutionary forces responsible for generating nonrandom patterns of genome organization. As the first metazoan to benefit from full-genome sequencing, Caenorhabditis elegans has been at the forefront of research in this area. Studies of genomic patterns, and their evolutionary underpinnings, continue to be augmented by the recent push to obtain additional full-genome sequences of related Caenorhabditis taxa. In the near future, we expect to see major advances with the onset of whole-genome resequencing of multiple wild individuals of the same species. In this review, we synthesize many of the important insights to date in our understanding of genome organization and function that derive from the evolutionary principles made explicit by theoretical population genetics and molecular evolution and highlight fertile areas for future research on unanswered questions in C. elegans genome evolution. We call attention to the need for C. elegans researchers to generate and critically assess nonadaptive hypotheses for genomic and developmental patterns, in addition to adaptive scenarios. We also emphasize the potential importance of evolution in the gonochoristic (female and male) ancestors of the androdioecious (hermaphrodite and male) C. elegans as the source for many of its genomic and developmental patterns.
Collapse
Affiliation(s)
- Asher D Cutter
- Department of Ecology & Evolutionary Biology and the Centre for the Analysis of Genome Evolution and Function, University of Toronto, Toronto, Ontario, Canada.
| | | | | |
Collapse
|
8
|
Hokii Y, Kubo A, Ogasawara T, Nogi Y, Taneda A, Arai R, Muto A, Ushida C. Twelve novel C. elegans RNA candidates isolated by two-dimensional polyacrylamide gel electrophoresis. Gene 2005; 365:83-7. [PMID: 16356666 DOI: 10.1016/j.gene.2005.09.039] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2005] [Revised: 07/18/2005] [Accepted: 09/07/2005] [Indexed: 10/25/2022]
Abstract
C. elegans small RNAs (<50 nt) were separated by two-dimensional gel electrophoresis (2D-PAGE). cDNAs were prepared from the RNAs extracted from randomly chosen 2D-PAGE spots. Although many cDNA sequences corresponded to parts of known RNAs, twelve novel small RNA candidates were identified: eleven from 2D-PAGE spots of the mixed-stage worm RNA preparation and one from those of the embryonic RNA preparation. These are encoded in the intergenic regions, in the introns of protein-coding genes, in the anti-sense strand of protein-coding sequences and repetitive sequence regions of the genome. None of them showed a characteristic structure of miRNAs, suggesting that they are candidates of other or new classes of RNAs.
Collapse
MESH Headings
- Animals
- Base Sequence
- Caenorhabditis elegans/embryology
- Caenorhabditis elegans/genetics
- Caenorhabditis elegans/growth & development
- Caenorhabditis elegans Proteins/genetics
- Chromosomes
- Cloning, Molecular
- DNA, Complementary/genetics
- Electrophoresis, Gel, Two-Dimensional/methods
- Embryo, Nonmammalian
- Exons
- Genes, Helminth
- Genome
- Introns
- MicroRNAs/chemistry
- MicroRNAs/genetics
- MicroRNAs/isolation & purification
- Molecular Sequence Data
- Nucleic Acid Conformation
- RNA, Antisense/chemistry
- RNA, Antisense/genetics
- RNA, Antisense/isolation & purification
- RNA, Helminth/chemistry
- RNA, Helminth/genetics
- RNA, Helminth/isolation & purification
- RNA, Small Interfering/chemistry
- RNA, Small Interfering/genetics
- RNA, Small Interfering/isolation & purification
- Repetitive Sequences, Nucleic Acid
- Sequence Analysis, RNA
Collapse
Affiliation(s)
- Yusuke Hokii
- Department of Biochemistry and Biotechnology, Faculty of Agriculture and Life Science, Hirosaki University, Hirosaki, Aomori, 036-8561, Japan
| | | | | | | | | | | | | | | |
Collapse
|
9
|
Denver DR, Morris K, Lynch M, Thomas WK. High mutation rate and predominance of insertions in the Caenorhabditis elegans nuclear genome. Nature 2004; 430:679-82. [PMID: 15295601 DOI: 10.1038/nature02697] [Citation(s) in RCA: 258] [Impact Index Per Article: 12.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2004] [Accepted: 06/01/2004] [Indexed: 11/09/2022]
Abstract
Mutations have pivotal functions in the onset of genetic diseases and are the fundamental substrate for evolution. However, present estimates of the spontaneous mutation rate and spectrum are derived from indirect and biased measurements. For instance, mutation rate estimates for Caenorhabditis elegans are extrapolated from observations on a few genetic loci with visible phenotypes and vary over an order of magnitude. Alternative approaches in mammals, relying on phylogenetic comparisons of pseudogene loci and fourfold degenerate codon positions, suffer from uncertainties in the actual number of generations separating the compared species and the inability to exclude biases associated with natural selection. Here we provide a direct and unbiased estimate of the nuclear mutation rate and its molecular spectrum with a set of C. elegans mutation-accumulation lines that reveal a mutation rate about tenfold higher than previous indirect estimates and an excess of insertions over deletions. Because deletions dominate patterns of C. elegans pseudogene variation, our observations indicate that natural selection might be significant in promoting small genome size, and challenge the prevalent assumption that pseudogene divergence accurately reflects the spontaneous mutation spectrum.
Collapse
Affiliation(s)
- Dee R Denver
- Department of Biology, Indiana University, Bloomington, Indiana 47405, USA.
| | | | | | | |
Collapse
|
10
|
Mandrioli M, Manicardi GC, Marec F. Cytogenetic and molecular characterization of the MBSAT1 satellite DNA in holokinetic chromosomes of the cabbage moth, Mamestra brassicae (Lepidoptera). Chromosome Res 2003; 11:51-6. [PMID: 12675305 DOI: 10.1023/a:1022058032217] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]
Abstract
Digestion of Mamestra brassicae DNA with DraI produced a prominent fragment of approximately 200 bp and a ladder of electrophoretic bands with molecular weights which are a multiple of 200 bp. Southern blotting revealed that this ladder is composed of DNA fragments that are multimers of the 200-bp DraI band suggesting that DraI isolated a satellite that has been called Mamestra brassicae satellite DNA1 (MBSAT1). MBSAT1 is the first satellite DNA isolated in Lepidoptera. In-situ DraI digestion of chromosome spreads, together with fluorescent in-situ hybridization, showed that MBSAT1 sequences are clustered in heterochromatin of the sex chromosomes, Z and W. MBSAT1 was 234 bp long with an AT content of 60.7%. The curvature-propensity plot suggested a curvature in the MBSAT1 structure.
Collapse
Affiliation(s)
- Mauro Mandrioli
- Dipartimento di Biologia Animale, Università di Modena e Reggio Emilia, Via Campi 213/D, 41100 Modena, Italy.
| | | | | |
Collapse
|
11
|
Abstract
All eukaryotic DNA transposons reported so far belong to a single category of elements transposed by the so-called "cut-and-paste" mechanism. Here, we report a previously unknown category of eukaryotic DNA transposons, Helitron, which transpose by rolling-circle replication. Autonomous Helitrons encode a 5'-to-3' DNA helicase and nuclease/ligase similar to those encoded by known rolling-circle replicons. Helitron-like transposons have conservative 5'-TC and CTRR-3' termini and do not have terminal inverted repeats. They contain 16- to 20-bp hairpins separated by 10--12 nucleotides from the 3'-end and transpose precisely between the 5'-A and T-3', with no modifications of the AT target sites. Together with their multiple diverged nonautonomous descendants, Helitrons constitute approximately 2% of both the Arabidopsis thaliana and Caenorhabditis elegans genomes and also colonize the Oriza sativa genome. Sequence conservation suggests that Helitrons continue to be transposed.
Collapse
Affiliation(s)
- V V Kapitonov
- Genetic Information Research Institute, 2081 Landings Drive, Mountain View, CA 94043, USA.
| | | |
Collapse
|
12
|
Sanford C, Perry MD. Asymmetrically distributed oligonucleotide repeats in the Caenorhabditis elegans genome sequence that map to regions important for meiotic chromosome segregation. Nucleic Acids Res 2001; 29:2920-6. [PMID: 11452017 PMCID: PMC55808 DOI: 10.1093/nar/29.14.2920] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2001] [Accepted: 05/30/2001] [Indexed: 11/14/2022] Open
Abstract
The roundworm Caenorhabditis elegans has a haploid karyotype containing six linear chromosomes. The termini of worm chromosomes have been proposed to play an important role in meiotic prophase, either when homologs are participating in a genome-wide search for their proper partners or in the initiation of synapsis. For each chromosome one end appears to stimulate crossing-over with the correct homolog; the other end lacks this property. We have used a bioinformatics approach to identify six repetitive sequence elements in the sequenced C.elegans genome whose distribution closely parallels these putative meiotic pairing centers (MPC) or homolog recognition regions (HRR). We propose that these six DNA sequence elements, which are largely chromosome specific, may correspond to the genetically defined HRR/MPC elements.
Collapse
Affiliation(s)
- C Sanford
- Department of Molecular and Medical Genetics, Faculty of Medicine, University of Toronto, 1 King's College Circle, Toronto, Ontario M5S 1A8, Canada
| | | |
Collapse
|
13
|
Duret L, Marais G, Biémont C. Transposons but not retrotransposons are located preferentially in regions of high recombination rate in Caenorhabditis elegans. Genetics 2000; 156:1661-9. [PMID: 11102365 PMCID: PMC1461346 DOI: 10.1093/genetics/156.4.1661] [Citation(s) in RCA: 100] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
We analyzed the distribution of transposable elements (TEs: transposons, LTR retrotransposons, and non-LTR retrotransposons) in the chromosomes of the nematode Caenorhabditis elegans. The density of transposons (DNA-based elements) along the chromosomes was found to be positively correlated with recombination rate, but this relationship was not observed for LTR or non-LTR retrotransposons (RNA-based elements). Gene (coding region) density is higher in regions of low recombination rate. However, the lower TE density in these regions is not due to the counterselection of TE insertions within exons since the same positive correlation between TE density and recombination rate was found in noncoding regions (both in introns and intergenic DNA). These data are not compatible with a global model of selection acting against TE insertions, for which an accumulation of elements in regions of reduced recombination is expected. We also found no evidence for a stronger selection against TE insertions on the X chromosome compared to the autosomes. The difference in distribution of the DNA and RNA-based elements along the chromosomes in relation to recombination rate can be explained by differences in the transposition processes.
Collapse
Affiliation(s)
- L Duret
- Laboratoire de Biométrie et Biologie Evolutive, Université Lyon 1, 69622 Villeurbanne Cedex, France.
| | | | | |
Collapse
|
14
|
Han CG, Frank MJ, Ohtsubo H, Ohtsubo E. New transposable elements identified as insertions in rice transposon Tnr1. Genes Genet Syst 2000; 75:69-77. [PMID: 10925785 DOI: 10.1266/ggs.75.69] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Tnr1 (235 bp long) is a transposable element in rice. Polymerase chain reactions (PCRs) done with a primer(s) that hybridizes to terminal inverted repeat sequences (TIRs) of Tnr1 detected new Tnr1 members with one or two insertions in rice genomes. Six identified insertion sequences (Tnr4, Tnr5, Tnr11, Tnr12, Tnr13 and RIRE9) did not have extensive homology to known transposable elements, rather they had structural features characteristic of transposable elements. Tnr4 (1767 bp long) had imperfect 64-bp TIRs and appeared to generate duplication of a 9-bp sequence at the target site. However, the TIR sequences were not homologous to those of known transposable elements, indicative that Tnr4 is a new transposable element. Tnr5 (209 bp long) had imperfect 46-bp TIRs and appeared to generate duplication of sequence TTA like that of some elements of the Tourist family. Tnr11 (811 bp long) had 73-bp TIRs with significant homology to those of Tnr1 and Stowaway and appeared to generate duplication of sequence TA, indicative that Tnr11 is a transposable element of the Tnr1/Stowaway family. Tnr12 (2426 bp long) carried perfect 9-bp TIRs, which began with 5'-CACTA- -3' from both ends and appeared to generate duplication of a 3-bp target sequence, indicative that Tnr12 is a transposable element of the En/Spm family. Tnr13 (347 bp long) had 31-bp TIRs and appeared to generate duplication of an 8-bp target sequence. Two sequences, one the transposon-like element Crackle, had partial homology in the Tnr13 ends. All five insertions appear to be defective elements derived from autonomous ones encoding the transposase gene. All had characteristic tandem repeat sequences which may be recognized by transposase. The sixth insertion sequence, named RIRE9 (3852 bp long), which begins with 5'-TG- -3' and ends with 5'- -CA-3', appeared to generate duplication of a 5-bp target sequence. These and other structural features indicate that this insertion is a solo LTR (long terminal repeat) of a retrotransposon. The transposable elements described above could be identified as insertions into Tnr1, which do not deleteriously affect the growth of rice cells.
Collapse
Affiliation(s)
- C G Han
- Institute of Molecular and Cellular Biosciences, University of Tokyo, Japan
| | | | | | | |
Collapse
|
15
|
Popovici C, Roubin R, Coulier F, Pontarotti P, Birnbaum D. The family of Caenorhabditis elegans tyrosine kinase receptors: similarities and differences with mammalian receptors. Genome Res 1999; 9:1026-39. [PMID: 10568743 DOI: 10.1101/gr.9.11.1026] [Citation(s) in RCA: 40] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Transmembrane receptors with tyrosine kinase activity (RTK) constitute a superfamily of proteins present in all metazoans that is associated with the control and regulation of cellular processes. They have been the focus of numerous studies and are a good subject for comparative analyses of multigene families in different species aimed at understanding metazoan evolution. The sequence of the genome of the nematode worm Caenorhabditis elegans is available. This offers a good opportunity to study the superfamily of nematode RTKs in its entirety and to compare it with its mammalian counterpart. We show that the C. elegans RTKs constitute various groups with different phylogenetic relationships with mammalian RTKs. A group of four RTKs show structural similarity with the three mammalian receptors for the vascular endothelial growth factors. Another group comprises RTKs with a short extracellular region, a feature not known in mammals; the genes encoding these RTKs are clustered on chromosome II with other gene families, including genes encoding chitinase-like proteins. Most of the C. elegans RTKs have no direct orthologous relationship with any mammalian RTK, providing an illustration of the importance of the separate evolution of the different phyla.
Collapse
Affiliation(s)
- C Popovici
- Laboratoire d'Oncologie Moléculaire, U.119 Institut National de la Santé et de la Recherche Médicale (INSERM), Marseille, France
| | | | | | | | | |
Collapse
|
16
|
Williams SA. Deep within the filarial genome: progress of the filarial genome project. PARASITOLOGY TODAY (PERSONAL ED.) 1999; 15:219-24. [PMID: 10366827 DOI: 10.1016/s0169-4758(99)01454-4] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
Four years ago, a WHO/United Nations Development Programme/World Bank-sponsored genome project to study the filarial lymphatic nematode parasite Brugia malayi was initiated. The project took as its aims gene discovery for drug target and vaccine candidate identification, genome mapping, dissemination of genomic data to the world community and training of endemic country partners in genomic research. In this article, the principal investigators in the laboratories behind the project describe the background to the project, the data now emerging and goals for the future. Open access to filarial genome data is emphasized.
Collapse
Affiliation(s)
- S A Williams
- Department of Biological Sciences, Clark Science Center, Smith College, Northampton, MA 01063, USA.
| |
Collapse
|
17
|
Pilgrim D. CeRep25B forms chromosome-specific minisatellite arrays in Caenorhabditis elegans. Genome Res 1998; 8:1192-201. [PMID: 9847081 PMCID: PMC310793 DOI: 10.1101/gr.8.11.1192] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
With the completion of the Genome Sequencing Project, it is now possible to rapidly and accurately determine the frequency and position of a particular repeat sequence in the Caenorhabditis elegans genome. Several repeat sequences with a variety of characteristics have been examined and with few exceptions they show a near-random distribution throughout the genome. We characterized several genes near the left end of Chromosome III in the C. elegans genome, and found a 24-bp minisatellite repeat sequence present in the introns of two unrelated genes. This prompted a search of the databank for other occurrences of this sequence. Multiple copy arrays of this repeat are all located on the same autosome and fall in two clusters: one near the left end, and one in the central region separated by approximately 10 Mb. There are >200 copies of this repeat on the chromosome. This euchromatic repeat sequence seems unrelated to gene expression, is absent from homologous sites in a related species, is unstable in Escherichia coli, and is polymorphic between different wild isolates of C. elegans. Most CeRep25B units in the array match the consensus sequence very well, suggesting that either this repeat originated quite recently or its sequence is functionally constrained. Although chromosome-specific repeat sequences have been reported previously in many organisms, such sequences are usually structural and heterochromatic (e.g., centromeric alpha-satellite) or on the mammalian sex chromosomes. This report describes the first confirmed instance from a whole genome sequencing project of an autosomal euchromatic chromosome-specific minisatellite repeat.
Collapse
Affiliation(s)
- D Pilgrim
- Department of Biological Sciences, University of Alberta, Edmonton, Alberta, Canada T6G 2E9.
| |
Collapse
|
18
|
Rezsohazy R, van Luenen HG, Durbin RM, Plasterk RH. Tc7, a Tc1-hitch hiking transposon in Caenorhabditis elegans. Nucleic Acids Res 1997; 25:4048-54. [PMID: 9321656 PMCID: PMC147001 DOI: 10.1093/nar/25.20.4048] [Citation(s) in RCA: 25] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open
Abstract
We have found a novel transposon in the genome of Caenorhabditis elegans. Tc7 is a 921 bp element, made up of two 345 bp inverted repeats separated by a unique, internal sequence. Tc7 does not contain an open reading frame. The outer 38 bp of the inverted repeat show 36 matches with the outer 38 bp of Tc1. This region of Tc1 contains the Tc1-transposase binding site. Furthermore, Tc7 is flanked by TA dinucleotides, just like Tc1, which presumably correspond to the target duplication generated upon integration. Since Tc7 does not encode its own transposase but contains the Tc1-transposase binding site at its extremities, we tested the ability of Tc7 to jump upon forced expression of Tc1 transposase in somatic cells. Under these conditions Tc7 jumps at a frequency similar to Tc1. The target site choice of Tc7 is identical to that of Tc1. These data suggest that Tc7 shares with Tc1 all the sequences minimally required to parasitize upon the Tc1 transposition machinery. The genomic distribution of Tc7 shows a striking clustering on the X chromosome where two thirds of the elements (20 out of 33) are located. Related transposons in C. elegans do not show this asymmetric distribution.
Collapse
Affiliation(s)
- R Rezsohazy
- Division of Molecular Biology, The Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| | | | | | | |
Collapse
|
19
|
Sonnhammer EL, Durbin R. A dot-matrix program with dynamic threshold control suited for genomic DNA and protein sequence analysis. Gene X 1995; 167:GC1-10. [PMID: 8566757 DOI: 10.1016/0378-1119(95)00714-8] [Citation(s) in RCA: 575] [Impact Index Per Article: 19.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023] Open
Abstract
Graphical dot-matrix plots can provide the most complete and detailed comparison of two sequences. Presented here is DOTTER2, a dot-plot program for X-windows which can compare DNA or protein sequences, and also DNA versus protein. The main novel feature of DOTTER is that the user can vary the stringency cutoffs interactively, so that the dot-matrix only needs to be calculated once. This is possible thanks to a 'Greyramp tool' that was developed to change the displayed stringency of the matrix by dynamically changing the greyscale rendering of the dots. The Greyramp tool allows the user to interactively change the lower and upper score limit for the greyscale rendering. This allows exploration of the separation between signal and noise, and fine-grained visualisation of different score levels in the dot-matrix. Other useful features are dot-matrix compression, mouse-controlled zooming, sequence alignment display and saving/loading of dot-matrices. Since the matrix only has to be calculated once and since the algorithm is fast and linear in space, DOTTER is practical to use even for sequences as long as cosmids. DOTTER was integrated in the gene-modelling module of the genomic database system ACEDB3. This was done via the homology viewer BLIXEM in a way that also allows segments from the BLAST suite of searching programs to be superimposed on top of the full dot-matrix. This feature can also be used for very quick finding of the strongest matches. As examples, we analyse a Caenorhabditis elegans cosmid with several tandem repeat families, and illustrate how DOTTER can improve gene modelling.
Collapse
|
20
|
Berks M. The C. elegans genome sequencing project. C. elegans Genome Mapping and Sequencing Consortium. Genome Res 1995; 5:99-104. [PMID: 9132275 DOI: 10.1101/gr.5.2.99] [Citation(s) in RCA: 26] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Affiliation(s)
- M Berks
- The Sanger Centre, Hinxton, Cambridgeshire, UK, mb 1 @sanger.ac.uk
| |
Collapse
|
21
|
Abstract
A great amount of genomic DNA in multicellular eukaryotic organisms is regarded as junk because it has no real function in protein coding. However, there is growing evidence that noncoding DNA can play a vital role in the regulation of gene expression during development (Lee et al., 1993). This indicates that the so-called junk DNA may have essential functions that are yet to be found (Nowak, 1994). A novel binary model of noncoding repetitive DNA sequence is proposed to illustrate its possible structure and implications in genome organization and development.
Collapse
Affiliation(s)
- S J Ting
- Department of Physics, Tunghai University, Taichung, Taiwan, Republic of China
| |
Collapse
|
22
|
Coulson A, Huynh C, Kozono Y, Shownkeen R. The physical map of the Caenorhabditis elegans genome. Methods Cell Biol 1995; 48:533-50. [PMID: 8531741 DOI: 10.1016/s0091-679x(08)61402-8] [Citation(s) in RCA: 25] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Affiliation(s)
- A Coulson
- Sanger Centre, Cambridge, United Kingdom
| | | | | | | |
Collapse
|
23
|
La Volpe A. A repetitive DNA family, conserved throughout the evolution of free-living nematodes. J Mol Evol 1994; 39:473-7. [PMID: 7807537 DOI: 10.1007/bf00173416] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
This paper concerns the molecular evolution of a tandemly repeated DNA family, RcC9, originally identified in Caenorhabditis elegans. The minimum unit of periodicity of this family is the pentanucleotide [nGAAn] and its complement [nTTCn] recurring several times in alternating tandem arrays. This consensus sequence is identical to that of the heat-shock element (HSE). Multiple HSEs are present in the regulatory regions of heat-inducible genes in a wide range of eukaryotic species; HSEs mediate transcriptional activation through the binding of the heat-shock factor (HSF). I describe some repeated DNA families sharing this same consensus and found in nematode species other than C. elegans. Although the consensus is conserved, the repeated sequence diverged between species to the point that cross-hybridization is abolished. Evolutionary implications will be discussed.
Collapse
Affiliation(s)
- A La Volpe
- CNR International Institute of Genetics and Biophysics, Napoli, Italy
| |
Collapse
|
24
|
Sapolsky RJ, Brendel V, Karlin S. A comparative analysis of distinctive features of yeast protein sequences. Yeast 1993; 9:1287-98. [PMID: 8154180 DOI: 10.1002/yea.320091202] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open
Abstract
The recently published sequence of yeast chromosome III (YCIII) provides the longest continuous stretch of a eukaryotic DNA molecule sequenced to date (315 kb). The sequence contains 116 distinct AUG-initiated open reading frames of at least 200 codons in length, more than 50 of which had not been described previously nor bear significant similarity to known proteins. We have analysed the YCIII known and putative protein sequences with respect to significant statistical features which might reflect on structural and functional characteristics. The YCIII proteins have striking similarities and differences in their sequence attribute distributions compared to the corresponding distributions for all available yeast sequences and other protein collections. Nine examples of YCIII proteins with distinctive sequence features are discussed in detail.
Collapse
Affiliation(s)
- R J Sapolsky
- Department of Mathematics, Stanford University, CA 94305-2125
| | | | | |
Collapse
|
25
|
Cangiano G, La Volpe A. Repetitive DNA sequences located in the terminal portion of the Caenorhabditis elegans chromosomes. Nucleic Acids Res 1993; 21:1133-9. [PMID: 8096635 PMCID: PMC309273 DOI: 10.1093/nar/21.5.1133] [Citation(s) in RCA: 25] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
Abstract
We describe the distribution along the chromosomes of Caenorhabditis elegans of two repetitive DNA families, RcS5 and Cerep3 and interstitial telomeric sequences. Both families show, among other interesting features, a preferential location in the terminal 30% of the chromosomes. It is known that in these regions of the genome the frequency of recombination is much higher than in the central portion, genes are rarer and sequences important for chromosome disjunction may lie.
Collapse
Affiliation(s)
- G Cangiano
- CNR International Institute of Genetics and Biophysics, Naples, Italy
| | | |
Collapse
|