1
|
Robicheau BM, Susko E, Harrigan AM, Snyder M. Ribosomal RNA Genes Contribute to the Formation of Pseudogenes and Junk DNA in the Human Genome. Genome Biol Evol 2018; 9:380-397. [PMID: 28204512 PMCID: PMC5381670 DOI: 10.1093/gbe/evw307] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/30/2016] [Indexed: 12/20/2022] Open
Abstract
Approximately 35% of the human genome can be identified as sequence devoid of a selected-effect function, and not derived from transposable elements or repeated sequences. We provide evidence supporting a known origin for a fraction of this sequence. We show that: 1) highly degraded, but near full length, ribosomal DNA (rDNA) units, including both 45S and Intergenic Spacer (IGS), can be found at multiple sites in the human genome on chromosomes without rDNA arrays, 2) that these rDNA sequences have a propensity for being centromere proximal, and 3) that sequence at all human functional rDNA array ends is divergent from canonical rDNA to the point that it is pseudogenic. We also show that small sequence strings of rDNA (from 45S + IGS) can be found distributed throughout the genome and are identifiable as an “rDNA-like signal”, representing 0.26% of the q-arm of HSA21 and ∼2% of the total sequence of other regions tested. The size of sequence strings found in the rDNA-like signal intergrade into the size of sequence strings that make up the full-length degrading rDNA units found scattered throughout the genome. We conclude that the displaced and degrading rDNA sequences are likely of a similar origin but represent different stages in their evolution towards random sequence. Collectively, our data suggests that over vast evolutionary time, rDNA arrays contribute to the production of junk DNA. The concept that the production of rDNA pseudogenes is a by-product of concerted evolution represents a previously under-appreciated process; we demonstrate here its importance.
Collapse
Affiliation(s)
- Brent M Robicheau
- Department of Biology, Acadia University, Wolfville, Nova Scotia, Canada
| | - Edward Susko
- Center for Comparative Genomics and Evolutionary Bioinformatics, Department of Mathematics and Statistics, Dalhousie University, Halifax, Nova Scotia, Canada
| | - Amye M Harrigan
- Department of Biology, Acadia University, Wolfville, Nova Scotia, Canada
| | - Marlene Snyder
- Department of Biology, Acadia University, Wolfville, Nova Scotia, Canada
| |
Collapse
|
2
|
Kumar V, Dickey AM, Seal DR, Shatters RG, Osborne LS, McKenzie CL. Unexpected High Intragenomic Variation in Two of Three Major Pest Thrips Species Does Not Affect Ribosomal Internal Transcribed Spacer 2 (ITS2) Utility for Thrips Identification. Int J Mol Sci 2017; 18:ijms18102100. [PMID: 28984819 PMCID: PMC5666782 DOI: 10.3390/ijms18102100] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2017] [Revised: 09/28/2017] [Accepted: 09/30/2017] [Indexed: 11/16/2022] Open
Abstract
The mitochondrial cytochrome oxidase I gene (mtCO1) and the ribosomal internal transcribed spacer 2 region (ITS2) are among the most widely used molecular markers for insect taxonomic characterization. Three economically important species of thrips, Scirtothripsdorsalis, Thripspalmi, and Frankliniellaoccidentalis were selected to examine the extent of intragenomic variation within these two marker regions in the family Thripidae, and determine if this variation would affect the utility of markers in thrips molecular diagnostics. For each species, intragenomic (within individual) variation and intergenomic (among individuals) variation was assessed by cloning and sequencing PCR-amplified copies. Intergenomic variation was generally higher than intragenomic variation except in cases where intergenomic variation was very low, as in mtCO1 from S.dorsalis and F.occidentalis. Intragenomic variation was detected in both markers in all three of the thrips species, however, 2-3 times more intragenomic variation was observed for ITS2 than mtCO1 in both S.dorsalis and T.palmi. Furthermore, levels of intragenomic variation were low for both of the genes in F.occidentalis. In all of the three thrips species, no sex-based clustering of haplotypes was observed in either marker. Unexpected high intragenomic variation in ITS2 for two of three thrips species did not interfere with thrips diagnostics. However, caution should be taken in applying ITS2 to certain studies of S.dorsalis and T.palmi when high levels of intragenomic variation could be problematic or confounding. In such studies, mtCO1 may be a preferable marker. Possible reasons for discrepancies in intragenomic variation among genomic regions are discussed.
Collapse
Affiliation(s)
- Vivek Kumar
- Department of Entomology and Nematology, Mid-Florida Research and Education Center, University of Florida, Apopka, FL 32703, USA; (A.M.D.); (L.S.O.)
- U.S. Horticultural Research Laboratory, USDA-ARS, Fort Pierce, FL 34945, USA; (R.G.S.); (C.L.M.)
- Department of Entomology and Nematology, Tropical Research and Education Center, University of Florida, Homestead, FL 33031, USA;
- Correspondence: ; Tel.: +1-772-462-5978
| | - Aaron M. Dickey
- Department of Entomology and Nematology, Mid-Florida Research and Education Center, University of Florida, Apopka, FL 32703, USA; (A.M.D.); (L.S.O.)
- U.S. Horticultural Research Laboratory, USDA-ARS, Fort Pierce, FL 34945, USA; (R.G.S.); (C.L.M.)
- Present Address: U.S. Meat Animal Research Center, USDA-ARS, Clay Center, NE 68933, USA
| | - Dakshina R. Seal
- Department of Entomology and Nematology, Tropical Research and Education Center, University of Florida, Homestead, FL 33031, USA;
| | - Robert G. Shatters
- U.S. Horticultural Research Laboratory, USDA-ARS, Fort Pierce, FL 34945, USA; (R.G.S.); (C.L.M.)
| | - Lance S. Osborne
- Department of Entomology and Nematology, Mid-Florida Research and Education Center, University of Florida, Apopka, FL 32703, USA; (A.M.D.); (L.S.O.)
| | - Cindy L. McKenzie
- U.S. Horticultural Research Laboratory, USDA-ARS, Fort Pierce, FL 34945, USA; (R.G.S.); (C.L.M.)
| |
Collapse
|
3
|
Makalowski W, Boguski MS. Evolutionary parameters of the transcribed mammalian genome: an analysis of 2,820 orthologous rodent and human sequences. Proc Natl Acad Sci U S A 1998; 95:9407-12. [PMID: 9689093 PMCID: PMC21351 DOI: 10.1073/pnas.95.16.9407] [Citation(s) in RCA: 330] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023] Open
Abstract
We have rigorously defined 2,820 orthologous mRNA and protein sequence pairs from rats, mice, and humans. Evolutionary rate analyses indicate that mammalian genes are evolving 17-30% more slowly than previous textbook values. Data are presented on the average properties of mRNA and protein sequences, on variations in sequence conservation in coding and noncoding regions, and on the absolute and relative frequencies of repetitive elements and splice sites in untranslated regions of mRNAs. Our data set contains 1,880 unique human/rodent sequence pairs that represent about 2-4% of all mammalian genes. Of the 1,880 human orthologs, 70% are present on a new gene map of the human genome, thus providing a valuable resource for cross-referencing human and rodent genomes. In addition to comparative mapping, these results have practical applications in the interpretation of noncoding sequence conservation between syntenic regions of human and mouse genomic sequence, and in the design and calibration of gene expression arrays.
Collapse
Affiliation(s)
- W Makalowski
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | | |
Collapse
|
4
|
Safrany G, Hidvegi EJ. New tandem repeat region in the non-transcribed spacer of human ribosomal RNA gene. Nucleic Acids Res 1989; 17:3013-22. [PMID: 2726449 PMCID: PMC317709 DOI: 10.1093/nar/17.8.3013] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
A new repetitive DNA region was identified in the non-transcribed spacer of human rDNA, namely a long (4.6 kb) sequence motif (Xbal element) was present in two copies. The repeating unit composed of two parts. One of them consisted of unique nucleotide sequences, interrupted by some simple sequences. The other, about 3.1 kb long one assembled only from highly repeated simple sequences. The unique sequence region contained two, inverted copies of the human AluI type repetitive DNA family. The authors suggest that the XbaI elements may flank the tandem arrays of human rRNA genes as terminal repeats and they might function both as the origin of rDNA replication and/or site of homologous recombination.
Collapse
Affiliation(s)
- G Safrany
- Frederic Joliot-Curie National Research Institute for Radiobiology and Radiohygiene, Budapest, Hungary
| | | |
Collapse
|
5
|
Braaten DC, Thomas JR, Little RD, Dickson KR, Goldberg I, Schlessinger D, Ciccodicola A, D'Urso M. Locations and contexts of sequences that hybridize to poly(dG-dT).(dC-dA) in mammalian ribosomal DNAs and two X-linked genes. Nucleic Acids Res 1988; 16:865-81. [PMID: 3267216 PMCID: PMC334724 DOI: 10.1093/nar/16.3.865] [Citation(s) in RCA: 58] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
Sequences located several kilobases both 5' and 3' of the stably transcribed portion of several genes hybridize to radio-labeled pure fragments of the alternating sequence poly (dG-dT) (dC-dA) ["poly(GT)"]. The genes include the ribosomal DNA of mouse, rat, and human, and also human glucose-6-phosphate dehydrogenase (G6PD) and mouse hypoxanthine-guanine phosphoribosyl transferase (HPRT). HPRT has additional hybridizing sequences in introns. Fragments that include the hybridizing sequences and up to 300 bp of adjoining DNA show perfect runs of poly(GT) (greater than 30bp) in all but the human 5' region of rDNA, which shows a somewhat different alternating purine:pyrimidine sequence, poly(GTAT) (36bp). Within 150 bp of these sequences in various instances are found a number of other sequences reported to affect DNA conformation in model systems. Most marked is an enhancement of sequences matching at least 67% to the consensus binding sequence for topoisomerase II. Two to ten-fold less of such sequences were found in other sequenced portions of the nontranscribed spacer or in the transcribed portion of rDNA. The conservation of the locations of tracts of alternating purine:pyrimidine between evolutionarily diverse species is consistent with a possible functional role for these sequences.
Collapse
Affiliation(s)
- D C Braaten
- Department of Microbiology and Immunology, Washington University School of Medicine, St Louis, MO 63110
| | | | | | | | | | | | | | | |
Collapse
|
6
|
Abstract
We report the isolation and partial characterization of distinct EcoRI fragments of the Chinese hamster genome which contain regions complementary to a 1-kb portion of the mature 18 S ribosomal RNA molecule. This previously undescribed 18 S rDNA-like region, which we have termed a "fragment of ribosomal DNA" (frDNA), has been shown by sequence analysis to correspond to a region extending 1 kb upstream from the 3' terminus of the mature 18 S rRNA. Within the five frDNA-containing clones described here, no other region of the ribosomal RNA cistron was detected, making it unlikely that these are polymorphic forms of the ribosomal DNA repeat. The 18 S rDNA-complementary region appears to be flanked by an imperfect direct repeat, which could have been the result of the retroinsertion of a fragment of ribosomal RNA. Directly adjacent to the 18 S rDNA-like region we have identified nonribosomal sequences which appear common to all of the frDNA-containing clones we examined. At least eight different-sized EcoRI fragments contain frDNAs and the abundance of the frDNAs appears to be of the order of 30 per genome. The occurrence of multiple copies of this ribosomal-nonribosomal chimera suggests that, once formed, the chimera was duplicated within the genome.
Collapse
Affiliation(s)
- P J Wejksnora
- Department of Biological Sciences, University of Wisconsin, Milwaukee 53201
| | | | | |
Collapse
|
7
|
Di Nocera PP, Graziani F, Lavorgna G. Genomic and structural organization of Drosophila melanogaster G elements. Nucleic Acids Res 1986; 14:675-91. [PMID: 3003691 PMCID: PMC339457 DOI: 10.1093/nar/14.2.675] [Citation(s) in RCA: 52] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
The properties and the genomic organization of G elements, a moderately repeated DNA family of D. melanogaster, are reported. G elements lack terminal repeats, generate target site duplications at the point of insertion and exhibit at one end a stretch of A residues of variable length. In a large number of recombinant clones analyzed G elements occur in tandem arrays, interspersed with specific ribosomal DNA (rDNA) segments. This arrangement results from the insertion of members of the G family within the nontranscribed spacer (NTS) of rDNA units. Similarity of the site of integration of G elements to that of ribosomal DNA insertions suggests that distinct DNA sequences might have been inserted into rDNA through a partly common pathway.
Collapse
|