1
|
Scully OJ, Chua PJ, Harve KS, Bay BH, Yip GW. Serglycin in Health and Diseases. Anat Rec (Hoboken) 2012; 295:1415-20. [DOI: 10.1002/ar.22536] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2012] [Revised: 06/17/2012] [Accepted: 06/23/2012] [Indexed: 11/08/2022]
|
2
|
Suzuki R, Saitou N. Exploration for functional nucleotide sequence candidates within coding regions of mammalian genes. DNA Res 2011; 18:177-87. [PMID: 21586532 PMCID: PMC3111233 DOI: 10.1093/dnares/dsr010] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open
Abstract
The primary role of a protein coding gene is to encode amino acids. Therefore, synonymous sites of codons, which do not change the encoded amino acid, are regarded as evolving neutrally. However, if a certain region of a protein coding gene contains a functional nucleotide element (e.g. splicing signals), synonymous sites in the region may have selective pressure. The existence of such elements would be detected by searching regions of low nucleotide substitution. We explored invariant nucleotide sequences in 10 790 orthologous genes of six mammalian species (Homo sapiens, Macaca mulatta, Mus musculus, Rattus norvegicus, Bos taurus, and Canis familiaris), and extracted 4150 sequences whose conservation is significantly stronger than other regions of the gene and named them significantly conserved coding sequences (SCCSs). SCCSs are observed in 2273 genes. The genes are mainly involved with development, transcriptional regulation, and the neurons, and are expressed in the nervous system and the head and neck organs. No strong influence of conventional factors that affect synonymous substitution was observed in SCCSs. These results imply that SCCSs may have double function as nucleotide element and protein coding sequence and retained in the course of mammalian evolution.
Collapse
Affiliation(s)
- Rumiko Suzuki
- Department of Genetics, School of Life Science, Graduate University for Advanced Studies, Mishima 411-8540, Japan
| | | |
Collapse
|
3
|
Smith AM, Sanchez MJ, Follows GA, Kinston S, Donaldson IJ, Green AR, Göttgens B. A novel mode of enhancer evolution: the Tal1 stem cell enhancer recruited a MIR element to specifically boost its activity. Genome Res 2008; 18:1422-32. [PMID: 18687876 PMCID: PMC2527711 DOI: 10.1101/gr.077008.108] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Altered cis-regulation is thought to underpin much of metazoan evolution, yet the underlying mechanisms remain largely obscure. The stem cell leukemia TAL1 (also known as SCL) transcription factor is essential for the normal development of blood stem cells and we have previously shown that the Tal1 +19 enhancer directs expression to hematopoietic stem cells, hematopoietic progenitors, and to endothelium. Here we demonstrate that an adjacent region 1 kb upstream (+18 element) is in an open chromatin configuration and carries active histone marks but does not function as an enhancer in transgenic mice. Instead, it boosts activity of the +19 enhancer both in stable transfection assays and during differentiation of embryonic stem (ES) cells carrying single-copy reporter constructs targeted to the Hprt locus. The +18 element contains a mammalian interspersed repeat (MIR) which is essential for the +18 function and which was transposed to the Tal1 locus approximately 160 million years ago at the time of the mammalian/marsupial branchpoint. Our data demonstrate a previously unrecognized mechanism whereby enhancer activity is modulated by a transposon exerting a "booster" function which would go undetected by conventional transgenic approaches.
Collapse
Affiliation(s)
- Aileen M Smith
- University of Cambridge Department of Haematology, Cambridge Institute for Medical Research, Cambridge CB2 2XY, United Kingdom
| | | | | | | | | | | | | |
Collapse
|
4
|
Hrabcová I, Kypr J. The Longest (A+T) and (G+C) Blocks in the Human and Other Genomes. J Biomol Struct Dyn 2008; 25:337-45. [DOI: 10.1080/07391102.2008.10507182] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
|
5
|
|
6
|
Abstract
DNA replication initiation sites and initiation frequencies over 12. 5 kb of the human c-myc locus, including 4.6 kb of new 5' sequence, were determined based on short nascent DNA abundance measured by competitive polymerase chain reaction using 21 primer sets. In previous measurements, no comparative quantitation of nascent strand abundance was performed, and distinction of major from minor initiation sites was not feasible. Two major initiation sites were identified in this study. One predominant site has been located at approximately 0.5 kb upstream of exon 1 of the c-myc gene, and a second new major site is located in exon 2. The site in exon 2 has not been previously identified. In addition, there are other sites that may act as less frequently used initiation sites, some of which may correspond to sites in previous reports. Furthermore, a comparison of the abundance of DNA replication intermediates over this same region of the c-myc locus between HeLa and normal skin fibroblast (NSF) cells indicated that the relative distribution was very similar, but that nascent strand abundance in HeLa cells was approximately twice that in NSF relative to the abundance at the lamin B2 origin. This increased activity at initiation sites in the c-myc locus may mainly be influenced by regulators at higher levels in transformed cells like HeLa.
Collapse
Affiliation(s)
- L Tao
- McGill Cancer Center, McGill University, Montreal, Quebec, Canada
| | | | | | | | | |
Collapse
|
7
|
Abstract
We characterized short interspersed elements (SINEs), of the CORE-suprafamily in egg-laying (monotremes), pouched (marsupials) and placental mammals. Five families of these repeats distinguished by the presence of distinct LINE-related 3'-segments shared tRNA-like promoter and the central core region. The putative active elements were reconstructed from the alignment of genomic repeats representing molecular fossils of sequences that amplified in the past and since then underwent multiple mutations. Their mode of proliferation by retroposition was indicated by the presence of: (1) internal RNA PolIII promoter; (2) simple sequence repeated tail; (3) direct repeats; and (4) subfamilies recording the evolution of elements. The copy number of CORE-SINEs in placental genomes was estimated at about 300,000; they were highly divergent and apparently ceased to amplify before radiation of these lineages. On the other hand, among almost half a million fossil elements present in marsupials and monotremes, the youngest subfamilies could still be retropositionally active. CORE-SINEs terminate in sequence repeats of a few nucleotides similar to their 3'-segment LINE-homologues, CR1, L2 and Bov-B. These three LINE elements fall into clades distinct from that of L1 elements which, similar to their co-amplifying SINEs, end in a poly(A) tail. We propose a model in which new CORE-families, with distinct 3'-segments, are created at the RNA level due to template switching between LINE and CORE-RNA during reverse transcription. The proposed mechanism suggests that such an adaptation to the changing amplification machinery facilitated the survival and prosperity of CORE-elements over long evolutionary periods in different lineages.
Collapse
Affiliation(s)
- N Gilbert
- Centre de recherche de l'Hôpital Sainte-Justine, Centre de cancérologie Charles Bruneau, Montréal, H3T 1C5, Canada
| | | |
Collapse
|
8
|
Gilbert N, Labuda D. CORE-SINEs: eukaryotic short interspersed retroposing elements with common sequence motifs. Proc Natl Acad Sci U S A 1999; 96:2869-74. [PMID: 10077603 PMCID: PMC15861 DOI: 10.1073/pnas.96.6.2869] [Citation(s) in RCA: 103] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
A 65-bp "core" sequence is dispersed in hundreds of thousands copies in the human genome. This sequence was found to constitute the central segment of a group of short interspersed elements (SINEs), referred to as mammalian-wide interspersed repeats, that proliferated before the radiation of placental mammals. Here, we propose that the core identifies an ancient tRNA-like SINE element, which survived in different lineages such as mammals, reptiles, birds, and fish, as well as mollusks, presumably for >550 million years. This element gave rise to a number of sequence families (CORE-SINEs), including mammalian-wide interspersed repeats, whose distinct 3' ends are shared with different families of long interspersed elements (LINEs). The evolutionary success of the generic CORE-SINE element can be related to the recruitment of the internal promoter from highly transcribed host RNA as well as to its capacity to adapt to changing retropositional opportunities by sequence exchange with actively amplifying LINEs. It reinforces the notion that the very existence of SINEs depends on the cohabitation with both LINEs and the host genome.
Collapse
Affiliation(s)
- N Gilbert
- Centre de Recherche de l'Hôpital Sainte-Justine, Centre de Cancérologie Charles Bruneau, Québec H3T 1C5, Canada
| | | |
Collapse
|
9
|
Murray J, Buard J, Neil DL, Yeramian E, Tamaki K, Hollies C, Jeffreys AJ. Comparative Sequence Analysis of Human Minisatellites Showing Meiotic Repeat Instability. Genome Res 1999. [DOI: 10.1101/gr.9.2.130] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
The highly variable human minisatellites MS32 (D1S8), MS31A (D7S21), and CEB1 (D2S90) all show recombination-based repeat instability restricted to the germline. Mutation usually results in polar interallelic conversion or occasionally in crossovers, which, at MS32 at least, extend into DNA flanking the repeat array, defining a localized recombination hotspot and suggesting that cis-acting elements in flanking DNA can influence repeat instability. Therefore, comparative sequence analysis was performed to search for common flanking elements associated with these unstable loci. All three minisatellites are located in GC-rich DNA abundant in dispersed and tandem repetitive elements. There were no significant sequence similarities between different loci upstream of the unstable end of the repeat array. Only one of the three loci showed clear evidence for putative coding sequences near the minisatellite. No consistent patterns of thermal stability or DNA secondary structure were shared by DNA flanking these loci. This work extends previous data on the genomic environment of minisatellites. In addition, this work suggests that recombinational activity is not controlled by primary or secondary characteristics of the DNA sequence flanking the repeat array and is not obviously associated with gene promoters as seen in yeast.[The sequence data described in this paper have been submitted to the GenBank data library under accession nos. AF048727(CEB1), AF048728 (MS31A), and AF048729 (MS32).]
Collapse
|
10
|
Murray J, Buard J, Neil DL, Yeramian E, Tamaki K, Hollies C, Jeffreys AJ. Comparative sequence analysis of human minisatellites showing meiotic repeat instability. Genome Res 1999; 9:130-6. [PMID: 10022977 PMCID: PMC310713] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/1998] [Accepted: 12/10/1998] [Indexed: 02/10/2023]
Abstract
The highly variable human minisatellites MS32 (D1S8), MS31A (D7S21), and CEB1 (D2S90) all show recombination-based repeat instability restricted to the germline. Mutation usually results in polar interallelic conversion or occasionally in crossovers, which, at MS32 at least, extend into DNA flanking the repeat array, defining a localized recombination hotspot and suggesting that cis-acting elements in flanking DNA can influence repeat instability. Therefore, comparative sequence analysis was performed to search for common flanking elements associated with these unstable loci. All three minisatellites are located in GC-rich DNA abundant in dispersed and tandem repetitive elements. There were no significant sequence similarities between different loci upstream of the unstable end of the repeat array. Only one of the three loci showed clear evidence for putative coding sequences near the minisatellite. No consistent patterns of thermal stability or DNA secondary structure were shared by DNA flanking these loci. This work extends previous data on the genomic environment of minisatellites. In addition, this work suggests that recombinational activity is not controlled by primary or secondary characteristics of the DNA sequence flanking the repeat array and is not obviously associated with gene promoters as seen in yeast.
Collapse
Affiliation(s)
- J Murray
- Department of Genetics, University of Leicester, Leicester LE1 7RH, UK
| | | | | | | | | | | | | |
Collapse
|
11
|
Matassi G, Labuda D, Bernardi G. Distribution of the mammalian-wide interspersed repeats (MIRs) in the isochores of the human genome. FEBS Lett 1998; 439:63-5. [PMID: 9849878 DOI: 10.1016/s0014-5793(98)01338-6] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
The distribution of MIRs (mammalian-wide interspersed repeats) was investigated in 164 human sequences (> or = 100 kb), which were assigned, according to their GC level, to isochore families L, H1, H2 and H3. MIR elements, whose total number in the genome was estimated to be about 3.3 x 10(5), were found to be unevenly distributed in human isochores. The majority of MIRs (55%) were found in the L isochore family. In contrast, MIR density was highest in H2, closely followed by H1, whereas densities in L and H3 were 2- and 3-fold lower than in H2, respectively. For this reason, the assessment of MIR distribution by inter-repeat PCR led to an overestimation of MIR numbers in H2 isochore and an underestimation in L isochores.
Collapse
Affiliation(s)
- G Matassi
- Laboratoire de Génétique Moléculaire, Institut Jacques Monod, Paris, France
| | | | | |
Collapse
|
12
|
Lako M, Strachan T, Bullen P, Wilson DI, Robson SC, Lindsay S. Isolation, characterisation and embryonic expression of WNT11, a gene which maps to 11q13.5 and has possible roles in the development of skeleton, kidney and lung. Gene 1998; 219:101-10. [PMID: 9757009 DOI: 10.1016/s0378-1119(98)00393-x] [Citation(s) in RCA: 98] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
The Wnt gene family encodes a set of signalling molecules, thought to play an important role in key processes of embryonic development. In vertebrates as a whole 20 different Wnt genes have been identified to date, however, a complement of only 16 have been identified in man and for some of these the complete coding sequences are unavailable. We have recently isolated the full-length cDNA sequence of a new human WNT gene, WNT11, investigated its genomic organisation and performed detailed expression studies in early human embryos. These have shown that the expression of human WNT11 is restricted to the perichondrium of the developing skeleton, lung mesenchyme, the tips of the ureteric buds and other areas of the urogenital system and the cortex of the adrenal gland. This, for the first time, provides information for the embryonic expression of human WNT11. We have mapped WNT11 to 11q13.5 and this together with its expression in the perichondrium of the developing skeleton, makes it a plausible candidate gene for HBM, which has been previously linked to markers from this region.
Collapse
Affiliation(s)
- M Lako
- Department of Human Genetics, University of Newcastle upon Tyne, Newcastle upon Tyne, UK
| | | | | | | | | | | |
Collapse
|
13
|
Prokhortchouk EB, Prokhortchouk AV, Rouzov AS, Kiselev SL, Lukanidin EM, Georgiev GP. A minisatellite "core" element constitutes a novel, chromatin-specific activator of mts1 gene transcription. J Mol Biol 1998; 280:227-36. [PMID: 9654447 DOI: 10.1006/jmbi.1998.1857] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Expression of the mts1 gene is often associated with malignant transformation of tumor cells. Transcription of the gene is controlled by a number of positive and negative regulatory elements, all of them being localized in the first intron (+38 to +1215) of the mts1 gene. Through analysis of the distribution of DNase I hypersensitive sites in the first intron of the gene we revealed a structurally conserved region that consisted of a non-canonical NFkB binding site and a minisatellite "core" element. Deletion of the minisatellite core DNA in the context of the first intron had no effect on its regulatory capacity when assayed in transient transfections, while a fivefold decrease was observed in a pool of stably transfected cells. The minisatellite core sequence CTGGGCAGGCAG is involved in DNA-protein interactions in vivo, and is similar to a binding site for the previously identified minisatellite DNA sequence binding protein (Msbp-1). The core DNA interacted in vitro with a protein that had an apparent molecular mass of 40 kDa. These data indicate that the minisatellite DNA represents the novel, chromatin-specific element in the mts1 complex enhancer.
Collapse
Affiliation(s)
- E B Prokhortchouk
- Laboratory of Molecular Cancer Genetics, Institute of Gene Biology, Vavilova 34/5, Moscow, 117334, Russia
| | | | | | | | | | | |
Collapse
|
14
|
Tulko JS, Korotkov EV, Phoenix DA. MIRs are present in coding regions of human genes. DNA SEQUENCE : THE JOURNAL OF DNA SEQUENCING AND MAPPING 1998; 8:31-8. [PMID: 9522118 DOI: 10.3109/10425179709020882] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
By using a weighted function and the method of enlarged similarity a search has been performed to identify mammalian interspersed repeats (MIRs) in DNA sequences from the EMBL data bank. The existence of MIRs is shown in coding regions of human genes and also in chicken and duck genomes. It is possible to conclude from the results obtained that MIRs were established in the coding regions of some genes and may have taken part in gene evolution. Furthermore, MIRs may have been amplified in vertebrate genomes before the origin of mammals.
Collapse
Affiliation(s)
- J S Tulko
- Bioengineering Center, Russian Academy of Sciences, Moscow, Russia
| | | | | |
Collapse
|
15
|
Nguyen TD, Chen P, Huang WD, Chen H, Johnson D, Polansky JR. Gene structure and properties of TIGR, an olfactomedin-related glycoprotein cloned from glucocorticoid-induced trabecular meshwork cells. J Biol Chem 1998; 273:6341-50. [PMID: 9497363 DOI: 10.1074/jbc.273.11.6341] [Citation(s) in RCA: 251] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Expression of the trabecular meshwork inducible glucocorticoid response (TIGR) gene progressively increases from barely detectable levels to greater than 2% of total cellular mRNA over 10 days exposure of trabecular meshwork (TM) cells to dexamethasone. Cycloheximide blocked most of the TIGR mRNA induction, suggesting a requirement for ongoing protein synthesis. The genomic structure of TIGR (approximately 20 kilobases) consists of 3 exons, and a 5-kilobase promoter region that contains 13 predicted hormone response elements, including several glucocorticoid regulatory elements, and other potentially important regulatory motifs. TIGR cDNA encodes an olfactomedin-related glycoprotein of 504 amino acids with motifs for N- and O-linked glycosylation, glycosaminoglycan initiation, hyaluronic acid binding, and leucine zippers. Recombinant TIGR (rTIGR) showed oligomerization and specific binding to TM cells. Anti-rTIGR antibody detected multiple translational/post-translational forms of TIGR produced by the cells (including secreted 66 kDa/55 kDa glycoproteins/proteins in the media and 55 kDa cellular proteins), whereas Northern blot showed a single mRNA species. The findings suggest potential mechanisms by which TIGR could obstruct the aqueous humor fluid flow and participate in the pathogenesis of glaucoma.
Collapse
Affiliation(s)
- T D Nguyen
- Cellular Pharmacology Laboratories, Department of Ophthalmology, School of Medicine, University of California, San Francisco, California 94143-0730, USA.
| | | | | | | | | | | |
Collapse
|
16
|
Rainville IR, Albertini RJ, Nicklas JA. Breakpoints and junctional regions of intragenic deletions in the HPRT gene in human T-Cells. SOMATIC CELL AND MOLECULAR GENETICS 1995; 21:309-26. [PMID: 8619128 DOI: 10.1007/bf02257466] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
DNA sequences of the deletion breakpoints of 24 human T-lymphocyte hprt gene mutations are reported. These independent deletions ranged in size from 18 to 15655 base pairs. Seven of the 21 in vivo mutations arose in normal adults, three in normal children, eight in radioimmunotherapy patients and three in platinum chemotherapy patients. One in vitro mutation was isolated after 93cGy radon exposure and two after 300cGy gamma radiation. The breakpoints were found to be non-random and a cluster of small deletions in exon 6 is reported. Ten of the mutations had 2-5bp direct repeats at the breakpoints. There was no excess of "deletion-associated" motifs over that expected by chance. Some breakpoints do occur at consensus topoisomerase II cleavage sites and the centromeric end of a Donehower sequence occurs exactly at a telomeric breakpoint. Three mutants had breakpoints at hairpins expected by the model of Glickman and Ripley.
Collapse
Affiliation(s)
- I R Rainville
- VCC Genetics Laboratory, University of Vermont, Burlington, Vermont 05401, USA
| | | | | |
Collapse
|
17
|
Murnane JP, Morales JF. Use of a mammalian interspersed repetitive (MIR) element in the coding and processing sequences of mammalian genes. Nucleic Acids Res 1995; 23:2837-9. [PMID: 7659505 PMCID: PMC307119 DOI: 10.1093/nar/23.15.2837] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023] Open
Abstract
The mammalian interspersed repetitive (MIR) element was amplified in mammals 130 million years ago. The MIR element is at least 260 bp in length and is found in approximately 105 copies in the mammalian genome. We analyzed copies of the MIR element in the DNA of various mammals to determine its relationship to the structure and function of genes, in an attempt to identify specific uses of the MIR element within the mammalian genome. We found that alternative splicing within the acetylcholine receptor gene in humans takes place within the MIR element and results in the incorporation of part of the MIR element into the coding sequence of this gene. Furthermore, the polyadenylation signal (AATAAA) at the 3' end of four different mammalian genes is derived from the MIR element. These uses of the MIR element suggest that other regulatory sequences found within the mammalian genome originated from ancient transposable elements, many of which may no longer be recognizable.
Collapse
Affiliation(s)
- J P Murnane
- Laboratory of Radiobiology and Enviromental Health, University of California, San Francisco 94143-0750, USA
| | | |
Collapse
|
18
|
Jurka J, Zietkiewicz E, Labuda D. Ubiquitous mammalian-wide interspersed repeats (MIRs) are molecular fossils from the mesozoic era. Nucleic Acids Res 1995; 23:170-5. [PMID: 7870583 PMCID: PMC306646 DOI: 10.1093/nar/23.1.170] [Citation(s) in RCA: 101] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023] Open
Abstract
Short interspersed elements (SINEs) are ubiquitous in mammalian genomes. Remarkable variety of these repeats among placental orders indicates that most of them amplified in each lineage independently, following mammalian radiation. Here, we present an ancient family of repeats, whose sequence divergence and common occurrence among placental mammals, marsupials and monotremes indicate their amplification during the Mesozoic era. They are called MIRs for abundant Mammalian-wide Interspersed Repeats. With approximately 120,000 copies still detectable in the human genome (0.2-0.3% DNA), MIRs represent a 'fossilized' record of a major genetic event preceding the radiation of placental orders.
Collapse
Affiliation(s)
- J Jurka
- Linus Pauling Institute of Science and Medicine, Palo Alto, CA 94306
| | | | | |
Collapse
|
19
|
Smit AF, Riggs AD. MIRs are classic, tRNA-derived SINEs that amplified before the mammalian radiation. Nucleic Acids Res 1995; 23:98-102. [PMID: 7870595 PMCID: PMC306635 DOI: 10.1093/nar/23.1.98] [Citation(s) in RCA: 162] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023] Open
Abstract
Short Interspersed Nucleotide Elements (SINEs) are highly abundant in mammalian genomes. The term SINE has come to be restricted to short retroposons with internal RNA polymerase III promoter sites in a region derived from a structural RNA (usually a tRNA). Here we describe a novel, 260 bp tRNA-derived SINE, some fragments of which have been noted before to be repetitive in mammalian DNA. Unlike previously reported SINEs, which are restricted to closely related species, copies of this element can be found in all mammalian genomes, including marsupials. It is therefore called MIR for mammalian-wide interspersed repeat. Their high divergence and their presence at orthologous sites in different mammals indicate that MIRs, at least in part, amplified before the mammalian radiation. Next to Alu, MIRs are the most common interspersed repeat in primates with an estimated 300,000 copies still discernible, which account for 1 to 2% of our DNA. Interestingly, a small, central region of MIR appears to be much better conserved in the genomic copies than the rest of the sequence.
Collapse
Affiliation(s)
- A F Smit
- Department of Biology, Beckman Research Institute of the City of Hope, Duarte, CA 91010
| | | |
Collapse
|
20
|
Korotkov EV. Fast method of homology and purine-pyrimidine mutual relations between DNA sequences search. DNA SEQUENCE : THE JOURNAL OF DNA SEQUENCING AND MAPPING 1994; 4:413-5. [PMID: 7841466 DOI: 10.3109/10425179409010192] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
A new algorithm for scanning sequences is described. This algorithm uses the boolean operators AND and OR. The mutual information between the sequences is used as a measure of sequence interrelation. It allows evaluation of the probability of accidental sequence interrelation in a quantitative manner. The proposed algorithm was used for searching for MB1 repeats in human and other mammalian sequences.
Collapse
Affiliation(s)
- E V Korotkov
- N. N. Semenov Institute of Chemical Physics, Moscow, Russia
| |
Collapse
|
21
|
Russell L, Forsdyke DR. The third human homolog of a murine gene encoding an inhibitor of stem cell proliferation is truncated and linked to a CpG island-containing upstream sequence. DNA Cell Biol 1993; 12:157-75. [PMID: 8097094 DOI: 10.1089/dna.1993.12.157] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
Abstract
The murine gene, MIP1 alpha, encodes a cytokine (macrophage inflammatory protein 1 alpha) that inhibits the proliferation of bone marrow stem cells. Two human homologs have been characterized, G0S19-1 and G0S19-2. Like MIP1 alpha, these genes contain three exons, the first of which encodes a hydrophobic signal sequence. The existence of a third human G0S19 gene, present in one in four individuals, has been predicted from restriction enzyme analyses. This paper reports that a previously identified human genomic clone containing a G0S19 sequence (G0S19-3), corresponds to the third gene. However, the first G0S19 exon is missing. The sequence differs from those of G0S19-1 and G0S19-2 upstream of a point 31 nucleotides from the junction of the first intron with the second exon. This upstream sequence contains a CpG island and is named "CpG island-containing upstream sequence," CUS. Apart from the G0S19-3-associated copy found only in individuals with the third G0S19 gene, all individuals have one DNA species hybridizing strongly to a CUS-specific probe and at least two less strongly hybridizing species. The CUS has potential binding sites for transcription factors AP-1, AP-2, AP-3, AP-4, and Sp1, a Donehower conserved repetitive element, and motifs characteristic of cytokine, oncogene, and retroviral promoters. Thus, the CUS might promote the transcription of sequences with which it became associated. We suggest that the CUS-G0S19-3 sequence was generated by recombination between a G0S19-2 gene and a member of a novel CUS-associated gene family.
Collapse
Affiliation(s)
- L Russell
- Department of Biochemistry, Queen's University, Kingston, Ontario, Canada
| | | |
Collapse
|
22
|
Beck S, Kelly A, Radley E, Khurshid F, Alderton RP, Trowsdale J. DNA sequence analysis of 66 kb of the human MHC class II region encoding a cluster of genes for antigen processing. J Mol Biol 1992; 228:433-41. [PMID: 1453454 DOI: 10.1016/0022-2836(92)90832-5] [Citation(s) in RCA: 130] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
The genomic sequence of a 66,109 bp long region within the human MHC has been determined by manual and automated DNA sequencing. From cDNA mapping and sequencing data it is known that this region contains a cluster of at least four genes that are believed to be involved in antigen processing. Here, we describe the genomic organization of these genes, which comprise two proteasome-related genes (LMP2 and LMP7), thought to be involved in the proteolytic degradation of cytoplasmic antigens and two ABC transporter genes (TAP1 and TAP2), thought to be involved in pumping of the degraded peptides across the endoplasmic reticulum membrane. Analysis of the sequence homology and the intron/exon structures of the corresponding genes suggests that one gene pair arose by duplication from the other. Comparison of the available sequence data from other organisms shows striking conservation (70 to 84%) of this gene cluster in human, mouse and rat. The presence of several potential interferon stimulated response elements (ISREs) is in agreement with the experimentally observed up-regulation of these genes with gamma-interferon.
Collapse
Affiliation(s)
- S Beck
- Imperial Cancer Research Fund, London, England
| | | | | | | | | | | |
Collapse
|
23
|
Abstract
We report a collection of 53 prototypic sequences representing known families of repetitive elements from the human genome. The prototypic sequences are either consensus sequences or selected examples of repetitive sequences. The collection includes: prototypes for high and medium reiteration frequency interspersed repeats, long terminal repeats of endogenous retroviruses, alphoid repeats, telomere-associated repeats, and some miscellaneous repeats. The collection is annotated and available electronically.
Collapse
Affiliation(s)
- J Jurka
- Linus Pauling Institute of Science and Medicine, Palo Alto, CA 94306
| | | | | |
Collapse
|
24
|
Humphries DE, Nicodemus CF, Schiller V, Stevens RL. The human serglycin gene. Nucleotide sequence and methylation pattern in human promyelocytic leukemia HL-60 cells and T-lymphoblast Molt-4 cells. J Biol Chem 1992. [PMID: 1377686 DOI: 10.1016/s0021-9258(18)42248-x] [Citation(s) in RCA: 29] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open
Abstract
The complete nucleotide sequence of the 16.7-kb human gene that encodes the peptide core (serglycin) of a secretory granule proteoglycan was determined, thus representing the first proteoglycan peptide core gene to be sequenced in its entirety. The exons, intron 1, and intron 2 comprised 7, 53, and 40% of the gene, respectively. Nineteen Alu-repetitive DNA sequences were interspersed in the gene, accounting for 28% of the total nucleotides in intron 1 and 40% of the nucleotides in intron 2. The nucleotide sequence was then used in an examination of the methylation pattern of the human serglycin gene in human promyelocytic leukemia HL-60 cells that contain serglycin mRNA and in T-lymphoblast Molt-4 cells that do not. With polymerase chain reaction methodology, 13 DNA probes of 250-880 base pairs in length were generated that corresponded to unique, non-Alu sequences spaced throughout the entire human serglycin gene. When blots containing genomic DNA digested with HpaII or MspI were examined with these genomic probes, it was discovered that the 5'-flanking region and intron 1 of the serglycin gene in HL-60 cells were both substantially less methylated than intron 2. In contrast, the entire serglycin gene in Molt-4 cells was highly methylated. Because hypomethylated genes generally are transcribed more efficiently than hypermethylated genes, the high level of serglycin mRNA in HL-60 cells probably is a consequence of the low level of methylation of intron 1 and the 5'-flanking region of the serglycin gene in these cells.
Collapse
Affiliation(s)
- D E Humphries
- Department of Veterans Affairs Outpatient Clinic, Boston, Massachusetts
| | | | | | | |
Collapse
|
25
|
Monnat RJ, Hackmann AF, Chiaverotti TA. Nucleotide sequence analysis of human hypoxanthine phosphoribosyltransferase (HPRT) gene deletions. Genomics 1992; 13:777-87. [PMID: 1639404 DOI: 10.1016/0888-7543(92)90153-j] [Citation(s) in RCA: 46] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
We have determined the nucleotide sequences of 10 intragenic human HPRT gene deletion junctions isolated from thioguanine-resistant PSV811 Werner syndrome fibroblasts or from HL60 myeloid leukemia cells. Deletion junctions were located by fine structure blot hybridization mapping and then amplified with flanking oligonucleotide primer pairs for DNA sequence analysis. The junction region sequences from these 10 HPRT mutants contained 13 deletions ranging in size from 57 bp to 19.3 kb. Three DNA inversions of 711, 368, and 20 bp were associated with tandem deletions in two mutants. Each mutant contained the deletion of one or more HPRT exon, thus explaining the thioguanine-resistant cellular phenotype. Deletion junction and donor nucleotide sequence alignments suggest that all of these HPRT gene rearrangements were generated by the nonhomologous recombination of donor DNA duplexes that share little nucleotide sequence identity. This result is surprising, given the potential for homologous recombination between copies of repeated DNA sequences that constitute approximately a third of the human HPRT locus. No difference in deletion structure or complexity was observed between deletions isolated from Werner syndrome or from HL60 mutants. This suggests that the Werner syndrome deletion mutator uses deletion mutagenesis pathway(s) that are similar or identical to those used in other human somatic cells.
Collapse
Affiliation(s)
- R J Monnat
- Department of Pathology SM-30, University of Washington, Seattle 98195
| | | | | |
Collapse
|
26
|
Affiliation(s)
- R E Tashian
- Department of Human Genetics, University of Michigan Medical School, Ann Arbor 48109
| |
Collapse
|
27
|
Chen EY, Cheng A, Lee A, Kuang WJ, Hillier L, Green P, Schlessinger D, Ciccodicola A, D'Urso M. Sequence of human glucose-6-phosphate dehydrogenase cloned in plasmids and a yeast artificial chromosome. Genomics 1991; 10:792-800. [PMID: 1889820 DOI: 10.1016/0888-7543(91)90465-q] [Citation(s) in RCA: 83] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
The sequence of 20,114 bp of DNA including the human glucose-6-phosphate dehydrogenase (G6PD) gene was determined. The region included a prominent CpG island, starting about 680 nucleotides upstream of the transcription start site, extending about 1050 nucleotides downstream of the start site, and ending just at the start of the first intron. The transcribed region from the start site to the poly(A) addition site covers 15,860 bp. The sequence of the 13 exons agreed with published cDNA sequence and for the 11 exons tested, with the corresponding sequence in a yeast artificial chromosome (YAC). The latter confirms YAC cloning fidelity at the DNA sequence level. Sixteen Alu sequences constitute 24% of the total sequence tract. Four were outside the borders of the mRNA transcript of the gene; all the others were found in a large (9858 bp) intron between exons 2 and 3. Two Alu clusters each contain Alus lying between the monomers of another.
Collapse
Affiliation(s)
- E Y Chen
- Department of Molecular Biology, Genentech, Inc., South San Francisco, California 94080
| | | | | | | | | | | | | | | | | |
Collapse
|
28
|
Carter PE, Duponchel C, Tosi M, Fothergill JE. Complete nucleotide sequence of the gene for human C1 inhibitor with an unusually high density of Alu elements. EUROPEAN JOURNAL OF BIOCHEMISTRY 1991; 197:301-8. [PMID: 2026152 DOI: 10.1111/j.1432-1033.1991.tb15911.x] [Citation(s) in RCA: 89] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
The complete (17159 bp) nucleotide sequence of the gene for the human C1 inhibitor has been determined. The transcription initiation site was examined by primer extension using human liver mRNA, and the messenger 5'-end sequence was determined on clones obtained by the anchored polymerase chain reaction. The gene of this serpin molecule is split by seven introns, with junctions of phases zero and one. An outstanding feature of the intron sequences is the occurrence of 17 AluI repeats of all four ancestral subgroups, indicating that the gene has been invaded during consecutive waves of Alu amplification, including a recent one. These Alu repeats form the sites of deletion and insertion in several known lesions in the C1-inhibitor gene. There is no obvious promoter site of the TATA-box type at the 5' end of the gene, but instead it contains a region of potential H-DNA structure similar to that found upstream of the human c-myc gene.
Collapse
Affiliation(s)
- P E Carter
- Department of Molecular and Cell Biology, University of Aberdeen, Scotland
| | | | | | | |
Collapse
|
29
|
Edwards A, Voss H, Rice P, Civitello A, Stegemann J, Schwager C, Zimmermann J, Erfle H, Caskey CT, Ansorge W. Automated DNA sequencing of the human HPRT locus. Genomics 1990; 6:593-608. [PMID: 2341149 DOI: 10.1016/0888-7543(90)90493-e] [Citation(s) in RCA: 272] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
The complete sequence of 57 kb of the human HPRT locus has been determined using automated fluorescent DNA sequencing. The strategy employed increasingly directed sequencing methods: A randomly generated M13 library was sequenced to generate contiguous overlapping sets of sequences (contigs). M13 clones at the ends of these contigs were further sequenced using M13 (universal and reverse) and custom oligonucleotide primers to order the contigs and to complete the sequencing project. The human HPRT sequence includes 1676 bp 5' and 15,238 bp 3' to exons 1 and 9, respectively. The sequence contains 49 representatives of the Alu repeat, along with several other types of repetitive sequences. The Alu sequences exhibit a biased orientation, with those sequences in the first half of the locus oriented in the minus direction relative to transcription of the gene (3'----5' = 77%, P less than 0.005) and those sequences in the latter half of the locus oriented randomly (5'----3' = 67%, P less than 0.5). The development and performance of the sequencing strategy and the features of the human HPRT gene are presented.
Collapse
Affiliation(s)
- A Edwards
- Department of Cell Biology, Baylor College of Medicine, Houston, Texas 77030
| | | | | | | | | | | | | | | | | | | |
Collapse
|
30
|
Levy-Wilson B, Fortier C. The limits of the DNase I-sensitive domain of the human apolipoprotein B gene coincide with the locations of chromosomal anchorage loops and define the 5′ and 3′ boundaries of the gene. J Biol Chem 1989. [DOI: 10.1016/s0021-9258(19)30066-3] [Citation(s) in RCA: 97] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022] Open
|
31
|
Armour JA, Wong Z, Wilson V, Royle NJ, Jeffreys AJ. Sequences flanking the repeat arrays of human minisatellites: association with tandem and dispersed repeat elements. Nucleic Acids Res 1989; 17:4925-35. [PMID: 2762114 PMCID: PMC318084 DOI: 10.1093/nar/17.13.4925] [Citation(s) in RCA: 87] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
We present DNA sequences flanking cloned hypervariable human minisatellites. In addition to providing confirmatory evidence that minisatellites cluster with other tandem repeats, these flanking sequences contain a high frequency of interspersed repetitive elements. These elements include a retroviral LTR-like sequence, from which one of the minisatellites appears to have expanded, and a recently described short interspersed repeat. We present our own findings concerning this element, in particular that those examples studied do not show significant evolutionary conservation, despite suggestions that the element may have a cis-acting function.
Collapse
Affiliation(s)
- J A Armour
- Department of Genetics, University of Leicester, UK
| | | | | | | | | |
Collapse
|