1
|
Abstract
While less than 1.5% of the mammalian genome encodes proteins, it is now evident that the vast majority is transcribed, mainly into non-protein-coding RNAs. This raises the question of what fraction of the genome is functional, i.e., composed of sequences that yield functional products, are required for the expression (regulation or processing) of these products, or are required for chromosome replication and maintenance. Many of the observed noncoding transcripts are differentially expressed, and, while most have not yet been studied, increasing numbers are being shown to be functional and/or trafficked to specific subcellular locations, as well as exhibit subtle evidence of selection. On the other hand, analyses of conservation patterns indicate that only approximately 5% (3%-8%) of the human genome is under purifying selection for functions common to mammals. However, these estimates rely on the assumption that reference sequences (usually ancient transposon-derived sequences) have evolved neutrally, which may not be the case, and if so would lead to an underestimate of the fraction of the genome under evolutionary constraint. These analyses also do not detect functional sequences that are evolving rapidly and/or have acquired lineage-specific functions. Indeed, many regulatory sequences and known functional noncoding RNAs, including many microRNAs, are not conserved over significant evolutionary distances, and recent evidence from the ENCODE project suggests that many functional elements show no detectable level of sequence constraint. Thus, it is likely that much more than 5% of the genome encodes functional information, and although the upper bound is unknown, it may be considerably higher than currently thought.
Collapse
Affiliation(s)
- Michael Pheasant
- ARC Special Research Centre for Functional and Applied Genomics, Institute for Molecular Bioscience, University of Queensland, St Lucia, Queensland 4072, Australia
| | | |
Collapse
|
2
|
Eller CD, Regelson M, Merriman B, Nelson S, Horvath S, Marahrens Y. Repetitive sequence environment distinguishes housekeeping genes. Gene 2006; 390:153-65. [PMID: 17141428 PMCID: PMC1857324 DOI: 10.1016/j.gene.2006.09.018] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2006] [Revised: 09/18/2006] [Accepted: 09/24/2006] [Indexed: 12/14/2022]
Abstract
Housekeeping genes are expressed across a wide variety of tissues. Since repetitive sequences have been reported to influence the expression of individual genes, we employed a novel approach to determine whether housekeeping genes can be distinguished from tissue-specific genes by their repetitive sequence context. We show that Alu elements are more highly concentrated around housekeeping genes while various longer (>400-bp) repetitive sequences ("repeats"), including Long Interspersed Nuclear Element-1 (LINE-1) elements, are excluded from these regions. We further show that isochore membership does not distinguish housekeeping genes from tissue-specific genes and that repetitive sequence environment distinguishes housekeeping genes from tissue-specific genes in every isochore. The distinct repetitive sequence environment, in combination with other previously published sequence properties of housekeeping genes, was used to develop a method of predicting housekeeping genes on the basis of DNA sequence alone. Using expression across tissue types as a measure of success, we demonstrate that repetitive sequence environment is by far the most important sequence feature identified to date for distinguishing housekeeping genes.
Collapse
Affiliation(s)
- C. Daniel Eller
- UCLA Department of Human Genetics David Geffen School of Medicine, Gonda Center, 695 E. Young Drive South, Los Angeles, California 90095-7088, USA
| | - Moira Regelson
- UCLA Department of Human Genetics David Geffen School of Medicine, Gonda Center, 695 E. Young Drive South, Los Angeles, California 90095-7088, USA
| | - Barry Merriman
- UCLA Department of Human Genetics David Geffen School of Medicine, Gonda Center, 695 E. Young Drive South, Los Angeles, California 90095-7088, USA
| | - Stan Nelson
- UCLA Department of Human Genetics David Geffen School of Medicine, Gonda Center, 695 E. Young Drive South, Los Angeles, California 90095-7088, USA
| | - Steve Horvath
- UCLA Department of Human Genetics David Geffen School of Medicine, Gonda Center, 695 E. Young Drive South, Los Angeles, California 90095-7088, USA
- UCLA Department of Biostatistics, School of Public Health, Box 951772, Los Angeles, California 90095-1772, USA
| | - York Marahrens
- UCLA Department of Human Genetics David Geffen School of Medicine, Gonda Center, 695 E. Young Drive South, Los Angeles, California 90095-7088, USA
- * to whom correspondence should be addressed: York Marahrens, UCLA Department of Human Genetics, Gonda Center, Room 4554b, 695 Charles E. Young Drive, Los Angeles, CA 90095, USA, Phone: (310) 267-2466, Fax: (310) 794-5446, E-mail:
| |
Collapse
|
3
|
Gama Sosa MA, Rosas DH, DeGasperi R, Morita E, Hutchison MR, Ruprecht RM. Negative regulation of the 5' long terminal repeat (LTR) by the 3' LTR in the murine proviral genome. J Virol 1994; 68:2662-70. [PMID: 8139043 PMCID: PMC236743 DOI: 10.1128/jvi.68.4.2662-2670.1994] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open
Abstract
To assess the influence of the 3' long terminal repeat (LTR) on the promoter/enhancer activity of the 5' LTR, a set of isogenic retroviral vectors differing only in the U3 region of the 3' LTR was constructed. These U3 elements were derived from viruses with different tissue tropism. The 5' LTR originated from Moloney murine leukemia virus and directed the transcription of a reporter gene (chloramphenicol acetyltransferase [CAT] gene), giving rise to plasmids of the general configuration LTR-CAT-LTR'. Following transfection of these chimeric constructs into various cell types, the CAT activity in a given cell line was inversely related to the activity of the downstream U3 region when used in a single-LTR construct in that cell type, indicating negative regulation of the 5' LTR by the chimeric 3' LTR'. Our data indicate that a highly active 3' LTR interferes with gene expression from the 5' LTR. Potential mechanisms for this down-regulation are discussed.
Collapse
|
4
|
Ehrlich M, Ehrlich KC. Effect of DNA methylation on the binding of vertebrate and plant proteins to DNA. EXS 1993; 64:145-68. [PMID: 8418948 DOI: 10.1007/978-3-0348-9118-9_7] [Citation(s) in RCA: 27] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
Affiliation(s)
- M Ehrlich
- Department of Biochemistry, Tulane Medical School, New Orleans, LA 70112
| | | |
Collapse
|
5
|
Two regions of the mouse mammary tumor virus long terminal repeat regulate the activity of its promoter in mammary cell lines. Mol Cell Biol 1991. [PMID: 1708094 DOI: 10.1128/mcb.11.5.2529] [Citation(s) in RCA: 42] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
In vivo expression of the mouse mammary tumor virus (MMTV) is restricted to a few organs, with the highest rate of transcription found in the mammary gland. Using a series of mammary and nonmammary murine cell lines, we have identified two regulatory elements, located upstream of the hormone responsive element, that specifically regulate the MMTV promoter. The first element displays an enhancerlike activity and is coincident with the binding of a nuclear factor (designated MP4; position -1078 to -1052 in the long terminal repeat) whose presence is apparently restricted to mammary cell lines. The second regulatory region mediates a repressive activity and is mapped to the long terminal repeat segment from -415 to -483. This repression is specific for a particular subtype of mammary cells (RAC cells) able to grow under two differentiation states (A. Sonnenberg, H. Daams, J. Calafat, and J. Hilgers, Cancer Res. 46:5913-5922, 1986). The MMTV promoter in mammary cell lines thus appears to be modulated by two cis-acting elements that are likely to be involved in tissue-specific expression in vivo.
Collapse
|
6
|
Lefebvre P, Berard DS, Cordingley MG, Hager GL. Two regions of the mouse mammary tumor virus long terminal repeat regulate the activity of its promoter in mammary cell lines. Mol Cell Biol 1991; 11:2529-37. [PMID: 1708094 PMCID: PMC360022 DOI: 10.1128/mcb.11.5.2529-2537.1991] [Citation(s) in RCA: 37] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
In vivo expression of the mouse mammary tumor virus (MMTV) is restricted to a few organs, with the highest rate of transcription found in the mammary gland. Using a series of mammary and nonmammary murine cell lines, we have identified two regulatory elements, located upstream of the hormone responsive element, that specifically regulate the MMTV promoter. The first element displays an enhancerlike activity and is coincident with the binding of a nuclear factor (designated MP4; position -1078 to -1052 in the long terminal repeat) whose presence is apparently restricted to mammary cell lines. The second regulatory region mediates a repressive activity and is mapped to the long terminal repeat segment from -415 to -483. This repression is specific for a particular subtype of mammary cells (RAC cells) able to grow under two differentiation states (A. Sonnenberg, H. Daams, J. Calafat, and J. Hilgers, Cancer Res. 46:5913-5922, 1986). The MMTV promoter in mammary cell lines thus appears to be modulated by two cis-acting elements that are likely to be involved in tissue-specific expression in vivo.
Collapse
Affiliation(s)
- P Lefebvre
- Hormone Action and Oncogenesis Section, National Cancer Institute Bethesda, Maryland 20892
| | | | | | | |
Collapse
|
7
|
Ono T, Shinya K, Uehara Y, Okada S. Endogenous virus genomes become hypomethylated tissue--specifically during aging process of C57BL mice. Mech Ageing Dev 1989; 50:27-36. [PMID: 2561001 DOI: 10.1016/0047-6374(89)90056-0] [Citation(s) in RCA: 25] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
In an attempt to find out a cause for age-dependent derepression of endogenous viruses, extents of DNA methylation at the endogenous B- and C-type ecotropic viruses in brain, liver and spleen of C57BL/6NJc1 were examined at three ages, newborn, young adult and old. Both endogenous viruses showed a slight but significant tissue-specific either hypo- or hypermethylation during post-natal developmental phase in the three tissues. After maturation, however, no such change was detectable at most of the sites examined. The exceptions were C-type ecotropic virus in brain and B-type virus in spleen, where the age-dependent decreases of methylation were observed. The changes seemed to be continuations of preceding developmental hypomethylation. They indicated that the hypomethylation could be one of the causes for the age-dependent derepression of endogenous virus. It was further suggested that a mechanism to stop the developmental changes of DNA methylation at the maturation of individuals would be important in considering the reasons for the changes in senescent phase.
Collapse
Affiliation(s)
- T Ono
- Radiation Biology Center, Kyoto University, Japan
| | | | | | | |
Collapse
|
8
|
Knepper JE, Kittrell FS, Medina D, Butel JS. Spontaneous progression of hyperplastic outgrowths of the D1 lineage to mammary tumors: expression of mouse mammary tumor virus and cellular proto-oncogenes. Mol Carcinog 1989; 1:229-38. [PMID: 2551332 DOI: 10.1002/mc.2940010405] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
Mammary cancer in mice is characterized by progression through defined stages of preneoplasia, with the most common preneoplastic stage being the hyperplastic alveolar nodule (HAN). We determined the relative levels of RNA expression of various cellular proto-oncogenes and endogenous mouse mammary tumor virus genes in outgrowths and tumors of three sublines of the transplantable D1 HAN preneoplastic outgrowth line. The three sublines differed in relative tumor-producing capabilities. Subline D1B produced a high incidence of tumors with short latency periods, whereas sublines D1C and D1D produced low incidences of tumors with long latency periods. No consistent alteration in proto-oncogene expression correlated with relative tumorigenicity, although tumors frequently contained higher levels of one or more proto-oncogene transcripts as compared with preneoplastic tissue. Slightly elevated (2- to 6-fold) levels of different oncogene transcripts were detected in 13 of 17 tumors as compared with outgrowth tissue, including abl (2 tumors), fps (5 tumors), Ha-ras (6 tumors), and Ki-ras (8 tumors). One tumor contained 45 times more Ki-ras-specific RNA than outgrowth tissue because of a comparable amplification of Ki-ras DNA sequences. Elevated levels of Ha-ras occurred more frequently in tumors of a high-incidence subline than in a less-aggressive subline (5/10 vs 1/7), but this difference was not statistically significant. However, consistent changes in MMTV expression accompanied progression from preneoplastic tissues to mammary tumors. All 17 tumors displayed reduced levels of the MMTV-specific long terminal repeat (LTR) transcript (1.6 kb) as compared with HAN tissue; tumors with moderate levels of LTR transcript expressed the 3.8-kb envelope message as well, one not detected in HANs. Expression of the LTR transcript is apparently influenced by factors in addition to the methylation status of endogenous mouse mammary tumor virus genes, which was similar in outgrowths and tumors. As the survey of representative proto-oncogenes failed to identify a uniform change between HAN and tumors, it is likely that other genes are involved in tumor progression in the mammary gland.
Collapse
MESH Headings
- Animals
- DNA Probes
- DNA, Neoplasm/genetics
- DNA, Neoplasm/isolation & purification
- DNA, Viral/genetics
- DNA, Viral/isolation & purification
- Gene Expression
- Genes, Viral
- Hyperplasia
- Mammary Neoplasms, Experimental/genetics
- Mammary Neoplasms, Experimental/microbiology
- Mammary Neoplasms, Experimental/pathology
- Mammary Tumor Virus, Mouse/genetics
- Mice
- Mice, Inbred BALB C
- Proto-Oncogenes
Collapse
Affiliation(s)
- J E Knepper
- Division of Molecular Virology, Baylor College of Medicine, Houston, Texas
| | | | | | | |
Collapse
|
9
|
Gama-Sosa MA, Breznik T, Butel JS, Medina D, Cohen JC. Mammary preneoplasia and tumorigenesis in the BALB/c mouse: structure and modification of mouse mammary tumor virus DNA sequences. Virus Res 1987; 7:1-15. [PMID: 3031894 DOI: 10.1016/0168-1702(87)90054-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
Mouse mammary tumor virus (MMTV) DNA sequences were examined in mammary tissues from BALB/c mice, including both preneoplastic hyperplastic alveolar nodule (HAN) outgrowth lines, tumors derived from those preneoplastic tissues, and DMBA-induced mammary tumors. Over 60 different HAN and tumors samples were analyzed. The D2 HAN line contained one additional provirus, whereas Cv-2 and Cv-4 HAN lines that are infected with exogenous virus exhibited multiple virus integration events. D2 tumors showed the same provirus pattern as D2 HANs, whereas Cv-2 and Cv-4 tumors exhibited a subset of the acquired proviruses found in the parental HAN populations. Differential methylation patterns of virus-specific sequences were observed that resembled those described for other tissues in which viral DNA replication and integration has occurred, i.e., acquired proviruses were hypomethylated and endogenous proviruses were methylated. In tumors that arose from HAN lines and exhibited only endogenous proviruses, demethylation of the subgenomic Mtv-6 locus was observed. Demethylation of Mtv-6 was not detected in any of the preneoplastic tissues. Altered methylation of Mtv-8 and -9 was observed in both Cv-2 and Cv-4 tumors. Finally, mammary tumors induced by DMBA carried no acquired proviruses and demethylation of endogenous MMTV proviruses was demonstrated.
Collapse
MESH Headings
- 5-Methylcytosine
- 9,10-Dimethyl-1,2-benzanthracene
- Animals
- Cell Transformation, Neoplastic
- Cytosine/analogs & derivatives
- Cytosine/analysis
- DNA Restriction Enzymes
- DNA, Viral/analysis
- Female
- Genes, Viral
- Mammary Glands, Animal/microbiology
- Mammary Neoplasms, Experimental/chemically induced
- Mammary Neoplasms, Experimental/microbiology
- Mammary Tumor Virus, Mouse/genetics
- Mammary Tumor Virus, Mouse/physiology
- Methylation
- Mice
- Mice, Inbred BALB C
- Precancerous Conditions/microbiology
- Recombination, Genetic
- Virus Replication
Collapse
|
10
|
Dickson C. Molecular aspects of mouse mammary tumor virus biology. INTERNATIONAL REVIEW OF CYTOLOGY 1987; 108:119-47. [PMID: 2822592 DOI: 10.1016/s0074-7696(08)61437-0] [Citation(s) in RCA: 22] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Affiliation(s)
- C Dickson
- Imperial Cancer Research Fund, Lincoln's Inn Fields, London, England
| |
Collapse
|
11
|
Knepper JE, Medina D, Butel JS. Differential expression of endogenous mouse mammary tumor virus genes during development of the BALB/c mammary gland. J Virol 1986; 59:518-21. [PMID: 3016314 PMCID: PMC253107 DOI: 10.1128/jvi.59.2.518-521.1986] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
Expression of endogenous mouse mammary tumor virus sequences varied over the course of development of the mammary gland during primary pregnancy and lactation in virus-free BALB/c mice. Although RNA from all regions of the genome was detected, both the level and temporal regulation of expression were different for long terminal repeat-, env-, and gag-pol-specific RNAs. Analysis of the methylation status of proviral DNA indicated differential accessibility of the three endogenous units during development. The results demonstrated noncoordinate regulation of mouse mammary tumor virus expression with respect to provirus template utilized and specific transcripts accumulated.
Collapse
|
12
|
Slagle BL, Butel JS. Identification and characterization of a mouse mammary tumor virus protein uniquely expressed on the surface of BALB/cV mammary tumor cells. Virology 1985; 143:127-42. [PMID: 2997998 PMCID: PMC7130887 DOI: 10.1016/0042-6822(85)90102-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
A unique subline of BALB/c mice, designated BALB/cV, exhibits an intermediate mammary tumor incidence (47%) and harbors a distinct milk-transmitted mouse mammary tumor virus (MMTV). The BALB/cV subline was used to study the molecular basis of potential virus-host interactions involving cell surface-expressed MMTV proteins. Cell surface iodination identified virus-specific proteins expressed on BALB/cV primary mammary tumor cells grown in culture. In contrast to (C3H)MMTV-producing cell lines which expressed MMTV gp52, BALB/cV tumor cells lacked gp52 and expressed instead a 68K, env-related protein. The 68Kenv protein was also detected on the surface of metabolically labeled BALB/cV tumor cells by an external immunoprecipitation technique. The expression of 68Kenv was restricted to mammary tissues of BALB/cV mice that also expressed other MMTV proteins. Biochemical analysis established that 68Kenv was not modified by N-linked glycosylation. 125I-labeled 68Kenv was rapidly released into the media of tumor cell cultures and was recovered both in the form of a soluble protein and in a 100,000 g pellet. The biologic function of this cell surface-expressed viral protein remains unknown.
Collapse
|
13
|
Slagle BL, Wheeler DA, Hager GL, Medina D, Butel JS. Molecular basis of altered mouse mammary tumor virus expression in the D-2 hyperplastic alveolar nodule line of BALB/c mice. Virology 1985; 143:1-15. [PMID: 2414907 DOI: 10.1016/0042-6822(85)90091-1] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
The preneoplastic D-2 hyperplastic outgrowth line, which was derived from a hormone-induced hyperplastic alveolar nodule (HAN) of a BALB/c mouse, was used for a detailed analysis of mouse mammary tumor virus (MMTV) expression. The D-2 HAN line has previously been shown to express viral RNA representative of the entire genome, although viral particles have been noted only rarely. The MMTV-specific mRNA, protein, and DNA content of the D-2 tissues was defined in an effort to better understand the molecular basis of the aberrant virus expression. Northern blotting techniques demonstrated the presence of properly processed 8.9 kb (genomic) and 3.6 kb (envelope) mRNA. Protein electroblotting procedures established the presence of properly processed viral core protein p28. In contrast, the envelope precursor polyprotein was not processed into detectable levels of gp52. Analysis of MMTV proviral content by Southern blot methodology revealed the presence of a newly acquired provirus which serves as a marker for the clonal nature of the D-2 line. The origin of the new provirus is unknown. Methylation studies established that the new proviral insert is hypomethylated and, therefore, is likely serving as the template for the MMTV expression observed in the D-2 HAN line. These characteristics of the D-2 line make it an excellent system in which to study the role, if any, of MMTV in the progression of D-2 preneoplastic tissues to the tumor phenotype.
Collapse
|