1
|
Gasperini M, Tome JM, Shendure J. Towards a comprehensive catalogue of validated and target-linked human enhancers. Nat Rev Genet 2020; 21:292-310. [PMID: 31988385 PMCID: PMC7845138 DOI: 10.1038/s41576-019-0209-0] [Citation(s) in RCA: 159] [Impact Index Per Article: 39.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/13/2019] [Indexed: 12/14/2022]
Abstract
The human gene catalogue is essentially complete, but we lack an equivalently vetted inventory of bona fide human enhancers. Hundreds of thousands of candidate enhancers have been nominated via biochemical annotations; however, only a handful of these have been validated and confidently linked to their target genes. Here we review emerging technologies for discovering, characterizing and validating human enhancers at scale. We furthermore propose a new framework for operationally defining enhancers that accommodates the heterogeneous and complementary results that are emerging from reporter assays, biochemical measurements and CRISPR screens.
Collapse
Affiliation(s)
- Molly Gasperini
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Jacob M Tome
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Jay Shendure
- Department of Genome Sciences, University of Washington, Seattle, WA, USA.
- Brotman Baty Institute for Precision Medicine, Seattle, WA, USA.
- Allen Discovery Center for Cell Lineage, Seattle, WA, USA.
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA.
| |
Collapse
|
2
|
Hoffmann FG, Opazo JC, Storz JF. New genes originated via multiple recombinational pathways in the beta-globin gene family of rodents. Mol Biol Evol 2008; 25:2589-600. [PMID: 18780876 DOI: 10.1093/molbev/msn200] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Species differences in the size or membership composition of multigene families can be attributed to lineage-specific additions of new genes via duplication, losses of genes via deletion or inactivation, and the creation of chimeric genes via domain shuffling or gene fusion. In principle, it should be possible to infer the recombinational pathways responsible for each of these different types of genomic change by conducting detailed comparative analyses of genomic sequence data. Here, we report an attempt to unravel the complex evolutionary history of the beta-globin gene family in a taxonomically diverse set of rodent species. The main objectives were: 1) to characterize the genomic structure of the beta-globin gene cluster of rodents; 2) to assign orthologous and paralogous relationships among duplicate copies of beta-like globin genes; and 3) to infer the specific recombinational pathways responsible for gene duplications, gene deletions, and the creation of chimeric fusion genes. Results of our comparative genomic analyses revealed that variation in gene family size among rodent species is mainly attributable to the differential gain and loss of later expressed beta-globin genes via unequal crossing-over. However, two distinct recombinational mechanisms were implicated in the creation of chimeric fusion genes. In muroid rodents, a chimeric gamma/epsilon fusion gene was created by unequal crossing-over between the embryonic epsilon- and gamma-globin genes. Interestingly, this gamma/epsilon fusion gene was generated in the same fashion as the "anti-Lepore" 5'-delta-(beta/delta)-beta-3' duplication mutant in humans (the reciprocal exchange product of the pathological hemoglobin Lepore deletion mutant). By contrast, in the house mouse, Mus musculus, a chimeric beta/delta fusion pseudogene was created by a beta-globin --> delta-globin gene conversion event. Although the gamma/epsilon and beta/delta fusion genes share a similar chimeric gene structure, they originated via completely different recombinational pathways.
Collapse
|
3
|
Differential loss of embryonic globin genes during the radiation of placental mammals. Proc Natl Acad Sci U S A 2008; 105:12950-5. [PMID: 18755893 DOI: 10.1073/pnas.0804392105] [Citation(s) in RCA: 57] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The differential gain and loss of genes from homologous gene families represents an important source of functional variation among the genomes of different species. Differences in gene content between species are primarily attributable to lineage-specific gene gains via duplication and lineage-specific losses via deletion or inactivation. Here, we use a comparative genomic approach to investigate this process of gene turnover in the beta-globin gene family of placental mammals. By analyzing genomic sequence data from representatives of each of the main superordinal clades of placental mammals, we were able to reconstruct pathways of gene family evolution during the basal radiation of this physiologically and morphologically diverse vertebrate group. Our analysis revealed that an initial expansion of the nonadult portion of the beta-globin gene cluster in the ancestor of placental mammals was followed by the differential loss and retention of ancestral gene lineages, thereby generating variation in the complement of embryonic globin genes among contemporary species. The sorting of epsilon-, gamma-, and eta-globin gene lineages among the basal clades of placental mammals has produced species differences in the functional types of hemoglobin isoforms that can be synthesized during the course of embryonic development.
Collapse
|
4
|
Evidence for contrasting modes of selection at interacting globin genes in the European rabbit (Oryctolagus cuniculus). Heredity (Edinb) 2008; 100:602-9. [PMID: 18493260 DOI: 10.1038/hdy.2008.26] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022] Open
Abstract
In hybrid zones between genetically differentiated populations, variation in locus-specific rates of introgression may reflect adaptation to different environments or adaptation to different genetic backgrounds. The European rabbit, Oryctolagus cuniculus, is well-suited to studies of such hybrid zone dynamics because it is composed of two genetically divergent subspecies that hybridize in a zone of secondary contact in central Iberia. A species-wide survey of allozyme variation revealed a broad range of locus-specific divergence levels (F(ST) ranged from 0 to 0.54, mean F(ST)=0.16). Interestingly, the two loci that fell at opposite ends of the distribution of F(ST) values, haemoglobin alpha-chain (HBA) and haemoglobin beta-chain (HBB), encode interacting subunits of the haemoglobin protein. The contrasting patterns of spatial variation at these two loci could not be reconciled under a neutral model of population structure. The HBA gene exhibited higher-than-expected levels of population differentiation, consistent with a history of spatially varying selection. The HBB gene exhibited lower-than-expected levels of population differentiation, consistent with some form of spatially uniform selection. Patterns of linkage disequilibrium and allele frequency variation do not appear to fit any simple model of two-locus epistatic selection.
Collapse
|
5
|
Abstract
The genomes from three mammals (human, mouse, and rat), two worms, and several yeasts have been sequenced, and more genomes will be completed in the near future for comparison with those of the major model organisms. Scientists have used various methods to align and compare the sequenced genomes to address critical issues in genome function and evolution. This review covers some of the major new insights about gene content, gene regulation, and the fraction of mammalian genomes that are under purifying selection and presumed functional. We review the evolutionary processes that shape genomes, with particular attention to variation in rates within genomes and along different lineages. Internet resources for accessing and analyzing the treasure trove of sequence alignments and annotations are reviewed, and we discuss critical problems to address in new bioinformatic developments in comparative genomics.
Collapse
Affiliation(s)
- Webb Miller
- The Center for Comparative Genomics and Bioinformatics, The Huck Institutes of Life Sciences, Department of Biology, Pennsylvania State University, University Park, Pennsylvania, USA.
| | | | | | | |
Collapse
|
6
|
Pennacchio LA. Insights from human/mouse genome comparisons. Mamm Genome 2003; 14:429-36. [PMID: 12925891 DOI: 10.1007/s00335-002-4001-1] [Citation(s) in RCA: 55] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2003] [Accepted: 02/20/2003] [Indexed: 10/27/2022]
Abstract
Large-scale public genomic sequencing efforts have provided a wealth of vertebrate sequence data poised to provide insights into mammalian biology. These include deep genomic sequence coverage of human, mouse, rat, zebrafish, and two pufferfish ( Fugu rubripes and Tetraodon nigroviridis) (Aparicio et al. 2002; Lander et al. 2001; Venter et al. 2001; Waterston et al. 2002). In addition, a high-priority has been placed on determining the genomic sequence of chimpanzee, dog, cow, frog, and chicken (Boguski 2002). While only recently available, whole genome sequence data have provided the unique opportunity to globally compare complete genome contents. Furthermore, the shared evolutionary ancestry of vertebrate species has allowed the development of comparative genomic approaches to identify ancient conserved sequences with functionality. Accordingly, this review focuses on the initial comparison of available mammalian genomes and describes various insights derived from such analysis.
Collapse
Affiliation(s)
- Len A Pennacchio
- Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, California, USA.
| |
Collapse
|
7
|
Pennacchio LA, Rubin EM. Comparative genomic tools and databases: providing insights into the human genome. J Clin Invest 2003. [DOI: 10.1172/jci200317842] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
|
8
|
Pennacchio LA, Rubin EM. Comparative genomic tools and databases: providing insights into the human genome. J Clin Invest 2003; 111:1099-106. [PMID: 12697725 PMCID: PMC152942 DOI: 10.1172/jci17842] [Citation(s) in RCA: 19] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Affiliation(s)
- Len A Pennacchio
- Genome Sciences Department, MS 84-171, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley, CA 94720, USA.
| | | |
Collapse
|
9
|
Hardison RC, Roskin KM, Yang S, Diekhans M, Kent WJ, Weber R, Elnitski L, Li J, O'Connor M, Kolbe D, Schwartz S, Furey TS, Whelan S, Goldman N, Smit A, Miller W, Chiaromonte F, Haussler D. Covariation in frequencies of substitution, deletion, transposition, and recombination during eutherian evolution. Genome Res 2003; 13:13-26. [PMID: 12529302 PMCID: PMC430971 DOI: 10.1101/gr.844103] [Citation(s) in RCA: 225] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2002] [Accepted: 11/14/2002] [Indexed: 11/24/2022]
Abstract
Six measures of evolutionary change in the human genome were studied, three derived from the aligned human and mouse genomes in conjunction with the Mouse Genome Sequencing Consortium, consisting of (1) nucleotide substitution per fourfold degenerate site in coding regions, (2) nucleotide substitution per site in relics of transposable elements active only before the human-mouse speciation, and (3) the nonaligning fraction of human DNA that is nonrepetitive or in ancestral repeats; and three derived from human genome data alone, consisting of (4) SNP density, (5) frequency of insertion of transposable elements, and (6) rate of recombination. Features 1 and 2 are measures of nucleotide substitutions at two classes of "neutral" sites, whereas 4 is a measure of recent mutations. Feature 3 is a measure dominated by deletions in mouse, whereas 5 represents insertions in human. It was found that all six vary significantly in megabase-sized regions genome-wide, and many vary together. This indicates that some regions of a genome change slowly by all processes that alter DNA, and others change faster. Regional variation in all processes is correlated with, but not completely accounted for, by GC content in human and the difference between GC content in human and mouse.
Collapse
Affiliation(s)
- Ross C Hardison
- Department of Biochemistry, The Pennsylvania State University, University Park, Pennsylvania 16802, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
10
|
Endrizzi M, Huang S, Scharf JM, Kelter AR, Wirth B, Kunkel LM, Miller W, Dietrich WF. Comparative sequence analysis of the mouse and human Lgn1/SMA interval. Genomics 1999; 60:137-51. [PMID: 10486205 DOI: 10.1006/geno.1999.5910] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Human chromosome 5q11.2-q13.3 and its ortholog on mouse chromosome 13 contain candidate genes for an inherited human neurodegenerative disorder called spinal muscular atrophy (SMA) and for an inherited mouse susceptibility to infection with Legionella pneumophila (Lgn1). These homologous genomic regions also have unusual repetitive organizations that create practical difficulties in mapping and raise interesting issues about the evolutionary origin of the repeats. In an attempt to analyze this region in detail, and as a way to identify additional candidate genes for these diseases, we have determined the sequence of 179 kb of the mouse Lgn1/SMA interval. We have analyzed this sequence using BLAST searches and various exon prediction programs to identify potential genes. Since these methods can generate false-positive exon declarations, our alignments of the mouse sequence with available human orthologous sequence allowed us to discriminate rapidly among this collection of potential coding regions by indicating which regions were well conserved and were more likely to represent actual coding sequence. As a result of our analysis, we accurately mapped two additional genes in the SMA interval that can be tested for involvement in the pathogenesis of SMA. While no new Lgn1 candidates emerged, we have identified new genetic markers that exclude Smn as an Lgn1 candidate. In addition to providing important resources for studying SMA and Lgn1, our data provide further evidence of the value of sequencing the mouse genome as a means to help with the annotation of the human genomic sequence and vice versa.
Collapse
Affiliation(s)
- M Endrizzi
- Department of Genetics, Harvard Medical School, 200 Longwood Avenue, Boston, Massachusetts 02115, USA
| | | | | | | | | | | | | | | |
Collapse
|
11
|
Rishi AK, Sun RJ, Gao Y, Hsu CK, Gerald TM, Sheikh MS, Dawson MI, Reichert U, Shroot B, Fornace AJ, Brewer G, Fontana JA. Post-transcriptional regulation of the DNA damage-inducible gadd45 gene in human breast carcinoma cells exposed to a novel retinoid CD437. Nucleic Acids Res 1999; 27:3111-9. [PMID: 10454607 PMCID: PMC148537 DOI: 10.1093/nar/27.15.3111] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The biologically active synthetic retinoid CD437 (6-[3-adamantyl-4-hydroxyphenyl]-2-naphthalene, AHPN) and different human breast carcinoma (HBC) cell lines were used to examine the possible mechanism(s) of gadd45 induction. Northern blot analysis of mRNA isolated from MCF-7, MDA-MB-468 and MDA-MB-231 HBC cell lines demonstrated a progressive increase in the 1.4 kb gadd45 transcript after exposure to 1 microM CD437. Western blot analysis showed increased gadd45 protein levels in MDA-MB-468 HBC cells following exposure to CD437. CD437 increased gadd45 mRNA levels by approximately 20-fold in MDA-MB-468 cells, however, the transcriptional activity was increased approximately 2-3-fold as demonstrated by the human gadd45 promoter-luciferase reporter construct and nuclear run-off assays. Sublines of MDA-MB-468 HBC cells expressing stably integrated GADD45 cDNA fragments were obtained and CD437-dependent induction of GADD45 analyzed. We report that approximately 300 nt located in the 5"-untranslated region (5"-UTR) of gadd45 mRNA are involved in the CD437-dependent 4-fold enhanced stability of gadd45 transcripts. MDA-MB-468 cells were stably transfected with either a plasmid having a CMV promoter-driven rabbit beta-globin gene or plasmids having a CMV promoter-driven chimeric gadd45 5"-UTR-rabbit beta-globin gene, where the entire gadd45 5"-UTR (from +1 to +298) or a 45 bp subfragment of the gadd45 5"-UTR (from +10 to +55) was positioned at the 5"-end of the rabbit beta-globin gene. CD437 was found to up-regulate expression of both the chimeric gadd45 -rabbit beta-globin transcripts, suggesting that cis element(s) involved in the CD437-dependent enhanced stability of gadd45 mRNA are contained in the 45 nt of the 5"-UTR of the gadd45 mRNA.
Collapse
Affiliation(s)
- A K Rishi
- Department of Internal Medicine and Karmanos Cancer Institute, Wayne State University, Detroit, MI 48201, USA,
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
12
|
Li W. Statistical properties of open reading frames in complete genome sequences. COMPUTERS & CHEMISTRY 1999; 23:283-301. [PMID: 10404621 DOI: 10.1016/s0097-8485(99)00014-5] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Some statistical properties of open reading frames in all currently available complete genome sequences are analyzed (seventeen prokatyotic genomes, and 16 chromosome sequences from the yeast genome). The size distribution of open reading frames is characterized by various techniques, such as quantile tables, QQ-plots, rank-size plots (Zipf's plots), and spatial densities. The issue of the influence of CG% on the size distribution is addressed. When yeast chromosomes are compared with archaeal and eubacterial genomes, they tend to have more long open reading frames. There is little or no evidence to reject the null hypothesis that open reading frames on six different reading frames and two strands distribute similarly. A topic of current interest, the base composition asymmetry in open reading frames between the two strands, is studied using regression analysis. The base composition asymmetry at three codon positions is analyzed separately. It was shown in these genome sequences that the first codon position is G- and A-rich (i.e. purine-rich); there is a co-existence of A- and T-rich branches at the second codon position; and the third codon position is weakly T-rich.
Collapse
Affiliation(s)
- W Li
- Laboratory of Statistical Genetics, Rockefeller University, New York, NY 10021, USA.
| |
Collapse
|
13
|
zu Putlitz J, Wands JR. Specific inhibition of hepatitis B virus replication by sense RNA. ANTISENSE & NUCLEIC ACID DRUG DEVELOPMENT 1999; 9:241-52. [PMID: 10435749 DOI: 10.1089/oli.1.1999.9.241] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
We describe effects of sense RNA molecules on hepatitis B virus (HBV) replication and antigen synthesis in transiently transfected cells. When certain subgenomic fragments of HBV were expressed as sense RNA together with a replication-competent genome of HBV, they inhibited HBV replication by up to 75% and HBsAg secretion by up to 60%. The corresponding antisense sequences had a 50% inhibitory effect in one case and no effect in another case. The sense RNA species did not inhibit duck hepatitis B virus (DHBV) replication, suggesting specific inhibitory effects. HBV transcript levels were unaltered in the presence of sense RNA species, consistent with an inhibitory effect mediated at the posttranscriptional level. The inhibition of HBV replication by overexpression of sense RNA derived from the viral genome represents an example of sense cosuppression of an animal virus.
Collapse
Affiliation(s)
- J zu Putlitz
- Molecular Hepatology Laboratory, Massachusetts General Hospital Cancer Center and Harvard Medical School, Boston 02129, USA
| | | |
Collapse
|
14
|
Chiu CH, Gregoire L, Gumucio D, Muniz J, Lancaster W, Goodman M. Model for the fetal recruitment of simian ?-globin genes based on findings from two new world monkeysCebus apella andCallithrix jacchus (Platyrrhini, Primates). ACTA ACUST UNITED AC 1999. [DOI: 10.1002/(sici)1097-010x(19990415)285:1<27::aid-jez4>3.0.co;2-0] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
15
|
Calzolari R, McMorrow T, Yannoutsos N, Langeveld A, Grosveld F. Deletion of a region that is a candidate for the difference between the deletion forms of hereditary persistence of fetal hemoglobin and deltabeta-thalassemia affects beta- but not gamma-globin gene expression. EMBO J 1999; 18:949-58. [PMID: 10022837 PMCID: PMC1171187 DOI: 10.1093/emboj/18.4.949] [Citation(s) in RCA: 43] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The analysis of a number of cases of beta-globin thalassemia and hereditary persistence of fetal hemoglobin (HPFH) due to large deletions in the beta-globin locus has led to the identification of several DNA elements that have been implicated in the switch from human fetal gamma- to adult beta-globin gene expression. We have tested this hypothesis for an element that covers the minimal distance between the thalassemia and HPFH deletions and is thought to be responsible for the difference between a deletion HPFH and deltabeta-thalassemia, located 5' of the delta-globin gene. This element has been deleted from a yeast artificial chromosome (YAC) containing the complete human beta-globin locus. Analysis of this modified YAC in transgenic mice shows that early embryonic expression is unaffected, but in the fetal liver it is subject to position effects. In addition, the efficiency of transcription of the beta-globin gene is decreased, but the developmental silencing of the gamma-globin genes is unaffected by the deletion. These results show that the deleted element is involved in the activation of the beta-globin gene perhaps through the loss of a structural function required for gene activation by long-range interactions.
Collapse
Affiliation(s)
- R Calzolari
- Department of Cell Biology and Genetics, Medical Genetics Centre, Faculty of Medicine, Erasmus University, PO Box 1738, 3000 DR Rotterdam, The Netherlands
| | | | | | | | | |
Collapse
|
16
|
zu Putlitz J, Wieland S, Blum HE, Wands JR. Antisense RNA complementary to hepatitis B virus specifically inhibits viral replication. Gastroenterology 1998; 115:702-13. [PMID: 9721168 DOI: 10.1016/s0016-5085(98)70150-7] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
BACKGROUND & AIMS Chronic infection with the hepatitis B virus (HBV) is a major public health problem, and currently available therapies have limited efficacy. Gene therapy strategies for HBV infection are under active investigation. We evaluated the potential of antisense RNA transcribed from antisense genes to interfere with HBV replication. METHODS Subgenomic fragments of the HBV genome were studied with respect to the property of inhibiting HBV replication when intracellularly expressed in the antisense orientation. RESULTS Antisense RNAs derived from the HBV genome specifically inhibited HBV replication and antigen expression in human hepatocellular carcinoma cells by 60%-75%. DNA sequences corresponding to the identified RNAs had no effect on HBV replication, indicating that inhibitory effects are mediated by RNA. Transcripts corresponding to the inhibitory subgenomic fragments were present at high levels. One antisense RNA was found to reduce the amount of pregenomic RNA encapsidated into core particles as a molecular mechanism of antiviral effects. CONCLUSIONS Certain antisense RNA molecules will have substantial antiviral effects against HBV. Antisense RNAs derived from the HBV genome are promising candidates as antiviral agents and may serve as novel tools to identify functionally important regions of HBV transcripts.
Collapse
Affiliation(s)
- J zu Putlitz
- Molecular Hepatology Laboratory, Massachusetts General Hospital Cancer Center and Harvard Medical School, Boston, Massachusetts, USA
| | | | | | | |
Collapse
|
17
|
Chiu CH, Schneider H, Slightom JL, Gumucio DL, Goodman M. Dynamics of regulatory evolution in primate beta-globin gene clusters: cis-mediated acquisition of simian gamma fetal expression patterns. Gene 1997; 205:47-57. [PMID: 9461379 DOI: 10.1016/s0378-1119(97)00476-9] [Citation(s) in RCA: 20] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Phylogenetic reconstructions by parsimony were carried out on an enlarged body of primate gamma1 and gamma2-globin sequences. The results confirm that gamma1 and gamma2 arose from a tandem duplication in an ancient simian lineage ancestral to both platyrrhines (New World monkeys) and catarrhines (Old World monkeys and hominoids). Gene conversions between the two gamma homologs were frequent over the gamma gene proper but less frequent over the 5' flanking and very infrequent over the 3' flanking regions. The ancient platyrrhine conversion in the most distal 5' flanking region had the polarity of gamma2-->gamma1. Recent platyrrhine conversions between 5' regulatory sequences were very infrequent, in striking contrast to catarrhines which have large, uninterrupted stretches of converted 5' regulatory sequences. Comparisons of reconstructed ancestral primate and simian gamma promoter sequences revealed an accumulation of 21 nucleotide substitutions concentrated in or near cis-elements that may have mediated the change from embryonic to fetal gamma expression. Almost all 21 substitutions were retained in the lineages leading to functional gamma genes of extant catarrhines (both gamma1 and gamma2) and platyrrhines (most often gamma2). Fewer of these simian specific substitutions were retained in the platyrrhine gamma1 genes and new mutations occurred more often in the platyrrhine gamma1 than gamma2 promoters.
Collapse
Affiliation(s)
- C H Chiu
- Center for Molecular Medicine and Genetics, Wayne State University, Detroit, MI 48201, USA
| | | | | | | | | |
Collapse
|
18
|
Hardison RC, Oeltjen J, Miller W. Long human-mouse sequence alignments reveal novel regulatory elements: a reason to sequence the mouse genome. Genome Res 1997; 7:959-66. [PMID: 9331366 DOI: 10.1101/gr.7.10.959] [Citation(s) in RCA: 209] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Affiliation(s)
- R C Hardison
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | | | | |
Collapse
|
19
|
Bailey AD, Shen CC, Shen CK. Molecular origin of the mosaic sequence arrangements of higher primate alpha-globin duplication units. Proc Natl Acad Sci U S A 1997; 94:5177-82. [PMID: 9144211 PMCID: PMC24652 DOI: 10.1073/pnas.94.10.5177] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
The human adult alpha-globin locus consists of three pairs of homology blocks (X, Y, and Z) interspersed with three nonhomology blocks (I, II, and III), and three Alu family repeats, Alu1, Alu2, and Alu3. It has been suggested that an ancient primate alpha-globin-containing unit was ancestral to the X, Y, and Z and the Alu1/Alu2 repeats. However, the evolutionary origin of the three nonhomologous blocks has remained obscure. We have now analyzed the sequence organization of the entire adult alpha-globin locus of gibbon (Hylobates lar). DNA segments homologous to human block I occur in both duplication units of the gibbon alpha-globin locus. Detailed interspecies sequence comparisons suggest that nonhomologous blocks I and II, as well as another sequence, IV, were all part of the ancestral alpha-globin-containing unit prior to its tandem duplication. However, sometime thereafter, block I was deleted from the human alpha1-globin-containing unit, and block II was also deleted from the alpha2-globin-containing unit in both human and gibbon. These were probably independent events both mediated by independent illegitimate recombination processes. Interestingly, the end points of these deletions coincide with potential insertion sites of Alu family repeats. These results suggest that the shaping of DNA segments in eukaryotic genomes involved the retroposition of repetitive DNA elements in conjunction with simple DNA recombination processes.
Collapse
Affiliation(s)
- A D Bailey
- Section of Molecular and Cellular Biology, University of California, Davis, CA 95616, USA
| | | | | |
Collapse
|
20
|
Oliveira PL, Kawooya JK, Ribeiro JM, Meyer T, Poorman R, Alves EW, Walker FA, Machado EA, Nussenzveig RH, Padovan GJ. A heme-binding protein from hemolymph and oocytes of the blood-sucking insect, Rhodnius prolixus. Isolation and characterization. J Biol Chem 1995; 270:10897-901. [PMID: 7738030 DOI: 10.1074/jbc.270.18.10897] [Citation(s) in RCA: 75] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023] Open
Abstract
A heme-binding protein has been isolated and characterized from both the hemolymph and oocytes of the blood-sucking insect, Rhodnius prolixus. The protein from both sources is identical in most aspects studied. The Rhodnius heme-binding protein (RHBP) is composed of a single 15-kDa polypeptide chain coiled in a highly alpha-helical structure which binds non-covalently one heme/polypeptide chain. This RHBP is not produced by limited degradation of hemoglobin from the vertebrate host, since specific polyclonal antibodies against it do not cross-react with rabbit hemoglobin, and since it differs from hemoglobin in having a distinct amino-acid composition and NH2-terminal sequence. The spectrum of the dithionite-reduced protein has peaks at 426, 530, and 559 nm and resembles that of a b-type cytochrome. RHBP from hemolymph is not saturated with heme and promptly binds heme added to the solution. The oocyte protein, on the other hand, is fully saturated and is not capable of binding additional heme.
Collapse
Affiliation(s)
- P L Oliveira
- Departamento de Bioquímica Médica, Universidade Federal do Rio de Janeiro, Brasil
| | | | | | | | | | | | | | | | | | | |
Collapse
|
21
|
James-Pederson M, Yost S, Shewchuk B, Zeigler T, Miller R, Hardison R. Flanking and intragenic sequences regulating the expression of the rabbit alpha-globin gene. J Biol Chem 1995; 270:3965-73. [PMID: 7533158 DOI: 10.1074/jbc.270.8.3965] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023] Open
Abstract
Despite their descent from a common ancestral gene and the requirement for coordinated, tissue-specific regulation, the alpha- and beta-globin genes in many mammals are regulated in distinctly different ways. Unlike the beta-globin gene, the rabbit alpha-globin gene is transiently expressed at a high level without an added enhancer in transfected erythroid and non-erythroid cells. By examining a series of alpha/beta fusion genes, we show that internal sequences of the rabbit alpha-globin gene (within the first two exons and introns) are required along with the 5' flank for this enhancer-independent expression. Furthermore, deletion of the introns of the alpha-globin gene, or replacement by introns of the beta-globin gene, results in severely decreased expression of the transfecting genes. Hybrid constructs between segments of the alpha-globin gene and a luciferase gene confirm that internal alpha-globin sequences are needed for high level production of RNA in transfected cells. The flanking and internal sequences implicated in regulation of the rabbit alpha-globin gene coincide with a prominent CpG-rich island and may comprise an extended promoter (including both flanking and intragenic sequences) that is active in transfected cells without an enhancer.
Collapse
Affiliation(s)
- M James-Pederson
- Department of Biochemistry and Molecular Biology, Pennsylvania State University, University Park 16802
| | | | | | | | | | | |
Collapse
|
22
|
Koop B. Human and rodent genomes: Discrepant patterns of DNA sequence similarity. Anim Biotechnol 1994. [DOI: 10.1080/10495399409525810] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
23
|
Volloch V, Schweitzer B, Rits S. Ligation-mediated amplification of RNA from murine erythroid cells reveals a novel class of beta globin mRNA with an extended 5'-untranslated region. Nucleic Acids Res 1994; 22:2507-11. [PMID: 8041612 PMCID: PMC308202 DOI: 10.1093/nar/22.13.2507] [Citation(s) in RCA: 50] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
Abstract
Ligation-mediated RNA amplification was developed as a tool for analysis and determination of the termini of RNA molecules [Volloch et al. (1991) Proc. Natl. Acad. Sci. USA 88: 10671-10675]. In this approach, T4 RNA ligase is used to join cellular RNA with a defined ribo-oligonucleotide. Although several additional enzymatic steps are involved in this type of analysis, the reliability of the entire procedure is determined by the initial ligation step, which marks and preserves the termini of cellular RNA molecules. We applied this approach to the analysis of the 5' terminus of beta globin mRNA in various murine erythroid cells. As expected, we detected RNA molecules with 5' ends terminating at the regular cap site as well as globin RNA molecules truncated at the 5' end. Unexpectedly, we also detected a class of beta globin mRNA which is identical to regular beta globin mRNA in every respect but contains 17, 29, or 31 additional nucleotides 5' to the regular cap site. These extensions correspond precisely to the genomic segments just upstream of the regular cap site and are probably generated by initiation of transcription of the globin gene upstream from the regular cap site. It is likely that the extended globin RNA is transcribed not from the TATA promoter, which regulates the transcription of regular murine globin mRNA, but from the GATA regulatory element located 30 nucleotides upstream of the 31-nucleotide extension, in a position identical to that of the active GATA promoter of the TATA-less chicken beta globin gene. The evolutionary conservation of this relationship suggests the importance of the GATA promoter element of the mouse beta globin gene and its possible involvement in developmental regulation of expression of this gene.
Collapse
Affiliation(s)
- V Volloch
- Boston Biomedical Research Institute, MA 02114
| | | | | |
Collapse
|
24
|
Schrenzel MD, Watson JL, Ferrick DA. Characterization of horse (Equus caballus) T-cell receptor beta chain genes. Immunogenetics 1994; 40:135-44. [PMID: 7913080 DOI: 10.1007/bf00188177] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
Genes encoding the horse (Equus caballus) T-cell receptor beta chain (TCRB) were cloned and characterized. Of 33 cDNA clones isolated from the mesenteric lymph node, 30 had functionally rearranged gene segments, and three contained germline sequences. Sixteen unique variable segments (TCRBV), 14 joining genes (TCRBJ), and two constant region genes (TCRBC) were identified. Horse TCRBV were grouped into nine families based on similarity to human sequences. TCRBV2 and TCRBV12 were the most commonly represented horse families. Analysis of predicted protein structure revealed the presence of conserved regions similar to those seen in TCRB of other species. A decanucleotide promoter sequence homologous to those found in humans and mice was located in the 5' untranslated region of one horse gene. Germline sequences included the 5' region of the TCRBD2 gene with flanking heptamer/nonamer recombination signals and portions of the TCRBJ2-C2 intron. Southern blot hybridizations demonstrated restriction fragment length polymorphisms at the TCRBC locus among different horse breeds.
Collapse
MESH Headings
- Amino Acid Sequence
- Animals
- Base Sequence
- Cloning, Molecular
- DNA, Complementary/genetics
- Gene Rearrangement, beta-Chain T-Cell Antigen Receptor/genetics
- Horses/classification
- Horses/genetics
- Horses/immunology
- Lymph Nodes/immunology
- Mesentery/immunology
- Molecular Sequence Data
- Polymorphism, Restriction Fragment Length
- Receptors, Antigen, T-Cell, alpha-beta/classification
- Receptors, Antigen, T-Cell, alpha-beta/genetics
- Sequence Analysis
- Sequence Homology, Amino Acid
- Sequence Homology, Nucleic Acid
- Species Specificity
Collapse
Affiliation(s)
- M D Schrenzel
- Department of Pathology, Microbiology and Immunology, School of Veterinary Medicine, University of California, Davis 95616
| | | | | |
Collapse
|
25
|
Koop BF, Hood L. Striking sequence similarity over almost 100 kilobases of human and mouse T-cell receptor DNA. Nat Genet 1994; 7:48-53. [PMID: 8075639 DOI: 10.1038/ng0594-48] [Citation(s) in RCA: 122] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
We report here the comparative DNA sequence analysis of nearly 100 kilobases of contiguous DNA in the C delta to C alpha region of the alpha/delta T cell receptor loci (TCRAC/TCRDC) of mouse and man. This analysis--the largest genomic sequence comparison so far--provides new insights into the functions of the T cell receptor genes as well as the surrounding chromosome structure through the identification of actively conserved DNA sequences. In this comparison we have identified a very high level of organizational and noncoding sequence similarity (approximately 71%) in contrast to previous findings in the beta-globin gene cluster. This observation begins to question the notion that much of the chromosomal non-coding sequence is junk.
Collapse
Affiliation(s)
- B F Koop
- Department of Biology, University of Victoria, British Columbia, Canada
| | | |
Collapse
|
26
|
Goodman M, Bailey WJ, Hayasaka K, Stanhope MJ, Slightom J, Czelusniak J. Molecular evidence on primate phylogeny from DNA sequences. AMERICAN JOURNAL OF PHYSICAL ANTHROPOLOGY 1994; 94:3-24. [PMID: 8042704 DOI: 10.1002/ajpa.1330940103] [Citation(s) in RCA: 49] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
Evidence from DNA sequences on the phylogenetic systematics of primates is congruent with the evidence from morphology in grouping Cercopithecoidea (Old World monkeys) and Hominoidea (apes and humans) into Catarrhini, Catarrhini and Platyrrhini (ceboids or New World monkeys) into Anthropoidea, Lemuriformes and Lorisiformes into Strepsirhini, and Anthropoidea, Tarsioidea, and Strepsirhini into Primates. With regard to the problematic relationships of Tarsioidea, DNA sequences group it with Anthropoidea into Haplorhini. In addition, the DNA evidence favors retaining Cheirogaleidae within Lemuriformes in contrast to some morphological studies that favor placing Cheirogaleids in Lorisiformes. While parsimony analysis of the present DNA sequence data provides only modest support for Haplorhini as a monophyletic taxon, it provides very strong support for Hominoidea, Catarrhini, Anthropoidea, and Strepsirhini as monophyletic taxa. The parsimony DNA evidence also rejects the hypothesis that megabats are the sister group of either Primates or Dermoptera (flying lemur) or a Primate-Dermoptera clade and instead strongly supports the monophyly of Chiroptera, with megabats grouping with microbats at considerable distance from Primates. In contrast to the confused morphological picture of sister group relationships within Hominoidea, orthologous noncoding DNA sequences (spanning alignments involving as many as 20,000 base positions) now provide by the parsimony criterion highly significant evidence for the sister group relationships defined by a cladistic classification that groups the lineages to all extant hominoids into family Hominidae, divides this ape family into subfamilies Hylobatinae (gibbons) and Homininae, divides Homininae into tribes Pongini (orangutans) and Hominini, and divides Hominini into subtribes Gorillina (gorillas) and Hominina (humans and chimpanzees). A likelihood analysis of the largest body of these noncoding orthologues and counts of putative synapomorphies using the full range of sequence data from mitochondrial and nuclear genomes also find that humans and chimpanzees share the longest common ancestry.
Collapse
Affiliation(s)
- M Goodman
- Department of Anatomy, Wayne State University School of Medicine, Detroit, Michigan 48201
| | | | | | | | | | | |
Collapse
|
27
|
Reitman M, Grasso JA, Blumenthal R, Lewit P. Primary sequence, evolution, and repetitive elements of the Gallus gallus (chicken) beta-globin cluster. Genomics 1993; 18:616-26. [PMID: 8307571 DOI: 10.1016/s0888-7543(05)80364-7] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
The DNA sequence of the Gallus gallus (chicken) beta-globin cluster was completed and analyzed. This G + C-rich region is 23.7 kb in length and includes the rho-, beta H-, beta A-, and epsilon-globin genes, the enhancer found between the beta A and epsilon genes, and three upstream DNase I hypersensitive sites. The CpG dinucleotides are nonrandomly distributed, being present at an increased relative frequency near the promoters and upstream hypersensitive sites. The cluster has an unusually low TA dinucleotide frequency. The upstream hypersensitive sites (5'HS1, 5'HS2, and 5'HS3) contain DNA sequence motifs recognized by erythroid transcription factors. However, no significant sequence similarity was found among the upstream hypersensitive sites and the beta A/epsilon enhancer. The G. gallus upstream site sequences were not similar to the upstream sites of the mammalian globin clusters, probably due to the small size of the functional regions and large evolutionary distance between the classes. The avian cluster evolved by gene duplication from an ancestor beta-globin gene, first producing the epsilon and the rho/beta H/beta A ancestor genes, then the rho and the beta H/beta A ancestor genes, and finally the beta H- and beta A-globins. Four probable gene conversions can be documented: beta A to beta H, epsilon to beta H, and rho/epsilon (twice). The cluster shows a massive overrepresentation of a non-LTR retrotransposon, CR1, which accounts for 16% of the DNA. We suggest that the locus is a preferred site for CR1 insertion.
Collapse
Affiliation(s)
- M Reitman
- Diabetes Branch, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland 20892
| | | | | | | |
Collapse
|
28
|
Hardison R, Xu J, Jackson J, Mansberger J, Selifonova O, Grotch B, Biesecker J, Petrykowska H, Miller W. Comparative analysis of the locus control region of the rabbit beta-like gene cluster: HS3 increases transient expression of an embryonic epsilon-globin gene. Nucleic Acids Res 1993; 21:1265-72. [PMID: 8464710 PMCID: PMC309292 DOI: 10.1093/nar/21.5.1265] [Citation(s) in RCA: 38] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023] Open
Abstract
The rabbit homolog to the locus control region (LCR) of the human beta-like globin gene cluster was isolated, and long segments containing the DNase I hypersensitive sites (HS) were sequenced. The order and spacing of HS4, HS3, HS2 and HS1 are conserved between rabbit and human. Alignment of these sequences with their homologs from human, goat, and mouse shows that very long segments of DNA match between species, for over a thousand base pairs on either side of the previously identified functional cores, indicating that some important functions are found outside the cores. The activity of rabbit HS2 and HS3 was tested by attaching each to a novel reporter gene constructed by inserting the luciferase coding region into the rabbit epsilon-globin gene. In contrast to previous reports showing no effect of human or mouse HS3 on transient expression, both the rabbit HS2 and HS3 DNA fragments separately increased transient expression from the epsilon-luciferase hybrid gene and expression from stably integrated constructs in K562 erythroleukemia cells.
Collapse
Affiliation(s)
- R Hardison
- Department of Molecular and Cell Biology, Pennsylvania State University, University Park 16802
| | | | | | | | | | | | | | | | | |
Collapse
|
29
|
Hardison R, Chao KM, Adamkiewicz M, Price D, Jackson J, Zeigler T, Stojanovic N, Miller W. Positive and negative regulatory elements of the rabbit embryonic epsilon-globin gene revealed by an improved multiple alignment program and functional analysis. DNA SEQUENCE : THE JOURNAL OF DNA SEQUENCING AND MAPPING 1993; 4:163-76. [PMID: 8161819 DOI: 10.3109/10425179309015629] [Citation(s) in RCA: 26] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
The epsilon-globin genes of mammals are expressed in early embryos, but are silenced during fetal and adult erythropoiesis. As a guide to defining the regulatory elements involved in this developmental switch, we have searched the sequences of epsilon-globin genes from different mammals for highly conserved segments. The search was facilitated by the development of a new program, called yama, to generate a multiple alignment of very long sequences using an improved scoring scheme. This allowed us to generate a multiple alignment of sequences from a more divergent group than previously analyzed, as demonstrated here for representatives of four mammalian orders. In parallel experiments, we constructed a series of deletion mutations in the 5' flank of the rabbit epsilon-globin gene and tested their effect on an epsilon-globin-luciferase hybrid reporter gene. These results show that 121 bp of 5' flank, containing CACC, CCAAT and ATA motifs, is sufficient for expression in erythroid K562 cells. Both positive and negative cis-acting control sequences are located between 218 and 394 bp 5' to the cap site, in a region previously proposed to be a silencer. The positive regulatory sequence contains conserved binding sites for the nuclear protein YY1 adjacent to another highly conserved sequence. The negative element contains a conserved sequence followed by a purine-rich segment. This analysis maps the upstream control sequences more precisely and points to a very complex regulatory scheme for this gene.
Collapse
Affiliation(s)
- R Hardison
- Department of Molecular and Cell Biology, Pennsylvania State University, University Park 16802
| | | | | | | | | | | | | | | |
Collapse
|
30
|
Affiliation(s)
- J L Slightom
- Molecular Biology Unit, Upjohn Company, Kalamazoo, Michigan 49007
| |
Collapse
|
31
|
Abstract
We propose a method of measuring the human genomic deleterious mutation rate based on comparison of the rate of evolution per nucleotide per generation of neutral sequences, microN, with the overall rate of evolution of unique sequence DNA, microO. Data on microN, which are based on pseudogenes, permit an estimate of the total zygotic (twice the genomic) mutation rate, UT = 2 microNn = 2(2 x 10(-8)(3 x 10(9)) approximately 10(2), where n is the number of nucleotides per genome. Data on microO can be obtained by comparison of representative samples of relatively short homologous sequences from the genomes of Homo and a related species. The fraction of the genome that is controlled by negative selection is fS = (microN - microO)/microN and the deleterious zygotic mutation rate, the expected number of new deleterious alleles carried by a zygote, is US = UTfS. If fS > 0.01, US > 1, which would have important implications for human genetics and evolutionary biology.
Collapse
Affiliation(s)
- A S Kondrashov
- Institute of Mathematical Problems of Biology, Pushchino, Moscow Region, Russia
| | | |
Collapse
|
32
|
Jiménez G, Gale KB, Enver T. The mouse beta-globin locus control region: hypersensitive sites 3 and 4. Nucleic Acids Res 1992; 20:5797-803. [PMID: 1454540 PMCID: PMC334418 DOI: 10.1093/nar/20.21.5797] [Citation(s) in RCA: 37] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
The human beta-globin LCR plays a key role in the transcriptional regulation of the beta-globin locus and comprises four erythroid specific DNase I hypersensitive sites, designated 5'HS1-4. We have now isolated genomic clones containing 5'HS3 and 5'HS4 of the mouse beta-globin LCR. 5'HS3 and 5'HS4 are located 15 kb and 22 kb upstream of the mouse epsilon y-globin gene, respectively. Sequence analysis of murine 5'HS3 and 5'HS4 reveals a significant degree of sequence conservation with their human homologues, including the presence of recognition sites for functionally relevant transcription factors. 5'HS3 and 5'HS4 regions were found to form hypersensitive sites in nuclei from murine erythroid cells, but not in nuclei from a variety of nonerythroid haematopoietic cell lines. Analysis of different mouse strains revealed the existence of a polymorphism that alters the spacing between 5'HS3 and 5'HS4. Taken together, our results emphasize the extent of evolutionary conservation and complexity of mammalian beta-globin LCRs. Finally, the cloning of mouse 5'HS3 and 5'HS4 will facilitate the molecular analysis of LCR function in the mouse model.
Collapse
Affiliation(s)
- G Jiménez
- Leukaemia Research Fund Centre, Institute of Cancer Research, Chester Beatty Laboratories, London, UK
| | | | | |
Collapse
|
33
|
Koop BF, Wilson RK, Wang K, Vernooij B, Zallwer D, Kuo CL, Seto D, Toda M, Hood L. Organization, structure, and function of 95 kb of DNA spanning the murine T-cell receptor C alpha/C delta region. Genomics 1992; 13:1209-30. [PMID: 1505954 DOI: 10.1016/0888-7543(92)90039-u] [Citation(s) in RCA: 102] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
We have analyzed the organization, structure, and function of the murine T-cell receptor C alpha/C delta region. This region spans 94.6 kb of DNA and contains the C alpha and C delta genes, as well as the V delta 5, J delta 2, and 50 different J alpha gene segments. Within this sequence we have identified 15 new J alpha gene segments, 40 new 5' RNA splice signals, and 40 new DNA rearrangement signals for the J alpha gene segments. The murine C alpha/C delta sequence contains an exceptionally high level of coding sequence with over 5.7% of the total sequence found in the exons. This is much more than that found in the beta-globin locus and the HPRT locus. Using the sequence data obtained from the C alpha/C delta region, we have designed simple assays to test for J alpha gene segment transcription and to determine the level of polymorphism for simple repeat sequences among different inbred strains of mice using the polymerase chain reaction. Furthermore, comparisons of this 95 kb of sequence with the available sequence from homologous regions of other species have led to the identification of a highly conserved sequence that is present throughout vertebrates and in the mouse binds lymphocyte-specific nuclear proteins. Comparisons of a 10-kb region, which includes the C alpha gene in human and mouse, average 66% sequence similarity. These studies support the contention that large-scale DNA sequencing projects of homologous regions of mouse and human will provide powerful new tools for studying the biology and evolution of loci such as the T-cell receptor and for identifying and posing new questions about the functions of conserved sequences.
Collapse
Affiliation(s)
- B F Koop
- Division of Biology, California Institute of Technology, Pasadena 91125
| | | | | | | | | | | | | | | | | |
Collapse
|
34
|
Tagle DA, Stanhope MJ, Siemieniak DR, Benson P, Goodman M, Slightom JL. The beta globin gene cluster of the prosimian primate Galago crassicaudatus: nucleotide sequence determination of the 41-kb cluster and comparative sequence analyses. Genomics 1992; 13:741-60. [PMID: 1639402 DOI: 10.1016/0888-7543(92)90150-q] [Citation(s) in RCA: 24] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
The nucleotide sequence of the beta globin gene cluster of the prosimian Galago crassicaudatus has been determined. A total sequence spanning 41,101 bp contains and links together previously published sequences of the five galago beta-like globin genes (5'-epsilon-gamma-psi eta-delta-beta-3'). A computer-aided search for middle interspersed repetitive sequences identified 10 LINE (L1) elements, including a 5' truncated repeat that is orthologous to the full-length L1 element found in the human epsilon-gamma intergenic region. SINE elements that were identified included one Alu type I repeat, four Alu type II repeats, and two methionine tRNA-derived Monomer (type III) elements. Alu type II and Monomer sequences are unique to the galago genome. Structural analyses of the cluster sequence reveals that it is relatively A+T rich (about 62%) and regions with high G+C content are associated primarily with globin coding regions. Comparative analyses with the beta globin cluster sequences of human, rabbit, and mouse reveal extensive sequence homologies in their genic regions, but only human, galago, and rabbit sequences share extensive intergenic sequence homologies. Divergence analyses of aligned intergenic and flanking sequences from orthologous human, galago, and rabbit sequences show a gradation in the rate of nucleotide sequence evolution along the cluster where sequences 5' of the epsilon globin gene region show the least sequence divergence and sequences just 5' of the beta globin gene region show the greatest sequence divergence.
Collapse
Affiliation(s)
- D A Tagle
- Department of Molecular Biology, Wayne State University School of Medicine, Detroit, Michigan 48201
| | | | | | | | | | | |
Collapse
|
35
|
Bailey WJ, Hayasaka K, Skinner CG, Kehoe S, Sieu LC, Slightom JL, Goodman M. Reexamination of the African hominoid trichotomy with additional sequences from the primate beta-globin gene cluster. Mol Phylogenet Evol 1992; 1:97-135. [PMID: 1342932 DOI: 10.1016/1055-7903(92)90024-b] [Citation(s) in RCA: 98] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
Additional DNA sequence information from a range of primates, including 13.7 kb from pygmy chimpanzee (Pan paniscus), was added to data sets of beta-globin gene cluster sequence alignments that span the gamma 1, gamma 2, and psi eta loci and their flanking and intergenic regions. This enlarged body of data was used to address the issue of whether the ancestral separations of gorilla, chimpanzee, and human lineages resulted from only one trichotomous branching or from two dichotomous branching events. The degree of divergence, corrected for superimposed substitutions, seen in the beta-globin gene cluster between human alleles is about a third to a half that observed between two species of chimpanzee and about a fourth that between human and chimpanzee. The divergence either between chimpanzee and gorilla or between human and gorilla is slightly greater than that between human and chimpanzee, suggesting that the ancestral separations resulted from two closely spaced dichotomous branchings. Maximum parsimony analysis further strengthened the evidence that humans and chimpanzees share the longest common ancestry. Support for this human-chimpanzee clade is statistically significant at P = 0.002 over a human-gorilla clade or a chimpanzee-gorilla clade. An analysis of expected and observed homoplasy revealed that the number of sequence changes uniquely shared by human and chimpanzee lineages is too large to be attributed to homoplasy. Molecular clock calculations that accommodated lineage variations in rates of molecular evolution yielded hominoid branching times that ranged from 17-19 million years ago (MYA) for the separation of gibbon from the other hominoids to 5-7 MYA for the separation of chimpanzees from humans. Based on the relatively late dates and mounting corroborative evidence from unlinked nuclear genes and mitochondrial DNA for the close sister grouping of humans and chimpanzees, a cladistic classification would place all apes and humans in the same family. Within this family, gibbons would be placed in one subfamily and all other extant hominoids in another subfamily. The later subfamily would be divided into a tribe for orangutans and another tribe for gorillas, chimpanzees, and humans. Finally, gorillas would be placed in one subtribe with chimpanzees and humans in another, although this last division is not as strongly supported as the other divisions.
Collapse
Affiliation(s)
- W J Bailey
- Department of Molecular Biology and Genetics, Wayne State University School of Medicine, Detroit, Michigan 48201
| | | | | | | | | | | | | |
Collapse
|
36
|
Mark WH, Signorelli K, Blum M, Kwee L, Lacy E. Genomic structure of the locus associated with an insertional mutation in line 4 transgenic mice. Genomics 1992; 13:159-66. [PMID: 1577480 DOI: 10.1016/0888-7543(92)90216-f] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
In line 4 transgenic mice, six to eight copies of a 50-kb lambda recombinant clone are arranged in a head-to-tail tandem array on chromosome 3. Embryos homozygous for the transgene become arrested in their development on Day 5 of gestation shortly after implantation. The insertion site was cloned using a small segment of the transgene as a probe. Comparison of the insertion site with the wildtype locus indicates that a 22-kb deletion of host DNA has occurred in line 4 mice. Restriction enzyme analyses showed that neither the tandem array nor the flanking chromosomal DNA had any detectable rearrangements. Sequencing of the junctions between host and foreign DNA, however, revealed insertions of small fragments of DNA of unknown origin as well as an insertion of a DNA segment derived from another region of the transgene. Therefore, disruption of an essential gene in the line 4 transgenic mouse may have been caused by the insertion of 300-400 kb of foreign DNA or a deletion of sequences in the host genome.
Collapse
Affiliation(s)
- W H Mark
- DeWitt Wallace Research Laboratory, Sloan-Kettering Institute, Memorial Sloan-Kettering Cancer Center, New York, New York
| | | | | | | | | |
Collapse
|
37
|
Hayasaka K, Fitch DH, Slightom JL, Goodman M. Fetal recruitment of anthropoid gamma-globin genes. Findings from phylogenetic analyses involving the 5'-flanking sequences of the psi gamma 1 globin gene of spider monkey Ateles geoffroyi. J Mol Biol 1992; 224:875-81. [PMID: 1569563 DOI: 10.1016/0022-2836(92)90568-5] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
We determined the nucleotide sequence of a 2.5 kb DNA fragment (1 kb is 10(3) base-pairs) that includes exon 1, intron 1 and about 1.4 kb of 5'-flanking DNA of the spider monkey gamma 1-globin pseudogene locus and compared this sequence to its homologous from other primates and rabbit. This region of the gamma 1 locus of spider monkey still retains conserved regulatory elements, suggesting that it became a pseudogene late in New World monkey phylogeny. In the 250 base-pair region immediately 5' from the transcription start site where many known regulatory elements are located, a higher rate of nucleotide substitutions occurred in the ancestral anthropoid (human, ape and monkey) lineage than in the prosimian (galago) lineage, as was also the case for non-synonymous substitutions in the coding region. The opposite pattern was observed for most other non-coding regions and for synonymous substitutions. These substitution patterns correlate with the embryonic-to-fetal transformation of the gamma-globin genes of the ancestral anthropoids. Analysis of the 5'-flanking sequences suggests that 11 gene conversion events have occurred in the anthropoid gamma-gene lineages. In the parts of the 5'-flanking region where no gene conversions have been detected, gamma 2-gene sequences have accumulated more nucleotide changes than gamma 1, which suggests that the gamma 2 gene was the more redundant duplicate that may have accumulated first the nucleotide changes responsible for the anthropoid fetal pattern of gamma-globin gene expression.
Collapse
Affiliation(s)
- K Hayasaka
- Department of Anatomy and Cell Biology, Wayne State University School of Medicine, Detroit, MI 48201
| | | | | | | |
Collapse
|
38
|
Li H, Subar M, Lee KM, Boussios T. Cloning and sequence analysis of two embryonic beta-like globin cDNAs (y and z) of hamster. BIOCHIMICA ET BIOPHYSICA ACTA 1992; 1130:218-20. [PMID: 1339291 DOI: 10.1016/0167-4781(92)90532-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
A cDNA library was prepared from poly(A) mRNA extracted from 9-day hamster-yolk-sac erythroid cells. Two clones containing inserts coding for embryonic beta-like z or y globin-chains were isolated. Their identity was confirmed by (a) translation of hybrid selected mRNAs and (b) nucleotide sequence analysis of the inserts and comparison to the embryonic beta-like globin genes of Balb/c mouse. Availability of sequences for embryonic and adult globin cDNAs will aid in investigations of the molecular mechanisms of the globin switch in hamster YSEC.
Collapse
Affiliation(s)
- H Li
- Medical Service, St. Luke's-Roosevelt Hospital Center, New York, NY 10025
| | | | | | | |
Collapse
|
39
|
Perrin-Pecontal P, Gouy M, Nigon VM, Trabuchet G. Evolution of the primate beta-globin gene region: nucleotide sequence of the delta-beta-globin intergenic region of gorilla and phylogenetic relationships between African apes and man. J Mol Evol 1992; 34:17-30. [PMID: 1556740 DOI: 10.1007/bf00163849] [Citation(s) in RCA: 23] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
A 6.0-kb DNA fragment from Gorilla gorilla including the 5' part of the beta-globin gene and about 4.5 kb of its upstream flanking region was cloned and sequenced. The sequence was compared to the human, chimpanzee, and macaque delta-beta intergenic region. This analysis reveals four tandemly repeated sequences (RS), at the same location in the four species, showing a variable number of repeats generating both intraspecific (polymorphism) and interspecific variability. These tandem arrays delimit five regions of unique sequence called IG for intergenic. The divergence for these IG sequences is 1.85 +/- 0.22% between human and gorilla, which is not significantly different from the value estimated in the same region between chimpanzee and human (1.62 +/- 0.21%). The CpG and TpA dinucleotides are avoided. CpGs evolve faster than other sequence sites but do not confuse phylogenetic inferences by producing parallel mutations in different lineages. About 75% of CpG doublets have become TpG or CpA since the common ancestor, in agreement with the methylation/deamination pattern. Comparison of this intergenic region gives information on branching order within Hominoidea. Parsimony and distance-based methods when applied to the delta-beta intergenic region provide evidence (although not statistically significant) that human and chimpanzee are more closely related to each other than to gorilla. CpG sites are indeed rich in information by carrying substitutions along the short internal branch. Combining these results with those on the psi eta-delta intergenic region, shows in a statistically significant way that chimpanzee is the closest relative of human.
Collapse
Affiliation(s)
- P Perrin-Pecontal
- Centre de Génétique Moléculaire et Cellulaire, UMR 106, Université Claude Bernard, Lyon I, France
| | | | | | | |
Collapse
|
40
|
Shaw JP, Marks J, Shen CK. The adult alpha-globin locus of Old World monkeys: an abrupt breakdown of sequence similarity to human is defined by an Alu family repeat insertion site. J Mol Evol 1991; 33:506-13. [PMID: 1779433 DOI: 10.1007/bf02102803] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
The haploid genomes of all known primates have two or more adult alpha-globin genes contained within tandemly arranged duplication units. Although the tandem duplication event generating these alpha-globin loci is believed to occur prior to the divergence of primates, a number of length polymorphisms exist within the loci among different primate species. In order to understand the molecular basis of these length polymorphisms, we have cloned and determined the nucleotide sequence of a major portion of the rhesus monkey adult alpha-globin locus. Sequence comparison to human suggests that the length difference between the adult alpha-globin loci of human and Old World monkey is the result of one or more DNA recombination processes, all of which appeared to be related to the transposition of Alu family repeats. First, the finding of a monomeric Alu family repeat at the junction between nonhomology block I and homology block Y of the alpha 2 gene-containing unit in rhesus macaque suggests that the dimeric Alu family repeat, Alu 3, at the orthologous position in human was generated by insertion of a monomeric Alu family repeat into the 3' end of another preexisting Alu family repeat. Second, two Alu family repeats, Alu 1 and Alu 2, exist in human at the 3' end of each of the two X homology blocks, respectively. However, this pair of paralogous Alu family repeats is absent at the corresponding positions in rhesus macaques. This raises interesting questions regarding the evolutionary origin of Alu 1 and Alu 2. Finally, DNA sequences immediately downstream from the insertion site of Alu 2 are completely different between human and rhesus macaque.(ABSTRACT TRUNCATED AT 250 WORDS)
Collapse
Affiliation(s)
- J P Shaw
- Department of Genetics, University of California, Davis 95616
| | | | | |
Collapse
|
41
|
Hardison R, Krane D, Vandenbergh D, Cheng JF, Mansberger J, Taddie J, Schwartz S, Huang XQ, Miller W. Sequence and comparative analysis of the rabbit alpha-like globin gene cluster reveals a rapid mode of evolution in a G + C-rich region of mammalian genomes. J Mol Biol 1991; 222:233-49. [PMID: 1960725 DOI: 10.1016/0022-2836(91)90209-o] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
A sequence of 10,621 base-pairs from the alpha-like globin gene cluster of rabbit has been determined. It includes the sequence of gene zeta 1 (a pseudogene for the rabbit embryonic zeta-globin), the functional rabbit alpha-globin gene, and the theta 1 pseudogene, along with the sequences of eight C repeats (short interspersed repeats in rabbit) and a J sequence implicated in recombination. The region is quite G + C-rich (62%) and contains two CpG islands. As expected for a very G + C-rich region, it has an abundance of open reading frames, but few of the long open reading frames are associated with the coding regions of genes. Alignments between the sequences of the rabbit and human alpha-like globin gene clusters reveal matches primarily in the immediate vicinity of genes and CpG islands, while the intergenic regions of these gene clusters have many fewer matches than are seen between the beta-like globin gene clusters of these two species. Furthermore, the non-coding sequences in this portion of the rabbit alpha-like globin gene cluster are shorter than in human, indicating a strong tendency either for sequence contraction in the rabbit gene cluster or for expansion in the human gene cluster. Thus, the intergenic regions of the alpha-like globin gene clusters have evolved in a relatively fast mode since the mammalian radiation, but not exclusively by nucleotide substitution. Despite this rapid mode of evolution, some strong matches are found 5' to the start sites of the human and rabbit alpha genes, perhaps indicating conservation of a regulatory element. The rabbit J sequence is over 1000 base-pairs long; it contains a C repeat at its 5' end and an internal region of homology to the 3'-untranslated region of the alpha-globin gene. Part of the rabbit J sequence matches with sequences within the X homology block in human. Both of these regions have been implicated as hot-spots for recombination, hence the matching sequences are good candidates for such a function. All the interspersed repeats within both gene clusters are retroposon SINEs that appear to have inserted independently in the rabbit and human lineages.
Collapse
Affiliation(s)
- R Hardison
- Department of Molecular and Cell Biology, Pennsylvania State University, University Park 16802
| | | | | | | | | | | | | | | | | |
Collapse
|
42
|
Schwartz S, Miller W, Yang CM, Hardison RC. Software tools for analyzing pairwise alignments of long sequences. Nucleic Acids Res 1991; 19:4663-7. [PMID: 1891357 PMCID: PMC328706 DOI: 10.1093/nar/19.17.4663] [Citation(s) in RCA: 22] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
Pairwise comparison of long stretches of genomic DNA sequence can identify regions conserved across species, which often indicate functional significance. However, the novel insights frequently must be windowed from a flood of information; for instance, running an alignment program on two 50-kilobase sequences might yield over a hundred pages of alignments. Direct inspection of such a volume of printed output is infeasible, or at best highly undesirable, and computer tools are needed to summarize the information, to assist in its analysis, and to report the findings. This paper describes two such software tools. One tool prepares publication-quality pictorial representations of alignments, while another facilitates interactive browsing of pairwise alignment data. Their effectiveness is illustrated by comparing the beta-like globin gene clusters between humans and rabbits. A second example compares the chloroplast genomes of tobacco and liverwort.
Collapse
Affiliation(s)
- S Schwartz
- Department of Computer Science, Pennsylvania State University, University Park 16802
| | | | | | | |
Collapse
|
43
|
Fitch DH, Bailey WJ, Tagle DA, Goodman M, Sieu L, Slightom JL. Duplication of the gamma-globin gene mediated by L1 long interspersed repetitive elements in an early ancestor of simian primates. Proc Natl Acad Sci U S A 1991; 88:7396-400. [PMID: 1908094 PMCID: PMC52302 DOI: 10.1073/pnas.88.16.7396] [Citation(s) in RCA: 89] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
Regions surrounding the single gamma-globin gene of galago and the duplicated gamma 1- and gamma 2-globin genes of gibbon, rhesus monkey, and spider monkey were sequenced and aligned with those from humans. Contrary to previous studies, spider monkey was found to have not one but two gamma-globin genes, only one of which (gamma 2) is functional. The reconstructed evolutionary history of the gamma-globin genes and their flanking sequences traces their origin to a tandem duplication of a DNA segment approximately 5.5 kilobases long that occurred before catarrhine primates (humans, apes, and Old World monkeys) diverged from platyrrhines (New World monkeys), much earlier than previously thought. This reconstructed molecular history also reveals that the duplication resulted from an unequal homologous crossover between two related L1 long interspersed repetitive elements, one upstream and one downstream of the single ancestral gamma-globin gene. Perhaps facilitated by the redundancy resulting from the duplication, the gamma-globin genes escaped the selective constraints of embryonically functioning genes and evolved into fetally functioning genes. This view is supported by the finding that a burst of nonsynonymous substitutions occurred in the gamma-globin genes while they became restructured for fetal expression in the common ancestor of platyrrhines and catarrhines.
Collapse
Affiliation(s)
- D H Fitch
- Department of Anatomy, Wayne State University School of Medicine, Detroit, MI 48201
| | | | | | | | | | | |
Collapse
|
44
|
Hanscombe O, Whyatt D, Fraser P, Yannoutsos N, Greaves D, Dillon N, Grosveld F. Importance of globin gene order for correct developmental expression. Genes Dev 1991; 5:1387-94. [PMID: 1714415 DOI: 10.1101/gad.5.8.1387] [Citation(s) in RCA: 243] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
We have used transgenic mice to study the influence of position of the human globin genes relative to the locus control region (LCR) on their expression pattern during development. The LCR, which is located 5' of the globin gene cluster, is normally required for the activation of all the genes. When the human beta-globin gene is linked as a single gene to the LCR it is activated prematurely in the embryonic yolk sac. We show that the correct timing of beta gene activation is restored when it is placed farther from the LCR than a competing human gamma- or alpha-globin gene. Correct timing is not restored when beta is the globin gene closest to the LCR. Similarly, the human gamma-globin gene is silenced earlier when present farthest from the LCR. On the basis of this result, we propose a model of developmental gene control based on stage-specific elements immediately flanking the genes and on polarity in the locus. We suggest that the difference in relative distance to the LCR, which is a consequence of the ordered arrangement of the genes, results in nonreciprocal competition between the genes for activation by the LCR.
Collapse
Affiliation(s)
- O Hanscombe
- Laboratory of Gene Structure and Expression, National Institute for Medical Research, The Ridgeway, London, UK
| | | | | | | | | | | | | |
Collapse
|
45
|
Vandenbergh DJ, James-Pederson M, Hardison RC. An apparent pause site in the transcription unit of the rabbit alpha-globin gene. J Mol Biol 1991; 220:255-70. [PMID: 1856860 DOI: 10.1016/0022-2836(91)90011-t] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
Transcription of the rabbit alpha-globin gene begins primarily at the cap site, although some upstream start sites are also observed. Analysis by RNA polymerase run-on assays in nuclei shows that transcription continues at a high level past the polyadenylation site, after which the polymerase density actually increases in a region of about 400 nucleotides, followed by a gradual decline over the 700 nucleotides. These features are also observed in the transcription unit of the rabbit beta-globin gene. The region with the unexpectedly high nascent RNA hybridization signal in the 3' flank contains a conserved sequence, KGCAGCWGGR (K = G or T, W = A or T, R = A or G), followed by an inverted repeat. The inverted repeat (perhaps with the conserved sequence) may be a pause site for RNA polymerase II, thus accounting for the increase in polymerase density. This sequence and inverted repeat are found in the 3' flank of several globin genes and the simian virus 40 (SV40) early genes, as well as in the regions implicated in pausing or termination of transcription of eight different genes. Deletion of the conserved sequence and inverted repeat from the 3' flank of the SV40 early region causes a small increase in the levels of transcription downstream from this site. Replacement with the conserved sequence and inverted repeat from the rabbit alpha-globin gene causes an accumulation of polymerases, supporting the hypothesis that polymerases pause at this site. This proposed pause site may affect the efficiency of termination at some sites further downstream, perhaps by loss of a processivity factor.
Collapse
Affiliation(s)
- D J Vandenbergh
- Department of Molecular and Cell Biology, Pennsylvania State University, University Park 16802
| | | | | |
Collapse
|
46
|
Transcriptional role of a conserved GATA-1 site in the human epsilon-globin gene promoter. Mol Cell Biol 1991. [PMID: 2017165 DOI: 10.1128/mcb.11.5.2558] [Citation(s) in RCA: 40] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The epsilon-globin gene is the first of the human beta-like globin genes to be expressed during development. We have analyzed protein-DNA interactions in the epsilon-globin promoter region by DNase I footprinting and electrophoretic mobility shift experiments using nuclear extracts from K562 human erythroid cells and from nonerythroid HeLa cells. A restricted set of ubiquitous proteins, including Sp1, bound to regions of the promoter including the CACCC and CCAAT sites. Three interactions, at positions -213, -165, and +3 relative to the transcription start site, were erythroid specific and corresponded to binding of GATA-1, a transcription factor highly restricted to the erythroid lineage. Interestingly, the GATA-1 site at -165 has been conserved in the promoters of 10 mammalian embryonic globin genes. Point mutations demonstrate that GATA-1 binding to this site is necessary for interaction with an erythroid-specific enhancer but that in the absence of an enhancer, GATA-1 does not increase transcription.
Collapse
|
47
|
Gong QH, Stern J, Dean A. Transcriptional role of a conserved GATA-1 site in the human epsilon-globin gene promoter. Mol Cell Biol 1991; 11:2558-66. [PMID: 2017165 PMCID: PMC360025 DOI: 10.1128/mcb.11.5.2558-2566.1991] [Citation(s) in RCA: 19] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
The epsilon-globin gene is the first of the human beta-like globin genes to be expressed during development. We have analyzed protein-DNA interactions in the epsilon-globin promoter region by DNase I footprinting and electrophoretic mobility shift experiments using nuclear extracts from K562 human erythroid cells and from nonerythroid HeLa cells. A restricted set of ubiquitous proteins, including Sp1, bound to regions of the promoter including the CACCC and CCAAT sites. Three interactions, at positions -213, -165, and +3 relative to the transcription start site, were erythroid specific and corresponded to binding of GATA-1, a transcription factor highly restricted to the erythroid lineage. Interestingly, the GATA-1 site at -165 has been conserved in the promoters of 10 mammalian embryonic globin genes. Point mutations demonstrate that GATA-1 binding to this site is necessary for interaction with an erythroid-specific enhancer but that in the absence of an enhancer, GATA-1 does not increase transcription.
Collapse
Affiliation(s)
- Q H Gong
- Laboratory of Cellular and Developmental Biology, National Institute of Diabetes and Digestive and Kidney Diseases, Bethesda, Maryland 20892
| | | | | |
Collapse
|
48
|
Bacolla A, Wu FY. Mung bean nuclease cleavage pattern at a polypurine.polypyrimidine sequence upstream from the mouse metallothionein-I gene. Nucleic Acids Res 1991; 19:1639-47. [PMID: 2027772 PMCID: PMC333927 DOI: 10.1093/nar/19.7.1639] [Citation(s) in RCA: 22] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
Mung bean nuclease, an enzyme specific for single-stranded DNA, was used to probe a non-B DNA structure present in the mouse metallothionein-I gene. The region sensitive to the enzyme was constituted by a 128 base-pair long polypurine.polypyrimidine sequence located at 1.2-kb from the start of transcription. A detailed analysis of the mung bean nuclease cleavage pattern revealed that: (i) under conditions of supercoiling and low pH a triplex structure was formed, (ii) the triplex was flanked by a sequence with the potential of forming a Z-DNA structure, (iii) most of the enzymatic activity was localized at some of the junctions between double-stranded and triple-stranded DNA and at mismatches in the triplex, (iv) no unpaired bases were observed in the loop or outside the triplex, and (v) the triplex was present in more than one configuration.
Collapse
Affiliation(s)
- A Bacolla
- Department of Pharmacological Sciences, State University of New York, Stony Brook 11794
| | | |
Collapse
|
49
|
Tagle DA, Slightom JL, Jones RT, Goodman M. Concerted evolution led to high expression of a prosimian primate delta globin gene locus. J Biol Chem 1991. [DOI: 10.1016/s0021-9258(20)89470-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022] Open
|
50
|
Li Q, Zhou B, Powers P, Enver T, Stamatoyannopoulos G. Primary structure of the goat beta-globin locus control region. Genomics 1991; 9:488-99. [PMID: 2032720 DOI: 10.1016/0888-7543(91)90415-b] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
The goat beta-globin cluster is composed of a triplicated four-gene set. A locus control region (LCR) containing elements homologous to 5'DNase I hypersensitive sites (HS) 1, 2, and 3 of the human beta-globin LCR has been identified at the 5' end of this locus. We determined 10.2 kb of nucleotide sequence from the goat beta-globin locus control region. Self-comparison of this sequence by dot matrix analysis revealed the presence of six complete and three incomplete artiodactyl repeats. A novel repeated element, termed D repeat, was also identified. Southern blotting analysis demonstrated that these elements exist in the goat genome as a low to medium frequency interspersed repeat family. The absence of any other large region of self-homology (direct or inverted) in the goat LCR suggests that 5'HSs 1, 2, and 3 did not arise through duplication, but rather evolved independently. By comparing goat 5'HS 1 to those of human, rabbit, and mouse, we show a greater than 80% conservation in sequence between the four species. This level of evolutionary conservation suggests that 5'HS 1 plays an important role in the regulation of beta-globin loci.
Collapse
Affiliation(s)
- Q Li
- Shanghai Institute of Biochemistry, Chinese Academy of Sciences
| | | | | | | | | |
Collapse
|