1
|
Thomas JW, Touchman JW, Blakesley RW, Bouffard GG, Beckstrom-Sternberg SM, Margulies EH, Blanchette M, Siepel AC, Thomas PJ, McDowell JC, Maskeri B, Hansen NF, Schwartz MS, Weber RJ, Kent WJ, Karolchik D, Bruen TC, Bevan R, Cutler DJ, Schwartz S, Elnitski L, Idol JR, Prasad AB, Lee-Lin SQ, Maduro VVB, Summers TJ, Portnoy ME, Dietrich NL, Akhter N, Ayele K, Benjamin B, Cariaga K, Brinkley CP, Brooks SY, Granite S, Guan X, Gupta J, Haghighi P, Ho SL, Huang MC, Karlins E, Laric PL, Legaspi R, Lim MJ, Maduro QL, Masiello CA, Mastrian SD, McCloskey JC, Pearson R, Stantripop S, Tiongson EE, Tran JT, Tsurgeon C, Vogt JL, Walker MA, Wetherby KD, Wiggins LS, Young AC, Zhang LH, Osoegawa K, Zhu B, Zhao B, Shu CL, De Jong PJ, Lawrence CE, Smit AF, Chakravarti A, Haussler D, Green P, Miller W, Green ED. Comparative analyses of multi-species sequences from targeted genomic regions. Nature 2003; 424:788-93. [PMID: 12917688 DOI: 10.1038/nature01858] [Citation(s) in RCA: 482] [Impact Index Per Article: 23.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2003] [Accepted: 06/16/2003] [Indexed: 11/08/2022]
Abstract
The systematic comparison of genomic sequences from different organisms represents a central focus of contemporary genome analysis. Comparative analyses of vertebrate sequences can identify coding and conserved non-coding regions, including regulatory elements, and provide insight into the forces that have rendered modern-day genomes. As a complement to whole-genome sequencing efforts, we are sequencing and comparing targeted genomic regions in multiple, evolutionarily diverse vertebrates. Here we report the generation and analysis of over 12 megabases (Mb) of sequence from 12 species, all derived from the genomic region orthologous to a segment of about 1.8 Mb on human chromosome 7 containing ten genes, including the gene mutated in cystic fibrosis. These sequences show conservation reflecting both functional constraints and the neutral mutational events that shaped this genomic region. In particular, we identify substantial numbers of conserved non-coding segments beyond those previously identified experimentally, most of which are not detectable by pair-wise sequence comparisons alone. Analysis of transposable element insertions highlights the variation in genome dynamics among these species and confirms the placement of rodents as a sister group to the primates.
Collapse
Affiliation(s)
- J W Thomas
- Genome Technology Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20892,USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
2
|
Hardison RC, Chiaromonte F, Kolbe D, Wang H, Petrykowska H, Elnitski L, Yang S, Giardine B, Zhang Y, Riemer C, Schwartz S, Haussler D, Roskin KM, Weber RJ, Diekhans M, Kent WJ, Weiss MJ, Welch J, Miller W. Global predictions and tests of erythroid regulatory regions. Cold Spring Harb Symp Quant Biol 2003; 68:335-44. [PMID: 15338635 DOI: 10.1101/sqb.2003.68.335] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/30/2023]
Affiliation(s)
- R C Hardison
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
3
|
Chiaromonte F, Yang S, Elnitski L, Yap VB, Miller W, Hardison RC. Association between divergence and interspersed repeats in mammalian noncoding genomic DNA. Proc Natl Acad Sci U S A 2001; 98:14503-8. [PMID: 11717405 PMCID: PMC64711 DOI: 10.1073/pnas.251423898] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The amount of noncoding genomic DNA sequence that aligns between human and mouse varies substantially in different regions of their genomes, and the amount of repetitive DNA also varies. In this report, we show that divergence in noncoding nonrepetitive DNA is strongly correlated with the amount of repetitive DNA in a region. We investigated aligned DNA in four large genomic regions with finished human sequence and almost or completely finished mouse sequence. These regions, totaling 5.89 Mb of DNA, are on different chromosomes and vary in their base composition. An analysis based on sliding windows of 10 kb shows that the fraction of aligned noncoding nonrepetitive DNA and the fraction of repetitive DNA are negatively correlated, both at the level of an entire region and locally within it. This conclusion is strongly supported by a randomization study, in which repetitive elements are removed and randomly relocated along the sequences. Thus, regions of noncoding genomic DNA that accumulated fewer point mutations since the primate-rodent divergence also suffered fewer retrotransposition events. These results indicate that some regions of the genome are more "flexible" over the time scale of mammalian evolution, being able to accommodate many point mutations and insertions, whereas other regions are more "rigid" and accumulate fewer changes. Stronger conservation is generally interpreted as indicating more extensive or more important function. The evidence presented here of correlated variation in the rates of different evolutionary processes across noncoding DNA must be considered in assessing such conservation for evidence of selection.
Collapse
Affiliation(s)
- F Chiaromonte
- Department of Statistics, Pennsylvania State University, University Park, PA 16802, USA
| | | | | | | | | | | |
Collapse
|
4
|
Elnitski L, Li J, Noguchi CT, Miller W, Hardison R. A negative cis-element regulates the level of enhancement by hypersensitive site 2 of the beta-globin locus control region. J Biol Chem 2001; 276:6289-98. [PMID: 11092897 DOI: 10.1074/jbc.m009624200] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
The core of DNase hypersensitive site (HS) 2 from the beta-globin locus control region is a potent enhancer of globin gene expression. Although it has been considered to contain only positive cis-regulatory sequences, our study of the enhancement conferred by segments of HS2 in erythroid cells reveals a novel negative element. Individual cis-regulatory elements from HS2 such as E boxes or Maf-response elements produced as great or greater enhancement than the intact core in mouse erythroleukemia (MEL) cells, indicating the presence of negative elements within HS2. A deletion series through HS2 revealed negative elements at the 5' and 3' ends of the core. Analysis of constructs with and without the 5' negative element showed that the effect is exerted on the promoters of globin genes expressed at embryonic, fetal, or adult stages. The negative effect was observed in bipotential human cells (K562 and human erythroleukemia (HEL) cells), proerythroblastic mouse (MEL) cells, and normal adult human erythroid cells. The novel negative element also functions after stable integration into MEL chromosomes. Smaller deletions at the 5' end of the HS2 core map the negative element within a 20-base pair region containing two conserved sequences.
Collapse
Affiliation(s)
- L Elnitski
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | | | | | | | | |
Collapse
|
5
|
Abstract
Preliminary studies of cis-regulatory elements are frequently performed in transiently transfected cells before further analysis in stably transfected cell lines and transgenic mice. However, not all cells are readily transfectable by routine means. For instance, mouse erythroleukemia (MEL) cells have been a valuable model system for studies of their endogenous globin genes, but introduction of DNA using common transfection methods such as electroporation has been very inefficient. This has allowed studies of stably transfected cells, after selection for the rare transfection events, but transient transfection analysis has been problematic. This report describes an efficient and reliable method for transient transfection of MEL cells using commercially available cationic lipids.
Collapse
Affiliation(s)
- L Elnitski
- Department of Biochemistry and Molecular Biology, The Center for Gene Regulation, The Pennsylvania State University, University Park, PA 16802, USA.
| | | |
Collapse
|
6
|
Elnitski L, Miller W, Hardison R. Conserved E boxes function as part of the enhancer in hypersensitive site 2 of the beta-globin locus control region. Role of basic helix-loop-helix proteins. J Biol Chem 1997; 272:369-78. [PMID: 8995271 DOI: 10.1074/jbc.272.1.369] [Citation(s) in RCA: 67] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023] Open
Abstract
The human beta-globin gene cluster is regulated in part by a distal locus control region that is required for opening a chromatin domain in erythroid cells and enhancing expression of the beta-like globin genes at the correct developmental stages. One part of the locus control region, called hypersensitive site 2 (HS2), functions as a strong enhancer. Matches to the consensus binding sites for basic helix-loop-helix (bHLH) proteins (E boxes) are well conserved within the HS2 core. We show that mutations of the HS2 core that alter an invariant E box cause a 3.5-fold reduction in enhancement of expression of an epsilon-globin reporter gene in transiently transfected K562 cells, both before and after induction. Mutations of the HS2 core that alter a less-highly conserved E box cause a more modest reduction in enhancement. Footprint analysis shows binding of erythroid nuclear proteins in vitro to the invariant E box as well as an adjacent CAC/GTG box. Probes containing the E box regions form sequence-specific complexes with proteins from both K562 and MEL nuclear extracts; these are disrupted by the same mutations that decrease enhancement. Some of these latter complexes contain known bHLH proteins, as revealed by specific loss of individual complexes when treated with antibodies against TAL1 and USF. Interaction between the E boxes and the bHLH proteins, as well as other binding proteins, could account for the role of these sites in enhancement by HS2.
Collapse
Affiliation(s)
- L Elnitski
- Department of Biochemistry, The Pennsylvania State University, University Park 16802, USA
| | | | | |
Collapse
|