151
|
Blanchette M, Green ED, Miller W, Haussler D. Reconstructing large regions of an ancestral mammalian genome in silico. Genome Res 2004; 14:2412-23. [PMID: 15574820 PMCID: PMC534665 DOI: 10.1101/gr.2800104] [Citation(s) in RCA: 106] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2004] [Accepted: 10/05/2004] [Indexed: 11/25/2022]
Abstract
It is believed that most modern mammalian lineages arose from a series of rapid speciation events near the Cretaceous-Tertiary boundary. It is shown that such a phylogeny makes the common ancestral genome sequence an ideal target for reconstruction. Simulations suggest that with methods currently available, we can expect to get 98% of the bases correct in reconstructing megabase-scale euchromatic regions of an eutherian ancestral genome from the genomes of approximately 20 optimally chosen modern mammals. Using actual genomic sequences from 19 extant mammals, we reconstruct 1.1 Mb of ancient genome sequence around the CFTR locus. Detailed examination suggests the reconstruction is accurate and that it allows us to identify features in modern species, such as remnants of ancient transposon insertions, that were not identified by direct analysis. Tracing the predicted evolutionary history of the bases in the reconstructed region, estimates are made of the amount of DNA turnover due to insertion, deletion, and substitution in the different placental mammalian lineages since the common eutherian ancestor, showing considerable variation between lineages. In coming years, such reconstructions may help in identifying and understanding the genetic features common to eutherian mammals and may shed light on the evolution of human or primate-specific traits.
Collapse
|
152
|
Abstract
The genomes from three mammals (human, mouse, and rat), two worms, and several yeasts have been sequenced, and more genomes will be completed in the near future for comparison with those of the major model organisms. Scientists have used various methods to align and compare the sequenced genomes to address critical issues in genome function and evolution. This review covers some of the major new insights about gene content, gene regulation, and the fraction of mammalian genomes that are under purifying selection and presumed functional. We review the evolutionary processes that shape genomes, with particular attention to variation in rates within genomes and along different lineages. Internet resources for accessing and analyzing the treasure trove of sequence alignments and annotations are reviewed, and we discuss critical problems to address in new bioinformatic developments in comparative genomics.
Collapse
|
153
|
She X, Horvath JE, Jiang Z, Liu G, Furey TS, Christ L, Clark R, Graves T, Gulden CL, Alkan C, Bailey JA, Sahinalp C, Rocchi M, Haussler D, Wilson RK, Miller W, Schwartz S, Eichler EE. The structure and evolution of centromeric transition regions within the human genome. Nature 2004; 430:857-64. [PMID: 15318213 DOI: 10.1038/nature02806] [Citation(s) in RCA: 144] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2004] [Accepted: 07/02/2004] [Indexed: 11/09/2022]
Abstract
An understanding of how centromeric transition regions are organized is a critical aspect of chromosome structure and function; however, the sequence context of these regions has been difficult to resolve on the basis of the draft genome sequence. We present a detailed analysis of the structure and assembly of all human pericentromeric regions (5 megabases). Most chromosome arms (35 out of 43) show a gradient of dwindling transcriptional diversity accompanied by an increasing number of interchromosomal duplications in proximity to the centromere. At least 30% of the centromeric transition region structure originates from euchromatic gene-containing segments of DNA that were duplicatively transposed towards pericentromeric regions at a rate of six-seven events per million years during primate evolution. This process has led to the formation of a minimum of 28 new transcripts by exon exaptation and exon shuffling, many of which are primarily expressed in the testis. The distribution of these duplicated segments is nonrandom among pericentromeric regions, suggesting that some regions have served as preferential acceptors of euchromatic DNA.
Collapse
|
154
|
Patt YZ, Liebmann J, Diamandidis D, Eckhardt SG, Javle M, Justice GR, Keiser W, Lee FC, Miller W, Lin E. Capecitabine (X) plus irinotecan (XELIRI) as first-line treatment for metastatic colorectal cancer (MCRC): Final safety findings from a phase II trial. J Clin Oncol 2004. [DOI: 10.1200/jco.2004.22.90140.3602] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
|
155
|
Rijnkels M, Elnitski L, Miller W, Rosen JM. Multispecies comparative analysis of a mammalian-specific genomic domain encoding secretory proteins. Genomics 2004; 82:417-32. [PMID: 13679022 DOI: 10.1016/s0888-7543(03)00114-9] [Citation(s) in RCA: 54] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
The mammalian-specific casein gene cluster comprises 3 or 4 evolutionarily related genes and 1 physically linked gene with a functional association. To gain a better understanding of the mechanisms regulating the entire casein cluster at the genomic level we initiated a multispecies comparative sequence analysis. Despite the high level of divergence at the coding level, these studies have identified uncharacterized family members within two species and the presence at orthologous positions of previously uncharacterized genes. Also the previous suggestion that the histatin/statherin gene family, located in this region, was primate specific was ruled out. All 11 genes identified in this region appear to encode secretory proteins. Conservation of a number of noncoding regions was observed; one coincides with an element previously suggested to be important for beta-casein gene expression in human and cow. The conserved regions might have biological importance for the regulation of genes in this genomic "neighborhood."
Collapse
|
156
|
Kolbe D, Taylor J, Elnitski L, Eswara P, Li J, Miller W, Hardison R, Chiaromonte F. Regulatory potential scores from genome-wide three-way alignments of human, mouse, and rat. Genome Res 2004; 14:700-7. [PMID: 15060013 PMCID: PMC383316 DOI: 10.1101/gr.1976004] [Citation(s) in RCA: 84] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
We generalize the computation of the Regulatory Potential (RP) score from two-way alignments of human and mouse to three-way alignments of human, mouse, and rat. This requires overcoming technical challenges that arise because the complexity of the models underlying the score increases exponentially with the number of species. Despite the close evolutionary proximity of rat to mouse, we find that adding the rat sequence increases our ability to predict genomic sites that regulate gene transcription. A variant of the RP scoring scheme that accounts for local variation in neutral mutational patterns further improves our predictions.
Collapse
|
157
|
Yang S, Smit AF, Schwartz S, Chiaromonte F, Roskin KM, Haussler D, Miller W, Hardison RC. Patterns of insertions and their covariation with substitutions in the rat, mouse, and human genomes. Genome Res 2004; 14:517-27. [PMID: 15059992 PMCID: PMC383295 DOI: 10.1101/gr.1984404] [Citation(s) in RCA: 66] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Abstract
The rates at which human genomic DNA changes by neutral substitution and insertion of certain families of transposable elements covary in large, megabase-sized segments. We used the rat, mouse, and human genomic DNA sequences to examine these processes in more detail in comparisons over both shorter (rat-mouse) and longer (rodent-primate) times, and demonstrated the generality of the covariation. Different families of transposable elements show distinctive insertion preferences and patterns of variation with substitution rates. SINEs are more abundant in GC-rich DNA, but the regional GC preference for insertion (monitored in young SINEs) differs between rodents and humans. In contrast, insertions in the rodent genomes are predominantly LINEs, which prefer to insert into AT-rich DNA in all three mammals. The insertion frequency of repeats other than SINEs correlates strongly positively with the frequency of substitutions in all species. However, correlations with SINEs show the opposite effects. The correlations are explained only in part by the GC content, indicating that other factors also contribute to the inherent tendency of DNA segments to change over evolutionary time.
Collapse
|
158
|
Tufarelli C, Hardison R, Miller W, Hughes J, Clark K, Ventress N, Frischauf AM, Higgs DR. Comparative analysis of the alpha-like globin clusters in mouse, rat, and human chromosomes indicates a mechanism underlying breaks in conserved synteny. Genome Res 2004; 14:623-30. [PMID: 15060003 PMCID: PMC383306 DOI: 10.1101/gr.2143604] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
We have sequenced and fully annotated a 65,871-bp region of mouse Chromosome 17 including the Hba-ps4 alpha-globin pseudogene. Comparative sequence analysis with the functional alpha-globin loci at human Chromosome 16p13.3 and mouse Chromosome 11 shows that this segment of mouse Chromosome 17 contains a group of three alpha-like pseudogenes (Hba-psm-Hba-ps4-Hba-q3), similar to the duplicated sets found at the functional mouse cluster on Chromosome 11. In addition, exons 7 to 12 of the mLuc7L gene are present just downstream from the pseudogene cluster, indicating that this clone contains the region in which human 16p13.3 switches in synteny between mouse Chromosomes 11 and 17. Comparison of the sequences around the alpha-like clusters on the two mouse chromosomes reveals the presence of conserved tandem repeats. We propose that these repetitive elements have played a role in the fragmentation of the mouse alpha cluster during evolution.
Collapse
|
159
|
Blanchette M, Kent WJ, Riemer C, Elnitski L, Smit AFA, Roskin KM, Baertsch R, Rosenbloom K, Clawson H, Green ED, Haussler D, Miller W. Aligning multiple genomic sequences with the threaded blockset aligner. Genome Res 2004; 14:708-15. [PMID: 15060014 PMCID: PMC383317 DOI: 10.1101/gr.1933104] [Citation(s) in RCA: 1037] [Impact Index Per Article: 51.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
We define a "threaded blockset," which is a novel generalization of the classic notion of a multiple alignment. A new computer program called TBA (for "threaded blockset aligner") builds a threaded blockset under the assumption that all matching segments occur in the same order and orientation in the given sequences; inversions and duplications are not addressed. TBA is designed to be appropriate for aligning many, but by no means all, megabase-sized regions of multiple mammalian genomes. The output of TBA can be projected onto any genome chosen as a reference, thus guaranteeing that different projections present consistent predictions of which genomic positions are orthologous. This capability is illustrated using a new visualization tool to view TBA-generated alignments of vertebrate Hox clusters from both the mammalian and fish perspectives. Experimental evaluation of alignment quality, using a program that simulates evolutionary change in genomic sequences, indicates that TBA is more accurate than earlier programs. To perform the dynamic-programming alignment step, TBA runs a stand-alone program called MULTIZ, which can be used to align highly rearranged or incompletely sequenced genomes. We describe our use of MULTIZ to produce the whole-genome multiple alignments at the Santa Cruz Genome Browser.
Collapse
|
160
|
Gibbs RA, Weinstock GM, Metzker ML, Muzny DM, Sodergren EJ, Scherer S, Scott G, Steffen D, Worley KC, Burch PE, Okwuonu G, Hines S, Lewis L, DeRamo C, Delgado O, Dugan-Rocha S, Miner G, Morgan M, Hawes A, Gill R, Celera, Holt RA, Adams MD, Amanatides PG, Baden-Tillson H, Barnstead M, Chin S, Evans CA, Ferriera S, Fosler C, Glodek A, Gu Z, Jennings D, Kraft CL, Nguyen T, Pfannkoch CM, Sitter C, Sutton GG, Venter JC, Woodage T, Smith D, Lee HM, Gustafson E, Cahill P, Kana A, Doucette-Stamm L, Weinstock K, Fechtel K, Weiss RB, Dunn DM, Green ED, Blakesley RW, Bouffard GG, De Jong PJ, Osoegawa K, Zhu B, Marra M, Schein J, Bosdet I, Fjell C, Jones S, Krzywinski M, Mathewson C, Siddiqui A, Wye N, McPherson J, Zhao S, Fraser CM, Shetty J, Shatsman S, Geer K, Chen Y, Abramzon S, Nierman WC, Havlak PH, Chen R, Durbin KJ, Simons R, Ren Y, Song XZ, Li B, Liu Y, Qin X, Cawley S, Worley KC, Cooney AJ, D'Souza LM, Martin K, Wu JQ, Gonzalez-Garay ML, Jackson AR, Kalafus KJ, McLeod MP, Milosavljevic A, Virk D, Volkov A, Wheeler DA, Zhang Z, Bailey JA, Eichler EE, Tuzun E, Birney E, Mongin E, Ureta-Vidal A, Woodwark C, Zdobnov E, Bork P, Suyama M, Torrents D, Alexandersson M, Trask BJ, Young JM, Huang H, Wang H, Xing H, Daniels S, Gietzen D, Schmidt J, Stevens K, Vitt U, Wingrove J, Camara F, Mar Albà M, Abril JF, Guigo R, Smit A, Dubchak I, Rubin EM, Couronne O, Poliakov A, Hübner N, Ganten D, Goesele C, Hummel O, Kreitler T, Lee YA, Monti J, Schulz H, Zimdahl H, Himmelbauer H, Lehrach H, Jacob HJ, Bromberg S, Gullings-Handley J, Jensen-Seaman MI, Kwitek AE, Lazar J, Pasko D, Tonellato PJ, Twigger S, Ponting CP, Duarte JM, Rice S, Goodstadt L, Beatson SA, Emes RD, Winter EE, Webber C, Brandt P, Nyakatura G, Adetobi M, Chiaromonte F, Elnitski L, Eswara P, Hardison RC, Hou M, Kolbe D, Makova K, Miller W, Nekrutenko A, Riemer C, Schwartz S, Taylor J, Yang S, Zhang Y, Lindpaintner K, Andrews TD, Caccamo M, Clamp M, Clarke L, Curwen V, Durbin R, Eyras E, Searle SM, Cooper GM, Batzoglou S, Brudno M, Sidow A, Stone EA, Venter JC, Payseur BA, Bourque G, López-Otín C, Puente XS, Chakrabarti K, Chatterji S, Dewey C, Pachter L, Bray N, Yap VB, Caspi A, Tesler G, Pevzner PA, Haussler D, Roskin KM, Baertsch R, Clawson H, Furey TS, Hinrichs AS, Karolchik D, Kent WJ, Rosenbloom KR, Trumbower H, Weirauch M, Cooper DN, Stenson PD, Ma B, Brent M, Arumugam M, Shteynberg D, Copley RR, Taylor MS, Riethman H, Mudunuri U, Peterson J, Guyer M, Felsenfeld A, Old S, Mockrin S, Collins F. Genome sequence of the Brown Norway rat yields insights into mammalian evolution. Nature 2004; 428:493-521. [PMID: 15057822 DOI: 10.1038/nature02426] [Citation(s) in RCA: 1512] [Impact Index Per Article: 75.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2003] [Accepted: 02/20/2004] [Indexed: 01/16/2023]
Abstract
The laboratory rat (Rattus norvegicus) is an indispensable tool in experimental medicine and drug development, having made inestimable contributions to human health. We report here the genome sequence of the Brown Norway (BN) rat strain. The sequence represents a high-quality 'draft' covering over 90% of the genome. The BN rat sequence is the third complete mammalian genome to be deciphered, and three-way comparisons with the human and mouse genomes resolve details of mammalian evolution. This first comprehensive analysis includes genes and proteins and their relation to human disease, repeated sequences, comparative genome-wide studies of mammalian orthologous chromosomal regions and rearrangement breakpoints, reconstruction of ancestral karyotypes and the events leading to existing species, rates of variation, and lineage-specific and lineage-independent evolutionary events such as expansion of gene families, orthology relations and protein evolution.
Collapse
|
161
|
Abstract
We develop techniques to estimate the statistical significance of gap-free alignments between two genomic DNA sequences, using human-mouse alignments as an example. The sequences are assumed to be sufficiently similar that some but not all of the neutrally evolving regions (i.e., those under no evolutionary constraint) can be reliably aligned. Our goal is to model the situation in which the neutral rate of evolution, and hence the extent of the aligning intervals, varies across the genome. In some cases, this permits the weaker of two matches to be judged as less likely to have arisen by chance, provided it lies in a genomic interval with a high level of background divergence. We employ a hidden Markov model to capture variations in divergence rates and assign probability values to gap-free alignments using techniques of Dembo and Karlin, which are related to those used for the same purpose by BLAST. Our methods are illustrated in detail using a 1.49 Mb genomic region. Results obtained from the analysis of human chromosome 22 using these techniques are also provided.
Collapse
|
162
|
Ovcharenko I, Loots GG, Hardison RC, Miller W, Stubbs L. zPicture: dynamic alignment and visualization tool for analyzing conservation profiles. Genome Res 2004; 14:472-7. [PMID: 14993211 PMCID: PMC353235 DOI: 10.1101/gr.2129504] [Citation(s) in RCA: 120] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Comparative sequence analysis has evolved as an essential technique for identifying functional coding and noncoding elements conserved throughout evolution. Here, we introduce zPicture, an interactive Web-based sequence alignment and visualization tool for dynamically generating conservation profiles and identifying evolutionarily conserved regions (ECRs). zPicture is highly flexible, because critical parameters can be modified interactively, allowing users to differentially predict ECRs in comparisons of sequences of different phylogenetic distances and evolutionary rates. We demonstrate the application of this module to identify a known regulatory element in the HOXD locus, in which functional ECRs are difficult to discern against the highly conserved genomic background. zPicture also facilitates transcription factor binding-site analysis via the rVista tool portal. We present an example of the HBB complex when zPicture/rVista combination specifically pinpoints to two ECRs containing GATA-1, NF-E2, and TAL1/E47 binding sites that were identified previously as transcriptional enhancers. In addition, zPicture is linked to the UCSC Genome Browser, allowing users to automatically extract sequences and gene annotations for any recorded locus. Finally, we describe how this tool can be efficiently applied to the analysis of nonvertebrate genomes, including those of microbial organisms.
Collapse
|
163
|
Miller W, Vingron M. Special RECOMB 2003 Issue. J Comput Biol 2004. [DOI: 10.1089/1066527041410427] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
|
164
|
Yu CY, Chung EK, Yang Y, Blanchong CA, Jacobsen N, Saxena K, Yang Z, Miller W, Varga L, Fust G. Dancing with complement C4 and the RP-C4-CYP21-TNX (RCCX) modules of the major histocompatibility complex. ACTA ACUST UNITED AC 2004; 75:217-92. [PMID: 14604014 DOI: 10.1016/s0079-6603(03)75007-7] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
The number of the complement component C4 genes varies from 2 to 8 in a diploid genome among different human individuals. Three quarters of the C4 genes in Caucasian populations have the endogenous retrovirus, HERV-K(C4), in the ninth intron. The remainder does not. The C4 serum proteins are highly polymorphic and their concentrations vary from 100 to approximately 1000 microg/ml. There are two distinct classes of C4 protein, C4A and C4B, which have diversified to fulfill (a) the opsonization/immunoclearance purposes and (b) the well-known complement function in the killing of microbes by lysis and neutralization, respectively. Many infectious and autoimmune diseases are associated with complete or partial deficiency of C4A and/or C4B. The adverse effects of high C4 gene dosages, however, are just emerging, as the concepts of human C4 genetics are revised and accurate techniques are applied to distinguish partial deficiencies from differential expression caused by unequal C4A and C4B gene dosages and gene sizes. This review attempts to dissect the sophisticated genetics of complement C4A and C4B. The emphases are on the qualitative and quantitative diversities of C4 genotypes and phenotypes. The many allotypic variants and the processed products of human and mouse C4 proteins are described. The modular variation of C4 genes together with the serine/threonine nuclear kinase gene RP, the steroid 21-hydroxylase CYP21, and extracellular matrix protein TNX (RCCX modules) are investigated for the effects on homogenization of C4 protein polymorphisms, and on the unequal genetic crossovers that knocked out the functions of CYP21 and/or TNX. Furthermore, the influence of the endogenous retrovirus HERV-K(C4) on C4 gene expression and the dispersal of HERV-K(C4) family members in the human genome are discussed.
Collapse
|
165
|
Patrinos GP, Giardine B, Riemer C, Miller W, Chui DHK, Anagnou NP, Wajcman H, Hardison RC. Improvements in the HbVar database of human hemoglobin variants and thalassemia mutations for population and sequence variation studies. Nucleic Acids Res 2004; 32:D537-41. [PMID: 14681476 PMCID: PMC308741 DOI: 10.1093/nar/gkh006] [Citation(s) in RCA: 250] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
HbVar (http://globin.cse.psu.edu/globin/hbvar/) is a relational database developed by a multi-center academic effort to provide up-to-date and high quality information on the genomic sequence changes leading to hemoglobin variants and all types of thalassemia and hemoglobinopathies. Extensive information is recorded for each variant and mutation, including sequence alterations, biochemical and hematological effects, associated pathology, ethnic occurrence and references. In addition to the regular updates to entries, we report two significant advances: (i) The frequencies for a large number of mutations causing beta-thalassemia in at-risk populations have been extracted from the published literature and made available for the user to query upon. (ii) HbVar has been linked with the GALA (Genome Alignment and Annotation database, available at http://globin.cse.psu.edu/gala/) so that users can combine information on hemoglobin variants and thalassemia mutations with a wide spectrum of genomic data. It also expands the capacity to view and analyze the data, using tools within GALA and the University of California at Santa Cruz (UCSC) Genome Browser.
Collapse
|
166
|
Sawatzky B, Slobogean GP, Chambers C, Miller W. 17 SHOULDER PAIN IN ADULT AND CONGENITAL SPINAL CORD INJURY. J Investig Med 2004. [DOI: 10.1136/jim-52-suppl1-17] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
|
167
|
Chiaromonte F, Miller W, Bouhassira EE. Gene length and proximity to neighbors affect genome-wide expression levels. Genome Res 2003; 13:2602-8. [PMID: 14613975 PMCID: PMC403802 DOI: 10.1101/gr.1169203] [Citation(s) in RCA: 55] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Steady-state levels of mRNA in cells theoretically depend on the rate and efficiency of transcription and posttranscriptional processing, on mRNA stability, on transcriptional interference from other genes, and on poorly defined long-range chromatin effects. Although each of these cellular processes has been studied in detail for a few genes, it is not possible to predict expression levels by simply examining gene sequences. In this report, we have used a bioinformatics approach to identify critical factors that influence expression levels. To simplify the problem, we have limited our analysis to the collection of genes expressed in all tissues, because such genes provide a unique opportunity to distinguish the role of general genomic features that constrain gene expression from the effect of tissue-specific factors. Using correlation and regression techniques, we have investigated the dependence between expression level and morphological parameters (distance to neighbors, gene, mRNA or 3'-UTR length, number of exons, etc.) that can be directly related to transcription, posttranscriptional processing, mRNA stability, or transcriptional interference. We found that, on a genome-wide scale, highly expressed genes are significantly farther from their closest neighboring genes, are smaller, contain a moderate number of exons, and produce shorter mRNAs with shorter 3'-UTRs. This confirms that transcriptional and posttranscriptional processes are highly interrelated and implies that transcriptional interference plays a role in determining steady-state levels of mRNA in cells.
Collapse
|
168
|
Farhadi HF, Lepage P, Forghani R, Friedman HCH, Orfali W, Jasmin L, Miller W, Hudson TJ, Peterson AC. A combinatorial network of evolutionarily conserved myelin basic protein regulatory sequences confers distinct glial-specific phenotypes. J Neurosci 2003; 23:10214-23. [PMID: 14614079 PMCID: PMC6741010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/27/2023] Open
Abstract
Myelin basic protein (MBP) is required for normal myelin compaction and is implicated in both experimental and human demyelinating diseases. In this study, as an initial step in defining the regulatory network controlling MBP transcription, we located and characterized the function of evolutionarily conserved regulatory sequences. Long-range human-mouse sequence comparison revealed over 1 kb of conserved noncoding MBP 5' flanking sequence distributed into four widely spaced modules ranging from 0.1 to 0.4 kb. We demonstrate first that a controlled strategy of transgenesis provides an effective means to assign and compare qualitative and quantitative in vivo regulatory programs. Using this strategy, single-copy reporter constructs, designed to evaluate the regulatory significance of modular and intermodular sequences, were introduced by homologous recombination into the mouse hprt (hypoxanthine-guanine phosphoribosyltransferase) locus. The proximal modules M1 and M2 confer comparatively low-level oligodendrocyte expression primarily limited to early postnatal development, whereas the upstream M3 confers high-level oligodendrocyte expression extending throughout maturity. Furthermore, constructs devoid of M3 fail to target expression to newly myelinating oligodendrocytes in the mature CNS. Mutation of putative Nkx6.2/Gtx sites within M3, although not eliminating oligodendrocyte targeting, significantly decreases transgene expression levels. High-level and continuous expression is conferred to myelinating or remyelinating Schwann cells by M4. In addition, when isolated from surrounding MBP sequences, M3 confers transient expression to Schwann cells elaborating myelin. These observations define the in vivo regulatory roles played by conserved noncoding MBP sequences and lead to a combinatorial model in which different regulatory modules are engaged during primary myelination, myelin maintenance, and remyelination.
Collapse
|
169
|
Kent WJ, Baertsch R, Hinrichs A, Miller W, Haussler D. Evolution's cauldron: duplication, deletion, and rearrangement in the mouse and human genomes. Proc Natl Acad Sci U S A 2003; 100:11484-9. [PMID: 14500911 PMCID: PMC208784 DOI: 10.1073/pnas.1932072100] [Citation(s) in RCA: 593] [Impact Index Per Article: 28.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2003] [Indexed: 11/18/2022] Open
Abstract
This study examines genomic duplications, deletions, and rearrangements that have happened at scales ranging from a single base to complete chromosomes by comparing the mouse and human genomes. From whole-genome sequence alignments, 344 large (>100-kb) blocks of conserved synteny are evident, but these are further fragmented by smaller-scale evolutionary events. Excluding transposon insertions, on average in each megabase of genomic alignment we observe two inversions, 17 duplications (five tandem or nearly tandem), seven transpositions, and 200 deletions of 100 bases or more. This includes 160 inversions and 75 duplications or transpositions of length >100 kb. The frequencies of these smaller events are not substantially higher in finished portions in the assembly. Many of the smaller transpositions are processed pseudogenes; we define a "syntenic" subset of the alignments that excludes these and other small-scale transpositions. These alignments provide evidence that approximately 2% of the genes in the human/mouse common ancestor have been deleted or partially deleted in the mouse. There also appears to be slightly less nontransposon-induced genome duplication in the mouse than in the human lineage. Although some of the events we detect are possibly due to misassemblies or missing data in the current genome sequence or to the limitations of our methods, most are likely to represent genuine evolutionary events. To make these observations, we developed new alignment techniques that can handle large gaps in a robust fashion and discriminate between orthologous and paralogous alignments.
Collapse
|
170
|
Thomas JW, Touchman JW, Blakesley RW, Bouffard GG, Beckstrom-Sternberg SM, Margulies EH, Blanchette M, Siepel AC, Thomas PJ, McDowell JC, Maskeri B, Hansen NF, Schwartz MS, Weber RJ, Kent WJ, Karolchik D, Bruen TC, Bevan R, Cutler DJ, Schwartz S, Elnitski L, Idol JR, Prasad AB, Lee-Lin SQ, Maduro VVB, Summers TJ, Portnoy ME, Dietrich NL, Akhter N, Ayele K, Benjamin B, Cariaga K, Brinkley CP, Brooks SY, Granite S, Guan X, Gupta J, Haghighi P, Ho SL, Huang MC, Karlins E, Laric PL, Legaspi R, Lim MJ, Maduro QL, Masiello CA, Mastrian SD, McCloskey JC, Pearson R, Stantripop S, Tiongson EE, Tran JT, Tsurgeon C, Vogt JL, Walker MA, Wetherby KD, Wiggins LS, Young AC, Zhang LH, Osoegawa K, Zhu B, Zhao B, Shu CL, De Jong PJ, Lawrence CE, Smit AF, Chakravarti A, Haussler D, Green P, Miller W, Green ED. Comparative analyses of multi-species sequences from targeted genomic regions. Nature 2003; 424:788-93. [PMID: 12917688 DOI: 10.1038/nature01858] [Citation(s) in RCA: 482] [Impact Index Per Article: 23.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2003] [Accepted: 06/16/2003] [Indexed: 11/08/2022]
Abstract
The systematic comparison of genomic sequences from different organisms represents a central focus of contemporary genome analysis. Comparative analyses of vertebrate sequences can identify coding and conserved non-coding regions, including regulatory elements, and provide insight into the forces that have rendered modern-day genomes. As a complement to whole-genome sequencing efforts, we are sequencing and comparing targeted genomic regions in multiple, evolutionarily diverse vertebrates. Here we report the generation and analysis of over 12 megabases (Mb) of sequence from 12 species, all derived from the genomic region orthologous to a segment of about 1.8 Mb on human chromosome 7 containing ten genes, including the gene mutated in cystic fibrosis. These sequences show conservation reflecting both functional constraints and the neutral mutational events that shaped this genomic region. In particular, we identify substantial numbers of conserved non-coding segments beyond those previously identified experimentally, most of which are not detectable by pair-wise sequence comparisons alone. Analysis of transposable element insertions highlights the variation in genome dynamics among these species and confirms the placement of rodents as a sister group to the primates.
Collapse
|
171
|
Schwartz S, Elnitski L, Li M, Weirauch M, Riemer C, Smit A, Green ED, Hardison RC, Miller W. MultiPipMaker and supporting tools: Alignments and analysis of multiple genomic DNA sequences. Nucleic Acids Res 2003; 31:3518-24. [PMID: 12824357 PMCID: PMC168985 DOI: 10.1093/nar/gkg579] [Citation(s) in RCA: 169] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Analysis of multiple sequence alignments can generate important, testable hypotheses about the phylogenetic history and cellular function of genomic sequences. We describe the MultiPipMaker server, which aligns multiple, long genomic DNA sequences quickly and with good sensitivity (available at http://bio.cse.psu.edu/ since May 2001). Alignments are computed between a contiguous reference sequence and one or more secondary sequences, which can be finished or draft sequence. The outputs include a stacked set of percent identity plots, called a MultiPip, comparing the reference sequence with subsequent sequences, and a nucleotide-level multiple alignment. New tools are provided to search MultiPipMaker output for conserved matches to a user-specified pattern and for conserved matches to position weight matrices that describe transcription factor binding sites (singly and in clusters). We illustrate the use of MultiPipMaker to identify candidate regulatory regions in WNT2 and then demonstrate by transfection assays that they are functional. Analysis of the alignments also confirms the phylogenetic inference that horses are more closely related to cats than to cows.
Collapse
|
172
|
Florea L, McClelland M, Riemer C, Schwartz S, Miller W. EnteriX 2003: Visualization tools for genome alignments of Enterobacteriaceae. Nucleic Acids Res 2003; 31:3527-32. [PMID: 12824359 PMCID: PMC168958 DOI: 10.1093/nar/gkg551] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
We describe EnteriX, a suite of three web-based visualization tools for graphically portraying alignment information from comparisons among several fixed and user-supplied sequences from related enterobacterial species, anchored on a reference genome (http://bio.cse.psu.edu/). The first visualization, Enteric, displays stacked pairwise alignments between a reference genome and each of the related bacteria, represented schematically as PIPs (Percent Identity Plots). Encoded in the views are large-scale genomic rearrangement events and functional landmarks. The second visualization, Menteric, computes and displays 1 Kb views of nucleotide-level multiple alignments of the sequences, together with annotations of genes, regulatory sites and conserved regions. The third, a Java-based tool named Maj, displays alignment information in two formats, corresponding roughly to the Enteric and Menteric views, and adds zoom-in capabilities. The uses of such tools are diverse, from examining the multiple sequence alignment to infer conserved sites with potential regulatory roles, to scrutinizing the commonalities and differences between the genomes for pathogenicity or phylogenetic studies. The EnteriX suite currently includes >15 enterobacterial genomes, generates views centered on four different anchor genomes and provides support for including user sequences in the alignments.
Collapse
|
173
|
Giardine B, Elnitski L, Riemer C, Makalowska I, Schwartz S, Miller W, Hardison RC. GALA, a database for genomic sequence alignments and annotations. Genome Res 2003; 13:732-41. [PMID: 12671007 PMCID: PMC430176 DOI: 10.1101/gr.603103] [Citation(s) in RCA: 39] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2002] [Accepted: 01/24/2003] [Indexed: 01/26/2023]
Abstract
We have developed a relational database to contain whole genome sequence alignments between human and mouse with extensive annotations of the human sequence. Complex queries are supported on recorded features, both directly and on proximity among them. Searches can reveal a wide variety of relationships, such as finding all genes expressed in a designated tissue that have a highly conserved noncoding sequence 5' to the start site. Other examples are finding single nucleotide polymorphisms that occur in conserved noncoding regions upstream of genes and identifying CpG islands that overlap the 5' ends of divergently transcribed genes. The database is available online at http://globin.cse.psu.edu/ and http://bio.cse.psu.edu/.
Collapse
|
174
|
Green P, Ewing B, Miller W, Thomas PJ, Green ED. Transcription-associated mutational asymmetry in mammalian evolution. Nat Genet 2003; 33:514-7. [PMID: 12612582 DOI: 10.1038/ng1103] [Citation(s) in RCA: 202] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2002] [Accepted: 01/19/2003] [Indexed: 11/09/2022]
Abstract
Although mutation is commonly thought of as a random process, evolutionary studies show that different types of nucleotide substitution occur with widely varying rates that presumably reflect biases intrinsic to mutation and repair mechanisms. A strand asymmetry, the occurrence of particular substitution types at higher rates than their complementary types, that is associated with DNA replication has been found in bacteria and mitochondria. A strand asymmetry that is associated with transcription and attributable to higher rates of cytosine deamination on the coding strand has been observed in enterobacteria. Here, we describe a qualitatively different transcription-associated strand asymmetry in mammals, which may be a byproduct of transcription-coupled repair in germline cells. This mutational asymmetry has acted over long periods of time to produce a compositional asymmetry, an excess of G+T over A+C on the coding strand, in most genes. The mutational and compositional asymmetries can be used to detect the orientations and approximate extents of transcribed regions.
Collapse
|
175
|
Hardison RC, Chiaromonte F, Kolbe D, Wang H, Petrykowska H, Elnitski L, Yang S, Giardine B, Zhang Y, Riemer C, Schwartz S, Haussler D, Roskin KM, Weber RJ, Diekhans M, Kent WJ, Weiss MJ, Welch J, Miller W. Global predictions and tests of erythroid regulatory regions. COLD SPRING HARBOR SYMPOSIA ON QUANTITATIVE BIOLOGY 2003; 68:335-44. [PMID: 15338635 DOI: 10.1101/sqb.2003.68.335] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/30/2023]
|
176
|
Hardison RC, Roskin KM, Yang S, Diekhans M, Kent WJ, Weber R, Elnitski L, Li J, O'Connor M, Kolbe D, Schwartz S, Furey TS, Whelan S, Goldman N, Smit A, Miller W, Chiaromonte F, Haussler D. Covariation in frequencies of substitution, deletion, transposition, and recombination during eutherian evolution. Genome Res 2003; 13:13-26. [PMID: 12529302 PMCID: PMC430971 DOI: 10.1101/gr.844103] [Citation(s) in RCA: 239] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2002] [Accepted: 11/14/2002] [Indexed: 11/24/2022]
Abstract
Six measures of evolutionary change in the human genome were studied, three derived from the aligned human and mouse genomes in conjunction with the Mouse Genome Sequencing Consortium, consisting of (1) nucleotide substitution per fourfold degenerate site in coding regions, (2) nucleotide substitution per site in relics of transposable elements active only before the human-mouse speciation, and (3) the nonaligning fraction of human DNA that is nonrepetitive or in ancestral repeats; and three derived from human genome data alone, consisting of (4) SNP density, (5) frequency of insertion of transposable elements, and (6) rate of recombination. Features 1 and 2 are measures of nucleotide substitutions at two classes of "neutral" sites, whereas 4 is a measure of recent mutations. Feature 3 is a measure dominated by deletions in mouse, whereas 5 represents insertions in human. It was found that all six vary significantly in megabase-sized regions genome-wide, and many vary together. This indicates that some regions of a genome change slowly by all processes that alter DNA, and others change faster. Regional variation in all processes is correlated with, but not completely accounted for, by GC content in human and the difference between GC content in human and mouse.
Collapse
|
177
|
Elnitski L, Hardison RC, Li J, Yang S, Kolbe D, Eswara P, O'Connor MJ, Schwartz S, Miller W, Chiaromonte F. Distinguishing regulatory DNA from neutral sites. Genome Res 2003; 13:64-72. [PMID: 12529307 PMCID: PMC430974 DOI: 10.1101/gr.817703] [Citation(s) in RCA: 104] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
We explore several computational approaches to analyzing interspecies genomic sequence alignments, aiming to distinguish regulatory regions from neutrally evolving DNA. Human-mouse genomic alignments were collected for three sets of human regions: (1) experimentally defined gene regulatory regions, (2) well-characterized exons (coding sequences, as a positive control), and (3) interspersed repeats thought to have inserted before the human-mouse split (a good model for neutrally evolving DNA). Models that potentially could distinguish functional noncoding sequences from neutral DNA were evaluated on these three data sets, as well as bulk genome alignments. Our analyses show that discrimination based on frequencies of individual nucleotide pairs or gaps (i.e., of possible alignment columns) is only partially successful. In contrast, scoring procedures that include the alignment context, based on frequencies of short runs of alignment columns, dramatically improve separation between regulatory and neutral features. Such scoring functions should aid in the identification of putative regulatory regions throughout the human genome.
Collapse
|
178
|
Schwartz S, Kent WJ, Smit A, Zhang Z, Baertsch R, Hardison RC, Haussler D, Miller W. Human-mouse alignments with BLASTZ. Genome Res 2003; 13:103-7. [PMID: 12529312 PMCID: PMC430961 DOI: 10.1101/gr.809403] [Citation(s) in RCA: 851] [Impact Index Per Article: 40.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
The Mouse Genome Analysis Consortium aligned the human and mouse genome sequences for a variety of purposes, using alignment programs that suited the various needs. For investigating issues regarding genome evolution, a particularly sensitive method was needed to permit alignment of a large proportion of the neutrally evolving regions. We selected a program called BLASTZ, an independent implementation of the Gapped BLAST algorithm specifically designed for aligning two long genomic sequences. BLASTZ was subsequently modified, both to attain efficiency adequate for aligning entire mammalian genomes and to increase its sensitivity. This work describes BLASTZ, its modifications, the hardware environment on which we run it, and several empirical studies to validate its results.
Collapse
|
179
|
Waterston RH, Lindblad-Toh K, Birney E, Rogers J, Abril JF, Agarwal P, Agarwala R, Ainscough R, Alexandersson M, An P, Antonarakis SE, Attwood J, Baertsch R, Bailey J, Barlow K, Beck S, Berry E, Birren B, Bloom T, Bork P, Botcherby M, Bray N, Brent MR, Brown DG, Brown SD, Bult C, Burton J, Butler J, Campbell RD, Carninci P, Cawley S, Chiaromonte F, Chinwalla AT, Church DM, Clamp M, Clee C, Collins FS, Cook LL, Copley RR, Coulson A, Couronne O, Cuff J, Curwen V, Cutts T, Daly M, David R, Davies J, Delehaunty KD, Deri J, Dermitzakis ET, Dewey C, Dickens NJ, Diekhans M, Dodge S, Dubchak I, Dunn DM, Eddy SR, Elnitski L, Emes RD, Eswara P, Eyras E, Felsenfeld A, Fewell GA, Flicek P, Foley K, Frankel WN, Fulton LA, Fulton RS, Furey TS, Gage D, Gibbs RA, Glusman G, Gnerre S, Goldman N, Goodstadt L, Grafham D, Graves TA, Green ED, Gregory S, Guigó R, Guyer M, Hardison RC, Haussler D, Hayashizaki Y, Hillier LW, Hinrichs A, Hlavina W, Holzer T, Hsu F, Hua A, Hubbard T, Hunt A, Jackson I, Jaffe DB, Johnson LS, Jones M, Jones TA, Joy A, Kamal M, Karlsson EK, Karolchik D, Kasprzyk A, Kawai J, Keibler E, Kells C, Kent WJ, Kirby A, Kolbe DL, Korf I, Kucherlapati RS, Kulbokas EJ, Kulp D, Landers T, Leger JP, Leonard S, Letunic I, Levine R, Li J, Li M, Lloyd C, Lucas S, Ma B, Maglott DR, Mardis ER, Matthews L, Mauceli E, Mayer JH, McCarthy M, McCombie WR, McLaren S, McLay K, McPherson JD, Meldrim J, Meredith B, Mesirov JP, Miller W, Miner TL, Mongin E, Montgomery KT, Morgan M, Mott R, Mullikin JC, Muzny DM, Nash WE, Nelson JO, Nhan MN, Nicol R, Ning Z, Nusbaum C, O'Connor MJ, Okazaki Y, Oliver K, Overton-Larty E, Pachter L, Parra G, Pepin KH, Peterson J, Pevzner P, Plumb R, Pohl CS, Poliakov A, Ponce TC, Ponting CP, Potter S, Quail M, Reymond A, Roe BA, Roskin KM, Rubin EM, Rust AG, Santos R, Sapojnikov V, Schultz B, Schultz J, Schwartz MS, Schwartz S, Scott C, Seaman S, Searle S, Sharpe T, Sheridan A, Shownkeen R, Sims S, Singer JB, Slater G, Smit A, Smith DR, Spencer B, Stabenau A, Stange-Thomann N, Sugnet C, Suyama M, Tesler G, Thompson J, Torrents D, Trevaskis E, Tromp J, Ucla C, Ureta-Vidal A, Vinson JP, Von Niederhausern AC, Wade CM, Wall M, Weber RJ, Weiss RB, Wendl MC, West AP, Wetterstrand K, Wheeler R, Whelan S, Wierzbowski J, Willey D, Williams S, Wilson RK, Winter E, Worley KC, Wyman D, Yang S, Yang SP, Zdobnov EM, Zody MC, Lander ES. Initial sequencing and comparative analysis of the mouse genome. Nature 2002; 420:520-62. [PMID: 12466850 DOI: 10.1038/nature01262] [Citation(s) in RCA: 4802] [Impact Index Per Article: 218.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2002] [Accepted: 10/31/2002] [Indexed: 12/18/2022]
Abstract
The sequence of the mouse genome is a key informational tool for understanding the contents of the human genome and a key experimental tool for biomedical research. Here, we report the results of an international collaboration to produce a high-quality draft sequence of the mouse genome. We also present an initial comparative analysis of the mouse and human genomes, describing some of the insights that can be gleaned from the two sequences. We discuss topics including the analysis of the evolutionary forces shaping the size, structure and sequence of the genomes; the conservation of large-scale synteny across most of the genomes; the much lower extent of sequence orthology covering less than half of the genomes; the proportions of the genomes under selection; the number of protein-coding genes; the expansion of gene families related to reproduction and immunity; the evolution of proteins; and the identification of intraspecies polymorphism.
Collapse
MESH Headings
- Animals
- Base Composition
- Chromosomes, Mammalian/genetics
- Conserved Sequence/genetics
- CpG Islands/genetics
- Evolution, Molecular
- Gene Expression Regulation
- Genes/genetics
- Genetic Variation/genetics
- Genome
- Genome, Human
- Genomics
- Humans
- Mice/classification
- Mice/genetics
- Mice, Knockout
- Mice, Transgenic
- Models, Animal
- Multigene Family/genetics
- Mutagenesis
- Neoplasms/genetics
- Physical Chromosome Mapping
- Proteome/genetics
- Pseudogenes/genetics
- Quantitative Trait Loci/genetics
- RNA, Untranslated/genetics
- Repetitive Sequences, Nucleic Acid/genetics
- Selection, Genetic
- Sequence Analysis, DNA
- Sex Chromosomes/genetics
- Species Specificity
- Synteny
Collapse
|
180
|
Burns LJ, Miller W, Kandaswamy C, DeFor TE, MacMillan ML, Van Burik JA, Weisdorf DJ. Randomized clinical trial of ganciclovir vs acyclovir for prevention of cytomegalovirus antigenemia after allogeneic transplantation. Bone Marrow Transplant 2002; 30:945-51. [PMID: 12476289 DOI: 10.1038/sj.bmt.1703770] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2002] [Accepted: 09/16/2002] [Indexed: 11/08/2022]
Abstract
Cytomegalovirus (CMV) disease remains a major cause of morbidity following allogeneic stem cell transplantation (SCT). In a prospective randomized trial, we tested prophylactic therapy with ganciclovir or acyclovir for patients at high risk of disease. Ninety-one CMV seropositive recipients of related (n = 53) and unrelated (n = 38) donor transplants were enrolled. All patients received intravenous (i.v.) ganciclovir 5 mg/kg every 12 h days -7 to -2, followed by acyclovir 10 mg/kg i.v. every 8 h from day -1 until neutrophil engraftment. Patients were then randomly assigned to either ganciclovir (n = 45) or acyclovir (n = 46) until day 100 post transplant. Any degree of antigenemia was treated with ganciclovir 5 mg/kg i.v. twice a day for 2 weeks, followed by 5 mg/kg i.v. each weekday for 6 weeks. At day 100, the cumulative incidence of antigenemia was 31% (95% CI 17-45%) for ganciclovir and 41% (95% CI 26-56%) (P = 0.22) for acyclovir prophylaxis, respectively. The assigned prophylaxis cohort did not predict for CMV antigenemia. The cumulative incidence of CMV disease at 12 months was 13% (95% CI 3-23%) and 17% (95% CI 6-28%) (P = 0.59) for the ganciclovir- and acyclovir-treated groups, respectively. An absolute neutrophil count (ANC) <or=1500 x 10(6)/l at randomization (P < 0.01) and grade II-IV acute graft-versus-host-disease (P = 0.01), but not the assigned prophylaxis cohort (P = 0.62), were independent risk factors for CMV disease. The incidence of fungal infections and renal insufficiency was similar across treatment groups; however, bacterial infections and secondary neutropenia occurred more frequently in the ganciclovir group. With our study powered to detect a 60% reduction in antigenemia with ganciclovir prophylaxis, we did not find a statistically significant difference between ganciclovir and acyclovir when used as part of an overall strategy for prevention of CMV antigenemia and disease in SCT, although fewer side-effects occurred with acyclovir treatment.
Collapse
|
181
|
Elnitski L, Riemer C, Petrykowska H, Florea L, Schwartz S, Miller W, Hardison R. PipTools: a computational toolkit to annotate and analyze pairwise comparisons of genomic sequences. Genomics 2002; 80:681-90. [PMID: 12504859 DOI: 10.1006/geno.2002.7018] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Sequence conservation between species is useful both for locating coding regions of genes and for identifying functional noncoding segments. Hence interspecies alignment of genomic sequences is an important computational technique. However, its utility is limited without extensive annotation. We describe a suite of software tools, PipTools, and related programs that facilitate the annotation of genes and putative regulatory elements in pairwise alignments. The alignment server PipMaker uses the output of these tools to display detailed information needed to interpret alignments. These programs are provided in a portable format for use on common desktop computers and both the toolkit and the PipMaker server can be found at our Web site (http://bio.cse.psu.edu/). We illustrate the utility of the toolkit using annotation of a pairwise comparison of the mouse MHC class II and class III regions with orthologous human sequences and subsequently identify conserved, noncoding sequences that are DNase I hypersensitive sites in chromatin of mouse cells.
Collapse
|
182
|
Maul JE, Lilly JW, Cui L, dePamphilis CW, Miller W, Harris EH, Stern DB. The Chlamydomonas reinhardtii plastid chromosome: islands of genes in a sea of repeats. THE PLANT CELL 2002; 14:2659-79. [PMID: 12417694 PMCID: PMC153795 DOI: 10.1105/tpc.006155] [Citation(s) in RCA: 280] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/08/2002] [Accepted: 09/10/2002] [Indexed: 05/17/2023]
Abstract
Chlamydomonas reinhardtii is a unicellular eukaryotic alga possessing a single chloroplast that is widely used as a model system for the study of photosynthetic processes. This report analyzes the surprising structural and evolutionary features of the completely sequenced 203,395-bp plastid chromosome. The genome is divided by 21.2-kb inverted repeats into two single-copy regions of approximately 80 kb and contains only 99 genes, including a full complement of tRNAs and atypical genes encoding the RNA polymerase. A remarkable feature is that >20% of the genome is repetitive DNA: the majority of intergenic regions consist of numerous classes of short dispersed repeats (SDRs), which may have structural or evolutionary significance. Among other sequenced chlorophyte plastid genomes, only that of the green alga Chlorella vulgaris appears to share this feature. The program MultiPipMaker was used to compare the genic complement of Chlamydomonas with those of other chloroplast genomes and to scan the genomes for sequence similarities and repetitive DNAs. Among the results was evidence that the SDRs were not derived from extant coding sequences, although some SDRs may have arisen from other genomic fragments. Phylogenetic reconstruction of changes in plastid genome content revealed that an accelerated rate of gene loss also characterized the Chlamydomonas/Chlorella lineage, a phenomenon that might be independent of the proliferation of SDRs. Together, our results reveal a dynamic and unusual plastid genome whose existence in a model organism will allow its features to be tested functionally.
Collapse
|
183
|
Chiaromonte F, Yap VB, Miller W. Scoring pairwise genomic sequence alignments. PACIFIC SYMPOSIUM ON BIOCOMPUTING. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2002:115-26. [PMID: 11928468 DOI: 10.1142/9789812799623_0012] [Citation(s) in RCA: 102] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
The parameters by which alignments are scored can strongly affect sensitivity and specificity of alignment procedures. While appropriate parameter choices are well understood for protein alignments, much less is known for genomic DNA sequences. We describe a straightforward approach to scoring nucleotide substitutions in genomic sequence alignments, especially human-mouse comparisons. Scores are obtained from relative frequencies of aligned nucleotides observed in alignments of non-coding, non-repetitive genomic regions, and can be theoretically motivated through substitution models. Additional accuracy can be attained by down-weighting alignments characterized by low compositional complexity. We also describe an evaluation protocol that is relevant when alignments are intended to identify all and only the orthologous positions. One particular scoring matrix, called HOXD70, has proven to be generally effective for human-mouse comparisons, and has been used by the PipMaker server since July, 2000. We discuss but leave open the problem of effectively scoring regions of strongly biased nucleotide composition, such as low G + C content.
Collapse
|
184
|
Macomson SD, Brophy CM, Miller W, Harris VA, Shaver EG. Heat shock protein expression in cerebral vessels after subarachnoid hemorrhage. Neurosurgery 2002; 51:204-10; discussion 210-1. [PMID: 12182419 DOI: 10.1097/00006123-200207000-00029] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
OBJECTIVE The mechanisms of cerebral vasospasm after subarachnoid hemorrhage (SAH) remain controversial. Recent data have implicated two small heat shock proteins (HSPs), namely HSP20 and HSP27, in the regulation of vascular tone. Increases in the phosphorylation of HSP20 are associated with vasorelaxation, and increases in the phosphorylation of HSP27 are associated with impaired vasorelaxation. Therefore, we hypothesized that alterations in the expression and/or phosphorylation of these two small HSPs might play a role in cerebral vasospasm after SAH. METHODS A rat model of endovascular perforation was used to induce SAH. Middle cerebral arteries were harvested from control animals, sham-treated animals, and animals with SAH, 48 hours after SAH induction. Dose-response curves for endothelium-independent (sodium nitroprusside, 10(-8) to 10(-4) mol/L) and endothelium-dependent (bradykinin, 10(-10) to 10(-5) mol/L) relaxing agents were recorded ex vivo. Physiological responses were correlated with the expression and phosphorylation of HSP20 and HSP27 by using one- and two-dimensional immunoblots. RESULTS There was impaired endothelium-independent and endothelium-dependent relaxation in cerebral vessels after SAH. These changes were associated with decreased expression of both total and phosphorylated HSP20 and increases in the amount of phosphorylated HSP27. CONCLUSION In this model, impaired relaxation of cerebral vessels after SAH was associated with increases in the amount of phosphorylated HSP27 and decreases in the expression and phosphorylation of HSP20. These data are consistent with alterations in the expression and phosphorylation of these small HSPs in other models of vasospasm.
Collapse
|
185
|
Agron PG, Macht M, Radnedge L, Skowronski EW, Miller W, Andersen GL. Use of subtractive hybridization for comprehensive surveys of prokaryotic genome differences. FEMS Microbiol Lett 2002; 211:175-82. [PMID: 12076809 DOI: 10.1111/j.1574-6968.2002.tb11221.x] [Citation(s) in RCA: 41] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
Abstract
Comparative bacterial genomics shows that even different isolates of the same bacterial species can vary significantly in gene content. An effective means to survey differences across whole genomes would be highly advantageous for understanding this variation. Here we show that suppression subtractive hybridization (SSH) provides high, representative coverage of regions that differ between similar genomes. Using Helicobacter pylori strains 26695 and J99 as a model, SSH identified approximately 95% of the unique open reading frames in each strain, showing that the approach is effective. Furthermore, combining data from parallel SSH experiments using different restriction enzymes significantly increased coverage compared to using a single enzyme. These results suggest a powerful approach for assessing genome differences among closely related strains when one member of the group has been completely sequenced.
Collapse
|
186
|
Hardison RC, Chui DHK, Giardine B, Riemer C, Patrinos GP, Anagnou N, Miller W, Wajcman H. HbVar: A relational database of human hemoglobin variants and thalassemia mutations at the globin gene server. Hum Mutat 2002; 19:225-33. [PMID: 11857738 DOI: 10.1002/humu.10044] [Citation(s) in RCA: 282] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
We have constructed a relational database of hemoglobin variants and thalassemia mutations, called HbVar, which can be accessed on the web at http://globin.cse.psu.edu. Extensive information is recorded for each variant and mutation, including a description of the variant and associated pathology, hematology, electrophoretic mobility, methods of isolation, stability information, ethnic occurrence, structure studies, functional studies, and references. The initial information was derived from books by Dr. Titus Huisman and colleagues [Huisman et al., 1996, 1997, 1998]. The current database is updated regularly with the addition of new data and corrections to previous data. Queries can be formulated based on fields in the database. Tables of common categories of variants, such as all those involving the alpha1-globin gene (HBA1) or all those that result in high oxygen affinity, are maintained by automated queries on the database. Users can formulate more precise queries, such as identifying "all beta-globin variants associated with instability and found in Scottish populations." This new database should be useful for clinical diagnosis as well as in fundamental studies of hemoglobin biochemistry, globin gene regulation, and human sequence variation at these loci.
Collapse
|
187
|
Kurihara LJ, Semenova E, Miller W, Ingram RS, Guan XJ, Tilghman SM. Candidate genes required for embryonic development: a comparative analysis of distal mouse chromosome 14 and human chromosome 13q22. Genomics 2002; 79:154-61. [PMID: 11829485 DOI: 10.1006/geno.2002.6692] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Mice homozygous for the Ednrb(s-1Acrg) deletion arrest at embryonic day 8.5 from defects associated with mesoderm development. To determine the molecular basis of this phenotype, we initiated a positional cloning of the Acrg minimal region. This region was predicted to be gene-poor by several criteria. From comparative analysis with the syntenic human locus at 13q22 and gene prediction program analysis, we found a single cluster of four genes within the 1.4-to 2-Mb contig over the Acrg minimal region that is flanked by a gene desert. We also found 130 highly conserved nonexonic sequences that were distributed over the gene cluster and desert. The four genes encode the TBC (Tre-2, BUB2, CDC16) domain-containing protein KIAA0603, the ubiquitin carboxy-terminal hydrolase L3 (UCHL3), the F-box/PDZ/LIM domain protein LMO7,and a novel gene. On the basis of their expression profile during development, all four genes are candidates for the Ednrb(s-1Acrg) embryonic lethality. Because we determined that a mutant of Uchl3 was viable, three candidate genes remain within the region.
Collapse
|
188
|
Molete JM, Petrykowska H, Sigg M, Miller W, Hardison R. Functional and binding studies of HS3.2 of the beta-globin locus control region. Gene 2002; 283:185-97. [PMID: 11867225 DOI: 10.1016/s0378-1119(01)00858-7] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The distal locus control region (LCR) is required for high-level expression of the complex of genes (HBBC) encoding the beta-like globins of mammals in erythroid cells. Several major DNase hypersensitive sites (HSs 1-5) mark the LCR. Sequence conservation and direct experimental evidence have implicated sequences within and between the HS cores in function of the LCR. In this report we confirm the mapping of a minor HS between HS3 and HS4, called HS3.2, and show that sequences including it increase the number of random integration sites at which a drug resistance gene is expressed. We also show that nuclear proteins including GATA1 and Oct1 bind specifically to sequences within HS3.2. However, the protein Pbx1, whose binding site is the best match to one highly conserved sequence, does not bind strongly. GATA1 and Oct1 also bind in the HS cores of the LCR and to promoters in HBBC. Their binding to this minor HS suggests that they may be used in assembly of a large complex containing multiple regulatory sequences.
Collapse
|
189
|
Sumiyama K, Irvine SQ, Stock DW, Weiss KM, Kawasaki K, Shimizu N, Shashikant CS, Miller W, Ruddle FH. Genomic structure and functional control of the Dlx3-7 bigene cluster. Proc Natl Acad Sci U S A 2002; 99:780-5. [PMID: 11792834 PMCID: PMC117382 DOI: 10.1073/pnas.012584999] [Citation(s) in RCA: 61] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The Dlx genes are involved in early vertebrate morphogenesis, notably of the head. The six Dlx genes of mammals are arranged in three convergently transcribed bigene clusters. In this study, we examine the regulation of the Dlx3-7 cluster of the mouse. We obtained and sequenced human and mouse P1 clones covering the entire Dlx3-7 cluster. Comparative analysis of the human and mouse sequences revealed several highly conserved noncoding regions within 30 kb of the Dlx3-7-coding regions. These conserved elements were located both 5' of the coding exons of each gene and in the intergenic region 3' of the exons, suggesting that some enhancers might be shared between genes. We also found that the protein sequence of Dlx7 is evolving more rapidly than that of Dlx3. We conducted a functional study of the 79-kb mouse genomic clone to locate cis-element activity able to reproduce the endogenous expression pattern by using transgenic mice. We inserted a lacZ reporter gene into the first exon of the Dlx3 gene by using homologous recombination in yeast. Strong lacZ expression in embryonic (E) stage E9.5 and E10.5 mouse embryos was found in the limb buds and first and second visceral arches, consistent with the endogenous Dlx3 expression pattern. This result shows that the 79-kb region contains the major cis-elements required to direct the endogenous expression of Dlx3 at stage E10.5. To test for enhancer location, we divided the construct in the mid-intergenic region and injected the Dlx3 gene portion. This shortened fragment lacking Dlx7-flanking sequences is able to drive expression in the limb buds but not in the visceral arches. This observation is consistent with a cis-regulatory enhancer-sharing model within the Dlx bigene cluster.
Collapse
|
190
|
Beavers MP, Gunnet JW, Hageman W, Miller W, Moore JB, Zhou L, Chen RH, Xiang A, Urbanski M, Combs DW, Mayo KH, Demarest KT. Discovery of the first non-peptide antagonist of the motilin receptor. DRUG DESIGN AND DISCOVERY 2002; 17:243-51. [PMID: 11469754] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 02/20/2023]
Abstract
A first-in-class non-peptide antagonist of the motilin receptor was identified through electronic screening of our corporate database against a 3D pharmacophore. The pharmacophore was developed from the motilin 22 residue endogenous peptide using NMR structural data, principles of peptide folding, and peptide structure activity relationships. The NMR data supported helical content within the peptide, and both the hydrophobic staple and N-capping box motifs were identified in the motilin sequence. The conformational features of these motifs were imposed on the peptide structure, providing a constrained conformer as a starting point for database searching. A trisubstituted cyclopentene lead was identified directly from the electronic search. Compounds in this series inhibit the binding of 125I-motilin to human antral smooth muscle membrane and antagonize motilin-induced intracellular calcium mobilization in cells expressing the human motilin receptor. A potent compound developed through optimization, RWJ 68023, is active in binding and cell-based functional assays and is also effective in inhibiting motilin-induced contractility in segments of rabbit duodenum. This orally active compound is currently undergoing clinical evaluation for the treatment of gastrointestinal disorders associated with altered motility.
Collapse
|
191
|
DeSilva U, Elnitski L, Idol JR, Doyle JL, Gan W, Thomas JW, Schwartz S, Dietrich NL, Beckstrom-Sternberg SM, McDowell JC, Blakesley RW, Bouffard GG, Thomas PJ, Touchman JW, Miller W, Green ED. Generation and comparative analysis of approximately 3.3 Mb of mouse genomic sequence orthologous to the region of human chromosome 7q11.23 implicated in Williams syndrome. Genome Res 2002; 12:3-15. [PMID: 11779826 PMCID: PMC155257 DOI: 10.1101/gr.214802] [Citation(s) in RCA: 56] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
Abstract
Williams syndrome is a complex developmental disorder that results from the heterozygous deletion of a approximately 1.6-Mb segment of human chromosome 7q11.23. These deletions are mediated by large (approximately 300 kb) duplicated blocks of DNA of near-identical sequence. Previously, we showed that the orthologous region of the mouse genome is devoid of such duplicated segments. Here, we extend our studies to include the generation of approximately 3.3 Mb of genomic sequence from the mouse Williams syndrome region, of which just over 1.4 Mb is finished to high accuracy. Comparative analyses of the mouse and human sequences within and immediately flanking the interval commonly deleted in Williams syndrome have facilitated the identification of nine previously unreported genes, provided detailed sequence-based information regarding 30 genes residing in the region, and revealed a number of potentially interesting conserved noncoding sequences. Finally, to facilitate comparative sequence analysis, we implemented several enhancements to the program, including the addition of links from annotated features within a generated percent-identity plot to specific records in public databases. Taken together, the results reported here provide an important comparative sequence resource that should catalyze additional studies of Williams syndrome, including those that aim to characterize genes within the commonly deleted interval and to develop mouse models of the disorder.
Collapse
|
192
|
Chiaromonte F, Yang S, Elnitski L, Yap VB, Miller W, Hardison RC. Association between divergence and interspersed repeats in mammalian noncoding genomic DNA. Proc Natl Acad Sci U S A 2001; 98:14503-8. [PMID: 11717405 PMCID: PMC64711 DOI: 10.1073/pnas.251423898] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The amount of noncoding genomic DNA sequence that aligns between human and mouse varies substantially in different regions of their genomes, and the amount of repetitive DNA also varies. In this report, we show that divergence in noncoding nonrepetitive DNA is strongly correlated with the amount of repetitive DNA in a region. We investigated aligned DNA in four large genomic regions with finished human sequence and almost or completely finished mouse sequence. These regions, totaling 5.89 Mb of DNA, are on different chromosomes and vary in their base composition. An analysis based on sliding windows of 10 kb shows that the fraction of aligned noncoding nonrepetitive DNA and the fraction of repetitive DNA are negatively correlated, both at the level of an entire region and locally within it. This conclusion is strongly supported by a randomization study, in which repetitive elements are removed and randomly relocated along the sequences. Thus, regions of noncoding genomic DNA that accumulated fewer point mutations since the primate-rodent divergence also suffered fewer retrotransposition events. These results indicate that some regions of the genome are more "flexible" over the time scale of mammalian evolution, being able to accommodate many point mutations and insertions, whereas other regions are more "rigid" and accumulate fewer changes. Stronger conservation is generally interpreted as indicating more extensive or more important function. The evidence presented here of correlated variation in the rates of different evolutionary processes across noncoding DNA must be considered in assessing such conservation for evidence of selection.
Collapse
|
193
|
Chiaramonte M, Koviach JL, Moore C, Iyer VV, Wagner CR, Halcomb RL, Miller W, Melançon P, Kuchta RD. Inhibition of CMP-sialic acid transport into Golgi vesicles by nucleoside monophosphates. Biochemistry 2001; 40:14260-7. [PMID: 11714280 DOI: 10.1021/bi011262w] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
We examined the interactions of nucleotides with the CMP-sialic acid transporter in order to better understand which features play a role in binding and to investigate the relationship between binding and subsequent transport. With respect to the sugar, the transporter requires a complete ribose ring for tight binding, and the 2'-ara hydrogen makes an important contact. The enzyme exhibits little specificity with respect to the 2'- and 3'-hydroxyls, as it tolerated substitutions ranging from fluorine to an azido group. In the base, the C4 amine and C2 carbonyl groups make important contacts, while the N3 nitrogen does not. However, adding a methyl group to N3 dramatically reduced binding, indicating that mass at this position sterically hinders binding. Adding a group at C5 had either no effect or slightly enhanced binding. To determine if the transporter recognizes these CMP analogues as substrates, we assayed them for their ability to trans stimulate CMP-sialic acid import. These data suggest that the enzyme transports a wide variety of NMPs, and the rate of transport is inversely proportional to the K(I) of the analogue. The importance of our findings for understanding the specificities of the different nucleotide-sugar tranlocators and the design of novel glycosylation inhibitors are discussed.
Collapse
|
194
|
McClelland M, Sanderson KE, Spieth J, Clifton SW, Latreille P, Courtney L, Porwollik S, Ali J, Dante M, Du F, Hou S, Layman D, Leonard S, Nguyen C, Scott K, Holmes A, Grewal N, Mulvaney E, Ryan E, Sun H, Florea L, Miller W, Stoneking T, Nhan M, Waterston R, Wilson RK. Complete genome sequence of Salmonella enterica serovar Typhimurium LT2. Nature 2001; 413:852-6. [PMID: 11677609 DOI: 10.1038/35101614] [Citation(s) in RCA: 1407] [Impact Index Per Article: 61.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Salmonella enterica subspecies I, serovar Typhimurium (S. typhimurium), is a leading cause of human gastroenteritis, and is used as a mouse model of human typhoid fever. The incidence of non-typhoid salmonellosis is increasing worldwide, causing millions of infections and many deaths in the human population each year. Here we sequenced the 4,857-kilobase (kb) chromosome and 94-kb virulence plasmid of S. typhimurium strain LT2. The distribution of close homologues of S. typhimurium LT2 genes in eight related enterobacteria was determined using previously completed genomes of three related bacteria, sample sequencing of both S. enterica serovar Paratyphi A (S. paratyphi A) and Klebsiella pneumoniae, and hybridization of three unsequenced genomes to a microarray of S. typhimurium LT2 genes. Lateral transfer of genes is frequent, with 11% of the S. typhimurium LT2 genes missing from S. enterica serovar Typhi (S. typhi), and 29% missing from Escherichia coli K12. The 352 gene homologues of S. typhimurium LT2 confined to subspecies I of S. enterica-containing most mammalian and bird pathogens-are useful for studies of epidemiology, host specificity and pathogenesis. Most of these homologues were previously unknown, and 50 may be exported to the periplasm or outer membrane, rendering them accessible as therapeutic or vaccine targets.
Collapse
|
195
|
Miller W. Promising strategies help employers integrate pharmacy and medical programs--and reap cost, quality advantages. EMPLOYEE BENEFITS JOURNAL 2001; 26:23-7. [PMID: 11534217] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 02/21/2023]
Abstract
Providing quality pharmacy benefits while managing costs is an escalating challenge for employers. Easy solutions such as higher copays and standard three-tier systems will not provide effective, long-lasting results that include improved clinical outcomes. Employers must concentrate on educating employees on the real cost and value of their pharmacy benefits and they must also take an integrated view of medical and pharmacy benefits.
Collapse
|
196
|
Keeler LK, Finkelstein LH, Miller W, Fernhall B. Early-phase adaptations of traditional-speed vs. superslow resistance training on strength and aerobic capacity in sedentary individuals. J Strength Cond Res 2001; 15:309-14. [PMID: 11710656] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/22/2023]
Abstract
We performed a randomized exercise training study to assess the effects of traditional Nautilus-style (TR) or superslow (SS) strength training on muscular strength, body composition, aerobic capacity, and cardiovascular endurance. Subjects were 14 healthy, sedentary women, 19-45 years of age (mean +/- SD age, 32.7 +/- 8.9 years), randomized to either the SS or TR training protocols and trained 3 times per week for 10 weeks. Measurements were taken both before and after training, which included a maximal incremental exercise test on a cycle ergometer, body composition, and 1 repetition maximum (1RM) tests on 8 Nautilus machines. Both groups increased their strength significantly on all 8 exercises, whereas the TR group increased significantly more than the SS group on bench press (34% vs. 11%), torso arm (anterior lateral pull-down) (27% vs. 12%), leg press (33% vs. 7%), leg extension (56% vs. 24%), and leg curl (40% vs. 15%). Thus, the TR group's improvement in total exercise weight lifted was significantly greater than that of the SS group after testing (39% vs. 15%). Exercise duration on the cycle ergometer and work rate significantly improved for both groups, but there was no group-by-training interaction. No significant differences were found for body composition or additional aerobic variables measured. Both strength training protocols produced a significant improvement in strength during a 10-week training period, but the TR protocol produced better gains in the absence of changes in percentage of body fat, body mass index, lean body mass, and body weight. In addition, strength training alone did not improve Vo2max, yet short-term endurance increased.
Collapse
|
197
|
Miller W. Using a Phase-field-like Approach for the Calculation of Melt Flow and Interface Shape during Czochralski Growth. CRYSTAL RESEARCH AND TECHNOLOGY 2001. [DOI: 10.1002/1521-4079(200108)36:7<675::aid-crat675>3.0.co;2-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
198
|
Miller W, Rehse U. Numerical Simulation of Temperature and Flow Field in the Melt for the Vapour‐pressure‐ controlled Czochralski Growth of GaAs. CRYSTAL RESEARCH AND TECHNOLOGY 2001. [DOI: 10.1002/1521-4079(200108)36:7<685::aid-crat685>3.0.co;2-#] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
199
|
Benjamin DK, Miller W, Garges H, Benjamin DK, McKinney RE, Cotton M, Fisher RG, Alexander KA. Bacteremia, central catheters, and neonates: when to pull the line. Pediatrics 2001; 107:1272-6. [PMID: 11389242 DOI: 10.1542/peds.107.6.1272] [Citation(s) in RCA: 137] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
OBJECTIVES Physicians who treat neonates who become bacteremic while dependent on central venous catheters face a serious and common dilemma. We sought 1) to evaluate the relationship between central venous catheter removal and outcome in bacteremic neonates, 2) to determine species of bacteria that are associated with an increased risk of infectious complications if the central catheter is not removed promptly, and 3) to provide evidence-based recommendations for central catheter management. METHOD A retrospective cohort study of all neonates who had central venous access and developed bacteremia between July 1, 1995, and July 31, 1999, was conducted in the Duke University neonatal intensive care unit. RESULTS The outcome for patients in whom the central catheter was not removed within 24 hours of organism identification was significantly worse (odds ratio = 9.8) than it was for those whose catheters were removed promptly. For patients who were infected with Staphylococcus aureus or with nonenteric Gram-negative rods, delayed removal of the central catheter was associated with complicated bacteremia. Catheter sterilization was attempted in 27 neonates who were infected with enteric Gram-negative rods; only 10 of these infants retained their catheters without infection-related complications. Infants who had 4 consecutive blood cultures that were positive for coagulase-negative staphylococcus (CoNS) were at significantly increased risk for end-organ damage and death, compared with infants who had 3 or fewer positive blood culture for CoNS (odds ratio = 29.58). CONCLUSIONS Bacteremic infants experienced fewer infection-related complications when the central catheter was removed promptly. One positive blood culture for S aureus or a Gram-negative rod warrants central line removal in a neonate. Clinicians who are faced with a neonate who has 1 positive culture for CoNS may attempt medical management without central catheter removal, but documentation of subsequent negative blood cultures is crucial. Once a neonate has 3 positive blood cultures for CoNS, the central catheter should be removed.central line, neonate, bacteremia, bacteria, umbilical catheter, Broviac, percutaneous.
Collapse
|
200
|
Shiraishi T, Druck T, Mimori K, Flomenberg J, Berk L, Alder H, Miller W, Huebner K, Croce CM. Sequence conservation at human and mouse orthologous common fragile regions, FRA3B/FHIT and Fra14A2/Fhit. Proc Natl Acad Sci U S A 2001; 98:5722-7. [PMID: 11320209 PMCID: PMC33280 DOI: 10.1073/pnas.091095898] [Citation(s) in RCA: 55] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
It has been suggested that delayed DNA replication underlies fragility at common human fragile sites, but specific sequences responsible for expression of these inducible fragile sites have not been identified. One approach to identify such cis-acting sequences within the large nonexonic regions of fragile sites would be to identify conserved functional elements within orthologous fragile sites by interspecies sequence comparison. This study describes a comparison of orthologous fragile regions, the human FRA3B/FHIT and the murine Fra14A2/Fhit locus. We sequenced over 600 kbp of the mouse Fra14A2, covering the region orthologous to the fragile epicenter of FRA3B, and determined the Fhit deletion break points in a mouse kidney cancer cell line (RENCA). The murine Fra14A2 locus, like the human FRA3B, was characterized by a high AT content. Alignment of the two sequences showed that this fragile region was stable in evolution despite its susceptibility to mitotic recombination on inhibition of DNA replication. There were also several unusual highly conserved regions (HCRs). The positions of predicted matrix attachment regions (MARs), possibly related to replication origins, were not conserved. Of known fragile region landmarks, five cancer cell break points, one viral integration site, and one aphidicolin break cluster were located within or near HCRs. Thus, comparison of orthologous fragile regions has identified highly conserved sequences with possible functional roles in maintenance of fragility.
Collapse
|