1
|
Zhang W, Bouffard GG, Wallace SS, Bond JP. Estimation of DNA sequence context-dependent mutation rates using primate genomic sequences. J Mol Evol 2007; 65:207-14. [PMID: 17676366 DOI: 10.1007/s00239-007-9000-5] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2004] [Accepted: 11/30/2006] [Indexed: 10/23/2022]
Abstract
It is understood that DNA and amino acid substitution rates are highly sequence context-dependent, e.g., C --> T substitutions in vertebrates may occur much more frequently at CpG sites and that cysteine substitution rates may depend on support of the context for participation in a disulfide bond. Furthermore, many applications rely on quantitative models of nucleotide or amino acid substitution, including phylogenetic inference and identification of amino acid sequence positions involved in functional specificity. We describe quantification of the context dependence of nucleotide substitution rates using baboon, chimpanzee, and human genomic sequence data generated by the NISC Comparative Sequencing Program. Relative mutation rates are reported for the 96 classes of mutations of the form 5' alphabetagamma 3' --> 5' alphadeltagamma 3', where alpha, beta, gamma, and delta are nucleotides and beta not equal delta, based on maximum likelihood calculations. Our results confirm that C --> T substitutions are enhanced at CpG sites compared with other transitions, relatively independent of the identity of the preceding nucleotide. While, as expected, transitions generally occur more frequently than transversions, we find that the most frequent transversions involve the C at CpG sites (CpG transversions) and that their rate is comparable to the rate of transitions at non-CpG sites. A four-class model of the rates of context-dependent evolution of primate DNA sequences, CpG transitions > non-CpG transitions approximately CpG transversions > non-CpG transversions, captures qualitative features of the mutation spectrum. We find that despite qualitative similarity of mutation rates among different genomic regions, there are statistically significant differences.
Collapse
Affiliation(s)
- Wei Zhang
- Department of Medicine, University of Chicago, 515 CLSC, Chicago, IL 60637, USA
| | | | | | | |
Collapse
|
2
|
|
3
|
Klein PE, Klein RR, Vrebalov J, Mullet JE. Sequence-based alignment of sorghum chromosome 3 and rice chromosome 1 reveals extensive conservation of gene order and one major chromosomal rearrangement. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2003; 34:605-621. [PMID: 12787243 DOI: 10.1046/j.1365-313x.2003.01751.x] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
The completed rice genome sequence will accelerate progress on the identification and functional classification of biologically important genes and serve as an invaluable resource for the comparative analysis of grass genomes. In this study, methods were developed for sequence-based alignment of sorghum and rice chromosomes and for refining the sorghum genetic/physical map based on the rice genome sequence. A framework of 135 BAC contigs spanning approximately 33 Mbp was anchored to sorghum chromosome 3. A limited number of sequences were collected from 118 of the BACs and subjected to BLASTX analysis to identify putative genes and BLASTN analysis to identify sequence matches to the rice genome. Extensive conservation of gene content and order between sorghum chromosome 3 and the homeologous rice chromosome 1 was observed. One large-scale rearrangement was detected involving the inversion of an approximately 59 cM block of the short arm of sorghum chromosome 3. Several small-scale changes in gene collinearity were detected, indicating that single genes and/or small clusters of genes have moved since the divergence of sorghum and rice. Additionally, the alignment of the sorghum physical map to the rice genome sequence allowed sequence-assisted assembly of an approximately 1.6 Mbp sorghum BAC contig. This streamlined approach to high-resolution genome alignment and map building will yield important information about the relationships between rice and sorghum genes and genomic segments and ultimately enhance our understanding of cereal genome structure and evolution.
Collapse
Affiliation(s)
- Patricia E Klein
- Institute for Plant Genomics and Biotechnology, Texas A&M University, College Station, TX 77843, USA.
| | | | | | | |
Collapse
|
4
|
Chapman MA, Charchar FJ, Kinston S, Bird CP, Grafham D, Rogers J, Grützner F, Graves JAM, Green AR, Göttgens B. Comparative and functional analyses of LYL1 loci establish marsupial sequences as a model for phylogenetic footprinting. Genomics 2003; 81:249-59. [PMID: 12659809 DOI: 10.1016/s0888-7543(03)00005-3] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
Comparative genomic sequence analysis is a powerful technique for identifying regulatory regions in genomic DNA. However, its utility largely depends on the evolutionary distances between the species involved. Here we describe the screening of a genomic BAC library from the stripe-faced dunnart, Sminthopsis macroura, formerly known as the narrow-footed marsupial mouse. We isolated a clone containing the LYL1 locus, completely sequenced the 60.6-kb insert, and compared it with orthologous human and mouse sequences. Noncoding homology was substantially reduced in the human/dunnart analysis compared with human/mouse, yet we could readily identify all promoters and exons. Human/mouse/dunnart alignments of the LYL1 candidate promoter allowed us to identify putative transcription factor binding sites, revealing a pattern highly reminiscent of critical regulatory regions of the LYL1 paralogue, SCL. This newly identified LYL1 promoter showed strong activity in myeloid progenitor cells and was bound in vivo by Fli1, Elf1, and Gata2-transcription factors all previously shown to bind to the SCL stem cell enhancer. This study represents the first large-scale comparative analysis involving marsupial genomic sequence and demonstrates that such comparisons provide a powerful approach to characterizing mammalian regulatory elements.
Collapse
Affiliation(s)
- Michael A Chapman
- Department of Haematology, Cambridge Institute for Medical Research, Cambridge University, Hills Road, Cambridge CB2 2XY, UK
| | | | | | | | | | | | | | | | | | | |
Collapse
|
5
|
Pevzner P, Tesler G. Genome rearrangements in mammalian evolution: lessons from human and mouse genomes. Genome Res 2003; 13:37-45. [PMID: 12529304 PMCID: PMC430962 DOI: 10.1101/gr.757503] [Citation(s) in RCA: 212] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Although analysis of genome rearrangements was pioneered by Dobzhansky and Sturtevant 65 years ago, we still know very little about the rearrangement events that produced the existing varieties of genomic architectures. The genomic sequences of human and mouse provide evidence for a larger number of rearrangements than previously thought and shed some light on previously unknown features of mammalian evolution. In particular, they reveal that a large number of microrearrangements is required to explain the differences in draft human and mouse sequences. Here we describe a new algorithm for constructing synteny blocks, study arrangements of synteny blocks in human and mouse, derive a most parsimonious human-mouse rearrangement scenario, and provide evidence that intrachromosomal rearrangements are more frequent than interchromosomal rearrangements. Our analysis is based on the human-mouse breakpoint graph, which reveals related breakpoints and allows one to find a most parsimonious scenario. Because these graphs provide important insights into rearrangement scenarios, we introduce a new visualization tool that allows one to view breakpoint graphs superimposed with genomic dot-plots.
Collapse
Affiliation(s)
- Pavel Pevzner
- Department of Computer Science and Engineering, University of California, San Diego, La Jolla, CA 92093-0114, USA.
| | | |
Collapse
|
6
|
Thomas JW, Schueler MG, Summers TJ, Blakesley RW, McDowell JC, Thomas PJ, Idol JR, Maduro VVB, Lee-Lin SQ, Touchman JW, Bouffard GG, Beckstrom-Sternberg SM, Green ED. Pericentromeric duplications in the laboratory mouse. Genome Res 2003; 13:55-63. [PMID: 12529306 PMCID: PMC430956 DOI: 10.1101/gr.791403] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Duplications have long been postulated to be an important mechanism by which genomes evolve. Interspecies genomic comparisons are one method by which the origin and molecular mechanism of duplications can be inferred. By comparative mapping in human, mouse, and rat, we previously found evidence for a recent chromosome-fission event that occurred in the mouse lineage. Cytogenetic mapping revealed that the genomic segments flanking the fission site appeared to be duplicated, with copies residing near the centromere of multiple mouse chromosomes. Here we report the mapping and sequencing of the regions of mouse chromosomes 5 and 6 involved in this chromosome-fission event as well as the results of comparative sequence analysis with the orthologous human and rat genomic regions. Our data indicate that the duplications associated with mouse chromosomes 5 and 6 are recent and that the resulting duplicated segments share significant sequence similarity with a series of regions near the centromeres of the mouse chromosomes previously identified by cytogenetic mapping. We also identified pericentromeric duplicated segments shared between mouse chromosomes 5 and 1. Finally, novel mouse satellite sequences as well as putative chimeric transcripts were found to be associated with the duplicated segments. Together, these findings demonstrate that pericentromeric duplications are not restricted to primates and may be a common mechanism for genome evolution in mammals.
Collapse
Affiliation(s)
- James W Thomas
- Genome Technology Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
7
|
Frazer KA, Elnitski L, Church DM, Dubchak I, Hardison RC. Cross-species sequence comparisons: a review of methods and available resources. Genome Res 2003; 13:1-12. [PMID: 12529301 PMCID: PMC430969 DOI: 10.1101/gr.222003] [Citation(s) in RCA: 155] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
With the availability of whole-genome sequences for an increasing number of species, we are now faced with the challenge of decoding the information contained within these DNA sequences. Comparative analysis of DNA sequences from multiple species at varying evolutionary distances is a powerful approach for identifying coding and functional noncoding sequences, as well as sequences that are unique for a given organism. In this review, we outline the strategy for choosing DNA sequences from different species for comparative analyses and describe the methods used and the resources publicly available for these studies.
Collapse
Affiliation(s)
- Kelly A Frazer
- Perlegen Sciences, Mountain View, California 94043, USA.
| | | | | | | | | |
Collapse
|
8
|
Gregory SG, Sekhon M, Schein J, Zhao S, Osoegawa K, Scott CE, Evans RS, Burridge PW, Cox TV, Fox CA, Hutton RD, Mullenger IR, Phillips KJ, Smith J, Stalker J, Threadgold GJ, Birney E, Wylie K, Chinwalla A, Wallis J, Hillier L, Carter J, Gaige T, Jaeger S, Kremitzki C, Layman D, Maas J, McGrane R, Mead K, Walker R, Jones S, Smith M, Asano J, Bosdet I, Chan S, Chittaranjan S, Chiu R, Fjell C, Fuhrmann D, Girn N, Gray C, Guin R, Hsiao L, Krzywinski M, Kutsche R, Lee SS, Mathewson C, McLeavy C, Messervier S, Ness S, Pandoh P, Prabhu AL, Saeedi P, Smailus D, Spence L, Stott J, Taylor S, Terpstra W, Tsai M, Vardy J, Wye N, Yang G, Shatsman S, Ayodeji B, Geer K, Tsegaye G, Shvartsbeyn A, Gebregeorgis E, Krol M, Russell D, Overton L, Malek JA, Holmes M, Heaney M, Shetty J, Feldblyum T, Nierman WC, Catanese JJ, Hubbard T, Waterston RH, Rogers J, de Jong PJ, Fraser CM, Marra M, McPherson JD, Bentley DR. A physical map of the mouse genome. Nature 2002; 418:743-50. [PMID: 12181558 DOI: 10.1038/nature00957] [Citation(s) in RCA: 251] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
A physical map of a genome is an essential guide for navigation, allowing the location of any gene or other landmark in the chromosomal DNA. We have constructed a physical map of the mouse genome that contains 296 contigs of overlapping bacterial clones and 16,992 unique markers. The mouse contigs were aligned to the human genome sequence on the basis of 51,486 homology matches, thus enabling use of the conserved synteny (correspondence between chromosome blocks) of the two genomes to accelerate construction of the mouse map. The map provides a framework for assembly of whole-genome shotgun sequence data, and a tile path of clones for generation of the reference sequence. Definition of the human-mouse alignment at this level of resolution enables identification of a mouse clone that corresponds to almost any position in the human genome. The human sequence may be used to facilitate construction of other mammalian genome maps using the same strategy.
Collapse
Affiliation(s)
- Simon G Gregory
- The Wellcome Trust Sanger Institute, Hinxton, Cambridge CB10 1SA, UK
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
9
|
Thomas JW, Prasad AB, Summers TJ, Lee-Lin SQ, Maduro VVB, Idol JR, Ryan JF, Thomas PJ, McDowell JC, Green ED. Parallel construction of orthologous sequence-ready clone contig maps in multiple species. Genome Res 2002; 12:1277-85. [PMID: 12176935 PMCID: PMC186643 DOI: 10.1101/gr.283202] [Citation(s) in RCA: 58] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
Abstract
Comparison is a fundamental tool for analyzing DNA sequence. Interspecies sequence comparison is particularly powerful for inferring genome function and is based on the simple premise that conserved sequences are likely to be important. Thus, the comparison of a genomic sequence with its orthologous counterpart from another species is increasingly becoming an integral component of genome analysis. In ideal situations, such comparisons are performed with orthologous sequences from multiple species. To facilitate multispecies comparative sequence analysis, a robust and scalable strategy for simultaneously constructing sequence-ready bacterial artificial chromosome (BAC) contig maps from targeted genomic regions has been developed. Central to this approach is the generation and utilization of "universal" oligonucleotide-based hybridization probes ("overgo" probes), which are designed from sequences that are highly conserved between distantly related species. Large collections of these probes are used en masse to screen BAC libraries from multiple species in parallel, with the isolated clones assembled into physical contig maps. To validate the effectiveness of this strategy, efforts were focused on the construction of BAC-based physical maps from multiple mammalian species (chimpanzee, baboon, cat, dog, cow, and pig). Using available human and mouse genomic sequence and a newly developed computer program to design the requisite probes, sequence-ready maps were constructed in all species for a series of targeted regions totaling approximately 16 Mb in the human genome. The described approach can be used to facilitate the multispecies comparative sequencing of targeted genomic regions and can be adapted for constructing BAC contig maps in other vertebrates.
Collapse
Affiliation(s)
- James W Thomas
- Genome Technology Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| | | | | | | | | | | | | | | | | | | |
Collapse
|
10
|
Abstract
The human genome sequence provides a reference point from which we can compare ourselves with other organisms. Interspecies comparison is a powerful tool for inferring function from genomic sequence and could ultimately lead to the discovery of what makes humans unique. To date, most comparative sequencing has focused on pair-wise comparisons between human and a limited number of other vertebrates, such as mouse. Targeted approaches now exist for mapping and sequencing vertebrate bacterial artificial chromosomes (BACs) from numerous species, allowing rapid and detailed molecular and phylogenetic investigation of multi-megabase loci. Such targeted sequencing is complementary to current whole-genome sequencing projects, and would benefit greatly from the creation of BAC libraries from a diverse range of vertebrates.
Collapse
Affiliation(s)
- James W Thomas
- Genome Technology Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | | |
Collapse
|
11
|
DeSilva U, Elnitski L, Idol JR, Doyle JL, Gan W, Thomas JW, Schwartz S, Dietrich NL, Beckstrom-Sternberg SM, McDowell JC, Blakesley RW, Bouffard GG, Thomas PJ, Touchman JW, Miller W, Green ED. Generation and comparative analysis of approximately 3.3 Mb of mouse genomic sequence orthologous to the region of human chromosome 7q11.23 implicated in Williams syndrome. Genome Res 2002; 12:3-15. [PMID: 11779826 PMCID: PMC155257 DOI: 10.1101/gr.214802] [Citation(s) in RCA: 56] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
Abstract
Williams syndrome is a complex developmental disorder that results from the heterozygous deletion of a approximately 1.6-Mb segment of human chromosome 7q11.23. These deletions are mediated by large (approximately 300 kb) duplicated blocks of DNA of near-identical sequence. Previously, we showed that the orthologous region of the mouse genome is devoid of such duplicated segments. Here, we extend our studies to include the generation of approximately 3.3 Mb of genomic sequence from the mouse Williams syndrome region, of which just over 1.4 Mb is finished to high accuracy. Comparative analyses of the mouse and human sequences within and immediately flanking the interval commonly deleted in Williams syndrome have facilitated the identification of nine previously unreported genes, provided detailed sequence-based information regarding 30 genes residing in the region, and revealed a number of potentially interesting conserved noncoding sequences. Finally, to facilitate comparative sequence analysis, we implemented several enhancements to the program, including the addition of links from annotated features within a generated percent-identity plot to specific records in public databases. Taken together, the results reported here provide an important comparative sequence resource that should catalyze additional studies of Williams syndrome, including those that aim to characterize genes within the commonly deleted interval and to develop mouse models of the disorder.
Collapse
Affiliation(s)
- Udaya DeSilva
- Genome Technology Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
12
|
Kwitek AE, Tonellato PJ, Chen D, Gullings-Handley J, Cheng YS, Twigger S, Scheetz TE, Casavant TL, Stoll M, Nobrega MA, Shiozawa M, Soares MB, Sheffield VC, Jacob HJ. Automated construction of high-density comparative maps between rat, human, and mouse. Genome Res 2001; 11:1935-43. [PMID: 11691858 PMCID: PMC311144 DOI: 10.1101/gr.173701] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Animal models have been used primarily as surrogates for humans, having similar disease-based phenotypes. Genomic organization also tends to be conserved between species, leading to the generation of comparative genome maps. The emergence of radiation hybrid (RH) maps, coupled with the large numbers of available Expressed Sequence Tags (ESTs), has revolutionized the way comparative maps can be built. We used publicly available rat, mouse, and human data to identify genes and ESTs with interspecies sequence identity (homology), identified their UniGene relationships, and incorporated their RH map positions to build integrated comparative maps with >2100 homologous UniGenes mapped in more than one species (approximately 6% of all mammalian genes). The generation of these maps is iterative and labor intensive; therefore, we developed a series of computer tools (not described here) based on our algorithm that identifies anchors between species and produces printable and on-line clickable comparative maps that link to a wide variety of useful tools and databases. The maps were constructed using sequence-based comparisons, thus creating "hooks" for further sequence-based annotation of human, mouse, and rat sequences. Currently, this map enables investigators to link the physiology of the rat with the genetics of the mouse and the clinical significance of the human.
Collapse
Affiliation(s)
- A E Kwitek
- Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, WI 53226, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
13
|
Kim J, Gordon L, Dehal P, Badri H, Christensen M, Groza M, Ha C, Hammond S, Vargas M, Wehri E, Wagner M, Olsen A, Stubbs L. Homology-driven assembly of a sequence-ready mouse BAC contig map spanning regions related to the 46-Mb gene-rich euchromatic segments of human chromosome 19. Genomics 2001; 74:129-41. [PMID: 11386749 DOI: 10.1006/geno.2001.6521] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Draft sequence derived from the 46-Mb gene-rich euchromatic portion of human chromosome 19 (HSA19) was utilized to generate a sequence-ready physical map spanning homologous regions of mouse chromosomes. Sequence similarity searches with the human sequence identified more than 1000 individual orthologous mouse genes from which 382 overgo probes were developed for hybridization. Using human gene order and spacing as a model, these probes were used to isolate and assemble bacterial artificial chromosome (BAC) clone contigs spanning homologous mouse regions. Each contig was verified, extended, and joined to neighboring contigs by restriction enzyme fingerprinting analysis. Approximately 3000 mouse BACs were analyzed and assembled into 44 contigs with a combined length of 41.4 Mb. These BAC contigs, covering 90% of HSA19-related mouse DNA, are distributed throughout 15 homology segments derived from different regions of mouse chromosomes 7, 8, 9, 10, and 17. The alignment of the HSA19 map with the ordered mouse BAC contigs revealed a number of structural differences in several overtly conserved homologous regions and more precisely defined the borders of the known regions of HSA19-syntenic homology. Our results demonstrate that given a human draft sequence, BAC contig maps can be constructed quickly for comparative sequencing without the need for preestablished mouse-specific genetic or physical markers and indicate that similar strategies can be applied with equal success to genomes of other vertebrate species.
Collapse
Affiliation(s)
- J Kim
- Genomics Division, Lawrence Livermore National Laboratory, 7000 East Avenue, L-441, Livermore, California 94550, USA
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
14
|
Pletcher MT, Wiltshire T, Cabin DE, Villanueva M, Reeves RH. Use of Comparative Physical and Sequence Mapping to Annotate Mouse Chromosome 16 and Human Chromosome 21. Genomics 2001; 74:45-54. [PMID: 11374901 DOI: 10.1006/geno.2001.6533] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Distal mouse chromosome 16 (MMU16) shares conserved linkage with human chromosome 21 (HSA21), trisomy for which causes Down syndrome (DS). A 4.5-Mb physical map extending from Cbr1 to Tmprss2 on MMU16 provides a minimal tiling path of P1 artificial chromosomes (PACs) for comparative mapping and genomic sequencing. Thirty-four expressed sequences were positioned on the mouse map, including 19 that were not physically mapped previously. This region of the mouse:human comparative map shows a high degree of evolutionary conservation of gene order and content, which differs only by insertion of one gene (in mouse) and a small inversion involving two adjacent genes. "Low-pass" (2.2x) mouse sequence from a portion of the contig was ordered and oriented along 510 kb of finished HSA21 sequence. In combination with 68 kb of unique PAC end sequence, the comparison provided confirmation of genes predicted by comparative mapping, indicated gene predictions that are likely to be incorrect, and identified three candidate genes in mouse and human that were not observed in the initial HSA21 sequence annotation. This comparative map and sequence derived from it are powerful tools for identifying genes and regulatory regions, information that will in turn provide insights into the genetic mechanisms by which trisomy 21 results in DS.
Collapse
Affiliation(s)
- M T Pletcher
- Department of Physiology, Johns Hopkins School of Medicine, Baltimore, Maryland 21205, USA
| | | | | | | | | |
Collapse
|
15
|
Wilson MD, Riemer C, Martindale DW, Schnupf P, Boright AP, Cheung TL, Hardy DM, Schwartz S, Scherer SW, Tsui LC, Miller W, Koop BF. Comparative analysis of the gene-dense ACHE/TFR2 region on human chromosome 7q22 with the orthologous region on mouse chromosome 5. Nucleic Acids Res 2001; 29:1352-65. [PMID: 11239002 PMCID: PMC29746 DOI: 10.1093/nar/29.6.1352] [Citation(s) in RCA: 41] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open
Abstract
Chromosome 7q22 has been the focus of many cytogenetic and molecular studies aimed at delineating regions commonly deleted in myeloid leukemias and myelodysplastic syndromes. We have compared a gene-dense, GC-rich sub-region of 7q22 with the orthologous region on mouse chromosome 5. A physical map of 640 kb of genomic DNA from mouse chromosome 5 was derived from a series of overlapping bacterial artificial chromosomes. A 296 kb segment from the physical map, spanning ACHE: to Tfr2, was compared with 267 kb of human sequence. We identified a conserved linkage of 12 genes including an open reading frame flanked by ACHE: and Asr2, a novel cation-chloride cotransporter interacting protein Cip1, Ephb4, Zan and Perq1. While some of these genes have been previously described, in each case we present new data derived from our comparative sequence analysis. Adjacent unfinished sequence data from the mouse contains an orthologous block of 10 additional genes including three novel cDNA sequences that we subsequently mapped to human 7q22. Methods for displaying comparative genomic information, including unfinished sequence data, are becoming increasingly important. We supplement our printed comparative analysis with a new, Web-based program called Laj (local alignments with java). Laj provides interactive access to archived pairwise sequence alignments via the WWW. It displays synchronized views of a dot-plot, a percent identity plot, a nucleotide-level local alignment and a variety of relevant annotations. Our mouse-human comparison can be viewed at http://web.uvic.ca/~bioweb/laj.html. Laj is available at http://bio.cse.psu.edu/, along with online documentation and additional examples of annotated genomic regions.
Collapse
Affiliation(s)
- M D Wilson
- Department of Biology, Centre for Environmental Health, PO Box 3020, University of Victoria, Victoria, British Columbia V8W 3N5, Canada
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
16
|
Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, Devon K, Dewar K, Doyle M, FitzHugh W, Funke R, Gage D, Harris K, Heaford A, Howland J, Kann L, Lehoczky J, LeVine R, McEwan P, McKernan K, Meldrim J, Mesirov JP, Miranda C, Morris W, Naylor J, Raymond C, Rosetti M, Santos R, Sheridan A, Sougnez C, Stange-Thomann Y, Stojanovic N, Subramanian A, Wyman D, Rogers J, Sulston J, Ainscough R, Beck S, Bentley D, Burton J, Clee C, Carter N, Coulson A, Deadman R, Deloukas P, Dunham A, Dunham I, Durbin R, French L, Grafham D, Gregory S, Hubbard T, Humphray S, Hunt A, Jones M, Lloyd C, McMurray A, Matthews L, Mercer S, Milne S, Mullikin JC, Mungall A, Plumb R, Ross M, Shownkeen R, Sims S, Waterston RH, Wilson RK, Hillier LW, McPherson JD, Marra MA, Mardis ER, Fulton LA, Chinwalla AT, Pepin KH, Gish WR, Chissoe SL, Wendl MC, Delehaunty KD, Miner TL, Delehaunty A, Kramer JB, Cook LL, Fulton RS, Johnson DL, Minx PJ, Clifton SW, Hawkins T, Branscomb E, Predki P, Richardson P, Wenning S, Slezak T, Doggett N, Cheng JF, Olsen A, Lucas S, Elkin C, Uberbacher E, Frazier M, Gibbs RA, Muzny DM, Scherer SE, Bouck JB, Sodergren EJ, Worley KC, Rives CM, Gorrell JH, Metzker ML, Naylor SL, Kucherlapati RS, Nelson DL, Weinstock GM, Sakaki Y, Fujiyama A, Hattori M, Yada T, Toyoda A, Itoh T, Kawagoe C, Watanabe H, Totoki Y, Taylor T, Weissenbach J, Heilig R, Saurin W, Artiguenave F, Brottier P, Bruls T, Pelletier E, Robert C, Wincker P, Smith DR, Doucette-Stamm L, Rubenfield M, Weinstock K, Lee HM, Dubois J, Rosenthal A, Platzer M, Nyakatura G, Taudien S, Rump A, Yang H, Yu J, Wang J, Huang G, Gu J, Hood L, Rowen L, Madan A, Qin S, Davis RW, Federspiel NA, Abola AP, Proctor MJ, Myers RM, Schmutz J, Dickson M, Grimwood J, Cox DR, Olson MV, Kaul R, Raymond C, Shimizu N, Kawasaki K, Minoshima S, Evans GA, Athanasiou M, Schultz R, Roe BA, Chen F, Pan H, Ramser J, Lehrach H, Reinhardt R, McCombie WR, de la Bastide M, Dedhia N, Blöcker H, Hornischer K, Nordsiek G, Agarwala R, Aravind L, Bailey JA, Bateman A, Batzoglou S, Birney E, Bork P, Brown DG, Burge CB, Cerutti L, Chen HC, Church D, Clamp M, Copley RR, Doerks T, Eddy SR, Eichler EE, Furey TS, Galagan J, Gilbert JG, Harmon C, Hayashizaki Y, Haussler D, Hermjakob H, Hokamp K, Jang W, Johnson LS, Jones TA, Kasif S, Kaspryzk A, Kennedy S, Kent WJ, Kitts P, Koonin EV, Korf I, Kulp D, Lancet D, Lowe TM, McLysaght A, Mikkelsen T, Moran JV, Mulder N, Pollara VJ, Ponting CP, Schuler G, Schultz J, Slater G, Smit AF, Stupka E, Szustakowki J, Thierry-Mieg D, Thierry-Mieg J, Wagner L, Wallis J, Wheeler R, Williams A, Wolf YI, Wolfe KH, Yang SP, Yeh RF, Collins F, Guyer MS, Peterson J, Felsenfeld A, Wetterstrand KA, Patrinos A, Morgan MJ, de Jong P, Catanese JJ, Osoegawa K, Shizuya H, Choi S, Chen YJ, Szustakowki J. Initial sequencing and analysis of the human genome. Nature 2001; 409:860-921. [PMID: 11237011 DOI: 10.1038/35057062] [Citation(s) in RCA: 14684] [Impact Index Per Article: 638.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
The human genome holds an extraordinary trove of information about human development, physiology, medicine and evolution. Here we report the results of an international collaboration to produce and make freely available a draft sequence of the human genome. We also present an initial analysis of the data, describing some of the insights that can be gleaned from the sequence.
Collapse
Affiliation(s)
- E S Lander
- Whitehead Institute for Biomedical Research, Center for Genome Research, Cambridge, MA 02142, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
17
|
Ranz JM, Casals F, Ruiz A. How malleable is the eukaryotic genome? Extreme rate of chromosomal rearrangement in the genus Drosophila. Genome Res 2001; 11:230-9. [PMID: 11157786 PMCID: PMC311025 DOI: 10.1101/gr.162901] [Citation(s) in RCA: 120] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2000] [Accepted: 11/21/2000] [Indexed: 11/24/2022]
Abstract
During the evolution of the genus Drosophila, the molecular organization of the major chromosomal elements has been repeatedly rearranged via the fixation of paracentric inversions. Little detailed information is available, however, on the extent and effect of these changes at the molecular level. In principle, a full description of the rate and pattern of change could reveal the limits, if any, to which the eukaryotic genome can accommodate reorganizations. We have constructed a high-density physical map of the largest chromosomal element in Drosophila repleta (chromosome 2) and compared the order and distances between the markers with those on the homologous chromosomal element (3R) in Drosophila melanogaster. The two species belong to different subgenera (Drosophila and Sophophora, respectively), which diverged 40-62 million years (Myr) ago and represent, thus, the farthest lineages within the Drosophila genus. The comparison reveals extensive reshuffling of gene order from centromere to telomere. Using a maximum likelihood method, we estimate that 114 +/- 14 paracentric inversions have been fixed in this chromosomal element since the divergence of the two species, that is, 0.9-1.4 inversions fixed per Myr. Comparison with available rates of chromosomal evolution, taking into account genome size, indicates that the Drosophila genome shows the highest rate found so far in any eukaryote. Twenty-one small segments (23-599 kb) comprising at least two independent (nonoverlapping) markers appear to be conserved between D. melanogaster and D. repleta. These results are consistent with the random breakage model and do not provide significant evidence of functional constraint of any kind. They support the notion that the Drosophila genome is extraordinarily malleable and has a modular organization. The high rate of chromosomal change also suggests a very limited transferability of the positional information from the Drosophila genome to other insects.
Collapse
Affiliation(s)
- J M Ranz
- Departament de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, 08193 Bellaterra, Barcelona, Spain.
| | | | | |
Collapse
|
18
|
Current awareness on comparative and functional genomics. Yeast 2000; 17. [PMID: 11119313 PMCID: PMC2448380 DOI: 10.1002/1097-0061(200012)17:4<339::aid-yea10>3.0.co;2-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022] Open
|
19
|
Current awareness on comparative and functional genomics. Yeast 2000; 17:339-46. [PMID: 11119313 PMCID: PMC2448380 DOI: 10.1002/1097-0061(200012)17:4<339::aid-yea10>3.0.co;2-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022] Open
|