1
|
Waterston RH, Moerman DG. John Sulston (1942-2018): a personal perspective. J Neurogenet 2021; 34:238-246. [PMID: 33446017 DOI: 10.1080/01677063.2020.1833008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
Abstract
John Sulston changed the way we do science, not once, but three times - initially with the complete cell lineage of the nematode Caenorhabditis elegans, next with completion of the genome sequences of the worm and human genomes and finally with his strong and active advocacy for open data sharing. His contributions were widely recognized and in 2002 he received the Nobel Prize in Physiology and Medicine.
Collapse
Affiliation(s)
- Robert H Waterston
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Donald G Moerman
- Department of Zoology, University of British Columbia, Vancouver, BC, USA
| |
Collapse
|
2
|
Maxson Jones K, Ankeny RA, Cook-Deegan R. The Bermuda Triangle: The Pragmatics, Policies, and Principles for Data Sharing in the History of the Human Genome Project. JOURNAL OF THE HISTORY OF BIOLOGY 2018; 51:693-805. [PMID: 30390178 PMCID: PMC7307446 DOI: 10.1007/s10739-018-9538-7] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
The Bermuda Principles for DNA sequence data sharing are an enduring legacy of the Human Genome Project (HGP). They were adopted by the HGP at a strategy meeting in Bermuda in February of 1996 and implemented in formal policies by early 1998, mandating daily release of HGP-funded DNA sequences into the public domain. The idea of daily sharing, we argue, emanated directly from strategies for large, goal-directed molecular biology projects first tested within the "community" of C. elegans researchers, and were introduced and defended for the HGP by the nematode biologists John Sulston and Robert Waterston. In the C. elegans community, and subsequently in the HGP, daily sharing served the pragmatic goals of quality control and project coordination. Yet in the HGP human genome, we also argue, the Bermuda Principles addressed concerns about gene patents impeding scientific advancement, and were aspirational and flexible in implementation and justification. They endured as an archetype for how rapid data sharing could be realized and rationalized, and permitted adaptation to the needs of various scientific communities. Yet in addition to the support of Sulston and Waterston, their adoption also depended on the clout of administrators at the US National Institutes of Health (NIH) and the UK nonprofit charity the Wellcome Trust, which together funded 90% of the HGP human sequencing effort. The other nations wishing to remain in the HGP consortium had to accommodate to the Bermuda Principles, requiring exceptions from incompatible existing or pending data access policies for publicly funded research in Germany, Japan, and France. We begin this story in 1963, with the biologist Sydney Brenner's proposal for a nematode research program at the Laboratory of Molecular Biology (LMB) at the University of Cambridge. We continue through 2003, with the completion of the HGP human reference genome, and conclude with observations about policy and the historiography of molecular biology.
Collapse
Affiliation(s)
- Kathryn Maxson Jones
- Department of History, Princeton University, Princeton, NJ, USA.
- MBL McDonnell Foundation Scholar, Marine Biological Laboratory, Woods Hole, MA, USA.
| | - Rachel A Ankeny
- School of Humanities, The University of Adelaide, Adelaide, Australia
| | - Robert Cook-Deegan
- School for the Future of Innovation in Society, Consortium for Science, Policy & Outcomes, Arizona State University, Barrett & O'Connor Washington Center, Washington, D.C., USA
| |
Collapse
|
3
|
A Behind-the-Scenes Story of Precision Medicine. GENOMICS, PROTEOMICS & BIOINFORMATICS 2017; 15:3-10. [PMID: 28188874 PMCID: PMC5339399 DOI: 10.1016/j.gpb.2017.01.002] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/09/2016] [Accepted: 01/25/2017] [Indexed: 12/24/2022]
|
4
|
Louis EJ. Historical Evolution of Laboratory Strains of Saccharomyces cerevisiae. Cold Spring Harb Protoc 2016; 2016:2016/7/pdb.top077750. [PMID: 27371602 DOI: 10.1101/pdb.top077750] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Budding yeast strains used in the laboratory have had a checkered past. Historically, the choice of strain for any particular experiment depended on the suitability of the strain for the topic of study (e.g., cell cycle vs. meiosis). Many laboratory strains had poor fermentation properties and were not representative of the robust strains used for domestic purposes. Most strains were related to each other, but investigators usually had only vague notions about the extent of their relationships. Isogenicity was difficult to confirm before the advent of molecular genetic techniques. However, their ease of growth and manipulation in laboratory conditions made them "the model" model organism, and they still provided a great deal of fundamental knowledge. Indeed, more than one Nobel Prize has been won using them. Most of these strains continue to be powerful tools, and isogenic derivatives of many of them-including entire collections of deletions, overexpression constructs, and tagged gene products-are now available. Furthermore, many of these strains are now sequenced, providing intimate knowledge of their relationships. Recent collections, new isolates, and the creation of genetically tractable derivatives have expanded the available strains for experiments. But even still, these laboratory strains represent a small fraction of the diversity of yeast. The continued development of new laboratory strains will broaden the potential questions that can be posed. We are now poised to take advantage of this diversity, rather than viewing it as a detriment to controlled experiments.
Collapse
Affiliation(s)
- Edward J Louis
- Centre for Genetic Architecture of Complex Traits, Department of Genetics, University of Leicester, Leicester LE1 7RH, United Kingdom
| |
Collapse
|
5
|
Hoffman CS, Wood V, Fantes PA. An Ancient Yeast for Young Geneticists: A Primer on the Schizosaccharomyces pombe Model System. Genetics 2015; 201:403-23. [PMID: 26447128 PMCID: PMC4596657 DOI: 10.1534/genetics.115.181503] [Citation(s) in RCA: 150] [Impact Index Per Article: 16.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
The fission yeast Schizosaccharomyces pombe is an important model organism for the study of eukaryotic molecular and cellular biology. Studies of S. pombe, together with studies of its distant cousin, Saccharomyces cerevisiae, have led to the discovery of genes involved in fundamental mechanisms of transcription, translation, DNA replication, cell cycle control, and signal transduction, to name but a few processes. However, since the divergence of the two species approximately 350 million years ago, S. pombe appears to have evolved less rapidly than S. cerevisiae so that it retains more characteristics of the common ancient yeast ancestor, causing it to share more features with metazoan cells. This Primer introduces S. pombe by describing the yeast itself, providing a brief description of the origins of fission yeast research, and illustrating some genetic and bioinformatics tools used to study protein function in fission yeast. In addition, a section on some key differences between S. pombe and S. cerevisiae is included for readers with some familiarity with budding yeast research but who may have an interest in developing research projects using S. pombe.
Collapse
Affiliation(s)
- Charles S Hoffman
- Biology Department, Boston College, Chestnut Hill, Massachusetts 02467
| | - Valerie Wood
- Cambridge Systems Biology Centre and Department of Biochemistry, University of Cambridge, CB2 1GA Cambridge, United Kingdom
| | - Peter A Fantes
- School of Biological Sciences, College of Science and Engineering, University of Edinburgh EH9 3JR Edinburgh, United Kingdom
| |
Collapse
|
6
|
Dujon B. Basic principles of yeast genomics, a personal recollection: Graphical Abstract Figure. FEMS Yeast Res 2015; 15:fov047. [DOI: 10.1093/femsyr/fov047] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/08/2015] [Indexed: 12/12/2022] Open
|
7
|
Abstract
Genomic information reported as haplotypes rather than genotypes will be increasingly important for personalized medicine. Current technologies generate diploid sequence data that is rarely resolved into its constituent haplotypes. Furthermore, paradigms for thinking about genomic information are based on interpreting genotypes rather than haplotypes. Nevertheless, haplotypes have historically been useful in contexts ranging from population genetics to disease-gene mapping efforts. The main approaches for phasing genomic sequence data are molecular haplotyping, genetic haplotyping, and population-based inference. Long-read sequencing technologies are enabling longer molecular haplotypes, and decreases in the cost of whole-genome sequencing are enabling the sequencing of whole-chromosome genetic haplotypes. Hybrid approaches combining high-throughput short-read assembly with strategic approaches that enable physical or virtual binning of reads into haplotypes are enabling multi-gene haplotypes to be generated from single individuals. These techniques can be further combined with genetic and population approaches. Here, we review advances in whole-genome haplotyping approaches and discuss the importance of haplotypes for genomic medicine. Clinical applications include diagnosis by recognition of compound heterozygosity and by phasing regulatory variation to coding variation. Haplotypes, which are more specific than less complex variants such as single nucleotide variants, also have applications in prognostics and diagnostics, in the analysis of tumors, and in typing tissue for transplantation. Future advances will include technological innovations, the application of standard metrics for evaluating haplotype quality, and the development of databases that link haplotypes to disease.
Collapse
Affiliation(s)
- Gustavo Glusman
- Institute for Systems Biology, Terry Avenue North, Seattle, WA 98109 USA
| | - Hannah C Cox
- Institute for Systems Biology, Terry Avenue North, Seattle, WA 98109 USA
| | - Jared C Roach
- Institute for Systems Biology, Terry Avenue North, Seattle, WA 98109 USA
| |
Collapse
|
8
|
Engel SR, Dietrich FS, Fisk DG, Binkley G, Balakrishnan R, Costanzo MC, Dwight SS, Hitz BC, Karra K, Nash RS, Weng S, Wong ED, Lloyd P, Skrzypek MS, Miyasato SR, Simison M, Cherry JM. The reference genome sequence of Saccharomyces cerevisiae: then and now. G3 (BETHESDA, MD.) 2014; 4:389-98. [PMID: 24374639 PMCID: PMC3962479 DOI: 10.1534/g3.113.008995] [Citation(s) in RCA: 255] [Impact Index Per Article: 25.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/11/2013] [Accepted: 12/21/2013] [Indexed: 11/18/2022]
Abstract
The genome of the budding yeast Saccharomyces cerevisiae was the first completely sequenced from a eukaryote. It was released in 1996 as the work of a worldwide effort of hundreds of researchers. In the time since, the yeast genome has been intensively studied by geneticists, molecular biologists, and computational scientists all over the world. Maintenance and annotation of the genome sequence have long been provided by the Saccharomyces Genome Database, one of the original model organism databases. To deepen our understanding of the eukaryotic genome, the S. cerevisiae strain S288C reference genome sequence was updated recently in its first major update since 1996. The new version, called "S288C 2010," was determined from a single yeast colony using modern sequencing technologies and serves as the anchor for further innovations in yeast genomic science.
Collapse
Affiliation(s)
- Stacia R. Engel
- Department of Genetics, Stanford University, Stanford, California 94305
| | - Fred S. Dietrich
- Department of Molecular Genetics and Microbiology, Duke University, Durham, North Carolina 27710
| | - Dianna G. Fisk
- Department of Genetics, Stanford University, Stanford, California 94305
| | - Gail Binkley
- Department of Genetics, Stanford University, Stanford, California 94305
| | - Rama Balakrishnan
- Department of Genetics, Stanford University, Stanford, California 94305
| | - Maria C. Costanzo
- Department of Genetics, Stanford University, Stanford, California 94305
| | - Selina S. Dwight
- Department of Genetics, Stanford University, Stanford, California 94305
| | - Benjamin C. Hitz
- Department of Genetics, Stanford University, Stanford, California 94305
| | - Kalpana Karra
- Department of Genetics, Stanford University, Stanford, California 94305
| | - Robert S. Nash
- Department of Genetics, Stanford University, Stanford, California 94305
| | - Shuai Weng
- Department of Genetics, Stanford University, Stanford, California 94305
| | - Edith D. Wong
- Department of Genetics, Stanford University, Stanford, California 94305
| | - Paul Lloyd
- Department of Genetics, Stanford University, Stanford, California 94305
| | - Marek S. Skrzypek
- Department of Genetics, Stanford University, Stanford, California 94305
| | | | - Matt Simison
- Department of Genetics, Stanford University, Stanford, California 94305
| | - J. Michael Cherry
- Department of Genetics, Stanford University, Stanford, California 94305
| |
Collapse
|
9
|
Frenkel Z, Paux E, Mester D, Feuillet C, Korol A. LTC: a novel algorithm to improve the efficiency of contig assembly for physical mapping in complex genomes. BMC Bioinformatics 2010; 11:584. [PMID: 21118513 PMCID: PMC3098104 DOI: 10.1186/1471-2105-11-584] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2010] [Accepted: 11/30/2010] [Indexed: 11/25/2022] Open
Abstract
Background Physical maps are the substrate of genome sequencing and map-based cloning and their construction relies on the accurate assembly of BAC clones into large contigs that are then anchored to genetic maps with molecular markers. High Information Content Fingerprinting has become the method of choice for large and repetitive genomes such as those of maize, barley, and wheat. However, the high level of repeated DNA present in these genomes requires the application of very stringent criteria to ensure a reliable assembly with the FingerPrinted Contig (FPC) software, which often results in short contig lengths (of 3-5 clones before merging) as well as an unreliable assembly in some difficult regions. Difficulties can originate from a non-linear topological structure of clone overlaps, low power of clone ordering algorithms, and the absence of tools to identify sources of gaps in Minimal Tiling Paths (MTPs). Results To address these problems, we propose a novel approach that: (i) reduces the rate of false connections and Q-clones by using a new cutoff calculation method; (ii) obtains reliable clusters robust to the exclusion of single clone or clone overlap; (iii) explores the topological contig structure by considering contigs as networks of clones connected by significant overlaps; (iv) performs iterative clone clustering combined with ordering and order verification using re-sampling methods; and (v) uses global optimization methods for clone ordering and Band Map construction. The elements of this new analytical framework called Linear Topological Contig (LTC) were applied on datasets used previously for the construction of the physical map of wheat chromosome 3B with FPC. The performance of LTC vs. FPC was compared also on the simulated BAC libraries based on the known genome sequences for chromosome 1 of rice and chromosome 1 of maize. Conclusions The results show that compared to other methods, LTC enables the construction of highly reliable and longer contigs (5-12 clones before merging), the detection of "weak" connections in contigs and their "repair", and the elongation of contigs obtained by other assembly methods.
Collapse
Affiliation(s)
- Zeev Frenkel
- University of Haifa, Institute of Evolution, Haifa 31905, Israel.
| | | | | | | | | |
Collapse
|
10
|
Coulson A, Sulston J, Brenner S, Karn J. Toward a physical map of the genome of the nematode Caenorhabditis elegans. Proc Natl Acad Sci U S A 2010; 83:7821-5. [PMID: 16593771 PMCID: PMC386814 DOI: 10.1073/pnas.83.20.7821] [Citation(s) in RCA: 420] [Impact Index Per Article: 30.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
A technique for digital characterization and comparison of DNA fragments, using restriction enzymes, is described. The technique is being applied to fragments from the nematode Caenorhabditis elegans (i) to facilitate cross-indexing of clones emanating from different laboratories and (ii) to construct a physical map of the genome. Eight hundred sixty clusters of clones, from 35 to 350 kilobases long and totaling about 60% of the genome, have been characterized.
Collapse
Affiliation(s)
- A Coulson
- Medical Research Council Laboratory of Molecular Biology, Hills Road, Cambridge, CB2 2QH, England
| | | | | | | |
Collapse
|
11
|
Mathewson CA, Schein JE, Marra MA. Large-scale BAC clone restriction digest fingerprinting. ACTA ACUST UNITED AC 2008; Chapter 5:Unit 5.19. [PMID: 18428413 DOI: 10.1002/0471142905.hg0519s53] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Restriction digest fingerprinting is a common method for characterizing large insert genomic clones, e.g., bacterial artificial chromosome (BAC), P1 artificial chromosome (PAC) and Fosmid clones. This clone fingerprinting method has been widely applied in the construction of clone-based physical maps, which have been used as positional cloning resources as well as to support directed and genome-wide sequencing efforts. This unit describes a robust, large-scale procedure for generation of agarose gel-based clone fingerprints from BAC clones.
Collapse
Affiliation(s)
- Carrie A Mathewson
- Canada's Michael Smith Genome Sciences Center Vancouver, British Columbia, Canada
| | | | | |
Collapse
|
12
|
Hayden HS, Gillett W, Saenphimmachak C, Lim R, Zhou Y, Jacobs MA, Chang J, Rohmer L, D'Argenio DA, Palmieri A, Levy R, Haugen E, Wong GKS, Brittnacher MJ, Burns JL, Miller SI, Olson MV, Kaul R. Large-insert genome analysis technology detects structural variation in Pseudomonas aeruginosa clinical strains from cystic fibrosis patients. Genomics 2008; 91:530-7. [PMID: 18445516 DOI: 10.1016/j.ygeno.2008.02.005] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2007] [Revised: 02/26/2008] [Accepted: 02/28/2008] [Indexed: 11/16/2022]
Abstract
Large-insert genome analysis (LIGAN) is a broadly applicable, high-throughput technology designed to characterize genome-scale structural variation. Fosmid paired-end sequences and DNA fingerprints from a query genome are compared to a reference sequence using the Genomic Variation Analysis (GenVal) suite of software tools to pinpoint locations of insertions, deletions, and rearrangements. Fosmids spanning regions that contain new structural variants can then be sequenced. Clonal pairs of Pseudomonas aeruginosa isolates from four cystic fibrosis patients were used to validate the LIGAN technology. Approximately 1.5 Mb of inserted sequences were identified, including 743 kb containing 615 ORFs that are absent from published P. aeruginosa genomes. Six rearrangement breakpoints and 220 kb of deleted sequences were also identified. Our study expands the "genome universe" of P. aeruginosa and validates a technology that complements emerging, short-read sequencing methods that are better suited to characterizing single-nucleotide polymorphisms than structural variation.
Collapse
Affiliation(s)
- Hillary S Hayden
- Genome Center, University of Washington, Seattle, WA 98195, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
13
|
Foote S, Marshall V, Munroe DJ, Segre JA. Constructing contigs from large-insert clones. CURRENT PROTOCOLS IN HUMAN GENETICS 2008; Chapter 5:Unit 5.10. [PMID: 18428284 DOI: 10.1002/0471142905.hg0510s15] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
This unit describes three approaches that are widely used to define alignments between overlapping clones bearing large-insert genomic DNA and to generate extensive contiguous overlapping sets of clones (contigs). The three approaches are sequence-tagged site (STS) content mapping, repetitive-element hybridization fingerprinting, and Alu-PCR fingerprinting. Methods for isolating the necessary BAC DNA suitable for automated fluorescent sequencing and generating new STS markers are discussed in support protocols. An alternate protocol presents repetitive-element hybridization fingerprinting to detect overlaps and build contigs with full-genomic YAC libraries.
Collapse
Affiliation(s)
- S Foote
- Walter and Eliza Hall Institute, Victoria, Australia
| | | | | | | |
Collapse
|
14
|
Brenner S. The human genome: the nature of the enterprise. CIBA FOUNDATION SYMPOSIUM 2007; 149:6-12; discussion 12-7. [PMID: 2335125 DOI: 10.1002/9780470513903.ch2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
The development of methods of cloning and sequencing DNA has liberated genetics from the constraints of breeding experiments, and all organisms are now accessible to genetical analysis by the new methods. The analysis of the human genome is likely to produce a revolution in biomedical science, and it is argued that high priority should be given to gene identification. The strategy advocated is one which identifies the expressed genes and pursues mapping in parallel.
Collapse
Affiliation(s)
- S Brenner
- MRC Molecular Genetics Unit, Cambridge, UK
| |
Collapse
|
15
|
Kim H, San Miguel P, Nelson W, Collura K, Wissotski M, Walling JG, Kim JP, Jackson SA, Soderlund C, Wing RA. Comparative physical mapping between Oryza sativa (AA genome type) and O. punctata (BB genome type). Genetics 2007; 176:379-90. [PMID: 17339227 PMCID: PMC1893071 DOI: 10.1534/genetics.106.068783] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2006] [Accepted: 02/09/2007] [Indexed: 11/18/2022] Open
Abstract
A comparative physical map of the AA genome (Oryza sativa) and the BB genome (O. punctata) was constructed by aligning a physical map of O. punctata, deduced from 63,942 BAC end sequences (BESs) and 34,224 fingerprints, onto the O. sativa genome sequence. The level of conservation of each chromosome between the two species was determined by calculating a ratio of BES alignments. The alignment result suggests more divergence of intergenic and repeat regions in comparison to gene-rich regions. Further, this characteristic enabled localization of heterochromatic and euchromatic regions for each chromosome of both species. The alignment identified 16 locations containing expansions, contractions, inversions, and transpositions. By aligning 40% of the punctata BES on the map, 87% of the punctata FPC map covered 98% of the O. sativa genome sequence. The genome size of O. punctata was estimated to be 8% larger than that of O. sativa with individual chromosome differences of 1.5-16.5%. The sum of expansions and contractions observed in regions >500 kb were similar, suggesting that most of the contractions/expansions contributing to the genome size difference between the two species are small, thus preserving the macro-collinearity between these species, which diverged approximately 2 million years ago.
Collapse
Affiliation(s)
- HyeRan Kim
- Arizona Genomics Institute, University of Arizona, Tucson, Arizona 85721, USA
| | | | | | | | | | | | | | | | | | | |
Collapse
|
16
|
Wendl MC. Algebraic correction methods for computational assessment of clone overlaps in DNA fingerprint mapping. BMC Bioinformatics 2007; 8:127. [PMID: 17442113 PMCID: PMC1868038 DOI: 10.1186/1471-2105-8-127] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2007] [Accepted: 04/18/2007] [Indexed: 12/31/2022] Open
Abstract
BACKGROUND The Sulston score is a well-established, though approximate metric for probabilistically evaluating postulated clone overlaps in DNA fingerprint mapping. It is known to systematically over-predict match probabilities by various orders of magnitude, depending upon project-specific parameters. Although the exact probability distribution is also available for the comparison problem, it is rather difficult to compute and cannot be used directly in most cases. A methodology providing both improved accuracy and computational economy is required. RESULTS We propose a straightforward algebraic correction procedure, which takes the Sulston score as a provisional value and applies a power-law equation to obtain an improved result. Numerical comparisons indicate dramatically increased accuracy over the range of parameters typical of traditional agarose fingerprint mapping. Issues with extrapolating the method into parameter ranges characteristic of newer capillary electrophoresis-based projects are also discussed. CONCLUSION Although only marginally more expensive to compute than the raw Sulston score, the correction provides a vastly improved probabilistic description of hypothesized clone overlaps. This will clearly be important in overlap assessment and perhaps for other tasks as well, for example in using the ranking of overlap probabilities to assist in clone ordering.
Collapse
Affiliation(s)
- Michael C Wendl
- Genome Sequencing Center, Washington University, St. Louis, MO 63108, USA.
| |
Collapse
|
17
|
García-Sancho M. Mapping and sequencing information: the social context for the genomics revolution. ENDEAVOUR 2007; 31:18-23. [PMID: 17336383 DOI: 10.1016/j.endeavour.2007.01.006] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/30/2006] [Revised: 01/25/2007] [Accepted: 01/31/2007] [Indexed: 05/14/2023]
Abstract
In 1983, after devoting some eight years of his life to the description of how a nematode worm develops from an embryo into an adult, molecular biologist John Sulston embarked on a remarkably different project: he decided to map the worm's genome. Sulston's impulsive desire to characterise this creature's DNA from start to finish offers only a partial explanation for this transition. Instead, a close examination of the wider social context for this 'moment' in molecular biology gives a more rewarding explanation of Sulston's intellectual leap. This reveals a world in which biotechnology gradually adapted to and integrated into an 'information society' increasingly dependent on the creation, distribution and manipulation of information. The application of computing to DNA during the first half of the 1980s was crucial for this integration, fostering the emergence of genomics and ultimately the Human Genome Project.
Collapse
|
18
|
Kim S, Wolyniak MJ, Staab JF, Sundstrom P. A 368-base-pair cis-acting HWP1 promoter region, HCR, of Candida albicans confers hypha-specific gene regulation and binds architectural transcription factors Nhp6 and Gcf1p. EUKARYOTIC CELL 2007; 6:693-709. [PMID: 17220463 PMCID: PMC1865660 DOI: 10.1128/ec.00341-06] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
To elucidate the molecular mechanisms controlling the expression of the hypha-specific adhesin gene HWP1 of Candida albicans, its promoter was dissected and analyzed using a green fluorescent protein reporter gene. A 368-bp region, the HWP1 control region (HCR), was critical for activation under hypha-inducing conditions and conferred developmental regulation to a heterologous ENO1 promoter. A more distal region of the promoter served to amplify the level of promoter activation. Using gel mobility shift assays, a 249-bp subregion of HCR, HCRa, was found to bind at least four proteins from crude extracts of yeasts and hyphae with differing binding patterns dependent on cell morphology. Four proteins with DNA binding activities were identified by using sodium dodecyl sulfate-polyacrylamide gel electrophoresis after separation by anion-exchange and heparin-Sepharose chromatography. One protein with high similarity to Nhp6, an HMG1 family member in Saccharomyces cerevisiae, and another with weak similarity to an HMG-like condensation factor from Physarum polycephalum implicated changes in chromatin structure as a critical process in hypha-specific gene regulation. Proteins with strong homology to histones were also found. These studies are the first to identify proteins that bind to a DNA segment that confers developmental gene regulation in C. albicans and suggest a new model for hypha-specific gene regulation.
Collapse
Affiliation(s)
- Samin Kim
- Microbiology and Molecular Pathogenesis Program, Dartmouth Medical School, Hanover, New Hampshire, USA
| | | | | | | |
Collapse
|
19
|
|
20
|
Jann OC, Aerts J, Jones M, Hastings N, Law A, McKay S, Marques E, Prasad A, Yu J, Moore SS, Floriot S, Mahé MF, Eggen A, Silveri L, Negrini R, Milanesi E, Ajmone-Marsan P, Valentini A, Marchitelli C, Savarese MC, Janitz M, Herwig R, Hennig S, Gorni C, Connor EE, Sonstegard TS, Smith T, Drögemüller C, Williams JL. A second generation radiation hybrid map to aid the assembly of the bovine genome sequence. BMC Genomics 2006; 7:283. [PMID: 17087818 PMCID: PMC1636650 DOI: 10.1186/1471-2164-7-283] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2006] [Accepted: 11/06/2006] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Several approaches can be used to determine the order of loci on chromosomes and hence develop maps of the genome. However, all mapping approaches are prone to errors either arising from technical deficiencies or lack of statistical support to distinguish between alternative orders of loci. The accuracy of the genome maps could be improved, in principle, if information from different sources was combined to produce integrated maps. The publicly available bovine genomic sequence assembly with 6x coverage (Btau_2.0) is based on whole genome shotgun sequence data and limited mapping data however, it is recognised that this assembly is a draft that contains errors. Correcting the sequence assembly requires extensive additional mapping information to improve the reliability of the ordering of sequence scaffolds on chromosomes. The radiation hybrid (RH) map described here has been contributed to the international sequencing project to aid this process. RESULTS An RH map for the 30 bovine chromosomes is presented. The map was built using the Roslin 3000-rad RH panel (BovGen RH map) and contains 3966 markers including 2473 new loci in addition to 262 amplified fragment-length polymorphisms (AFLP) and 1231 markers previously published with the first generation RH map. Sequences of the mapped loci were aligned with published bovine genome maps to identify inconsistencies. In addition to differences in the order of loci, several cases were observed where the chromosomal assignment of loci differed between maps. All the chromosome maps were aligned with the current 6x bovine assembly (Btau_2.0) and 2898 loci were unambiguously located in the bovine sequence. The order of loci on the RH map for BTA 5, 7, 16, 22, 25 and 29 differed substantially from the assembled bovine sequence. From the 2898 loci unambiguously identified in the bovine sequence assembly, 131 mapped to different chromosomes in the BovGen RH map. CONCLUSION Alignment of the BovGen RH map with other published RH and genetic maps showed higher consistency in marker order and chromosome assignment than with the current 6x sequence assembly. This suggests that the bovine sequence assembly could be significantly improved by incorporating additional independent mapping information.
Collapse
Affiliation(s)
- Oliver C Jann
- Division of Genetics & Genomics, Roslin Institute, Roslin, Midlothian, Edinburgh, EH25 9PS, UK
| | - Jan Aerts
- Division of Genetics & Genomics, Roslin Institute, Roslin, Midlothian, Edinburgh, EH25 9PS, UK
| | - Michelle Jones
- Division of Genetics & Genomics, Roslin Institute, Roslin, Midlothian, Edinburgh, EH25 9PS, UK
| | - Nicola Hastings
- Division of Genetics & Genomics, Roslin Institute, Roslin, Midlothian, Edinburgh, EH25 9PS, UK
| | - Andy Law
- Division of Genetics & Genomics, Roslin Institute, Roslin, Midlothian, Edinburgh, EH25 9PS, UK
| | | | - Elisa Marques
- University of Alberta, Edmonton, AB, T6G 2P5, Canada
| | - Aparna Prasad
- University of Alberta, Edmonton, AB, T6G 2P5, Canada
| | - Jody Yu
- University of Alberta, Edmonton, AB, T6G 2P5, Canada
| | | | - Sandrine Floriot
- Laboratoire de Génétique Biochimique et Cytogénétique, INRA-CRJ, 78350 Jouy-en-Josas, France
| | - Marie-Françoise Mahé
- Laboratoire de Génétique Biochimique et Cytogénétique, INRA-CRJ, 78350 Jouy-en-Josas, France
| | - André Eggen
- Laboratoire de Génétique Biochimique et Cytogénétique, INRA-CRJ, 78350 Jouy-en-Josas, France
| | - Licia Silveri
- Laboratoire de Génétique Biochimique et Cytogénétique, INRA-CRJ, 78350 Jouy-en-Josas, France
- Istituto di Zootecnica, Università Cattolica del S. Cuore via E. Parmense 84, 29100 Piacenza, Italy
| | - Riccardo Negrini
- Istituto di Zootecnica, Università Cattolica del S. Cuore via E. Parmense 84, 29100 Piacenza, Italy
| | - Elisabetta Milanesi
- Istituto di Zootecnica, Università Cattolica del S. Cuore via E. Parmense 84, 29100 Piacenza, Italy
| | - Paolo Ajmone-Marsan
- Istituto di Zootecnica, Università Cattolica del S. Cuore via E. Parmense 84, 29100 Piacenza, Italy
| | - Alessio Valentini
- Department of Animal Productions, University of Tuscia, Viterbo, Italy
| | | | - Maria C Savarese
- Department of Animal Productions, University of Tuscia, Viterbo, Italy
| | - Michal Janitz
- Department of Vertebrate Genomics, Max Planck Institute for Molecular Genetics, 14195 Berlin, Germany
| | - Ralf Herwig
- Department of Vertebrate Genomics, Max Planck Institute for Molecular Genetics, 14195 Berlin, Germany
| | - Steffen Hennig
- RZPD German Resource Center for Genome Research, 14059 Berlin, Germany
| | - Chiara Gorni
- Istituto di Zootecnica, Università Cattolica del S. Cuore via E. Parmense 84, 29100 Piacenza, Italy
- Parco Tecnologico Padano, via Einstein, Polo Universitario, Lodi 26900, Italy
| | - Erin E Connor
- USDA-ARS, Beltsville Agricultural Research Center, 10300 Baltimore Avenue, Beltsville, MD 20705, USA
| | - Tad S Sonstegard
- USDA-ARS, Beltsville Agricultural Research Center, 10300 Baltimore Avenue, Beltsville, MD 20705, USA
| | - Timothy Smith
- USDA-ARS U.S. Meat Animal Research Center P.O. Box 166 Clay Center, NE 68933-0166, USA
| | - Cord Drögemüller
- Institute for Animal Breeding and Genetics, University of Veterinary Medicine Hannover, Bünteweg 17p, 30559 Hannover, Germany
| | - John L Williams
- Division of Genetics & Genomics, Roslin Institute, Roslin, Midlothian, Edinburgh, EH25 9PS, UK
- Parco Tecnologico Padano, via Einstein, Polo Universitario, Lodi 26900, Italy
| |
Collapse
|
21
|
Nelson WM, Dvorak J, Luo MC, Messing J, Wing RA, Soderlund C. Efficacy of clone fingerprinting methodologies. Genomics 2006; 89:160-5. [PMID: 17011744 DOI: 10.1016/j.ygeno.2006.08.008] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2006] [Revised: 08/15/2006] [Accepted: 08/18/2006] [Indexed: 10/24/2022]
Abstract
With the development of new high-information content fingerprinting techniques for constructing BAC-based physical maps, physical map construction is accelerating and it is important to determine which methodologies work best. In a recent publication (Z. Xu et al., 2004, Genomics 84:941-951), Xu et al. evaluated five different techniques (one agarose-based and four using multiple enzymes) and concluded that a two-enzyme technique was superior. In addition, they found that no benefit was gained from fingerprinting more than 10x coverage. In this paper we report our own extensive simulation results, which lead to contrasting conclusions. Our data indicate that the five-enzyme method known as SNaPshot is the most effective and that the assembly can in fact be significantly improved with greater than 10x coverage.
Collapse
Affiliation(s)
- William M Nelson
- Arizona Genomics Computational Laboratory, BIO5 Institute, University of Arizona, Tucson, AZ 85721, USA
| | | | | | | | | | | |
Collapse
|
22
|
Nelson WM, Bharti AK, Butler E, Wei F, Fuks G, Kim H, Wing RA, Messing J, Soderlund C. Whole-genome validation of high-information-content fingerprinting. PLANT PHYSIOLOGY 2005; 139:27-38. [PMID: 16166258 PMCID: PMC1203355 DOI: 10.1104/pp.105.061978] [Citation(s) in RCA: 52] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Fluorescent-based high-information-content fingerprinting (HICF) techniques have recently been developed for physical mapping. These techniques make use of automated capillary DNA sequencing instruments to enable both high-resolution and high-throughput fingerprinting. In this article, we report the construction of a whole-genome HICF FPC map for maize (Zea mays subsp. mays cv B73), using a variant of HICF in which a type IIS restriction enzyme is used to generate the fluorescently labeled fragments. The HICF maize map was constructed from the same three maize bacterial artificial chromosome libraries as previously used for the whole-genome agarose FPC map, providing a unique opportunity for direct comparison of the agarose and HICF methods; as a result, it was found that HICF has substantially greater sensitivity in forming contigs. An improved assembly procedure is also described that uses automatic end-merging of contigs to reduce the effects of contamination and repetitive bands. Several new features in FPC v7.2 are presented, including shared-memory multiprocessing, which allows dramatically faster assemblies, and automatic end-merging, which permits more accurate assemblies. It is further shown that sequenced clones may be digested in silico and located accurately on the HICF assembly, despite size deviations that prevent the precise prediction of experimental fingerprints. Finally, repetitive bands are isolated, and their effect on the assembly is studied.
Collapse
Affiliation(s)
- William M Nelson
- Arizona Genomics Computational Laboratory, BIO5 Institute, University of Arizona, Tucson, 85721, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
23
|
Milosavljevic A, Harris RA, Sodergren EJ, Jackson AR, Kalafus KJ, Hodgson A, Cree A, Dai W, Csuros M, Zhu B, de Jong PJ, Weinstock GM, Gibbs RA. Pooled genomic indexing of rhesus macaque. Genome Res 2005; 15:292-301. [PMID: 15687293 PMCID: PMC546531 DOI: 10.1101/gr.3162505] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Pooled genomic indexing (PGI) is a method for mapping collections of bacterial artificial chromosome (BAC) clones between species by using a combination of clone pooling and DNA sequencing. PGI has been used to map a total of 3858 BAC clones covering approximately 24% of the rhesus macaque (Macaca mulatta) genome onto 4178 homologous loci in the human genome. A number of intrachromosomal rearrangements were detected by mapping multiple segments within the individual rhesus BACs onto multiple disjoined loci in the human genome. Transversal pooling designs involving shuffled BAC arrays were employed for robust mapping even with modest DNA sequence read coverage. A further innovation, short-tag pooled genomic indexing (ST-PGI), was also introduced to further improve the economy of mapping by sequencing multiple, short, mapable tags within a single sequencing reaction.
Collapse
Affiliation(s)
- Aleksandar Milosavljevic
- Human Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
24
|
Wendl MC. Probabilistic Assessment of Clone Overlaps in DNA Fingerprint Mapping via a Priori Models. J Comput Biol 2005; 12:283-97. [PMID: 15857243 DOI: 10.1089/cmb.2005.12.283] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
We construct a combinatorially exact probability distribution for the problem of a priori clone overlap assessment in DNA fingerprint mapping. It is implemented as a general enumeration methodology using Bell's exponential polynomials. Established computational metrics do not consider the conditional nature of the problem. They dramatically overpredict actual match probabilities as a consequence. This elevates the rate of false-negative overlap declarations, which is consistent with previous validation studies. Increased measurement resolution does not significantly improve accuracy. We describe general trends in error behavior for intermediate-size DNA clones and discuss corresponding limits on the ability to assess certain overlaps.
Collapse
Affiliation(s)
- Michael C Wendl
- Genome Sequencing Center, Washington University, St. Louis, MO 63108, USA.
| |
Collapse
|
25
|
Xu Z, Sun S, Covaleda L, Ding K, Zhang A, Wu C, Scheuring C, Zhang HB. Genome physical mapping with large-insert bacterial clones by fingerprint analysis: methodologies, source clone genome coverage, and contig map quality. Genomics 2005; 84:941-51. [PMID: 15533711 DOI: 10.1016/j.ygeno.2004.08.014] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2004] [Accepted: 08/18/2004] [Indexed: 11/19/2022]
Abstract
Genome physical mapping with large-insert clones by fingerprint analysis is becoming an active area of genomics research. Here, we report two new capillary electrophoresis-based fingerprinting methods for genome physical mapping and the effects of different fingerprinting methods and source clone genome coverage on quality physical map construction revealed by computer simulations and laboratory experiments. It was shown that the manual sequencing gel-based two-enzyme fingerprinting method consistently generated larger and more accurate contigs, followed by the new capillary electrophoresis-based three-enzyme method, the new capillary electrophoresis-based five-enzyme (SNaPshot) method, the agarose gel-based one-enzyme method, and the automatic sequencing gel-based four-enzyme method, in descending order, when 1% or fewer questionable clones were allowed. Analysis of clones equivalent to 5x, 8x, 10x, and 15x genomes using the fingerprinting methods revealed that as the number of clones increased from 5x to 10x, the contig length rapidly increased for all methods. However, when the number of clones was increased from 10x to 15x coverage, the contig length at best increased at a lower rate or even decreased. The results will provide useful knowledge and strategies for effective construction of quality genome physical maps for advanced genomics research.
Collapse
Affiliation(s)
- Zhanyou Xu
- Department of Soil and Crop Sciences and Institute for Plant Genomics and Biotechnology, 2123 TAMU, Texas A&M University, College Station, TX 77843-2123, USA
| | | | | | | | | | | | | | | |
Collapse
|
26
|
Takeda M, Katayama H, Satoh T, Mabuchi T. Three copies of the ATP2 gene are arranged in tandem on chromosome X in the yeast Saccharomyces cerevisiae. Curr Genet 2005; 47:265-72. [PMID: 15776236 DOI: 10.1007/s00294-005-0565-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2004] [Revised: 12/27/2004] [Accepted: 12/31/2004] [Indexed: 10/25/2022]
Abstract
We previously reported that there were three copies of ATP1 coding for F1-alpha and two copies of ATP3 coding for F1-gamma on the left and right arm of chromosome II, respectively. In this study, we present evidence that there are three closely linked copies of ATP2 encoding the beta subunit of the F1F0-ATPase complex on the right arm of chromosome X in several laboratory strains, including Saccharomyces cerevisiae strain S288C, although it was reported by the yeast genome project that ATP2 is a single-copy gene. Chromosome X fragmentation, long-PCR, chromosome-walking and ATP2-disruption analysis using haploid wild-type strains and prime clone 70645 showed that the three copies of ATP2 are present on the right arm of chromosome X, like those of ATP1 on chromosome II. Each was estimated to be approximately 4 kb apart. We designated the ATP2 proximal to the centromere as ATP2a, the middle one as ATP2b and the distal one as ATP2c. The region containing the three ATP2s is composed of two repeated units of approximately 7 kb; that is, both ends (ATP2a, ATP2c) accompanying the ATP2-neighboring ORFs are the same. A part of YJR119c, YJR120w, YJR122w (CAF17) and YJR123w (RP55), which were reported by the yeast genome project, are contained in the ATP2 repeated units; and the middle ATP2 of the three ATP2s, ATP2b, is located between the two repeated units. Expression of all three copies of ATP2 (ATP2a, ATP2b, ATP2c) was confirmed because a single or double ATP2-disruptant could grow on glycerol, but a triple ATP2-disruptant could not. In addition, of the three copies of ATP1 and ATP2, even if only one copy of the ATP1 and ATP2 genes remained, the cells grew on glycerol.
Collapse
Affiliation(s)
- Masaharu Takeda
- Department of Applied Life Science, Sojo University, 4-22-1 Ikeda, Kumamoto, 860-0082, Japan.
| | | | | | | |
Collapse
|
27
|
Xu Z, van den Berg MA, Scheuring C, Covaleda L, Lu H, Santos FA, Uhm T, Lee MK, Wu C, Liu S, Zhang HB. Genome physical mapping from large-insert clones by fingerprint analysis with capillary electrophoresis: a robust physical map of Penicillium chrysogenum. Nucleic Acids Res 2005; 33:e50. [PMID: 15767275 PMCID: PMC1065262 DOI: 10.1093/nar/gni037] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open
Abstract
Physical mapping with large-insert clones is becoming an active area of genomics research, and capillary electrophoresis (CE) promises to revolutionize the physical mapping technology. Here, we demonstrate the utility of the CE technology for genome physical mapping with large-insert clones by constructing a robust, binary bacterial artificial chromosome (BIBAC)-based physical map of Penicillium chrysogenum. We fingerprinted 23.1x coverage BIBAC clones with five restriction enzymes and the SNaPshot kit containing four fluorescent-ddNTPs using the CE technology, and explored various strategies to construct quality physical maps. It was shown that the fingerprints labeled with one or two colors, resulting in 40-70 bands per clone, were assembled into much better quality maps than those labeled with three or four colors. The selection of fingerprinting enzymes was crucial to quality map construction. From the dataset labeled with ddTTP-dROX, we assembled a physical map for P.chrysogenum, with 2-3 contigs per chromosome and anchored the map to its chromosomes. This map represents the first physical map constructed using the CE technology, thus providing not only a platform for genomic studies of the penicillin-producing species, but also strategies for efficient use of the CE technology for genome physical mapping of plants, animals and microbes.
Collapse
Affiliation(s)
| | | | | | | | - Hong Lu
- Department of Computer Science, Texas A&M UniversityCollege Station, TX 77843, USA
| | | | | | | | | | - Steve Liu
- Department of Computer Science, Texas A&M UniversityCollege Station, TX 77843, USA
| | - Hong-Bin Zhang
- To whom correspondence should be addressed. Tel: +1 979 862 2244; Fax: +1 979 862 4790;
| |
Collapse
|
28
|
Gerbasi VR, Weaver CM, Hill S, Friedman DB, Link AJ. Yeast Asc1p and mammalian RACK1 are functionally orthologous core 40S ribosomal proteins that repress gene expression. Mol Cell Biol 2004; 24:8276-87. [PMID: 15340087 PMCID: PMC515043 DOI: 10.1128/mcb.24.18.8276-8287.2004] [Citation(s) in RCA: 134] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
Translation of mRNA into protein is a fundamental step in eukaryotic gene expression requiring the large (60S) and small (40S) ribosome subunits and associated proteins. By modern proteomic approaches, we previously identified a novel 40S-associated protein named Asc1p in budding yeast and RACK1 in mammals. The goals of this study were to establish Asc1p or RACK1 as a core conserved eukaryotic ribosomal protein and to determine the role of Asc1p or RACK1 in translational control. We provide biochemical, evolutionary, genetic, and functional evidence showing that Asc1p or RACK1 is indeed a conserved core component of the eukaryotic ribosome. We also show that purified Asc1p-deficient ribosomes have increased translational activity compared to that of wild-type yeast ribosomes. Further, we demonstrate that asc1Delta null strains have increased levels of specific proteins in vivo and that this molecular phenotype is complemented by either Asc1p or RACK1. Our data suggest that one of Asc1p's or RACK1's functions is to repress gene expression.
Collapse
Affiliation(s)
- Vincent R Gerbasi
- Department of Microbiology and Immunology, Vanderbilt University School of Medicine, 1161 21st Ave. South, Nashville, TN 37232-2363, USA
| | | | | | | | | |
Collapse
|
29
|
Magrini V, Warren WC, Wallis J, Goldman WE, Xu J, Mardis ER, McPherson JD. Fosmid-based physical mapping of the Histoplasma capsulatum genome. Genome Res 2004; 14:1603-9. [PMID: 15289478 PMCID: PMC509269 DOI: 10.1101/gr.2361404] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
A fosmid library representing 10-fold coverage of the Histoplasma capsulatum G217B genome was used to construct a restriction-based physical map. The data obtained from three restriction endonuclease fingerprints, generated from each clone using BamHI, HindIII, and PstI endonucleases, were combined and used in FPC for automatic and manual contig assembly builds. Concomitantly, a whole-genome shotgun (WGS) sequencing of paired-end reads from plasmids and fosmids were assembled with PCAP, providing a predicted genome size of up to 43.5 Mbp and 17% repetitive DNA. Fosmid paired-end sequences in the WGS assembly provide anchoring information to the physical map and result in joining of existing physical map contigs into 84 clusters containing 9551 fosmid clones. Here, we detail mapping the Histoplasma capsulatum genome comprehensively in fosmids, resulting in an efficient paradigm for de novo sequencing that uses a map-assisted whole genome shotgun approach.
Collapse
Affiliation(s)
- Vincent Magrini
- Washington University School of Medicine, Genome Sequencing Center, St. Louis, Missouri 63108, USA
| | | | | | | | | | | | | |
Collapse
|
30
|
Meyers BC, Scalabrin S, Morgante M. Mapping and sequencing complex genomes: let's get physical! Nat Rev Genet 2004; 5:578-88. [PMID: 15266340 DOI: 10.1038/nrg1404] [Citation(s) in RCA: 69] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Affiliation(s)
- Blake C Meyers
- Department of Plant and Soil Sciences and Delaware Biotechnology Institute, University of Delaware, Newark, Delaware 19711, USA
| | | | | |
Collapse
|
31
|
Luo MC, Thomas C, You FM, Hsiao J, Ouyang S, Buell CR, Malandro M, McGuire PE, Anderson OD, Dvorak J. High-throughput fingerprinting of bacterial artificial chromosomes using the snapshot labeling kit and sizing of restriction fragments by capillary electrophoresis. Genomics 2003; 82:378-89. [PMID: 12906862 DOI: 10.1016/s0888-7543(03)00128-9] [Citation(s) in RCA: 213] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
We have developed an automated, high-throughput fingerprinting technique for large genomic DNA fragments suitable for the construction of physical maps of large genomes. In the technique described here, BAC DNA is isolated in a 96-well plate format and simultaneously digested with four 6-bp-recognizing restriction endonucleases that generate 3' recessed ends and one 4-bp-recognizing restriction endonuclease that generates a blunt end. Each of the four recessed 3' ends is labeled with a different fluorescent dye, and restriction fragments are sized on a capillary DNA analyzer. The resulting fingerprints are edited with a fingerprint-editing computer program and contigs are assembled with the FPC computer program. The technique was evaluated by repeated fingerprinting of several BACs included as controls in plates during routine fingerprinting of a BAC library and by reconstruction of contigs of rice BAC clones with known positions on rice chromosome 10.
Collapse
Affiliation(s)
- Ming-Cheng Luo
- Department of Agronomy and Range Science, University of California at Davis, Davis, CA, 95616, USA
| | | | | | | | | | | | | | | | | | | |
Collapse
|
32
|
Fjell CD, Bosdet I, Schein JE, Jones SJM, Marra MA. Internet Contig Explorer (iCE)--a tool for visualizing clone fingerprint maps. Genome Res 2003; 13:1244-9. [PMID: 12799356 PMCID: PMC403654 DOI: 10.1101/gr.819303] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Fingerprinted clone physical maps have proven useful in various applications, supporting both whole-genome and region-specific DNA sequencing as well as gene cloning studies. Fingerprint maps have been generated for several genomes, including those of human, mouse, rat, the nematodes Caenorhabditis elegans and Caenorhabditis briggsae, Arabidopsis thaliana and rice. Fingerprint maps of other genomes, including those of fungi, bacteria, poplar, and the cow, are being generated. The increasing use of fingerprint maps in genomic research has spawned a need in the research community for intuitive computer tools that facilitate viewing of the maps and the underlying fingerprint data. In this report we describe a new Java-based application called iCE (Internet Contig Explorer) that has been designed to provide views of fingerprint maps and associated data. Users can search for and display individual clones, contigs, clone fingerprints, clone insert sizes and markers. Users can also load into the software lists of particular clones of interest and view their fingerprints. iCE is being used at our Genome Centre to offer up to the research community views of the mouse, rat, bovine, C. briggsae, and several fungal genome bacterial artificial chromosome (BAC) fingerprint maps we have either completed or are currently constructing. We are also using iCE as part of the Rat Genome Sequencing Project to manage our provision of rat BAC clones for sequencing at the Human Genome Sequencing Center at the Baylor College of Medicine.
Collapse
Affiliation(s)
- Christopher D Fjell
- Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC V5Z 4E6, Canada
| | | | | | | | | |
Collapse
|
33
|
Fuhrmann DR, Krzywinski MI, Chiu R, Saeedi P, Schein JE, Bosdet IE, Chinwalla A, Hillier LW, Waterston RH, McPherson JD, Jones SJM, Marra MA. Software for automated analysis of DNA fingerprinting gels. Genome Res 2003; 13:940-53. [PMID: 12727910 PMCID: PMC430903 DOI: 10.1101/gr.904303] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2002] [Accepted: 02/26/2003] [Indexed: 11/24/2022]
Abstract
Here we describe software tools for the automated detection of DNA restriction fragments resolved on agarose fingerprinting gels. We present a mathematical model for the location and shape of the restriction fragments as a function of fragment size, with model parameters determined empirically from "marker" lanes containing molecular size standards. Automated identification of restriction fragments involves several steps, including: image preprocessing, to put the data in a form consistent with a linear model; marker lane analysis, for determination of the model parameters; and data lane analysis, a procedure for detecting restriction fragment multiplets while simultaneously determining the amplitude curve that describes restriction fragment amplitude as a function of mobility. In validation experiments conducted on fingerprinted and sequenced Bacterial Artificial Chromosome (BAC) clones, sensitivity and specificity of restriction fragment identification exceeded 96% on restriction fragments ranging in size from 600 base pairs (bp) to 30,000 bp. The integrated suite of software tools, written in MATLAB and collectively called BandLeader, is in use at the BC Cancer Agency Genome Sciences Centre (GSC) and the Washington University Genome Sequencing Center, and has been provided to the Wellcome Trust Sanger Institute and the Whitehead Institute. Employed in a production mode at the GSC, BandLeader has been used to perform automated restriction fragment identification for more than 850,000 BAC clones for mouse, rat, bovine, and poplar fingerprint mapping projects.
Collapse
Affiliation(s)
- Daniel R Fuhrmann
- Department of Electrical Engineering, Washington University, St. Louis, Missouri 63130, USA
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
34
|
Xu Z, Lance B, Vargas C, Arpinar B, Bhandarkar S, Kraemer E, Kochut KJ, Miller JA, Wagner JR, Weise MJ, Wunderlich JK, Stringer J, Smulian G, Cushion MT, Arnold J. Mapping by sequencing the Pneumocystis genome using the ordering DNA sequences V3 tool. Genetics 2003; 163:1299-313. [PMID: 12702676 PMCID: PMC1462508 DOI: 10.1093/genetics/163.4.1299] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
A bioinformatics tool called ODS3 has been created for mapping by sequencing. The tool allows the creation of integrated genomic maps from genetic, physical mapping, and sequencing data and permits an integrated genome map to be stored, retrieved, viewed, and queried in a stand-alone capacity, in a client/server relationship with the Fungal Genome Database (FGDB), and as a web-browsing tool for the FGDB. In that ODS3 is programmed in Java, the tool promotes platform independence and supports export of integrated genome-mapping data in the extensible markup language (XML) for data interchange with other genome information systems. The tool ODS3 is used to create an initial integrated genome map of the AIDS-related fungal pathogen, Pneumocystis carinii. Contig dynamics would indicate that this physical map is approximately 50% complete with approximately 200 contigs. A total of 10 putative multigene families were found. Two of these putative families were previously characterized in P. carinii, namely the major surface glycoproteins (MSGs) and HSP70 proteins; three of these putative families (not previously characterized in P. carinii) were found to be similar to families encoding the HSP60 in Schizosaccharomyces pombe, the heat-shock psi protein in S. pombe, and the RNA synthetase family (i.e., MES1) in Saccharomyces cerevisiae. Physical mapping data are consistent with the 16S, 5.8S, and 26S rDNA genes being single copy in P. carinii. No other fungus outside this genus is known to have the rDNA genes in single copy.
Collapse
Affiliation(s)
- Zheng Xu
- Department of Genetics, University of Georgia, Athens, Georgia 30602, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
35
|
Gregory SG, Sekhon M, Schein J, Zhao S, Osoegawa K, Scott CE, Evans RS, Burridge PW, Cox TV, Fox CA, Hutton RD, Mullenger IR, Phillips KJ, Smith J, Stalker J, Threadgold GJ, Birney E, Wylie K, Chinwalla A, Wallis J, Hillier L, Carter J, Gaige T, Jaeger S, Kremitzki C, Layman D, Maas J, McGrane R, Mead K, Walker R, Jones S, Smith M, Asano J, Bosdet I, Chan S, Chittaranjan S, Chiu R, Fjell C, Fuhrmann D, Girn N, Gray C, Guin R, Hsiao L, Krzywinski M, Kutsche R, Lee SS, Mathewson C, McLeavy C, Messervier S, Ness S, Pandoh P, Prabhu AL, Saeedi P, Smailus D, Spence L, Stott J, Taylor S, Terpstra W, Tsai M, Vardy J, Wye N, Yang G, Shatsman S, Ayodeji B, Geer K, Tsegaye G, Shvartsbeyn A, Gebregeorgis E, Krol M, Russell D, Overton L, Malek JA, Holmes M, Heaney M, Shetty J, Feldblyum T, Nierman WC, Catanese JJ, Hubbard T, Waterston RH, Rogers J, de Jong PJ, Fraser CM, Marra M, McPherson JD, Bentley DR. A physical map of the mouse genome. Nature 2002; 418:743-50. [PMID: 12181558 DOI: 10.1038/nature00957] [Citation(s) in RCA: 251] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
A physical map of a genome is an essential guide for navigation, allowing the location of any gene or other landmark in the chromosomal DNA. We have constructed a physical map of the mouse genome that contains 296 contigs of overlapping bacterial clones and 16,992 unique markers. The mouse contigs were aligned to the human genome sequence on the basis of 51,486 homology matches, thus enabling use of the conserved synteny (correspondence between chromosome blocks) of the two genomes to accelerate construction of the mouse map. The map provides a framework for assembly of whole-genome shotgun sequence data, and a tile path of clones for generation of the reference sequence. Definition of the human-mouse alignment at this level of resolution enables identification of a mouse clone that corresponds to almost any position in the human genome. The human sequence may be used to facilitate construction of other mammalian genome maps using the same strategy.
Collapse
Affiliation(s)
- Simon G Gregory
- The Wellcome Trust Sanger Institute, Hinxton, Cambridge CB10 1SA, UK
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
36
|
Foster-Hartnett D, Mudge J, Larsen D, Danesh D, Yan H, Denny R, Peñuela S, Young ND. Comparative genomic analysis of sequences sampled from a small region on soybean (Glycine max) molecular linkage group G. Genome 2002; 45:634-45. [PMID: 12175066 DOI: 10.1139/g02-027] [Citation(s) in RCA: 36] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Eight DNA markers spanning an interval of approximately 10 centimorgans (cM) on soybean (Glycine max) molecular linkage group G (MLG-G) were used to identify bacterial artificial chromosome (BAC) clones. Twenty-eight BAC clones in eight distinct contiguous groups (contigs) were isolated from this genome region, along with 59 BAC clones on 17 contigs homoeologous to those on MLG-G. BAC clones in four of the MLG-G contigs were also digested to produce subclones and detailed physical maps. All of the BAC-ends were sequenced, as were the subclones, to estimate proportions in different sequence categories, compare similarities among homoeologs, and explore microsynteny with Arabidopsis. Homoeologous BAC contigs were enriched in repetitive sequences compared with those on MLG-G or the soybean genome as a whole. Fingerprint and cross-hybridization comparisons between MLG-G and homoeologous contigs revealed cases of highly similar physical organization between soybean duplicates, as did DNA sequence comparisons. Twenty-seven out of 78 total sequences on soybean MLG-G showed significant similarity to Arabidopsis. The homologs mapped to six compact genome segments in Arabidopsis, with the longest containing seven homologs spanning two million base pairs. These results extend previous observations of large-scale duplication and selective gene loss in Arabidopsis, suggesting that networks of conserved synteny between Arabidopsis and other angiosperm families can stretch over long physical distances.
Collapse
|
37
|
Luyten K, Riou C, Blondin B. The hexose transporters of Saccharomyces cerevisiae play different roles during enological fermentation. Yeast 2002; 19:713-26. [PMID: 12185841 DOI: 10.1002/yea.869] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
We investigated the role of hexose transporters in a Saccharomyces cerevisiae strain derived from an industrial wine strain by carrying out a functional analysis of HXT genes 1-7 under enological conditions. A strain in which the sugar carrier genes HXT1-HXT7 were deleted was constructed and the HXT genes were expressed individually or in combination to evaluate their role under wine alcoholic fermentation conditions. No growth or fermentation was observed in winemaking conditions for the hxt1-7 delta strain. The low-affinity carriers Hxt1 and Hxt3 were the only carriers giving complete fermentation of sugars when expressed alone, indicating that these carriers play a predominant role in wine fermentation. However, these two carriers have different functions. The Hxt3 transporter is thought to play a major role, as it was the only carrier that gave an almost normal fermentation profile when produced alone. The hxt1 carrier was much less effective during the stationary phase and its role is thought to be restricted to the beginning of fermentation. The high-affinity carriers Hxt2, Hxt6 and/or Hxt7 were also required for normal fermentation. These high-affinity transporters have different functions: hxt2 is involved in growth initiation, whereas Hxt6 and/or Hxt7 are required at the end of alcoholic fermentation. This work shows that the successful alcoholic fermentation of wine involves at least four or five hexose carriers, playing different roles at various stages in the fermentation cycle.
Collapse
Affiliation(s)
- Kattie Luyten
- UMR Sciences Pour 1' CEnologie, INRA-ENSAM-UMI, 2 place Viala, F-34060 Montpellier, France
| | | | | |
Collapse
|
38
|
Tao Q, Chang YL, Wang J, Chen H, Islam-Faridi MN, Scheuring C, Wang B, Stelly DM, Zhang HB. Bacterial artificial chromosome-based physical map of the rice genome constructed by restriction fingerprint analysis. Genetics 2001; 158:1711-24. [PMID: 11514457 PMCID: PMC1461754 DOI: 10.1093/genetics/158.4.1711] [Citation(s) in RCA: 80] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Genome-wide physical mapping with bacteria-based large-insert clones (e.g., BACs, PACs, and PBCs) promises to revolutionize genomics of large, complex genomes. To accelerate rice and other grass species genome research, we developed a genome-wide BAC-based map of the rice genome. The map consists of 298 BAC contigs and covers 419 Mb of the 430-Mb rice genome. Subsequent analysis indicated that the contigs constituting the map are accurate and reliable. Particularly important to proficiency were (1) a high-resolution, high-throughput DNA sequencing gel-based electrophoretic method for BAC fingerprinting, (2) the use of several complementary large-insert BAC libraries, and (3) computer-aided contig assembly. It has been demonstrated that the fingerprinting method is not significantly influenced by repeated sequences, genome size, and genome complexity. Use of several complementary libraries developed with different restriction enzymes minimized the "gaps" in the physical map. In contrast to previous estimates, a clonal coverage of 6.0-8.0 genome equivalents seems to be sufficient for development of a genome-wide physical map of approximately 95% genome coverage. This study indicates that genome-wide BAC-based physical maps can be developed quickly and economically for a variety of plant and animal species by restriction fingerprint analysis via DNA sequencing gel-based electrophoresis.
Collapse
Affiliation(s)
- Q Tao
- Department of Soil and Crop Sciences and Crop Biotechnology Center, Texas A&M University, College Station, TX 77843-2123, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
39
|
Abstract
Recent spectacular advances in the technologies and strategies for DNA sequencing have profoundly accelerated the detailed analysis of genomes from myriad organisms. The past few years alone have seen the publication of near-complete or draft versions of the genome sequence of several well-studied, multicellular organisms - most notably, the human. As well as providing data of fundamental biological significance, these landmark accomplishments have yielded important strategic insights that are guiding current and future genome-sequencing projects.
Collapse
Affiliation(s)
- E D Green
- Genome Technology Branch and NIH Intramural Sequencing Center, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20892, USA.
| |
Collapse
|
40
|
Hall D, Bhandarkar SM, Wang J. ODS2: a multiplatform software application for creating integrated physical and genetic maps. Genetics 2001; 157:1045-56. [PMID: 11238393 PMCID: PMC1461558 DOI: 10.1093/genetics/157.3.1045] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
A contig map is a physical map that shows the native order of a library of overlapping genomic clones. One common method for creating such maps involves using hybridization to detect clone overlaps. False- positive and false-negative hybridization errors, the presence of chimeric clones, and gaps in library coverage lead to ambiguity and error in the clone order. Genomes with good genetic maps, such as Neurospora crassa, provide a means for reducing ambiguities and errors when constructing contig maps if clones can be anchored with genetic markers to the genetic map. A software application called ODS2 for creating contig maps based on clone-clone hybridization data is presented. This application is also designed to exploit partial ordering information provided by anchorage of clones to a genetic map. This information, along with clone-clone hybridization data, is used by a clone ordering algorithm and is represented graphically, allowing users to interactively align physical and genetic maps. ODS2 has a graphical user interface and is implemented entirely in Java, so it runs on multiple platforms. Other features include the flexibility of storing data in a local file or relational database and the ability to create full or minimum tiling contig maps.
Collapse
Affiliation(s)
- D Hall
- Department of Computer Science, The University of Georgia, Athens, Georgia 30602-7404, USA
| | | | | |
Collapse
|
41
|
|
42
|
Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, Devon K, Dewar K, Doyle M, FitzHugh W, Funke R, Gage D, Harris K, Heaford A, Howland J, Kann L, Lehoczky J, LeVine R, McEwan P, McKernan K, Meldrim J, Mesirov JP, Miranda C, Morris W, Naylor J, Raymond C, Rosetti M, Santos R, Sheridan A, Sougnez C, Stange-Thomann Y, Stojanovic N, Subramanian A, Wyman D, Rogers J, Sulston J, Ainscough R, Beck S, Bentley D, Burton J, Clee C, Carter N, Coulson A, Deadman R, Deloukas P, Dunham A, Dunham I, Durbin R, French L, Grafham D, Gregory S, Hubbard T, Humphray S, Hunt A, Jones M, Lloyd C, McMurray A, Matthews L, Mercer S, Milne S, Mullikin JC, Mungall A, Plumb R, Ross M, Shownkeen R, Sims S, Waterston RH, Wilson RK, Hillier LW, McPherson JD, Marra MA, Mardis ER, Fulton LA, Chinwalla AT, Pepin KH, Gish WR, Chissoe SL, Wendl MC, Delehaunty KD, Miner TL, Delehaunty A, Kramer JB, Cook LL, Fulton RS, Johnson DL, Minx PJ, Clifton SW, Hawkins T, Branscomb E, Predki P, Richardson P, Wenning S, Slezak T, Doggett N, Cheng JF, Olsen A, Lucas S, Elkin C, Uberbacher E, Frazier M, Gibbs RA, Muzny DM, Scherer SE, Bouck JB, Sodergren EJ, Worley KC, Rives CM, Gorrell JH, Metzker ML, Naylor SL, Kucherlapati RS, Nelson DL, Weinstock GM, Sakaki Y, Fujiyama A, Hattori M, Yada T, Toyoda A, Itoh T, Kawagoe C, Watanabe H, Totoki Y, Taylor T, Weissenbach J, Heilig R, Saurin W, Artiguenave F, Brottier P, Bruls T, Pelletier E, Robert C, Wincker P, Smith DR, Doucette-Stamm L, Rubenfield M, Weinstock K, Lee HM, Dubois J, Rosenthal A, Platzer M, Nyakatura G, Taudien S, Rump A, Yang H, Yu J, Wang J, Huang G, Gu J, Hood L, Rowen L, Madan A, Qin S, Davis RW, Federspiel NA, Abola AP, Proctor MJ, Myers RM, Schmutz J, Dickson M, Grimwood J, Cox DR, Olson MV, Kaul R, Raymond C, Shimizu N, Kawasaki K, Minoshima S, Evans GA, Athanasiou M, Schultz R, Roe BA, Chen F, Pan H, Ramser J, Lehrach H, Reinhardt R, McCombie WR, de la Bastide M, Dedhia N, Blöcker H, Hornischer K, Nordsiek G, Agarwala R, Aravind L, Bailey JA, Bateman A, Batzoglou S, Birney E, Bork P, Brown DG, Burge CB, Cerutti L, Chen HC, Church D, Clamp M, Copley RR, Doerks T, Eddy SR, Eichler EE, Furey TS, Galagan J, Gilbert JG, Harmon C, Hayashizaki Y, Haussler D, Hermjakob H, Hokamp K, Jang W, Johnson LS, Jones TA, Kasif S, Kaspryzk A, Kennedy S, Kent WJ, Kitts P, Koonin EV, Korf I, Kulp D, Lancet D, Lowe TM, McLysaght A, Mikkelsen T, Moran JV, Mulder N, Pollara VJ, Ponting CP, Schuler G, Schultz J, Slater G, Smit AF, Stupka E, Szustakowki J, Thierry-Mieg D, Thierry-Mieg J, Wagner L, Wallis J, Wheeler R, Williams A, Wolf YI, Wolfe KH, Yang SP, Yeh RF, Collins F, Guyer MS, Peterson J, Felsenfeld A, Wetterstrand KA, Patrinos A, Morgan MJ, de Jong P, Catanese JJ, Osoegawa K, Shizuya H, Choi S, Chen YJ, Szustakowki J. Initial sequencing and analysis of the human genome. Nature 2001; 409:860-921. [PMID: 11237011 DOI: 10.1038/35057062] [Citation(s) in RCA: 14712] [Impact Index Per Article: 639.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
The human genome holds an extraordinary trove of information about human development, physiology, medicine and evolution. Here we report the results of an international collaboration to produce and make freely available a draft sequence of the human genome. We also present an initial analysis of the data, describing some of the insights that can be gleaned from the sequence.
Collapse
Affiliation(s)
- E S Lander
- Whitehead Institute for Biomedical Research, Center for Genome Research, Cambridge, MA 02142, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
43
|
McPherson JD, Marra M, Hillier L, Waterston RH, Chinwalla A, Wallis J, Sekhon M, Wylie K, Mardis ER, Wilson RK, Fulton R, Kucaba TA, Wagner-McPherson C, Barbazuk WB, Gregory SG, Humphray SJ, French L, Evans RS, Bethel G, Whittaker A, Holden JL, McCann OT, Dunham A, Soderlund C, Scott CE, Bentley DR, Schuler G, Chen HC, Jang W, Green ED, Idol JR, Maduro VV, Montgomery KT, Lee E, Miller A, Emerling S, Gibbs R, Scherer S, Gorrell JH, Sodergren E, Clerc-Blankenburg K, Tabor P, Naylor S, Garcia D, de Jong PJ, Catanese JJ, Nowak N, Osoegawa K, Qin S, Rowen L, Madan A, Dors M, Hood L, Trask B, Friedman C, Massa H, Cheung VG, Kirsch IR, Reid T, Yonescu R, Weissenbach J, Bruls T, Heilig R, Branscomb E, Olsen A, Doggett N, Cheng JF, Hawkins T, Myers RM, Shang J, Ramirez L, Schmutz J, Velasquez O, Dixon K, Stone NE, Cox DR, Haussler D, Kent WJ, Furey T, Rogic S, Kennedy S, Jones S, Rosenthal A, Wen G, Schilhabel M, Gloeckner G, Nyakatura G, Siebert R, Schlegelberger B, Korenberg J, Chen XN, Fujiyama A, Hattori M, Toyoda A, Yada T, Park HS, Sakaki Y, Shimizu N, Asakawa S, Kawasaki K, Sasaki T, Shintani A, Shimizu A, Shibuya K, Kudoh J, Minoshima S, Ramser J, Seranski P, Hoff C, Poustka A, Reinhardt R, Lehrach H. A physical map of the human genome. Nature 2001; 409:934-41. [PMID: 11237014 DOI: 10.1038/35057157] [Citation(s) in RCA: 549] [Impact Index Per Article: 23.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
The human genome is by far the largest genome to be sequenced, and its size and complexity present many challenges for sequence assembly. The International Human Genome Sequencing Consortium constructed a map of the whole genome to enable the selection of clones for sequencing and for the accurate assembly of the genome sequence. Here we report the construction of the whole-genome bacterial artificial chromosome (BAC) map and its integration with previous landmark maps and information from mapping efforts focused on specific chromosomal regions. We also describe the integration of sequence data with the map.
Collapse
Affiliation(s)
- J D McPherson
- Washington University School of Medicine, Genome Sequencing Center, Department of Genetics, St. Louis, Missouri 63108, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
44
|
Abstract
Having the complete genome sequence of Saccharomyces cerevisiae makes us aware of the ultimate goal of yeast molecular biology: the 'solution' of the cell, that is, an understanding of the function of all approximately 6000 proteins (and a few RNAs) and how they interact with each other and the environment. The recent development of 'genomic' approaches for studying gene function makes this goal seem reachable in the foreseeable future. When this is accomplished, we will have entered a Golden Age, when we will have the information necessary for designing truly incisive experiments to reveal biological function.
Collapse
Affiliation(s)
- M Johnston
- Department of Genetics, Box 8232, Washington University School of Medicine, 660 Euclid Avenue, St Louis, Missouri 63113, USA.
| |
Collapse
|
45
|
Soderlund C, Humphray S, Dunham A, French L. Contigs built with fingerprints, markers, and FPC V4.7. Genome Res 2000. [PMID: 11076862 DOI: 10.1101/gr.gr‐1375r] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Contigs have been assembled, and over 2800 clones selected for sequencing for human chromosomes 9, 10 and 13. Using the FPC (FingerPrinted Contig) software, the contigs are assembled with markers and complete digest fingerprints, and the contigs are ordered and localised by a global framework. Publicly available resources have been used, such as, the 1998 International Gene Map for the framework and the GSC Human BAC fingerprint database for the majority of the fingerprints. Additional markers and fingerprints are generated in-house to supplement this data. To support the scale up of building maps, FPC V4.7 has been extended to use markers with the fingerprints for assembly of contigs, new clones and markers can be automatically added to existing contigs, and poorly assembled contigs are marked accordingly. To test the automatic assembly, a simulated complete digest of 110 Mb of concatenated human sequence was used to create datasets with varying coverage, length of clones, and types of error. When no error was introduced and a tolerance of 7 was used in assembly, the largest contig with no false positive overlaps has 9534 clones with 37 out-of-order clones, that is, the starting coordinates of adjacent clones are in the wrong order. This paper describes the new features in FPC, the scenario for building the maps of chromosomes 9, 10 and 13, and the results from the simulation.
Collapse
Affiliation(s)
- C Soderlund
- Clemson University Genomic Institute, Clemson, South Carolina 29634-5808, USA.
| | | | | | | |
Collapse
|
46
|
Soderlund C, Humphray S, Dunham A, French L. Contigs built with fingerprints, markers, and FPC V4.7. Genome Res 2000; 10:1772-87. [PMID: 11076862 PMCID: PMC310962 DOI: 10.1101/gr.gr-1375r] [Citation(s) in RCA: 267] [Impact Index Per Article: 11.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Contigs have been assembled, and over 2800 clones selected for sequencing for human chromosomes 9, 10 and 13. Using the FPC (FingerPrinted Contig) software, the contigs are assembled with markers and complete digest fingerprints, and the contigs are ordered and localised by a global framework. Publicly available resources have been used, such as, the 1998 International Gene Map for the framework and the GSC Human BAC fingerprint database for the majority of the fingerprints. Additional markers and fingerprints are generated in-house to supplement this data. To support the scale up of building maps, FPC V4.7 has been extended to use markers with the fingerprints for assembly of contigs, new clones and markers can be automatically added to existing contigs, and poorly assembled contigs are marked accordingly. To test the automatic assembly, a simulated complete digest of 110 Mb of concatenated human sequence was used to create datasets with varying coverage, length of clones, and types of error. When no error was introduced and a tolerance of 7 was used in assembly, the largest contig with no false positive overlaps has 9534 clones with 37 out-of-order clones, that is, the starting coordinates of adjacent clones are in the wrong order. This paper describes the new features in FPC, the scenario for building the maps of chromosomes 9, 10 and 13, and the results from the simulation.
Collapse
Affiliation(s)
- C Soderlund
- Clemson University Genomic Institute, Clemson, South Carolina 29634-5808, USA.
| | | | | | | |
Collapse
|
47
|
Abstract
The end of the beginning of the Human Genome Project was announced on 26 June when the working draft or first assembly was announced. Here, Ian Dunham who led the group at the Sanger Centre that produced the first complete sequence of a human chromosome reflects on how it felt to be with the genome project from the beginning.
Collapse
Affiliation(s)
- I Dunham
- The Sanger Centre, Wellcome Trust Genome Campus, Hinxton, CB10 1SA, Cambridge, UK
| |
Collapse
|
48
|
Costello CA, Payson RA, Menke MA, Larson JL, Brown KA, Tanner JE, Kaiser RE, Hershberger CL, Zmijewski MJ. Purification, characterization, cDNA cloning and expression of a novel ketoreductase from Zygosaccharomyces rouxii. EUROPEAN JOURNAL OF BIOCHEMISTRY 2000; 267:5493-501. [PMID: 10951208 DOI: 10.1046/j.1432-1327.2000.01608.x] [Citation(s) in RCA: 22] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
A novel ketoreductase isolated from Zygosaccharomyces rouxii catalyzes the asymmetric reduction of selected ketone substrates of commercial importance. The 37.8-kDa ketoreductase was purified more than 300-fold to > 95% homogeneity from whole cells with a 30% activity yield. The ketoreductase functions as a monomer with an apparent Km for 3,4-methylenedioxyphenyl acetone of 2.9 mM and a Km for NADPH of 23.5 microM. The enzyme is able to effectively reduce alpha-ketolactones, alpha-ketolactams, and diketones. Inhibition is observed in the presence of diethyl pyrocarbonate, suggesting that a histidine is crucial for catalysis. The 1.0-kb ketoreductase gene was cloned and sequenced from a Z. rouxii cDNA library using a degenerate primer to the N-terminal sequence of the purified protein. Furthermore, it was expressed in both Escherichia coli and Pichia pastoris and shown to be active. Substrate specificity, lack of a catalytic metal, and extent of protein sequence identity to known reductases suggests that the enzyme falls into the carbonyl reductase enzyme class.
Collapse
Affiliation(s)
- C A Costello
- Lilly Research Laboratories, Eli Lilly and Company, Indianapolis, IN 46285, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
49
|
Tran HG, Steger DJ, Iyer VR, Johnson AD. The chromo domain protein chd1p from budding yeast is an ATP-dependent chromatin-modifying factor. EMBO J 2000; 19:2323-31. [PMID: 10811623 PMCID: PMC384354 DOI: 10.1093/emboj/19.10.2323] [Citation(s) in RCA: 130] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
CHD proteins are members of the chromo domain family, a class of proteins involved in transcription, DNA degradation and chromatin structure. In higher eukaryotes, there are two distinct subfamilies of CHD proteins: CHD1 and CHD3/4. Analyses carried out in vitro indicate that the CHD3/4 proteins may regulate transcription via alteration of chromatin structure. However, little is known about the role of CHD proteins in vivo, particularly the CHD1 subfamily. To understand better the cellular function of CHD proteins, we initiated a study on the Chd1p protein from budding yeast. Using genomic DNA arrays, we identified genes whose expression is affected by the absence of Chd1p. A synthetic-lethal screen uncovered genetic interactions between SWI/SNF genes and CHD1. Biochemical experiments using Chd1p purified from yeast showed that it reconfigures the structure of nucleosome core particles in a manner distinct from the SWI-SNF complex. Taken together, these results suggest that Chd1p functions as a nucleosome remodeling factor, and that Chd1p may share overlapping roles with the SWI-SNF complex to regulate transcription.
Collapse
Affiliation(s)
- H G Tran
- Department of Biochemistry and Biophysics and Department of Microbiology and Immunology, University of California at San Francisco, San Francisco, CA 94143, USA
| | | | | | | |
Collapse
|
50
|
Abstract
Here we report an evaluation of the Aspergillus nidulans physical map (a cosmid contig map) emphasizing quantification and description of obvious mapping errors. Classification and appraisal of mapping errors should be helpful to researchers working on particular regions of the map. We estimate between 47 (4.1%) and 63 (5.4%) probe/clone-linking errors. The majority of identified false links (38) permit reciprocal exchanges among linking clones located on disconnected mapping regions. The order of adjacent clones or probes on the affected contigs remains unchanged. In addition we describe an Internet-accessible resource in which genetic and physical maps were integrated through a graphic interface. A simple search engine allows retrieval of cosmids from redundant clone lists and provides links to the minimal clone order. Integration of genetic and physical maps provides an additional level of accountability in which mapping discrepancies are visually located.
Collapse
Affiliation(s)
- R A Prade
- Department of Microbiology and Molecular Genetics, Oklahoma State University, Stillwater, Oklahoma, 74078, USA
| |
Collapse
|