1
|
Heilig R, Eckenberg R, Petit JL, Fonknechten N, Da Silva C, Cattolico L, Levy M, Barbe V, de Berardinis V, Ureta-Vidal A, Pelletier E, Vico V, Anthouard V, Rowen L, Madan A, Qin S, Sun H, Du H, Pepin K, Artiguenave F, Robert C, Cruaud C, Brüls T, Jaillon O, Friedlander L, Samson G, Brottier P, Cure S, Ségurens B, Anière F, Samain S, Crespeau H, Abbasi N, Aiach N, Boscus D, Dickhoff R, Dors M, Dubois I, Friedman C, Gouyvenoux M, James R, Madan A, Mairey-Estrada B, Mangenot S, Martins N, Ménard M, Oztas S, Ratcliffe A, Shaffer T, Trask B, Vacherie B, Bellemere C, Belser C, Besnard-Gonnet M, Bartol-Mavel D, Boutard M, Briez-Silla S, Combette S, Dufossé-Laurent V, Ferron C, Lechaplais C, Louesse C, Muselet D, Magdelenat G, Pateau E, Petit E, Sirvain-Trukniewicz P, Trybou A, Vega-Czarny N, Bataille E, Bluet E, Bordelais I, Dubois M, Dumont C, Guérin T, Haffray S, Hammadi R, Muanga J, Pellouin V, Robert D, Wunderle E, Gauguet G, Roy A, Sainte-Marthe L, Verdier J, Verdier-Discala C, Hillier L, Fulton L, McPherson J, Matsuda F, Wilson R, Scarpelli C, Gyapay G, Wincker P, Saurin W, Quétier F, Waterston R, Hood L, Weissenbach J. Publisher Correction: The DNA sequence and analysis of human chromosome 14. Nature 2023; 620:E17. [PMID: 37491470 DOI: 10.1038/s41586-023-06403-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/27/2023]
Affiliation(s)
- Roland Heilig
- Genoscope-Centre National de Séquençage, 91000, Evry, France.
- UMR-8030, CNRS et Université d'Evry, 91000, Evry, France.
| | - Ralph Eckenberg
- Genoscope-Centre National de Séquençage, 91000, Evry, France
- UMR-8030, CNRS et Université d'Evry, 91000, Evry, France
| | - Jean-Louis Petit
- Genoscope-Centre National de Séquençage, 91000, Evry, France
- UMR-8030, CNRS et Université d'Evry, 91000, Evry, France
| | - Núria Fonknechten
- Genoscope-Centre National de Séquençage, 91000, Evry, France
- UMR-8030, CNRS et Université d'Evry, 91000, Evry, France
| | - Corinne Da Silva
- Genoscope-Centre National de Séquençage, 91000, Evry, France
- UMR-8030, CNRS et Université d'Evry, 91000, Evry, France
| | | | - Michaël Levy
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | - Valérie Barbe
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | | | | | - Eric Pelletier
- Genoscope-Centre National de Séquençage, 91000, Evry, France
- UMR-8030, CNRS et Université d'Evry, 91000, Evry, France
| | - Virginie Vico
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | | | - Lee Rowen
- Institute for Systems Biology, Seattle, Washington, 98103, USA
| | - Anup Madan
- Institute for Systems Biology, Seattle, Washington, 98103, USA
| | - Shizhen Qin
- Institute for Systems Biology, Seattle, Washington, 98103, USA
| | - Hui Sun
- Genome Sequencing Center, Washington University School of Medicine, St Louis, Missouri, 63108, USA
| | - Hui Du
- Genome Sequencing Center, Washington University School of Medicine, St Louis, Missouri, 63108, USA
| | - Kymberlie Pepin
- Genome Sequencing Center, Washington University School of Medicine, St Louis, Missouri, 63108, USA
| | | | | | - Corinne Cruaud
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | - Thomas Brüls
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | - Olivier Jaillon
- Genoscope-Centre National de Séquençage, 91000, Evry, France
- UMR-8030, CNRS et Université d'Evry, 91000, Evry, France
| | | | - Gaelle Samson
- Genoscope-Centre National de Séquençage, 91000, Evry, France
- UMR-8030, CNRS et Université d'Evry, 91000, Evry, France
| | | | - Susan Cure
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | | | - Franck Anière
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | - Sylvie Samain
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | - Hervé Crespeau
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | - Nissa Abbasi
- Institute for Systems Biology, Seattle, Washington, 98103, USA
| | - Nathalie Aiach
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | - Didier Boscus
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | - Rachel Dickhoff
- Institute for Systems Biology, Seattle, Washington, 98103, USA
| | - Monica Dors
- Institute for Systems Biology, Seattle, Washington, 98103, USA
| | - Ivan Dubois
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | | | | | - Rose James
- Institute for Systems Biology, Seattle, Washington, 98103, USA
| | - Anuradha Madan
- Institute for Systems Biology, Seattle, Washington, 98103, USA
| | | | - Sophie Mangenot
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | | | - Manuela Ménard
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | - Sophie Oztas
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | - Amber Ratcliffe
- Institute for Systems Biology, Seattle, Washington, 98103, USA
| | - Tristan Shaffer
- Institute for Systems Biology, Seattle, Washington, 98103, USA
| | - Barbara Trask
- Institute for Systems Biology, Seattle, Washington, 98103, USA
| | - Benoit Vacherie
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | | | - Caroline Belser
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | | | | | - Magali Boutard
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | | | | | | | - Carolyne Ferron
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | | | | | | | | | - Emilie Pateau
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | | | | | - Arnaud Trybou
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | | | - Elodie Bataille
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | - Elodie Bluet
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | | | - Maria Dubois
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | - Corinne Dumont
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | - Thomas Guérin
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | | | - Rachid Hammadi
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | | | | | | | - Edith Wunderle
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | - Gilbert Gauguet
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | - Alice Roy
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | | | - Jean Verdier
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | | | - LaDeana Hillier
- Genome Sequencing Center, Washington University School of Medicine, St Louis, Missouri, 63108, USA
| | - Lucinda Fulton
- Genome Sequencing Center, Washington University School of Medicine, St Louis, Missouri, 63108, USA
| | - John McPherson
- Genome Sequencing Center, Washington University School of Medicine, St Louis, Missouri, 63108, USA
| | | | - Richard Wilson
- Genome Sequencing Center, Washington University School of Medicine, St Louis, Missouri, 63108, USA
| | | | - Gábor Gyapay
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | - Patrick Wincker
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | - William Saurin
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | - Francis Quétier
- Genoscope-Centre National de Séquençage, 91000, Evry, France
- UMR-8030, CNRS et Université d'Evry, 91000, Evry, France
| | - Robert Waterston
- Genome Sequencing Center, Washington University School of Medicine, St Louis, Missouri, 63108, USA
| | - Leroy Hood
- Institute for Systems Biology, Seattle, Washington, 98103, USA
| | - Jean Weissenbach
- Genoscope-Centre National de Séquençage, 91000, Evry, France
- UMR-8030, CNRS et Université d'Evry, 91000, Evry, France
| |
Collapse
|
2
|
Ghaddar A, Armingol E, Huynh C, Gevirtzman L, Lewis NE, Waterston R, O’Rourke EJ. Whole-body gene expression atlas of an adult metazoan. Sci Adv 2023; 9:eadg0506. [PMID: 37352352 PMCID: PMC10289653 DOI: 10.1126/sciadv.adg0506] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Accepted: 05/17/2023] [Indexed: 06/25/2023]
Abstract
Gene activity defines cell identity, drives intercellular communication, and underlies the functioning of multicellular organisms. We present the single-cell resolution atlas of gene activity of a fertile adult metazoan: Caenorhabditis elegans. This compendium comprises 180 distinct cell types and 19,657 expressed genes. We predict 7541 transcription factor expression profile associations likely responsible for defining cellular identity. We predict thousands of intercellular interactions across the C. elegans body and the ligand-receptor pairs that mediate them, some of which we experimentally validate. We identify 172 genes that show consistent expression across cell types, are involved in basic and essential functions, and are conserved across phyla; therefore, we present them as experimentally validated housekeeping genes. We developed the WormSeq application to explore these data. In addition to the integrated gene-to-systems biology, we present genome-scale single-cell resolution testable hypotheses that we anticipate will advance our understanding of the molecular mechanisms, underlying the functioning of a multicellular organism and the perturbations that lead to its malfunction.
Collapse
Affiliation(s)
- Abbas Ghaddar
- Department of Biology, College of Arts and Sciences, University of Virginia, Charlottesville, VA 22903, USA
| | - Erick Armingol
- Bioinformatics and Systems Biology Graduate Program, University of California, San Diego, La Jolla, CA 92093, USA
- Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA
| | - Chau Huynh
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Louis Gevirtzman
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Nathan E. Lewis
- Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA
| | - Robert Waterston
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Eyleen J. O’Rourke
- Department of Biology, College of Arts and Sciences, University of Virginia, Charlottesville, VA 22903, USA
- Department of Cell Biology, University of Virginia School of Medicine, Charlottesville, VA 22903, USA
- Robert M. Berne Cardiovascular Research Center, School of Medicine, University of Virginia, Charlottesville, VA 22903, USA
| |
Collapse
|
3
|
Gerstein MB, Rozowsky J, Yan KK, Wang D, Cheng C, Brown JB, Davis CA, Hillier L, Sisu C, Li JJ, Pei B, Harmanci AO, Duff MO, Djebali S, Alexander RP, Alver BH, Auerbach R, Bell K, Bickel PJ, Boeck ME, Boley NP, Booth BW, Cherbas L, Cherbas P, Di C, Dobin A, Drenkow J, Ewing B, Fang G, Fastuca M, Feingold EA, Frankish A, Gao G, Good PJ, Guigó R, Hammonds A, Harrow J, Hoskins RA, Howald C, Hu L, Huang H, Hubbard TJP, Huynh C, Jha S, Kasper D, Kato M, Kaufman TC, Kitchen RR, Ladewig E, Lagarde J, Lai E, Leng J, Lu Z, MacCoss M, May G, McWhirter R, Merrihew G, Miller DM, Mortazavi A, Murad R, Oliver B, Olson S, Park PJ, Pazin MJ, Perrimon N, Pervouchine D, Reinke V, Reymond A, Robinson G, Samsonova A, Saunders GI, Schlesinger F, Sethi A, Slack FJ, Spencer WC, Stoiber MH, Strasbourger P, Tanzer A, Thompson OA, Wan KH, Wang G, Wang H, Watkins KL, Wen J, Wen K, Xue C, Yang L, Yip K, Zaleski C, Zhang Y, Zheng H, Brenner SE, Graveley BR, Celniker SE, Gingeras TR, Waterston R. Comparative analysis of the transcriptome across distant species. Nature 2014; 512:445-8. [PMID: 25164755 PMCID: PMC4155737 DOI: 10.1038/nature13424] [Citation(s) in RCA: 239] [Impact Index Per Article: 23.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2013] [Accepted: 04/30/2014] [Indexed: 12/30/2022]
Abstract
The transcriptome is the readout of the genome. Identifying common features in it across distant species can reveal fundamental principles. To this end, the ENCODE and modENCODE consortia have generated large amounts of matched RNA-sequencing data for human, worm and fly. Uniform processing and comprehensive annotation of these data allow comparison across metazoan phyla, extending beyond earlier within-phylum transcriptome comparisons and revealing ancient, conserved features. Specifically, we discover co-expression modules shared across animals, many of which are enriched in developmental genes. Moreover, we use expression patterns to align the stages in worm and fly development and find a novel pairing between worm embryo and fly pupae, in addition to the embryo-to-embryo and larvae-to-larvae pairings. Furthermore, we find that the extent of non-canonical, non-coding transcription is similar in each organism, per base pair. Finally, we find in all three organisms that the gene-expression levels, both coding and non-coding, can be quantitatively predicted from chromatin features at the promoter using a 'universal model' based on a single set of organism-independent parameters.
Collapse
Affiliation(s)
- Mark B Gerstein
- 1] Program in Computational Biology and Bioinformatics, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA [2] Department of Molecular Biophysics and Biochemistry, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA [3] Department of Computer Science, Yale University, 51 Prospect Street, New Haven, Connecticut 06511, USA [4] [5]
| | - Joel Rozowsky
- 1] Program in Computational Biology and Bioinformatics, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA [2] Department of Molecular Biophysics and Biochemistry, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA [3]
| | - Koon-Kiu Yan
- 1] Program in Computational Biology and Bioinformatics, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA [2] Department of Molecular Biophysics and Biochemistry, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA [3]
| | - Daifeng Wang
- 1] Program in Computational Biology and Bioinformatics, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA [2] Department of Molecular Biophysics and Biochemistry, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA [3]
| | - Chao Cheng
- 1] Department of Genetics, Geisel School of Medicine at Dartmouth, Hanover, New Hampshire 03755, USA [2] Institute for Quantitative Biomedical Sciences, Norris Cotton Cancer Center, Geisel School of Medicine at Dartmouth, Lebanon, New Hampshire 03766, USA [3]
| | - James B Brown
- 1] Department of Genome Dynamics, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA [2] Department of Statistics, University of California, Berkeley, 367 Evans Hall, Berkeley, California 94720-3860, USA [3]
| | - Carrie A Davis
- 1] Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA [2]
| | - LaDeana Hillier
- 1] Department of Genome Sciences and University of Washington School of Medicine, William H. Foege Building S350D, 1705 Northeast Pacific Street, Box 355065 Seattle, Washington 98195-5065, USA [2]
| | - Cristina Sisu
- 1] Program in Computational Biology and Bioinformatics, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA [2] Department of Molecular Biophysics and Biochemistry, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA [3]
| | - Jingyi Jessica Li
- 1] Department of Statistics, University of California, Berkeley, 367 Evans Hall, Berkeley, California 94720-3860, USA [2] Department of Statistics, University of California, Los Angeles, California 90095-1554, USA [3] Department of Human Genetics, University of California, Los Angeles, California 90095-7088, USA [4]
| | - Baikang Pei
- 1] Program in Computational Biology and Bioinformatics, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA [2] Department of Molecular Biophysics and Biochemistry, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA [3]
| | - Arif O Harmanci
- 1] Program in Computational Biology and Bioinformatics, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA [2] Department of Molecular Biophysics and Biochemistry, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA [3]
| | - Michael O Duff
- 1] Department of Genetics and Developmental Biology, Institute for Systems Genomics, University of Connecticut Health Center, 400 Farmington Avenue, Farmington, Connecticut 06030, USA [2]
| | - Sarah Djebali
- 1] Centre for Genomic Regulation, Doctor Aiguader 88, 08003 Barcelona, Catalonia, Spain [2] Departament de Ciències Experimentals i de la Salut, Universitat Pompeu Fabra, 08003 Barcelona, Catalonia, Spain [3]
| | - Roger P Alexander
- 1] Program in Computational Biology and Bioinformatics, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA [2] Department of Molecular Biophysics and Biochemistry, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA
| | - Burak H Alver
- Center for Biomedical Informatics, Harvard Medical School, 10 Shattuck Street, Boston, Massachusetts 02115, USA
| | - Raymond Auerbach
- 1] Program in Computational Biology and Bioinformatics, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA [2] Department of Molecular Biophysics and Biochemistry, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA
| | - Kimberly Bell
- Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA
| | - Peter J Bickel
- Department of Statistics, University of California, Berkeley, 367 Evans Hall, Berkeley, California 94720-3860, USA
| | - Max E Boeck
- Department of Genome Sciences and University of Washington School of Medicine, William H. Foege Building S350D, 1705 Northeast Pacific Street, Box 355065 Seattle, Washington 98195-5065, USA
| | - Nathan P Boley
- 1] Department of Genome Dynamics, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA [2] Department of Biostatistics, University of California, Berkeley, 367 Evans Hall, Berkeley, California 94720-3860, USA
| | - Benjamin W Booth
- Department of Genome Dynamics, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Lucy Cherbas
- 1] Department of Biology, Indiana University, 1001 East 3rd Street, Bloomington, Indiana 47405-7005, USA [2] Center for Genomics and Bioinformatics, Indiana University, 1001 East 3rd Street, Bloomington, Indiana 47405-7005, USA
| | - Peter Cherbas
- 1] Department of Biology, Indiana University, 1001 East 3rd Street, Bloomington, Indiana 47405-7005, USA [2] Center for Genomics and Bioinformatics, Indiana University, 1001 East 3rd Street, Bloomington, Indiana 47405-7005, USA
| | - Chao Di
- MOE Key Lab of Bioinformatics, School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Alex Dobin
- Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA
| | - Jorg Drenkow
- Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA
| | - Brent Ewing
- Department of Genome Sciences and University of Washington School of Medicine, William H. Foege Building S350D, 1705 Northeast Pacific Street, Box 355065 Seattle, Washington 98195-5065, USA
| | - Gang Fang
- 1] Program in Computational Biology and Bioinformatics, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA [2] Department of Molecular Biophysics and Biochemistry, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA
| | - Megan Fastuca
- Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA
| | - Elise A Feingold
- National Human Genome Research Institute, National Institutes of Health, 5635 Fishers Lane, Bethesda, Maryland 20892-9307, USA
| | - Adam Frankish
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK
| | - Guanjun Gao
- MOE Key Lab of Bioinformatics, School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Peter J Good
- National Human Genome Research Institute, National Institutes of Health, 5635 Fishers Lane, Bethesda, Maryland 20892-9307, USA
| | - Roderic Guigó
- 1] Centre for Genomic Regulation, Doctor Aiguader 88, 08003 Barcelona, Catalonia, Spain [2] Departament de Ciències Experimentals i de la Salut, Universitat Pompeu Fabra, 08003 Barcelona, Catalonia, Spain
| | - Ann Hammonds
- Department of Genome Dynamics, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Jen Harrow
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK
| | - Roger A Hoskins
- Department of Genome Dynamics, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Cédric Howald
- 1] Center for Integrative Genomics, University of Lausanne, Genopode building, Lausanne 1015, Switzerland [2] Swiss Institute of Bioinformatics, Genopode building, Lausanne 1015, Switzerland
| | - Long Hu
- MOE Key Lab of Bioinformatics, School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Haiyan Huang
- Department of Statistics, University of California, Berkeley, 367 Evans Hall, Berkeley, California 94720-3860, USA
| | - Tim J P Hubbard
- 1] Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK [2] Medical and Molecular Genetics, King's College London, London WC2R 2LS, UK
| | - Chau Huynh
- Department of Genome Sciences and University of Washington School of Medicine, William H. Foege Building S350D, 1705 Northeast Pacific Street, Box 355065 Seattle, Washington 98195-5065, USA
| | - Sonali Jha
- Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA
| | - Dionna Kasper
- Department of Genetics, Yale University School of Medicine, New Haven, Connecticut 06520-8005, USA
| | - Masaomi Kato
- Department of Molecular, Cellular and Developmental Biology, PO Box 208103, Yale University, New Haven, Connecticut 06520, USA
| | - Thomas C Kaufman
- Department of Biology, Indiana University, 1001 East 3rd Street, Bloomington, Indiana 47405-7005, USA
| | - Robert R Kitchen
- 1] Program in Computational Biology and Bioinformatics, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA [2] Department of Molecular Biophysics and Biochemistry, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA
| | - Erik Ladewig
- Sloan-Kettering Institute, 1275 York Avenue, Box 252, New York, New York 10065, USA
| | - Julien Lagarde
- 1] Centre for Genomic Regulation, Doctor Aiguader 88, 08003 Barcelona, Catalonia, Spain [2] Departament de Ciències Experimentals i de la Salut, Universitat Pompeu Fabra, 08003 Barcelona, Catalonia, Spain
| | - Eric Lai
- Sloan-Kettering Institute, 1275 York Avenue, Box 252, New York, New York 10065, USA
| | - Jing Leng
- 1] Program in Computational Biology and Bioinformatics, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA [2] Department of Molecular Biophysics and Biochemistry, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA
| | - Zhi Lu
- MOE Key Lab of Bioinformatics, School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Michael MacCoss
- Department of Genome Sciences and University of Washington School of Medicine, William H. Foege Building S350D, 1705 Northeast Pacific Street, Box 355065 Seattle, Washington 98195-5065, USA
| | - Gemma May
- 1] Department of Genetics and Developmental Biology, Institute for Systems Genomics, University of Connecticut Health Center, 400 Farmington Avenue, Farmington, Connecticut 06030, USA [2] Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213 USA
| | - Rebecca McWhirter
- Department of Cell and Developmental Biology, Vanderbilt University, 465 21st Avenue South, Nashville, Tennessee 37232-8240, USA
| | - Gennifer Merrihew
- Department of Genome Sciences and University of Washington School of Medicine, William H. Foege Building S350D, 1705 Northeast Pacific Street, Box 355065 Seattle, Washington 98195-5065, USA
| | - David M Miller
- Department of Cell and Developmental Biology, Vanderbilt University, 465 21st Avenue South, Nashville, Tennessee 37232-8240, USA
| | - Ali Mortazavi
- 1] Developmental and Cell Biology, University of California, Irvine, California 92697, USA [2] Center for Complex Biological Systems, University of California, Irvine, California 92697, USA
| | - Rabi Murad
- 1] Developmental and Cell Biology, University of California, Irvine, California 92697, USA [2] Center for Complex Biological Systems, University of California, Irvine, California 92697, USA
| | - Brian Oliver
- Section of Developmental Genomics, Laboratory of Cellular and Developmental Biology, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Sara Olson
- Department of Genetics and Developmental Biology, Institute for Systems Genomics, University of Connecticut Health Center, 400 Farmington Avenue, Farmington, Connecticut 06030, USA
| | - Peter J Park
- Center for Biomedical Informatics, Harvard Medical School, 10 Shattuck Street, Boston, Massachusetts 02115, USA
| | - Michael J Pazin
- National Human Genome Research Institute, National Institutes of Health, 5635 Fishers Lane, Bethesda, Maryland 20892-9307, USA
| | - Norbert Perrimon
- 1] Department of Genetics and Drosophila RNAi Screening Center, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, Massachusetts 02115, USA [2] Howard Hughes Medical Institute, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, Massachusetts 02115, USA
| | - Dmitri Pervouchine
- 1] Centre for Genomic Regulation, Doctor Aiguader 88, 08003 Barcelona, Catalonia, Spain [2] Departament de Ciències Experimentals i de la Salut, Universitat Pompeu Fabra, 08003 Barcelona, Catalonia, Spain
| | - Valerie Reinke
- Department of Genetics, Yale University School of Medicine, New Haven, Connecticut 06520-8005, USA
| | - Alexandre Reymond
- Center for Integrative Genomics, University of Lausanne, Genopode building, Lausanne 1015, Switzerland
| | - Garrett Robinson
- Department of Statistics, University of California, Berkeley, 367 Evans Hall, Berkeley, California 94720-3860, USA
| | - Anastasia Samsonova
- 1] Department of Genetics and Drosophila RNAi Screening Center, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, Massachusetts 02115, USA [2] Howard Hughes Medical Institute, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, Massachusetts 02115, USA
| | - Gary I Saunders
- 1] Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK [2] European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, CB10 1SD, UK
| | - Felix Schlesinger
- Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA
| | - Anurag Sethi
- 1] Program in Computational Biology and Bioinformatics, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA [2] Department of Molecular Biophysics and Biochemistry, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA
| | - Frank J Slack
- Department of Molecular, Cellular and Developmental Biology, PO Box 208103, Yale University, New Haven, Connecticut 06520, USA
| | - William C Spencer
- Department of Cell and Developmental Biology, Vanderbilt University, 465 21st Avenue South, Nashville, Tennessee 37232-8240, USA
| | - Marcus H Stoiber
- 1] Department of Genome Dynamics, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA [2] Department of Biostatistics, University of California, Berkeley, 367 Evans Hall, Berkeley, California 94720-3860, USA
| | - Pnina Strasbourger
- Department of Genome Sciences and University of Washington School of Medicine, William H. Foege Building S350D, 1705 Northeast Pacific Street, Box 355065 Seattle, Washington 98195-5065, USA
| | - Andrea Tanzer
- 1] Bioinformatics and Genomics Programme, Center for Genomic Regulation, Universitat Pompeu Fabra (CRG-UPF), 08003 Barcelona, Catalonia, Spain [2] Institute for Theoretical Chemistry, Theoretical Biochemistry Group (TBI), University of Vienna, Währingerstrasse 17/3/303, A-1090 Vienna, Austria
| | - Owen A Thompson
- Department of Genome Sciences and University of Washington School of Medicine, William H. Foege Building S350D, 1705 Northeast Pacific Street, Box 355065 Seattle, Washington 98195-5065, USA
| | - Kenneth H Wan
- Department of Genome Dynamics, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Guilin Wang
- Department of Genetics, Yale University School of Medicine, New Haven, Connecticut 06520-8005, USA
| | - Huaien Wang
- Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA
| | - Kathie L Watkins
- Department of Cell and Developmental Biology, Vanderbilt University, 465 21st Avenue South, Nashville, Tennessee 37232-8240, USA
| | - Jiayu Wen
- Sloan-Kettering Institute, 1275 York Avenue, Box 252, New York, New York 10065, USA
| | - Kejia Wen
- MOE Key Lab of Bioinformatics, School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Chenghai Xue
- Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA
| | - Li Yang
- 1] Department of Genetics and Developmental Biology, Institute for Systems Genomics, University of Connecticut Health Center, 400 Farmington Avenue, Farmington, Connecticut 06030, USA [2] Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Kevin Yip
- 1] Hong Kong Bioinformatics Centre, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong [2] 5 CUHK-BGI Innovation Institute of Trans-omics, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong
| | - Chris Zaleski
- Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA
| | - Yan Zhang
- 1] Program in Computational Biology and Bioinformatics, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA [2] Department of Molecular Biophysics and Biochemistry, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA
| | - Henry Zheng
- 1] Program in Computational Biology and Bioinformatics, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA [2] Department of Molecular Biophysics and Biochemistry, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA
| | - Steven E Brenner
- 1] Department of Molecular and Cell Biology, University of California, Berkeley, California 94720, USA [2] Department of Plant and Microbial Biology, University of California, Berkeley, California 94720, USA [3]
| | - Brenton R Graveley
- 1] Department of Genetics and Developmental Biology, Institute for Systems Genomics, University of Connecticut Health Center, 400 Farmington Avenue, Farmington, Connecticut 06030, USA [2]
| | - Susan E Celniker
- 1] Department of Genome Dynamics, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA [2]
| | - Thomas R Gingeras
- 1] Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA [2]
| | - Robert Waterston
- 1] Department of Genome Sciences and University of Washington School of Medicine, William H. Foege Building S350D, 1705 Northeast Pacific Street, Box 355065 Seattle, Washington 98195-5065, USA [2]
| |
Collapse
|
4
|
Boeck ME, Boyle T, Bao Z, Murray J, Mericle B, Waterston R. Specific roles for the GATA transcription factors end-1 and end-3 during C. elegans E-lineage development. Dev Biol 2011; 358:345-55. [PMID: 21854766 DOI: 10.1016/j.ydbio.2011.08.002] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2011] [Revised: 08/02/2011] [Accepted: 08/03/2011] [Indexed: 12/21/2022]
Abstract
end-1 and end-3 are GATA transcription factors important for specifying endoderm cell fate in Caenorhabditis elegans. Deletion of both factors together results in larval arrest, 0% survival and a fate change in the endoderm-specifying E lineage. Individual deletions of either factor, however, result in the development of viable, fertile adults, with 100% of worms developing to adults for end-1(-) and 95% for end-3(-). We sought to quantify the variable phenotypes seen in both deletions using automated cell lineaging. We quantified defects in cell lifetime, cell movement and division axis in end-3(-) embryos, while quantifying perturbations in downstream reporter gene expression in strains with homozygous deletions for either gene, showing that each deletion leads to a unique profile of downstream perturbations in gene expression and cellular phenotypes with a high correlation between early and late defects. Combining observations in both cellular and gene expression defects we found that misaligned divisions at the E2 stage resulted in ectopic expression of the Notch target ref-1 in end-3(-) embryos. Using a maximum likelihood phylogenetic approach we found end-1 and end-3 split to form two distinct clades within the Caenorhabditis lineage with distinct DNA-binding structures. These results indicate that end-1 and end-3 have each evolved into genes with unique functions during endoderm development, that end-3(-) embryos have a delay in the onset of E lineage cell fate and that end-1 has only a partially penetrant ability to activate E lineage fate.
Collapse
Affiliation(s)
- Max E Boeck
- Department of Genome Sciences, University of Washington School of Medicine, 1705 NE Pacific Street, Seattle, WA 98195, USA.
| | | | | | | | | | | |
Collapse
|
5
|
Tian C, Shi H, Colledge C, Stern M, Waterston R, Liu J. The C. elegans SoxC protein SEM-2 opposes differentiation factors to promote a proliferative blast cell fate in the postembryonic mesoderm. Development 2011; 138:1033-43. [PMID: 21307099 DOI: 10.1242/dev.062240] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
The proper development of multicellular organisms requires precise regulation and coordination of cell fate specification, cell proliferation and differentiation. Abnormal regulation and coordination of these processes could lead to disease, including cancer. We have examined the function of the sole C. elegans SoxC protein, SEM-2, in the M lineage, which produces the postembryonic mesoderm. We found that SEM-2/SoxC is both necessary and sufficient to promote a proliferating blast cell fate, the sex myoblast fate, over a differentiated striated bodywall muscle fate. A number of factors control the specific expression of sem-2 in the sex myoblast precursors and their descendants. This includes direct control of sem-2 expression by a Hox-PBC complex. The crucial nature of the HOX/PBC factors in directly enhancing expression of this proliferative factor in the C. elegans M lineage suggests a possible more general link between Hox-PBC factors and SoxC proteins in regulating cell proliferation.
Collapse
Affiliation(s)
- Chenxi Tian
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA
| | | | | | | | | | | |
Collapse
|
6
|
Spradling A, Ganetsky B, Hieter P, Johnston M, Olson M, Orr-Weaver T, Rossant J, Sanchez A, Waterston R. New roles for model genetic organisms in understanding and treating human disease: report from the 2006 Genetics Society of America meeting. Genetics 2006; 172:2025-32. [PMID: 16636111 PMCID: PMC1456383 DOI: 10.1093/genetics/172.4.2025] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Fundamental biological knowledge and the technology to acquire it have been immeasurably advanced by past efforts to understand and manipulate the genomes of model organisms. Has the utility of bacteria, yeast, worms, flies, mice, plants, and other models now peaked and are humans poised to become the model organism of the future? The Genetics Society of America recently convened its 2006 meeting entitled "Genetic Analysis: Model Organisms to Human Biology" to examine the future role of genetic research. (Because of time limitations, the meeting was unable to cover the substantial contributions and future potential of research on model prokaryotic organisms.) In fact, the potential of model-organism-based studies has grown substantially in recent years. The genomics revolution has revealed an underlying unity between the cells and tissues of eukaryotic organisms from yeast to humans. No uniquely human biological mechanisms have yet come to light. This common evolutionary heritage makes it possible to use genetically tractable organisms to model important aspects of human medical disorders such as cancer, birth defects, neurological dysfunction, reproductive failure, malnutrition, and aging in systems amenable to rapid and powerful experimentation. Applying model systems in this way will allow us to identify common genes, proteins, and processes that underlie human medical conditions. It will allow us to systematically decipher the gene-gene and gene-environment interactions that influence complex multigenic disorders. Above all, disease models have the potential to address a growing gap between our ability to collect human genetic data and to productively interpret and apply it. If model organism research is supported with these goals in mind, we can look forward to diagnosing and treating human disease using information from multiple systems and to a medical science built on the unified history of life on earth.
Collapse
Affiliation(s)
- Allan Spradling
- Department of Embryology/HHMI, Carnegie Institution of Washington, Baltimore, MD 21218, USA.
| | | | | | | | | | | | | | | | | |
Collapse
|
7
|
|
8
|
Cliften P, Sudarsanam P, Desikan A, Fulton L, Fulton B, Majors J, Waterston R, Cohen BA, Johnston M. Finding functional features in Saccharomyces genomes by phylogenetic footprinting. Science 2003; 301:71-6. [PMID: 12775844 DOI: 10.1126/science.1084337] [Citation(s) in RCA: 634] [Impact Index Per Article: 30.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
The sifting and winnowing of DNA sequence that occur during evolution cause nonfunctional sequences to diverge, leaving phylogenetic footprints of functional sequence elements in comparisons of genome sequences. We searched for such footprints among the genome sequences of six Saccharomyces species and identified potentially functional sequences. Comparison of these sequences allowed us to revise the catalog of yeast genes and identify sequence motifs that may be targets of transcriptional regulatory proteins. Some of these conserved sequence motifs reside upstream of genes with similar functional annotations or similar expression patterns or those bound by the same transcription factor and are thus good candidates for functional regulatory sequences.
Collapse
Affiliation(s)
- Paul Cliften
- Department of Genetics, Washington University School of Medicine, 660 South Euclid Avenue, St. Louis, MO 63110, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
9
|
Heilig R, Eckenberg R, Petit JL, Fonknechten N, Da Silva C, Cattolico L, Levy M, Barbe V, de Berardinis V, Ureta-Vidal A, Pelletier E, Vico V, Anthouard V, Rowen L, Madan A, Qin S, Sun H, Du H, Pepin K, Artiguenave F, Robert C, Cruaud C, Brüls T, Jaillon O, Friedlander L, Samson G, Brottier P, Cure S, Ségurens B, Anière F, Samain S, Crespeau H, Abbasi N, Aiach N, Boscus D, Dickhoff R, Dors M, Dubois I, Friedman C, Gouyvenoux M, James R, Madan A, Mairey-Estrada B, Mangenot S, Martins N, Ménard M, Oztas S, Ratcliffe A, Shaffer T, Trask B, Vacherie B, Bellemere C, Belser C, Besnard-Gonnet M, Bartol-Mavel D, Boutard M, Briez-Silla S, Combette S, Dufossé-Laurent V, Ferron C, Lechaplais C, Louesse C, Muselet D, Magdelenat G, Pateau E, Petit E, Sirvain-Trukniewicz P, Trybou A, Vega-Czarny N, Bataille E, Bluet E, Bordelais I, Dubois M, Dumont C, Guérin T, Haffray S, Hammadi R, Muanga J, Pellouin V, Robert D, Wunderle E, Gauguet G, Roy A, Sainte-Marthe L, Verdier J, Verdier-Discala C, Hillier L, Fulton L, McPherson J, Matsuda F, Wilson R, Scarpelli C, Gyapay G, Wincker P, Saurin W, Quétier F, Waterston R, Hood L, Weissenbach J. The DNA sequence and analysis of human chromosome 14. Nature 2003; 421:601-7. [PMID: 12508121 DOI: 10.1038/nature01348] [Citation(s) in RCA: 66] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2002] [Accepted: 12/03/2002] [Indexed: 11/08/2022]
Abstract
Chromosome 14 is one of five acrocentric chromosomes in the human genome. These chromosomes are characterized by a heterochromatic short arm that contains essentially ribosomal RNA genes, and a euchromatic long arm in which most, if not all, of the protein-coding genes are located. The finished sequence of human chromosome 14 comprises 87,410,661 base pairs, representing 100% of its euchromatic portion, in a single continuous segment covering the entire long arm with no gaps. Two loci of crucial importance for the immune system, as well as more than 60 disease genes, have been localized so far on chromosome 14. We identified 1,050 genes and gene fragments, and 393 pseudogenes. On the basis of comparisons with other vertebrate genomes, we estimate that more than 96% of the chromosome 14 genes have been annotated. From an analysis of the CpG island occurrences, we estimate that 70% of these annotated genes are complete at their 5' end.
Collapse
Affiliation(s)
- Roland Heilig
- Genoscope-Centre National de Séquençage, 91000, Evry, France.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
10
|
Okazaki Y, Furuno M, Kasukawa T, Adachi J, Bono H, Kondo S, Nikaido I, Osato N, Saito R, Suzuki H, Yamanaka I, Kiyosawa H, Yagi K, Tomaru Y, Hasegawa Y, Nogami A, Schönbach C, Gojobori T, Baldarelli R, Hill DP, Bult C, Hume DA, Quackenbush J, Schriml LM, Kanapin A, Matsuda H, Batalov S, Beisel KW, Blake JA, Bradt D, Brusic V, Chothia C, Corbani LE, Cousins S, Dalla E, Dragani TA, Fletcher CF, Forrest A, Frazer KS, Gaasterland T, Gariboldi M, Gissi C, Godzik A, Gough J, Grimmond S, Gustincich S, Hirokawa N, Jackson IJ, Jarvis ED, Kanai A, Kawaji H, Kawasawa Y, Kedzierski RM, King BL, Konagaya A, Kurochkin IV, Lee Y, Lenhard B, Lyons PA, Maglott DR, Maltais L, Marchionni L, McKenzie L, Miki H, Nagashima T, Numata K, Okido T, Pavan WJ, Pertea G, Pesole G, Petrovsky N, Pillai R, Pontius JU, Qi D, Ramachandran S, Ravasi T, Reed JC, Reed DJ, Reid J, Ring BZ, Ringwald M, Sandelin A, Schneider C, Semple CAM, Setou M, Shimada K, Sultana R, Takenaka Y, Taylor MS, Teasdale RD, Tomita M, Verardo R, Wagner L, Wahlestedt C, Wang Y, Watanabe Y, Wells C, Wilming LG, Wynshaw-Boris A, Yanagisawa M, Yang I, Yang L, Yuan Z, Zavolan M, Zhu Y, Zimmer A, Carninci P, Hayatsu N, Hirozane-Kishikawa T, Konno H, Nakamura M, Sakazume N, Sato K, Shiraki T, Waki K, Kawai J, Aizawa K, Arakawa T, Fukuda S, Hara A, Hashizume W, Imotani K, Ishii Y, Itoh M, Kagawa I, Miyazaki A, Sakai K, Sasaki D, Shibata K, Shinagawa A, Yasunishi A, Yoshino M, Waterston R, Lander ES, Rogers J, Birney E, Hayashizaki Y. Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAs. Nature 2002; 420:563-73. [PMID: 12466851 DOI: 10.1038/nature01266] [Citation(s) in RCA: 1226] [Impact Index Per Article: 55.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2002] [Accepted: 10/28/2002] [Indexed: 01/10/2023]
Abstract
Only a small proportion of the mouse genome is transcribed into mature messenger RNA transcripts. There is an international collaborative effort to identify all full-length mRNA transcripts from the mouse, and to ensure that each is represented in a physical collection of clones. Here we report the manual annotation of 60,770 full-length mouse complementary DNA sequences. These are clustered into 33,409 'transcriptional units', contributing 90.1% of a newly established mouse transcriptome database. Of these transcriptional units, 4,258 are new protein-coding and 11,665 are new non-coding messages, indicating that non-coding RNA is a major component of the transcriptome. 41% of all transcriptional units showed evidence of alternative splicing. In protein-coding transcripts, 79% of splice variations altered the protein product. Whole-transcriptome analyses resulted in the identification of 2,431 sense-antisense pairs. The present work, completely supported by physical clones, provides the most comprehensive survey of a mammalian transcriptome so far, and is a valuable resource for functional genomics.
Collapse
MESH Headings
- Alternative Splicing/genetics
- Amino Acid Motifs
- Animals
- Chromosomes, Mammalian/genetics
- Cloning, Molecular
- DNA, Complementary/genetics
- Databases, Genetic
- Expressed Sequence Tags
- Genes/genetics
- Genomics/methods
- Humans
- Membrane Proteins/genetics
- Mice/genetics
- Physical Chromosome Mapping
- Protein Structure, Tertiary
- Proteome/chemistry
- Proteome/genetics
- RNA, Antisense/genetics
- RNA, Messenger/analysis
- RNA, Messenger/genetics
- RNA, Untranslated/analysis
- RNA, Untranslated/genetics
- Transcription Initiation Site
- Transcription, Genetic/genetics
Collapse
Affiliation(s)
- Y Okazaki
- [1] Laboratory for Genome Exploration Research Group, RIKEN Genomic Sciences Center, RIKEN Yokohama Institute 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa, 230-0045, Japan
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
11
|
Shoemaker R, Keim P, Vodkin L, Retzel E, Clifton SW, Waterston R, Smoller D, Coryell V, Khanna A, Erpelding J, Gai X, Brendel V, Raph-Schmidt C, Shoop EG, Vielweber CJ, Schmatz M, Pape D, Bowers Y, Theising B, Martin J, Dante M, Wylie T, Granger C. A compilation of soybean ESTs: generation and analysis. Genome 2002. [PMID: 11962630 DOI: 10.1139/g01‐150] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Whole-genome sequencing is fundamental to understanding the genetic composition of an organism. Given the size and complexity of the soybean genome, an alternative approach is targeted random-gene sequencing, which provides an immediate and productive method of gene discovery. In this study, more than 120000 soybean expressed sequence tags (ESTs) generated from more than 50 cDNA libraries were evaluated. These ESTs coalesced into 16928 contigs and 17336 singletons. On average, each contig was composed of 6 ESTs and spanned 788 bases. The average sequence length submitted to dbEST was 414 bases. Using only those libraries generating more than 800 ESTs each and only those contigs with 10 or more ESTs each, correlated patterns of gene expression among libraries and genes were discerned. Two-dimensional qualitative representations of contig and library similarities were generated based on expression profiles. Genes with similar expression patterns and, potentially, similar functions were identified. These studies provide a rich source of publicly available gene sequences as well as valuable insight into the structure, function, and evolution of a model crop legume genome.
Collapse
Affiliation(s)
- Randy Shoemaker
- USDA-ARS, Corn Insect and Crop Genetics Research Unit, and Department of Agronomy, Iowa State University, Ames 50011, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
12
|
Shoemaker R, Keim P, Vodkin L, Retzel E, Clifton SW, Waterston R, Smoller D, Coryell V, Khanna A, Erpelding J, Gai X, Brendel V, Raph-Schmidt C, Shoop EG, Vielweber CJ, Schmatz M, Pape D, Bowers Y, Theising B, Martin J, Dante M, Wylie T, Granger C. A compilation of soybean ESTs: generation and analysis. Genome 2002; 45:329-38. [PMID: 11962630 DOI: 10.1139/g01-150] [Citation(s) in RCA: 116] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Whole-genome sequencing is fundamental to understanding the genetic composition of an organism. Given the size and complexity of the soybean genome, an alternative approach is targeted random-gene sequencing, which provides an immediate and productive method of gene discovery. In this study, more than 120000 soybean expressed sequence tags (ESTs) generated from more than 50 cDNA libraries were evaluated. These ESTs coalesced into 16928 contigs and 17336 singletons. On average, each contig was composed of 6 ESTs and spanned 788 bases. The average sequence length submitted to dbEST was 414 bases. Using only those libraries generating more than 800 ESTs each and only those contigs with 10 or more ESTs each, correlated patterns of gene expression among libraries and genes were discerned. Two-dimensional qualitative representations of contig and library similarities were generated based on expression profiles. Genes with similar expression patterns and, potentially, similar functions were identified. These studies provide a rich source of publicly available gene sequences as well as valuable insight into the structure, function, and evolution of a model crop legume genome.
Collapse
Affiliation(s)
- Randy Shoemaker
- USDA-ARS, Corn Insect and Crop Genetics Research Unit, and Department of Agronomy, Iowa State University, Ames 50011, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
13
|
McClelland M, Sanderson KE, Spieth J, Clifton SW, Latreille P, Courtney L, Porwollik S, Ali J, Dante M, Du F, Hou S, Layman D, Leonard S, Nguyen C, Scott K, Holmes A, Grewal N, Mulvaney E, Ryan E, Sun H, Florea L, Miller W, Stoneking T, Nhan M, Waterston R, Wilson RK. Complete genome sequence of Salmonella enterica serovar Typhimurium LT2. Nature 2001; 413:852-6. [PMID: 11677609 DOI: 10.1038/35101614] [Citation(s) in RCA: 1400] [Impact Index Per Article: 60.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Salmonella enterica subspecies I, serovar Typhimurium (S. typhimurium), is a leading cause of human gastroenteritis, and is used as a mouse model of human typhoid fever. The incidence of non-typhoid salmonellosis is increasing worldwide, causing millions of infections and many deaths in the human population each year. Here we sequenced the 4,857-kilobase (kb) chromosome and 94-kb virulence plasmid of S. typhimurium strain LT2. The distribution of close homologues of S. typhimurium LT2 genes in eight related enterobacteria was determined using previously completed genomes of three related bacteria, sample sequencing of both S. enterica serovar Paratyphi A (S. paratyphi A) and Klebsiella pneumoniae, and hybridization of three unsequenced genomes to a microarray of S. typhimurium LT2 genes. Lateral transfer of genes is frequent, with 11% of the S. typhimurium LT2 genes missing from S. enterica serovar Typhi (S. typhi), and 29% missing from Escherichia coli K12. The 352 gene homologues of S. typhimurium LT2 confined to subspecies I of S. enterica-containing most mammalian and bird pathogens-are useful for studies of epidemiology, host specificity and pathogenesis. Most of these homologues were previously unknown, and 50 may be exported to the periplasm or outer membrane, rendering them accessible as therapeutic or vaccine targets.
Collapse
Affiliation(s)
- M McClelland
- Sidney Kimmel Cancer Center, 10835 Altman Row, San Diego, California 92121, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
14
|
Jones SJ, Riddle DL, Pouzyrev AT, Velculescu VE, Hillier L, Eddy SR, Stricklin SL, Baillie DL, Waterston R, Marra MA. Changes in gene expression associated with developmental arrest and longevity in Caenorhabditis elegans. Genome Res 2001; 11:1346-52. [PMID: 11483575 DOI: 10.1101/gr.184401] [Citation(s) in RCA: 187] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Gene expression in a developmentally arrested, long-lived dauer population of Caenorhabditis elegans was compared with a nondauer (mixed-stage) population by using serial analysis of gene expression (SAGE). Dauer (152,314) and nondauer (148,324) SAGE tags identified 11,130 of the predicted 19,100 C. elegans genes. Genes implicated previously in longevity were expressed abundantly in the dauer library, and new genes potentially important in dauer biology were discovered. Two thousand six hundred eighteen genes were detected only in the nondauer population, whereas 2016 genes were detected only in the dauer, showing that dauer larvae show a surprisingly complex gene expression profile. Evidence for differentially expressed gene transcript isoforms was obtained for 162 genes. H1 histones were differentially expressed, raising the possibility of alternative chromatin packaging. The most abundant tag from dauer larvae (20-fold more abundant than in the nondauer profile) corresponds to a new, unpredicted gene we have named tts-1 (transcribed telomere-like sequence), which may interact with telomeres or telomere-associated proteins. Abundant antisense mitochondrial transcripts (2% of all tags), suggest the existence of an antisense-mediated regulatory mechanism in C. elegans mitochondria. In addition to providing a robust tool for gene expression studies, the SAGE approach already has provided the advantage of new gene/transcript discovery in a metazoan.
Collapse
Affiliation(s)
- S J Jones
- Genome Sequence Centre, British Columbia Cancer Research Centre, Vancouver, British Columbia V5Z 4E6, Canada.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
15
|
Abstract
Internet-based learning is a growing trend in continuing education for health professionals. We have developed an on-line version of four workshops for practising physicians, tracing a patient's course from the initial diagnosis of dementia to end-of-life care. We have also developed an on-line dementia toolkit that could evolve with changing knowledge. The toolkit consisted of reference material collected from a variety of sources and vetted by our team of experts to address the common complaint that the sources of on-line reference materials are often unauthenticated. The Geriatric Forum provides an array of resources for health professionals.
Collapse
Affiliation(s)
- D Ryan
- Regional Geriatric Program of Toronto, Faculty of Medicine, University of Toronto, Canada.
| | | |
Collapse
|
16
|
Marra M, Kucaba T, Sekhon M, Hillier L, Martienssen R, Chinwalla A, Crockett J, Fedele J, Grover H, Gund C, McCombie WR, McDonald K, McPherson J, Mudd N, Parnell L, Schein J, Seim R, Shelby P, Waterston R, Wilson R. zA map for sequence analysis of the Arabidopsis thaliana genome. Nat Genet 1999; 22:265-70. [PMID: 10391214 DOI: 10.1038/10327] [Citation(s) in RCA: 99] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Arabidopsis thaliana has emerged as a model system for studies of plant genetics and development, and its genome has been targeted for sequencing by an international consortium (the Arabidopsis Genome Initiative; http://genome-www. stanford.edu/Arabidopsis/agi.html). To support the genome-sequencing effort, we fingerprinted more than 20,000 BACs (ref. 2) from two high-quality publicly available libraries, generating an estimated 17-fold redundant coverage of the genome, and used the fingerprints to nucleate assembly of the data by computer. Subsequent manual revision of the assemblies resulted in the incorporation of 19,661 fingerprinted BACs into 169 ordered sets of overlapping clones ('contigs'), each containing at least 3 clones. These contigs are ideal for parallel selection of BACs for large-scale sequencing and have supported the generation of more than 5.8 Mb of finished genome sequence submitted to GenBank; analysis of the sequence has confirmed the integrity of contigs constructed using this fingerprint data. Placement of contigs onto chromosomes can now be performed, and is being pursued by groups involved in both sequencing and positional cloning studies. To our knowledge, these data provide the first example of whole-genome random BAC fingerprint analysis of a eucaryote, and have provided a model essential to efforts aimed at generating similar databases of fingerprint contigs to support sequencing of other complex genomes, including that of human.
Collapse
Affiliation(s)
- M Marra
- Washington University Genome Sequencing Center, St Louis, Missouri 63108, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
17
|
Marra M, Hillier L, Kucaba T, Allen M, Barstead R, Beck C, Blistain A, Bonaldo M, Bowers Y, Bowles L, Cardenas M, Chamberlain A, Chappell J, Clifton S, Favello A, Geisel S, Gibbons M, Harvey N, Hill F, Jackson Y, Kohn S, Lennon G, Mardis E, Martin J, Mila L, McCann R, Morales R, Pape D, Person B, Prange C, Ritter E, Soares M, Schurk R, Shin T, Steptoe M, Swaller T, Theising B, Underwood K, Wylie T, Yount T, Wilson R, Waterston R. An encyclopedia of mouse genes. Nat Genet 1999; 21:191-4. [PMID: 9988271 DOI: 10.1038/5976] [Citation(s) in RCA: 91] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The laboratory mouse is the premier model system for studies of mammalian development due to the powerful classical genetic analysis possible (see also the Jackson Laboratory web site, http://www.jax.org/) and the ever-expanding collection of molecular tools. To enhance the utility of the mouse system, we initiated a program to generate a large database of expressed sequence tags (ESTs) that can provide rapid access to genes. Of particular significance was the possibility that cDNA libraries could be prepared from very early stages of development, a situation unrealized in human EST projects. We report here the development of a comprehensive database of ESTs for the mouse. The project, initiated in March 1996, has focused on 5' end sequences from directionally cloned, oligo-dT primed cDNA libraries. As of 23 October 1998, 352,040 sequences had been generated, annotated and deposited in dbEST, where they comprised 93% of the total ESTs available for mouse. EST data are versatile and have been applied to gene identification, comparative sequence analysis, comparative gene mapping and candidate disease gene identification, genome sequence annotation, microarray development and the development of gene-based map resources.
Collapse
Affiliation(s)
- M Marra
- Washington University Genome Sequencing Center, St. Louis, Missouri 63108, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
18
|
Affiliation(s)
- R Waterston
- Genome Sequencing Center, Washington University School of Medicine, St. Louis, MO 63108, USA.
| | | |
Collapse
|
19
|
Manger ID, Hehl A, Parmley S, Sibley LD, Marra M, Hillier L, Waterston R, Boothroyd JC. Expressed sequence tag analysis of the bradyzoite stage of Toxoplasma gondii: identification of developmentally regulated genes. Infect Immun 1998; 66:1632-7. [PMID: 9529091 PMCID: PMC108098 DOI: 10.1128/iai.66.4.1632-1637.1998] [Citation(s) in RCA: 112] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/1997] [Accepted: 01/11/1998] [Indexed: 02/07/2023] Open
Abstract
Toxoplasma gondii is a protozoan parasite responsible for widespread infections in humans and animals. Two major asexual forms are produced during the life cycle of this parasite: the rapidly dividing tachyzoite and the more slowly dividing, encysted bradyzoite. To further study the differentiation between these two forms, we have generated a large number of expressed sequence tags (ESTs) from both asexual stages. Previously, we obtained data on approximately 7,400 ESTs from tachyzoites (J. Ajioka et al., Genome Res. 8:18-28, 1998). Here, we report the results from analysis of approximately 2,500 ESTs from bradyzoites purified from the cysts of infected mice. We also report the results from analysis of 760 ESTs from parasites induced to differentiate from tachyzoites to bradyzoites in vitro. Comparison of the data sets from bradyzoites and tachyzoites reveals many previously uncharacterized sequence clusters which are largely or completely specific to one or other developmental stage. This class includes a bradyzoite-specific form of enolase. Combined with the previously identified bradyzoite-specific form of lactate dehydrogenase, this finding suggests significant differences in flux through the lower end of the glycolytic pathway in this stage. Thus, the generation of this data set provides valuable insights into the metabolism and growth of the parasite in the encysted form and represents a substantial body of information for further study of development in Toxoplasma.
Collapse
Affiliation(s)
- I D Manger
- Department of Microbiology and Immunology, Stanford University, California 94305-5124, USA
| | | | | | | | | | | | | | | |
Collapse
|
20
|
Ajioka JW, Boothroyd JC, Brunk BP, Hehl A, Hillier L, Manger ID, Marra M, Overton GC, Roos DS, Wan KL, Waterston R, Sibley LD. Gene discovery by EST sequencing in Toxoplasma gondii reveals sequences restricted to the Apicomplexa. Genome Res 1998; 8:18-28. [PMID: 9445484 DOI: 10.1101/gr.8.1.18] [Citation(s) in RCA: 151] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
To accelerate gene discovery and facilitate genetic mapping in the protozoan parasite Toxoplasma gondii, we have generated >7000 new ESTs from the 5' ends of randomly selected tachyzoite cDNAs. Comparison of the ESTs with the existing gene databases identified possible functions for more than 500 new T. gondii genes by virtue of sequence motifs shared with conserved protein families, including factors involved in transcription, translation, protein secretion, signal transduction, cytoskeleton organization, and metabolism. Despite this success in identifying new genes, more than 50% of the ESTs correspond to genes of unknown function, reflecting the divergent evolutionary status of this parasite. A newly recognized class of genes was identified based on its similarity to sequences known only from other members of the same phylum, therefore identifying sequences that are apparently restricted to the Apicomplexa. Such genes may underlie pathways common to this group of medically important parasites, therefore identifying potential targets for intervention.
Collapse
Affiliation(s)
- J W Ajioka
- Department of Pathology, Cambridge University, UK
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
21
|
Hillier LD, Lennon G, Becker M, Bonaldo MF, Chiapelli B, Chissoe S, Dietrich N, DuBuque T, Favello A, Gish W, Hawkins M, Hultman M, Kucaba T, Lacy M, Le M, Le N, Mardis E, Moore B, Morris M, Parsons J, Prange C, Rifkin L, Rohlfing T, Schellenberg K, Bento Soares M, Tan F, Thierry-Meg J, Trevaskis E, Underwood K, Wohldman P, Waterston R, Wilson R, Marra M. Generation and analysis of 280,000 human expressed sequence tags. Genome Res 1996; 6:807-28. [PMID: 8889549 DOI: 10.1101/gr.6.9.807] [Citation(s) in RCA: 327] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
We report the generation of 319,311 single-pass sequencing reactions (known as expressed sequence tags, or ESTs) obtained from the 5' and 3' ends of 194,031 human cDNA clones. Our goal has been to obtain tag sequences from many different genes and to deposit these in the publicly accessible Data Base for Expressed Sequence Tags. Highly efficient automatic screening of the data allows deposition of the annotated sequences without delay. Sequences have been generated from 26 oligo(dT) primed directionally cloned libraries, of which 18 were normalized. The libraries were constructed using mRNA isolated from 17 different tissues representing three developmental states. Comparisons of a subset of our data with nonredundant human mRNA and protein data bases show that the ESTs represent many known sequences and contain many that are novel. Analysis of protein families using Hidden Markov Models confirms this observation and supports the contention that although normalization reduces significantly the relative abundance of redundant cDNA clones, it does not result in the complete removal of members of gene families.
Collapse
Affiliation(s)
- L D Hillier
- Genome Sequencing Center, Washington University School of Medicine, St. Louis, Missouri 63108, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
22
|
Abstract
The physical map of the 100-Mb Caenorhabditis elegans genome consists of 17,500 cosmids and 3500 yeast artificial chromosomes (YACs). A total of 22.5 Mb has been sequenced, with the remainder expected by 1998. A further 15.5 Mb of unfinished sequence is freely available online: because the areas sequenced so far are relatively gene rich, about half the 13,000 genes can now be scanned. More than a quarter of the genes are represented by expressed sequence tags (ESTs). All information pertaining to the genome is publicly available in the ACeDB data base.
Collapse
Affiliation(s)
- R Waterston
- Department of Genetics and Genome Sequencing Center, Washington University School of Medicine, St. Louis, MO 63110, USA
| | | |
Collapse
|
23
|
Abstract
Sets of new gene sequences from human, nematode, and yeast were compared with each other and with a set of Escherichia coli genes in order to detect ancient evolutionarily conserved regions (ACRs) in the encoded proteins. Nearly all of the ACRs so identified were found to be homologous to sequences in the protein databases. This suggests that currently known proteins may already include representatives of most ACRs and that new sequences not similar to any database sequence are unlikely to contain ACRs. Preliminary analyses indicate that moderately expressed genes may be more likely to contain ACRs than rarely expressed genes. It is estimated that there are fewer than 900 ACRs in all.
Collapse
Affiliation(s)
- P Green
- Genetics Department, Washington University Medical School, St. Louis, MO 63110
| | | | | | | | | | | |
Collapse
|
24
|
Waterston R, Ainscough R, Anderson K, Berks M, Blair D, Connell M, Cooper J, Coulson A, Craxton M, Dear S. The genome of the nematode Caenorhabditis elegans. Cold Spring Harb Symp Quant Biol 1993; 58:367-76. [PMID: 7956049 DOI: 10.1101/sqb.1993.058.01.043] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Affiliation(s)
- R Waterston
- Department of Genetics, Washington University School of Medicine, St. Louis, Missouri 63110
| | | | | | | | | | | | | | | | | | | |
Collapse
|
25
|
Waterston R, Martin C, Craxton M, Huynh C, Coulson A, Hillier L, Durbin R, Green P, Shownkeen R, Halloran N. A survey of expressed genes in Caenorhabditis elegans. Nat Genet 1992; 1:114-23. [PMID: 1302004 DOI: 10.1038/ng0592-114] [Citation(s) in RCA: 292] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
As an adjunct to the genomic sequencing of Caenorhabditis elegans, we have investigated a representative cDNA library of 1,517 clones. A single sequence read has been obtained from the 5' end of each clone, allowing its characterization with respect to the public databases, and the clones are being localized on the genome map. The result is the identification of about 1,200 of the estimated 15,000 genes of C. elegans. More than 30% of the inferred protein sequences have significant similarity to existing sequences in the databases, providing a route towards in vivo analysis of known genes in the nematode. These clones also provide material for assessing the accuracy of predicted exons and splicing patterns and will lead to a more accurate estimate of the total number of genes in the organism than has hitherto been available.
Collapse
Affiliation(s)
- R Waterston
- Department of Genetics, Washington University School of Medicine, St Louis, Missouri 63110
| | | | | | | | | | | | | | | | | | | |
Collapse
|
26
|
Abstract
During the past decade, it has become apparent that it is within our grasp to understand fully the development and functioning of complex organisms. It is widely accepted that this undertaking must include the elucidation of the genetic blueprint - the genome sequence - of a number of model organisms. As a prelude to the determination of these sequences, clone-based physical maps of the genomes of a number of multicellular animals and plants are being constructed. Yeast artificial chromosome (YAC) vectors, by virtue of their relatively unbiased cloning capabilities and capacity to carry large inserts, have come to play a central role in the construction of these maps. The application of YACs to the physical map of the Caenorhabditis elegans genome has enabled cosmid clone 'islands' to be linked together in an efficient manner. The long-range continuity has improved the linkage between the genetic and physical maps, greatly increasing its utility. Since the genome can be represented by a relatively small number of YACs, it has been possible to make replica filters of genomically ordered YACs available to the community at large.
Collapse
Affiliation(s)
- A Coulson
- MRC Laboratory of Molecular Biology, Cambridge, UK
| | | | | | | | | | | |
Collapse
|
27
|
Abstract
A method for linking genomic sequences cloned in yeast artificial chromosomes (YACs) has been tested using Caenorhabditis elegans as a model system. Yeast clones carrying YACs with repeated sequences were selected from a C. elegans genomic library, total DNA was digested with restriction enzymes, transferred to nylon membranes and probed with a variety of repetitive DNA probes. YAC clones that overlap share common bands with one or more repetitive DNA probes. In 159 YAC clones tested with one restriction enzyme and six probes 28 overlapping clones were detected. The advantages and limitations of this method for construction of YAC physical maps is discussed.
Collapse
Affiliation(s)
- G Cangiano
- International Institute of Genetics and Biophysics, CNR, Naples, Italy
| | | | | | | |
Collapse
|
28
|
Affiliation(s)
- A Fire
- MRC Laboratory of Molecular Biology, Cambridge, UK
| | | | | |
Collapse
|
29
|
Abstract
We describe a general strategy for the genetic mapping in parallel of multiple restriction fragment length polymorphism (RFLP) loci. This approach allows the systematic identification for cloning of physical genetic loci within about 100 kb of any gene in Caenorhabditis elegans. We have used this strategy of parallel RFLP mapping to clone the heterochronic gene lin-14, which controls the timing and sequence of many C. elegans postembryonic developmental events. We found that of about 400 polymorphic loci in the C. elegans genome associated with the Tc1 family of repetitive elements, six are within 0.3 map unit of lin-14. The three closest lin-14-linked Tc1-containing restriction fragments were cloned and used to identify by hybridization an 830-kb region of contiguous cloned DNA fragments assembled from cosmid and yeast artificial chromosome libraries. A lin-14 intragenic recombinant that separated a previously cryptic lin-14 semidominant mutation from a cis-acting lin-14 suppressor mutation was used to map the location of the lin-14 gene to a 25-kb region of this 830-kb contig. DNA probes from this region detected lin-14 allele-specific DNA alterations and a lin-14 mRNA. Two lin-14 semi-dominant alleles, which cause temporally inappropriate lin-14 gene activity and lead to the reiterated expression of specific early developmental events, were shown to delete sequences from the lin-14 gene and mRNA. These deletions may define cis-acting sequences responsible for the temporal regulation of lin-14.
Collapse
Affiliation(s)
- G Ruvkun
- Department of Molecular Biology, Massachusetts General Hospital, Boston
| | | | | | | | | | | |
Collapse
|
30
|
Abstract
The haploid genome of Caenorhabditis elegans consists of some 80 x 10(6) base pairs of DNA contained in six chromosomes. The large number of interesting loci that have been recognized by mutation, and the accuracy of the genetic map, mean that a physical map of the genome is highly desirable, because it will facilitate the molecular cloning of chosen loci. The first steps towards such a map used a fingerprinting method to link cosmid clones together. This approach reached its practical limit last year, when 90-95% of the genome had been cloned into 17,500 cosmids assembled into some 700 clusters (contigs), but the linking clones needed were either non-existent or extremely rare. Anticipating this, we had planned to link by physical means--probably by hybridization to NotI fragments separated by pulse field gel electrophoresis. NotI recognizes an eight base sequence of GC pairs; thus the fragments should be large enough to bridge regions that clone poorly in cosmids, and, with no selective step involved, would necessarily be fully representative. However, with the availability of a yeast artificial chromosome (YAC) vector, we decided to use this alternative source of large DNA fragments to obtain linkage. The technique involves the ligation of large (50-1,000 kilobase) genomic fragments into a vector that provides centromeric, telomeric and selective functions; the constructs are then introduced into Saccharomyces cerevisiae, and replicate in the same manner as the host chromosomes.
Collapse
Affiliation(s)
- A Coulson
- MRC Laboratory of Molecular Biology, Cambridge, UK
| | | | | | | | | |
Collapse
|
31
|
Hoffman EP, Fischbeck KH, Brown RH, Johnson M, Medori R, Loike JD, Harris JB, Waterston R, Brooke M, Specht L. Characterization of dystrophin in muscle-biopsy specimens from patients with Duchenne's or Becker's muscular dystrophy. N Engl J Med 1988; 318:1363-8. [PMID: 3285207 DOI: 10.1056/nejm198805263182104] [Citation(s) in RCA: 646] [Impact Index Per Article: 17.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Abstract
A deficiency of the protein dystrophin has recently been shown to be the probable cause of Duchenne's muscular dystrophy. We sought to determine the relation between the clinical phenotype and the status of dystrophin in muscle-biopsy specimens from 103 patients with various neuromuscular disorders. We found very low levels (less than 3 percent of normal levels) or no dystrophin in the severe Duchenne phenotype (35 of 38 patients), low concentrations of dystrophin in the intermediate (outlier) phenotype (4 of 7), and dystrophin of abnormal molecular weight in the mild Becker phenotype (12 of 18). Normal levels of dystrophin of normal molecular weight were found in nearly all the patients (38 of 40) with 20 other neuromuscular disorders we studied. These data show the clinical consequences of both quantitative alterations (in Duchenne's and intermediate dystrophy) in a single protein. The biochemical assay for dystrophin should prove helpful in delineating myopathies that overlap clinically with Duchenne's and Becker's dystrophies, and it shows promise as an accurate diagnostic tool.
Collapse
Affiliation(s)
- E P Hoffman
- Division of Genetics, Children's Hospital, Boston, MA
| | | | | | | | | | | | | | | | | | | |
Collapse
|