1
|
Adler A, Bader JS, Basnight B, Booth BW, Cai J, Cho E, Collins JH, Ge Y, Grothendieck J, Keating K, Marshall T, Persikov A, Scott H, Siegelmann R, Singh M, Taggart A, Toll B, Wan KH, Wyschogrod D, Yaman F, Young EM, Celniker SE, Roehner N. Ensemble Detection of DNA Engineering Signatures. ACS Synth Biol 2024; 13:1105-1115. [PMID: 38468602 DOI: 10.1021/acssynbio.3c00398] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/13/2024]
Abstract
Synthetic biology is creating genetically engineered organisms at an increasing rate for many potentially valuable applications, but this potential comes with the risk of misuse or accidental release. To begin to address this issue, we have developed a system called GUARDIAN that can automatically detect signatures of engineering in DNA sequencing data, and we have conducted a blinded test of this system using a curated Test and Evaluation (T&E) data set. GUARDIAN uses an ensemble approach based on the guiding principle that no single approach is likely to be able to detect engineering with perfect accuracy. Critically, ensembling enables GUARDIAN to detect sequence inserts in 13 target organisms with a high degree of specificity that requires no subject matter expert (SME) review.
Collapse
Affiliation(s)
- Aaron Adler
- Raytheon BBN, Cambridge, Massachusetts 02138, United States
| | - Joel S Bader
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, Maryland 21218, United States
| | - Brian Basnight
- Raytheon BBN, Cambridge, Massachusetts 02138, United States
| | - Benjamin W Booth
- Lawrence Berkeley National Laboratory, Berkeley, California 94720, United States
| | - Jitong Cai
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, Maryland 21218, United States
| | - Elizabeth Cho
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, Maryland 21218, United States
| | - Joseph H Collins
- Department of Chemical Engineering, Worcester Polytechnic Institute, Worcester, Massachusetts 01609, United States
| | - Yuchen Ge
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, Maryland 21218, United States
| | | | - Kevin Keating
- Department of Chemical Engineering, Worcester Polytechnic Institute, Worcester, Massachusetts 01609, United States
| | - Tyler Marshall
- Raytheon BBN, Cambridge, Massachusetts 02138, United States
| | - Anton Persikov
- Department of Computer Science, Princeton University, Princeton, New Jersey 08544, United States
| | - Helen Scott
- Raytheon BBN, Cambridge, Massachusetts 02138, United States
| | - Roy Siegelmann
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, Maryland 21218, United States
| | - Mona Singh
- Department of Computer Science, Princeton University, Princeton, New Jersey 08544, United States
| | | | - Benjamin Toll
- Raytheon BBN, Cambridge, Massachusetts 02138, United States
| | - Kenneth H Wan
- Lawrence Berkeley National Laboratory, Berkeley, California 94720, United States
| | | | - Fusun Yaman
- Raytheon BBN, Cambridge, Massachusetts 02138, United States
| | - Eric M Young
- Department of Chemical Engineering, Worcester Polytechnic Institute, Worcester, Massachusetts 01609, United States
| | - Susan E Celniker
- Lawrence Berkeley National Laboratory, Berkeley, California 94720, United States
| | | |
Collapse
|
2
|
Inman JL, Wu Y, Chen L, Brydon E, Ghosh D, Wan KH, De Chant J, Obst-Huebl L, Nakamura K, Ralston CY, Celniker SE, Mao JH, Zwart PH, Holman HYN, Chang H, Brown JB, Snijders AM. Long-term, non-invasive FTIR detection of low-dose ionizing radiation exposure. Sci Rep 2024; 14:6119. [PMID: 38480827 PMCID: PMC10937999 DOI: 10.1038/s41598-024-56491-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2023] [Accepted: 03/07/2024] [Indexed: 03/17/2024] Open
Abstract
Non-invasive methods of detecting radiation exposure show promise to improve upon current approaches to biological dosimetry in ease, speed, and accuracy. Here we developed a pipeline that employs Fourier transform infrared (FTIR) spectroscopy in the mid-infrared spectrum to identify a signature of low dose ionizing radiation exposure in mouse ear pinnae over time. Mice exposed to 0.1 to 2 Gy total body irradiation were repeatedly measured by FTIR at the stratum corneum of the ear pinnae. We found significant discriminative power for all doses and time-points out to 90 days after exposure. Classification accuracy was maximized when testing 14 days after exposure (specificity > 0.9 with a sensitivity threshold of 0.9) and dropped by roughly 30% sensitivity at 90 days. Infrared frequencies point towards biological changes in DNA conformation, lipid oxidation and accumulation and shifts in protein secondary structure. Since only hundreds of samples were used to learn the highly discriminative signature, developing human-relevant diagnostic capabilities is likely feasible and this non-invasive procedure points toward rapid, non-invasive, and reagent-free biodosimetry applications at population scales.
Collapse
Affiliation(s)
- Jamie L Inman
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Rd, Berkeley, CA, 94720, USA
| | - Yulun Wu
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Rd, Berkeley, CA, 94720, USA
- Department of Statistics, University of California, Berkeley, CA, 94720, USA
| | - Liang Chen
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Rd, Berkeley, CA, 94720, USA
| | - Ella Brydon
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Rd, Berkeley, CA, 94720, USA
| | - Dhruba Ghosh
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Rd, Berkeley, CA, 94720, USA
- Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, USA
| | - Kenneth H Wan
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Rd, Berkeley, CA, 94720, USA
| | - Jared De Chant
- Accelerator Technology and Applied Physics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Rd, Berkeley, CA, 94720, USA
| | - Lieselotte Obst-Huebl
- Accelerator Technology and Applied Physics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Rd, Berkeley, CA, 94720, USA
| | - Kei Nakamura
- Accelerator Technology and Applied Physics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Rd, Berkeley, CA, 94720, USA
| | - Corie Y Ralston
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Rd, Berkeley, CA, 94720, USA
| | - Susan E Celniker
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Rd, Berkeley, CA, 94720, USA
| | - Jian-Hua Mao
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Rd, Berkeley, CA, 94720, USA
| | - Peter H Zwart
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Rd, Berkeley, CA, 94720, USA
| | - Hoi-Ying N Holman
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Rd, Berkeley, CA, 94720, USA.
| | - Hang Chang
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Rd, Berkeley, CA, 94720, USA.
| | - James B Brown
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Rd, Berkeley, CA, 94720, USA.
- Department of Statistics, University of California, Berkeley, CA, 94720, USA.
| | - Antoine M Snijders
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Rd, Berkeley, CA, 94720, USA.
| |
Collapse
|
3
|
Bosch JA, Keith N, Escobedo F, Fisher WW, LaGraff JT, Rabasco J, Wan KH, Weiszmann R, Wu Y, Hu Y, Kondo S, Brown JB, Perrimon N, Celniker SE. Molecular and functional characterization of the Drosophila melanogaster conserved smORFome. Cell Rep 2024; 43:113737. [PMID: 38265936 DOI: 10.1016/j.celrep.2024.113737] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2024] Open
|
4
|
Kudron M, Gevirtzman L, Victorsen A, Lear BC, Gao J, Xu J, Samanta S, Frink E, Tran-Pearson A, Huynh C, Vafeados D, Hammonds A, Fisher W, Wall M, Wesseling G, Hernandez V, Lin Z, Kasparian M, White K, Allada R, Gerstein M, Hillier L, Celniker SE, Reinke V, Waterston RH. Binding profiles for 954 Drosophila and C. elegans transcription factors reveal tissue specific regulatory relationships. bioRxiv 2024:2024.01.18.576242. [PMID: 38293065 PMCID: PMC10827215 DOI: 10.1101/2024.01.18.576242] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/01/2024]
Abstract
A catalog of transcription factor (TF) binding sites in the genome is critical for deciphering regulatory relationships. Here we present the culmination of the modERN (model organism Encyclopedia of Regulatory Networks) consortium that systematically assayed TF binding events in vivo in two major model organisms, Drosophila melanogaster (fly) and Caenorhabditis elegans (worm). We describe key features of these datasets, comprising 604 TFs identifying 3.6M sites in the fly and 350 TFs identifying 0.9 M sites in the worm. Applying a machine learning model to these data identifies sets of TFs with a prominent role in promoting target gene expression in specific cell types. TF binding data are available through the ENCODE Data Coordinating Center and at https://epic.gs.washington.edu/modERNresource, which provides access to processed and summary data, as well as widgets to probe cell type-specific TF-target relationships. These data are a rich resource that should fuel investigations into TF function during development.
Collapse
Affiliation(s)
- Michelle Kudron
- Department of Genetics, Yale University, New Haven, Connecticut 06520
| | - Louis Gevirtzman
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195
| | - Alec Victorsen
- Department of Laboratory Medicine & Pathology, University of Minnesota, Minneapolis, MN 55455
| | - Bridget C. Lear
- Department of Neurobiology, Northwestern University, Evanston IL 60208
| | - Jiahao Gao
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut 06520
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520
| | - Jinrui Xu
- Department of Biology, Howard University, Washington, District of Columbia 20059, USA
- Center for Applied Data Science and Analytics, Howard University, Washington, District of Columbia 20059, USA
| | - Swapna Samanta
- Department of Genetics, Yale University, New Haven, Connecticut 06520
| | - Emily Frink
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195
| | - Adri Tran-Pearson
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195
| | - Chau Huynh
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195
| | - Dionne Vafeados
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195
| | - Ann Hammonds
- Division of Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, California 94720
| | - William Fisher
- Division of Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, California 94720
| | - Martha Wall
- Institute for Genomics and Systems Biology, Department of Human Genetics, University of Chicago, Illinois 60637
| | - Greg Wesseling
- Department of Neurobiology, Northwestern University, Evanston IL 60208
| | - Vanessa Hernandez
- Department of Neurobiology, Northwestern University, Evanston IL 60208
| | - Zhichun Lin
- Department of Neurobiology, Northwestern University, Evanston IL 60208
| | - Mary Kasparian
- Department of Neurobiology, Northwestern University, Evanston IL 60208
| | - Kevin White
- Department of Biochemistry and Precision Medicine Translational Research Programme, Yong Loo Lin School of Medicine, National University of Singapore, Singapore 117597
| | - Ravi Allada
- Department of Neurobiology, Northwestern University, Evanston IL 60208
| | - Mark Gerstein
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut 06520
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520
- Department of Statistics and Data Science, Yale University, New Haven, Connecticut 06520, USA
| | - LaDeana Hillier
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195
| | - Susan E. Celniker
- Division of Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, California 94720
| | - Valerie Reinke
- Department of Genetics, Yale University, New Haven, Connecticut 06520
| | - Robert H. Waterston
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195
| |
Collapse
|
5
|
Peng D, Jackson D, Palicha B, Kernfeld E, Laughner N, Shoemaker A, Celniker SE, Loganathan R, Cahan P, Andrew DJ. Organogenetic transcriptomes of the Drosophila embryo at single cell resolution. Development 2024; 151:dev202097. [PMID: 38174902 PMCID: PMC10820837 DOI: 10.1242/dev.202097] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Accepted: 12/19/2023] [Indexed: 01/05/2024]
Abstract
To gain insight into the transcription programs activated during the formation of Drosophila larval structures, we carried out single cell RNA sequencing during two periods of Drosophila embryogenesis: stages 10-12, when most organs are first specified and initiate morphological and physiological specialization; and stages 13-16, when organs achieve their final mature architectures and begin to function. Our data confirm previous findings with regards to functional specialization of some organs - the salivary gland and trachea - and clarify the embryonic functions of another - the plasmatocytes. We also identify two early developmental trajectories in germ cells and uncover a potential role for proteolysis during germline stem cell specialization. We identify the likely cell type of origin for key components of the Drosophila matrisome and several commonly used Drosophila embryonic cell culture lines. Finally, we compare our findings with other recent related studies and with other modalities for identifying tissue-specific gene expression patterns. These data provide a useful community resource for identifying many new players in tissue-specific morphogenesis and functional specialization of developing organs.
Collapse
Affiliation(s)
- Da Peng
- Department of Biomedical Engineering, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA
| | - Dorian Jackson
- Department of Cell Biology, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA
| | - Bianca Palicha
- Department of Cell Biology, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA
| | - Eric Kernfeld
- Department of Biomedical Engineering, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA
| | - Nathaniel Laughner
- Department of Cell Biology, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA
| | - Ashleigh Shoemaker
- Department of Cell Biology, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA
| | - Susan E. Celniker
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Rajprasad Loganathan
- Department of Biological Sciences, Wichita State University, Wichita, KS 67260, USA
| | - Patrick Cahan
- Department of Biomedical Engineering, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA
| | - Deborah J. Andrew
- Department of Cell Biology, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA
| |
Collapse
|
6
|
Li D, Zhong C, Yang M, He L, Chang H, Zhu N, Celniker SE, Threadgill DW, Snijders AM, Mao JH, Yuan Y. Genetic and microbial determinants of azoxymethane-induced colorectal tumor susceptibility in Collaborative Cross mice and their implication in human cancer. Gut Microbes 2024; 16:2341647. [PMID: 38659246 PMCID: PMC11057575 DOI: 10.1080/19490976.2024.2341647] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/03/2024] [Accepted: 04/08/2024] [Indexed: 04/26/2024] Open
Abstract
The insights into interactions between host genetics and gut microbiome (GM) in colorectal tumor susceptibility (CTS) remains lacking. We used Collaborative Cross mouse population model to identify genetic and microbial determinants of Azoxymethane-induced CTS. We identified 4417 CTS-associated single nucleotide polymorphisms (SNPs) containing 334 genes that were transcriptionally altered in human colorectal cancers (CRCs) and consistently clustered independent human CRC cohorts into two subgroups with different prognosis. We discovered a set of genera in early-life associated with CTS and defined a 16-genus signature that accurately predicted CTS, the majority of which were correlated with human CRCs. We identified 547 SNPs associated with abundances of these genera. Mediation analysis revealed GM as mediators partially exerting the effect of SNP UNC3869242 within Duox2 on CTS. Intestine cell-specific depletion of Duox2 altered GM composition and contribution of Duox2 depletion to CTS was significantly influenced by GM. Our findings provide potential novel targets for personalized CRC prevention and treatment.
Collapse
Affiliation(s)
- Dan Li
- Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education, Key Laboratory of Molecular Biology in Medical Sciences, Hangzhou, ZJ, China
- Department of Medical Oncology, The Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, ZJ, China
| | - Chenhan Zhong
- Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education, Key Laboratory of Molecular Biology in Medical Sciences, Hangzhou, ZJ, China
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Mengyuan Yang
- Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education, Key Laboratory of Molecular Biology in Medical Sciences, Hangzhou, ZJ, China
| | - Li He
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
- Department of Hematology, Zhongnan Hospital, Wuhan University, Wuhan, China
| | - Hang Chang
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
- Berkeley Biomedical Data Science Center, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Ning Zhu
- Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education, Key Laboratory of Molecular Biology in Medical Sciences, Hangzhou, ZJ, China
| | - Susan E Celniker
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - David W Threadgill
- Texas A&M Institute for Genome Sciences and Society, Texas A&M University, College Station, TX, USA
- Department of Molecular and Cellular Medicine and Department of Biochemistry & Biophysics, Texas A&M University, College Station, TX, USA
| | - Antoine M Snijders
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
- Berkeley Biomedical Data Science Center, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Jian-Hua Mao
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
- Berkeley Biomedical Data Science Center, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Ying Yuan
- Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education, Key Laboratory of Molecular Biology in Medical Sciences, Hangzhou, ZJ, China
- Zhejiang Provincial Clinical Research Center for CANCER, Hangzhou, ZJ, China
- Cancer Center, Zhejiang University, Hangzhou, ZJ, China
| |
Collapse
|
7
|
Bosch JA, Keith N, Escobedo F, Fisher WW, LaGraff JT, Rabasco J, Wan KH, Weiszmann R, Hu Y, Kondo S, Brown JB, Perrimon N, Celniker SE. Molecular and functional characterization of the Drosophila melanogaster conserved smORFome. Cell Rep 2023; 42:113311. [PMID: 37889754 PMCID: PMC10843857 DOI: 10.1016/j.celrep.2023.113311] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2022] [Revised: 08/24/2023] [Accepted: 10/04/2023] [Indexed: 10/29/2023] Open
Abstract
Short polypeptides encoded by small open reading frames (smORFs) are ubiquitously found in eukaryotic genomes and are important regulators of physiology, development, and mitochondrial processes. Here, we focus on a subset of 298 smORFs that are evolutionarily conserved between Drosophila melanogaster and humans. Many of these smORFs are conserved broadly in the bilaterian lineage, and ∼182 are conserved in plants. We observe remarkably heterogeneous spatial and temporal expression patterns of smORF transcripts-indicating wide-spread tissue-specific and stage-specific mitochondrial architectures. In addition, an analysis of annotated functional domains reveals a predicted enrichment of smORF polypeptides localizing to mitochondria. We conduct an embryonic ribosome profiling experiment and find support for translation of 137 of these smORFs during embryogenesis. We further embark on functional characterization using CRISPR knockout/activation, RNAi knockdown, and cDNA overexpression, revealing diverse phenotypes. This study underscores the importance of identifying smORF function in disease and phenotypic diversity.
Collapse
Affiliation(s)
- Justin A Bosch
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA 02115, USA
| | - Nathan Keith
- Division of Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA; Division of Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Felipe Escobedo
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA 02115, USA
| | - William W Fisher
- Division of Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - James Thai LaGraff
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA 02115, USA
| | - Jorden Rabasco
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA 02115, USA
| | - Kenneth H Wan
- Division of Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Richard Weiszmann
- Division of Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Yanhui Hu
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA 02115, USA
| | - Shu Kondo
- Laboratory of Invertebrate Genetics, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan
| | - James B Brown
- Division of Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA; Division of Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA.
| | - Norbert Perrimon
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA 02115, USA; Howard Hughes Medical Institute, Harvard Medical School, Boston, MA 02115, USA.
| | - Susan E Celniker
- Division of Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA.
| |
Collapse
|
8
|
He L, Zhong C, Chang H, Inman JL, Celniker SE, Ioakeim-Ioannidou M, Liu KX, Haas-Kogan D, MacDonald SM, Threadgill DW, Kogan SC, Mao JH, Snijders AM. Genetic architecture of the acute and persistent immune cell response after radiation exposure. Cell Genom 2023; 3:100422. [PMID: 38020972 PMCID: PMC10667298 DOI: 10.1016/j.xgen.2023.100422] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/08/2023] [Revised: 08/19/2023] [Accepted: 09/12/2023] [Indexed: 12/01/2023]
Abstract
Hematologic toxicity is a common side effect of multimodal cancer therapy. Nearly all animal studies investigating the causes of radiotherapy-induced hematologic toxicity use inbred strains with limited genetic diversity and do not reflect the diverse responses observed in humans. We used the population-based Collaborative Cross (CC) mouse resource to investigate the genetic architecture of the acute and persistent immune response after radiation exposure by measuring 22 immune parameters in 1,720 CC mice representing 35 strains. We determined relative acute and persistent radiation resistance scores at the individual strain level considering contributions from all immune parameters. Genome-wide association analysis identified quantitative trait loci associated with baseline and radiation responses. A cross-species radiation resistance score predicted recurrence-free survival in medulloblastoma patients. We present a community resource of immune parameters and genome-wide association analyses before and after radiation exposure for future investigations of the contributions of host genetics on radiosensitivity.
Collapse
Affiliation(s)
- Li He
- Department of Hematology, Zhongnan Hospital, Wuhan University, Wuhan, Hubei 430079, China
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Chenhan Zhong
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
- Department of Medical Oncology, Key Laboratory of Cancer Prevention and Intervention, Ministry of Education, The Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang 310009, China
| | - Hang Chang
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
- Berkeley Biomedical Data Science Center, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Jamie L. Inman
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
- Berkeley Biomedical Data Science Center, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Susan E. Celniker
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
- Comparative Biochemistry Program, University of California Berkeley, Berkeley, CA 94720, USA
| | | | - Kevin X. Liu
- Department of Radiation Oncology, Brigham and Women’s Hospital, Dana-Farber Cancer Institute, Boston Children’s Hospital, Harvard Medical School, Boston, MA 02115, USA
| | - Daphne Haas-Kogan
- Department of Radiation Oncology, Brigham and Women’s Hospital, Dana-Farber Cancer Institute, Boston Children’s Hospital, Harvard Medical School, Boston, MA 02115, USA
| | - Shannon M. MacDonald
- Department of Radiation Oncology, Massachusetts General Hospital, Boston, MA 02114, USA
| | - David W. Threadgill
- Texas A&M Institute for Genome Sciences and Society, Texas A&M University, College Station, TX 77843, USA
- Departments of Nutrition and Cell Biology and Genetics, Texas A&M University, College Station, TX 77843, USA
| | - Scott C. Kogan
- Department of Laboratory Medicine, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Jian-Hua Mao
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
- Berkeley Biomedical Data Science Center, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
- Comparative Biochemistry Program, University of California Berkeley, Berkeley, CA 94720, USA
| | - Antoine M. Snijders
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
- Berkeley Biomedical Data Science Center, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
- Comparative Biochemistry Program, University of California Berkeley, Berkeley, CA 94720, USA
| |
Collapse
|
9
|
Wan KH, Park S, Booth BW, Brydon EC, Eichenberger J, Inman JL, Mao JH, Snijders AM, Celniker SE. Complete genome sequence of the Microbacterium sp. strain BDGP8. Microbiol Resour Announc 2023; 12:e0038423. [PMID: 37607064 PMCID: PMC10508165 DOI: 10.1128/mra.00384-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2023] [Accepted: 06/16/2023] [Indexed: 08/24/2023] Open
Abstract
Microbacterium sp. BDGP8 is a species of facultative anaerobic gram-positive bacterium of the family Microbacteriaceae. The complete genome consists of a single circular chromosome of 3,293,567 bp with a G + C content of 69.84% and two plasmids of 49,365 bp and 32,884 bp.
Collapse
Affiliation(s)
- Kenneth H. Wan
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Soo Park
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Benjamin W. Booth
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Ella C. Brydon
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Joanne Eichenberger
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Jamie L. Inman
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Jian-Hua Mao
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Antoine M. Snijders
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Susan E. Celniker
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| |
Collapse
|
10
|
Guichard A, Lu S, Kanca O, Bressan D, Huang Y, Ma M, Sanz Juste S, Andrews JC, Jay KL, Sneider M, Schwartz R, Huang MC, Bei D, Pan H, Ma L, Lin WW, Auradkar A, Bhagwat P, Park S, Wan KH, Ohsako T, Takano-Shimizu T, Celniker SE, Wangler MF, Yamamoto S, Bellen HJ, Bier E. A comprehensive Drosophila resource to identify key functional interactions between SARS-CoV-2 factors and host proteins. Cell Rep 2023; 42:112842. [PMID: 37480566 PMCID: PMC10962759 DOI: 10.1016/j.celrep.2023.112842] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2023] [Revised: 05/18/2023] [Accepted: 07/05/2023] [Indexed: 07/24/2023] Open
Abstract
Development of effective therapies against SARS-CoV-2 infections relies on mechanistic knowledge of virus-host interface. Abundant physical interactions between viral and host proteins have been identified, but few have been functionally characterized. Harnessing the power of fly genetics, we develop a comprehensive Drosophila COVID-19 resource (DCR) consisting of publicly available strains for conditional tissue-specific expression of all SARS-CoV-2 encoded proteins, UAS-human cDNA transgenic lines encoding established host-viral interacting factors, and GAL4 insertion lines disrupting fly homologs of SARS-CoV-2 human interacting proteins. We demonstrate the utility of the DCR to functionally assess SARS-CoV-2 genes and candidate human binding partners. We show that NSP8 engages in strong genetic interactions with several human candidates, most prominently with the ATE1 arginyltransferase to induce actin arginylation and cytoskeletal disorganization, and that two ATE1 inhibitors can reverse NSP8 phenotypes. The DCR enables parallel global-scale functional analysis of SARS-CoV-2 components in a prime genetic model system.
Collapse
Affiliation(s)
- Annabel Guichard
- Section of Cell and Developmental Biology, University of California, San Diego (UCSD), La Jolla, CA 92093, USA
| | - Shenzhao Lu
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA; Jan and Dan Duncan Neurological Research Institute, Texas Children's Hospital, Houston, TX 77030, USA
| | - Oguz Kanca
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA; Jan and Dan Duncan Neurological Research Institute, Texas Children's Hospital, Houston, TX 77030, USA
| | - Daniel Bressan
- Section of Cell and Developmental Biology, University of California, San Diego (UCSD), La Jolla, CA 92093, USA; Instituto de Ciências Biomédicas (ICB), Universidade Federal do Rio de Janeiro, Rio de Janeiro, Rio de Janeiro 21941-902, Brazil
| | - Yan Huang
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA; Jan and Dan Duncan Neurological Research Institute, Texas Children's Hospital, Houston, TX 77030, USA
| | - Mengqi Ma
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA; Jan and Dan Duncan Neurological Research Institute, Texas Children's Hospital, Houston, TX 77030, USA
| | - Sara Sanz Juste
- Section of Cell and Developmental Biology, University of California, San Diego (UCSD), La Jolla, CA 92093, USA; Department of Epigenetics & Molecular Carcinogenesis at MD Anderson, The University of Texas MD Anderson Cancer Center, Houston, TX 77054, USA; Center for Cancer Epigenetics, MD Anderson Cancer Center, Houston, TX, USA
| | - Jonathan C Andrews
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA; Jan and Dan Duncan Neurological Research Institute, Texas Children's Hospital, Houston, TX 77030, USA
| | - Kristy L Jay
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA; Jan and Dan Duncan Neurological Research Institute, Texas Children's Hospital, Houston, TX 77030, USA
| | - Marketta Sneider
- Section of Cell and Developmental Biology, University of California, San Diego (UCSD), La Jolla, CA 92093, USA
| | - Ruth Schwartz
- Section of Cell and Developmental Biology, University of California, San Diego (UCSD), La Jolla, CA 92093, USA
| | - Mei-Chu Huang
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA; Jan and Dan Duncan Neurological Research Institute, Texas Children's Hospital, Houston, TX 77030, USA
| | - Danqing Bei
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA; Jan and Dan Duncan Neurological Research Institute, Texas Children's Hospital, Houston, TX 77030, USA
| | - Hongling Pan
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA; Jan and Dan Duncan Neurological Research Institute, Texas Children's Hospital, Houston, TX 77030, USA
| | - Liwen Ma
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA; Jan and Dan Duncan Neurological Research Institute, Texas Children's Hospital, Houston, TX 77030, USA
| | - Wen-Wen Lin
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA; Jan and Dan Duncan Neurological Research Institute, Texas Children's Hospital, Houston, TX 77030, USA
| | - Ankush Auradkar
- Section of Cell and Developmental Biology, University of California, San Diego (UCSD), La Jolla, CA 92093, USA
| | - Pranjali Bhagwat
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA; Jan and Dan Duncan Neurological Research Institute, Texas Children's Hospital, Houston, TX 77030, USA
| | - Soo Park
- Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Kenneth H Wan
- Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Takashi Ohsako
- Advanced Technology Center, Kyoto Institute of Technology, Kyoto 606-8585, Japan
| | - Toshiyuki Takano-Shimizu
- Kyoto Drosophila Stock Center and Faculty of Applied Biology, Kyoto Institute of Technology, Kyoto 616-8354, Japan
| | - Susan E Celniker
- Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Michael F Wangler
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA; Jan and Dan Duncan Neurological Research Institute, Texas Children's Hospital, Houston, TX 77030, USA; Texas Children's Hospital, Houston, TX 77030, USA
| | - Shinya Yamamoto
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA; Jan and Dan Duncan Neurological Research Institute, Texas Children's Hospital, Houston, TX 77030, USA; Department of Neuroscience, Baylor College of Medicine, Houston, TX 77030, USA; Development, Disease Models & Therapeutics Graduate Program, Baylor College of Medicine, Houston, TX 77030, USA.
| | - Hugo J Bellen
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA; Jan and Dan Duncan Neurological Research Institute, Texas Children's Hospital, Houston, TX 77030, USA; Department of Neuroscience, Baylor College of Medicine, Houston, TX 77030, USA.
| | - Ethan Bier
- Section of Cell and Developmental Biology, University of California, San Diego (UCSD), La Jolla, CA 92093, USA; Tata Institute for Genetics and Society - UCSD, La Jolla, CA 92093, USA.
| |
Collapse
|
11
|
Tang HW, Spirohn K, Hu Y, Hao T, Kovács IA, Gao Y, Binari R, Yang-Zhou D, Wan KH, Bader JS, Balcha D, Bian W, Booth BW, Coté AG, de Rouck S, Desbuleux A, Goh KY, Kim DK, Knapp JJ, Lee WX, Lemmens I, Li C, Li M, Li R, Lim HJ, Liu Y, Luck K, Markey D, Pollis C, Rangarajan S, Rodiger J, Schlabach S, Shen Y, Sheykhkarimli D, TeeKing B, Roth FP, Tavernier J, Calderwood MA, Hill DE, Celniker SE, Vidal M, Perrimon N, Mohr SE. Next-generation large-scale binary protein interaction network for Drosophila melanogaster. Nat Commun 2023; 14:2162. [PMID: 37061542 PMCID: PMC10105736 DOI: 10.1038/s41467-023-37876-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2022] [Accepted: 04/04/2023] [Indexed: 04/17/2023] Open
Abstract
Generating reference maps of interactome networks illuminates genetic studies by providing a protein-centric approach to finding new components of existing pathways, complexes, and processes. We apply state-of-the-art methods to identify binary protein-protein interactions (PPIs) for Drosophila melanogaster. Four all-by-all yeast two-hybrid (Y2H) screens of > 10,000 Drosophila proteins result in the 'FlyBi' dataset of 8723 PPIs among 2939 proteins. Testing subsets of data from FlyBi and previous PPI studies using an orthogonal assay allows for normalization of data quality; subsequent integration of FlyBi and previous data results in an expanded binary Drosophila reference interaction network, DroRI, comprising 17,232 interactions among 6511 proteins. We use FlyBi data to generate an autophagy network, then validate in vivo using autophagy-related assays. The deformed wings (dwg) gene encodes a protein that is both a regulator and a target of autophagy. Altogether, these resources provide a foundation for building new hypotheses regarding protein networks and function.
Collapse
Affiliation(s)
- Hong-Wen Tang
- Department of Genetics, Blavatnik Institute, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA, 02115, USA
- Program in Cancer and Stem Cell Biology, Duke-NUS Medical School, 8 College Road, Singapore, 169857, Singapore
- Division of Cellular & Molecular Research, Humphrey Oei Institute of Cancer Research, National Cancer Centre Singapore, Singapore, 169610, Singapore
| | - Kerstin Spirohn
- Department of Genetics, Blavatnik Institute, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA, 02115, USA
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, 450 Brookline Avenue, Boston, MA, 02215, USA
| | - Yanhui Hu
- Department of Genetics, Blavatnik Institute, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA, 02115, USA
| | - Tong Hao
- Department of Genetics, Blavatnik Institute, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA, 02115, USA
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, 450 Brookline Avenue, Boston, MA, 02215, USA
| | - István A Kovács
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, 450 Brookline Avenue, Boston, MA, 02215, USA
- Department of Physics and Astronomy, Northwestern University, 633 Clark Street, Evanston, IL, 60208, USA
- Northwestern Institute on Complex Systems, Chambers Hall, Northwestern University, 600 Foster St, Evanston, IL, 60208, USA
| | - Yue Gao
- Department of Genetics, Blavatnik Institute, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA, 02115, USA
| | - Richard Binari
- Department of Genetics, Blavatnik Institute, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA, 02115, USA
- Howard Hughes Medical Institute, 77 Avenue Louis Pasteur, Boston, MA, 02115, USA
| | - Donghui Yang-Zhou
- Department of Genetics, Blavatnik Institute, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA, 02115, USA
| | - Kenneth H Wan
- Berkeley Drosophila Genome Project, Lawrence Berkeley National Laboratory, 1 Cyclotron Rd, Berkeley, CA, 94720, USA
| | - Joel S Bader
- Department of Biomedical Engineering, Whiting School of Engineering, Johns Hopkins University, 3400 North Charles Street, Baltimore, MD, 21218, USA
- High-Throughput Biology Center, Institute of Basic Biological Sciences, Johns Hopkins School of Medicine, 733 North Broadway, Baltimore, MD, 21205, USA
| | - Dawit Balcha
- Department of Genetics, Blavatnik Institute, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA, 02115, USA
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, 450 Brookline Avenue, Boston, MA, 02215, USA
| | - Wenting Bian
- Department of Genetics, Blavatnik Institute, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA, 02115, USA
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, 450 Brookline Avenue, Boston, MA, 02215, USA
| | - Benjamin W Booth
- Berkeley Drosophila Genome Project, Lawrence Berkeley National Laboratory, 1 Cyclotron Rd, Berkeley, CA, 94720, USA
| | - Atina G Coté
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, 450 Brookline Avenue, Boston, MA, 02215, USA
- Donnelly Centre for Cellular and Biomolecular Research and Department of Molecular Genetics, University of Toronto, 160 College St, Toronto, ON, M5S 3E1, Canada
- Lunenfeld-Tanenbaum Research Institute (LTRI), Sinai Health, 600 University Ave, Toronto, ON, M5G 1×5, Canada
| | - Steffi de Rouck
- Cytokine Receptor Lab, VIB Center for Medical Biotechnology, Albert Baertsoenkaai 3, 9000, Ghent, Belgium
| | - Alice Desbuleux
- Department of Genetics, Blavatnik Institute, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA, 02115, USA
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, 450 Brookline Avenue, Boston, MA, 02215, USA
| | - Kah Yong Goh
- Program in Cancer and Stem Cell Biology, Duke-NUS Medical School, 8 College Road, Singapore, 169857, Singapore
| | - Dae-Kyum Kim
- Department of Cancer Genetics and Genomics, Roswell Park Comprehensive Cancer Center, 665 Elm St., Buffalo, NY, 14203, USA
| | - Jennifer J Knapp
- Donnelly Centre for Cellular and Biomolecular Research and Department of Molecular Genetics, University of Toronto, 160 College St, Toronto, ON, M5S 3E1, Canada
- Lunenfeld-Tanenbaum Research Institute (LTRI), Sinai Health, 600 University Ave, Toronto, ON, M5G 1×5, Canada
| | - Wen Xing Lee
- Program in Cancer and Stem Cell Biology, Duke-NUS Medical School, 8 College Road, Singapore, 169857, Singapore
| | - Irma Lemmens
- Cytokine Receptor Lab, VIB Center for Medical Biotechnology, Albert Baertsoenkaai 3, 9000, Ghent, Belgium
| | - Cathleen Li
- Department of Genetics, Blavatnik Institute, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA, 02115, USA
| | - Mian Li
- Department of Genetics, Blavatnik Institute, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA, 02115, USA
| | - Roujia Li
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, 450 Brookline Avenue, Boston, MA, 02215, USA
- Donnelly Centre for Cellular and Biomolecular Research and Department of Molecular Genetics, University of Toronto, 160 College St, Toronto, ON, M5S 3E1, Canada
- Lunenfeld-Tanenbaum Research Institute (LTRI), Sinai Health, 600 University Ave, Toronto, ON, M5G 1×5, Canada
| | - Hyobin Julianne Lim
- Department of Cancer Genetics and Genomics, Roswell Park Comprehensive Cancer Center, 665 Elm St., Buffalo, NY, 14203, USA
| | - Yifang Liu
- Department of Genetics, Blavatnik Institute, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA, 02115, USA
| | - Katja Luck
- Department of Genetics, Blavatnik Institute, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA, 02115, USA
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, 450 Brookline Avenue, Boston, MA, 02215, USA
| | - Dylan Markey
- Department of Genetics, Blavatnik Institute, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA, 02115, USA
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, 450 Brookline Avenue, Boston, MA, 02215, USA
| | - Carl Pollis
- Department of Genetics, Blavatnik Institute, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA, 02115, USA
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, 450 Brookline Avenue, Boston, MA, 02215, USA
| | - Sudharshan Rangarajan
- Department of Genetics, Blavatnik Institute, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA, 02115, USA
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, 450 Brookline Avenue, Boston, MA, 02215, USA
| | - Jonathan Rodiger
- Department of Genetics, Blavatnik Institute, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA, 02115, USA
| | - Sadie Schlabach
- Department of Genetics, Blavatnik Institute, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA, 02115, USA
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, 450 Brookline Avenue, Boston, MA, 02215, USA
| | - Yun Shen
- Department of Genetics, Blavatnik Institute, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA, 02115, USA
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, 450 Brookline Avenue, Boston, MA, 02215, USA
| | - Dayag Sheykhkarimli
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, 450 Brookline Avenue, Boston, MA, 02215, USA
- Donnelly Centre for Cellular and Biomolecular Research and Department of Molecular Genetics, University of Toronto, 160 College St, Toronto, ON, M5S 3E1, Canada
- Lunenfeld-Tanenbaum Research Institute (LTRI), Sinai Health, 600 University Ave, Toronto, ON, M5G 1×5, Canada
| | - Bridget TeeKing
- Department of Genetics, Blavatnik Institute, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA, 02115, USA
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, 450 Brookline Avenue, Boston, MA, 02215, USA
| | - Frederick P Roth
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, 450 Brookline Avenue, Boston, MA, 02215, USA
- Donnelly Centre for Cellular and Biomolecular Research and Department of Molecular Genetics, University of Toronto, 160 College St, Toronto, ON, M5S 3E1, Canada
- Lunenfeld-Tanenbaum Research Institute (LTRI), Sinai Health, 600 University Ave, Toronto, ON, M5G 1×5, Canada
- Department of Computer Science, University of Toronto, 40 St George St, Toronto, ON, M5S 2E4, Canada
| | - Jan Tavernier
- Cytokine Receptor Lab, VIB Center for Medical Biotechnology, Albert Baertsoenkaai 3, 9000, Ghent, Belgium
| | - Michael A Calderwood
- Department of Genetics, Blavatnik Institute, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA, 02115, USA
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, 450 Brookline Avenue, Boston, MA, 02215, USA
| | - David E Hill
- Department of Genetics, Blavatnik Institute, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA, 02115, USA
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, 450 Brookline Avenue, Boston, MA, 02215, USA
| | - Susan E Celniker
- Berkeley Drosophila Genome Project, Lawrence Berkeley National Laboratory, 1 Cyclotron Rd, Berkeley, CA, 94720, USA.
| | - Marc Vidal
- Department of Genetics, Blavatnik Institute, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA, 02115, USA.
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, 450 Brookline Avenue, Boston, MA, 02215, USA.
| | - Norbert Perrimon
- Department of Genetics, Blavatnik Institute, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA, 02115, USA.
- Howard Hughes Medical Institute, 77 Avenue Louis Pasteur, Boston, MA, 02115, USA.
| | - Stephanie E Mohr
- Department of Genetics, Blavatnik Institute, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA, 02115, USA.
| |
Collapse
|
12
|
Fisher WW, Hammonds AS, Weiszmann R, Booth BW, Gevirtzman L, Patton JEJ, Kubo CA, Waterston RH, Celniker SE. A modERN resource: identification of Drosophila transcription factor candidate target genes using RNAi. Genetics 2023; 223:iyad004. [PMID: 36652461 PMCID: PMC10078917 DOI: 10.1093/genetics/iyad004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2022] [Revised: 11/18/2022] [Accepted: 12/22/2022] [Indexed: 01/19/2023] Open
Abstract
Transcription factors (TFs) play a key role in development and in cellular responses to the environment by activating or repressing the transcription of target genes in precise spatial and temporal patterns. In order to develop a catalog of target genes of Drosophila melanogaster TFs, the modERN consortium systematically knocked down the expression of TFs using RNAi in whole embryos followed by RNA-seq. We generated data for 45 TFs which have 18 different DNA-binding domains and are expressed in 15 of the 16 organ systems. The range of inactivation of the targeted TFs by RNAi ranged from log2fold change -3.52 to +0.49. The TFs also showed remarkable heterogeneity in the numbers of candidate target genes identified, with some generating thousands of candidates and others only tens. We present detailed analysis from five experiments, including those for three TFs that have been the focus of previous functional studies (ERR, sens, and zfh2) and two previously uncharacterized TFs (sens-2 and CG32006), as well as short vignettes for selected additional experiments to illustrate the utility of this resource. The RNA-seq datasets are available through the ENCODE DCC (http://encodeproject.org) and the Sequence Read Archive (SRA). TF and target gene expression patterns can be found here: https://insitu.fruitfly.org. These studies provide data that facilitate scientific inquiries into the functions of individual TFs in key developmental, metabolic, defensive, and homeostatic regulatory pathways, as well as provide a broader perspective on how individual TFs work together in local networks during embryogenesis.
Collapse
Affiliation(s)
- William W Fisher
- Division of Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Ann S Hammonds
- Division of Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Richard Weiszmann
- Division of Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Benjamin W Booth
- Division of Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Louis Gevirtzman
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Jaeda E J Patton
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Connor A Kubo
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Robert H Waterston
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Susan E Celniker
- Division of Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| |
Collapse
|
13
|
Kenneally R, Lawrence Q, Brydon E, Wan KH, Mao JH, Verma SC, Khazaieli A, Celniker SE, Snijders AM. Inactivation of multiple human pathogens by Fathhome's dry sanitizer device: Rapid and eco-friendly ozone-based disinfection. Medicine in Microecology 2022; 14:100059. [PMID: 35945946 PMCID: PMC9354387 DOI: 10.1016/j.medmic.2022.100059] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2022] [Revised: 06/30/2022] [Accepted: 07/24/2022] [Indexed: 11/25/2022] Open
Abstract
SARS-CoV-2 spread rapidly, causing millions of deaths across the globe. As a result, demand for medical supplies and personal protective equipment (PPE) surged and supplies dwindled. Separate entirely, hospital-acquired infections have become commonplace and challenging to treat. To explore the potential of novel sterilization techniques, this study evaluated the disinfection efficacy of Fathhome's ozone-based, dry-sanitizing device by dose and time response. Inactivation of human pathogens was tested on non-porous (plastic) surfaces. 95.42–100% inactivation was observed across all types of vegetative microorganisms and 27.36% inactivation of bacterial endospores tested, with no residual ozone detectable after completion. These results strongly support the hypothesis that Fathhome's commercial implementation of gas-based disinfection is suitable for rapid decontamination of a wide variety of pathogens on PPE and other industrially relevant materials.
Collapse
|
14
|
He L, Zhou Y, Zhang Y, Hang B, Chang H, Schick SF, Celniker SE, Xia Y, Snijders AM, Mao J. Thirdhand cigarette smoke leads to age-dependent and persistent alterations in the cecal microbiome of mice. Microbiologyopen 2021; 10:e1198. [PMID: 34180593 PMCID: PMC8123915 DOI: 10.1002/mbo3.1198] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2021] [Revised: 04/26/2021] [Accepted: 04/27/2021] [Indexed: 01/11/2023] Open
Abstract
The gut microbiome composition is influenced by many factors including environmental exposures. Here, we investigated the effect of thirdhand cigarette smoke (THS) and exposure age on gut microbiome diversity. C57BL/6 mice were exposed to THS at human exposure relevant levels for three weeks during three different life stages: postnatal (0-3 weeks of age), pubescent (4-7 weeks of age), and adult (9-12 weeks of age), respectively. Cecal microbiome profiles were assessed at 17 weeks of age by 16S rRNA gene sequencing. We found that age at THS exposure strongly influenced the cecal microbiome composition. Although postnatal THS exposure significantly influenced the microbial composition, pubescent and adulthood exposures only had minor effects. The microbiome of postnatally THS-exposed mice significantly increased several degradation pathways that regulate glycolysis and pyruvate decarboxylation, and significantly decreased coenzyme A biosynthesis and pyrimidine deoxyribonucleoside salvage. Our results indicate that mouse postnatal development is particularly susceptible to persistent THS exposure effects on the gut microbiome.
Collapse
Affiliation(s)
- Li He
- Biological Systems and Engineering DivisionLawrence Berkeley National LaboratoryBerkeleyCAUSA
- Department of HematologyZhongnan HospitalWuhan UniversityWuhanChina
| | - Yan‐Xia Zhou
- Biological Systems and Engineering DivisionLawrence Berkeley National LaboratoryBerkeleyCAUSA
- Marine CollegeShandong UniversityWeihaiChina
| | - Yuqing Zhang
- Biological Systems and Engineering DivisionLawrence Berkeley National LaboratoryBerkeleyCAUSA
- State Key Laboratory of Reproductive MedicineInstitute of ToxicologyNanjing Medical UniversityNanjingChina
| | - Bo Hang
- Biological Systems and Engineering DivisionLawrence Berkeley National LaboratoryBerkeleyCAUSA
| | - Hang Chang
- Biological Systems and Engineering DivisionLawrence Berkeley National LaboratoryBerkeleyCAUSA
| | - Suzaynn F. Schick
- Department of MedicineDivision of Occupational and Environmental MedicineUniversity of CaliforniaSan FranciscoCAUSA
| | - Susan E. Celniker
- Biological Systems and Engineering DivisionLawrence Berkeley National LaboratoryBerkeleyCAUSA
| | - Yankai Xia
- State Key Laboratory of Reproductive MedicineInstitute of ToxicologyNanjing Medical UniversityNanjingChina
| | - Antoine M. Snijders
- Biological Systems and Engineering DivisionLawrence Berkeley National LaboratoryBerkeleyCAUSA
| | - Jian‐Hua Mao
- Biological Systems and Engineering DivisionLawrence Berkeley National LaboratoryBerkeleyCAUSA
| |
Collapse
|
15
|
Dachet F, Brown JB, Valyi-Nagy T, Narayan KD, Serafini A, Boley N, Gingeras TR, Celniker SE, Mohapatra G, Loeb JA. Selective time-dependent changes in activity and cell-specific gene expression in human postmortem brain. Sci Rep 2021; 11:6078. [PMID: 33758256 PMCID: PMC7988150 DOI: 10.1038/s41598-021-85801-6] [Citation(s) in RCA: 35] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2020] [Accepted: 02/24/2021] [Indexed: 12/15/2022] Open
Abstract
As a means to understand human neuropsychiatric disorders from human brain samples, we compared the transcription patterns and histological features of postmortem brain to fresh human neocortex isolated immediately following surgical removal. Compared to a number of neuropsychiatric disease-associated postmortem transcriptomes, the fresh human brain transcriptome had an entirely unique transcriptional pattern. To understand this difference, we measured genome-wide transcription as a function of time after fresh tissue removal to mimic the postmortem interval. Within a few hours, a selective reduction in the number of neuronal activity-dependent transcripts occurred with relative preservation of housekeeping genes commonly used as a reference for RNA normalization. Gene clustering indicated a rapid reduction in neuronal gene expression with a reciprocal time-dependent increase in astroglial and microglial gene expression that continued to increase for at least 24 h after tissue resection. Predicted transcriptional changes were confirmed histologically on the same tissue demonstrating that while neurons were degenerating, glial cells underwent an outgrowth of their processes. The rapid loss of neuronal genes and reciprocal expression of glial genes highlights highly dynamic transcriptional and cellular changes that occur during the postmortem interval. Understanding these time-dependent changes in gene expression in post mortem brain samples is critical for the interpretation of research studies on human brain disorders.
Collapse
Affiliation(s)
- Fabien Dachet
- University of Illinois at Chicago, Chicago, IL, 60612, USA.
| | - James B Brown
- Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | | | | | - Anna Serafini
- University of Illinois at Chicago, Chicago, IL, 60612, USA
| | - Nathan Boley
- University of California, Berkeley, CA, 94720, USA
| | | | | | | | - Jeffrey A Loeb
- University of Illinois at Chicago, Chicago, IL, 60612, USA.
| |
Collapse
|
16
|
Mao JH, Kim YM, Zhou YX, Hu D, Zhong C, Chang H, Brislawn CJ, Fansler S, Langley S, Wang Y, Peisl BYL, Celniker SE, Threadgill DW, Wilmes P, Orr G, Metz TO, Jansson JK, Snijders AM. Correction to: Genetic and metabolic links between the murine microbiome and memory. Microbiome 2020; 8:73. [PMID: 32466793 PMCID: PMC7257159 DOI: 10.1186/s40168-020-00870-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
An amendment to this paper has been published and can be accessed via the original article.
Collapse
Affiliation(s)
- Jian-Hua Mao
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | - Young-Mo Kim
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, WA, USA
| | - Yan-Xia Zhou
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
- Marine College, Shandong University, Weihai, 264209, China
| | - Dehong Hu
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, WA, USA
| | - Chenhan Zhong
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | - Hang Chang
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | - Colin J Brislawn
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, WA, USA
| | - Sarah Fansler
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, WA, USA
| | - Sasha Langley
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | - Yunshan Wang
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
- Department of Clinical Laboratory, The Second Hospital of Shandong University, Jinan, 250033, Shandong, China
| | - B Y Loulou Peisl
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, 7, Avenue des Hauts Fourneaux, L-4362, Esch-sur-Alzette, Luxembourg
| | - Susan E Celniker
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | - David W Threadgill
- Department of Veterinary Pathobiology, A&M University, College Station, Texas, USA
- Department of Molecular and Cellular Medicine Texas, A&M University, College Station, Texas, USA
| | - Paul Wilmes
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, 7, Avenue des Hauts Fourneaux, L-4362, Esch-sur-Alzette, Luxembourg
| | - Galya Orr
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, WA, USA
| | - Thomas O Metz
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, WA, USA
| | - Janet K Jansson
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, WA, USA.
| | - Antoine M Snijders
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA.
| |
Collapse
|
17
|
Mao JH, Kim YM, Zhou YX, Hu D, Zhong C, Chang H, Brislawn CJ, Fansler S, Langley S, Wang Y, Peisl BYL, Celniker SE, Threadgill DW, Wilmes P, Orr G, Metz TO, Jansson JK, Snijders AM. Genetic and metabolic links between the murine microbiome and memory. Microbiome 2020; 8:53. [PMID: 32299497 PMCID: PMC7164142 DOI: 10.1186/s40168-020-00817-w] [Citation(s) in RCA: 44] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/03/2020] [Accepted: 03/02/2020] [Indexed: 05/04/2023]
Abstract
BACKGROUND Recent evidence has linked the gut microbiome to host behavior via the gut-brain axis [1-3]; however, the underlying mechanisms remain unexplored. Here, we determined the links between host genetics, the gut microbiome and memory using the genetically defined Collaborative Cross (CC) mouse cohort, complemented with microbiome and metabolomic analyses in conventional and germ-free (GF) mice. RESULTS A genome-wide association analysis (GWAS) identified 715 of 76,080 single-nucleotide polymorphisms (SNPs) that were significantly associated with short-term memory using the passive avoidance model. The identified SNPs were enriched in genes known to be involved in learning and memory functions. By 16S rRNA gene sequencing of the gut microbial community in the same CC cohort, we identified specific microorganisms that were significantly correlated with longer latencies in our retention test, including a positive correlation with Lactobacillus. Inoculation of GF mice with individual species of Lactobacillus (L. reuteri F275, L. plantarum BDGP2 or L. brevis BDGP6) resulted in significantly improved memory compared to uninoculated or E. coli DH10B inoculated controls. Untargeted metabolomics analysis revealed significantly higher levels of several metabolites, including lactate, in the stools of Lactobacillus-colonized mice, when compared to GF control mice. Moreover, we demonstrate that dietary lactate treatment alone boosted memory in conventional mice. Mechanistically, we show that both inoculation with Lactobacillus or lactate treatment significantly increased the levels of the neurotransmitter, gamma-aminobutyric acid (GABA), in the hippocampus of the mice. CONCLUSION Together, this study provides new evidence for a link between Lactobacillus and memory and our results open possible new avenues for treating memory impairment disorders using specific gut microbial inoculants and/or metabolites. Video Abstract.
Collapse
Affiliation(s)
- Jian-Hua Mao
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720 USA
| | - Young-Mo Kim
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, WA USA
| | - Yan-Xia Zhou
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720 USA
- Marine College, Shandong University, Weihai, 264209 China
| | - Dehong Hu
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, WA USA
| | - Chenhan Zhong
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720 USA
| | - Hang Chang
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720 USA
| | - Colin J. Brislawn
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, WA USA
| | - Sarah Fansler
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, WA USA
| | - Sasha Langley
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720 USA
| | - Yunshan Wang
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720 USA
- Department of Clinical Laboratory, The Second Hospital of Shandong University, Jinan, 250033 Shandong China
| | - B. Y. Loulou Peisl
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, 7, Avenue des Hauts Fourneaux, L-4362 Esch-sur-Alzette, Luxembourg
| | - Susan E. Celniker
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720 USA
| | - David W. Threadgill
- Department of Veterinary Pathobiology, A&M University, College Station, Texas, USA
- Department of Molecular and Cellular Medicine Texas, A&M University, College Station, Texas, USA
| | - Paul Wilmes
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, 7, Avenue des Hauts Fourneaux, L-4362 Esch-sur-Alzette, Luxembourg
| | - Galya Orr
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, WA USA
| | - Thomas O. Metz
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, WA USA
| | - Janet K. Jansson
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, WA USA
| | - Antoine M. Snijders
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720 USA
| |
Collapse
|
18
|
Wang P, Wang Y, Langley SA, Zhou YX, Jen KY, Sun Q, Brislawn C, Rojas CM, Wahl KL, Wang T, Fan X, Jansson JK, Celniker SE, Zou X, Threadgill DW, Snijders AM, Mao JH. Diverse tumour susceptibility in Collaborative Cross mice: identification of a new mouse model for human gastric tumourigenesis. Gut 2019; 68:1942-1952. [PMID: 30842212 PMCID: PMC6839736 DOI: 10.1136/gutjnl-2018-316691] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/25/2018] [Revised: 02/11/2019] [Accepted: 02/12/2019] [Indexed: 12/20/2022]
Abstract
OBJECTIVE The Collaborative Cross (CC) is a mouse population model with diverse and reproducible genetic backgrounds used to identify novel disease models and genes that contribute to human disease. Since spontaneous tumour susceptibility in CC mice remains unexplored, we assessed tumour incidence and spectrum. DESIGN We monitored 293 mice from 18 CC strains for tumour development. Genetic association analysis and RNA sequencing were used to identify susceptibility loci and candidate genes. We analysed genomes of patients with gastric cancer to evaluate the relevance of genes identified in the CC mouse model and measured the expression levels of ISG15 by immunohistochemical staining using a gastric adenocarcinoma tissue microarray. Association of gene expression with overall survival (OS) was assessed by Kaplan-Meier analysis. RESULTS CC mice displayed a wide range in the incidence and types of spontaneous tumours. More than 40% of CC036 mice developed gastric tumours within 1 year. Genetic association analysis identified Nfκb1 as a candidate susceptibility gene, while RNA sequencing analysis of non-tumour gastric tissues from CC036 mice showed significantly higher expression of inflammatory response genes. In human gastric cancers, the majority of human orthologues of the 166 mouse genes were preferentially altered by amplification or deletion and were significantly associated with OS. Higher expression of the CC036 inflammatory response gene signature is associated with poor OS. Finally, ISG15 protein is elevated in gastric adenocarcinomas and correlated with shortened patient OS. CONCLUSIONS CC strains exhibit tremendous variation in tumour susceptibility, and we present CC036 as a spontaneous laboratory mouse model for studying human gastric tumourigenesis.
Collapse
Affiliation(s)
- Pin Wang
- Department of Gastroenterology, Nanjing Drum Tower Hospital, Affiliated Hospital of Nanjing University Medical School, Nanjing, Jiangsu, China,Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Yunshan Wang
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA,Clinical Laboratory, Second Hospital of Shandong University, Jinan, China
| | - Sasha A Langley
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Yan-Xia Zhou
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA,College of Marine Science, Shandong University, Weihai, China
| | - Kuang-Yu Jen
- Department of Pathology, University of California Davis Medical Center, Sacramento, California, USA
| | - Qi Sun
- Department of Pathology, Nanjing Drum Tower Hospital, Affiliated Hospital of Nanjing University Medical School, Nanjing, China
| | - Colin Brislawn
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, Washington, USA
| | - Carolina M Rojas
- Department of Veterinary Pathobiology, Texas A&M University, College Station, Texas, USA,Department of Molecular and Cellular Medicine, Texas A&M University, College Station, Texas, USA
| | - Kimberly L Wahl
- Department of Veterinary Pathobiology, Texas A&M University, College Station, Texas, USA,Department of Molecular and Cellular Medicine, Texas A&M University, College Station, Texas, USA
| | - Ting Wang
- Department of Pathology, Nanjing Drum Tower Hospital, Affiliated Hospital of Nanjing University Medical School, Nanjing, China
| | - Xiangshan Fan
- Department of Pathology, Nanjing Drum Tower Hospital, Affiliated Hospital of Nanjing University Medical School, Nanjing, China
| | - Janet K Jansson
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, Washington, USA
| | - Susan E Celniker
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Xiaoping Zou
- Department of Gastroenterology, Nanjing Drum Tower Hospital, Affiliated Hospital of Nanjing University Medical School, Nanjing, Jiangsu, China
| | - David W Threadgill
- Department of Veterinary Pathobiology, Texas A&M University, College Station, Texas, USA,Department of Molecular and Cellular Medicine, Texas A&M University, College Station, Texas, USA
| | - Antoine M Snijders
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Jian-Hua Mao
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| |
Collapse
|
19
|
Chen K, Luan X, Liu Q, Wang J, Chang X, Snijders AM, Mao JH, Secombe J, Dan Z, Chen JH, Wang Z, Dong X, Qiu C, Chang X, Zhang D, Celniker SE, Liu X. Drosophila Histone Demethylase KDM5 Regulates Social Behavior through Immune Control and Gut Microbiota Maintenance. Cell Host Microbe 2019; 25:537-552.e8. [PMID: 30902578 DOI: 10.1016/j.chom.2019.02.003] [Citation(s) in RCA: 99] [Impact Index Per Article: 19.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/01/2018] [Revised: 12/05/2018] [Accepted: 02/15/2019] [Indexed: 12/20/2022]
Abstract
Loss-of-function mutations in the histone demethylases KDM5A, KDM5B, or KDM5C are found in intellectual disability (ID) and autism spectrum disorders (ASD) patients. Here, we use the model organism Drosophila melanogaster to delineate how KDM5 contributes to ID and ASD. We show that reducing KDM5 causes intestinal barrier dysfunction and changes in social behavior that correlates with compositional changes in the gut microbiota. Therapeutic alteration of the dysbiotic microbiota through antibiotic administration or feeding with a probiotic Lactobacillus strain partially rescues the behavioral, lifespan, and cellular phenotypes observed in kdm5-deficient flies. Mechanistically, KDM5 was found to transcriptionally regulate component genes of the immune deficiency (IMD) signaling pathway and subsequent maintenance of host-commensal bacteria homeostasis in a demethylase-dependent manner. Together, our study uses a genetic approach to dissect the role of KDM5 in the gut-microbiome-brain axis and suggests that modifying the gut microbiome may provide therapeutic benefits for ID and ASD patients.
Collapse
Affiliation(s)
- Kun Chen
- Department of Pathogen Biology-Microbiology Division, State Key Laboratory of Reproductive Medicine, Nanjing Medical University, Nanjing 211166, China; Key Laboratory of Pathogen of Jiangsu Province, Center of Global Health, Nanjing Medical University, Nanjing 211166, China; Key Laboratory of Human Functional Genomics of Jiangsu Province, Nanjing Medical University, Nanjing 211166, China; Key Laboratory of Holistic Integrative Enterology, The Second Affiliated Hospital of Nanjing Medical University, Nanjing 210011, China
| | - Xiaoting Luan
- Department of Pathogen Biology-Microbiology Division, State Key Laboratory of Reproductive Medicine, Nanjing Medical University, Nanjing 211166, China
| | - Qisha Liu
- Department of Pathogen Biology-Microbiology Division, State Key Laboratory of Reproductive Medicine, Nanjing Medical University, Nanjing 211166, China; Key Laboratory of Pathogen of Jiangsu Province, Center of Global Health, Nanjing Medical University, Nanjing 211166, China
| | - Jianwei Wang
- Department of Pathogen Biology-Microbiology Division, State Key Laboratory of Reproductive Medicine, Nanjing Medical University, Nanjing 211166, China; Key Laboratory of Pathogen of Jiangsu Province, Center of Global Health, Nanjing Medical University, Nanjing 211166, China
| | - Xinxia Chang
- Department of Pathogen Biology-Microbiology Division, State Key Laboratory of Reproductive Medicine, Nanjing Medical University, Nanjing 211166, China; Key Laboratory of Pathogen of Jiangsu Province, Center of Global Health, Nanjing Medical University, Nanjing 211166, China
| | - Antoine M Snijders
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Jian-Hua Mao
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Julie Secombe
- Departments of Genetics and Neuroscience, Albert Einstein College of Medicine, Bronx, New York, NY 10461, USA
| | - Zhou Dan
- Department of Pathogen Biology-Microbiology Division, State Key Laboratory of Reproductive Medicine, Nanjing Medical University, Nanjing 211166, China
| | - Jian-Huan Chen
- Genomic and Precision Medicine Laboratory, Department of Public Health, Wuxi School of Medicine, Jiangnan University, Wuxi 214122, China
| | - Zibin Wang
- Center for Analysis and Testing, Nanjing Medical University, Nanjing 211166, China
| | - Xiao Dong
- Departments of Genetics and Neuroscience, Albert Einstein College of Medicine, Bronx, New York, NY 10461, USA
| | - Chen Qiu
- Department of Pathogen Biology-Microbiology Division, State Key Laboratory of Reproductive Medicine, Nanjing Medical University, Nanjing 211166, China
| | - Xiaoai Chang
- Key Laboratory of Human Functional Genomics of Jiangsu Province, Nanjing Medical University, Nanjing 211166, China
| | - Dong Zhang
- Department of Pathogen Biology-Microbiology Division, State Key Laboratory of Reproductive Medicine, Nanjing Medical University, Nanjing 211166, China
| | - Susan E Celniker
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Xingyin Liu
- Department of Pathogen Biology-Microbiology Division, State Key Laboratory of Reproductive Medicine, Nanjing Medical University, Nanjing 211166, China; Key Laboratory of Pathogen of Jiangsu Province, Center of Global Health, Nanjing Medical University, Nanjing 211166, China; Key Laboratory of Human Functional Genomics of Jiangsu Province, Nanjing Medical University, Nanjing 211166, China; Key Laboratory of Holistic Integrative Enterology, The Second Affiliated Hospital of Nanjing Medical University, Nanjing 210011, China.
| |
Collapse
|
20
|
Lee H, McManus CJ, Cho DY, Eaton M, Renda F, Somma MP, Cherbas L, May G, Powell S, Zhang D, Zhan L, Resch A, Andrews J, Celniker SE, Cherbas P, Przytycka TM, Gatti M, Oliver B, Graveley B, MacAlpine D. Correction to: DNA copy number evolution in Drosophila cell lines. Genome Biol 2019; 20:53. [PMID: 30857560 PMCID: PMC6410492 DOI: 10.1186/s13059-019-1668-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2019] [Accepted: 03/04/2019] [Indexed: 12/02/2022] Open
Affiliation(s)
- Hangnoh Lee
- National Institute of Diabetes, Digestive, and Kidney Diseases, National Institutes of Health, 50 South Drive, Bethesda, MD, 20892, USA
| | - C Joel McManus
- Department of Genetics and Developmental Biology, Institute for Systems Genomics, University of Connecticut Health Center, 400 Farmington Avenue, Farmington, CT, 06030, USA.,Present addresses: Department of Biological Sciences, Carnegie Mellon University, 4400 Fifth Avenue, Pittsburgh, PA, 15213, USA
| | - Dong-Yeon Cho
- Computational Biology Branch, National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, 8600 Rockville Pike, Bethesda, MD, 20892, USA
| | - Matthew Eaton
- Department of Pharmacology and Cancer Biology, Levine Science Research Center, Duke University Medical Center, 308 Research Drive, Durham, NC, 27708, USA
| | - Fioranna Renda
- Istituto di Biologia e Patologia Molecolari (IBPM) del CNR and Dipartimento di Biologia e Biotecnologie, Sapienza, Università di Roma, 5 Aldo Moro Piazzale, 00185, Rome, Italy
| | - Maria Patrizia Somma
- Istituto di Biologia e Patologia Molecolari (IBPM) del CNR and Dipartimento di Biologia e Biotecnologie, Sapienza, Università di Roma, 5 Aldo Moro Piazzale, 00185, Rome, Italy
| | - Lucy Cherbas
- Department of Biology, Indiana University, 1001 East 3rd Street, Bloomington, IN, 47405, USA
| | - Gemma May
- Department of Genetics and Developmental Biology, Institute for Systems Genomics, University of Connecticut Health Center, 400 Farmington Avenue, Farmington, CT, 06030, USA.,Present addresses: Department of Biological Sciences, Carnegie Mellon University, 4400 Fifth Avenue, Pittsburgh, PA, 15213, USA
| | - Sara Powell
- Department of Pharmacology and Cancer Biology, Levine Science Research Center, Duke University Medical Center, 308 Research Drive, Durham, NC, 27708, USA
| | - Dayu Zhang
- Department of Biology, Indiana University, 1001 East 3rd Street, Bloomington, IN, 47405, USA.,School of Agricultural and Food Science, Zhejiang A and F University, 88 Huan Cheng Bei Road, Lin'an, Zhejiang, 311300, China
| | - Lijun Zhan
- Department of Genetics and Developmental Biology, Institute for Systems Genomics, University of Connecticut Health Center, 400 Farmington Avenue, Farmington, CT, 06030, USA
| | - Alissa Resch
- Department of Genetics and Developmental Biology, Institute for Systems Genomics, University of Connecticut Health Center, 400 Farmington Avenue, Farmington, CT, 06030, USA
| | - Justen Andrews
- Department of Biology, Indiana University, 1001 East 3rd Street, Bloomington, IN, 47405, USA
| | - Susan E Celniker
- Department of Genome Dynamics, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA, 94720, USA
| | - Peter Cherbas
- Department of Biology, Indiana University, 1001 East 3rd Street, Bloomington, IN, 47405, USA
| | - Teresa M Przytycka
- Computational Biology Branch, National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, 8600 Rockville Pike, Bethesda, MD, 20892, USA
| | - Maurizio Gatti
- Istituto di Biologia e Patologia Molecolari (IBPM) del CNR and Dipartimento di Biologia e Biotecnologie, Sapienza, Università di Roma, 5 Aldo Moro Piazzale, 00185, Rome, Italy
| | - Brian Oliver
- National Institute of Diabetes, Digestive, and Kidney Diseases, National Institutes of Health, 50 South Drive, Bethesda, MD, 20892, USA.
| | - Brenton Graveley
- Department of Genetics and Developmental Biology, Institute for Systems Genomics, University of Connecticut Health Center, 400 Farmington Avenue, Farmington, CT, 06030, USA.
| | - David MacAlpine
- Department of Pharmacology and Cancer Biology, Levine Science Research Center, Duke University Medical Center, 308 Research Drive, Durham, NC, 27708, USA.
| |
Collapse
|
21
|
Zhou YX, Fuentes-Creollo G, Ponce F, Langley SA, Jen KY, Celniker SE, Mao JH, Snijders AM. No difference in 4-nitroquinoline induced tumorigenesis between germ-free and colonized mice. Mol Carcinog 2019; 58:627-632. [PMID: 30632250 DOI: 10.1002/mc.22972] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2018] [Revised: 12/14/2018] [Accepted: 01/03/2019] [Indexed: 12/24/2022]
Abstract
Variations in oral bacterial communities have been linked to oral cancer suggesting that the oral microbiome is an etiological factor that can influence oral cancer development. The 4-nitroquinoline 1-oxide (4-NQO)-induced murine oral and esophageal cancer model is frequently used to assess the effects of preventive and/or therapeutic agents. We used this model to assess the impact of the microbiome on tumorigenesis using axenic (germ-free) and conventionally housed mice. Increased toxicity was observed in germ-free mice, however, no difference in tumor incidence, multiplicity, and size was observed. Transcriptional profiling of liver tissue from germ-free and conventionally housed mice identified 254 differentially expressed genes including ten cytochrome p450 enzymes, the largest family of phase-1 drug metabolizing enzymes in the liver. Gene ontology revealed that differentially expressed genes were enriched for liver steatosis, inflammation, and oxidative stress in livers of germ-free mice. Our observations emphasize the importance of the microbiome in mediating chemical toxicity at least in part by altering host gene expression. Studies on the role of the microbiome in chemical-induced cancer using germ-free animal models should consider the potential difference in dose due to the microbiome-mediated changes in host metabolizing capacity, which might influence the ability to draw conclusions especially for tumor induction models that are dose dependent.
Collapse
Affiliation(s)
- Yan-Xia Zhou
- College of Marine Science, Shandong University at Weihai, Weihai, China.,Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California
| | - Gabriela Fuentes-Creollo
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California
| | - Frank Ponce
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California
| | - Sasha A Langley
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California
| | - Kuang-Yu Jen
- Department of Pathology and Laboratory Medicine, University of California Davis, Sacramento, California
| | - Susan E Celniker
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California
| | - Jian-Hua Mao
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California.,Berkeley Biomedical Data Science Center, Lawrence Berkeley National Laboratory, Berkeley, California.,School of BioEngineering and BioInformatics, Nanjing Medical University, Nanjing, China
| | - Antoine M Snijders
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California.,Berkeley Biomedical Data Science Center, Lawrence Berkeley National Laboratory, Berkeley, California.,School of BioEngineering and BioInformatics, Nanjing Medical University, Nanjing, China
| |
Collapse
|
22
|
Parra M, Booth BW, Weiszmann R, Yee B, Yeo GW, Brown JB, Celniker SE, Conboy JG. An important class of intron retention events in human erythroblasts is regulated by cryptic exons proposed to function as splicing decoys. RNA 2018; 24:1255-1265. [PMID: 29959282 PMCID: PMC6097662 DOI: 10.1261/rna.066951.118] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/23/2018] [Accepted: 06/26/2018] [Indexed: 06/08/2023]
Abstract
During terminal erythropoiesis, the splicing machinery in differentiating erythroblasts executes a robust intron retention (IR) program that impacts expression of hundreds of genes. We studied IR mechanisms in the SF3B1 splicing factor gene, which expresses ∼50% of its transcripts in late erythroblasts as a nuclear isoform that retains intron 4. RNA-seq analysis of nonsense-mediated decay (NMD)-inhibited cells revealed previously undescribed splice junctions, rare or not detected in normal cells, that connect constitutive exons 4 and 5 to highly conserved cryptic cassette exons within the intron. Minigene splicing reporter assays showed that these cassettes promote IR. Genome-wide analysis of splice junction reads demonstrated that cryptic noncoding cassettes are much more common in large (>1 kb) retained introns than they are in small retained introns or in nonretained introns. Functional assays showed that heterologous cassettes can promote retention of intron 4 in the SF3B1 splicing reporter. Although many of these cryptic exons were spliced inefficiently, they exhibited substantial binding of U2AF1 and U2AF2 adjacent to their splice acceptor sites. We propose that these exons function as decoys that engage the intron-terminal splice sites, thereby blocking cross-intron interactions required for excision. Developmental regulation of decoy function underlies a major component of the erythroblast IR program.
Collapse
Affiliation(s)
- Marilyn Parra
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Ben W Booth
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Richard Weiszmann
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Brian Yee
- Department of Cellular and Molecular Medicine, University of California at San Diego, La Jolla, California 92037, USA
| | - Gene W Yeo
- Department of Cellular and Molecular Medicine, University of California at San Diego, La Jolla, California 92037, USA
- Department of Physiology, Yong Loo Lin School of Medicine, National University of Singapore, Singapore 117597
| | - James B Brown
- Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Susan E Celniker
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - John G Conboy
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| |
Collapse
|
23
|
Booth BW, McParland C, Beattie K, Fisher WW, Hammonds AS, Celniker SE, Frise E. OpenHiCAMM: High-Content Screening Software for Complex Microscope Imaging Workflows. iScience 2018; 2:136-140. [PMID: 29888763 PMCID: PMC5993205 DOI: 10.1016/j.isci.2018.03.017] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022] Open
Abstract
High-content image acquisition is generally limited to cells grown in culture, requiring complex hardware and preset imaging modalities. Here we report an open source software package, OpenHiCAMM (Open Hi Content Acquisition for μManager), that provides a flexible framework for integration of generic microscope-associated robotics and image processing with sequential workflows. As an example, we imaged Drosophila embryos, detecting the embryos at low resolution, followed by re-imaging the detected embryos at high resolution, suitable for computational analysis and screening. The OpenHiCAMM package is easy to use and adapt for automating complex microscope image tasks. It expands our abilities for high-throughput image-based screens to a new range of biological samples, such as organoids, and will provide a foundation for bioimaging systems biology. OpenHiCAMM is an open source software that integrates robotics and image acquisition OpenHiCAMM is a module for the Fiji/ImageJ and μManager software packages OpenHiCAMM tracks images on multiple slides and controls an optional slide loader OpenHiCAMM is easily adapted for high-throughput screening and image analysis
Collapse
Affiliation(s)
- Benjamin W Booth
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Charles McParland
- Computer Science Research Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Keith Beattie
- Computer Science Research Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - William W Fisher
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Ann S Hammonds
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Susan E Celniker
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Erwin Frise
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA.
| |
Collapse
|
24
|
Kudron MM, Victorsen A, Gevirtzman L, Hillier LW, Fisher WW, Vafeados D, Kirkey M, Hammonds AS, Gersch J, Ammouri H, Wall ML, Moran J, Steffen D, Szynkarek M, Seabrook-Sturgis S, Jameel N, Kadaba M, Patton J, Terrell R, Corson M, Durham TJ, Park S, Samanta S, Han M, Xu J, Yan KK, Celniker SE, White KP, Ma L, Gerstein M, Reinke V, Waterston RH. The ModERN Resource: Genome-Wide Binding Profiles for Hundreds of Drosophila and Caenorhabditis elegans Transcription Factors. Genetics 2018; 208:937-949. [PMID: 29284660 PMCID: PMC5844342 DOI: 10.1534/genetics.117.300657] [Citation(s) in RCA: 105] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2017] [Accepted: 12/08/2017] [Indexed: 12/22/2022] Open
Abstract
To develop a catalog of regulatory sites in two major model organisms, Drosophila melanogaster and Caenorhabditis elegans, the modERN (model organism Encyclopedia of Regulatory Networks) consortium has systematically assayed the binding sites of transcription factors (TFs). Combined with data produced by our predecessor, modENCODE (Model Organism ENCyclopedia Of DNA Elements), we now have data for 262 TFs identifying 1.23 M sites in the fly genome and 217 TFs identifying 0.67 M sites in the worm genome. Because sites from different TFs are often overlapping and tightly clustered, they fall into 91,011 and 59,150 regions in the fly and worm, respectively, and these binding sites span as little as 8.7 and 5.8 Mb in the two organisms. Clusters with large numbers of sites (so-called high occupancy target, or HOT regions) predominantly associate with broadly expressed genes, whereas clusters containing sites from just a few factors are associated with genes expressed in tissue-specific patterns. All of the strains expressing GFP-tagged TFs are available at the stock centers, and the chromatin immunoprecipitation sequencing data are available through the ENCODE Data Coordinating Center and also through a simple interface (http://epic.gs.washington.edu/modERN/) that facilitates rapid accessibility of processed data sets. These data will facilitate a vast number of scientific inquiries into the function of individual TFs in key developmental, metabolic, and defense and homeostatic regulatory pathways, as well as provide a broader perspective on how individual TFs work together in local networks and globally across the life spans of these two key model organisms.
Collapse
Affiliation(s)
- Michelle M Kudron
- Department of Genetics, Yale University, New Haven, Connecticut 06520
| | - Alec Victorsen
- Institute for Genomics and Systems Biology, Department of Human Genetics, University of Chicago, Illinois 60637
| | - Louis Gevirtzman
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195
| | - LaDeana W Hillier
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195
| | - William W Fisher
- Division of Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, California 94720
| | - Dionne Vafeados
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195
| | - Matt Kirkey
- Institute for Genomics and Systems Biology, Department of Human Genetics, University of Chicago, Illinois 60637
| | - Ann S Hammonds
- Division of Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, California 94720
| | - Jeffery Gersch
- Institute for Genomics and Systems Biology, Department of Human Genetics, University of Chicago, Illinois 60637
| | - Haneen Ammouri
- Institute for Genomics and Systems Biology, Department of Human Genetics, University of Chicago, Illinois 60637
| | - Martha L Wall
- Institute for Genomics and Systems Biology, Department of Human Genetics, University of Chicago, Illinois 60637
| | - Jennifer Moran
- Institute for Genomics and Systems Biology, Department of Human Genetics, University of Chicago, Illinois 60637
| | - David Steffen
- Institute for Genomics and Systems Biology, Department of Human Genetics, University of Chicago, Illinois 60637
| | - Matt Szynkarek
- Institute for Genomics and Systems Biology, Department of Human Genetics, University of Chicago, Illinois 60637
| | - Samantha Seabrook-Sturgis
- Institute for Genomics and Systems Biology, Department of Human Genetics, University of Chicago, Illinois 60637
| | - Nader Jameel
- Institute for Genomics and Systems Biology, Department of Human Genetics, University of Chicago, Illinois 60637
| | - Madhura Kadaba
- Institute for Genomics and Systems Biology, Department of Human Genetics, University of Chicago, Illinois 60637
| | - Jaeda Patton
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195
| | - Robert Terrell
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195
| | - Mitch Corson
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195
| | - Timothy J Durham
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195
| | - Soo Park
- Division of Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, California 94720
| | - Swapna Samanta
- Department of Genetics, Yale University, New Haven, Connecticut 06520
| | - Mei Han
- Department of Genetics, Yale University, New Haven, Connecticut 06520
| | - Jinrui Xu
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut 06520
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520
| | - Koon-Kiu Yan
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut 06520
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520
| | - Susan E Celniker
- Division of Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, California 94720
| | - Kevin P White
- Institute for Genomics and Systems Biology, Department of Human Genetics, University of Chicago, Illinois 60637
| | - Lijia Ma
- Institute for Genomics and Systems Biology, Department of Human Genetics, University of Chicago, Illinois 60637
| | - Mark Gerstein
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut 06520
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520
- Department of Computer Science, Yale University, New Haven, Connecticut 06520
| | - Valerie Reinke
- Department of Genetics, Yale University, New Haven, Connecticut 06520
| | - Robert H Waterston
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195
| |
Collapse
|
25
|
Snijders AM, Langley SA, Kim YM, Brislawn CJ, Noecker C, Zink EM, Fansler SJ, Casey CP, Miller DR, Huang Y, Karpen GH, Celniker SE, Brown JB, Borenstein E, Jansson JK, Metz TO, Mao JH. Influence of early life exposure, host genetics and diet on the mouse gut microbiome and metabolome. Nat Microbiol 2016; 2:16221. [PMID: 27892936 DOI: 10.1038/nmicrobiol.2016.221] [Citation(s) in RCA: 67] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2016] [Accepted: 10/07/2016] [Indexed: 12/22/2022]
Abstract
Although the gut microbiome plays important roles in host physiology, health and disease1, we lack understanding of the complex interplay between host genetics and early life environment on the microbial and metabolic composition of the gut. We used the genetically diverse Collaborative Cross mouse system2 to discover that early life history impacts the microbiome composition, whereas dietary changes have only a moderate effect. By contrast, the gut metabolome was shaped mostly by diet, with specific non-dietary metabolites explained by microbial metabolism. Quantitative trait analysis identified mouse genetic trait loci (QTL) that impact the abundances of specific microbes. Human orthologues of genes in the mouse QTL are implicated in gastrointestinal cancer. Additionally, genes located in mouse QTL for Lactobacillales abundance are implicated in arthritis, rheumatic disease and diabetes. Furthermore, Lactobacillales abundance was predictive of higher host T-helper cell counts, suggesting an important link between Lactobacillales and host adaptive immunity.
Collapse
Affiliation(s)
- Antoine M Snijders
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Sasha A Langley
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Young-Mo Kim
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, Washington 99352, USA
| | - Colin J Brislawn
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, Washington 99352, USA
| | - Cecilia Noecker
- Department of Genome Sciences, University of Washington, Seattle, Washington 98105, USA
| | - Erika M Zink
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, Washington 99352, USA
| | - Sarah J Fansler
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, Washington 99352, USA
| | - Cameron P Casey
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, Washington 99352, USA
| | - Darla R Miller
- Systems Genetics Core Facility, Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, USA
| | - Yurong Huang
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Gary H Karpen
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA.,Department of Molecular and Cell Biology, University of California, Berkeley, California 94720, USA
| | - Susan E Celniker
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - James B Brown
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Elhanan Borenstein
- Department of Genome Sciences, University of Washington, Seattle, Washington 98105, USA.,Department of Computer Science and Engineering, University of Washington, Seattle, Washington 98195, USA.,Santa Fe Institute, Santa Fe, New Mexico 87501, USA
| | - Janet K Jansson
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, Washington 99352, USA
| | - Thomas O Metz
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, Washington 99352, USA
| | - Jian-Hua Mao
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| |
Collapse
|
26
|
Stoiber MH, Olson S, May GE, Duff MO, Manent J, Obar R, Guruharsha KG, Bickel PJ, Artavanis-Tsakonas S, Brown JB, Graveley BR, Celniker SE. Extensive cross-regulation of post-transcriptional regulatory networks in Drosophila. Genome Res 2015; 25:1692-702. [PMID: 26294687 PMCID: PMC4617965 DOI: 10.1101/gr.182675.114] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2014] [Accepted: 06/10/2015] [Indexed: 01/01/2023]
Abstract
In eukaryotic cells, RNAs exist as ribonucleoprotein particles (RNPs). Despite the importance of these complexes in many biological processes, including splicing, polyadenylation, stability, transportation, localization, and translation, their compositions are largely unknown. We affinity-purified 20 distinct RNA-binding proteins (RBPs) from cultured Drosophila melanogaster cells under native conditions and identified both the RNA and protein compositions of these RNP complexes. We identified “high occupancy target” (HOT) RNAs that interact with the majority of the RBPs we surveyed. HOT RNAs encode components of the nonsense-mediated decay and splicing machinery, as well as RNA-binding and translation initiation proteins. The RNP complexes contain proteins and mRNAs involved in RNA binding and post-transcriptional regulation. Genes with the capacity to produce hundreds of mRNA isoforms, ultracomplex genes, interact extensively with heterogeneous nuclear ribonuclear proteins (hnRNPs). Our data are consistent with a model in which subsets of RNPs include mRNA and protein products from the same gene, indicating the widespread existence of auto-regulatory RNPs. From the simultaneous acquisition and integrative analysis of protein and RNA constituents of RNPs, we identify extensive cross-regulatory and hierarchical interactions in post-transcriptional control.
Collapse
Affiliation(s)
- Marcus H Stoiber
- Department of Biostatistics, University of California Berkeley, Berkeley, California 94720, USA; Department of Genome Dynamics, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Sara Olson
- Department of Genetics and Genome Sciences, Institute for Systems Genomics, University of Connecticut Health Center, Farmington, Connecticut 06030, USA
| | - Gemma E May
- Department of Genetics and Genome Sciences, Institute for Systems Genomics, University of Connecticut Health Center, Farmington, Connecticut 06030, USA
| | - Michael O Duff
- Department of Genetics and Genome Sciences, Institute for Systems Genomics, University of Connecticut Health Center, Farmington, Connecticut 06030, USA
| | - Jan Manent
- Department of Cell Biology, Harvard Medical School, Boston, Massachusetts 02115, USA
| | - Robert Obar
- Department of Cell Biology, Harvard Medical School, Boston, Massachusetts 02115, USA
| | - K G Guruharsha
- Department of Cell Biology, Harvard Medical School, Boston, Massachusetts 02115, USA; Biogen Incorporated, Cambridge, Massachusetts 02142, USA
| | - Peter J Bickel
- Department of Biostatistics, University of California Berkeley, Berkeley, California 94720, USA
| | - Spyros Artavanis-Tsakonas
- Department of Cell Biology, Harvard Medical School, Boston, Massachusetts 02115, USA; Biogen Incorporated, Cambridge, Massachusetts 02142, USA
| | - James B Brown
- Department of Genome Dynamics, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA; Department of Statistics, University of California Berkeley, Berkeley, California 94720, USA
| | - Brenton R Graveley
- Department of Genetics and Genome Sciences, Institute for Systems Genomics, University of Connecticut Health Center, Farmington, Connecticut 06030, USA
| | - Susan E Celniker
- Department of Genome Dynamics, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| |
Collapse
|
27
|
Brooks AN, Duff MO, May G, Yang L, Bolisetty M, Landolin J, Wan K, Sandler J, Booth BW, Celniker SE, Graveley BR, Brenner SE. Regulation of alternative splicing in Drosophila by 56 RNA binding proteins. Genome Res 2015; 25:1771-80. [PMID: 26294686 PMCID: PMC4617972 DOI: 10.1101/gr.192518.115] [Citation(s) in RCA: 67] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2015] [Accepted: 08/19/2015] [Indexed: 12/26/2022]
Abstract
Alternative splicing is regulated by RNA binding proteins (RBPs) that recognize pre-mRNA sequence elements and activate or repress adjacent exons. Here, we used RNA interference and RNA-seq to identify splicing events regulated by 56 Drosophila proteins, some previously unknown to regulate splicing. Nearly all proteins affected alternative first exons, suggesting that RBPs play important roles in first exon choice. Half of the splicing events were regulated by multiple proteins, demonstrating extensive combinatorial regulation. We observed that SR and hnRNP proteins tend to act coordinately with each other, not antagonistically. We also identified a cross-regulatory network where splicing regulators affected the splicing of pre-mRNAs encoding other splicing regulators. This large-scale study substantially enhances our understanding of recent models of splicing regulation and provides a resource of thousands of exons that are regulated by 56 diverse RBPs.
Collapse
Affiliation(s)
- Angela N Brooks
- Department of Molecular and Cell Biology, University of California, Berkeley, California 94720, USA; Broad Institute, Cambridge, Massachusetts 02142, USA; Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, Massachusetts 02215, USA
| | - Michael O Duff
- Department of Genetics and Genome Sciences, Institute for Systems Genomics, University of Connecticut Health Center, Farmington, Connecticut 06030, USA
| | - Gemma May
- Department of Genetics and Genome Sciences, Institute for Systems Genomics, University of Connecticut Health Center, Farmington, Connecticut 06030, USA
| | - Li Yang
- Department of Genetics and Genome Sciences, Institute for Systems Genomics, University of Connecticut Health Center, Farmington, Connecticut 06030, USA
| | - Mohan Bolisetty
- Department of Genetics and Genome Sciences, Institute for Systems Genomics, University of Connecticut Health Center, Farmington, Connecticut 06030, USA
| | - Jane Landolin
- Department of Genome Dynamics, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Ken Wan
- Department of Genome Dynamics, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Jeremy Sandler
- Department of Genome Dynamics, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Benjamin W Booth
- Department of Genome Dynamics, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Susan E Celniker
- Department of Genome Dynamics, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Brenton R Graveley
- Department of Genetics and Genome Sciences, Institute for Systems Genomics, University of Connecticut Health Center, Farmington, Connecticut 06030, USA
| | - Steven E Brenner
- Department of Molecular and Cell Biology, University of California, Berkeley, California 94720, USA; Department of Plant and Microbial Biology, University of California, Berkeley, California 94720, USA
| |
Collapse
|
28
|
Abstract
The modENCODE (Model Organism Encyclopedia of DNA Elements) Consortium aimed to map functional elements-including transcripts, chromatin marks, regulatory factor binding sites, and origins of DNA replication-in the model organisms Drosophila melanogaster and Caenorhabditis elegans. During its five-year span, the consortium conducted more than 2,000 genome-wide assays in developmentally staged animals, dissected tissues, and homogeneous cell lines. Analysis of these data sets provided foundational insights into genome, epigenome, and transcriptome structure and the evolutionary turnover of regulatory pathways. These studies facilitated a comparative analysis with similar data types produced by the ENCODE Consortium for human cells. Genome organization differs drastically in these distant species, and yet quantitative relationships among chromatin state, transcription, and cotranscriptional RNA processing are deeply conserved. Of the many biological discoveries of the modENCODE Consortium, we highlight insights that emerged from integrative studies. We focus on operational and scientific lessons that may aid future projects of similar scale or aims in other, emerging model systems.
Collapse
Affiliation(s)
- James B Brown
- Department of Statistics, University of California, Berkeley, California 94720;
| | | |
Collapse
|
29
|
Chen ZX, Sturgill D, Qu J, Jiang H, Park S, Boley N, Suzuki AM, Fletcher AR, Plachetzki DC, FitzGerald PC, Artieri CG, Atallah J, Barmina O, Brown JB, Blankenburg KP, Clough E, Dasgupta A, Gubbala S, Han Y, Jayaseelan JC, Kalra D, Kim YA, Kovar CL, Lee SL, Li M, Malley JD, Malone JH, Mathew T, Mattiuzzo NR, Munidasa M, Muzny DM, Ongeri F, Perales L, Przytycka TM, Pu LL, Robinson G, Thornton RL, Saada N, Scherer SE, Smith HE, Vinson C, Warner CB, Worley KC, Wu YQ, Zou X, Cherbas P, Kellis M, Eisen MB, Piano F, Kionte K, Fitch DH, Sternberg PW, Cutter AD, Duff MO, Hoskins RA, Graveley BR, Gibbs RA, Bickel PJ, Kopp A, Carninci P, Celniker SE, Oliver B, Richards S. Comparative validation of the D. melanogaster modENCODE transcriptome annotation. Genome Res 2015; 24:1209-23. [PMID: 24985915 PMCID: PMC4079975 DOI: 10.1101/gr.159384.113] [Citation(s) in RCA: 111] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
Accurate gene model annotation of reference genomes is critical for making them useful. The modENCODE project has improved the D. melanogaster genome annotation by using deep and diverse high-throughput data. Since transcriptional activity that has been evolutionarily conserved is likely to have an advantageous function, we have performed large-scale interspecific comparisons to increase confidence in predicted annotations. To support comparative genomics, we filled in divergence gaps in the Drosophila phylogeny by generating draft genomes for eight new species. For comparative transcriptome analysis, we generated mRNA expression profiles on 81 samples from multiple tissues and developmental stages of 15 Drosophila species, and we performed cap analysis of gene expression in D. melanogaster and D. pseudoobscura. We also describe conservation of four distinct core promoter structures composed of combinations of elements at three positions. Overall, each type of genomic feature shows a characteristic divergence rate relative to neutral models, highlighting the value of multispecies alignment in annotating a target genome that should prove useful in the annotation of other high priority genomes, especially human and other mammalian genomes that are rich in noncoding sequences. We report that the vast majority of elements in the annotation are evolutionarily conserved, indicating that the annotation will be an important springboard for functional genetic testing by the Drosophila community.
Collapse
Affiliation(s)
- Zhen-Xia Chen
- National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - David Sturgill
- National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Jiaxin Qu
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Huaiyang Jiang
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Soo Park
- Department of Genome Dynamics, Life Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Nathan Boley
- Department of Statistics, University of California, Berkeley, California 94720, USA
| | - Ana Maria Suzuki
- Technology Development Group, RIKEN Omics Science Center and RIKEN Center for Life Science Technologies, Division of Genomic Technologies, Yokohama City, Kanagawa, Japan 230-0045
| | - Anthony R Fletcher
- Division of Computational Bioscience, Center For Information Technology, National Institutes of Health, Bethesda, Maryland 20814, USA
| | - David C Plachetzki
- Department of Evolution and Ecology, University of California, Davis, California 95616, USA
| | - Peter C FitzGerald
- National Cancer Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Carlo G Artieri
- National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Joel Atallah
- Department of Evolution and Ecology, University of California, Davis, California 95616, USA
| | - Olga Barmina
- Department of Evolution and Ecology, University of California, Davis, California 95616, USA
| | - James B Brown
- Department of Statistics, University of California, Berkeley, California 94720, USA
| | - Kerstin P Blankenburg
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Emily Clough
- National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Abhijit Dasgupta
- Clinical Trials and Outcomes Branch, National Institute of Arthritis and Musculoskeletal and Skin Diseases, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Sai Gubbala
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Yi Han
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Joy C Jayaseelan
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Divya Kalra
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Yoo-Ah Kim
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Christie L Kovar
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Sandra L Lee
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Mingmei Li
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - James D Malley
- Division of Computational Bioscience, Center For Information Technology, National Institutes of Health, Bethesda, Maryland 20814, USA
| | - John H Malone
- National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Tittu Mathew
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Nicolas R Mattiuzzo
- National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Mala Munidasa
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Donna M Muzny
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Fiona Ongeri
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Lora Perales
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Teresa M Przytycka
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Ling-Ling Pu
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Garrett Robinson
- Department of Statistics, University of California, Berkeley, California 94720, USA
| | - Rebecca L Thornton
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Nehad Saada
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Steven E Scherer
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Harold E Smith
- National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Charles Vinson
- National Cancer Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Crystal B Warner
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Kim C Worley
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Yuan-Qing Wu
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Xiaoyan Zou
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Peter Cherbas
- Department of Biology, Indiana University, Bloomington, Indiana 47405, USA
| | - Manolis Kellis
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, Massachusetts 20139, USA
| | - Michael B Eisen
- Molecular and Cell Biology, University of California, Berkeley, California 94720, USA
| | - Fabio Piano
- Department of Biology, New York University, New York, New York 10003, USA
| | - Karin Kionte
- Department of Biology, New York University, New York, New York 10003, USA
| | - David H Fitch
- Department of Biology, New York University, New York, New York 10003, USA
| | - Paul W Sternberg
- HHMI and Division of Biology, California Institute of Technology, Pasadena, California 91125, USA
| | - Asher D Cutter
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, M5S 3B2, Canada
| | - Michael O Duff
- Department of Genetics and Developmental Biology, Institute for Systems Genomics, University of Connecticut Health Center, Farmington, Connecticut 06030-6403, USA
| | - Roger A Hoskins
- Department of Genome Dynamics, Life Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Brenton R Graveley
- Department of Genetics and Developmental Biology, Institute for Systems Genomics, University of Connecticut Health Center, Farmington, Connecticut 06030-6403, USA
| | - Richard A Gibbs
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Peter J Bickel
- Department of Statistics, University of California, Berkeley, California 94720, USA
| | - Artyom Kopp
- Department of Evolution and Ecology, University of California, Davis, California 95616, USA
| | - Piero Carninci
- Technology Development Group, RIKEN Omics Science Center and RIKEN Center for Life Science Technologies, Division of Genomic Technologies, Yokohama City, Kanagawa, Japan 230-0045
| | - Susan E Celniker
- Department of Genome Dynamics, Life Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Brian Oliver
- National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Stephen Richards
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| |
Collapse
|
30
|
Hoskins RA, Carlson JW, Wan KH, Park S, Mendez I, Galle SE, Booth BW, Pfeiffer BD, George RA, Svirskas R, Krzywinski M, Schein J, Accardo MC, Damia E, Messina G, Méndez-Lago M, de Pablos B, Demakova OV, Andreyeva EN, Boldyreva LV, Marra M, Carvalho AB, Dimitri P, Villasante A, Zhimulev IF, Rubin GM, Karpen GH, Celniker SE. The Release 6 reference sequence of the Drosophila melanogaster genome. Genome Res 2015; 25:445-58. [PMID: 25589440 PMCID: PMC4352887 DOI: 10.1101/gr.185579.114] [Citation(s) in RCA: 255] [Impact Index Per Article: 28.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Drosophila melanogaster plays an important role in molecular,
genetic, and genomic studies of heredity, development, metabolism, behavior, and
human disease. The initial reference genome sequence reported more than a decade ago
had a profound impact on progress in Drosophila research, and
improving the accuracy and completeness of this sequence continues to be important to
further progress. We previously described improvement of the 117-Mb sequence in the
euchromatic portion of the genome and 21 Mb in the heterochromatic portion, using a
whole-genome shotgun assembly, BAC physical mapping, and clone-based finishing. Here,
we report an improved reference sequence of the single-copy and middle-repetitive
regions of the genome, produced using cytogenetic mapping to mitotic and polytene
chromosomes, clone-based finishing and BAC fingerprint verification, ordering of
scaffolds by alignment to cDNA sequences, incorporation of other map and sequence
data, and validation by whole-genome optical restriction mapping. These data
substantially improve the accuracy and completeness of the reference sequence and the
order and orientation of sequence scaffolds into chromosome arm assemblies.
Representation of the Y chromosome and other heterochromatic regions
is particularly improved. The new 143.9-Mb reference sequence, designated Release 6,
effectively exhausts clone-based technologies for mapping and sequencing. Highly
repeat-rich regions, including large satellite blocks and functional elements such as
the ribosomal RNA genes and the centromeres, are largely inaccessible to current
sequencing and assembly methods and remain poorly represented. Further significant
improvements will require sequencing technologies that do not depend on molecular
cloning and that produce very long reads.
Collapse
Affiliation(s)
- Roger A Hoskins
- Department of Genome Dynamics, Life Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA;
| | - Joseph W Carlson
- Department of Genome Dynamics, Life Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Kenneth H Wan
- Department of Genome Dynamics, Life Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Soo Park
- Department of Genome Dynamics, Life Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Ivonne Mendez
- Department of Genome Dynamics, Life Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Samuel E Galle
- Department of Genome Dynamics, Life Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Benjamin W Booth
- Department of Genome Dynamics, Life Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Barret D Pfeiffer
- Janelia Farm Research Campus, Howard Hughes Medical Institute, Ashburn, Virginia 20147, USA
| | - Reed A George
- Janelia Farm Research Campus, Howard Hughes Medical Institute, Ashburn, Virginia 20147, USA
| | - Robert Svirskas
- Janelia Farm Research Campus, Howard Hughes Medical Institute, Ashburn, Virginia 20147, USA
| | - Martin Krzywinski
- Genome Sciences Centre, BC Cancer Agency, Vancouver, BC, V5Z 4S6, Canada
| | - Jacqueline Schein
- Genome Sciences Centre, BC Cancer Agency, Vancouver, BC, V5Z 4S6, Canada
| | - Maria Carmela Accardo
- Dipartimento di Biologia e Biotecnologie "Charles Darwin" and Istituto Pasteur Fondazione Cenci-Bolognetti, Sapienza Università di Roma, 00185 Roma, Italy
| | - Elisabetta Damia
- Dipartimento di Biologia e Biotecnologie "Charles Darwin" and Istituto Pasteur Fondazione Cenci-Bolognetti, Sapienza Università di Roma, 00185 Roma, Italy
| | - Giovanni Messina
- Dipartimento di Biologia e Biotecnologie "Charles Darwin" and Istituto Pasteur Fondazione Cenci-Bolognetti, Sapienza Università di Roma, 00185 Roma, Italy
| | - María Méndez-Lago
- Centro de Biología Molecular "Severo Ochoa" (CSIC-UAM), Universidad Autónoma de Madrid, 28049 Madrid, Spain
| | - Beatriz de Pablos
- Centro de Biología Molecular "Severo Ochoa" (CSIC-UAM), Universidad Autónoma de Madrid, 28049 Madrid, Spain
| | - Olga V Demakova
- Institute of Molecular and Cellular Biology, Russian Academy of Sciences, Novosibirsk, 630090, Russia
| | - Evgeniya N Andreyeva
- Institute of Molecular and Cellular Biology, Russian Academy of Sciences, Novosibirsk, 630090, Russia
| | - Lidiya V Boldyreva
- Institute of Molecular and Cellular Biology, Russian Academy of Sciences, Novosibirsk, 630090, Russia
| | - Marco Marra
- Genome Sciences Centre, BC Cancer Agency, Vancouver, BC, V5Z 4S6, Canada
| | - A Bernardo Carvalho
- Departamento de Genética, Universidade Federal do Rio de Janeiro, CEP 21944-970, Rio de Janeiro, Brazil
| | - Patrizio Dimitri
- Dipartimento di Biologia e Biotecnologie "Charles Darwin" and Istituto Pasteur Fondazione Cenci-Bolognetti, Sapienza Università di Roma, 00185 Roma, Italy
| | - Alfredo Villasante
- Centro de Biología Molecular "Severo Ochoa" (CSIC-UAM), Universidad Autónoma de Madrid, 28049 Madrid, Spain
| | - Igor F Zhimulev
- Institute of Molecular and Cellular Biology, Russian Academy of Sciences, Novosibirsk, 630090, Russia; Novosibirsk State University, Novosibirsk, 630090, Russia
| | - Gerald M Rubin
- Janelia Farm Research Campus, Howard Hughes Medical Institute, Ashburn, Virginia 20147, USA
| | - Gary H Karpen
- Department of Genome Dynamics, Life Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA; Department of Molecular and Cell Biology, University of California, Berkeley, California 94720, USA
| | - Susan E Celniker
- Department of Genome Dynamics, Life Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA;
| |
Collapse
|
31
|
Westholm JO, Miura P, Olson S, Shenker S, Joseph B, Sanfilippo P, Celniker SE, Graveley BR, Lai EC. Genome-wide analysis of drosophila circular RNAs reveals their structural and sequence properties and age-dependent neural accumulation. Cell Rep 2014; 9:1966-1980. [PMID: 25544350 PMCID: PMC4279448 DOI: 10.1016/j.celrep.2014.10.062] [Citation(s) in RCA: 707] [Impact Index Per Article: 70.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2014] [Revised: 09/03/2014] [Accepted: 10/27/2014] [Indexed: 12/17/2022] Open
Abstract
Circularization was recently recognized to broadly expand transcriptome complexity. Here, we exploit massive Drosophila total RNA-sequencing data, >5 billion paired-end reads from >100 libraries covering diverse developmental stages, tissues, and cultured cells, to rigorously annotate >2,500 fruit fly circular RNAs. These mostly derive from back-splicing of protein-coding genes and lack poly(A) tails, and the circularization of hundreds of genes is conserved across multiple Drosophila species. We elucidate structural and sequence properties of Drosophila circular RNAs, which exhibit commonalities and distinctions from mammalian circles. Notably, Drosophila circular RNAs harbor >1,000 well-conserved canonical miRNA seed matches, especially within coding regions, and coding conserved miRNA sites reside preferentially within circularized exons. Finally, we analyze the developmental and tissue specificity of circular RNAs and note their preferred derivation from neural genes and enhanced accumulation in neural tissues. Interestingly, circular isoforms increase substantially relative to linear isoforms during CNS aging and constitute an aging biomarker.
Collapse
Affiliation(s)
- Jakub O Westholm
- Department of Developmental Biology, Sloan-Kettering Institute, 1275 York Avenue, Box 252, New York, NY 10065, USA
| | - Pedro Miura
- Department of Developmental Biology, Sloan-Kettering Institute, 1275 York Avenue, Box 252, New York, NY 10065, USA; Department of Biology, University of Nevada, Reno, Nevada 89557, USA
| | - Sara Olson
- Department of Genetics and Developmental Biology, Institute for Systems Genomics, University of Connecticut Health Center, 400 Farmington Avenue, Farmington, CT 06032, USA
| | - Sol Shenker
- Department of Developmental Biology, Sloan-Kettering Institute, 1275 York Avenue, Box 252, New York, NY 10065, USA; Tri-Institutional Program in Computational Biology and Medicine, Weill Cornell Medical College, New York, NY 10065, USA
| | - Brian Joseph
- Department of Developmental Biology, Sloan-Kettering Institute, 1275 York Avenue, Box 252, New York, NY 10065, USA; Gerstner Sloan-Kettering Graduate Program of Biomedical Sciences, 417 East 68th Street, New York, NY 10065, USA
| | - Piero Sanfilippo
- Department of Developmental Biology, Sloan-Kettering Institute, 1275 York Avenue, Box 252, New York, NY 10065, USA; Gerstner Sloan-Kettering Graduate Program of Biomedical Sciences, 417 East 68th Street, New York, NY 10065, USA
| | - Susan E Celniker
- Department of Genome Dynamics, Lawrence Berkeley National Laboratory, Berkeley, CA 94701, USA
| | - Brenton R Graveley
- Department of Genetics and Developmental Biology, Institute for Systems Genomics, University of Connecticut Health Center, 400 Farmington Avenue, Farmington, CT 06032, USA
| | - Eric C Lai
- Department of Developmental Biology, Sloan-Kettering Institute, 1275 York Avenue, Box 252, New York, NY 10065, USA.
| |
Collapse
|
32
|
Kim KE, Peluso P, Babayan P, Yeadon PJ, Yu C, Fisher WW, Chin CS, Rapicavoli NA, Rank DR, Li J, Catcheside DEA, Celniker SE, Phillippy AM, Bergman CM, Landolin JM. Long-read, whole-genome shotgun sequence data for five model organisms. Sci Data 2014; 1:140045. [PMID: 25977796 PMCID: PMC4365909 DOI: 10.1038/sdata.2014.45] [Citation(s) in RCA: 105] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2014] [Accepted: 10/03/2014] [Indexed: 12/22/2022] Open
Abstract
Single molecule, real-time (SMRT) sequencing from Pacific Biosciences is increasingly used in many areas of biological research including de novo genome assembly, structural-variant identification, haplotype phasing, mRNA isoform discovery, and base-modification analyses. High-quality, public datasets of SMRT sequences can spur development of analytic tools that can accommodate unique characteristics of SMRT data (long read lengths, lack of GC or amplification bias, and a random error profile leading to high consensus accuracy). In this paper, we describe eight high-coverage SMRT sequence datasets from five organisms (Escherichia coli, Saccharomyces cerevisiae, Neurospora crassa, Arabidopsis thaliana, and Drosophila melanogaster) that have been publicly released to the general scientific community (NCBI Sequence Read Archive ID SRP040522). Data were generated using two sequencing chemistries (P4C2 and P5C3) on the PacBio RS II instrument. The datasets reported here can be used without restriction by the research community to generate whole-genome assemblies, test new algorithms, investigate genome structure and evolution, and identify base modifications in some of the most widely-studied model systems in biological research.
Collapse
Affiliation(s)
- Kristi E Kim
- Pacific Biosciences of California Inc., 1380 Willow Road, Menlo Park, California 94025, USA
| | - Paul Peluso
- Pacific Biosciences of California Inc., 1380 Willow Road, Menlo Park, California 94025, USA
| | - Primo Babayan
- Pacific Biosciences of California Inc., 1380 Willow Road, Menlo Park, California 94025, USA
| | - P. Jane Yeadon
- Flinders University, School of Biological Sciences, PO Box 2100, Adelaide, South Australia 5001, Australia
| | - Charles Yu
- Department of Genome Dynamics, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, California 94720, USA
| | - William W Fisher
- Department of Genome Dynamics, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, California 94720, USA
| | - Chen-Shan Chin
- Pacific Biosciences of California Inc., 1380 Willow Road, Menlo Park, California 94025, USA
| | - Nicole A Rapicavoli
- Pacific Biosciences of California Inc., 1380 Willow Road, Menlo Park, California 94025, USA
| | - David R Rank
- Pacific Biosciences of California Inc., 1380 Willow Road, Menlo Park, California 94025, USA
| | - Joachim Li
- Department of Microbiology and Immunology, UCSF, San Francisco, California 94158, USA
| | - David E. A Catcheside
- Flinders University, School of Biological Sciences, PO Box 2100, Adelaide, South Australia 5001, Australia
| | - Susan E Celniker
- Department of Genome Dynamics, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, California 94720, USA
| | - Adam M Phillippy
- National Biodefense Analysis and Countermeasures Center, 110 Thomas Johnson Drive, Frederick, Maryland 21702, USA
| | - Casey M Bergman
- Faculty of Life Sciences, University of Manchester, Oxford Road, Manchester M13 9PT, UK
| | - Jane M Landolin
- Pacific Biosciences of California Inc., 1380 Willow Road, Menlo Park, California 94025, USA
| |
Collapse
|
33
|
Rhee DY, Cho DY, Zhai B, Slattery M, Ma L, Mintseris J, Wong CY, White KP, Celniker SE, Przytycka TM, Gygi SP, Obar RA, Artavanis-Tsakonas S. Transcription factor networks in Drosophila melanogaster. Cell Rep 2014; 8:2031-2043. [PMID: 25242320 DOI: 10.1016/j.celrep.2014.08.038] [Citation(s) in RCA: 72] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2014] [Revised: 06/09/2014] [Accepted: 08/16/2014] [Indexed: 11/15/2022] Open
Abstract
Specific cellular fates and functions depend on differential gene expression, which occurs primarily at the transcriptional level and is controlled by complex regulatory networks of transcription factors (TFs). TFs act through combinatorial interactions with other TFs, cofactors, and chromatin-remodeling proteins. Here, we define protein-protein interactions using a coaffinity purification/mass spectrometry method and study 459 Drosophila melanogaster transcription-related factors, representing approximately half of the established catalog of TFs. We probe this network in vivo, demonstrating functional interactions for many interacting proteins, and test the predictive value of our data set. Building on these analyses, we combine regulatory network inference models with physical interactions to define an integrated network that connects combinatorial TF protein interactions to the transcriptional regulatory network of the cell. We use this integrated network as a tool to connect the functional network of genetic modifiers related to mastermind, a transcriptional cofactor of the Notch pathway.
Collapse
Affiliation(s)
- David Y Rhee
- Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA
| | - Dong-Yeon Cho
- Computational Biology Branch, National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Bo Zhai
- Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA
| | - Matthew Slattery
- Institute for Genomics and Systems Biology, University of Chicago, Chicago, IL 60637, USA
| | - Lijia Ma
- Institute for Genomics and Systems Biology, University of Chicago, Chicago, IL 60637, USA
| | - Julian Mintseris
- Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA
| | - Christina Y Wong
- Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA
| | - Kevin P White
- Institute for Genomics and Systems Biology, University of Chicago, Chicago, IL 60637, USA
| | - Susan E Celniker
- Berkeley Drosophila Genome Project, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Teresa M Przytycka
- Computational Biology Branch, National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Steven P Gygi
- Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA
| | - Robert A Obar
- Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA
| | - Spyros Artavanis-Tsakonas
- Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA; Biogen Idec, Inc., Cambridge, MA 02142, USA.
| |
Collapse
|
34
|
Gerstein MB, Rozowsky J, Yan KK, Wang D, Cheng C, Brown JB, Davis CA, Hillier L, Sisu C, Li JJ, Pei B, Harmanci AO, Duff MO, Djebali S, Alexander RP, Alver BH, Auerbach R, Bell K, Bickel PJ, Boeck ME, Boley NP, Booth BW, Cherbas L, Cherbas P, Di C, Dobin A, Drenkow J, Ewing B, Fang G, Fastuca M, Feingold EA, Frankish A, Gao G, Good PJ, Guigó R, Hammonds A, Harrow J, Hoskins RA, Howald C, Hu L, Huang H, Hubbard TJP, Huynh C, Jha S, Kasper D, Kato M, Kaufman TC, Kitchen RR, Ladewig E, Lagarde J, Lai E, Leng J, Lu Z, MacCoss M, May G, McWhirter R, Merrihew G, Miller DM, Mortazavi A, Murad R, Oliver B, Olson S, Park PJ, Pazin MJ, Perrimon N, Pervouchine D, Reinke V, Reymond A, Robinson G, Samsonova A, Saunders GI, Schlesinger F, Sethi A, Slack FJ, Spencer WC, Stoiber MH, Strasbourger P, Tanzer A, Thompson OA, Wan KH, Wang G, Wang H, Watkins KL, Wen J, Wen K, Xue C, Yang L, Yip K, Zaleski C, Zhang Y, Zheng H, Brenner SE, Graveley BR, Celniker SE, Gingeras TR, Waterston R. Comparative analysis of the transcriptome across distant species. Nature 2014; 512:445-8. [PMID: 25164755 PMCID: PMC4155737 DOI: 10.1038/nature13424] [Citation(s) in RCA: 239] [Impact Index Per Article: 23.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2013] [Accepted: 04/30/2014] [Indexed: 12/30/2022]
Abstract
The transcriptome is the readout of the genome. Identifying common features in it across distant species can reveal fundamental principles. To this end, the ENCODE and modENCODE consortia have generated large amounts of matched RNA-sequencing data for human, worm and fly. Uniform processing and comprehensive annotation of these data allow comparison across metazoan phyla, extending beyond earlier within-phylum transcriptome comparisons and revealing ancient, conserved features. Specifically, we discover co-expression modules shared across animals, many of which are enriched in developmental genes. Moreover, we use expression patterns to align the stages in worm and fly development and find a novel pairing between worm embryo and fly pupae, in addition to the embryo-to-embryo and larvae-to-larvae pairings. Furthermore, we find that the extent of non-canonical, non-coding transcription is similar in each organism, per base pair. Finally, we find in all three organisms that the gene-expression levels, both coding and non-coding, can be quantitatively predicted from chromatin features at the promoter using a 'universal model' based on a single set of organism-independent parameters.
Collapse
Affiliation(s)
- Mark B Gerstein
- 1] Program in Computational Biology and Bioinformatics, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA [2] Department of Molecular Biophysics and Biochemistry, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA [3] Department of Computer Science, Yale University, 51 Prospect Street, New Haven, Connecticut 06511, USA [4] [5]
| | - Joel Rozowsky
- 1] Program in Computational Biology and Bioinformatics, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA [2] Department of Molecular Biophysics and Biochemistry, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA [3]
| | - Koon-Kiu Yan
- 1] Program in Computational Biology and Bioinformatics, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA [2] Department of Molecular Biophysics and Biochemistry, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA [3]
| | - Daifeng Wang
- 1] Program in Computational Biology and Bioinformatics, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA [2] Department of Molecular Biophysics and Biochemistry, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA [3]
| | - Chao Cheng
- 1] Department of Genetics, Geisel School of Medicine at Dartmouth, Hanover, New Hampshire 03755, USA [2] Institute for Quantitative Biomedical Sciences, Norris Cotton Cancer Center, Geisel School of Medicine at Dartmouth, Lebanon, New Hampshire 03766, USA [3]
| | - James B Brown
- 1] Department of Genome Dynamics, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA [2] Department of Statistics, University of California, Berkeley, 367 Evans Hall, Berkeley, California 94720-3860, USA [3]
| | - Carrie A Davis
- 1] Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA [2]
| | - LaDeana Hillier
- 1] Department of Genome Sciences and University of Washington School of Medicine, William H. Foege Building S350D, 1705 Northeast Pacific Street, Box 355065 Seattle, Washington 98195-5065, USA [2]
| | - Cristina Sisu
- 1] Program in Computational Biology and Bioinformatics, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA [2] Department of Molecular Biophysics and Biochemistry, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA [3]
| | - Jingyi Jessica Li
- 1] Department of Statistics, University of California, Berkeley, 367 Evans Hall, Berkeley, California 94720-3860, USA [2] Department of Statistics, University of California, Los Angeles, California 90095-1554, USA [3] Department of Human Genetics, University of California, Los Angeles, California 90095-7088, USA [4]
| | - Baikang Pei
- 1] Program in Computational Biology and Bioinformatics, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA [2] Department of Molecular Biophysics and Biochemistry, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA [3]
| | - Arif O Harmanci
- 1] Program in Computational Biology and Bioinformatics, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA [2] Department of Molecular Biophysics and Biochemistry, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA [3]
| | - Michael O Duff
- 1] Department of Genetics and Developmental Biology, Institute for Systems Genomics, University of Connecticut Health Center, 400 Farmington Avenue, Farmington, Connecticut 06030, USA [2]
| | - Sarah Djebali
- 1] Centre for Genomic Regulation, Doctor Aiguader 88, 08003 Barcelona, Catalonia, Spain [2] Departament de Ciències Experimentals i de la Salut, Universitat Pompeu Fabra, 08003 Barcelona, Catalonia, Spain [3]
| | - Roger P Alexander
- 1] Program in Computational Biology and Bioinformatics, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA [2] Department of Molecular Biophysics and Biochemistry, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA
| | - Burak H Alver
- Center for Biomedical Informatics, Harvard Medical School, 10 Shattuck Street, Boston, Massachusetts 02115, USA
| | - Raymond Auerbach
- 1] Program in Computational Biology and Bioinformatics, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA [2] Department of Molecular Biophysics and Biochemistry, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA
| | - Kimberly Bell
- Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA
| | - Peter J Bickel
- Department of Statistics, University of California, Berkeley, 367 Evans Hall, Berkeley, California 94720-3860, USA
| | - Max E Boeck
- Department of Genome Sciences and University of Washington School of Medicine, William H. Foege Building S350D, 1705 Northeast Pacific Street, Box 355065 Seattle, Washington 98195-5065, USA
| | - Nathan P Boley
- 1] Department of Genome Dynamics, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA [2] Department of Biostatistics, University of California, Berkeley, 367 Evans Hall, Berkeley, California 94720-3860, USA
| | - Benjamin W Booth
- Department of Genome Dynamics, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Lucy Cherbas
- 1] Department of Biology, Indiana University, 1001 East 3rd Street, Bloomington, Indiana 47405-7005, USA [2] Center for Genomics and Bioinformatics, Indiana University, 1001 East 3rd Street, Bloomington, Indiana 47405-7005, USA
| | - Peter Cherbas
- 1] Department of Biology, Indiana University, 1001 East 3rd Street, Bloomington, Indiana 47405-7005, USA [2] Center for Genomics and Bioinformatics, Indiana University, 1001 East 3rd Street, Bloomington, Indiana 47405-7005, USA
| | - Chao Di
- MOE Key Lab of Bioinformatics, School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Alex Dobin
- Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA
| | - Jorg Drenkow
- Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA
| | - Brent Ewing
- Department of Genome Sciences and University of Washington School of Medicine, William H. Foege Building S350D, 1705 Northeast Pacific Street, Box 355065 Seattle, Washington 98195-5065, USA
| | - Gang Fang
- 1] Program in Computational Biology and Bioinformatics, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA [2] Department of Molecular Biophysics and Biochemistry, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA
| | - Megan Fastuca
- Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA
| | - Elise A Feingold
- National Human Genome Research Institute, National Institutes of Health, 5635 Fishers Lane, Bethesda, Maryland 20892-9307, USA
| | - Adam Frankish
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK
| | - Guanjun Gao
- MOE Key Lab of Bioinformatics, School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Peter J Good
- National Human Genome Research Institute, National Institutes of Health, 5635 Fishers Lane, Bethesda, Maryland 20892-9307, USA
| | - Roderic Guigó
- 1] Centre for Genomic Regulation, Doctor Aiguader 88, 08003 Barcelona, Catalonia, Spain [2] Departament de Ciències Experimentals i de la Salut, Universitat Pompeu Fabra, 08003 Barcelona, Catalonia, Spain
| | - Ann Hammonds
- Department of Genome Dynamics, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Jen Harrow
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK
| | - Roger A Hoskins
- Department of Genome Dynamics, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Cédric Howald
- 1] Center for Integrative Genomics, University of Lausanne, Genopode building, Lausanne 1015, Switzerland [2] Swiss Institute of Bioinformatics, Genopode building, Lausanne 1015, Switzerland
| | - Long Hu
- MOE Key Lab of Bioinformatics, School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Haiyan Huang
- Department of Statistics, University of California, Berkeley, 367 Evans Hall, Berkeley, California 94720-3860, USA
| | - Tim J P Hubbard
- 1] Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK [2] Medical and Molecular Genetics, King's College London, London WC2R 2LS, UK
| | - Chau Huynh
- Department of Genome Sciences and University of Washington School of Medicine, William H. Foege Building S350D, 1705 Northeast Pacific Street, Box 355065 Seattle, Washington 98195-5065, USA
| | - Sonali Jha
- Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA
| | - Dionna Kasper
- Department of Genetics, Yale University School of Medicine, New Haven, Connecticut 06520-8005, USA
| | - Masaomi Kato
- Department of Molecular, Cellular and Developmental Biology, PO Box 208103, Yale University, New Haven, Connecticut 06520, USA
| | - Thomas C Kaufman
- Department of Biology, Indiana University, 1001 East 3rd Street, Bloomington, Indiana 47405-7005, USA
| | - Robert R Kitchen
- 1] Program in Computational Biology and Bioinformatics, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA [2] Department of Molecular Biophysics and Biochemistry, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA
| | - Erik Ladewig
- Sloan-Kettering Institute, 1275 York Avenue, Box 252, New York, New York 10065, USA
| | - Julien Lagarde
- 1] Centre for Genomic Regulation, Doctor Aiguader 88, 08003 Barcelona, Catalonia, Spain [2] Departament de Ciències Experimentals i de la Salut, Universitat Pompeu Fabra, 08003 Barcelona, Catalonia, Spain
| | - Eric Lai
- Sloan-Kettering Institute, 1275 York Avenue, Box 252, New York, New York 10065, USA
| | - Jing Leng
- 1] Program in Computational Biology and Bioinformatics, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA [2] Department of Molecular Biophysics and Biochemistry, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA
| | - Zhi Lu
- MOE Key Lab of Bioinformatics, School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Michael MacCoss
- Department of Genome Sciences and University of Washington School of Medicine, William H. Foege Building S350D, 1705 Northeast Pacific Street, Box 355065 Seattle, Washington 98195-5065, USA
| | - Gemma May
- 1] Department of Genetics and Developmental Biology, Institute for Systems Genomics, University of Connecticut Health Center, 400 Farmington Avenue, Farmington, Connecticut 06030, USA [2] Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213 USA
| | - Rebecca McWhirter
- Department of Cell and Developmental Biology, Vanderbilt University, 465 21st Avenue South, Nashville, Tennessee 37232-8240, USA
| | - Gennifer Merrihew
- Department of Genome Sciences and University of Washington School of Medicine, William H. Foege Building S350D, 1705 Northeast Pacific Street, Box 355065 Seattle, Washington 98195-5065, USA
| | - David M Miller
- Department of Cell and Developmental Biology, Vanderbilt University, 465 21st Avenue South, Nashville, Tennessee 37232-8240, USA
| | - Ali Mortazavi
- 1] Developmental and Cell Biology, University of California, Irvine, California 92697, USA [2] Center for Complex Biological Systems, University of California, Irvine, California 92697, USA
| | - Rabi Murad
- 1] Developmental and Cell Biology, University of California, Irvine, California 92697, USA [2] Center for Complex Biological Systems, University of California, Irvine, California 92697, USA
| | - Brian Oliver
- Section of Developmental Genomics, Laboratory of Cellular and Developmental Biology, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Sara Olson
- Department of Genetics and Developmental Biology, Institute for Systems Genomics, University of Connecticut Health Center, 400 Farmington Avenue, Farmington, Connecticut 06030, USA
| | - Peter J Park
- Center for Biomedical Informatics, Harvard Medical School, 10 Shattuck Street, Boston, Massachusetts 02115, USA
| | - Michael J Pazin
- National Human Genome Research Institute, National Institutes of Health, 5635 Fishers Lane, Bethesda, Maryland 20892-9307, USA
| | - Norbert Perrimon
- 1] Department of Genetics and Drosophila RNAi Screening Center, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, Massachusetts 02115, USA [2] Howard Hughes Medical Institute, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, Massachusetts 02115, USA
| | - Dmitri Pervouchine
- 1] Centre for Genomic Regulation, Doctor Aiguader 88, 08003 Barcelona, Catalonia, Spain [2] Departament de Ciències Experimentals i de la Salut, Universitat Pompeu Fabra, 08003 Barcelona, Catalonia, Spain
| | - Valerie Reinke
- Department of Genetics, Yale University School of Medicine, New Haven, Connecticut 06520-8005, USA
| | - Alexandre Reymond
- Center for Integrative Genomics, University of Lausanne, Genopode building, Lausanne 1015, Switzerland
| | - Garrett Robinson
- Department of Statistics, University of California, Berkeley, 367 Evans Hall, Berkeley, California 94720-3860, USA
| | - Anastasia Samsonova
- 1] Department of Genetics and Drosophila RNAi Screening Center, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, Massachusetts 02115, USA [2] Howard Hughes Medical Institute, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, Massachusetts 02115, USA
| | - Gary I Saunders
- 1] Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK [2] European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, CB10 1SD, UK
| | - Felix Schlesinger
- Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA
| | - Anurag Sethi
- 1] Program in Computational Biology and Bioinformatics, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA [2] Department of Molecular Biophysics and Biochemistry, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA
| | - Frank J Slack
- Department of Molecular, Cellular and Developmental Biology, PO Box 208103, Yale University, New Haven, Connecticut 06520, USA
| | - William C Spencer
- Department of Cell and Developmental Biology, Vanderbilt University, 465 21st Avenue South, Nashville, Tennessee 37232-8240, USA
| | - Marcus H Stoiber
- 1] Department of Genome Dynamics, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA [2] Department of Biostatistics, University of California, Berkeley, 367 Evans Hall, Berkeley, California 94720-3860, USA
| | - Pnina Strasbourger
- Department of Genome Sciences and University of Washington School of Medicine, William H. Foege Building S350D, 1705 Northeast Pacific Street, Box 355065 Seattle, Washington 98195-5065, USA
| | - Andrea Tanzer
- 1] Bioinformatics and Genomics Programme, Center for Genomic Regulation, Universitat Pompeu Fabra (CRG-UPF), 08003 Barcelona, Catalonia, Spain [2] Institute for Theoretical Chemistry, Theoretical Biochemistry Group (TBI), University of Vienna, Währingerstrasse 17/3/303, A-1090 Vienna, Austria
| | - Owen A Thompson
- Department of Genome Sciences and University of Washington School of Medicine, William H. Foege Building S350D, 1705 Northeast Pacific Street, Box 355065 Seattle, Washington 98195-5065, USA
| | - Kenneth H Wan
- Department of Genome Dynamics, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Guilin Wang
- Department of Genetics, Yale University School of Medicine, New Haven, Connecticut 06520-8005, USA
| | - Huaien Wang
- Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA
| | - Kathie L Watkins
- Department of Cell and Developmental Biology, Vanderbilt University, 465 21st Avenue South, Nashville, Tennessee 37232-8240, USA
| | - Jiayu Wen
- Sloan-Kettering Institute, 1275 York Avenue, Box 252, New York, New York 10065, USA
| | - Kejia Wen
- MOE Key Lab of Bioinformatics, School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Chenghai Xue
- Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA
| | - Li Yang
- 1] Department of Genetics and Developmental Biology, Institute for Systems Genomics, University of Connecticut Health Center, 400 Farmington Avenue, Farmington, Connecticut 06030, USA [2] Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Kevin Yip
- 1] Hong Kong Bioinformatics Centre, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong [2] 5 CUHK-BGI Innovation Institute of Trans-omics, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong
| | - Chris Zaleski
- Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA
| | - Yan Zhang
- 1] Program in Computational Biology and Bioinformatics, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA [2] Department of Molecular Biophysics and Biochemistry, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA
| | - Henry Zheng
- 1] Program in Computational Biology and Bioinformatics, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA [2] Department of Molecular Biophysics and Biochemistry, Yale University, Bass 432, 266 Whitney Avenue, New Haven, Connecticut 06520, USA
| | - Steven E Brenner
- 1] Department of Molecular and Cell Biology, University of California, Berkeley, California 94720, USA [2] Department of Plant and Microbial Biology, University of California, Berkeley, California 94720, USA [3]
| | - Brenton R Graveley
- 1] Department of Genetics and Developmental Biology, Institute for Systems Genomics, University of Connecticut Health Center, 400 Farmington Avenue, Farmington, Connecticut 06030, USA [2]
| | - Susan E Celniker
- 1] Department of Genome Dynamics, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA [2]
| | - Thomas R Gingeras
- 1] Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA [2]
| | - Robert Waterston
- 1] Department of Genome Sciences and University of Washington School of Medicine, William H. Foege Building S350D, 1705 Northeast Pacific Street, Box 355065 Seattle, Washington 98195-5065, USA [2]
| |
Collapse
|
35
|
Brown JB, Boley N, Eisman R, May GE, Stoiber MH, Duff MO, Booth BW, Wen J, Park S, Suzuki AM, Wan KH, Yu C, Zhang D, Carlson JW, Cherbas L, Eads BD, Miller D, Mockaitis K, Roberts J, Davis CA, Frise E, Hammonds AS, Olson S, Shenker S, Sturgill D, Samsonova AA, Weiszmann R, Robinson G, Hernandez J, Andrews J, Bickel PJ, Carninci P, Cherbas P, Gingeras TR, Hoskins RA, Kaufman TC, Lai EC, Oliver B, Perrimon N, Graveley BR, Celniker SE. Diversity and dynamics of the Drosophila transcriptome. Nature 2014; 512:393-9. [PMID: 24670639 PMCID: PMC4152413 DOI: 10.1038/nature12962] [Citation(s) in RCA: 470] [Impact Index Per Article: 47.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2013] [Accepted: 12/18/2013] [Indexed: 01/10/2023]
Abstract
Animal transcriptomes are dynamic, with each cell type, tissue and organ system expressing an ensemble of transcript isoforms that give rise to substantial diversity. Here we have identified new genes, transcripts and proteins using poly(A)+ RNA sequencing from Drosophila melanogaster in cultured cell lines, dissected organ systems and under environmental perturbations. We found that a small set of mostly neural-specific genes has the potential to encode thousands of transcripts each through extensive alternative promoter usage and RNA splicing. The magnitudes of splicing changes are larger between tissues than between developmental stages, and most sex-specific splicing is gonad-specific. Gonads express hundreds of previously unknown coding and long non-coding RNAs (lncRNAs), some of which are antisense to protein-coding genes and produce short regulatory RNAs. Furthermore, previously identified pervasive intergenic transcription occurs primarily within newly identified introns. The fly transcriptome is substantially more complex than previously recognized, with this complexity arising from combinatorial usage of promoters, splice sites and polyadenylation sites.
Collapse
|
36
|
Lee H, McManus CJ, Cho DY, Eaton M, Renda F, Somma MP, Cherbas L, May G, Powell S, Zhang D, Zhan L, Resch A, Andrews J, Celniker SE, Cherbas P, Przytycka TM, Gatti M, Oliver B, Graveley B, MacAlpine D. DNA copy number evolution in Drosophila cell lines. Genome Biol 2014; 15:R70. [PMID: 25262759 PMCID: PMC4289277 DOI: 10.1186/gb-2014-15-8-r70] [Citation(s) in RCA: 88] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2013] [Accepted: 07/01/2014] [Indexed: 12/19/2022] Open
Abstract
Background Structural rearrangements of the genome resulting in genic imbalance due to copy number change are often deleterious at the organismal level, but are common in immortalized cell lines and tumors, where they may be an advantage to cells. In order to explore the biological consequences of copy number changes in the Drosophila genome, we resequenced the genomes of 19 tissue-culture cell lines and generated RNA-Seq profiles. Results Our work revealed dramatic duplications and deletions in all cell lines. We found three lines of evidence indicating that copy number changes were due to selection during tissue culture. First, we found that copy numbers correlated to maintain stoichiometric balance in protein complexes and biochemical pathways, consistent with the gene balance hypothesis. Second, while most copy number changes were cell line-specific, we identified some copy number changes shared by many of the independent cell lines. These included dramatic recurrence of increased copy number of the PDGF/VEGF receptor, which is also over-expressed in many cancer cells, and of bantam, an anti-apoptosis miRNA. Third, even when copy number changes seemed distinct between lines, there was strong evidence that they supported a common phenotypic outcome. For example, we found that proto-oncogenes were over-represented in one cell line (S2-DRSC), whereas tumor suppressor genes were under-represented in another (Kc167). Conclusion Our study illustrates how genome structure changes may contribute to selection of cell lines in vitro. This has implications for other cell-level natural selection progressions, including tumorigenesis. Electronic supplementary material The online version of this article (doi:10.1186/gb-2014-15-8-r70) contains supplementary material, which is available to authorized users.
Collapse
|
37
|
Boley N, Stoiber MH, Booth BW, Wan KH, Hoskins RA, Bickel PJ, Celniker SE, Brown JB. Genome-guided transcript assembly by integrative analysis of RNA sequence data. Nat Biotechnol 2014; 32:341-6. [PMID: 24633242 PMCID: PMC4037530 DOI: 10.1038/nbt.2850] [Citation(s) in RCA: 41] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2013] [Accepted: 02/11/2014] [Indexed: 01/31/2023]
Abstract
The identification of full length transcripts entirely from short-read RNA sequencing data (RNA-seq) remains a challenge in the annotation of genomes. Here we describe an automated pipeline for genome annotation that integrates RNA-seq and gene-boundary data sets, which we call Generalized RNA Integration Tool, or GRIT. Applying GRIT to Drosophila melanogaster short-read RNA-seq, cap analysis of gene expression (CAGE) and poly(A)-site-seq data collected for the modENCODE project, we recovered the vast majority of previously annotated transcripts and doubled the total number of transcripts cataloged. We found that 20% of protein coding genes encode multiple protein-localization signals and that, in 20-d-old adult fly heads, genes with multiple polyadenylation sites are more common than genes with alternative splicing or alternative promoters. GRIT demonstrates 30% higher precision and recall than the most widely used transcript assembly tools. GRIT will facilitate the automated generation of high-quality genome annotations without the need for extensive manual annotation.
Collapse
Affiliation(s)
- Nathan Boley
- Department of Biostatistics, University of California at Berkeley, Berkeley, CA, USA
| | - Marcus H. Stoiber
- Department of Biostatistics, University of California at Berkeley, Berkeley, CA, USA
| | - Benjamin W. Booth
- Department of Genome Dynamics, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Kenneth H. Wan
- Department of Genome Dynamics, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Roger A. Hoskins
- Department of Genome Dynamics, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Peter J. Bickel
- Department of Statistics, University of California at Berkeley, Berkeley, CA, USA
| | - Susan E. Celniker
- Department of Genome Dynamics, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - James B. Brown
- Department of Genome Dynamics, Lawrence Berkeley National Laboratory, Berkeley, California, USA
- Department of Statistics, University of California at Berkeley, Berkeley, CA, USA
| |
Collapse
|
38
|
Abstract
modENCODE was a 5year NHGRI funded project (2007-2012) to map the function of every base in the genomes of worms and flies characterizing positions of modified histones and other chromatin marks, origins of DNA replication, RNA transcripts and the transcription factor binding sites that control gene expression. Here we describe the Drosophila modENCODE datasets and how best to access and use them for genome wide and individual gene studies.
Collapse
Affiliation(s)
- Nathan Boley
- Department of Biostatistics, University of California Berkeley, Berkeley, CA, United States
| | - Kenneth H Wan
- Department of Genome Dynamics, Lawrence Berkeley National Laboratory, Berkeley, CA, United States
| | - Peter J Bickel
- Department of Statistics, University of California Berkeley, Berkeley, CA, United States
| | - Susan E Celniker
- Department of Genome Dynamics, Lawrence Berkeley National Laboratory, Berkeley, CA, United States.
| |
Collapse
|
39
|
Hammonds AS, Bristow CA, Fisher WW, Weiszmann R, Wu S, Hartenstein V, Kellis M, Yu B, Frise E, Celniker SE. Spatial expression of transcription factors in Drosophila embryonic organ development. Genome Biol 2013; 14:R140. [PMID: 24359758 PMCID: PMC4053779 DOI: 10.1186/gb-2013-14-12-r140] [Citation(s) in RCA: 94] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2013] [Accepted: 12/20/2013] [Indexed: 11/29/2022] Open
Abstract
Background Site-specific transcription factors (TFs) bind DNA regulatory elements to control expression of target genes, forming the core of gene regulatory networks. Despite decades of research, most studies focus on only a small number of TFs and the roles of many remain unknown. Results We present a systematic characterization of spatiotemporal gene expression patterns for all known or predicted Drosophila TFs throughout embryogenesis, the first such comprehensive study for any metazoan animal. We generated RNA expression patterns for all 708 TFs by in situ hybridization, annotated the patterns using an anatomical controlled vocabulary, and analyzed TF expression in the context of organ system development. Nearly all TFs are expressed during embryogenesis and more than half are specifically expressed in the central nervous system. Compared to other genes, TFs are enriched early in the development of most organ systems, and throughout the development of the nervous system. Of the 535 TFs with spatially restricted expression, 79% are dynamically expressed in multiple organ systems while 21% show single-organ specificity. Of those expressed in multiple organ systems, 77 TFs are restricted to a single organ system either early or late in development. Expression patterns for 354 TFs are characterized for the first time in this study. Conclusions We produced a reference TF dataset for the investigation of gene regulatory networks in embryogenesis, and gained insight into the expression dynamics of the full complement of TFs controlling the development of each organ system.
Collapse
|
40
|
Cheng Q, Kazemian M, Pham H, Blatti C, Celniker SE, Wolfe SA, Brodsky MH, Sinha S. Computational identification of diverse mechanisms underlying transcription factor-DNA occupancy. PLoS Genet 2013; 9:e1003571. [PMID: 23935523 PMCID: PMC3731213 DOI: 10.1371/journal.pgen.1003571] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2012] [Accepted: 05/02/2013] [Indexed: 12/13/2022] Open
Abstract
ChIP-based genome-wide assays of transcription factor (TF) occupancy have emerged as a powerful, high-throughput method to understand transcriptional regulation, especially on a global scale. This has led to great interest in the underlying biochemical mechanisms that direct TF-DNA binding, with the ultimate goal of computationally predicting a TF's occupancy profile in any cellular condition. In this study, we examined the influence of various potential determinants of TF-DNA binding on a much larger scale than previously undertaken. We used a thermodynamics-based model of TF-DNA binding, called “STAP,” to analyze 45 TF-ChIP data sets from Drosophila embryonic development. We built a cross-validation framework that compares a baseline model, based on the ChIP'ed (“primary”) TF's motif, to more complex models where binding by secondary TFs is hypothesized to influence the primary TF's occupancy. Candidates interacting TFs were chosen based on RNA-SEQ expression data from the time point of the ChIP experiment. We found widespread evidence of both cooperative and antagonistic effects by secondary TFs, and explicitly quantified these effects. We were able to identify multiple classes of interactions, including (1) long-range interactions between primary and secondary motifs (separated by ≤150 bp), suggestive of indirect effects such as chromatin remodeling, (2) short-range interactions with specific inter-site spacing biases, suggestive of direct physical interactions, and (3) overlapping binding sites suggesting competitive binding. Furthermore, by factoring out the previously reported strong correlation between TF occupancy and DNA accessibility, we were able to categorize the effects into those that are likely to be mediated by the secondary TF's effect on local accessibility and those that utilize accessibility-independent mechanisms. Finally, we conducted in vitro pull-down assays to test model-based predictions of short-range cooperative interactions, and found that seven of the eight TF pairs tested physically interact and that some of these interactions mediate cooperative binding to DNA. Chromatin Immunoprecipitation (ChIP)-based genome-wide assays of transcription factor (TF) occupancy have emerged as a powerful, high throughput method to understand transcriptional regulation, especially on a global scale. Here, we utilize 45 ChIP-chip and ChIP-SEQ data sets from Drosophila to explore the underlying mechanisms of TF-DNA binding. For this, we employ a biophysically motivated computational model, in conjunction with over 300 TF motifs (binding specificities) as well as gene expression and DNA accessibility data from different developmental stages in Drosophila embryos. Our findings provide robust statistical evidence of the role played by TF-TF interactions in shaping genome-wide TF-DNA binding profiles, and thus in directing gene regulation. Our method allows us to go beyond simply recognizing the existence of such interactions, to quantifying their effects on TF occupancy. We are able to categorize the probable mechanisms of these effects as involving direct physical interactions versus accessibility-mediated indirect interactions, long-range versus short-range interactions, and cooperative versus antagonistic interactions. Our analysis reveals widespread evidence of combinatorial regulation present in recently generated ChIP data sets, and sets the stage for rich integrative models of the future that will predict cell type-specific TF occupancy values from sequence and expression data.
Collapse
Affiliation(s)
- Qiong Cheng
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
| | - Majid Kazemian
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
| | - Hannah Pham
- Program in Gene Function and Expression, University of Massachusetts Medical School, Worcester, Massachusetts, United States of America
| | - Charles Blatti
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
| | - Susan E. Celniker
- Department of Genome Dynamics, Berkeley Drosophila Genome Project, Lawrence Berkeley National Laboratory, Berkeley, California, United States of America
| | - Scot A. Wolfe
- Program in Gene Function and Expression, University of Massachusetts Medical School, Worcester, Massachusetts, United States of America
- Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, Massachusetts, United States of America
| | - Michael H. Brodsky
- Program in Gene Function and Expression, University of Massachusetts Medical School, Worcester, Massachusetts, United States of America
- Department of Molecular Medicine, University of Massachusetts Medical School, Worcester, Massachusetts, United States of America
- * E-mail: (MHB); (SS)
| | - Saurabh Sinha
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
- Institute of Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
- * E-mail: (MHB); (SS)
| |
Collapse
|
41
|
Özkan E, Carrillo RA, Eastman CL, Weiszmann R, Waghray D, Johnson KG, Zinn K, Celniker SE, Garcia KC. An extracellular interactome of immunoglobulin and LRR proteins reveals receptor-ligand networks. Cell 2013; 154:228-39. [PMID: 23827685 PMCID: PMC3756661 DOI: 10.1016/j.cell.2013.06.006] [Citation(s) in RCA: 156] [Impact Index Per Article: 14.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2012] [Revised: 04/02/2013] [Accepted: 06/05/2013] [Indexed: 10/26/2022]
Abstract
Extracellular domains of cell surface receptors and ligands mediate cell-cell communication, adhesion, and initiation of signaling events, but most existing protein-protein "interactome" data sets lack information for extracellular interactions. We probed interactions between receptor extracellular domains, focusing on a set of 202 proteins composed of the Drosophila melanogaster immunoglobulin superfamily (IgSF), fibronectin type III (FnIII), and leucine-rich repeat (LRR) families, which are known to be important in neuronal and developmental functions. Out of 20,503 candidate protein pairs tested, we observed 106 interactions, 83 of which were previously unknown. We "deorphanized" the 20 member subfamily of defective-in-proboscis-response IgSF proteins, showing that they selectively interact with an 11 member subfamily of previously uncharacterized IgSF proteins. Both subfamilies interact with a single common "orphan" LRR protein. We also observed interactions between Hedgehog and EGFR pathway components. Several of these interactions could be visualized in live-dissected embryos, demonstrating that this approach can identify physiologically relevant receptor-ligand pairs.
Collapse
Affiliation(s)
- Engin Özkan
- Department of Molecular and Cellular Physiology, and Structural Biology, Stanford, CA 94305, USA
- Howard Hughes Medical Institute, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Robert A. Carrillo
- Division of Biology, California Institute of Technology, Pasadena, CA 91125, USA
| | - Catharine L. Eastman
- Department of Molecular and Cellular Physiology, and Structural Biology, Stanford, CA 94305, USA
| | - Richard Weiszmann
- Department of Genome Dynamics, Berkeley Genome Project, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Deepa Waghray
- Department of Molecular and Cellular Physiology, and Structural Biology, Stanford, CA 94305, USA
| | - Karl G. Johnson
- Department of Biology, and Neuroscience, Pomona College, Claremont, CA 91711, USA
| | - Kai Zinn
- Division of Biology, California Institute of Technology, Pasadena, CA 91125, USA
| | - Susan E. Celniker
- Department of Genome Dynamics, Berkeley Genome Project, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - K. Christopher Garcia
- Department of Molecular and Cellular Physiology, and Structural Biology, Stanford, CA 94305, USA
- Howard Hughes Medical Institute, Stanford University School of Medicine, Stanford, CA 94305, USA
| |
Collapse
|
42
|
Smibert P, Miura P, Westholm JO, Shenker S, May G, Duff MO, Zhang D, Eads BD, Carlson J, Brown JB, Eisman RC, Andrews J, Kaufman T, Cherbas P, Celniker SE, Graveley BR, Lai EC. Global patterns of tissue-specific alternative polyadenylation in Drosophila. Cell Rep 2013; 1:277-89. [PMID: 22685694 DOI: 10.1016/j.celrep.2012.01.001] [Citation(s) in RCA: 182] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
We analyzed the usage and consequences of alternative cleavage and polyadenylation (APA) in Drosophila melanogaster by using >1 billion reads of stranded mRNA-seq across a variety of dissected tissues. Beyond demonstrating that a majority of fly transcripts are subject to APA, we observed broad trends for 3' untranslated region (UTR) shortening in the testis and lengthening in the central nervous system (CNS); the latter included hundreds of unannotated extensions ranging up to 18 kb. Extensive northern analyses validated the accumulation of full-length neural extended transcripts, and in situ hybridization indicated their spatial restriction to the CNS. Genes encoding RNA binding proteins (RBPs) and transcription factors were preferentially subject to 3' UTR extensions. Motif analysis indicated enrichment of miRNA and RBP sites in the neural extensions, and their termini were enriched in canonical cis elements that promote cleavage and polyadenylation. Altogether, we reveal broad tissue-specific patterns of APA in Drosophila and transcripts with unprecedented 3' UTR length in the nervous system.
Collapse
Affiliation(s)
- Peter Smibert
- Department of Developmental Biology, Sloan-Kettering Institute, New York, NY 10065, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
43
|
Enuameh MS, Asriyan Y, Richards A, Christensen RG, Hall VL, Kazemian M, Zhu C, Pham H, Cheng Q, Blatti C, Brasefield JA, Basciotta MD, Ou J, McNulty JC, Zhu LJ, Celniker SE, Sinha S, Stormo GD, Brodsky MH, Wolfe SA. Global analysis of Drosophila Cys₂-His₂ zinc finger proteins reveals a multitude of novel recognition motifs and binding determinants. Genome Res 2013; 23:928-40. [PMID: 23471540 PMCID: PMC3668361 DOI: 10.1101/gr.151472.112] [Citation(s) in RCA: 56] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Abstract
Cys2-His2 zinc finger proteins (ZFPs) are the largest group of transcription factors in higher metazoans. A complete characterization of these ZFPs and their associated target sequences is pivotal to fully annotate transcriptional regulatory networks in metazoan genomes. As a first step in this process, we have characterized the DNA-binding specificities of 129 zinc finger sets from Drosophila using a bacterial one-hybrid system. This data set contains the DNA-binding specificities for at least one encoded ZFP from 70 unique genes and 23 alternate splice isoforms representing the largest set of characterized ZFPs from any organism described to date. These recognition motifs can be used to predict genomic binding sites for these factors within the fruit fly genome. Subsets of fingers from these ZFPs were characterized to define their orientation and register on their recognition sequences, thereby allowing us to define the recognition diversity within this finger set. We find that the characterized fingers can specify 47 of the 64 possible DNA triplets. To confirm the utility of our finger recognition models, we employed subsets of Drosophila fingers in combination with an existing archive of artificial zinc finger modules to create ZFPs with novel DNA-binding specificity. These hybrids of natural and artificial fingers can be used to create functional zinc finger nucleases for editing vertebrate genomes.
Collapse
Affiliation(s)
- Metewo Selase Enuameh
- Program in Gene Function and Expression, University of Massachusetts Medical School, Worcester, Massachusetts 01605, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
44
|
Guruharsha KG, Rual JF, Zhai B, Mintseris J, Vaidya P, Vaidya N, Beekman C, Wong C, Rhee DY, Cenaj O, McKillip E, Shah S, Stapleton M, Wan KH, Yu C, Parsa B, Carlson JW, Chen X, Kapadia B, VijayRaghavan K, Gygi SP, Celniker SE, Obar RA, Artavanis-Tsakonas S. A protein complex network of Drosophila melanogaster. Cell 2011; 147:690-703. [PMID: 22036573 DOI: 10.1016/j.cell.2011.08.047] [Citation(s) in RCA: 465] [Impact Index Per Article: 35.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2011] [Revised: 07/14/2011] [Accepted: 08/19/2011] [Indexed: 01/24/2023]
Abstract
Determining the composition of protein complexes is an essential step toward understanding the cell as an integrated system. Using coaffinity purification coupled to mass spectrometry analysis, we examined protein associations involving nearly 5,000 individual, FLAG-HA epitope-tagged Drosophila proteins. Stringent analysis of these data, based on a statistical framework designed to define individual protein-protein interactions, led to the generation of a Drosophila protein interaction map (DPiM) encompassing 556 protein complexes. The high quality of the DPiM and its usefulness as a paradigm for metazoan proteomes are apparent from the recovery of many known complexes, significant enrichment for shared functional attributes, and validation in human cells. The DPiM defines potential novel members for several important protein complexes and assigns functional links to 586 protein-coding genes lacking previous experimental annotation. The DPiM represents, to our knowledge, the largest metazoan protein complex map and provides a valuable resource for analysis of protein complex evolution.
Collapse
Affiliation(s)
- K G Guruharsha
- Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
45
|
Atanesyan L, Günther V, Celniker SE, Georgiev O, Schaffner W. Characterization of MtnE, the fifth metallothionein member in Drosophila. J Biol Inorg Chem 2011; 16:1047-56. [PMID: 21870250 DOI: 10.1007/s00775-011-0825-4] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2011] [Accepted: 07/12/2011] [Indexed: 11/30/2022]
Abstract
Metallothioneins (MTs) constitute a family of cysteine-rich, low molecular weight metal-binding proteins which occur in almost all forms of life. They bind physiological metals, such as zinc and copper, as well as nonessential, toxic heavy metals, such as cadmium, mercury, and silver. MT expression is regulated at the transcriptional level by metal-regulatory transcription factor 1 (MTF-1), which binds to the metal-response elements (MREs) in the enhancer/promoter regions of MT genes. Drosophila was thought to have four MT genes, namely, MtnA, MtnB, MtnC, and MtnD. Here we characterize a new fifth member of Drosophila MT gene family, coding for metallothionein E (MtnE). The MtnE transcription unit is located head-to-head with the one of MtnD. The intervening sequence contains four MREs which bind, with different affinities, to MTF-1. Both of the divergently transcribed MT genes are completely dependent on MTF-1, whereby MtnE is consistently more strongly transcribed. MtnE expression is induced in response to heavy metals, notably copper, mercury, and silver, and is upregulated in a genetic background where the other four MTs are missing.
Collapse
Affiliation(s)
- Lilit Atanesyan
- Institute of Molecular Life Sciences, University of Zurich, Zurich, Switzerland
| | | | | | | | | |
Collapse
|
46
|
Thomas S, Li XY, Sabo PJ, Sandstrom R, Thurman RE, Canfield TK, Giste E, Fisher W, Hammonds A, Celniker SE, Biggin MD, Stamatoyannopoulos JA. Dynamic reprogramming of chromatin accessibility during Drosophila embryo development. Genome Biol 2011; 12:R43. [PMID: 21569360 PMCID: PMC3219966 DOI: 10.1186/gb-2011-12-5-r43] [Citation(s) in RCA: 126] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2011] [Revised: 03/21/2011] [Accepted: 05/11/2011] [Indexed: 12/22/2022] Open
Abstract
Background The development of complex organisms is believed to involve progressive restrictions in cellular fate. Understanding the scope and features of chromatin dynamics during embryogenesis, and identifying regulatory elements important for directing developmental processes remain key goals of developmental biology. Results We used in vivo DNaseI sensitivity to map the locations of regulatory elements, and explore the changing chromatin landscape during the first 11 hours of Drosophila embryonic development. We identified thousands of conserved, developmentally dynamic, distal DNaseI hypersensitive sites associated with spatial and temporal expression patterning of linked genes and with large regions of chromatin plasticity. We observed a nearly uniform balance between developmentally up- and down-regulated DNaseI hypersensitive sites. Analysis of promoter chromatin architecture revealed a novel role for classical core promoter sequence elements in directing temporally regulated chromatin remodeling. Another unexpected feature of the chromatin landscape was the presence of localized accessibility over many protein-coding regions, subsets of which were developmentally regulated or associated with the transcription of genes with prominent maternal RNA contributions in the blastoderm. Conclusions Our results provide a global view of the rich and dynamic chromatin landscape of early animal development, as well as novel insights into the organization of developmentally regulated chromatin features.
Collapse
Affiliation(s)
- Sean Thomas
- Department of Genome Sciences, University of Washington, Foege S310A, 1705 NE Pacific Street, Box 355065, Seattle, WA 98195, USA
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
47
|
Hoskins RA, Landolin JM, Brown JB, Sandler JE, Takahashi H, Lassmann T, Yu C, Booth BW, Zhang D, Wan KH, Yang L, Boley N, Andrews J, Kaufman TC, Graveley BR, Bickel PJ, Carninci P, Carlson JW, Celniker SE. Genome-wide analysis of promoter architecture in Drosophila melanogaster. Genome Res 2010; 21:182-92. [PMID: 21177961 DOI: 10.1101/gr.112466.110] [Citation(s) in RCA: 167] [Impact Index Per Article: 11.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Core promoters are critical regions for gene regulation in higher eukaryotes. However, the boundaries of promoter regions, the relative rates of initiation at the transcription start sites (TSSs) distributed within them, and the functional significance of promoter architecture remain poorly understood. We produced a high-resolution map of promoters active in the Drosophila melanogaster embryo by integrating data from three independent and complementary methods: 21 million cap analysis of gene expression (CAGE) tags, 1.2 million RNA ligase mediated rapid amplification of cDNA ends (RLM-RACE) reads, and 50,000 cap-trapped expressed sequence tags (ESTs). We defined 12,454 promoters of 8037 genes. Our analysis indicates that, due to non-promoter-associated RNA background signal, previous studies have likely overestimated the number of promoter-associated CAGE clusters by fivefold. We show that TSS distributions form a complex continuum of shapes, and that promoters active in the embryo and adult have highly similar shapes in 95% of cases. This suggests that these distributions are generally determined by static elements such as local DNA sequence and are not modulated by dynamic signals such as histone modifications. Transcription factor binding motifs are differentially enriched as a function of promoter shape, and peaked promoter shape is correlated with both temporal and spatial regulation of gene expression. Our results contribute to the emerging view that core promoters are functionally diverse and control patterning of gene expression in Drosophila and mammals.
Collapse
Affiliation(s)
- Roger A Hoskins
- Department of Genome Dynamics, Life Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 97420, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
48
|
Roy S, Ernst J, Kharchenko PV, Kheradpour P, Negre N, Eaton ML, Landolin JM, Bristow CA, Ma L, Lin MF, Washietl S, Arshinoff BI, Ay F, Meyer PE, Robine N, Washington NL, Di Stefano L, Berezikov E, Brown CD, Candeias R, Carlson JW, Carr A, Jungreis I, Marbach D, Sealfon R, Tolstorukov MY, Will S, Alekseyenko AA, Artieri C, Booth BW, Brooks AN, Dai Q, Davis CA, Duff MO, Feng X, Gorchakov AA, Gu T, Henikoff JG, Kapranov P, Li R, MacAlpine HK, Malone J, Minoda A, Nordman J, Okamura K, Perry M, Powell SK, Riddle NC, Sakai A, Samsonova A, Sandler JE, Schwartz YB, Sher N, Spokony R, Sturgill D, van Baren M, Wan KH, Yang L, Yu C, Feingold E, Good P, Guyer M, Lowdon R, Ahmad K, Andrews J, Berger B, Brenner SE, Brent MR, Cherbas L, Elgin SCR, Gingeras TR, Grossman R, Hoskins RA, Kaufman TC, Kent W, Kuroda MI, Orr-Weaver T, Perrimon N, Pirrotta V, Posakony JW, Ren B, Russell S, Cherbas P, Graveley BR, Lewis S, Micklem G, Oliver B, Park PJ, Celniker SE, Henikoff S, Karpen GH, Lai EC, MacAlpine DM, Stein LD, White KP, Kellis M. Identification of functional elements and regulatory circuits by Drosophila modENCODE. Science 2010; 330:1787-97. [PMID: 21177974 DOI: 10.1126/science.1198374] [Citation(s) in RCA: 899] [Impact Index Per Article: 64.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
To gain insight into how genomic information is translated into cellular and developmental programs, the Drosophila model organism Encyclopedia of DNA Elements (modENCODE) project is comprehensively mapping transcripts, histone modifications, chromosomal proteins, transcription factors, replication proteins and intermediates, and nucleosome properties across a developmental time course and in multiple cell lines. We have generated more than 700 data sets and discovered protein-coding, noncoding, RNA regulatory, replication, and chromatin elements, more than tripling the annotated portion of the Drosophila genome. Correlated activity patterns of these elements reveal a functional regulatory network, which predicts putative new functions for genes, reveals stage- and tissue-specific regulators, and enables gene-expression prediction. Our results provide a foundation for directed experimental and computational studies in Drosophila and related species and also a model for systematic data integration toward comprehensive genomic and functional annotation.
Collapse
Affiliation(s)
-
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology (MIT), Cambridge, MA 02139, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
49
|
Cherbas L, Willingham A, Zhang D, Yang L, Zou Y, Eads BD, Carlson JW, Landolin JM, Kapranov P, Dumais J, Samsonova A, Choi JH, Roberts J, Davis CA, Tang H, van Baren MJ, Ghosh S, Dobin A, Bell K, Lin W, Langton L, Duff MO, Tenney AE, Zaleski C, Brent MR, Hoskins RA, Kaufman TC, Andrews J, Graveley BR, Perrimon N, Celniker SE, Gingeras TR, Cherbas P. The transcriptional diversity of 25 Drosophila cell lines. Genome Res 2010; 21:301-14. [PMID: 21177962 DOI: 10.1101/gr.112961.110] [Citation(s) in RCA: 213] [Impact Index Per Article: 15.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Drosophila melanogaster cell lines are important resources for cell biologists. Here, we catalog the expression of exons, genes, and unannotated transcriptional signals for 25 lines. Unannotated transcription is substantial (typically 19% of euchromatic signal). Conservatively, we identify 1405 novel transcribed regions; 684 of these appear to be new exons of neighboring, often distant, genes. Sixty-four percent of genes are expressed detectably in at least one line, but only 21% are detected in all lines. Each cell line expresses, on average, 5885 genes, including a common set of 3109. Expression levels vary over several orders of magnitude. Major signaling pathways are well represented: most differentiation pathways are "off" and survival/growth pathways "on." Roughly 50% of the genes expressed by each line are not part of the common set, and these show considerable individuality. Thirty-one percent are expressed at a higher level in at least one cell line than in any single developmental stage, suggesting that each line is enriched for genes characteristic of small sets of cells. Most remarkable is that imaginal disc-derived lines can generally be assigned, on the basis of expression, to small territories within developing discs. These mappings reveal unexpected stability of even fine-grained spatial determination. No two cell lines show identical transcription factor expression. We conclude that each line has retained features of an individual founder cell superimposed on a common "cell line" gene expression pattern.
Collapse
Affiliation(s)
- Lucy Cherbas
- Center for Genomics and Bioinformatics, Indiana University, Bloomington, Indiana 47405, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
50
|
Kazemian M, Blatti C, Richards A, McCutchan M, Wakabayashi-Ito N, Hammonds AS, Celniker SE, Kumar S, Wolfe SA, Brodsky MH, Sinha S. Quantitative analysis of the Drosophila segmentation regulatory network using pattern generating potentials. PLoS Biol 2010; 8. [PMID: 20808951 PMCID: PMC2923081 DOI: 10.1371/journal.pbio.1000456] [Citation(s) in RCA: 64] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2009] [Accepted: 07/07/2010] [Indexed: 01/05/2023] Open
Abstract
A new computational method uses gene expression databases and transcription factor binding specificities to describe regulatory elements in the Drosophila A/P patterning network in unprecedented detail. Cis-regulatory modules that drive precise spatial-temporal patterns of gene expression are central to the process of metazoan development. We describe a new computational strategy to annotate genomic sequences based on their “pattern generating potential” and to produce quantitative descriptions of transcriptional regulatory networks at the level of individual protein-module interactions. We use this approach to convert the qualitative understanding of interactions that regulate Drosophila segmentation into a network model in which a confidence value is associated with each transcription factor-module interaction. Sequence information from multiple Drosophila species is integrated with transcription factor binding specificities to determine conserved binding site frequencies across the genome. These binding site profiles are combined with transcription factor expression information to create a model to predict module activity patterns. This model is used to scan genomic sequences for the potential to generate all or part of the expression pattern of a nearby gene, obtained from available gene expression databases. Interactions between individual transcription factors and modules are inferred by a statistical method to quantify a factor's contribution to the module's pattern generating potential. We use these pattern generating potentials to systematically describe the location and function of known and novel cis-regulatory modules in the segmentation network, identifying many examples of modules predicted to have overlapping expression activities. Surprisingly, conserved transcription factor binding site frequencies were as effective as experimental measurements of occupancy in predicting module expression patterns or factor-module interactions. Thus, unlike previous module prediction methods, this method predicts not only the location of modules but also their spatial activity pattern and the factors that directly determine this pattern. As databases of transcription factor specificities and in vivo gene expression patterns grow, analysis of pattern generating potentials provides a general method to decode transcriptional regulatory sequences and networks. The developmental program specifying segmentation along the anterior-posterior axis of the Drosophila embryo is one of the best studied examples of transcriptional regulatory networks. Previous work has identified the location and function of dozens of DNA segments called cis-regulatory “modules” that regulate several genes in precise spatial patterns in the early embryo. In many cases, transcription factors that interact with such modules have also been identified. We present a novel computational framework that turns a qualitative and fragmented understanding of modules and factor-module interactions into a quantitative, systems-level view. The formalism utilizes experimentally characterized binding specificities of transcription factors and gene expression patterns to describe how multiple transcription factors (working as activators or repressors) act together in a module to determine its regulatory activity. This formalism can explain the expression patterns of known modules, infer factor-module interactions and quantify the potential of an arbitrary DNA segment to drive a gene's expression. We have also employed databases of gene expression patterns to find novel modules of the regulatory network. As databases of binding motifs and gene expression patterns grow, this new approach provides a general method to decode transcriptional regulatory sequences and networks.
Collapse
Affiliation(s)
- Majid Kazemian
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana-Champaign, Illinois, United States of America
| | - Charles Blatti
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana-Champaign, Illinois, United States of America
| | - Adam Richards
- Program in Gene Function and Expression, University of Massachusetts Medical School, Worcester, Massachusetts, United States of America
- Department of Molecular Medicine, University of Massachusetts Medical School, Worcester, Massachusetts, United States of America
| | - Michael McCutchan
- Center for Evolutionary Functional Genomics, Biodesign Institute, Arizona State University, Tempe, Arizona, United States of America
| | - Noriko Wakabayashi-Ito
- Program in Gene Function and Expression, University of Massachusetts Medical School, Worcester, Massachusetts, United States of America
- Department of Molecular Medicine, University of Massachusetts Medical School, Worcester, Massachusetts, United States of America
| | - Ann S. Hammonds
- Department of Genome Dynamics, Berkeley Drosophila Genome Project, Lawrence Berkeley National Laboratory, Berkeley, California, United States of America
| | - Susan E. Celniker
- Department of Genome Dynamics, Berkeley Drosophila Genome Project, Lawrence Berkeley National Laboratory, Berkeley, California, United States of America
| | - Sudhir Kumar
- Center for Evolutionary Functional Genomics, Biodesign Institute, Arizona State University, Tempe, Arizona, United States of America
| | - Scot A. Wolfe
- Program in Gene Function and Expression, University of Massachusetts Medical School, Worcester, Massachusetts, United States of America
- Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, Massachusetts, United States of America
| | - Michael H. Brodsky
- Program in Gene Function and Expression, University of Massachusetts Medical School, Worcester, Massachusetts, United States of America
- Department of Molecular Medicine, University of Massachusetts Medical School, Worcester, Massachusetts, United States of America
- * E-mail: (SS); (MHB)
| | - Saurabh Sinha
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana-Champaign, Illinois, United States of America
- Institute of Genomic Biology, University of Illinois at Urbana-Champaign, Urbana-Champaign, Illinois, United States of America
- * E-mail: (SS); (MHB)
| |
Collapse
|