1
|
Abou Nader N, Charrier L, Meisnsohn MC, Banville L, Deffrennes B, St-Jean G, Boerboom D, Zamberlam G, Brind'Amour J, Pépin D, Boyer A. Lats1 and Lats2 regulate YAP and TAZ activity to control the development of mouse Sertoli cells. FASEB J 2024; 38:e23633. [PMID: 38690712 DOI: 10.1096/fj.202400346r] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2024] [Revised: 04/11/2024] [Accepted: 04/16/2024] [Indexed: 05/02/2024]
Abstract
Recent reports suggest that the Hippo signaling pathway regulates testis development, though its exact roles in Sertoli cell differentiation remain unknown. Here, we examined the functions of the main Hippo pathway kinases, large tumor suppressor homolog kinases 1 and 2 (Lats1 and Lats2) in developing mouse Sertoli cells. Conditional inactivation of Lats1/2 in Sertoli cells resulted in the disorganization and overgrowth of the testis cords, the induction of a testicular inflammatory response and germ cell apoptosis. Stimulated by retinoic acid 8 (STRA8) expression in germ cells additionally suggested that germ cells may have been preparing to enter meiosis prior to their loss. Gene expression analyses of the developing testes of conditional knockout animals further suggested impaired Sertoli cell differentiation, epithelial-to-mesenchymal transition, and the induction of a specific set of genes associated with Yes-associated protein (YAP) and transcriptional coactivator with PDZ-binding motif (TAZ)-mediated integrin signaling. Finally, the involvement of YAP/TAZ in Sertoli cell differentiation was confirmed by concomitantly inactivating Yap/Taz in Lats1/2 conditional knockout model, which resulted in a partial rescue of the testicular phenotypic changes. Taken together, these results identify Hippo signaling as a crucial pathway for Sertoli cell development and provide novel insight into Sertoli cell fate maintenance.
Collapse
Affiliation(s)
- Nour Abou Nader
- Centre de Recherche en Reproduction et Fertilité, Faculté de Médecine Vétérinaire, Université de Montréal, Saint-Hyacinthe, Quebec, Canada
| | - Laureline Charrier
- Centre de Recherche en Reproduction et Fertilité, Faculté de Médecine Vétérinaire, Université de Montréal, Saint-Hyacinthe, Quebec, Canada
| | - Marie-Charlotte Meisnsohn
- Pediatric Surgical Research Laboratories, Massachusetts General Hospital, Boston, Massachusetts, USA
| | - Laurence Banville
- Centre de Recherche en Reproduction et Fertilité, Faculté de Médecine Vétérinaire, Université de Montréal, Saint-Hyacinthe, Quebec, Canada
| | - Bérengère Deffrennes
- Centre de Recherche en Reproduction et Fertilité, Faculté de Médecine Vétérinaire, Université de Montréal, Saint-Hyacinthe, Quebec, Canada
- École Nationale Vétérinaire d'Alfort, Maisons-Alfort, France
| | - Guillaume St-Jean
- Centre de Recherche en Reproduction et Fertilité, Faculté de Médecine Vétérinaire, Université de Montréal, Saint-Hyacinthe, Quebec, Canada
| | - Derek Boerboom
- Centre de Recherche en Reproduction et Fertilité, Faculté de Médecine Vétérinaire, Université de Montréal, Saint-Hyacinthe, Quebec, Canada
| | - Gustavo Zamberlam
- Centre de Recherche en Reproduction et Fertilité, Faculté de Médecine Vétérinaire, Université de Montréal, Saint-Hyacinthe, Quebec, Canada
| | - Julie Brind'Amour
- Centre de Recherche en Reproduction et Fertilité, Faculté de Médecine Vétérinaire, Université de Montréal, Saint-Hyacinthe, Quebec, Canada
| | - David Pépin
- Pediatric Surgical Research Laboratories, Massachusetts General Hospital, Boston, Massachusetts, USA
| | - Alexandre Boyer
- Centre de Recherche en Reproduction et Fertilité, Faculté de Médecine Vétérinaire, Université de Montréal, Saint-Hyacinthe, Quebec, Canada
| |
Collapse
|
2
|
Zhou H, Li I, Bramlett CS, Wang B, Hao J, Yen DP, Ando Y, Fraser SE, Lu R, Shen K. Label-free metabolic optical biomarkers track stem cell fate transition in real time. SCIENCE ADVANCES 2024; 10:eadi6770. [PMID: 38718114 PMCID: PMC11078180 DOI: 10.1126/sciadv.adi6770] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/11/2023] [Accepted: 04/04/2024] [Indexed: 05/12/2024]
Abstract
Tracking stem cell fate transition is crucial for understanding their development and optimizing biomanufacturing. Destructive single-cell methods provide a pseudotemporal landscape of stem cell differentiation but cannot monitor stem cell fate in real time. We established a metabolic optical metric using label-free fluorescence lifetime imaging microscopy (FLIM), feature extraction and machine learning-assisted analysis, for real-time cell fate tracking. From a library of 205 metabolic optical biomarker (MOB) features, we identified 56 associated with hematopoietic stem cell (HSC) differentiation. These features collectively describe HSC fate transition and detect its bifurcate lineage choice. We further derived a MOB score measuring the "metabolic stemness" of single cells and distinguishing their division patterns. This score reveals a distinct role of asymmetric division in rescuing stem cells with compromised metabolic stemness and a unique mechanism of PI3K inhibition in promoting ex vivo HSC maintenance. MOB profiling is a powerful tool for tracking stem cell fate transition and improving their biomanufacturing from a single-cell perspective.
Collapse
Affiliation(s)
- Hao Zhou
- Department of Biomedical Engineering, University of Southern California, Los Angeles, CA 90089, USA
| | - Irene Li
- Department of Biomedical Engineering, University of Southern California, Los Angeles, CA 90089, USA
| | - Charles S. Bramlett
- Department of Stem Cell Biology and Regenerative Medicine, University of Southern California, Los Angeles, CA 90033, USA
| | - Bowen Wang
- Department of Stem Cell Biology and Regenerative Medicine, University of Southern California, Los Angeles, CA 90033, USA
| | - Jia Hao
- Department of Biomedical Engineering, University of Southern California, Los Angeles, CA 90089, USA
| | - Daniel P. Yen
- Department of Biomedical Engineering, University of Southern California, Los Angeles, CA 90089, USA
| | - Yuta Ando
- Department of Biomedical Engineering, University of Southern California, Los Angeles, CA 90089, USA
| | - Scott E. Fraser
- Department of Biomedical Engineering, University of Southern California, Los Angeles, CA 90089, USA
- Department of Stem Cell Biology and Regenerative Medicine, University of Southern California, Los Angeles, CA 90033, USA
- Translational Imaging Center, University of Southern California, Los Angeles, CA 90089, USA
- Molecular and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
| | - Rong Lu
- Department of Biomedical Engineering, University of Southern California, Los Angeles, CA 90089, USA
- Department of Stem Cell Biology and Regenerative Medicine, University of Southern California, Los Angeles, CA 90033, USA
- Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, CA 90033, USA
- Department of Medicine, University of Southern California, Los Angeles, CA 90033, USA
| | - Keyue Shen
- Department of Biomedical Engineering, University of Southern California, Los Angeles, CA 90089, USA
- Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, CA 90033, USA
- USC Stem Cell, University of Southern California, Los Angeles, CA 90033, USA
| |
Collapse
|
3
|
Aleksander SA, Anagnostopoulos AV, Antonazzo G, Arnaboldi V, Attrill H, Becerra A, Bello SM, Blodgett O, Bradford YM, Bult CJ, Cain S, Calvi BR, Carbon S, Chan J, Chen WJ, Cherry JM, Cho J, Crosby MA, De Pons JL, D’Eustachio P, Diamantakis S, Dolan ME, dos Santos G, Dyer S, Ebert D, Engel SR, Fashena D, Fisher M, Foley S, Gibson AC, Gollapally VR, Gramates LS, Grove CA, Hale P, Harris T, Hayman GT, Hu Y, James-Zorn C, Karimi K, Karra K, Kishore R, Kwitek AE, Laulederkind SJF, Lee R, Longden I, Luypaert M, Markarian N, Marygold SJ, Matthews B, McAndrews MS, Millburn G, Miyasato S, Motenko H, Moxon S, Muller HM, Mungall CJ, Muruganujan A, Mushayahama T, Nash RS, Nuin P, Paddock H, Pells T, Perrimon N, Pich C, Quinton-Tulloch M, Raciti D, Ramachandran S, Richardson JE, Gelbart SR, Ruzicka L, Schindelman G, Shaw DR, Sherlock G, Shrivatsav A, Singer A, Smith CM, Smith CL, Smith JR, Stein L, Sternberg PW, Tabone CJ, Thomas PD, Thorat K, Thota J, Tomczuk M, Trovisco V, Tutaj MA, Urbano JM, Van Auken K, Van Slyke CE, Vize PD, Wang Q, Weng S, Westerfield M, Wilming LG, Wong ED, Wright A, Yook K, Zhou P, Zorn A, Zytkovicz M. Updates to the Alliance of Genome Resources central infrastructure. Genetics 2024; 227:iyae049. [PMID: 38552170 PMCID: PMC11075569 DOI: 10.1093/genetics/iyae049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2023] [Revised: 02/28/2024] [Accepted: 02/29/2024] [Indexed: 04/09/2024] Open
Abstract
The Alliance of Genome Resources (Alliance) is an extensible coalition of knowledgebases focused on the genetics and genomics of intensively studied model organisms. The Alliance is organized as individual knowledge centers with strong connections to their research communities and a centralized software infrastructure, discussed here. Model organisms currently represented in the Alliance are budding yeast, Caenorhabditis elegans, Drosophila, zebrafish, frog, laboratory mouse, laboratory rat, and the Gene Ontology Consortium. The project is in a rapid development phase to harmonize knowledge, store it, analyze it, and present it to the community through a web portal, direct downloads, and application programming interfaces (APIs). Here, we focus on developments over the last 2 years. Specifically, we added and enhanced tools for browsing the genome (JBrowse), downloading sequences, mining complex data (AllianceMine), visualizing pathways, full-text searching of the literature (Textpresso), and sequence similarity searching (SequenceServer). We enhanced existing interactive data tables and added an interactive table of paralogs to complement our representation of orthology. To support individual model organism communities, we implemented species-specific "landing pages" and will add disease-specific portals soon; in addition, we support a common community forum implemented in Discourse software. We describe our progress toward a central persistent database to support curation, the data modeling that underpins harmonization, and progress toward a state-of-the-art literature curation system with integrated artificial intelligence and machine learning (AI/ML).
Collapse
Affiliation(s)
| | | | | | - Giulia Antonazzo
- Department of Physiology, Development and Neuroscience , University of Cambridge, Downing Street, Cambridge CB2 3DY , UK
| | - Valerio Arnaboldi
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Helen Attrill
- Department of Physiology, Development and Neuroscience , University of Cambridge, Downing Street, Cambridge CB2 3DY , UK
| | - Andrés Becerra
- European Molecular Biology Laboratory, European Bioinformatics Institute , Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD , UK
| | - Susan M Bello
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | - Olin Blodgett
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | | | - Carol J Bult
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | - Scott Cain
- Informatics and Bio-computing Platform, Ontario Institute for Cancer Research , Toronto, ON M5G0A3 , Canada
| | - Brian R Calvi
- Department of Biology, Indiana University , Bloomington, IN 47408 , USA
| | - Seth Carbon
- Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory , Berkeley, CA
| | - Juancarlos Chan
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Wen J Chen
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - J Michael Cherry
- Department of Genetics, Stanford University , Stanford, CA 94305
| | - Jaehyoung Cho
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Madeline A Crosby
- The Biological Laboratories, Harvard University , 16 Divinity Avenue, Cambridge, MA 02138 , USA
| | - Jeffrey L De Pons
- Medical College of Wisconsin—Rat Genome Database, Departments of Physiology and Biomedical Engineering , Medical College of Wisconsin, Milwaukee, WI 53226 , USA
| | | | - Stavros Diamantakis
- European Molecular Biology Laboratory, European Bioinformatics Institute , Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD , UK
| | - Mary E Dolan
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | - Gilberto dos Santos
- The Biological Laboratories, Harvard University , 16 Divinity Avenue, Cambridge, MA 02138 , USA
| | - Sarah Dyer
- European Molecular Biology Laboratory, European Bioinformatics Institute , Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD , UK
| | - Dustin Ebert
- Department of Population and Public Health Sciences, University of Southern California , Los Angeles, CA 90033 , USA
| | - Stacia R Engel
- Department of Genetics, Stanford University , Stanford, CA 94305
| | - David Fashena
- Institute of Neuroscience, University of Oregon , Eugene, OR 97403
| | - Malcolm Fisher
- Division of Developmental Biology, Cincinnati Children's Hospital Medical Center , 3333 Burnet Ave, Cincinnati, OH 45229 , USA
| | - Saoirse Foley
- Department of Biological Sciences, Carnegie Mellon University , 5000 Forbes Ave, Pittsburgh, PA 15203
| | - Adam C Gibson
- Medical College of Wisconsin—Rat Genome Database, Departments of Physiology and Biomedical Engineering , Medical College of Wisconsin, Milwaukee, WI 53226 , USA
| | - Varun R Gollapally
- Medical College of Wisconsin—Rat Genome Database, Departments of Physiology and Biomedical Engineering , Medical College of Wisconsin, Milwaukee, WI 53226 , USA
| | - L Sian Gramates
- The Biological Laboratories, Harvard University , 16 Divinity Avenue, Cambridge, MA 02138 , USA
| | - Christian A Grove
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Paul Hale
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | - Todd Harris
- Informatics and Bio-computing Platform, Ontario Institute for Cancer Research , Toronto, ON M5G0A3 , Canada
| | - G Thomas Hayman
- Medical College of Wisconsin—Rat Genome Database, Departments of Physiology and Biomedical Engineering , Medical College of Wisconsin, Milwaukee, WI 53226 , USA
| | - Yanhui Hu
- Department of Genetics, Howard Hughes Medical Institute , Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA 02115 , USA
| | - Christina James-Zorn
- Division of Developmental Biology, Cincinnati Children's Hospital Medical Center , 3333 Burnet Ave, Cincinnati, OH 45229 , USA
| | - Kamran Karimi
- Department of Biological Sciences, University of Calgary , 507 Campus Dr NW, Calgary, AB T2N 4V8 , Canada
| | - Kalpana Karra
- Department of Genetics, Stanford University , Stanford, CA 94305
| | - Ranjana Kishore
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Anne E Kwitek
- Medical College of Wisconsin—Rat Genome Database, Departments of Physiology and Biomedical Engineering , Medical College of Wisconsin, Milwaukee, WI 53226 , USA
| | - Stanley J F Laulederkind
- Medical College of Wisconsin—Rat Genome Database, Departments of Physiology and Biomedical Engineering , Medical College of Wisconsin, Milwaukee, WI 53226 , USA
| | - Raymond Lee
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Ian Longden
- The Biological Laboratories, Harvard University , 16 Divinity Avenue, Cambridge, MA 02138 , USA
| | - Manuel Luypaert
- European Molecular Biology Laboratory, European Bioinformatics Institute , Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD , UK
| | - Nicholas Markarian
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Steven J Marygold
- Department of Physiology, Development and Neuroscience , University of Cambridge, Downing Street, Cambridge CB2 3DY , UK
| | - Beverley Matthews
- The Biological Laboratories, Harvard University , 16 Divinity Avenue, Cambridge, MA 02138 , USA
| | - Monica S McAndrews
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | - Gillian Millburn
- Department of Physiology, Development and Neuroscience , University of Cambridge, Downing Street, Cambridge CB2 3DY , UK
| | - Stuart Miyasato
- Department of Genetics, Stanford University , Stanford, CA 94305
| | - Howie Motenko
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | - Sierra Moxon
- Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory , Berkeley, CA
| | - Hans-Michael Muller
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Christopher J Mungall
- Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory , Berkeley, CA
| | - Anushya Muruganujan
- Department of Population and Public Health Sciences, University of Southern California , Los Angeles, CA 90033 , USA
| | - Tremayne Mushayahama
- Department of Population and Public Health Sciences, University of Southern California , Los Angeles, CA 90033 , USA
| | - Robert S Nash
- Department of Genetics, Stanford University , Stanford, CA 94305
| | - Paulo Nuin
- Informatics and Bio-computing Platform, Ontario Institute for Cancer Research , Toronto, ON M5G0A3 , Canada
| | - Holly Paddock
- Institute of Neuroscience, University of Oregon , Eugene, OR 97403
| | - Troy Pells
- Department of Biological Sciences, University of Calgary , 507 Campus Dr NW, Calgary, AB T2N 4V8 , Canada
| | - Norbert Perrimon
- Department of Genetics, Howard Hughes Medical Institute , Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA 02115 , USA
| | - Christian Pich
- Institute of Neuroscience, University of Oregon , Eugene, OR 97403
| | - Mark Quinton-Tulloch
- European Molecular Biology Laboratory, European Bioinformatics Institute , Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD , UK
| | - Daniela Raciti
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | | | | | - Susan Russo Gelbart
- The Biological Laboratories, Harvard University , 16 Divinity Avenue, Cambridge, MA 02138 , USA
| | - Leyla Ruzicka
- Institute of Neuroscience, University of Oregon , Eugene, OR 97403
| | - Gary Schindelman
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - David R Shaw
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | - Gavin Sherlock
- Department of Genetics, Stanford University , Stanford, CA 94305
| | - Ajay Shrivatsav
- Department of Genetics, Stanford University , Stanford, CA 94305
| | - Amy Singer
- Institute of Neuroscience, University of Oregon , Eugene, OR 97403
| | - Constance M Smith
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | - Cynthia L Smith
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | - Jennifer R Smith
- Medical College of Wisconsin—Rat Genome Database, Departments of Physiology and Biomedical Engineering , Medical College of Wisconsin, Milwaukee, WI 53226 , USA
| | - Lincoln Stein
- Informatics and Bio-computing Platform, Ontario Institute for Cancer Research , Toronto, ON M5G0A3 , Canada
| | - Paul W Sternberg
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Christopher J Tabone
- The Biological Laboratories, Harvard University , 16 Divinity Avenue, Cambridge, MA 02138 , USA
| | - Paul D Thomas
- Department of Population and Public Health Sciences, University of Southern California , Los Angeles, CA 90033 , USA
| | - Ketaki Thorat
- Medical College of Wisconsin—Rat Genome Database, Departments of Physiology and Biomedical Engineering , Medical College of Wisconsin, Milwaukee, WI 53226 , USA
| | - Jyothi Thota
- Medical College of Wisconsin—Rat Genome Database, Departments of Physiology and Biomedical Engineering , Medical College of Wisconsin, Milwaukee, WI 53226 , USA
| | - Monika Tomczuk
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | - Vitor Trovisco
- Department of Physiology, Development and Neuroscience , University of Cambridge, Downing Street, Cambridge CB2 3DY , UK
| | - Marek A Tutaj
- Medical College of Wisconsin—Rat Genome Database, Departments of Physiology and Biomedical Engineering , Medical College of Wisconsin, Milwaukee, WI 53226 , USA
| | - Jose-Maria Urbano
- Department of Physiology, Development and Neuroscience , University of Cambridge, Downing Street, Cambridge CB2 3DY , UK
| | - Kimberly Van Auken
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Ceri E Van Slyke
- Institute of Neuroscience, University of Oregon , Eugene, OR 97403
| | - Peter D Vize
- Department of Biological Sciences, University of Calgary , 507 Campus Dr NW, Calgary, AB T2N 4V8 , Canada
| | - Qinghua Wang
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Shuai Weng
- Department of Genetics, Stanford University , Stanford, CA 94305
| | | | - Laurens G Wilming
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | - Edith D Wong
- Department of Genetics, Stanford University , Stanford, CA 94305
| | - Adam Wright
- Informatics and Bio-computing Platform, Ontario Institute for Cancer Research , Toronto, ON M5G0A3 , Canada
| | - Karen Yook
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Pinglei Zhou
- The Biological Laboratories, Harvard University , 16 Divinity Avenue, Cambridge, MA 02138 , USA
| | - Aaron Zorn
- Division of Developmental Biology, Cincinnati Children's Hospital Medical Center , 3333 Burnet Ave, Cincinnati, OH 45229 , USA
| | - Mark Zytkovicz
- The Biological Laboratories, Harvard University , 16 Divinity Avenue, Cambridge, MA 02138 , USA
| |
Collapse
|
4
|
Rahit KMTH, Avramovic V, Chong JX, Tarailo-Graovac M. GPAD: a natural language processing-based application to extract the gene-disease association discovery information from OMIM. BMC Bioinformatics 2024; 25:84. [PMID: 38413851 PMCID: PMC10898068 DOI: 10.1186/s12859-024-05693-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2023] [Accepted: 02/09/2024] [Indexed: 02/29/2024] Open
Abstract
BACKGROUND Thousands of genes have been associated with different Mendelian conditions. One of the valuable sources to track these gene-disease associations (GDAs) is the Online Mendelian Inheritance in Man (OMIM) database. However, most of the information in OMIM is textual, and heterogeneous (e.g. summarized by different experts), which complicates automated reading and understanding of the data. Here, we used Natural Language Processing (NLP) to make a tool (Gene-Phenotype Association Discovery (GPAD)) that could syntactically process OMIM text and extract the data of interest. RESULTS GPAD applies a series of language-based techniques to the text obtained from OMIM API to extract GDA discovery-related information. GPAD can inform when a particular gene was associated with a specific phenotype, as well as the type of validation-whether through model organisms or cohort-based patient-matching approaches-for such an association. GPAD extracted data was validated with published reports and was compared with large language model. Utilizing GPAD's extracted data, we analysed trends in GDA discoveries, noting a significant increase in their rate after the introduction of exome sequencing, rising from an average of about 150-250 discoveries each year. Contrary to hopes of resolving most GDAs for Mendelian disorders by now, our data indicate a substantial decline in discovery rates over the past five years (2017-2022). This decline appears to be linked to the increasing necessity for larger cohorts to substantiate GDAs. The rising use of zebrafish and Drosophila as model organisms in providing evidential support for GDAs is also observed. CONCLUSIONS GPAD's real-time analyzing capacity offers an up-to-date view of GDA discovery and could help in planning and managing the research strategies. In future, this solution can be extended or modified to capture other information in OMIM and scientific literature.
Collapse
Affiliation(s)
- K M Tahsin Hassan Rahit
- Departments of Biochemistry, Molecular Biology and Medical Genetics, Cumming School of Medicine, University of Calgary, Calgary, AB, T2N 4N1, Canada
- Alberta Children's Hospital Research Institute, University of Calgary, Calgary, AB, T2N 4N1, Canada
| | - Vladimir Avramovic
- Departments of Biochemistry, Molecular Biology and Medical Genetics, Cumming School of Medicine, University of Calgary, Calgary, AB, T2N 4N1, Canada
- Alberta Children's Hospital Research Institute, University of Calgary, Calgary, AB, T2N 4N1, Canada
| | - Jessica X Chong
- Division of Genetic Medicine, Department of Pediatrics, University of Washington, Seattle, WA, 98195, USA
- Brotman-Baty Institute, Seattle, WA, 98195, USA
| | - Maja Tarailo-Graovac
- Departments of Biochemistry, Molecular Biology and Medical Genetics, Cumming School of Medicine, University of Calgary, Calgary, AB, T2N 4N1, Canada.
- Alberta Children's Hospital Research Institute, University of Calgary, Calgary, AB, T2N 4N1, Canada.
| |
Collapse
|
5
|
Kousnetsov R, Bourque J, Surnov A, Fallahee I, Hawiger D. Single-cell sequencing analysis within biologically relevant dimensions. Cell Syst 2024; 15:83-103.e11. [PMID: 38198894 DOI: 10.1016/j.cels.2023.12.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2022] [Revised: 05/23/2023] [Accepted: 12/14/2023] [Indexed: 01/12/2024]
Abstract
The currently predominant approach to transcriptomic and epigenomic single-cell analysis depends on a rigid perspective constrained by reduced dimensions and algorithmically derived and annotated clusters. Here, we developed Seqtometry (sequencing-to-measurement), a single-cell analytical strategy based on biologically relevant dimensions enabled by advanced scoring with multiple gene sets (signatures) for examination of gene expression and accessibility across various organ systems. By utilizing information only in the form of specific signatures, Seqtometry bypasses unsupervised clustering and individual annotations of clusters. Instead, Seqtometry combines qualitative and quantitative cell-type identification with specific characterization of diverse biological processes under experimental or disease conditions. Comprehensive analysis by Seqtometry of various immune cells as well as other cells from different organs and disease-induced states, including multiple myeloma and Alzheimer's disease, surpasses corresponding cluster-based analytical output. We propose Seqtometry as a single-cell sequencing analysis approach applicable for both basic and clinical research.
Collapse
Affiliation(s)
- Robert Kousnetsov
- Department of Molecular Microbiology and Immunology, Saint Louis University School of Medicine, St. Louis, MO, USA
| | - Jessica Bourque
- Department of Molecular Microbiology and Immunology, Saint Louis University School of Medicine, St. Louis, MO, USA
| | - Alexey Surnov
- Department of Molecular Microbiology and Immunology, Saint Louis University School of Medicine, St. Louis, MO, USA
| | - Ian Fallahee
- Department of Molecular Microbiology and Immunology, Saint Louis University School of Medicine, St. Louis, MO, USA
| | - Daniel Hawiger
- Department of Molecular Microbiology and Immunology, Saint Louis University School of Medicine, St. Louis, MO, USA.
| |
Collapse
|
6
|
Bult CJ, Sternberg PW. The alliance of genome resources: transforming comparative genomics. Mamm Genome 2023; 34:531-544. [PMID: 37666946 PMCID: PMC10628019 DOI: 10.1007/s00335-023-10015-2] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2023] [Accepted: 08/11/2023] [Indexed: 09/06/2023]
Abstract
Comparing genomic and biological characteristics across multiple species is essential to using model systems to investigate the molecular and cellular mechanisms underlying human biology and disease and to translate mechanistic insights from studies in model organisms for clinical applications. Building a scalable knowledge commons platform that supports cross-species comparison of rich, expertly curated knowledge regarding gene function, phenotype, and disease associations available for model organisms and humans is the primary mission of the Alliance of Genome Resources (the Alliance). The Alliance is a consortium of seven model organism knowledgebases (mouse, rat, yeast, nematode, zebrafish, frog, fruit fly) and the Gene Ontology resource. The Alliance uses a common set of gene ortholog assertions as the basis for comparing biological annotations across the organisms represented in the Alliance. The major types of knowledge associated with genes that are represented in the Alliance database currently include gene function, phenotypic alleles and variants, human disease associations, pathways, gene expression, and both protein-protein and genetic interactions. The Alliance has enhanced the ability of researchers to easily compare biological annotations for common data types across model organisms and human through the implementation of shared programmatic access mechanisms, data-specific web pages with a unified "look and feel", and interactive user interfaces specifically designed to support comparative biology. The modular infrastructure developed by the Alliance allows the resource to serve as an extensible "knowledge commons" capable of expanding to accommodate additional model organisms.
Collapse
|
7
|
Aleksander SA, Anagnostopoulos AV, Antonazzo G, Arnaboldi V, Attrill H, Becerra A, Bello SM, Blodgett O, Bradford YM, Bult CJ, Cain S, Calvi BR, Carbon S, Chan J, Chen WJ, Michael Cherry J, Cho J, Crosby MA, De Pons JL, D’Eustachio P, Diamantakis S, Dolan ME, Santos GD, Dyer S, Ebert D, Engel SR, Fashena D, Fisher M, Foley S, Gibson AC, Gollapally VR, Sian Gramates L, Grove CA, Hale P, Harris T, Thomas Hayman G, Hu Y, James-Zorn C, Karimi K, Karra K, Kishore R, Kwitek AE, Laulederkind SJF, Lee R, Longden I, Luypaert M, Markarian N, Marygold SJ, Matthews B, McAndrews MS, Millburn G, Miyasato S, Motenko H, Moxon S, Muller HM, Mungall CJ, Muruganujan A, Mushayahama T, Nash RS, Nuin P, Paddock H, Pells T, Perrimon N, Pich C, Quinton-Tulloch M, Raciti D, Ramachandran S, Richardson JE, Gelbart SR, Ruzicka L, Schindelman G, Shaw DR, Sherlock G, Shrivatsav A, Singer A, Smith CM, Smith CL, Smith JR, Stein L, Sternberg PW, Tabone CJ, Thomas PD, Thorat K, Thota J, Tomczuk M, Trovisco V, Tutaj MA, Urbano JM, Auken KV, Van Slyke CE, Vize PD, Wang Q, Weng S, Westerfield M, Wilming LG, Wong ED, Wright A, Yook K, Zhou P, Zorn A, Zytkovicz M. Updates to the Alliance of Genome Resources Central Infrastructure Alliance of Genome Resources Consortium. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.20.567935. [PMID: 38045425 PMCID: PMC10690154 DOI: 10.1101/2023.11.20.567935] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/05/2023]
Abstract
The Alliance of Genome Resources (Alliance) is an extensible coalition of knowledgebases focused on the genetics and genomics of intensively-studied model organisms. The Alliance is organized as individual knowledge centers with strong connections to their research communities and a centralized software infrastructure, discussed here. Model organisms currently represented in the Alliance are budding yeast, C. elegans, Drosophila, zebrafish, frog, laboratory mouse, laboratory rat, and the Gene Ontology Consortium. The project is in a rapid development phase to harmonize knowledge, store it, analyze it, and present it to the community through a web portal, direct downloads, and APIs. Here we focus on developments over the last two years. Specifically, we added and enhanced tools for browsing the genome (JBrowse), downloading sequences, mining complex data (AllianceMine), visualizing pathways, full-text searching of the literature (Textpresso), and sequence similarity searching (SequenceServer). We enhanced existing interactive data tables and added an interactive table of paralogs to complement our representation of orthology. To support individual model organism communities, we implemented species-specific "landing pages" and will add disease-specific portals soon; in addition, we support a common community forum implemented in Discourse. We describe our progress towards a central persistent database to support curation, the data modeling that underpins harmonization, and progress towards a state-of-the art literature curation system with integrated Artificial Intelligence and Machine Learning (AI/ML).
Collapse
|
8
|
Owens SM, Sifford JM, Li G, Murdock SJ, Salinas E, Manzano M, Ghosh D, Stumhofer JS, Forrest JC. Intrinsic p53 Activation Restricts Gammaherpesvirus-Driven Germinal Center B Cell Expansion during Latency Establishment. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.31.563188. [PMID: 37961505 PMCID: PMC10634957 DOI: 10.1101/2023.10.31.563188] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2023]
Abstract
Gammaherpesviruses (GHV) are DNA tumor viruses that establish lifelong latent infections in lymphocytes. For viruses such as Epstein-Barr virus (EBV) and murine gammaherpesvirus 68 (MHV68), this is accomplished through a viral gene-expression program that promotes cellular proliferation and differentiation, especially of germinal center (GC) B cells. Intrinsic host mechanisms that control virus-driven cellular expansion are incompletely defined. Using a small-animal model of GHV pathogenesis, we demonstrate in vivo that tumor suppressor p53 is activated specifically in B cells that are latently infected by MHV68. In the absence of p53, the early expansion of MHV68 latency was greatly increased, especially in GC B cells, a cell-type whose proliferation was conversely restricted by p53. We identify the B cell-specific latency gene M2, a viral promoter of GC B cell differentiation, as a viral protein sufficient to elicit a p53-dependent anti-proliferative response caused by Src-family kinase activation. We further demonstrate that EBV-encoded latent membrane protein 1 (LMP1) similarly triggers a p53 response in primary B cells. Our data highlight a model in which GHV latency gene-expression programs that promote B cell proliferation and differentiation to facilitate viral colonization of the host trigger aberrant cellular proliferation that is controlled by p53. IMPORTANCE Gammaherpesviruses cause lifelong infections of their hosts, commonly referred to as latency, that can lead to cancer. Latency establishment benefits from the functions of viral proteins that augment and amplify B cell activation, proliferation, and differentiation signals. In uninfected cells, off-schedule cellular differentiation would typically trigger anti-proliferative responses by effector proteins known as tumor suppressors. However, tumor suppressor responses to gammaherpesvirus manipulation of cellular processes remain understudied, especially those that occur during latency establishment in a living organism. Here we identify p53, a tumor suppressor commonly mutated in cancer, as a host factor that limits virus-driven B cell proliferation and differentiation, and thus, viral colonization of a host. We demonstrate that p53 activation occurs in response to viral latency proteins that induce B cell activation. This work informs a gap in our understanding of intrinsic cellular defense mechanisms that restrict lifelong GHV infection.
Collapse
|
9
|
Engel SR, Wong ED, Nash RS, Aleksander S, Alexander M, Douglass E, Karra K, Miyasato SR, Simison M, Skrzypek MS, Weng S, Cherry JM. New data and collaborations at the Saccharomyces Genome Database: updated reference genome, alleles, and the Alliance of Genome Resources. Genetics 2022; 220:iyab224. [PMID: 34897464 PMCID: PMC9209811 DOI: 10.1093/genetics/iyab224] [Citation(s) in RCA: 19] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2021] [Accepted: 11/11/2021] [Indexed: 02/03/2023] Open
Abstract
Saccharomyces cerevisiae is used to provide fundamental understanding of eukaryotic genetics, gene product function, and cellular biological processes. Saccharomyces Genome Database (SGD) has been supporting the yeast research community since 1993, serving as its de facto hub. Over the years, SGD has maintained the genetic nomenclature, chromosome maps, and functional annotation, and developed various tools and methods for analysis and curation of a variety of emerging data types. More recently, SGD and six other model organism focused knowledgebases have come together to create the Alliance of Genome Resources to develop sustainable genome information resources that promote and support the use of various model organisms to understand the genetic and genomic bases of human biology and disease. Here we describe recent activities at SGD, including the latest reference genome annotation update, the development of a curation system for mutant alleles, and new pages addressing homology across model organisms as well as the use of yeast to study human disease.
Collapse
Affiliation(s)
- Stacia R Engel
- Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA
| | - Edith D Wong
- Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA
| | - Robert S Nash
- Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA
| | - Suzi Aleksander
- Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA
| | - Micheal Alexander
- Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA
| | - Eric Douglass
- Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA
| | - Kalpana Karra
- Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA
| | - Stuart R Miyasato
- Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA
| | - Matt Simison
- Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA
| | - Marek S Skrzypek
- Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA
| | - Shuai Weng
- Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA
| | - J Michael Cherry
- Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA
| |
Collapse
|
10
|
Abstract
Since 1992, FlyBase has provided a freely available online database of information about the model organism Drosophila melanogaster. Data in FlyBase is curated manually from research papers as well as computationally from a variety of relevant sources, to serve as an information hub that enables and accelerates research discovery. This chapter aims to give users new to the database an overview of the layout and types of data available, as well as introducing some tools with which to access the data. More experienced users will find useful information about recent improvements and descriptions to enable more efficient navigation of the database.
Collapse
Affiliation(s)
| | - Aoife Larkin
- Department of Physiology, Development and Neuroscience, University of Cambridge, Cambridge, UK
| | - Jim Thurmond
- Department of Biology, Indiana University, Bloomington, IN, USA
| |
Collapse
|
11
|
Kaldunski ML, Smith JR, Hayman GT, Brodie K, De Pons JL, Demos WM, Gibson AC, Hill ML, Hoffman MJ, Lamers L, Laulederkind SJF, Nalabolu HS, Thorat K, Thota J, Tutaj M, Tutaj MA, Vedi M, Wang SJ, Zacher S, Dwinell MR, Kwitek AE. The Rat Genome Database (RGD) facilitates genomic and phenotypic data integration across multiple species for biomedical research. Mamm Genome 2021; 33:66-80. [PMID: 34741192 PMCID: PMC8570235 DOI: 10.1007/s00335-021-09932-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2021] [Accepted: 10/21/2021] [Indexed: 01/21/2023]
Abstract
Model organism research is essential for discovering the mechanisms of human diseases by defining biologically meaningful gene to disease relationships. The Rat Genome Database (RGD, ( https://rgd.mcw.edu )) is a cross-species knowledgebase and the premier online resource for rat genetic and physiologic data. This rich resource is enhanced by the inclusion and integration of comparative data for human and mouse, as well as other human disease models including chinchilla, dog, bonobo, pig, 13-lined ground squirrel, green monkey, and naked mole-rat. Functional information has been added to records via the assignment of annotations based on sequence similarity to human, rat, and mouse genes. RGD has also imported well-supported cross-species data from external resources. To enable use of these data, RGD has developed a robust infrastructure of standardized ontologies, data formats, and disease- and species-centric portals, complemented with a suite of innovative tools for discovery and analysis. Using examples of single-gene and polygenic human diseases, we illustrate how data from multiple species can help to identify or confirm a gene as involved in a disease and to identify model organisms that can be studied to understand the pathophysiology of a gene or pathway. The ultimate aim of this report is to demonstrate the utility of RGD not only as the core resource for the rat research community but also as a source of bioinformatic tools to support a wider audience, empowering the search for appropriate models for human afflictions.
Collapse
Affiliation(s)
- M L Kaldunski
- Department of Biomedical Engineering, The Rat Genome Database, Medical College of Wisconsin, Milwaukee, WI, USA
| | - J R Smith
- Department of Biomedical Engineering, The Rat Genome Database, Medical College of Wisconsin, Milwaukee, WI, USA
| | - G T Hayman
- Department of Biomedical Engineering, The Rat Genome Database, Medical College of Wisconsin, Milwaukee, WI, USA
| | - K Brodie
- Clinical and Translational Science Institute, Medical College of Wisconsin, Milwaukee, WI, USA
| | - J L De Pons
- Department of Biomedical Engineering, The Rat Genome Database, Medical College of Wisconsin, Milwaukee, WI, USA
| | - W M Demos
- Department of Biomedical Engineering, The Rat Genome Database, Medical College of Wisconsin, Milwaukee, WI, USA
| | - A C Gibson
- Department of Biomedical Engineering, The Rat Genome Database, Medical College of Wisconsin, Milwaukee, WI, USA
| | - M L Hill
- Department of Biomedical Engineering, The Rat Genome Database, Medical College of Wisconsin, Milwaukee, WI, USA
| | - M J Hoffman
- Department of Biomedical Engineering, The Rat Genome Database, Medical College of Wisconsin, Milwaukee, WI, USA
- Department of Physiology, Medical College of Wisconsin, Milwaukee, WI, USA
| | - L Lamers
- Department of Biomedical Engineering, The Rat Genome Database, Medical College of Wisconsin, Milwaukee, WI, USA
| | - S J F Laulederkind
- Department of Biomedical Engineering, The Rat Genome Database, Medical College of Wisconsin, Milwaukee, WI, USA
| | - H S Nalabolu
- Department of Biomedical Engineering, The Rat Genome Database, Medical College of Wisconsin, Milwaukee, WI, USA
| | - K Thorat
- Department of Biomedical Engineering, The Rat Genome Database, Medical College of Wisconsin, Milwaukee, WI, USA
| | - J Thota
- Department of Biomedical Engineering, The Rat Genome Database, Medical College of Wisconsin, Milwaukee, WI, USA
| | - M Tutaj
- Department of Biomedical Engineering, The Rat Genome Database, Medical College of Wisconsin, Milwaukee, WI, USA
| | - M A Tutaj
- Department of Biomedical Engineering, The Rat Genome Database, Medical College of Wisconsin, Milwaukee, WI, USA
| | - M Vedi
- Department of Biomedical Engineering, The Rat Genome Database, Medical College of Wisconsin, Milwaukee, WI, USA
| | - S J Wang
- Department of Biomedical Engineering, The Rat Genome Database, Medical College of Wisconsin, Milwaukee, WI, USA
| | - S Zacher
- Information Services, Medical College of Wisconsin, Milwaukee, WI, USA
| | - M R Dwinell
- Department of Biomedical Engineering, The Rat Genome Database, Medical College of Wisconsin, Milwaukee, WI, USA
- Department of Physiology, Medical College of Wisconsin, Milwaukee, WI, USA
| | - A E Kwitek
- Department of Biomedical Engineering, The Rat Genome Database, Medical College of Wisconsin, Milwaukee, WI, USA.
- Department of Physiology, Medical College of Wisconsin, Milwaukee, WI, USA.
| |
Collapse
|
12
|
Soares EV, Soares HMVM. Harmful effects of metal(loid) oxide nanoparticles. Appl Microbiol Biotechnol 2021; 105:1379-1394. [PMID: 33521847 PMCID: PMC7847763 DOI: 10.1007/s00253-021-11124-1] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2020] [Revised: 01/04/2021] [Accepted: 01/16/2021] [Indexed: 02/06/2023]
Abstract
Abstract The incorporation of nanomaterials (NMs), including metal(loid) oxide (MOx) nanoparticles (NPs), in the most diversified consumer products, has grown enormously in recent decades. Consequently, the contact between humans and these materials increased, as well as their presence in the environment. This fact has raised concerns and uncertainties about the possible risks of NMs to human health and the adverse effects on the environment. These concerns underline the need and importance of assessing its nanosecurity. The present review focuses on the main mechanisms underlying the MOx NPs toxicity, illustrated with different biological models: release of toxic ions, cellular uptake of NPs, oxidative stress, shading effect on photosynthetic microorganisms, physical restrain and damage of cell wall. Additionally, the biological models used to evaluate the potential hazardous of nanomaterials are briefly presented, with particular emphasis on the yeast Saccharomyces cerevisiae, as an alternative model in nanotoxicology. An overview containing recent scientific advances on cellular responses (toxic symptoms exhibited by yeasts) resulting from the interaction with MOx NPs (inhibition of cell proliferation, cell wall damage, alteration of function and morphology of organelles, presence of oxidative stress bio-indicators, gene expression changes, genotoxicity and cell dead) is critically presented. The elucidation of the toxic modes of action of MOx NPs in yeast cells can be very useful in providing additional clues about the impact of NPs on the physiology and metabolism of the eukaryotic cell. Current and future trends of MOx NPs toxicity, regarding their possible impacts on the environment and human health, are discussed. Key points • The potential hazardous effects of MOx NPs are critically reviewed. • An overview of the main mechanisms associated with MOx NPs toxicity is presented. • Scientific advances about yeast cell responses to MOx NPs are updated and discussed.
Collapse
Affiliation(s)
- Eduardo V Soares
- Bioengineering Laboratory-CIETI, ISEP-School of Engineering, Polytechnic Institute of Porto, rua Dr António Bernardino de Almeida, 431, 4249-015, Porto, Portugal. .,CEB-Centre of Biological Engineering, University of Minho, Campus de Gualtar, 4710-057, Braga, Portugal.
| | - Helena M V M Soares
- REQUIMTE/LAQV, Departamento de Engenharia Química, Faculdade de Engenharia, Universidade do Porto, rua Dr Roberto Frias, s/n, 4200-465, Porto, Portugal
| |
Collapse
|
13
|
Vega Yon GG, Thomas DC, Morrison J, Mi H, Thomas PD, Marjoram P. Bayesian parameter estimation for automatic annotation of gene functions using observational data and phylogenetic trees. PLoS Comput Biol 2021; 17:e1007948. [PMID: 33600408 PMCID: PMC7924801 DOI: 10.1371/journal.pcbi.1007948] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2020] [Revised: 03/02/2021] [Accepted: 12/30/2020] [Indexed: 11/29/2022] Open
Abstract
Gene function annotation is important for a variety of downstream analyses of genetic data. But experimental characterization of function remains costly and slow, making computational prediction an important endeavor. Phylogenetic approaches to prediction have been developed, but implementation of a practical Bayesian framework for parameter estimation remains an outstanding challenge. We have developed a computationally efficient model of evolution of gene annotations using phylogenies based on a Bayesian framework using Markov Chain Monte Carlo for parameter estimation. Unlike previous approaches, our method is able to estimate parameters over many different phylogenetic trees and functions. The resulting parameters agree with biological intuition, such as the increased probability of function change following gene duplication. The method performs well on leave-one-out cross-validation, and we further validated some of the predictions in the experimental scientific literature.
Collapse
Affiliation(s)
- George G. Vega Yon
- Division of Biostatistics, Department of Preventive Medicine, University of Southern California, Los Angeles, California, United States of America
| | - Duncan C. Thomas
- Division of Biostatistics, Department of Preventive Medicine, University of Southern California, Los Angeles, California, United States of America
| | - John Morrison
- Division of Biostatistics, Department of Preventive Medicine, University of Southern California, Los Angeles, California, United States of America
| | - Huaiyu Mi
- Division of Bioinformatics, Department of Preventive Medicine, University of Southern California, Los Angeles, California, United States of America
| | - Paul D. Thomas
- Division of Bioinformatics, Department of Preventive Medicine, University of Southern California, Los Angeles, California, United States of America
| | - Paul Marjoram
- Division of Biostatistics, Department of Preventive Medicine, University of Southern California, Los Angeles, California, United States of America
| |
Collapse
|
14
|
Bastian FB, Roux J, Niknejad A, Comte A, Fonseca Costa SS, de Farias TM, Moretti S, Parmentier G, de Laval VR, Rosikiewicz M, Wollbrett J, Echchiki A, Escoriza A, Gharib WH, Gonzales-Porta M, Jarosz Y, Laurenczy B, Moret P, Person E, Roelli P, Sanjeev K, Seppey M, Robinson-Rechavi M. The Bgee suite: integrated curated expression atlas and comparative transcriptomics in animals. Nucleic Acids Res 2021; 49:D831-D847. [PMID: 33037820 PMCID: PMC7778977 DOI: 10.1093/nar/gkaa793] [Citation(s) in RCA: 93] [Impact Index Per Article: 31.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2020] [Revised: 08/24/2020] [Accepted: 09/15/2020] [Indexed: 01/24/2023] Open
Abstract
Bgee is a database to retrieve and compare gene expression patterns in multiple animal species, produced by integrating multiple data types (RNA-Seq, Affymetrix, in situ hybridization, and EST data). It is based exclusively on curated healthy wild-type expression data (e.g., no gene knock-out, no treatment, no disease), to provide a comparable reference of normal gene expression. Curation includes very large datasets such as GTEx (re-annotation of samples as ‘healthy’ or not) as well as many small ones. Data are integrated and made comparable between species thanks to consistent data annotation and processing, and to calls of presence/absence of expression, along with expression scores. As a result, Bgee is capable of detecting the conditions of expression of any single gene, accommodating any data type and species. Bgee provides several tools for analyses, allowing, e.g., automated comparisons of gene expression patterns within and between species, retrieval of the prefered conditions of expression of any gene, or enrichment analyses of conditions with expression of sets of genes. Bgee release 14.1 includes 29 animal species, and is available at https://bgee.org/ and through its Bioconductor R package BgeeDB.
Collapse
Affiliation(s)
- Frederic B Bastian
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Julien Roux
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Anne Niknejad
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Aurélie Comte
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Sara S Fonseca Costa
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Tarcisio Mendes de Farias
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Sébastien Moretti
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Gilles Parmentier
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Valentine Rech de Laval
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Marta Rosikiewicz
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Julien Wollbrett
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Amina Echchiki
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Angélique Escoriza
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Walid H Gharib
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Mar Gonzales-Porta
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Yohan Jarosz
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Balazs Laurenczy
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Philippe Moret
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Emilie Person
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Patrick Roelli
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Komal Sanjeev
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Mathieu Seppey
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Marc Robinson-Rechavi
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| |
Collapse
|
15
|
Kuang D, Weile J, Li R, Ouellette TW, Barber JA, Roth FP. MaveQuest: a web resource for planning experimental tests of human variant effects. Bioinformatics 2020; 36:3938-3940. [PMID: 32251504 PMCID: PMC7320626 DOI: 10.1093/bioinformatics/btaa228] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2019] [Revised: 01/27/2020] [Accepted: 04/01/2020] [Indexed: 11/22/2022] Open
Abstract
Summary Fully realizing the promise of personalized medicine will require rapid and accurate classification of pathogenic human variation. Multiplexed assays of variant effect (MAVEs) can experimentally test nearly all possible variants in selected gene targets. Planning a MAVE study involves identifying target genes with clinical impact, and identifying scalable functional assays for that target. Here, we describe MaveQuest, a web-based resource enabling systematic variant effect mapping studies by identifying potential functional assays, disease phenotypes and clinical relevance for nearly all human protein-coding genes. Availability and implementation MaveQuest service: https://mavequest.varianteffect.org/. MaveQuest source code: https://github.com/kvnkuang/mavequest-front-end/. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Da Kuang
- Donnelly Centre, University of Toronto, Toronto, ON M5S 3E1, Canada.,Department of Molecular Genetics, University of Toronto, Toronto, ON M5S 1A8, Canada.,Lunenfeld-Tanenbaum Research Institute, Sinai Health System, Toronto, ON M5G 1X5, Canada.,Department of Computer Science, University of Toronto, Toronto, ON M5T 3A1, Canada
| | - Jochen Weile
- Donnelly Centre, University of Toronto, Toronto, ON M5S 3E1, Canada.,Department of Molecular Genetics, University of Toronto, Toronto, ON M5S 1A8, Canada.,Lunenfeld-Tanenbaum Research Institute, Sinai Health System, Toronto, ON M5G 1X5, Canada.,Department of Computer Science, University of Toronto, Toronto, ON M5T 3A1, Canada
| | - Roujia Li
- Donnelly Centre, University of Toronto, Toronto, ON M5S 3E1, Canada.,Department of Molecular Genetics, University of Toronto, Toronto, ON M5S 1A8, Canada.,Lunenfeld-Tanenbaum Research Institute, Sinai Health System, Toronto, ON M5G 1X5, Canada.,Department of Computer Science, University of Toronto, Toronto, ON M5T 3A1, Canada
| | - Tom W Ouellette
- Donnelly Centre, University of Toronto, Toronto, ON M5S 3E1, Canada.,Department of Molecular Genetics, University of Toronto, Toronto, ON M5S 1A8, Canada
| | - Jarry A Barber
- Donnelly Centre, University of Toronto, Toronto, ON M5S 3E1, Canada.,Department of Molecular Genetics, University of Toronto, Toronto, ON M5S 1A8, Canada
| | - Frederick P Roth
- Donnelly Centre, University of Toronto, Toronto, ON M5S 3E1, Canada.,Department of Molecular Genetics, University of Toronto, Toronto, ON M5S 1A8, Canada.,Lunenfeld-Tanenbaum Research Institute, Sinai Health System, Toronto, ON M5G 1X5, Canada.,Department of Computer Science, University of Toronto, Toronto, ON M5T 3A1, Canada
| |
Collapse
|
16
|
Campos TL, Korhonen PK, Sternberg PW, Gasser RB, Young ND. Predicting gene essentiality in Caenorhabditis elegans by feature engineering and machine-learning. Comput Struct Biotechnol J 2020; 18:1093-1102. [PMID: 32489524 PMCID: PMC7251299 DOI: 10.1016/j.csbj.2020.05.008] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2020] [Revised: 05/01/2020] [Accepted: 05/06/2020] [Indexed: 02/08/2023] Open
Abstract
Defining genes that are essential for life has major implications for understanding critical biological processes and mechanisms. Although essential genes have been identified and characterised experimentally using functional genomic tools, it is challenging to predict with confidence such genes from molecular and phenomic data sets using computational methods. Using extensive data sets available for the model organism Caenorhabditis elegans, we constructed here a machine-learning (ML)-based workflow for the prediction of essential genes on a genome-wide scale. We identified strong predictors for such genes and showed that trained ML models consistently achieve highly-accurate classifications. Complementary analyses revealed an association between essential genes and chromosomal location. Our findings reveal that essential genes in C. elegans tend to be located in or near the centre of autosomal chromosomes; are positively correlated with low single nucleotide polymorphim (SNP) densities and epigenetic markers in promoter regions; are involved in protein and nucleotide processing; are transcribed in most cells; are enriched in reproductive tissues or are targets for small RNAs bound to the argonaut CSR-1. Based on these results, we hypothesise an interplay between epigenetic markers and small RNA pathways in the germline, with transcription-based memory; this hypothesis warrants testing. From a technical perspective, further work is needed to evaluate whether the present ML-based approach will be applicable to other metazoans (including Drosophila melanogaster) for which comprehensive data sets (i.e. genomic, transcriptomic, proteomic, variomic, epigenetic and phenomic) are available.
Collapse
Key Words
- CDS, coding sequence
- CRISPR, Clustered Regularly Interspaced Short Palindromic Repeats
- Caenorhabditis elegans
- ES, Essentiality Score
- EST, expressed sequence tag
- Essential genes
- Essentiality predictions
- GBM, Gradient Boosting Method
- GFF, general feature format
- GLM, Generalised Linear Model
- GO, gene ontology
- ML, machine-learning
- Machine-learning
- NN, Artificial Neural Network
- PPI, protein-protein interaction
- PR-AUC, Area Under the Precision-Recall Curve
- RF, Random Forest
- RNAi, RNA interference
- ROC-AUC, Area Under the Receiver Operating Characteristic Curve
- SNP, single nucleotide polymorphism
- SPLS, Sparse Partial Least Squares
- SVM, Support-Vector Machine
- TEA, Tissue Enrichment Analysis tool (WormBase)
- TSS, transcription start site
- VCF, variant call file
Collapse
Affiliation(s)
- Tulio L Campos
- Department of Veterinary Biosciences, Melbourne Veterinary School, The University of Melbourne, Parkville, Victoria 3010, Australia.,Instituto Aggeu Magalhães, Fundação Oswaldo Cruz (IAM-Fiocruz), Recife, Pernambuco, Brazil
| | - Pasi K Korhonen
- Department of Veterinary Biosciences, Melbourne Veterinary School, The University of Melbourne, Parkville, Victoria 3010, Australia
| | - Paul W Sternberg
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, United States
| | - Robin B Gasser
- Department of Veterinary Biosciences, Melbourne Veterinary School, The University of Melbourne, Parkville, Victoria 3010, Australia
| | - Neil D Young
- Department of Veterinary Biosciences, Melbourne Veterinary School, The University of Melbourne, Parkville, Victoria 3010, Australia
| |
Collapse
|
17
|
Thurmond J, Goodman JL, Strelets VB, Attrill H, Gramates LS, Marygold SJ, Matthews BB, Millburn G, Antonazzo G, Trovisco V, Kaufman TC, Calvi BR. FlyBase 2.0: the next generation. Nucleic Acids Res 2020; 47:D759-D765. [PMID: 30364959 PMCID: PMC6323960 DOI: 10.1093/nar/gky1003] [Citation(s) in RCA: 489] [Impact Index Per Article: 122.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2018] [Accepted: 10/09/2018] [Indexed: 01/01/2023] Open
Abstract
FlyBase (flybase.org) is a knowledge base that supports the community of researchers that use the fruit fly, Drosophila melanogaster, as a model organism. The FlyBase team curates and organizes a diverse array of genetic, molecular, genomic, and developmental information about Drosophila. At the beginning of 2018, ‘FlyBase 2.0’ was released with a significantly improved user interface and new tools. Among these important changes are a new organization of search results into interactive lists or tables (hitlists), enhanced reference lists, and new protein domain graphics. An important new data class called ‘experimental tools’ consolidates information on useful fly strains and other resources related to a specific gene, which significantly enhances the ability of the Drosophila researcher to design and carry out experiments. With the release of FlyBase 2.0, there has also been a restructuring of backend architecture and a continued development of application programming interfaces (APIs) for programmatic access to FlyBase data. In this review, we describe these major new features and functionalities of the FlyBase 2.0 site and how they support the use of Drosophila as a model organism for biological discovery and translational research.
Collapse
Affiliation(s)
- Jim Thurmond
- Department of Biology, Indiana University, Bloomington, IN 47408, USA
| | - Joshua L Goodman
- Department of Biology, Indiana University, Bloomington, IN 47408, USA
| | - Victor B Strelets
- Department of Biology, Indiana University, Bloomington, IN 47408, USA
| | - Helen Attrill
- Department of Physiology, Development and Neuroscience, University of Cambridge, Downing Street, Cambridge CB2 3DY, UK
| | - L Sian Gramates
- The Biological Laboratories, Harvard University, 16 Divinity Avenue, Cambridge, MA 02138, USA
| | - Steven J Marygold
- Department of Physiology, Development and Neuroscience, University of Cambridge, Downing Street, Cambridge CB2 3DY, UK
| | - Beverley B Matthews
- The Biological Laboratories, Harvard University, 16 Divinity Avenue, Cambridge, MA 02138, USA
| | - Gillian Millburn
- Department of Physiology, Development and Neuroscience, University of Cambridge, Downing Street, Cambridge CB2 3DY, UK
| | - Giulia Antonazzo
- Department of Physiology, Development and Neuroscience, University of Cambridge, Downing Street, Cambridge CB2 3DY, UK
| | - Vitor Trovisco
- Department of Physiology, Development and Neuroscience, University of Cambridge, Downing Street, Cambridge CB2 3DY, UK
| | - Thomas C Kaufman
- Department of Biology, Indiana University, Bloomington, IN 47408, USA
| | - Brian R Calvi
- Department of Biology, Indiana University, Bloomington, IN 47408, USA
| | | |
Collapse
|
18
|
Ruzicka L, Howe DG, Ramachandran S, Toro S, Van Slyke CE, Bradford YM, Eagle A, Fashena D, Frazer K, Kalita P, Mani P, Martin R, Moxon ST, Paddock H, Pich C, Schaper K, Shao X, Singer A, Westerfield M. The Zebrafish Information Network: new support for non-coding genes, richer Gene Ontology annotations and the Alliance of Genome Resources. Nucleic Acids Res 2020; 47:D867-D873. [PMID: 30407545 PMCID: PMC6323962 DOI: 10.1093/nar/gky1090] [Citation(s) in RCA: 95] [Impact Index Per Article: 23.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2018] [Accepted: 10/25/2018] [Indexed: 11/13/2022] Open
Abstract
The Zebrafish Information Network (ZFIN) (https://zfin.org/) is the database for the model organism, zebrafish (Danio rerio). ZFIN expertly curates, organizes and provides a wide array of zebrafish genetic and genomic data, including genes, alleles, transgenic lines, gene expression, gene function, mutant phenotypes, orthology, human disease models, nomenclature and reagents. New features at ZFIN include increased support for genomic regions and for non-coding genes, and support for more expressive Gene Ontology annotations. ZFIN has recently taken over maintenance of the zebrafish reference genome sequence as part of the Genome Reference Consortium. ZFIN is also a founding member of the Alliance of Genome Resources, a collaboration of six model organism databases (MODs) and the Gene Ontology Consortium (GO). The recently launched Alliance portal (https://alliancegenome.org) provides a unified, comparative view of MOD, GO, and human data, and facilitates foundational and translational biomedical research.
Collapse
Affiliation(s)
- Leyla Ruzicka
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | - Douglas G Howe
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | | | - Sabrina Toro
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | - Ceri E Van Slyke
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | - Yvonne M Bradford
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | - Anne Eagle
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | - David Fashena
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | - Ken Frazer
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | - Patrick Kalita
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | - Prita Mani
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | - Ryan Martin
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | - Sierra Taylor Moxon
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | - Holly Paddock
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | - Christian Pich
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | - Kevin Schaper
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | - Xiang Shao
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | - Amy Singer
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | - Monte Westerfield
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| |
Collapse
|
19
|
Bult CJ, Blake JA, Smith CL, Kadin JA, Richardson JE. Mouse Genome Database (MGD) 2019. Nucleic Acids Res 2020; 47:D801-D806. [PMID: 30407599 PMCID: PMC6323923 DOI: 10.1093/nar/gky1056] [Citation(s) in RCA: 461] [Impact Index Per Article: 115.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2018] [Accepted: 10/30/2018] [Indexed: 01/19/2023] Open
Abstract
The Mouse Genome Database (MGD; http://www.informatics.jax.org) is the community model organism genetic and genome resource for the laboratory mouse. MGD is the authoritative source for biological reference data sets related to mouse genes, gene functions, phenotypes, and mouse models of human disease. MGD is the primary outlet for official gene, allele and mouse strain nomenclature based on the guidelines set by the International Committee on Standardized Nomenclature for Mice. In this report we describe significant enhancements to MGD, including two new graphical user interfaces: (i) the Multi Genome Viewer for exploring the genomes of multiple mouse strains and (ii) the Phenotype-Gene Expression matrix which was developed in collaboration with the Gene Expression Database (GXD) and allows researchers to compare gene expression and phenotype annotations for mouse genes. Other recent improvements include enhanced efficiency of our literature curation processes and the incorporation of Transcriptional Start Site (TSS) annotations from RIKEN's FANTOM 5 initiative.
Collapse
Affiliation(s)
- Carol J Bult
- The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, USA
| | - Judith A Blake
- The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, USA
| | - Cynthia L Smith
- The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, USA
| | - James A Kadin
- The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, USA
| | | | | |
Collapse
|
20
|
Hahn ME, Sadler KC. Casting a wide net: use of diverse model organisms to advance toxicology. Dis Model Mech 2020; 13:dmm043844. [PMID: 32094287 PMCID: PMC7132827 DOI: 10.1242/dmm.043844] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
Toxicology - the study of how chemicals interact with biological systems - has clear relevance to human health and disease. Persistent exposure to natural and synthetic chemicals is an unavoidable part of living on our planet; yet, we understand very little about the effects of exposure to the vast majority of chemicals. While epidemiological studies can provide strong statistical inference linking chemical exposure to disease, research in model systems is essential to elucidate the mechanisms of action and to predict outcomes. Most research in toxicology utilizes a handful of mammalian models that represent a few distinct branches of the evolutionary tree. This narrow focus constrains the understanding of chemical-induced disease processes and systems that have evolved in response to exposures. We advocate for casting a wider net in environmental toxicology research to utilize diverse model systems, including zebrafish, and perform more mechanistic studies of cellular responses to chemical exposures to shift the perception of toxicology as an applied science to that of a basic science. This more-inclusive perspective will enrich the field and should remain central to research on chemical-induced disease.
Collapse
Affiliation(s)
- Mark E Hahn
- Biology Department, Woods Hole Oceanographic Institution, 45 Water Street, Woods Hole, MA 02543, USA
- Boston University Superfund Research Program, Boston, MA 02118, USA
| | | |
Collapse
|
21
|
Antonazzo G, Urbano JM, Marygold SJ, Millburn GH, Brown NH. Building a pipeline to solicit expert knowledge from the community to aid gene summary curation. Database (Oxford) 2020; 2020:baz152. [PMID: 31960022 PMCID: PMC6971343 DOI: 10.1093/database/baz152] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2019] [Revised: 10/29/2019] [Accepted: 12/12/2019] [Indexed: 11/25/2022]
Abstract
Brief summaries describing the function of each gene's product(s) are of great value to the research community, especially when interpreting genome-wide studies that reveal changes to hundreds of genes. However, manually writing such summaries, even for a single species, is a daunting task; for example, the Drosophila melanogaster genome contains almost 14 000 protein-coding genes. One solution is to use computational methods to generate summaries, but this often fails to capture the key functions or express them eloquently. Here, we describe how we solicited help from the research community to generate manually written summaries of D. melanogaster gene function. Based on the data within the FlyBase database, we developed a computational pipeline to identify researchers who have worked extensively on each gene. We e-mailed these researchers to ask them to draft a brief summary of the main function(s) of the gene's product, which we edited for consistency to produce a 'gene snapshot'. This approach yielded 1800 gene snapshot submissions within a 3-month period. We discuss the general utility of this strategy for other databases that capture data from the research literature. Database URL: https://flybase.org/.
Collapse
Affiliation(s)
- Giulia Antonazzo
- Department of Physiology, Development and Neuroscience, University of Cambridge, Downing Street, Cambridge, CB2 3DY, UK
| | - Jose M Urbano
- Department of Physiology, Development and Neuroscience, University of Cambridge, Downing Street, Cambridge, CB2 3DY, UK
| | - Steven J Marygold
- Department of Physiology, Development and Neuroscience, University of Cambridge, Downing Street, Cambridge, CB2 3DY, UK
| | - Gillian H Millburn
- Department of Physiology, Development and Neuroscience, University of Cambridge, Downing Street, Cambridge, CB2 3DY, UK
| | - Nicholas H Brown
- Department of Physiology, Development and Neuroscience, University of Cambridge, Downing Street, Cambridge, CB2 3DY, UK
| |
Collapse
|
22
|
Kishore R, Arnaboldi V, Van Slyke CE, Chan J, Nash RS, Urbano JM, Dolan ME, Engel SR, Shimoyama M, Sternberg PW, Genome Resources TAO. Automated generation of gene summaries at the Alliance of Genome Resources. Database (Oxford) 2020; 2020:baaa037. [PMID: 32559296 PMCID: PMC7304461 DOI: 10.1093/database/baaa037] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2020] [Revised: 04/06/2020] [Accepted: 04/29/2020] [Indexed: 12/28/2022]
Abstract
Short paragraphs that describe gene function, referred to as gene summaries, are valued by users of biological knowledgebases for the ease with which they convey key aspects of gene function. Manual curation of gene summaries, while desirable, is difficult for knowledgebases to sustain. We developed an algorithm that uses curated, structured gene data at the Alliance of Genome Resources (Alliance; www.alliancegenome.org) to automatically generate gene summaries that simulate natural language. The gene data used for this purpose include curated associations (annotations) to ontology terms from the Gene Ontology, Disease Ontology, model organism knowledgebase (MOK)-specific anatomy ontologies and Alliance orthology data. The method uses sentence templates for each data category included in the gene summary in order to build a natural language sentence from the list of terms associated with each gene. To improve readability of the summaries when numerous gene annotations are present, we developed a new algorithm that traverses ontology graphs in order to group terms by their common ancestors. The algorithm optimizes the coverage of the initial set of terms and limits the length of the final summary, using measures of information content of each ontology term as a criterion for inclusion in the summary. The automated gene summaries are generated with each Alliance release, ensuring that they reflect current data at the Alliance. Our method effectively leverages category-specific curation efforts of the Alliance member databases to create modular, structured and standardized gene summaries for seven member species of the Alliance. These automatically generated gene summaries make cross-species gene function comparisons tenable and increase discoverability of potential models of human disease. In addition to being displayed on Alliance gene pages, these summaries are also included on several MOK gene pages.
Collapse
Affiliation(s)
- Ranjana Kishore
- WormBase, Division of Biology and Biological Engineering, California Institute of Technology, 1200 East California Boulevard, Pasadena, CA 91125, USA
| | - Valerio Arnaboldi
- WormBase, Division of Biology and Biological Engineering, California Institute of Technology, 1200 East California Boulevard, Pasadena, CA 91125, USA
| | - Ceri E Van Slyke
- ZFIN, The Institute of Neuroscience, 222 Huestis Hall, University of Oregon, Eugene, OR 97403-1254, USA
| | - Juancarlos Chan
- WormBase, Division of Biology and Biological Engineering, California Institute of Technology, 1200 East California Boulevard, Pasadena, CA 91125, USA
| | - Robert S Nash
- Saccharomyces Genome Database, Department of Genetics, Stanford University, 3165 Porter Drive, Palo Alto, CA 94304, USA
| | - Jose M Urbano
- FlyBase, Department of Physiology, Development and Neuroscience, 7 Downing Pl, University of Cambridge, Cambridge CB2 3DY, UK
| | - Mary E Dolan
- MGI, The Jackson Laboratory, Bar Harbor, ME 04609, USA
| | - Stacia R Engel
- Saccharomyces Genome Database, Department of Genetics, Stanford University, 3165 Porter Drive, Palo Alto, CA 94304, USA
| | - Mary Shimoyama
- Rat Genome Database, Department of Biomedical Engineering, Medical College of Wisconsin and Marquette University, 8701 Watertown Plank Road, Milwaukee, WI 53226, USA
| | - Paul W Sternberg
- WormBase, Division of Biology and Biological Engineering, California Institute of Technology, 1200 East California Boulevard, Pasadena, CA 91125, USA
| | | |
Collapse
|
23
|
Rotelli MD, Bolling AM, Killion AW, Weinberg AJ, Dixon MJ, Calvi BR. An RNAi Screen for Genes Required for Growth of Drosophila Wing Tissue. G3 (BETHESDA, MD.) 2019; 9:3087-3100. [PMID: 31387856 PMCID: PMC6778782 DOI: 10.1534/g3.119.400581] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/21/2019] [Accepted: 07/31/2019] [Indexed: 12/23/2022]
Abstract
Cell division and tissue growth must be coordinated with development. Defects in these processes are the basis for a number of diseases, including developmental malformations and cancer. We have conducted an unbiased RNAi screen for genes that are required for growth in the Drosophila wing, using GAL4-inducible short hairpin RNA (shRNA) fly strains made by the Drosophila RNAi Screening Center. shRNA expression down the center of the larval wing disc using dpp-GAL4, and the central region of the adult wing was then scored for tissue growth and wing hair morphology. Out of 4,753 shRNA crosses that survived to adulthood, 18 had impaired wing growth. FlyBase and the new Alliance of Genome Resources knowledgebases were used to determine the known or predicted functions of these genes and the association of their human orthologs with disease. The function of eight of the genes identified has not been previously defined in Drosophila The genes identified included those with known or predicted functions in cell cycle, chromosome segregation, morphogenesis, metabolism, steroid processing, transcription, and translation. All but one of the genes are similar to those in humans, and many are associated with disease. Knockdown of lin-52, a subunit of the Myb-MuvB transcription factor, or βNACtes6, a gene involved in protein folding and trafficking, resulted in a switch from cell proliferation to an endoreplication growth program through which wing tissue grew by an increase in cell size (hypertrophy). It is anticipated that further analysis of the genes that we have identified will reveal new mechanisms that regulate tissue growth during development.
Collapse
Affiliation(s)
- Michael D Rotelli
- Department of Biology, Indiana University, Bloomington, IN 47405 and
| | - Anna M Bolling
- Department of Biology, Indiana University, Bloomington, IN 47405 and
| | - Andrew W Killion
- Department of Biology, Indiana University, Bloomington, IN 47405 and
| | | | - Michael J Dixon
- Department of Biology, Indiana University, Bloomington, IN 47405 and
| | - Brian R Calvi
- Department of Biology, Indiana University, Bloomington, IN 47405 and
- Melvin and Bren Simon Cancer Center, Indiana University, Indianapolis, IN 46202
| |
Collapse
|
24
|
Arias-Jayo N, Abecia L, Lavín JL, Tueros I, Arranz S, Ramírez-García A, Pardo MA. Host-microbiome interactions in response to a high-saturated fat diet and fish-oil supplementation in zebrafish adult. J Funct Foods 2019. [DOI: 10.1016/j.jff.2019.103416] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
|
25
|
Nenni MJ, Fisher ME, James-Zorn C, Pells TJ, Ponferrada V, Chu S, Fortriede JD, Burns KA, Wang Y, Lotay VS, Wang DZ, Segerdell E, Chaturvedi P, Karimi K, Vize PD, Zorn AM. Xenbase: Facilitating the Use of Xenopus to Model Human Disease. Front Physiol 2019; 10:154. [PMID: 30863320 PMCID: PMC6399412 DOI: 10.3389/fphys.2019.00154] [Citation(s) in RCA: 48] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2018] [Accepted: 02/08/2019] [Indexed: 01/02/2023] Open
Abstract
At a fundamental level most genes, signaling pathways, biological functions and organ systems are highly conserved between man and all vertebrate species. Leveraging this conservation, researchers are increasingly using the experimental advantages of the amphibian Xenopus to model human disease. The online Xenopus resource, Xenbase, enables human disease modeling by curating the Xenopus literature published in PubMed and integrating these Xenopus data with orthologous human genes, anatomy, and more recently with links to the Online Mendelian Inheritance in Man resource (OMIM) and the Human Disease Ontology (DO). Here we review how Xenbase supports disease modeling and report on a meta-analysis of the published Xenopus research providing an overview of the different types of diseases being modeled in Xenopus and the variety of experimental approaches being used. Text mining of over 50,000 Xenopus research articles imported into Xenbase from PubMed identified approximately 1,000 putative disease- modeling articles. These articles were manually assessed and annotated with disease ontologies, which were then used to classify papers based on disease type. We found that Xenopus is being used to study a diverse array of disease with three main experimental approaches: cell-free egg extracts to study fundamental aspects of cellular and molecular biology, oocytes to study ion transport and channel physiology and embryo experiments focused on congenital diseases. We integrated these data into Xenbase Disease Pages to allow easy navigation to disease information on external databases. Results of this analysis will equip Xenopus researchers with a suite of experimental approaches available to model or dissect a pathological process. Ideally clinicians and basic researchers will use this information to foster collaborations necessary to interrogate the development and treatment of human diseases.
Collapse
Affiliation(s)
- Mardi J Nenni
- Division of Developmental Biology, Cincinnati Children's Hospital, Cincinnati, OH, United States
| | - Malcolm E Fisher
- Division of Developmental Biology, Cincinnati Children's Hospital, Cincinnati, OH, United States
| | - Christina James-Zorn
- Division of Developmental Biology, Cincinnati Children's Hospital, Cincinnati, OH, United States
| | - Troy J Pells
- Department of Biological Sciences, University of Calgary, Calgary, AB, Canada
| | - Virgilio Ponferrada
- Division of Developmental Biology, Cincinnati Children's Hospital, Cincinnati, OH, United States
| | - Stanley Chu
- Department of Biological Sciences, University of Calgary, Calgary, AB, Canada
| | - Joshua D Fortriede
- Division of Developmental Biology, Cincinnati Children's Hospital, Cincinnati, OH, United States
| | - Kevin A Burns
- Division of Developmental Biology, Cincinnati Children's Hospital, Cincinnati, OH, United States
| | - Ying Wang
- Department of Biological Sciences, University of Calgary, Calgary, AB, Canada
| | - Vaneet S Lotay
- Department of Biological Sciences, University of Calgary, Calgary, AB, Canada
| | - Dong Zhou Wang
- Department of Biological Sciences, University of Calgary, Calgary, AB, Canada
| | - Erik Segerdell
- Institute of Ecology and Evolution, University of Oregon, Eugene, OR, United States
| | - Praneet Chaturvedi
- Division of Developmental Biology, Cincinnati Children's Hospital, Cincinnati, OH, United States
| | - Kamran Karimi
- Department of Biological Sciences, University of Calgary, Calgary, AB, Canada
| | - Peter D Vize
- Department of Biological Sciences, University of Calgary, Calgary, AB, Canada
| | - Aaron M Zorn
- Division of Developmental Biology, Cincinnati Children's Hospital, Cincinnati, OH, United States
| |
Collapse
|
26
|
Garapati PV, Zhang J, Rey AJ, Marygold SJ. Towards comprehensive annotation of Drosophila melanogaster enzymes in FlyBase. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2019; 2019:5298334. [PMID: 30689844 PMCID: PMC6343044 DOI: 10.1093/database/bay144] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/31/2018] [Accepted: 12/18/2018] [Indexed: 11/13/2022]
Abstract
The catalytic activities of enzymes can be described using Gene Ontology (GO) terms and Enzyme Commission (EC) numbers. These annotations are available from numerous biological databases and are routinely accessed by researchers and bioinformaticians to direct their work. However, enzyme data may not be congruent between different resources, while the origin, quality and genomic coverage of these data within any one resource are often unclear. GO/EC annotations are assigned either manually by expert curators or inferred computationally, and there is potential for errors in both types of annotation. If such errors remain unchecked, false positive annotations may be propagated across multiple resources, significantly degrading the quality and usefulness of these data. Similarly, the absence of annotations (false negatives) from any one resource can lead to incorrect inferences or conclusions. We are systematically reviewing and enhancing the functional annotation of the enzymes of Drosophila melanogaster, focusing on improvements within the FlyBase (www.flybase.org) database. We have reviewed four major enzyme groups to date: oxidoreductases, lyases, isomerases and ligases. Herein, we describe our review workflow, the improvement in the quality and coverage of enzyme annotations within FlyBase and the wider impact of our work on other related databases.
Collapse
Affiliation(s)
- Phani V Garapati
- Department of Physiology, Development and Neuroscience, University of Cambridge, Downing Street, Cambridge, CB2 3DY, UK
| | - Jingyao Zhang
- Department of Physiology, Development and Neuroscience, University of Cambridge, Downing Street, Cambridge, CB2 3DY, UK
| | - Alix J Rey
- Department of Physiology, Development and Neuroscience, University of Cambridge, Downing Street, Cambridge, CB2 3DY, UK
| | - Steven J Marygold
- Department of Physiology, Development and Neuroscience, University of Cambridge, Downing Street, Cambridge, CB2 3DY, UK
| |
Collapse
|