1
|
Corominas M, Marquès-Bonet T, Arnedo M, Bayés M, Belmonte J, Escrivà H, Fernández R, Gabaldón T, Garnatje T, Germain J, Niell M, Palero F, Pons J, Puigdomènech P, Arroyo V, Cuevas-Caballé C, Obiol JF, Gut I, Gut M, Hidalgo O, Izquierdo-Arànega G, Pérez-Sorribes L, Righi E, Riutort M, Vallès J, Rozas J, Alioto T, Guigó R. The Catalan initiative for the Earth BioGenome Project: contributing local data to global biodiversity genomics. NAR Genom Bioinform 2024; 6:lqae075. [PMID: 39022326 PMCID: PMC11252852 DOI: 10.1093/nargab/lqae075] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2023] [Revised: 05/10/2024] [Accepted: 06/19/2024] [Indexed: 07/20/2024] Open
Abstract
The Catalan Initiative for the Earth BioGenome Project (CBP) is an EBP-affiliated project network aimed at sequencing the genome of the >40 000 eukaryotic species estimated to live in the Catalan-speaking territories (Catalan Linguistic Area, CLA). These territories represent a biodiversity hotspot. While covering less than 1% of Europe, they are home to about one fourth of all known European eukaryotic species. These include a high proportion of endemisms, many of which are threatened. This trend is likely to get worse as the effects of global change are expected to be particularly severe across the Mediterranean Basin, particularly in freshwater ecosystems and mountain areas. Following the EBP model, the CBP is a networked organization that has been able to engage many scientific and non-scientific partners. In the pilot phase, the genomes of 52 species are being sequenced. As a case study in biodiversity conservation, we highlight the genome of the Balearic shearwater Puffinus mauretanicus, sequenced under the CBP umbrella.
Collapse
Affiliation(s)
- Montserrat Corominas
- Departament de Genètica, Microbiologia i Estadística, Facultat de Biologia, Universitat de Barcelona (UB), 08028 Barcelona, Catalonia, Spain
- Institut de Biomedicina (IBUB), Universitat de Barcelona (UB), 08028 Barcelona, Catalonia, Spain
- Institut d’Estudis Catalans (IEC), 08001 Barcelona, Catalonia, Spain
| | - Tomàs Marquès-Bonet
- Institute of Evolutionary Biology (IBE, UPF-CSIC), PRBB, 08003 Barcelona, Spain
- Catalan Institution of Research and Advanced Studies (ICREA), 08010 Barcelona, Spain
- Centre Nacional d’Anàlisi Genòmica (CNAG), 08028 Barcelona, Spain
- Institut Català de Paleontologia Miquel Crusafont, Universitat Autònoma de Barcelona, 08193 Cerdanyola del Vallès, Barcelona, Spain
| | - Miquel A Arnedo
- Departament de Biologia Evolutiva, Ecologia i Ciències Ambientals, Facultat de Biologia, Universitat de Barcelona (UB), 08028 Barcelona, Catalonia, Spain
- Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona (UB), 08028 Barcelona, Catalonia, Spain
| | - Mònica Bayés
- Centre Nacional d’Anàlisi Genòmica (CNAG), 08028 Barcelona, Spain
- Universitat de Barcelona (UB), 08028 Barcelona, Spain
| | - Jordina Belmonte
- Departament de Biologia Animal, Biologia Vegetal i Ecologia, Facultat de Biociències, Universitat Autònoma de Barcelona (UAB), 08193 Bellaterra, Catalonia, Spain
- Institut de Ciència i Tecnologia Ambientals (ICTA-UAB), Universitat Autònoma de Barcelona (UAB), 08193 Bellaterra, Catalonia, Spain
| | - Hector Escrivà
- Sorbonne Université, CNRS, Biologie Intégrative des Organismes Marins, BIOM, F-66650, Banyuls-sur-Mer, France
| | - Rosa Fernández
- Institute of Evolutionary Biology (IBE, UPF-CSIC), PRBB, 08003 Barcelona, Spain
| | - Toni Gabaldón
- Catalan Institution of Research and Advanced Studies (ICREA), 08010 Barcelona, Spain
- Barcelona Supercomputing Centre (BSC-CNS), 08034 Barcelona, Spain
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, 08028 Barcelona, Spain
- CIBER de Enfermedades Infecciosas, Instituto de Salud Carlos III, Madrid, Spain
| | - Teresa Garnatje
- Institut Botànic de Barcelona (IBB), CSIC-CMCNB, 08038 Barcelona, Catalonia, Spain
- Jardí Botànic Marimurtra - Fundació Carl Faust, 17300 Blanes, Catalonia, Spain
| | - Josep Germain
- Institució Catalana d’Història Natural, 08001 Barcelona, Catalonia, Spain
| | - Manel Niell
- Andorra Recerca + Innovació (ARI), AD600 Sant Julià de Lòria, Andorra
| | - Ferran Palero
- Institut Cavanilles de Biodiversitat i Biologia Evolutiva (ICBIBE), Paterna, Valencia, Spain
| | - Joan Pons
- Departament de Biodiversitat Animal i Microbiana, Institut Mediterrani d’Estudis Avançats (CSIC-UIB), 07190 Esporles, Illes Balears, Spain
| | - Pere Puigdomènech
- Institut d’Estudis Catalans (IEC), 08001 Barcelona, Catalonia, Spain
- Centre de Recerca en Agrigenòmica, CSIC/IRTA/UAB/UB, 08193 Bellaterra, Catalonia, Spain
| | - Vanesa Arroyo
- Andorra Recerca + Innovació (ARI), AD600 Sant Julià de Lòria, Andorra
| | - Cristian Cuevas-Caballé
- Departament de Genètica, Microbiologia i Estadística, Facultat de Biologia, Universitat de Barcelona (UB), 08028 Barcelona, Catalonia, Spain
- Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona (UB), 08028 Barcelona, Catalonia, Spain
| | - Joan Ferrer Obiol
- Department of Environmental Science and Policy, University of Milan, Milan, Italy
| | - Ivo Gut
- Centre Nacional d’Anàlisi Genòmica (CNAG), 08028 Barcelona, Spain
- Universitat de Barcelona (UB), 08028 Barcelona, Spain
| | - Marta Gut
- Centre Nacional d’Anàlisi Genòmica (CNAG), 08028 Barcelona, Spain
- Universitat de Barcelona (UB), 08028 Barcelona, Spain
| | - Oriane Hidalgo
- CIBER de Enfermedades Infecciosas, Instituto de Salud Carlos III, Madrid, Spain
- Royal Botanic Gardens, Kew, TW9 3DS Richmond, UK
| | - Guillem Izquierdo-Arànega
- Departament de Genètica, Microbiologia i Estadística, Facultat de Biologia, Universitat de Barcelona (UB), 08028 Barcelona, Catalonia, Spain
- Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona (UB), 08028 Barcelona, Catalonia, Spain
| | - Laia Pérez-Sorribes
- Institut Botànic de Barcelona (IBB), CSIC-CMCNB, 08038 Barcelona, Catalonia, Spain
- Estación Biológica de Doñana, CSIC, 41092 Sevilla, Spain
| | - Emilio Righi
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, 08003 Barcelona, Catalonia, Spain
| | - Marta Riutort
- Departament de Genètica, Microbiologia i Estadística, Facultat de Biologia, Universitat de Barcelona (UB), 08028 Barcelona, Catalonia, Spain
- Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona (UB), 08028 Barcelona, Catalonia, Spain
| | - Joan Vallès
- Institut d’Estudis Catalans (IEC), 08001 Barcelona, Catalonia, Spain
- Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona (UB), 08028 Barcelona, Catalonia, Spain
- Laboratori de Botànica (UB), Unitat Associada al CSIC, Facultat de Farmàcia i Ciències de l’Alimentació, Universitat de Barcelona, 08028 Barcelona, Catalonia, Spain
| | - Julio Rozas
- Departament de Genètica, Microbiologia i Estadística, Facultat de Biologia, Universitat de Barcelona (UB), 08028 Barcelona, Catalonia, Spain
- Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona (UB), 08028 Barcelona, Catalonia, Spain
| | - Tyler Alioto
- Centre Nacional d’Anàlisi Genòmica (CNAG), 08028 Barcelona, Spain
- Universitat de Barcelona (UB), 08028 Barcelona, Spain
| | - Roderic Guigó
- Institut d’Estudis Catalans (IEC), 08001 Barcelona, Catalonia, Spain
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, 08003 Barcelona, Catalonia, Spain
- Universitat Pompeu Fabra (UPF), 08003 Barcelona, Catalonia, Spain
| |
Collapse
|
2
|
Garello M, Piombo E, Buonsenso F, Prencipe S, Valente S, Meloni GR, Marcet-Houben M, Gabaldón T, Spadaro D. Several secondary metabolite gene clusters in the genomes of ten Penicillium spp. raise the risk of multiple mycotoxin occurrence in chestnuts. Food Microbiol 2024; 122:104532. [PMID: 38839238 DOI: 10.1016/j.fm.2024.104532] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2024] [Revised: 03/14/2024] [Accepted: 04/02/2024] [Indexed: 06/07/2024]
Abstract
Penicillium spp. produce a great variety of secondary metabolites, including several mycotoxins, on food substrates. Chestnuts represent a favorable substrate for Penicillium spp. development. In this study, the genomes of ten Penicillium species, virulent on chestnuts, were sequenced and annotated: P. bialowiezense. P. pancosmium, P. manginii, P. discolor, P. crustosum, P. palitans, P. viridicatum, P. glandicola, P. taurinense and P. terrarumae. Assembly size ranges from 27.5 to 36.8 Mb and the number of encoded genes ranges from 9,867 to 12,520. The total number of predicted biosynthetic gene clusters (BGCs) in the ten species is 551. The most represented families of BGCs are non ribosomal peptide synthase (191) and polyketide synthase (175), followed by terpene synthases (87). Genome-wide collections of gene phylogenies (phylomes) were reconstructed for each of the newly sequenced Penicillium species allowing for the prediction of orthologous relationships among our species, as well as other 20 annotated Penicillium species available in the public domain. We investigated in silico the presence of BGCs for 10 secondary metabolites, including 5 mycotoxins, whose production was validated in vivo through chemical analyses. Among the clusters present in this set of species we found andrastin A and its related cluster atlantinone A, mycophenolic acid, patulin, penitrem A and the cluster responsible for the synthesis of roquefortine C/glandicoline A/glandicoline B/meleagrin. We confirmed the presence of these clusters in several of the Penicillium species conforming our dataset and verified their capacity to synthesize them in a chestnut-based medium with chemical analysis. Interestingly, we identified mycotoxin clusters in some species for the first time, such as the andrastin A cluster in P. flavigenum and P. taurinense, and the roquefortine C cluster in P. nalgiovense and P. taurinense. Chestnuts proved to be an optimal substrate for species of Penicillium with different mycotoxigenic potential, opening the door to risks related to the occurrence of multiple mycotoxins in the same food matrix.
Collapse
Affiliation(s)
- Marco Garello
- Department of Agricultural, Forest and Food Sciences (DISAFA), University of Turin, Largo Braccini 2, 10095, Grugliasco, TO, Italy; AGROINNOVA - Interdepartmental Centre for the Innovation in the Agro-Environmental Sector, University of Torino, Largo Braccini 2, 10095, Grugliasco, TO, Italy
| | - Edoardo Piombo
- Department of Forest Mycology and Plant Pathology, Swedish University of Agricultural Sciences, Almas Allé 5, 75651, Uppsala, Sweden
| | - Fabio Buonsenso
- Department of Agricultural, Forest and Food Sciences (DISAFA), University of Turin, Largo Braccini 2, 10095, Grugliasco, TO, Italy; AGROINNOVA - Interdepartmental Centre for the Innovation in the Agro-Environmental Sector, University of Torino, Largo Braccini 2, 10095, Grugliasco, TO, Italy
| | - Simona Prencipe
- Department of Agricultural, Forest and Food Sciences (DISAFA), University of Turin, Largo Braccini 2, 10095, Grugliasco, TO, Italy
| | - Silvia Valente
- Department of Agricultural, Forest and Food Sciences (DISAFA), University of Turin, Largo Braccini 2, 10095, Grugliasco, TO, Italy; AGROINNOVA - Interdepartmental Centre for the Innovation in the Agro-Environmental Sector, University of Torino, Largo Braccini 2, 10095, Grugliasco, TO, Italy
| | - Giovanna Roberta Meloni
- Department of Agricultural, Forest and Food Sciences (DISAFA), University of Turin, Largo Braccini 2, 10095, Grugliasco, TO, Italy; AGROINNOVA - Interdepartmental Centre for the Innovation in the Agro-Environmental Sector, University of Torino, Largo Braccini 2, 10095, Grugliasco, TO, Italy
| | - Marina Marcet-Houben
- Barcelona Supercomputing Centre (BSC-CNS), Plaça Eusebi Güell, 1-3, 08034, Barcelona, Spain; Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Baldiri Reixac, 10, 08028 Barcelona, Spain
| | - Toni Gabaldón
- Barcelona Supercomputing Centre (BSC-CNS), Plaça Eusebi Güell, 1-3, 08034, Barcelona, Spain; Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Baldiri Reixac, 10, 08028 Barcelona, Spain; Catalan Institution for Research and Advanced Studies (ICREA), Barcelona, Spain; CIBER de Enfermedades Infecciosas, Instituto de Salud Carlos III, Madrid, Spain.
| | - Davide Spadaro
- Department of Agricultural, Forest and Food Sciences (DISAFA), University of Turin, Largo Braccini 2, 10095, Grugliasco, TO, Italy; AGROINNOVA - Interdepartmental Centre for the Innovation in the Agro-Environmental Sector, University of Torino, Largo Braccini 2, 10095, Grugliasco, TO, Italy.
| |
Collapse
|
3
|
Cosentino S, Sriswasdi S, Iwasaki W. SonicParanoid2: fast, accurate, and comprehensive orthology inference with machine learning and language models. Genome Biol 2024; 25:195. [PMID: 39054525 PMCID: PMC11270883 DOI: 10.1186/s13059-024-03298-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Accepted: 06/04/2024] [Indexed: 07/27/2024] Open
Abstract
Accurate inference of orthologous genes constitutes a prerequisite for comparative and evolutionary genomics. SonicParanoid is one of the fastest tools for orthology inference; however, its scalability and accuracy have been hampered by time-consuming all-versus-all alignments and the existence of proteins with complex domain architectures. Here, we present a substantial update of SonicParanoid, where a gradient boosting predictor halves the execution time and a language model doubles the recall. Application to empirical large-scale and standardized benchmark datasets shows that SonicParanoid2 is much faster than comparable methods and also the most accurate. SonicParanoid2 is available at https://gitlab.com/salvo981/sonicparanoid2 and https://zenodo.org/doi/10.5281/zenodo.11371108 .
Collapse
Affiliation(s)
- Salvatore Cosentino
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, the University of Tokyo, Kashiwa, Japan
| | - Sira Sriswasdi
- Center of Excellence in Computational Molecular Biology, Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand
| | - Wataru Iwasaki
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, the University of Tokyo, Kashiwa, Japan.
- Department of Biological Sciences, Graduate School of Science, the University of Tokyo, Bunkyo-ku, Japan.
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, the University of Tokyo, Kashiwa, Japan.
- Atmosphere and Ocean Research Institute, the University of Tokyo, Kashiwa, Japan.
- Institute for Quantitative Biosciences, the University of Tokyo, Bunkyo-ku, Japan.
- Collaborative Research Institute for Innovative Microbiology, the University of Tokyo, Bunkyo-ku, Japan.
| |
Collapse
|
4
|
Rossier V, Train C, Nevers Y, Robinson-Rechavi M, Dessimoz C. Matreex: Compact and Interactive Visualization for Scalable Studies of Large Gene Families. Genome Biol Evol 2024; 16:evae100. [PMID: 38742690 PMCID: PMC11149776 DOI: 10.1093/gbe/evae100] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2023] [Revised: 04/17/2024] [Accepted: 05/03/2024] [Indexed: 05/16/2024] Open
Abstract
Studying gene family evolution strongly benefits from insightful visualizations. However, the ever-growing number of sequenced genomes is leading to increasingly larger gene families, which challenges existing gene tree visualizations. Indeed, most of them present users with a dilemma: display complete but intractable gene trees, or collapse subtrees, thereby hiding their children's information. Here, we introduce Matreex, a new dynamic tool to scale up the visualization of gene families. Matreex's key idea is to use "phylogenetic" profiles, which are dense representations of gene repertoires, to minimize the information loss when collapsing subtrees. We illustrate Matreex's usefulness with three biological applications. First, we demonstrate on the MutS family the power of combining gene trees and phylogenetic profiles to delve into precise evolutionary analyses of large multicopy gene families. Second, by displaying 22 intraflagellar transport gene families across 622 species cumulating 5,500 representatives, we show how Matreex can be used to automate large-scale analyses of gene presence-absence. Notably, we report for the first time the complete loss of intraflagellar transport in the myxozoan Thelohanellus kitauei. Finally, using the textbook example of visual opsins, we show Matreex's potential to create easily interpretable figures for teaching and outreach. Matreex is available from the Python Package Index (pip install Matreex) with the source code and documentation available at https://github.com/DessimozLab/matreex.
Collapse
Affiliation(s)
- Victor Rossier
- Department of Computational Biology, University of Lausanne, Lausanne, Switzerland
- SIB Swiss Institute of Bioinformatics, Comparative Genomics, Lausanne, Switzerland
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland
| | - Clement Train
- Department of Computational Biology, University of Lausanne, Lausanne, Switzerland
| | - Yannis Nevers
- Department of Computational Biology, University of Lausanne, Lausanne, Switzerland
- SIB Swiss Institute of Bioinformatics, Comparative Genomics, Lausanne, Switzerland
| | - Marc Robinson-Rechavi
- SIB Swiss Institute of Bioinformatics, Comparative Genomics, Lausanne, Switzerland
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland
| | - Christophe Dessimoz
- Department of Computational Biology, University of Lausanne, Lausanne, Switzerland
- SIB Swiss Institute of Bioinformatics, Comparative Genomics, Lausanne, Switzerland
| |
Collapse
|
5
|
Aleksander SA, Anagnostopoulos AV, Antonazzo G, Arnaboldi V, Attrill H, Becerra A, Bello SM, Blodgett O, Bradford YM, Bult CJ, Cain S, Calvi BR, Carbon S, Chan J, Chen WJ, Cherry JM, Cho J, Crosby MA, De Pons JL, D’Eustachio P, Diamantakis S, Dolan ME, dos Santos G, Dyer S, Ebert D, Engel SR, Fashena D, Fisher M, Foley S, Gibson AC, Gollapally VR, Gramates LS, Grove CA, Hale P, Harris T, Hayman GT, Hu Y, James-Zorn C, Karimi K, Karra K, Kishore R, Kwitek AE, Laulederkind SJF, Lee R, Longden I, Luypaert M, Markarian N, Marygold SJ, Matthews B, McAndrews MS, Millburn G, Miyasato S, Motenko H, Moxon S, Muller HM, Mungall CJ, Muruganujan A, Mushayahama T, Nash RS, Nuin P, Paddock H, Pells T, Perrimon N, Pich C, Quinton-Tulloch M, Raciti D, Ramachandran S, Richardson JE, Gelbart SR, Ruzicka L, Schindelman G, Shaw DR, Sherlock G, Shrivatsav A, Singer A, Smith CM, Smith CL, Smith JR, Stein L, Sternberg PW, Tabone CJ, Thomas PD, Thorat K, Thota J, Tomczuk M, Trovisco V, Tutaj MA, Urbano JM, Van Auken K, Van Slyke CE, Vize PD, Wang Q, Weng S, Westerfield M, Wilming LG, Wong ED, Wright A, Yook K, Zhou P, Zorn A, Zytkovicz M. Updates to the Alliance of Genome Resources central infrastructure. Genetics 2024; 227:iyae049. [PMID: 38552170 PMCID: PMC11075569 DOI: 10.1093/genetics/iyae049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2023] [Revised: 02/28/2024] [Accepted: 02/29/2024] [Indexed: 04/09/2024] Open
Abstract
The Alliance of Genome Resources (Alliance) is an extensible coalition of knowledgebases focused on the genetics and genomics of intensively studied model organisms. The Alliance is organized as individual knowledge centers with strong connections to their research communities and a centralized software infrastructure, discussed here. Model organisms currently represented in the Alliance are budding yeast, Caenorhabditis elegans, Drosophila, zebrafish, frog, laboratory mouse, laboratory rat, and the Gene Ontology Consortium. The project is in a rapid development phase to harmonize knowledge, store it, analyze it, and present it to the community through a web portal, direct downloads, and application programming interfaces (APIs). Here, we focus on developments over the last 2 years. Specifically, we added and enhanced tools for browsing the genome (JBrowse), downloading sequences, mining complex data (AllianceMine), visualizing pathways, full-text searching of the literature (Textpresso), and sequence similarity searching (SequenceServer). We enhanced existing interactive data tables and added an interactive table of paralogs to complement our representation of orthology. To support individual model organism communities, we implemented species-specific "landing pages" and will add disease-specific portals soon; in addition, we support a common community forum implemented in Discourse software. We describe our progress toward a central persistent database to support curation, the data modeling that underpins harmonization, and progress toward a state-of-the-art literature curation system with integrated artificial intelligence and machine learning (AI/ML).
Collapse
Affiliation(s)
| | | | | | - Giulia Antonazzo
- Department of Physiology, Development and Neuroscience , University of Cambridge, Downing Street, Cambridge CB2 3DY , UK
| | - Valerio Arnaboldi
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Helen Attrill
- Department of Physiology, Development and Neuroscience , University of Cambridge, Downing Street, Cambridge CB2 3DY , UK
| | - Andrés Becerra
- European Molecular Biology Laboratory, European Bioinformatics Institute , Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD , UK
| | - Susan M Bello
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | - Olin Blodgett
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | | | - Carol J Bult
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | - Scott Cain
- Informatics and Bio-computing Platform, Ontario Institute for Cancer Research , Toronto, ON M5G0A3 , Canada
| | - Brian R Calvi
- Department of Biology, Indiana University , Bloomington, IN 47408 , USA
| | - Seth Carbon
- Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory , Berkeley, CA
| | - Juancarlos Chan
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Wen J Chen
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - J Michael Cherry
- Department of Genetics, Stanford University , Stanford, CA 94305
| | - Jaehyoung Cho
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Madeline A Crosby
- The Biological Laboratories, Harvard University , 16 Divinity Avenue, Cambridge, MA 02138 , USA
| | - Jeffrey L De Pons
- Medical College of Wisconsin—Rat Genome Database, Departments of Physiology and Biomedical Engineering , Medical College of Wisconsin, Milwaukee, WI 53226 , USA
| | | | - Stavros Diamantakis
- European Molecular Biology Laboratory, European Bioinformatics Institute , Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD , UK
| | - Mary E Dolan
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | - Gilberto dos Santos
- The Biological Laboratories, Harvard University , 16 Divinity Avenue, Cambridge, MA 02138 , USA
| | - Sarah Dyer
- European Molecular Biology Laboratory, European Bioinformatics Institute , Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD , UK
| | - Dustin Ebert
- Department of Population and Public Health Sciences, University of Southern California , Los Angeles, CA 90033 , USA
| | - Stacia R Engel
- Department of Genetics, Stanford University , Stanford, CA 94305
| | - David Fashena
- Institute of Neuroscience, University of Oregon , Eugene, OR 97403
| | - Malcolm Fisher
- Division of Developmental Biology, Cincinnati Children's Hospital Medical Center , 3333 Burnet Ave, Cincinnati, OH 45229 , USA
| | - Saoirse Foley
- Department of Biological Sciences, Carnegie Mellon University , 5000 Forbes Ave, Pittsburgh, PA 15203
| | - Adam C Gibson
- Medical College of Wisconsin—Rat Genome Database, Departments of Physiology and Biomedical Engineering , Medical College of Wisconsin, Milwaukee, WI 53226 , USA
| | - Varun R Gollapally
- Medical College of Wisconsin—Rat Genome Database, Departments of Physiology and Biomedical Engineering , Medical College of Wisconsin, Milwaukee, WI 53226 , USA
| | - L Sian Gramates
- The Biological Laboratories, Harvard University , 16 Divinity Avenue, Cambridge, MA 02138 , USA
| | - Christian A Grove
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Paul Hale
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | - Todd Harris
- Informatics and Bio-computing Platform, Ontario Institute for Cancer Research , Toronto, ON M5G0A3 , Canada
| | - G Thomas Hayman
- Medical College of Wisconsin—Rat Genome Database, Departments of Physiology and Biomedical Engineering , Medical College of Wisconsin, Milwaukee, WI 53226 , USA
| | - Yanhui Hu
- Department of Genetics, Howard Hughes Medical Institute , Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA 02115 , USA
| | - Christina James-Zorn
- Division of Developmental Biology, Cincinnati Children's Hospital Medical Center , 3333 Burnet Ave, Cincinnati, OH 45229 , USA
| | - Kamran Karimi
- Department of Biological Sciences, University of Calgary , 507 Campus Dr NW, Calgary, AB T2N 4V8 , Canada
| | - Kalpana Karra
- Department of Genetics, Stanford University , Stanford, CA 94305
| | - Ranjana Kishore
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Anne E Kwitek
- Medical College of Wisconsin—Rat Genome Database, Departments of Physiology and Biomedical Engineering , Medical College of Wisconsin, Milwaukee, WI 53226 , USA
| | - Stanley J F Laulederkind
- Medical College of Wisconsin—Rat Genome Database, Departments of Physiology and Biomedical Engineering , Medical College of Wisconsin, Milwaukee, WI 53226 , USA
| | - Raymond Lee
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Ian Longden
- The Biological Laboratories, Harvard University , 16 Divinity Avenue, Cambridge, MA 02138 , USA
| | - Manuel Luypaert
- European Molecular Biology Laboratory, European Bioinformatics Institute , Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD , UK
| | - Nicholas Markarian
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Steven J Marygold
- Department of Physiology, Development and Neuroscience , University of Cambridge, Downing Street, Cambridge CB2 3DY , UK
| | - Beverley Matthews
- The Biological Laboratories, Harvard University , 16 Divinity Avenue, Cambridge, MA 02138 , USA
| | - Monica S McAndrews
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | - Gillian Millburn
- Department of Physiology, Development and Neuroscience , University of Cambridge, Downing Street, Cambridge CB2 3DY , UK
| | - Stuart Miyasato
- Department of Genetics, Stanford University , Stanford, CA 94305
| | - Howie Motenko
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | - Sierra Moxon
- Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory , Berkeley, CA
| | - Hans-Michael Muller
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Christopher J Mungall
- Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory , Berkeley, CA
| | - Anushya Muruganujan
- Department of Population and Public Health Sciences, University of Southern California , Los Angeles, CA 90033 , USA
| | - Tremayne Mushayahama
- Department of Population and Public Health Sciences, University of Southern California , Los Angeles, CA 90033 , USA
| | - Robert S Nash
- Department of Genetics, Stanford University , Stanford, CA 94305
| | - Paulo Nuin
- Informatics and Bio-computing Platform, Ontario Institute for Cancer Research , Toronto, ON M5G0A3 , Canada
| | - Holly Paddock
- Institute of Neuroscience, University of Oregon , Eugene, OR 97403
| | - Troy Pells
- Department of Biological Sciences, University of Calgary , 507 Campus Dr NW, Calgary, AB T2N 4V8 , Canada
| | - Norbert Perrimon
- Department of Genetics, Howard Hughes Medical Institute , Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA 02115 , USA
| | - Christian Pich
- Institute of Neuroscience, University of Oregon , Eugene, OR 97403
| | - Mark Quinton-Tulloch
- European Molecular Biology Laboratory, European Bioinformatics Institute , Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD , UK
| | - Daniela Raciti
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | | | | | - Susan Russo Gelbart
- The Biological Laboratories, Harvard University , 16 Divinity Avenue, Cambridge, MA 02138 , USA
| | - Leyla Ruzicka
- Institute of Neuroscience, University of Oregon , Eugene, OR 97403
| | - Gary Schindelman
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - David R Shaw
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | - Gavin Sherlock
- Department of Genetics, Stanford University , Stanford, CA 94305
| | - Ajay Shrivatsav
- Department of Genetics, Stanford University , Stanford, CA 94305
| | - Amy Singer
- Institute of Neuroscience, University of Oregon , Eugene, OR 97403
| | - Constance M Smith
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | - Cynthia L Smith
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | - Jennifer R Smith
- Medical College of Wisconsin—Rat Genome Database, Departments of Physiology and Biomedical Engineering , Medical College of Wisconsin, Milwaukee, WI 53226 , USA
| | - Lincoln Stein
- Informatics and Bio-computing Platform, Ontario Institute for Cancer Research , Toronto, ON M5G0A3 , Canada
| | - Paul W Sternberg
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Christopher J Tabone
- The Biological Laboratories, Harvard University , 16 Divinity Avenue, Cambridge, MA 02138 , USA
| | - Paul D Thomas
- Department of Population and Public Health Sciences, University of Southern California , Los Angeles, CA 90033 , USA
| | - Ketaki Thorat
- Medical College of Wisconsin—Rat Genome Database, Departments of Physiology and Biomedical Engineering , Medical College of Wisconsin, Milwaukee, WI 53226 , USA
| | - Jyothi Thota
- Medical College of Wisconsin—Rat Genome Database, Departments of Physiology and Biomedical Engineering , Medical College of Wisconsin, Milwaukee, WI 53226 , USA
| | - Monika Tomczuk
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | - Vitor Trovisco
- Department of Physiology, Development and Neuroscience , University of Cambridge, Downing Street, Cambridge CB2 3DY , UK
| | - Marek A Tutaj
- Medical College of Wisconsin—Rat Genome Database, Departments of Physiology and Biomedical Engineering , Medical College of Wisconsin, Milwaukee, WI 53226 , USA
| | - Jose-Maria Urbano
- Department of Physiology, Development and Neuroscience , University of Cambridge, Downing Street, Cambridge CB2 3DY , UK
| | - Kimberly Van Auken
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Ceri E Van Slyke
- Institute of Neuroscience, University of Oregon , Eugene, OR 97403
| | - Peter D Vize
- Department of Biological Sciences, University of Calgary , 507 Campus Dr NW, Calgary, AB T2N 4V8 , Canada
| | - Qinghua Wang
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Shuai Weng
- Department of Genetics, Stanford University , Stanford, CA 94305
| | | | - Laurens G Wilming
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | - Edith D Wong
- Department of Genetics, Stanford University , Stanford, CA 94305
| | - Adam Wright
- Informatics and Bio-computing Platform, Ontario Institute for Cancer Research , Toronto, ON M5G0A3 , Canada
| | - Karen Yook
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Pinglei Zhou
- The Biological Laboratories, Harvard University , 16 Divinity Avenue, Cambridge, MA 02138 , USA
| | - Aaron Zorn
- Division of Developmental Biology, Cincinnati Children's Hospital Medical Center , 3333 Burnet Ave, Cincinnati, OH 45229 , USA
| | - Mark Zytkovicz
- The Biological Laboratories, Harvard University , 16 Divinity Avenue, Cambridge, MA 02138 , USA
| |
Collapse
|
6
|
Hendi NN, Nemer G. In silico characterization of the novel SDR42E1 as a potential vitamin D modulator. J Steroid Biochem Mol Biol 2024; 238:106447. [PMID: 38160768 DOI: 10.1016/j.jsbmb.2023.106447] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/24/2023] [Revised: 12/15/2023] [Accepted: 12/15/2023] [Indexed: 01/03/2024]
Abstract
The short-chain dehydrogenase/reductase (SDR) superfamily encompasses enzymes that play essential roles in the metabolism of steroid hormones and lipids. Despite an enigmatic function, recent genetic studies have linked the novel SDR 42 extended-1 (SDR42E1) gene to 25-hydroxyvitamin D levels. This study investigated the potential SDR42E1 functions and interactions with vitamin D using bioinformatics and molecular docking studies. Phylogenetic analysis unveiled that the nucleotide sequences of human SDR42E1 exhibit high evolutionary conservation across nematodes and fruit flies. Molecular docking analysis identified strong binding affinities between SDR42E1 and its orthologs with vitamin D3 and essential precursors, 8-dehydrocholesterol, followed by 7-dehydrocholesterol and 25-hydroxyvitamin D. The hydrophobic interactions observed between the protein residues and vitamin D compounds supported the predicted transmembrane localization of SDR42E1. Our investigation provides valuable insights into the potential role of SDR42E1 in skin vitamin D biosynthesis throughout species. This provides the foundation for future research and development of targeted therapies for vitamin D deficiency and related health conditions.
Collapse
Affiliation(s)
- Nagham Nafiz Hendi
- Division of Genomics and Translational Biomedicine, College of Health and Life Sciences, Hamad Bin Khalifa University, P.O. Box 34110, Doha, Qatar
| | - Georges Nemer
- Division of Genomics and Translational Biomedicine, College of Health and Life Sciences, Hamad Bin Khalifa University, P.O. Box 34110, Doha, Qatar; Department of Biochemistry and Molecular Genetics, American University of Beirut, P.O. Box 110236, Beirut, Lebanon.
| |
Collapse
|
7
|
Marcet-Houben M, Cruz F, Gómez-Garrido J, Alioto TS, Nunez-Rodriguez JC, Mesanza N, Gut M, Iturritxa E, Gabaldon T. Genomics of the expanding pine pathogen Lecanosticta acicola reveals patterns of ongoing genetic admixture. mSystems 2024; 9:e0092823. [PMID: 38364101 PMCID: PMC10949461 DOI: 10.1128/msystems.00928-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2023] [Accepted: 01/09/2024] [Indexed: 02/18/2024] Open
Abstract
Lecanosticta acicola is the causal agent for brown spot needle blight that affects pine trees across the northern hemisphere. Based on marker genes and microsatellite data, two distinct lineages have been identified that were introduced into Europe on two separate occasions. Despite their overall distinct geographic distribution, they have been found to coexist in regions of northern Spain and France. Here, we present the first genome-wide study of Lecanosticta acicola, including assembly of the reference genome and a population genomics analysis of 70 natural isolates from northern Spain. We show that most of the isolates belong to the southern lineage but show signs of introgression with northern lineage isolates, indicating mating between the two lineages. We also identify phenotypic differences between the two lineages based on the activity profiles of 20 enzymes, with introgressed strains being more phenotypically similar to members of the southern lineage. In conclusion, we show undergoing genetic admixture between the two main lineages of L. acicola in a region of recent expansion. IMPORTANCE Lecanosticta acicola is a fungal pathogen causing severe defoliation, growth reduction, and even death in more than 70 conifer species. Despite the increasing incidence of this species, little is known about its population dynamics. Two divergent lineages have been described that have now been found together in regions of France and Spain, but it is unknown how these mixed populations evolve. Here we present the first reference genome for this important plant pathogenic fungi and use it to study the population genomics of 70 isolates from an affected forest in the north of Spain. We find signs of introgression between the two main lineages, indicating that active mating is occurring in this region which could propitiate the appearance of novel traits in this species. We also study the phenotypic differences across this population based on enzymatic activities on 20 compounds.
Collapse
Affiliation(s)
- Marina Marcet-Houben
- Barcelona Supercomputing Centre (BSC-CNS), Barcelona, Spain
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Barcelona, Spain
- Centro de Investigación Biomédica En Red de Enfermedades Infecciosas (CIBERINFEC), Barcelona, Spain
| | - Fernando Cruz
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain
| | - Jéssica Gómez-Garrido
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain
| | - Tyler S. Alioto
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
| | - Juan Carlos Nunez-Rodriguez
- Barcelona Supercomputing Centre (BSC-CNS), Barcelona, Spain
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Nebai Mesanza
- Instituto Vasco de Investigación y Desarrollo Agrario (BRTA), Arkaute, Araba, Spain
| | - Marta Gut
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
| | - Eugenia Iturritxa
- Instituto Vasco de Investigación y Desarrollo Agrario (BRTA), Arkaute, Araba, Spain
| | - Toni Gabaldon
- Barcelona Supercomputing Centre (BSC-CNS), Barcelona, Spain
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Barcelona, Spain
- Centro de Investigación Biomédica En Red de Enfermedades Infecciosas (CIBERINFEC), Barcelona, Spain
- Catalan Institution for Research and Advanced Studies (ICREA), Barcelona, Spain
| |
Collapse
|
8
|
Grupp B, Denkhaus L, Gerhardt S, Vögele M, Johnsson N, Gronemeyer T. The structure of a tetrameric septin complex reveals a hydrophobic element essential for NC-interface integrity. Commun Biol 2024; 7:48. [PMID: 38184752 PMCID: PMC10771490 DOI: 10.1038/s42003-023-05734-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2023] [Accepted: 12/20/2023] [Indexed: 01/08/2024] Open
Abstract
The septins of the yeast Saccharomyces cerevisiae assemble into hetero-octameric rods by alternating interactions between neighboring G-domains or N- and C-termini, respectively. These rods polymerize end to end into apolar filaments, forming a ring beneath the prospective new bud that expands during the cell cycle into an hourglass structure. The hourglass finally splits during cytokinesis into a double ring. Understanding these transitions as well as the plasticity of the higher order assemblies requires a detailed knowledge of the underlying structures. Here we present the first X-ray crystal structure of a tetrameric Shs1-Cdc12-Cdc3-Cdc10 complex at a resolution of 3.2 Å. Close inspection of the NC-interfaces of this and other septin structures reveals a conserved contact motif that is essential for NC-interface integrity of yeast and human septins in vivo and in vitro. Using the tetrameric structure in combination with AlphaFold-Multimer allowed us to propose a model of the octameric septin rod.
Collapse
Affiliation(s)
- Benjamin Grupp
- Institute of Molecular Genetics and Cell Biology, Ulm University, Ulm, Germany
| | - Lukas Denkhaus
- Institute of Biochemistry, Albert-Ludwigs University, Freiburg, Germany
| | - Stefan Gerhardt
- Institute of Biochemistry, Albert-Ludwigs University, Freiburg, Germany
| | - Matthis Vögele
- Institute of Molecular Genetics and Cell Biology, Ulm University, Ulm, Germany
| | - Nils Johnsson
- Institute of Molecular Genetics and Cell Biology, Ulm University, Ulm, Germany
| | - Thomas Gronemeyer
- Institute of Molecular Genetics and Cell Biology, Ulm University, Ulm, Germany.
| |
Collapse
|
9
|
Allio R, Delsuc F, Belkhir K, Douzery EJP, Ranwez V, Scornavacca C. OrthoMaM v12: a database of curated single-copy ortholog alignments and trees to study mammalian evolutionary genomics. Nucleic Acids Res 2024; 52:D529-D535. [PMID: 37843103 PMCID: PMC10767847 DOI: 10.1093/nar/gkad834] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Revised: 09/19/2023] [Accepted: 09/26/2023] [Indexed: 10/17/2023] Open
Abstract
To date, the databases built to gather information on gene orthology do not provide end-users with descriptors of the molecular evolution information and phylogenetic pattern of these orthologues. In this context, we developed OrthoMaM, a database of ORTHOlogous MAmmalian Markers describing the evolutionary dynamics of coding sequences in mammalian genomes. OrthoMaM version 12 includes 15,868 alignments of orthologous coding sequences (CDS) from the 190 complete mammalian genomes currently available. All annotations and 1-to-1 orthology assignments are based on NCBI. Orthologous CDS can be mined for potential informative markers at the different taxonomic levels of the mammalian tree. To this end, several evolutionary descriptors of DNA sequences are provided for querying purposes (e.g. base composition and relative substitution rate). The graphical web interface allows the user to easily browse and sort the results of combined queries. The corresponding multiple sequence alignments and ML trees, inferred using state-of-the art approaches, are available for download both at the nucleotide and amino acid levels. OrthoMaM v12 can be used by researchers interested either in reconstructing the phylogenetic relationships of mammalian taxa or in understanding the evolutionary dynamics of coding sequences in their genomes. OrthoMaM is available for browsing, querying and complete or filtered download at https://orthomam.mbb.cnrs.fr/.
Collapse
Affiliation(s)
- Rémi Allio
- CBGP, INRAE, CIRAD, IRD, Institut Agro, Univ. Montpellier, Montpellier, 34988, France
- ISEM, Univ. Montpellier, CNRS, IRD, Montpellier, 34095, France
| | - Frédéric Delsuc
- ISEM, Univ. Montpellier, CNRS, IRD, Montpellier, 34095, France
| | - Khalid Belkhir
- ISEM, Univ. Montpellier, CNRS, IRD, Montpellier, 34095, France
| | | | - Vincent Ranwez
- AGAP, Univ. Montpellier, CIRAD, INRAE, Institut Agro, Montpellier, 34398, France
| | | |
Collapse
|
10
|
Müller J, Furlan M, Settele D, Grupp B, Johnsson N. Transient septin sumoylation steers a Fir1-Skt5 protein complex between the split septin ring. J Cell Biol 2024; 223:e202301027. [PMID: 37938157 PMCID: PMC10631487 DOI: 10.1083/jcb.202301027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2023] [Revised: 10/05/2023] [Accepted: 10/17/2023] [Indexed: 11/09/2023] Open
Abstract
Ubiquitylation and phosphorylation control composition and architecture of the cell separation machinery in yeast and other eukaryotes. The significance of septin sumoylation on cell separation remained an enigma. Septins form an hourglass structure at the bud neck of yeast cells that transforms into a split septin double ring during mitosis. We discovered that sumoylated septins recruit the cytokinesis checkpoint protein Fir1 to the peripheral side of the septin hourglass just before its transformation into the double-ring configuration. As this transition occurs, Fir1 is released from the septins and seamlessly relocates between the split septin rings through synchronized binding to the scaffold Spa2. Fir1 binds and carries the membrane-bound Skt5 on its route to the division plane where the Fir1-Skt5 complex serves as receptor for chitin synthase III.
Collapse
Affiliation(s)
- Judith Müller
- Department of Biology, Institute of Molecular Genetics and Cell Biology, Ulm University, Ulm, Germany
| | - Monique Furlan
- Department of Biology, Institute of Molecular Genetics and Cell Biology, Ulm University, Ulm, Germany
| | - David Settele
- Department of Biology, Institute of Molecular Genetics and Cell Biology, Ulm University, Ulm, Germany
| | - Benjamin Grupp
- Department of Biology, Institute of Molecular Genetics and Cell Biology, Ulm University, Ulm, Germany
| | - Nils Johnsson
- Department of Biology, Institute of Molecular Genetics and Cell Biology, Ulm University, Ulm, Germany
| |
Collapse
|
11
|
Aleksander SA, Anagnostopoulos AV, Antonazzo G, Arnaboldi V, Attrill H, Becerra A, Bello SM, Blodgett O, Bradford YM, Bult CJ, Cain S, Calvi BR, Carbon S, Chan J, Chen WJ, Michael Cherry J, Cho J, Crosby MA, De Pons JL, D’Eustachio P, Diamantakis S, Dolan ME, Santos GD, Dyer S, Ebert D, Engel SR, Fashena D, Fisher M, Foley S, Gibson AC, Gollapally VR, Sian Gramates L, Grove CA, Hale P, Harris T, Thomas Hayman G, Hu Y, James-Zorn C, Karimi K, Karra K, Kishore R, Kwitek AE, Laulederkind SJF, Lee R, Longden I, Luypaert M, Markarian N, Marygold SJ, Matthews B, McAndrews MS, Millburn G, Miyasato S, Motenko H, Moxon S, Muller HM, Mungall CJ, Muruganujan A, Mushayahama T, Nash RS, Nuin P, Paddock H, Pells T, Perrimon N, Pich C, Quinton-Tulloch M, Raciti D, Ramachandran S, Richardson JE, Gelbart SR, Ruzicka L, Schindelman G, Shaw DR, Sherlock G, Shrivatsav A, Singer A, Smith CM, Smith CL, Smith JR, Stein L, Sternberg PW, Tabone CJ, Thomas PD, Thorat K, Thota J, Tomczuk M, Trovisco V, Tutaj MA, Urbano JM, Auken KV, Van Slyke CE, Vize PD, Wang Q, Weng S, Westerfield M, Wilming LG, Wong ED, Wright A, Yook K, Zhou P, Zorn A, Zytkovicz M. Updates to the Alliance of Genome Resources Central Infrastructure Alliance of Genome Resources Consortium. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.20.567935. [PMID: 38045425 PMCID: PMC10690154 DOI: 10.1101/2023.11.20.567935] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/05/2023]
Abstract
The Alliance of Genome Resources (Alliance) is an extensible coalition of knowledgebases focused on the genetics and genomics of intensively-studied model organisms. The Alliance is organized as individual knowledge centers with strong connections to their research communities and a centralized software infrastructure, discussed here. Model organisms currently represented in the Alliance are budding yeast, C. elegans, Drosophila, zebrafish, frog, laboratory mouse, laboratory rat, and the Gene Ontology Consortium. The project is in a rapid development phase to harmonize knowledge, store it, analyze it, and present it to the community through a web portal, direct downloads, and APIs. Here we focus on developments over the last two years. Specifically, we added and enhanced tools for browsing the genome (JBrowse), downloading sequences, mining complex data (AllianceMine), visualizing pathways, full-text searching of the literature (Textpresso), and sequence similarity searching (SequenceServer). We enhanced existing interactive data tables and added an interactive table of paralogs to complement our representation of orthology. To support individual model organism communities, we implemented species-specific "landing pages" and will add disease-specific portals soon; in addition, we support a common community forum implemented in Discourse. We describe our progress towards a central persistent database to support curation, the data modeling that underpins harmonization, and progress towards a state-of-the art literature curation system with integrated Artificial Intelligence and Machine Learning (AI/ML).
Collapse
|
12
|
Campero-Basaldua C, González J, García JA, Ramírez E, Hernández H, Aguirre B, Torres-Ramírez N, Márquez D, Sánchez NS, Gómez-Hernández N, Torres-Machorro AL, Riego-Ruiz L, Scazzocchio C, González A. Neo-functionalization in Saccharomyces cerevisiae: a novel Nrg1-Rtg3 chimeric transcriptional modulator is essential to maintain mitochondrial DNA integrity. ROYAL SOCIETY OPEN SCIENCE 2023; 10:231209. [PMID: 37920568 PMCID: PMC10618058 DOI: 10.1098/rsos.231209] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/25/2023] [Accepted: 10/11/2023] [Indexed: 11/04/2023]
Abstract
In Saccharomyces cerevisiae, the transcriptional repressor Nrg1 (Negative Regulator of Glucose-repressed genes) and the β-Zip transcription factor Rtg3 (ReTroGrade regulation) mediate glucose repression and signalling from the mitochondria to the nucleus, respectively. Here, we show a novel function of these two proteins, in which alanine promotes the formation of a chimeric Nrg1/Rtg3 regulator that represses the ALT2 gene (encoding an alanine transaminase paralog of unknown function). An NRG1/NRG2 paralogous pair, resulting from a post-wide genome small-scale duplication event, is present in the Saccharomyces genus. Neo-functionalization of only one paralog resulted in the ability of Nrg1 to interact with Rtg3. Both nrg1Δ and rtg3Δ single mutant strains were unable to use ethanol and showed a typical petite (small) phenotype on glucose. Neither of the wild-type genes complemented the petite phenotype, suggesting irreversible mitochondrial DNA damage in these mutants. Neither nrg1Δ nor rtg3Δ mutant strains expressed genes encoded by any of the five polycistronic units transcribed from mitochondrial DNA in S. cerevisiae. This, and the direct measurement of the mitochondrial DNA gene complement, confirmed that irreversible damage of the mitochondrial DNA occurred in both mutant strains, which is consistent with the essential role of the chimeric Nrg1/Rtg3 regulator in mitochondrial DNA maintenance.
Collapse
Affiliation(s)
- Carlos Campero-Basaldua
- Departamento de Bioquímica y Biología Estructural, Instituto de Fisiología Celular Universidad Nacional Autónoma de México, Ciudad de Mexi, México
| | - James González
- Laboratorio de Biología Molecular y Genómica, Departamento de Biología Celular, Facultad de Ciencias, Universidad Nacional Autónoma de México, Ciudad de Mexico, México
| | - Janeth Alejandra García
- Departamento de Bioquímica y Biología Estructural, Instituto de Fisiología Celular Universidad Nacional Autónoma de México, Ciudad de Mexi, México
| | - Edgar Ramírez
- Departamento de Bioquímica y Biología Estructural, Instituto de Fisiología Celular Universidad Nacional Autónoma de México, Ciudad de Mexi, México
| | - Hugo Hernández
- Departamento de Biología, Facultad de Química, UNAM, México City, Universidad Nacional Autónoma de México, Ciudad de Mexico, México
| | - Beatriz Aguirre
- Departamento de Bioquímica y Biología Estructural, Instituto de Fisiología Celular Universidad Nacional Autónoma de México, Ciudad de Mexi, México
| | - Nayeli Torres-Ramírez
- Laboratorio de Microscopía Electrónica Departamento de Biología Celular, Facultad de Ciencias, Universidad Nacional Autónoma de México, Ciudad de Mexico, México
| | - Dariel Márquez
- Departamento de Bioquímica y Biología Estructural, Instituto de Fisiología Celular Universidad Nacional Autónoma de México, Ciudad de Mexi, México
| | - Norma Silvia Sánchez
- Departamento de Genética Molecular, Instituto de Fisiología Celular Universidad Nacional Autónoma de México, Ciudad de Mexi, México
| | - Nicolás Gómez-Hernández
- División de Biología Molecular, Instituto Potosino de Investigación Científica y Tecnológica (IPICYT), San Luis Potosí, SLP, México
| | - Ana Lilia Torres-Machorro
- Laboratorio de Biología Celular, Departamento de Investigación en Fibrosis Pulmonar, Instituto Nacional de Enfermedades Respiratorias ‘Ismael Cosío Villegas', Tlalpan, Mexico
| | - Lina Riego-Ruiz
- División de Biología Molecular, Instituto Potosino de Investigación Científica y Tecnológica (IPICYT), San Luis Potosí, SLP, México
| | - Claudio Scazzocchio
- Department of Life Sciences, Imperial College London, London SW7 2AZ, UK
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198 Gif-sur-Yvette, France
| | - Alicia González
- Departamento de Bioquímica y Biología Estructural, Instituto de Fisiología Celular Universidad Nacional Autónoma de México, Ciudad de Mexi, México
| |
Collapse
|
13
|
Del Olmo V, Mixão V, Fotedar R, Saus E, Al Malki A, Księżopolska E, Nunez-Rodriguez JC, Boekhout T, Gabaldón T. Origin of fungal hybrids with pathogenic potential from warm seawater environments. Nat Commun 2023; 14:6919. [PMID: 37903766 PMCID: PMC10616089 DOI: 10.1038/s41467-023-42679-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2023] [Accepted: 10/17/2023] [Indexed: 11/01/2023] Open
Abstract
Hybridisation is a common event in yeasts often leading to genomic variability and adaptation. The yeast Candida orthopsilosis is a human-associated opportunistic pathogen belonging to the Candida parapsilosis species complex. Most C. orthopsilosis clinical isolates are hybrids resulting from at least four independent crosses between two parental lineages, of which only one has been identified. The rare presence or total absence of parentals amongst clinical isolates is hypothesised to be a consequence of a reduced pathogenicity with respect to their hybrids. Here, we sequence and analyse the genomes of environmental C. orthopsilosis strains isolated from warm marine ecosystems. We find that a majority of environmental isolates are hybrids, phylogenetically closely related to hybrid clinical isolates. Furthermore, we identify the missing parental lineage, thus providing a more complete overview of the genomic evolution of this species. Additionally, we discover phenotypic differences between the two parental lineages, as well as between parents and hybrids, under conditions relevant for pathogenesis. Our results suggest a marine origin of C. orthopsilosis hybrids, with intrinsic pathogenic potential, and pave the way to identify pre-existing environmental adaptations that rendered hybrids more prone than parental lineages to colonise and infect the mammalian host.
Collapse
Affiliation(s)
- Valentina Del Olmo
- Life Sciences Department. Barcelona Supercomputing Center (BSC), Jordi Girona, 29, 08034, Barcelona, Spain
- Mechanisms of Disease Program, Institute for Research in Biomedicine (IRB), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Verónica Mixão
- Life Sciences Department. Barcelona Supercomputing Center (BSC), Jordi Girona, 29, 08034, Barcelona, Spain
- Mechanisms of Disease Program, Institute for Research in Biomedicine (IRB), The Barcelona Institute of Science and Technology, Barcelona, Spain
- Bioinformatics Unit, Infectious Diseases Department, National Institute of Health Dr. Ricardo Jorge, Av. Padre Cruz, 1649-016, Lisbon, Portugal
| | - Rashmi Fotedar
- Department of Genetic Engineering, Biotechnology Centre, Ministry of Municipality and Environment, P.O Box 20022, Doha, Qatar
| | - Ester Saus
- Life Sciences Department. Barcelona Supercomputing Center (BSC), Jordi Girona, 29, 08034, Barcelona, Spain
- Mechanisms of Disease Program, Institute for Research in Biomedicine (IRB), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Amina Al Malki
- Department of Genetic Engineering, Biotechnology Centre, Ministry of Municipality and Environment, P.O Box 20022, Doha, Qatar
| | - Ewa Księżopolska
- Life Sciences Department. Barcelona Supercomputing Center (BSC), Jordi Girona, 29, 08034, Barcelona, Spain
- Mechanisms of Disease Program, Institute for Research in Biomedicine (IRB), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Juan Carlos Nunez-Rodriguez
- Life Sciences Department. Barcelona Supercomputing Center (BSC), Jordi Girona, 29, 08034, Barcelona, Spain
- Mechanisms of Disease Program, Institute for Research in Biomedicine (IRB), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Teun Boekhout
- College of Science, King Saud University, Riyadh, Saudi Arabia
| | - Toni Gabaldón
- Life Sciences Department. Barcelona Supercomputing Center (BSC), Jordi Girona, 29, 08034, Barcelona, Spain.
- Mechanisms of Disease Program, Institute for Research in Biomedicine (IRB), The Barcelona Institute of Science and Technology, Barcelona, Spain.
- ICREA, Pg. Lluis Companys 23, Barcelona, 08010, Spain.
- , Centro de Investigación Biomédica En Red de Enfermedades Infecciosas, Barcelona, Spain.
| |
Collapse
|
14
|
Kearney SK, Berger A, Baker E. Aon: a service to augment Alliance Genome Resource data with additional species. BMC Res Notes 2023; 16:297. [PMID: 37891644 PMCID: PMC10604687 DOI: 10.1186/s13104-023-06577-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2023] [Accepted: 10/16/2023] [Indexed: 10/29/2023] Open
Abstract
OBJECTIVE Cross-species comparative genomics requires access to accurate homology data across the entire range of annotated genes. The Alliance of Genome Resources (AGR) provides an open-source and comprehensive database of homology data calculated using a wide array of algorithms at differing stringencies to elucidate orthologous relationships. However, the current AGR application program interface (API) is limited to five homology endpoints for nine species. While AGR provides a robust resource for several canonical species, its utility can be greatly enhanced by increased filtering and data processing options and incorporating additional species. RESULTS Here, we describe a novel API tool, AON, that expands access to the AGR orthology resource by creating a data structure that supports 50 additional endpoints. More importantly, it provides users with a framework for adding bespoke endpoints, custom species, and additional orthology data. We demonstrate AON's functionality by incorporating the service into the GeneWeaver ecosystem for supporting cross-species data analysis.
Collapse
Affiliation(s)
- Sophie K Kearney
- Department of Computer Science, Baylor University, One Bear Place Box 97356, Waco, 76798, USA
| | | | - Erich Baker
- Department of Computer Science, Baylor University, One Bear Place Box 97356, Waco, 76798, USA.
| |
Collapse
|
15
|
Mixão V, Nunez-Rodriguez JC, Del Olmo V, Ksiezopolska E, Saus E, Boekhout T, Gacser A, Gabaldón T. Evolution of loss of heterozygosity patterns in hybrid genomes of Candida yeast pathogens. BMC Biol 2023; 21:105. [PMID: 37170256 PMCID: PMC10173528 DOI: 10.1186/s12915-023-01608-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2022] [Accepted: 04/27/2023] [Indexed: 05/13/2023] Open
Abstract
BACKGROUND Hybrids are chimeric organisms with highly plastic heterozygous genomes that may confer unique traits enabling the adaptation to new environments. However, most evolutionary theory frameworks predict that the high levels of genetic heterozygosity present in hybrids from divergent parents are likely to result in numerous deleterious epistatic interactions. Under this scenario, selection is expected to favor recombination events resulting in loss of heterozygosity (LOH) affecting genes involved in such negative interactions. Nevertheless, it is so far unknown whether this phenomenon actually drives genomic evolution in natural populations of hybrids. To determine the balance between selection and drift in the evolution of LOH patterns in natural yeast hybrids, we analyzed the genomic sequences from fifty-five hybrid strains of the pathogenic yeasts Candida orthopsilosis and Candida metapsilosis, which derived from at least six distinct natural hybridization events. RESULTS We found that, although LOH patterns in independent hybrid clades share some level of convergence that would not be expected from random occurrence, there is an apparent lack of strong functional selection. Moreover, while mitosis is associated with a limited number of inter-homeologous chromosome recombinations in these genomes, induced DNA breaks seem to increase the LOH rate. We also found that LOH does not accumulate linearly with time in these hybrids. Furthermore, some C. orthopsilosis hybrids present LOH patterns compatible with footprints of meiotic recombination. These meiotic-like patterns are at odds with a lack of evidence of sexual recombination and with our inability to experimentally induce sporulation in these hybrids. CONCLUSIONS Our results suggest that genetic drift is the prevailing force shaping LOH patterns in these hybrid genomes. Moreover, the observed LOH patterns suggest that these are likely not the result of continuous accumulation of sporadic events-as expected by mitotic repair of rare chromosomal breaks-but rather of acute episodes involving many LOH events in a short period of time.
Collapse
Affiliation(s)
- Verónica Mixão
- Life Sciences Department, Barcelona Supercomputing Center (BSC), Jordi Girona, 29, 08034, Barcelona, Spain
- Mechanisms of Disease Program, Institute for Research in Biomedicine (IRB), The Barcelona Institute of Science and Technology, Barcelona, Spain
- Present address: Genomics and Bioinformatics Unit, Infectious Diseases Department, National Institute of Health Dr. Ricardo Jorge, Av. Padre Cruz, 1649-016, Lisbon, Portugal
| | - Juan Carlos Nunez-Rodriguez
- Life Sciences Department, Barcelona Supercomputing Center (BSC), Jordi Girona, 29, 08034, Barcelona, Spain
- Mechanisms of Disease Program, Institute for Research in Biomedicine (IRB), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Valentina Del Olmo
- Life Sciences Department, Barcelona Supercomputing Center (BSC), Jordi Girona, 29, 08034, Barcelona, Spain
- Mechanisms of Disease Program, Institute for Research in Biomedicine (IRB), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Ewa Ksiezopolska
- Life Sciences Department, Barcelona Supercomputing Center (BSC), Jordi Girona, 29, 08034, Barcelona, Spain
- Mechanisms of Disease Program, Institute for Research in Biomedicine (IRB), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Ester Saus
- Life Sciences Department, Barcelona Supercomputing Center (BSC), Jordi Girona, 29, 08034, Barcelona, Spain
- Mechanisms of Disease Program, Institute for Research in Biomedicine (IRB), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Teun Boekhout
- Westerdijk Fungal Biodiversity Institute, Utrecht, The Netherlands
- Institute of Biodiversity and Ecosystem Dynamics (IBED), University of Amsterdam, Amsterdam, The Netherlands
| | - Attila Gacser
- Department of Microbiology, University of Szeged, Szeged, Hungary
- MTA-SZTE "Lendület" Mycobiome Research Group, University of Szeged, Szeged, Hungary
| | - Toni Gabaldón
- Life Sciences Department, Barcelona Supercomputing Center (BSC), Jordi Girona, 29, 08034, Barcelona, Spain.
- Mechanisms of Disease Program, Institute for Research in Biomedicine (IRB), The Barcelona Institute of Science and Technology, Barcelona, Spain.
- ICREA, Pg. Lluis Companys 23, 08010, Barcelona, Spain.
- Centro de Investigación Biomédica En Red de Enfermedades Infecciosas, Barcelona, Spain.
| |
Collapse
|
16
|
Marlétaz F, de la Calle-Mustienes E, Acemel RD, Paliou C, Naranjo S, Martínez-García PM, Cases I, Sleight VA, Hirschberger C, Marcet-Houben M, Navon D, Andrescavage A, Skvortsova K, Duckett PE, González-Rajal Á, Bogdanovic O, Gibcus JH, Yang L, Gallardo-Fuentes L, Sospedra I, Lopez-Rios J, Darbellay F, Visel A, Dekker J, Shubin N, Gabaldón T, Nakamura T, Tena JJ, Lupiáñez DG, Rokhsar DS, Gómez-Skarmeta JL. The little skate genome and the evolutionary emergence of wing-like fins. Nature 2023; 616:495-503. [PMID: 37046085 PMCID: PMC10115646 DOI: 10.1038/s41586-023-05868-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2022] [Accepted: 02/21/2023] [Indexed: 04/14/2023]
Abstract
Skates are cartilaginous fish whose body plan features enlarged wing-like pectoral fins, enabling them to thrive in benthic environments1,2. However, the molecular underpinnings of this unique trait remain unclear. Here we investigate the origin of this phenotypic innovation by developing the little skate Leucoraja erinacea as a genomically enabled model. Analysis of a high-quality chromosome-scale genome sequence for the little skate shows that it preserves many ancestral jawed vertebrate features compared with other sequenced genomes, including numerous ancient microchromosomes. Combining genome comparisons with extensive regulatory datasets in developing fins-including gene expression, chromatin occupancy and three-dimensional conformation-we find skate-specific genomic rearrangements that alter the three-dimensional regulatory landscape of genes that are involved in the planar cell polarity pathway. Functional inhibition of planar cell polarity signalling resulted in a reduction in anterior fin size, confirming that this pathway is a major contributor to batoid fin morphology. We also identified a fin-specific enhancer that interacts with several hoxa genes, consistent with the redeployment of hox gene expression in anterior pectoral fins, and confirmed its potential to activate transcription in the anterior fin using zebrafish reporter assays. Our findings underscore the central role of genome reorganization and regulatory variation in the evolution of phenotypes, shedding light on the molecular origin of an enigmatic trait.
Collapse
Affiliation(s)
- Ferdinand Marlétaz
- Centre for Life's Origin and Evolution, Department of Genetics, Evolution and Environment, University College London, London, UK.
- Molecular Genetics Unit, Okinawa Institute of Science and Technology Graduate University, Onna, Japan.
| | - Elisa de la Calle-Mustienes
- Centro Andaluz de Biología del Desarrollo (CABD), Consejo Superior de Investigaciones Científicas/Universidad Pablo de Olavide/Junta de Andalucía, Seville, Spain
| | - Rafael D Acemel
- Centro Andaluz de Biología del Desarrollo (CABD), Consejo Superior de Investigaciones Científicas/Universidad Pablo de Olavide/Junta de Andalucía, Seville, Spain
- Epigenetics and Sex Development Group, Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin Institute for Medical Systems Biology (BIMSB), Berlin, Germany
| | - Christina Paliou
- Centro Andaluz de Biología del Desarrollo (CABD), Consejo Superior de Investigaciones Científicas/Universidad Pablo de Olavide/Junta de Andalucía, Seville, Spain
| | - Silvia Naranjo
- Centro Andaluz de Biología del Desarrollo (CABD), Consejo Superior de Investigaciones Científicas/Universidad Pablo de Olavide/Junta de Andalucía, Seville, Spain
| | - Pedro Manuel Martínez-García
- Centro Andaluz de Biología del Desarrollo (CABD), Consejo Superior de Investigaciones Científicas/Universidad Pablo de Olavide/Junta de Andalucía, Seville, Spain
| | - Ildefonso Cases
- Centro Andaluz de Biología del Desarrollo (CABD), Consejo Superior de Investigaciones Científicas/Universidad Pablo de Olavide/Junta de Andalucía, Seville, Spain
| | - Victoria A Sleight
- Department of Zoology, University of Cambridge, Cambridge, UK
- School of Biological Sciences, University of Aberdeen, Aberdeen, UK
| | | | - Marina Marcet-Houben
- Barcelona Supercomputing Centre (BCS-CNS), Barcelona, Spain
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Dina Navon
- Department of Genetics, Rutgers the State University of New Jersey, Piscataway, NJ, USA
| | - Ali Andrescavage
- Department of Genetics, Rutgers the State University of New Jersey, Piscataway, NJ, USA
| | - Ksenia Skvortsova
- Genomics and Epigenetics Division, Garvan Institute of Medical Research, Sydney, New South Wales, Australia
- Faculty of Medicine, St Vincent's Clinical School, University of New South Wales, Sydney, New South Wales, Australia
| | - Paul Edward Duckett
- Genomics and Epigenetics Division, Garvan Institute of Medical Research, Sydney, New South Wales, Australia
| | - Álvaro González-Rajal
- Genomics and Epigenetics Division, Garvan Institute of Medical Research, Sydney, New South Wales, Australia
- Faculty of Medicine, St Vincent's Clinical School, University of New South Wales, Sydney, New South Wales, Australia
| | - Ozren Bogdanovic
- Genomics and Epigenetics Division, Garvan Institute of Medical Research, Sydney, New South Wales, Australia
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales, Australia
| | - Johan H Gibcus
- Department of Systems Biology, University of Massachusetts Chan Medical School, Worcester, MA, USA
| | - Liyan Yang
- Department of Systems Biology, University of Massachusetts Chan Medical School, Worcester, MA, USA
| | - Lourdes Gallardo-Fuentes
- Centro Andaluz de Biología del Desarrollo (CABD), Consejo Superior de Investigaciones Científicas/Universidad Pablo de Olavide/Junta de Andalucía, Seville, Spain
| | - Ismael Sospedra
- Centro Andaluz de Biología del Desarrollo (CABD), Consejo Superior de Investigaciones Científicas/Universidad Pablo de Olavide/Junta de Andalucía, Seville, Spain
| | - Javier Lopez-Rios
- Centro Andaluz de Biología del Desarrollo (CABD), Consejo Superior de Investigaciones Científicas/Universidad Pablo de Olavide/Junta de Andalucía, Seville, Spain
| | - Fabrice Darbellay
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
- Department of Genetic Medicine and Development, Faculty of Medicine, University of Geneva, Geneva, Switzerland
| | - Axel Visel
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA
- School of Natural Sciences, University of California, Merced, CA, USA
| | - Job Dekker
- Department of Systems Biology, University of Massachusetts Chan Medical School, Worcester, MA, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | - Neil Shubin
- Department of Organismal Biology and Anatomy, University of Chicago, Chicago, IL, USA
| | - Toni Gabaldón
- Barcelona Supercomputing Centre (BCS-CNS), Barcelona, Spain
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Barcelona, Spain
- Catalan Institution for Research and Advanced Studies (ICREA), Barcelona, Spain
- CIBER de Enfermedades Infecciosas, Instituto de Salud Carlos III, Madrid, Spain
| | - Tetsuya Nakamura
- Department of Genetics, Rutgers the State University of New Jersey, Piscataway, NJ, USA.
| | - Juan J Tena
- Centro Andaluz de Biología del Desarrollo (CABD), Consejo Superior de Investigaciones Científicas/Universidad Pablo de Olavide/Junta de Andalucía, Seville, Spain.
| | - Darío G Lupiáñez
- Epigenetics and Sex Development Group, Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin Institute for Medical Systems Biology (BIMSB), Berlin, Germany.
| | - Daniel S Rokhsar
- Molecular Genetics Unit, Okinawa Institute of Science and Technology Graduate University, Onna, Japan.
- Department of Molecular and Cell Biology, University of California, Berkeley, CA, USA.
- Chan-Zuckerberg Biohub, San Francisco, CA, USA.
| | - José Luis Gómez-Skarmeta
- Centro Andaluz de Biología del Desarrollo (CABD), Consejo Superior de Investigaciones Científicas/Universidad Pablo de Olavide/Junta de Andalucía, Seville, Spain
| |
Collapse
|
17
|
Persson E, Sonnhammer ELL. InParanoiDB 9: Ortholog Groups for Protein Domains and Full-Length Proteins. J Mol Biol 2023:168001. [PMID: 36764355 DOI: 10.1016/j.jmb.2023.168001] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2022] [Revised: 01/20/2023] [Accepted: 02/01/2023] [Indexed: 02/11/2023]
Abstract
Prediction of orthologs is an important bioinformatics pursuit that is frequently used for inferring protein function and evolutionary analyses. The InParanoid database is a well known resource of ortholog predictions between a wide variety of organisms. Although orthologs have historically been inferred at the level of full-length protein sequences, many proteins consist of several independent protein domains that may be orthologous to domains in other proteins in a way that differs from the full-length protein case. To be able to capture all types of orthologous relations, conventional full-length protein orthologs can be complemented with orthologs inferred at the domain level. We here present InParanoiDB 9, covering 640 species and providing orthologs for both protein domains and full-length proteins. InParanoiDB 9 was built using the faster InParanoid-DIAMOND algorithm for orthology analysis, as well as Domainoid and Pfam to infer orthologous domains. InParanoiDB 9 is based on proteomes from 447 eukaryotes, 158 bacteria and 35 archaea, and includes over one billion predicted ortholog groups. A new website has been built for the database, providing multiple search options as well as visualization of groups of orthologs and orthologous domains. This release constitutes a major upgrade of the InParanoid database in terms of the number of species as well as the new capability to operate on the domain level. InParanoiDB 9 is available at https://inparanoidb.sbc.su.se/.
Collapse
Affiliation(s)
- Emma Persson
- Department of Biochemistry and Biophysics, Stockholm University, Science for Life Laboratory, Box 1031, 17121 Solna, Sweden. https://twitter.com/eriksonnhammer
| | - Erik L L Sonnhammer
- Department of Biochemistry and Biophysics, Stockholm University, Science for Life Laboratory, Box 1031, 17121 Solna, Sweden.
| |
Collapse
|
18
|
Rodi M, Gross C, Sandri TL, Berner L, Marcet-Houben M, Kocak E, Pogoda M, Casadei N, Köhler C, Kreidenweiss A, Agnandji ST, Gabaldón T, Ossowski S, Held J. Whole genome analysis of two sympatric human Mansonella: Mansonella perstans and Mansonella sp "DEUX". Front Cell Infect Microbiol 2023; 13:1159814. [PMID: 37124042 PMCID: PMC10145164 DOI: 10.3389/fcimb.2023.1159814] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2023] [Accepted: 03/15/2023] [Indexed: 05/02/2023] Open
Abstract
Introduction Mansonella species are filarial parasites that infect humans worldwide. Although these infections are common, knowledge of the pathology and diversity of the causative species is limited. Furthermore, the lack of sequencing data for Mansonella species, shows that their research is neglected. Apart from Mansonella perstans, a potential new species called Mansonella sp "DEUX" has been identified in Gabon, which is prevalent at high frequencies. We aimed to further determine if Mansonella sp "DEUX" is a genotype of M. perstans, or if these are two sympatric species. Methods We screened individuals in the area of Fougamou, Gabon for Mansonella mono-infections and generated de novo assemblies from the respective samples. For evolutionary analysis, a phylogenetic tree was reconstructed, and the differences and divergence times are presented. In addition, mitogenomes were generated and phylogenies based on 12S rDNA and cox1 were created. Results We successfully generated whole genomes for M. perstans and Mansonella sp "DEUX". Phylogenetic analysis based on annotated protein sequences, support the hypothesis of two distinct species. The inferred evolutionary analysis suggested, that M. perstans and Mansonella sp "DEUX" separated around 778,000 years ago. Analysis based on mitochondrial marker genes support our hypothesis of two sympatric human Mansonella species. Discussion The results presented indicate that Mansonella sp "DEUX" is a new Mansonella species. These findings reflect the neglect of this research topic. And the availability of whole genome data will allow further investigations of these species.
Collapse
Affiliation(s)
- Miriam Rodi
- Institute of Tropical Medicine, University of Tübingen, Tübingen, Germany
| | - Caspar Gross
- Institute of Medical Genetics and Applied Genomics, University of Tübingen, Tübingen, Germany
| | - Thaisa Lucas Sandri
- Institute of Tropical Medicine, University of Tübingen, Tübingen, Germany
- Laboratory of Molecular Immunopathology, Department of Clinical Pathology, Federal University of Paraná, Curitiba, Brazil
| | - Lilith Berner
- Institute of Tropical Medicine, University of Tübingen, Tübingen, Germany
| | - Marina Marcet-Houben
- Life Science Department, Barcelona Supercomputing Centre (BSC-CNS), Barcelona, Spain
- Mechanisms and Defense, Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Ersoy Kocak
- NGS Competence Center Tübingen (NCCT), Tübingen, Germany
| | - Michaela Pogoda
- Institute of Medical Genetics and Applied Genomics, University of Tübingen, Tübingen, Germany
- NGS Competence Center Tübingen (NCCT), Tübingen, Germany
| | - Nicolas Casadei
- Institute of Medical Genetics and Applied Genomics, University of Tübingen, Tübingen, Germany
- NGS Competence Center Tübingen (NCCT), Tübingen, Germany
| | - Carsten Köhler
- Institute of Tropical Medicine, University of Tübingen, Tübingen, Germany
- German Center for Infection Research (DZIF), partner site Tübingen, Tübingen, Germany
| | - Andrea Kreidenweiss
- Institute of Tropical Medicine, University of Tübingen, Tübingen, Germany
- German Center for Infection Research (DZIF), partner site Tübingen, Tübingen, Germany
- Centre de Recherches Médicales de Lambaréné (CERMEL), Lambaréné, Gabon
| | - Selidji Todagbe Agnandji
- Institute of Tropical Medicine, University of Tübingen, Tübingen, Germany
- Centre de Recherches Médicales de Lambaréné (CERMEL), Lambaréné, Gabon
| | - Toni Gabaldón
- Life Science Department, Barcelona Supercomputing Centre (BSC-CNS), Barcelona, Spain
- Mechanisms and Defense, Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Barcelona, Spain
- Catalan Institution for Research and Advanced Studies (ICREA), Barcelona, Spain
- Centro de Investigación Biomédica En Red de Enfermedades Infecciosas (CIBERINFEC), Barcelona, Spain
| | - Stephan Ossowski
- Institute of Medical Genetics and Applied Genomics, University of Tübingen, Tübingen, Germany
| | - Jana Held
- Institute of Tropical Medicine, University of Tübingen, Tübingen, Germany
- German Center for Infection Research (DZIF), partner site Tübingen, Tübingen, Germany
- Centre de Recherches Médicales de Lambaréné (CERMEL), Lambaréné, Gabon
- *Correspondence: Jana Held,
| |
Collapse
|
19
|
Hernández-Plaza A, Szklarczyk D, Botas J, Cantalapiedra C, Giner-Lamia J, Mende DR, Kirsch R, Rattei T, Letunic I, Jensen L, Bork P, von Mering C, Huerta-Cepas J. eggNOG 6.0: enabling comparative genomics across 12 535 organisms. Nucleic Acids Res 2022; 51:D389-D394. [PMID: 36399505 PMCID: PMC9825578 DOI: 10.1093/nar/gkac1022] [Citation(s) in RCA: 50] [Impact Index Per Article: 25.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2022] [Revised: 10/17/2022] [Accepted: 10/24/2022] [Indexed: 11/19/2022] Open
Abstract
The eggNOG (evolutionary gene genealogy Non-supervised Orthologous Groups) database is a bioinformatics resource providing orthology data and comprehensive functional information for organisms from all domains of life. Here, we present a major update of the database and website (version 6.0), which increases the number of covered organisms to 12 535 reference species, expands functional annotations, and implements new functionality. In total, eggNOG 6.0 provides a hierarchy of over 17M orthologous groups (OGs) computed at 1601 taxonomic levels, spanning 10 756 bacterial, 457 archaeal and 1322 eukaryotic organisms. OGs have been thoroughly annotated using recent knowledge from functional databases, including KEGG, Gene Ontology, UniProtKB, BiGG, CAZy, CARD, PFAM and SMART. eggNOG also offers phylogenetic trees for all OGs, maximising utility and versatility for end users while allowing researchers to investigate the evolutionary history of speciation and duplication events as well as the phylogenetic distribution of functional terms within each OG. Furthermore, the eggNOG 6.0 website contains new functionality to mine orthology and functional data with ease, including the possibility of generating phylogenetic profiles for multiple OGs across species or identifying single-copy OGs at custom taxonomic levels. eggNOG 6.0 is available at http://eggnog6.embl.de.
Collapse
Affiliation(s)
- Ana Hernández-Plaza
- Centro de Biotecnología y Genómica de Plantas, Universidad Politécnica de Madrid (UPM) - Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA-CSIC), Campus de Montegancedo-UPM, 28223 Pozuelo de Alarcón, Madrid, Spain
| | - Damian Szklarczyk
- Department of Molecular Life Sciences, University of Zurich, 8057 Zurich, Switzerland,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Jorge Botas
- Centro de Biotecnología y Genómica de Plantas, Universidad Politécnica de Madrid (UPM) - Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA-CSIC), Campus de Montegancedo-UPM, 28223 Pozuelo de Alarcón, Madrid, Spain
| | - Carlos P Cantalapiedra
- Centro de Biotecnología y Genómica de Plantas, Universidad Politécnica de Madrid (UPM) - Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA-CSIC), Campus de Montegancedo-UPM, 28223 Pozuelo de Alarcón, Madrid, Spain
| | - Joaquín Giner-Lamia
- Centro de Biotecnología y Genómica de Plantas, Universidad Politécnica de Madrid (UPM) - Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA-CSIC), Campus de Montegancedo-UPM, 28223 Pozuelo de Alarcón, Madrid, Spain,Departamento de Biotecnología-Biología Vegetal, Escuela Técnica Superior de Ingeniería Agronómica, Alimentaria y de Biosistemas, Universidad Politécnica de Madrid (UPM), Madrid 28040, Spain
| | - Daniel R Mende
- Department of Medical Microbiology, Amsterdam University Medical Centers, Amsterdam, The Netherlands
| | - Rebecca Kirsch
- Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, 2200 Copenhagen N, Denmark
| | - Thomas Rattei
- University of Vienna, Centre for Microbiology and Environmental Systems Science, Djerassiplatz 11030, Vienna, Austria
| | - Ivica Letunic
- Biobyte solutions GmbH, Bothestr. 142, 69117 Heidelberg, Germany
| | - Lars J Jensen
- Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, 2200 Copenhagen N, Denmark
| | - Peer Bork
- Correspondence may also be addressed to Peer Bork. Tel: +49 62213878526;
| | - Christian von Mering
- Correspondence may also be addressed to Christian von Mering. Tel: +41 446353147;
| | | |
Collapse
|
20
|
Foley S, Vlasova A, Marcet-Houben M, Gabaldón T, Hinman VF. Evolutionary analyses of genes in Echinodermata offer insights towards the origin of metazoan phyla. Genomics 2022; 114:110431. [PMID: 35835427 PMCID: PMC9552553 DOI: 10.1016/j.ygeno.2022.110431] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2021] [Revised: 05/10/2022] [Accepted: 07/06/2022] [Indexed: 11/24/2022]
Abstract
Despite recent studies discussing the evolutionary impacts of gene duplications and losses among metazoans, the genomic basis for the evolution of phyla remains enigmatic. Here, we employ phylogenomic approaches to search for orthologous genes without known functions among echinoderms, and subsequently use them to guide the identification of their homologs across other metazoans. Our final set of 14 genes was obtained via a suite of homology prediction tools, gene expression data, gene ontology, and generating the Strongylocentrotus purpuratus phylome. The gene set was subjected to selection pressure analyses, which indicated that they are highly conserved and under negative selection. Their presence across broad taxonomic depths suggests that genes required to form a phylum are ancestral to that phylum. Therefore, rather than de novo gene genesis, we posit that evolutionary forces such as selection on existing genomic elements over large timescales may drive divergence and contribute to the emergence of phyla.
Collapse
Affiliation(s)
- Saoirse Foley
- Department of Biological Sciences, Carnegie Mellon University, 5000 Forbes Ave, Pittsburgh, PA 15213, USA; Echinobase #6-46, Mellon Institute, 4400 Fifth Ave, Pittsburgh, PA 15213, USA.
| | - Anna Vlasova
- Barcelona Supercomputing Centre (BSC-CNS), Jordi Girona, 29, 08034 Barcelona, Spain; Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Baldiri Reixac, 10, 08028 Barcelona, Spain
| | - Marina Marcet-Houben
- Barcelona Supercomputing Centre (BSC-CNS), Jordi Girona, 29, 08034 Barcelona, Spain; Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Baldiri Reixac, 10, 08028 Barcelona, Spain
| | - Toni Gabaldón
- Barcelona Supercomputing Centre (BSC-CNS), Jordi Girona, 29, 08034 Barcelona, Spain; Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Baldiri Reixac, 10, 08028 Barcelona, Spain; Catalan Institution for Research and Advanced Studies (ICREA), Barcelona, Spain
| | - Veronica F Hinman
- Department of Biological Sciences, Carnegie Mellon University, 5000 Forbes Ave, Pittsburgh, PA 15213, USA; Echinobase #6-46, Mellon Institute, 4400 Fifth Ave, Pittsburgh, PA 15213, USA
| |
Collapse
|
21
|
Liedtke HC, Cruz F, Gómez-Garrido J, Fuentes Palacios D, Marcet-Houben M, Gut M, Alioto T, Gabaldón T, Gomez-Mestre I. Chromosome-level assembly, annotation and phylome of Pelobates cultripes, the western spadefoot toad. DNA Res 2022; 29:6588074. [PMID: 35583263 PMCID: PMC9164646 DOI: 10.1093/dnares/dsac013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2022] [Accepted: 05/12/2022] [Indexed: 11/14/2022] Open
Abstract
Abstract
Genomic resources for amphibians are still hugely under-represented in vertebrate genomic research, despite being a group of major interest for ecology, evolution and conservation. Amphibians constitute a highly threatened group of vertebrates, present a vast diversity in reproductive modes, are extremely diverse in morphology, occupy most ecoregions of the world, and present the widest range in genome sizes of any major group of vertebrates. We combined Illumina, Nanopore and Hi-C sequencing technologies to assemble a chromosome-level genome sequence for an anuran with a moderate genome size (assembly span 3.09 Gb); Pelobates cultripes, the western spadefoot toad. The genome has an N50 length of 330 Mb with 98.6% of the total sequence length assembled into 14 super scaffolds, and 87.7% complete BUSCO genes. We use published transcriptomic data to provide annotations, identifying 32,684 protein-coding genes. We also reconstruct the P. cultripes phylome and identify 2,527 gene expansions. We contribute the first draft of the genome of the western spadefoot toad, P. cultripes. This species represents a relatively basal lineage in the anuran tree with an interesting ecology and a high degree of developmental plasticity, and thus is an important resource for amphibian genomic research.
Collapse
Affiliation(s)
- Hans Christoph Liedtke
- Ecology, Evolution and Development Group, Department of Wetland Ecology, Estación Biológica de Doñana (CSIC) , 41092 Sevilla, Spain
| | - Fernando Cruz
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST) , 08028 Barcelona, Spain
| | - Jèssica Gómez-Garrido
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST) , 08028 Barcelona, Spain
| | - Diego Fuentes Palacios
- Barcelona Supercomputing Centre (BSC-CNS) , 08034 Barcelona, Spain
- Institute for Research in Biomedicine (IRB), The Barcelona Institute of Science and Technology (BIST) , 08028 Barcelona, Spain
| | - Marina Marcet-Houben
- Barcelona Supercomputing Centre (BSC-CNS) , 08034 Barcelona, Spain
- Institute for Research in Biomedicine (IRB), The Barcelona Institute of Science and Technology (BIST) , 08028 Barcelona, Spain
| | - Marta Gut
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST) , 08028 Barcelona, Spain
| | - Tyler Alioto
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST) , 08028 Barcelona, Spain
- Universitat Pompeu Fabra (UPF) , Barcelona, Spain
| | - Toni Gabaldón
- Barcelona Supercomputing Centre (BSC-CNS) , 08034 Barcelona, Spain
- Institute for Research in Biomedicine (IRB), The Barcelona Institute of Science and Technology (BIST) , 08028 Barcelona, Spain
- Catalan Institution for Research and Advanced Studies (ICREA) , Barcelona, Spain
| | - Ivan Gomez-Mestre
- Ecology, Evolution and Development Group, Department of Wetland Ecology, Estación Biológica de Doñana (CSIC) , 41092 Sevilla, Spain
| |
Collapse
|
22
|
Nevers Y, Jones TEM, Jyothi D, Yates B, Ferret M, Portell-Silva L, Codo L, Cosentino S, Marcet-Houben M, Vlasova A, Poidevin L, Kress A, Hickman M, Persson E, Piližota I, Guijarro-Clarke C, Iwasaki W, Lecompte O, Sonnhammer E, Roos DS, Gabaldón T, Thybert D, Thomas PD, Hu Y, Emms DM, Bruford E, Capella-Gutierrez S, Martin MJ, Dessimoz C, Altenhoff A. The Quest for Orthologs orthology benchmark service in 2022. Nucleic Acids Res 2022; 50:W623-W632. [PMID: 35552456 PMCID: PMC9252809 DOI: 10.1093/nar/gkac330] [Citation(s) in RCA: 19] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2022] [Revised: 04/07/2022] [Accepted: 04/30/2022] [Indexed: 11/15/2022] Open
Abstract
The Orthology Benchmark Service (https://orthology.benchmarkservice.org) is the gold standard for orthology inference evaluation, supported and maintained by the Quest for Orthologs consortium. It is an essential resource to compare existing and new methods of orthology inference (the bedrock for many comparative genomics and phylogenetic analysis) over a standard dataset and through common procedures. The Quest for Orthologs Consortium is dedicated to maintaining the resource up to date, through regular updates of the Reference Proteomes and increasingly accessible data through the OpenEBench platform. For this update, we have added a new benchmark based on curated orthology assertion from the Vertebrate Gene Nomenclature Committee, and provided an example meta-analysis of the public predictions present on the platform.
Collapse
Affiliation(s)
- Yannis Nevers
- To whom correspondence should be addressed. Tel: +41 21 692 5449;
| | - Tamsin E M Jones
- HUGO Gene Nomenclature Committee (HGNC), European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
| | - Dushyanth Jyothi
- Protein Function development, European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
| | - Bethan Yates
- HUGO Gene Nomenclature Committee (HGNC), European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
| | - Meritxell Ferret
- Barcelona Supercomputing Centre (BSC-CNS). Plaça Eusebi Güell, 1-3 08034 Barcelona, Spain
| | - Laura Portell-Silva
- Barcelona Supercomputing Centre (BSC-CNS). Plaça Eusebi Güell, 1-3 08034 Barcelona, Spain
| | - Laia Codo
- Barcelona Supercomputing Centre (BSC-CNS). Plaça Eusebi Güell, 1-3 08034 Barcelona, Spain
| | - Salvatore Cosentino
- Department of Biological Sciences, Graduate School of Science, the University of Tokyo, Tokyo, Japan
| | - Marina Marcet-Houben
- Barcelona Supercomputing Centre (BSC-CNS). Plaça Eusebi Güell, 1-3 08034 Barcelona, Spain,Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Baldiri Reixac, 10, 08028 Barcelona, Spain
| | - Anna Vlasova
- Barcelona Supercomputing Centre (BSC-CNS). Plaça Eusebi Güell, 1-3 08034 Barcelona, Spain,Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Baldiri Reixac, 10, 08028 Barcelona, Spain
| | - Laetitia Poidevin
- Department of Computer Science, ICube, UMR 7357, Centre de Recherche en Biomédecine de Strasbourg, University of Strasbourg, CNRS, Strasbourg, France,BiGEst-ICube Platform, ICube, UMR 7357, Centre de Recherche en Biomédecine de Strasbourg, University of Strasbourg, CNRS, Strasbourg, France
| | - Arnaud Kress
- Department of Computer Science, ICube, UMR 7357, Centre de Recherche en Biomédecine de Strasbourg, University of Strasbourg, CNRS, Strasbourg, France,BiGEst-ICube Platform, ICube, UMR 7357, Centre de Recherche en Biomédecine de Strasbourg, University of Strasbourg, CNRS, Strasbourg, France
| | - Mark Hickman
- Department of Biology, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Emma Persson
- Science for Life Laboratory, Department of Biochemistry and Biophysics, Stockholm University, Solna, Sweden
| | - Ivana Piližota
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
| | - Cristina Guijarro-Clarke
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
| | | | - Wataru Iwasaki
- Department of Biological Sciences, Graduate School of Science, the University of Tokyo, Tokyo, Japan,Department of Integrated Biosciences, Graduate School of Frontier Sciences, the University of Tokyo, Kashiwa, Japan
| | - Odile Lecompte
- Department of Computer Science, ICube, UMR 7357, Centre de Recherche en Biomédecine de Strasbourg, University of Strasbourg, CNRS, Strasbourg, France
| | - Erik Sonnhammer
- Science for Life Laboratory, Department of Biochemistry and Biophysics, Stockholm University, Solna, Sweden
| | - David S Roos
- Department of Biology, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Toni Gabaldón
- Barcelona Supercomputing Centre (BSC-CNS). Plaça Eusebi Güell, 1-3 08034 Barcelona, Spain,Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Baldiri Reixac, 10, 08028 Barcelona, Spain,Catalan Institution for Research and Advanced Studies (ICREA), Barcelona, Spain,Centro de Investigaciones Biomédicas en Red de Enfermedades Infecciosas, Barcelona, Spain
| | - David Thybert
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
| | - Paul D Thomas
- Department of Population and Public Health Sciences, University of Southern California, Los Angeles, CA 90032, USA
| | - Yanhui Hu
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Harvard University, Boston, MA 02115, USA
| | - David M Emms
- Department of Plant Sciences, University of Oxford, Oxford OX1 3RB, UK
| | - Elspeth Bruford
- HUGO Gene Nomenclature Committee (HGNC), European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK,Department of Haematology, University of Cambridge School of Clinical Medicine, Cambridge, UK
| | | | - Maria J Martin
- Protein Function development, European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
| | - Christophe Dessimoz
- Department of Computational Biology, University of Lausanne, Lausanne, Switzerland,Swiss Institute for Bioinformatics, University of Lausanne, Lausanne, Switzerland,Department of Computer Science, University College London, London, UK,Centre for Life's Origins and Evolution, Department of Genetics, Evolution and Environment, University College London, London, UK
| | - Adrian Altenhoff
- Swiss Institute for Bioinformatics, University of Lausanne, Lausanne, Switzerland,Computer Science Department, ETH Zurich, Zurich, Switzerland
| |
Collapse
|
23
|
Deng Z, Botas J, Cantalapiedra CP, Hernández-Plaza A, Burguet-Castell J, Huerta-Cepas J. PhyloCloud: an online platform for making sense of phylogenomic data. Nucleic Acids Res 2022; 50:W577-W582. [PMID: 35544233 PMCID: PMC9252743 DOI: 10.1093/nar/gkac324] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2022] [Revised: 04/18/2022] [Accepted: 05/03/2022] [Indexed: 11/14/2022] Open
Abstract
Phylogenomics data have grown exponentially over the last decades. It is currently common for genome-wide projects to generate hundreds or even thousands of phylogenetic trees and multiple sequence alignments, which may also be very large in size. However, the analysis and interpretation of such data still depends on custom bioinformatic and visualisation workflows that are largely unattainable for non-expert users. Here, we present PhyloCloud, an online platform aimed at hosting, indexing and exploring large phylogenetic tree collections, providing also seamless access to common analyses and operations, such as node annotation, searching, topology editing, automatic tree rooting, orthology detection and more. In addition, PhyloCloud provides quick access to tools that allow users to build their own phylogenies using fast predefined workflows, graphically compare tree topologies, or query taxonomic databases such as NBCI or GTDB. Finally, PhyloCloud offers a novel tree visualisation system based on ETE Toolkit v4.0, which can be used to explore very large trees and enhance them with custom annotations and multiple sequence alignments. The platform allows for sharing tree collections and specific tree views via private links, or make them fully public, serving also as a repository of phylogenomic data. PhyloCloud is available at https://phylocloud.cgmlab.org.
Collapse
Affiliation(s)
- Ziqi Deng
- Centro de Biotecnología y Genómica de Plantas, Universidad Politécnica de Madrid (UPM) and Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA-CSIC), 28223 Madrid, Spain
| | - Jorge Botas
- Centro de Biotecnología y Genómica de Plantas, Universidad Politécnica de Madrid (UPM) and Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA-CSIC), 28223 Madrid, Spain
| | - Carlos P Cantalapiedra
- Centro de Biotecnología y Genómica de Plantas, Universidad Politécnica de Madrid (UPM) and Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA-CSIC), 28223 Madrid, Spain
| | - Ana Hernández-Plaza
- Centro de Biotecnología y Genómica de Plantas, Universidad Politécnica de Madrid (UPM) and Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA-CSIC), 28223 Madrid, Spain
| | - Jordi Burguet-Castell
- Centro de Biotecnología y Genómica de Plantas, Universidad Politécnica de Madrid (UPM) and Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA-CSIC), 28223 Madrid, Spain
| | - Jaime Huerta-Cepas
- Centro de Biotecnología y Genómica de Plantas, Universidad Politécnica de Madrid (UPM) and Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA-CSIC), 28223 Madrid, Spain
| |
Collapse
|
24
|
Mixão V, del Olmo V, Hegedűsová E, Saus E, Pryszcz L, Cillingová A, Nosek J, Gabaldón T. Genome analysis of five recently described species of the CUG-Ser clade uncovers Candida theae as a new hybrid lineage with pathogenic potential in the Candida parapsilosis species complex. DNA Res 2022; 29:6570588. [PMID: 35438177 PMCID: PMC9046093 DOI: 10.1093/dnares/dsac010] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2022] [Indexed: 01/27/2023] Open
Abstract
Candida parapsilosis species complex comprises three important pathogenic species: Candida parapsilosis sensu stricto, Candida orthopsilosis and Candida metapsilosis. The majority of C. orthopsilosis and all C. metapsilosis isolates sequenced thus far are hybrids, and most of the parental lineages remain unidentified. This led to the hypothesis that hybrids with pathogenic potential were formed by the hybridization of non-pathogenic lineages that thrive in the environment. In a search for the missing hybrid parentals, and aiming to get a better understanding of the evolution of the species complex, we sequenced, assembled and analysed the genome of five close relatives isolated from the environment: Candida jiufengensis, Candida pseudojiufengensis, Candida oxycetoniae, Candida margitis and Candida theae. We found that the linear conformation of mitochondrial genomes in Candida species emerged multiple times independently. Furthermore, our analyses discarded the possible involvement of these species in the mentioned hybridizations, but identified C. theae as an additional hybrid in the species complex. Importantly, C. theae was recently associated with a case of infection, and we also uncovered the hybrid nature of this clinical isolate. Altogether, our results reinforce the hypothesis that hybridization is widespread among Candida species, and potentially contributes to the emergence of lineages with opportunistic pathogenic behaviour.
Collapse
Affiliation(s)
- Verónica Mixão
- Life Sciences Department, Barcelona Supercomputing Center (BSC), 08034 Barcelona, Spain
- Mechanisms of Disease Department, Institute for Research in Biomedicine (IRB), Barcelona, Spain
| | - Valentina del Olmo
- Life Sciences Department, Barcelona Supercomputing Center (BSC), 08034 Barcelona, Spain
- Mechanisms of Disease Department, Institute for Research in Biomedicine (IRB), Barcelona, Spain
| | - Eva Hegedűsová
- Department of Biochemistry, Faculty of Natural Sciences, Comenius University in Bratislava, 842 15 Bratislava, Slovak Republic
| | - Ester Saus
- Life Sciences Department, Barcelona Supercomputing Center (BSC), 08034 Barcelona, Spain
- Mechanisms of Disease Department, Institute for Research in Biomedicine (IRB), Barcelona, Spain
| | - Leszek Pryszcz
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona 08003, Spain
| | - Andrea Cillingová
- Department of Biochemistry, Faculty of Natural Sciences, Comenius University in Bratislava, 842 15 Bratislava, Slovak Republic
| | - Jozef Nosek
- Department of Biochemistry, Faculty of Natural Sciences, Comenius University in Bratislava, 842 15 Bratislava, Slovak Republic
| | - Toni Gabaldón
- Life Sciences Department, Barcelona Supercomputing Center (BSC), 08034 Barcelona, Spain
- Mechanisms of Disease Department, Institute for Research in Biomedicine (IRB), Barcelona, Spain
- ICREA, Barcelona 08010, Spain
- Centro de Investigación Biomédica En Red de Enfermedades Infecciosas, Barcelona, Spain
| |
Collapse
|
25
|
Wafula EK, Zhang H, Von Kuster G, Leebens-Mack JH, Honaas LA, dePamphilis CW. PlantTribes2: Tools for comparative gene family analysis in plant genomics. FRONTIERS IN PLANT SCIENCE 2022; 13:1011199. [PMID: 36798801 PMCID: PMC9928214 DOI: 10.3389/fpls.2022.1011199] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/03/2022] [Accepted: 12/02/2022] [Indexed: 05/12/2023]
Abstract
Plant genome-scale resources are being generated at an increasing rate as sequencing technologies continue to improve and raw data costs continue to fall; however, the cost of downstream analyses remains large. This has resulted in a considerable range of genome assembly and annotation qualities across plant genomes due to their varying sizes, complexity, and the technology used for the assembly and annotation. To effectively work across genomes, researchers increasingly rely on comparative genomic approaches that integrate across plant community resources and data types. Such efforts have aided the genome annotation process and yielded novel insights into the evolutionary history of genomes and gene families, including complex non-model organisms. The essential tools to achieve these insights rely on gene family analysis at a genome-scale, but they are not well integrated for rapid analysis of new data, and the learning curve can be steep. Here we present PlantTribes2, a scalable, easily accessible, highly customizable, and broadly applicable gene family analysis framework with multiple entry points including user provided data. It uses objective classifications of annotated protein sequences from existing, high-quality plant genomes for comparative and evolutionary studies. PlantTribes2 can improve transcript models and then sort them, either genome-scale annotations or individual gene coding sequences, into pre-computed orthologous gene family clusters with rich functional annotation information. Then, for gene families of interest, PlantTribes2 performs downstream analyses and customizable visualizations including, (1) multiple sequence alignment, (2) gene family phylogeny, (3) estimation of synonymous and non-synonymous substitution rates among homologous sequences, and (4) inference of large-scale duplication events. We give examples of PlantTribes2 applications in functional genomic studies of economically important plant families, namely transcriptomics in the weedy Orobanchaceae and a core orthogroup analysis (CROG) in Rosaceae. PlantTribes2 is freely available for use within the main public Galaxy instance and can be downloaded from GitHub or Bioconda. Importantly, PlantTribes2 can be readily adapted for use with genomic and transcriptomic data from any kind of organism.
Collapse
Affiliation(s)
- Eric K Wafula
- Department of Biology, The Pennsylvania State University, University Park, PA, United States
| | - Huiting Zhang
- Tree Fruit Research Laboratory, United States Department of Agriculture (USDA), Agricultural Research Service (ARS), Wenatchee, WA, United States
- Department of Horticulture, Washington State University, Pullman, WA, United States
| | - Gregory Von Kuster
- Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA, United States
| | | | - Loren A Honaas
- Tree Fruit Research Laboratory, United States Department of Agriculture (USDA), Agricultural Research Service (ARS), Wenatchee, WA, United States
| | - Claude W dePamphilis
- Department of Biology, The Pennsylvania State University, University Park, PA, United States
- Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA, United States
| |
Collapse
|