1
|
Cosio T, Pica F, Fontana C, Pistoia ES, Favaro M, Valsecchi I, Zarabian N, Campione E, Botterel F, Gaziano R. Stephanoascus ciferrii Complex: The Current State of Infections and Drug Resistance in Humans. J Fungi (Basel) 2024; 10:294. [PMID: 38667965 PMCID: PMC11050938 DOI: 10.3390/jof10040294] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2024] [Revised: 03/12/2024] [Accepted: 04/15/2024] [Indexed: 04/28/2024] Open
Abstract
In recent years, the incidence of fungal infections in humans has increased dramatically, accompanied by an expansion in the number of species implicated as etiological agents, especially environmental fungi never involved before in human infection. Among fungal pathogens, Candida species are the most common opportunistic fungi that can cause local and systemic infections, especially in immunocompromised individuals. Candida albicans (C. albicans) is the most common causative agent of mucosal and healthcare-associated systemic infections. However, during recent decades, there has been a worrying increase in the number of emerging multi-drug-resistant non-albicans Candida (NAC) species, i.e., C. glabrata, C. parapsilosis, C. tropicalis, C. krusei, C. auris, and C. ciferrii. In particular, Candida ciferrii, also known as Stephanoascus ciferrii or Trichomonascus ciferrii, is a heterothallic ascomycete yeast-like fungus that has received attention in recent decades as a cause of local and systemic fungal diseases. Today, the new definition of the S. ciferrii complex, which consists of S. ciferrii, Candida allociferrii, and Candida mucifera, was proposed after sequencing the 18S rRNA gene. Currently, the S. ciferrii complex is mostly associated with non-severe ear and eye infections, although a few cases of severe candidemia have been reported in immunocompromised individuals. Low susceptibility to currently available antifungal drugs is a rising concern, especially in NAC species. In this regard, a high rate of resistance to azoles and more recently also to echinocandins has emerged in the S. ciferrii complex. This review focuses on epidemiological, biological, and clinical aspects of the S. ciferrii complex, including its pathogenicity and drug resistance.
Collapse
Affiliation(s)
- Terenzio Cosio
- Department of Experimental Medicine, University of Rome Tor Vergata, Via Montpellier 1, 00133 Rome, Italy; (F.P.); (E.S.P.); (M.F.); (R.G.)
- Dermatology Unit, Department of Systems Medicine, Tor Vergata University Hospital, 00133 Rome, Italy;
| | - Francesca Pica
- Department of Experimental Medicine, University of Rome Tor Vergata, Via Montpellier 1, 00133 Rome, Italy; (F.P.); (E.S.P.); (M.F.); (R.G.)
| | - Carla Fontana
- Laboratory of Microbiology and BioBank, National Institute for Infectious Diseases “Lazzaro Spallanzani” I.R.C.C.S., 00149 Rome, Italy;
| | - Enrico Salvatore Pistoia
- Department of Experimental Medicine, University of Rome Tor Vergata, Via Montpellier 1, 00133 Rome, Italy; (F.P.); (E.S.P.); (M.F.); (R.G.)
| | - Marco Favaro
- Department of Experimental Medicine, University of Rome Tor Vergata, Via Montpellier 1, 00133 Rome, Italy; (F.P.); (E.S.P.); (M.F.); (R.G.)
| | - Isabel Valsecchi
- DYNAMYC 7380, Faculté de Santé, Université Paris-Est Créteil (UPEC), 94010 Créteil, France; (I.V.); (F.B.)
| | - Nikkia Zarabian
- School of Medicine and Health Sciences, George Washington University, 2300 I St NW, Washington, DC 20052, USA
| | - Elena Campione
- Dermatology Unit, Department of Systems Medicine, Tor Vergata University Hospital, 00133 Rome, Italy;
| | - Françoise Botterel
- DYNAMYC 7380, Faculté de Santé, Université Paris-Est Créteil (UPEC), 94010 Créteil, France; (I.V.); (F.B.)
| | - Roberta Gaziano
- Department of Experimental Medicine, University of Rome Tor Vergata, Via Montpellier 1, 00133 Rome, Italy; (F.P.); (E.S.P.); (M.F.); (R.G.)
| |
Collapse
|
2
|
Lopes JML, Nascimento LSDQ, Souza VC, de Matos EM, Fortini EA, Grazul RM, Santos MO, Soltis DE, Soltis PS, Otoni WC, Viccini LF. Water stress modulates terpene biosynthesis and morphophysiology at different ploidal levels in Lippia alba (Mill.) N. E. Brown (Verbenaceae). PROTOPLASMA 2024; 261:227-243. [PMID: 37665420 DOI: 10.1007/s00709-023-01890-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Accepted: 08/18/2023] [Indexed: 09/05/2023]
Abstract
Monoterpenes are the main component in essential oils of Lippia alba. In this species, the chemical composition of essential oils varies with genome size: citral (geraniol and neral) is dominant in diploids and tetraploids, and linalool in triploids. Because environmental stress impacts various metabolic pathways, we hypothesized that stress responses in L. alba could alter the relationship between genome size and essential oil composition. Water stress affects the flowering, production, and reproduction of plants. Here, we evaluated the effect of water stress on morphophysiology, essential oil production, and the expression of genes related to monoterpene synthesis in diploid, triploid, and tetraploid accessions of L. alba cultivated in vitro for 40 days. First, using transcriptome data, we performed de novo gene assembly and identified orthologous genes using phylogenetic and clustering-based approaches. The expression of candidate genes related to terpene biosynthesis was estimated by real-time quantitative PCR. Next, we assessed the expression of these genes under water stress conditions, whereby 1% PEG-4000 was added to MS medium. Water stress modulated L. alba morphophysiology at all ploidal levels. Gene expression and essential oil production were affected in triploid accessions. Polyploid accessions showed greater growth and metabolic tolerance under stress compared to diploids. These results confirm the complex regulation of metabolic pathways such as the production of essential oils in polyploid genomes. In addition, they highlight aspects of genotype and environment interactions, which may be important for the conservation of tropical biodiversity.
Collapse
Affiliation(s)
- Juliana Mainenti Leal Lopes
- Department of Biology, Insitute of Biological Science, Federal University of Juiz de Fora, Juiz de Fora, Minas Gerais, 36036-900, Brazil
- School of Life Science and Environment, Department of Genetic and Biotechnology, University of Trás-Os-Montes and Alto Douro, 5001-801, Vila Real, Portugal
- BioISI - Biosystems & Integrative Sciences Institute, Faculty of Sciences, University of Lisboa, 1649-004, Lisbon, Portugal
| | | | - Vinicius Carius Souza
- Department of Biology, Insitute of Biological Science, Federal University of Juiz de Fora, Juiz de Fora, Minas Gerais, 36036-900, Brazil
| | - Elyabe Monteiro de Matos
- Department of Biology, Insitute of Biological Science, Federal University of Juiz de Fora, Juiz de Fora, Minas Gerais, 36036-900, Brazil
| | - Evandro Alexandre Fortini
- Laboratory of Plant Tissue Culture (LCTII), Department of Plant Biology/BIOAGRO, Universidade Federal de Viçosa, Av. P.H. Rolfs S/N, Campus Universitário, Viçosa, MG, 36570-000, Brazil
| | | | - Marcelo Oliveira Santos
- Department of Biology, Insitute of Biological Science, Federal University of Juiz de Fora, Juiz de Fora, Minas Gerais, 36036-900, Brazil
| | - Douglas E Soltis
- Florida Museum of Natural History, University of Florida, Gainesville, FL, 32611, USA
- Department of Biology, University of Florida, Gainesville, FL, 32611, USA
| | - Pamela S Soltis
- Florida Museum of Natural History, University of Florida, Gainesville, FL, 32611, USA
| | - Wagner Campos Otoni
- Laboratory of Plant Tissue Culture (LCTII), Department of Plant Biology/BIOAGRO, Universidade Federal de Viçosa, Av. P.H. Rolfs S/N, Campus Universitário, Viçosa, MG, 36570-000, Brazil
| | - Lyderson Facio Viccini
- Department of Biology, Insitute of Biological Science, Federal University of Juiz de Fora, Juiz de Fora, Minas Gerais, 36036-900, Brazil.
| |
Collapse
|
3
|
Pezzini FF, Ferrari G, Forrest LL, Hart ML, Nishii K, Kidner CA. Target capture and genome skimming for plant diversity studies. APPLICATIONS IN PLANT SCIENCES 2023; 11:e11537. [PMID: 37601316 PMCID: PMC10439825 DOI: 10.1002/aps3.11537] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/17/2022] [Revised: 06/16/2023] [Accepted: 07/10/2023] [Indexed: 08/22/2023]
Abstract
Recent technological advances in long-read high-throughput sequencing and assembly methods have facilitated the generation of annotated chromosome-scale whole-genome sequence data for evolutionary studies; however, generating such data can still be difficult for many plant species. For example, obtaining high-molecular-weight DNA is typically impossible for samples in historical herbarium collections, which often have degraded DNA. The need to fast-freeze newly collected living samples to conserve high-quality DNA can be complicated when plants are only found in remote areas. Therefore, short-read reduced-genome representations, such as target capture and genome skimming, remain important for evolutionary studies. Here, we review the pros and cons of each technique for non-model plant taxa. We provide guidance related to logistics, budget, the genomic resources previously available for the target clade, and the nature of the study. Furthermore, we assess the available bioinformatic analyses, detailing best practices and pitfalls, and suggest pathways to combine newly generated data with legacy data. Finally, we explore the possible downstream analyses allowed by the type of data generated using each technique. We provide a practical guide to help researchers make the best-informed choice regarding reduced genome representation for evolutionary studies of non-model plants in cases where whole-genome sequencing remains impractical.
Collapse
Affiliation(s)
| | - Giada Ferrari
- Royal Botanic Garden Edinburgh Edinburgh United Kingdom
| | | | | | - Kanae Nishii
- Royal Botanic Garden Edinburgh Edinburgh United Kingdom
| | - Catherine A Kidner
- Royal Botanic Garden Edinburgh Edinburgh United Kingdom
- School of Biological Sciences University of Edinburgh Edinburgh United Kingdom
| |
Collapse
|
4
|
Gungabeesoon J, Gort-Freitas NA, Kiss M, Bolli E, Messemaker M, Siwicki M, Hicham M, Bill R, Koch P, Cianciaruso C, Duval F, Pfirschke C, Mazzola M, Peters S, Homicsko K, Garris C, Weissleder R, Klein AM, Pittet MJ. A neutrophil response linked to tumor control in immunotherapy. Cell 2023; 186:1448-1464.e20. [PMID: 37001504 PMCID: PMC10132778 DOI: 10.1016/j.cell.2023.02.032] [Citation(s) in RCA: 78] [Impact Index Per Article: 78.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2022] [Revised: 01/10/2023] [Accepted: 02/24/2023] [Indexed: 04/01/2023]
Abstract
Neutrophils accumulate in solid tumors, and their abundance correlates with poor prognosis. Neutrophils are not homogeneous, however, and could play different roles in cancer therapy. Here, we investigate the role of neutrophils in immunotherapy, leading to tumor control. We show that successful therapies acutely expanded tumor neutrophil numbers. This expansion could be attributed to a Sellhi state rather than to other neutrophils that accelerate tumor progression. Therapy-elicited neutrophils acquired an interferon gene signature, also seen in human patients, and appeared essential for successful therapy, as loss of the interferon-responsive transcription factor IRF1 in neutrophils led to failure of immunotherapy. The neutrophil response depended on key components of anti-tumor immunity, including BATF3-dependent DCs, IL-12, and IFNγ. In addition, we found that a therapy-elicited systemic neutrophil response positively correlated with disease outcome in lung cancer patients. Thus, we establish a crucial role of a neutrophil state in mediating effective cancer therapy.
Collapse
Affiliation(s)
- Jeremy Gungabeesoon
- Center for Systems Biology, Massachusetts General Hospital Research Institute and Harvard Medical School, Boston, MA, USA
| | | | - Máté Kiss
- Department of Pathology and Immunology, University of Geneva, Geneva, Switzerland; AGORA Cancer Research Center, Lausanne, Switzerland
| | - Evangelia Bolli
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA; Department of Pathology and Immunology, University of Geneva, Geneva, Switzerland; AGORA Cancer Research Center, Lausanne, Switzerland
| | - Marius Messemaker
- Center for Systems Biology, Massachusetts General Hospital Research Institute and Harvard Medical School, Boston, MA, USA; Division of Molecular Oncology and Immunology, Netherlands Cancer Institute, Amsterdam, the Netherlands
| | - Marie Siwicki
- Center for Systems Biology, Massachusetts General Hospital Research Institute and Harvard Medical School, Boston, MA, USA
| | - Mehdi Hicham
- Department of Pathology and Immunology, University of Geneva, Geneva, Switzerland; AGORA Cancer Research Center, Lausanne, Switzerland
| | - Ruben Bill
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA; Department of Pathology and Immunology, University of Geneva, Geneva, Switzerland; AGORA Cancer Research Center, Lausanne, Switzerland
| | - Peter Koch
- Center for Systems Biology, Massachusetts General Hospital Research Institute and Harvard Medical School, Boston, MA, USA
| | - Chiara Cianciaruso
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA; Department of Pathology and Immunology, University of Geneva, Geneva, Switzerland; AGORA Cancer Research Center, Lausanne, Switzerland
| | - Florent Duval
- Department of Pathology and Immunology, University of Geneva, Geneva, Switzerland; AGORA Cancer Research Center, Lausanne, Switzerland
| | - Christina Pfirschke
- Center for Systems Biology, Massachusetts General Hospital Research Institute and Harvard Medical School, Boston, MA, USA
| | - Michael Mazzola
- Center for Regenerative Medicine, Massachusetts General Hospital, Boston, MA, USA
| | - Solange Peters
- Service of Medical Oncology, Department of Oncology, CHUV, Lausanne, Switzerland; Department of Oncology, University of Lausanne, Lausanne, Switzerland
| | - Krisztian Homicsko
- AGORA Cancer Research Center, Lausanne, Switzerland; Ludwig Institute for Cancer Research, Lausanne, Switzerland; Department of Oncology, CHUV, Lausanne, Switzerland; Swiss Cancer Center Leman, Lausanne, Switzerland
| | - Christopher Garris
- Center for Systems Biology, Massachusetts General Hospital Research Institute and Harvard Medical School, Boston, MA, USA
| | - Ralph Weissleder
- Center for Systems Biology, Massachusetts General Hospital Research Institute and Harvard Medical School, Boston, MA, USA; Department of Systems Biology, Harvard Medical School, Boston, MA, USA
| | - Allon M Klein
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA.
| | - Mikael J Pittet
- Center for Systems Biology, Massachusetts General Hospital Research Institute and Harvard Medical School, Boston, MA, USA; Department of Pathology and Immunology, University of Geneva, Geneva, Switzerland; AGORA Cancer Research Center, Lausanne, Switzerland; Ludwig Institute for Cancer Research, Lausanne, Switzerland; Swiss Cancer Center Leman, Lausanne, Switzerland.
| |
Collapse
|
5
|
Julca I, Mutwil-Anderwald D, Manoj V, Khan Z, Lai SK, Yang LK, Beh IT, Dziekan J, Lim YP, Lim SK, Low YW, Lam YI, Tjia S, Mu Y, Tan QW, Nuc P, Choo LM, Khew G, Shining L, Kam A, Tam JP, Bozdech Z, Schmidt M, Usadel B, Kanagasundaram Y, Alseekh S, Fernie A, Li HY, Mutwil M. Genomic, transcriptomic, and metabolomic analysis of Oldenlandia corymbosa reveals the biosynthesis and mode of action of anti-cancer metabolites. JOURNAL OF INTEGRATIVE PLANT BIOLOGY 2023. [PMID: 36807520 DOI: 10.1111/jipb.13469] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/13/2022] [Accepted: 02/18/2023] [Indexed: 06/18/2023]
Abstract
Plants accumulate a vast array of secondary metabolites, which constitute a natural resource for pharmaceuticals. Oldenlandia corymbosa belongs to the Rubiaceae family, and has been used in traditional medicine to treat different diseases, including cancer. However, the active metabolites of the plant, their biosynthetic pathway and mode of action in cancer are unknown. To fill these gaps, we exposed this plant to eight different stress conditions and combined different omics data capturing gene expression, metabolic profiles, and anti-cancer activity. Our results show that O. corymbosa extracts are active against breast cancer cell lines and that ursolic acid is responsible for this activity. Moreover, we assembled a high-quality genome and uncovered two genes involved in the biosynthesis of ursolic acid. Finally, we also revealed that ursolic acid causes mitotic catastrophe in cancer cells and identified three high-confidence protein binding targets by Cellular Thermal Shift Assay (CETSA) and reverse docking. Altogether, these results constitute a valuable resource to further characterize the biosynthesis of active metabolites in the Oldenlandia group, while the mode of action of ursolic acid will allow us to further develop this valuable compound.
Collapse
Affiliation(s)
- Irene Julca
- School of Biological Sciences, Nanyang Technological University, Singapore, 639798, Singapore
| | | | - Vaishnervi Manoj
- School of Biological Sciences, Nanyang Technological University, Singapore, 639798, Singapore
| | - Zahra Khan
- School of Biological Sciences, Nanyang Technological University, Singapore, 639798, Singapore
| | - Soak Kuan Lai
- School of Biological Sciences, Nanyang Technological University, Singapore, 639798, Singapore
| | - Lay K Yang
- Shared Analytics, Singapore Institute of Food and Biotechnology Innovation (SIFBI), Agency for Science, Technology and Research (A*STAR), Singapore, 138671, Singapore
| | - Ing T Beh
- School of Biological Sciences, Nanyang Technological University, Singapore, 639798, Singapore
| | - Jerzy Dziekan
- School of Biological Sciences, Nanyang Technological University, Singapore, 639798, Singapore
| | - Yoon P Lim
- Department of Biochemistry, National University of Singapore, Singapore, 117596, Singapore
| | - Shen K Lim
- School of Biological Sciences, Nanyang Technological University, Singapore, 639798, Singapore
- Department of Biochemistry, National University of Singapore, Singapore, 117596, Singapore
| | - Yee W Low
- Singapore Botanic Gardens, Singapore, 259569, Singapore
| | - Yuen I Lam
- School of Biological Sciences, Nanyang Technological University, Singapore, 639798, Singapore
| | - Seth Tjia
- School of Biological Sciences, Nanyang Technological University, Singapore, 639798, Singapore
| | - Yuguang Mu
- School of Biological Sciences, Nanyang Technological University, Singapore, 639798, Singapore
| | - Qiao W Tan
- School of Biological Sciences, Nanyang Technological University, Singapore, 639798, Singapore
| | - Przemyslaw Nuc
- Department of Gene Expression, Faculty of Biology, Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University, Poznan, 61-614, Poland
| | - Le M Choo
- Singapore Botanic Gardens, Singapore, 259569, Singapore
| | - Gillian Khew
- School of Biological Sciences, Nanyang Technological University, Singapore, 639798, Singapore
- Singapore Botanic Gardens, Singapore, 259569, Singapore
| | - Loo Shining
- School of Biological Sciences, Nanyang Technological University, Singapore, 639798, Singapore
| | - Antony Kam
- School of Biological Sciences, Nanyang Technological University, Singapore, 639798, Singapore
| | - James P Tam
- School of Biological Sciences, Nanyang Technological University, Singapore, 639798, Singapore
| | - Zbynek Bozdech
- School of Biological Sciences, Nanyang Technological University, Singapore, 639798, Singapore
| | | | - Bjoern Usadel
- IBG-4 Bioinformatics, Forschungszentrum Jülich, Jülich, 52428, Germany
| | - Yoganathan Kanagasundaram
- Shared Analytics, Singapore Institute of Food and Biotechnology Innovation (SIFBI), Agency for Science, Technology and Research (A*STAR), Singapore, 138671, Singapore
| | - Saleh Alseekh
- Max-Planck-Institut für Molekulare Pflanzenphysiologie, Potsdam-Golm, 14476, Germany
- Center of Plant Systems Biology and Biotechnology, Plovdiv, 4000, Bulgaria
| | - Alisdair Fernie
- Max-Planck-Institut für Molekulare Pflanzenphysiologie, Potsdam-Golm, 14476, Germany
- Center of Plant Systems Biology and Biotechnology, Plovdiv, 4000, Bulgaria
| | - Hoi Y Li
- School of Biological Sciences, Nanyang Technological University, Singapore, 639798, Singapore
| | - Marek Mutwil
- School of Biological Sciences, Nanyang Technological University, Singapore, 639798, Singapore
| |
Collapse
|
6
|
Moreyra NN, Almeida FC, Allan C, Frankel N, Matzkin LM, Hasson E. Phylogenomics provides insights into the evolution of cactophily and host plant shifts in Drosophila. Mol Phylogenet Evol 2023; 178:107653. [PMID: 36404461 DOI: 10.1016/j.ympev.2022.107653] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2022] [Revised: 09/30/2022] [Accepted: 10/25/2022] [Indexed: 11/06/2022]
Abstract
Cactophilic species of the Drosophila buzzatii cluster (repleta group) comprise an excellent model group to investigate genomic changes underlying adaptation to extreme climate conditions and host plants. In particular, these species form a tractable system to study the transition from chemically simpler breeding sites (like prickly pears of the genus Opuntia) to chemically more complex hosts (columnar cacti). Here, we report four highly contiguous genome assemblies of three species of the buzzatii cluster. Based on this genomic data and inferred phylogenetic relationships, we identified candidate taxonomically restricted genes (TRGs) likely involved in the evolution of cactophily and cactus host specialization. Functional enrichment analyses of TRGs within the buzzatii cluster identified genes involved in detoxification, water preservation, immune system response, anatomical structure development, and morphogenesis. In contrast, processes that regulate responses to stress, as well as the metabolism of nitrogen compounds, transport, and secretion were found in the set of species that are columnar cacti dwellers. These findings are in line with the hypothesis that those genomic changes brought about key mechanisms underlying the adaptation of the buzzatii cluster species to arid regions in South America.
Collapse
Affiliation(s)
- Nicolás Nahuel Moreyra
- Departamento de Ecología, Genética y Evolución (EGE), Facultad de Ciencias Exactas y Naturales (FCEyN), Universidad de Buenos Aires (UBA), Ciudad Autónoma de Buenos Aires C1428EGA, Argentina; Instituto de Ecología, Genética y Evolución de Buenos Aires (IEGEBA), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Ciudad Autónoma de Buenos Aires C1428EGA, Argentina.
| | - Francisca Cunha Almeida
- Departamento de Ecología, Genética y Evolución (EGE), Facultad de Ciencias Exactas y Naturales (FCEyN), Universidad de Buenos Aires (UBA), Ciudad Autónoma de Buenos Aires C1428EGA, Argentina; Instituto de Ecología, Genética y Evolución de Buenos Aires (IEGEBA), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Ciudad Autónoma de Buenos Aires C1428EGA, Argentina.
| | - Carson Allan
- Department of Entomology, University of Arizona, Tucson, AZ 85719, USA.
| | - Nicolás Frankel
- Departamento de Ecología, Genética y Evolución (EGE), Facultad de Ciencias Exactas y Naturales (FCEyN), Universidad de Buenos Aires (UBA), Ciudad Autónoma de Buenos Aires C1428EGA, Argentina; Instituto de Ecología, Genética y Evolución de Buenos Aires (IEGEBA), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Ciudad Autónoma de Buenos Aires C1428EGA, Argentina.
| | | | - Esteban Hasson
- Departamento de Ecología, Genética y Evolución (EGE), Facultad de Ciencias Exactas y Naturales (FCEyN), Universidad de Buenos Aires (UBA), Ciudad Autónoma de Buenos Aires C1428EGA, Argentina; Instituto de Ecología, Genética y Evolución de Buenos Aires (IEGEBA), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Ciudad Autónoma de Buenos Aires C1428EGA, Argentina.
| |
Collapse
|
7
|
Barlow LD, Maciejowski W, More K, Terry K, Vargová R, Záhonová K, Dacks JB. Comparative Genomics for Evolutionary Cell Biology Using AMOEBAE: Understanding the Golgi and Beyond. Methods Mol Biol 2022; 2557:431-452. [PMID: 36512230 DOI: 10.1007/978-1-0716-2639-9_26] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Taking an evolutionary approach to cell biology can yield important new information about how the cell works and how it evolved to do so. This is true of the Golgi apparatus, as it is of all systems within the cell. Comparative genomics is one of the crucial first steps to this line of research, but comes with technical challenges that must be overcome for rigor and robustness. We here introduce AMOEBAE, a workflow for mid-range scale comparative genomic analyses. It allows for customization of parameters, queries, and taxonomic sampling of genomic and transcriptomics data. This protocol article covers the rationale for an evolutionary approach to cell biological study (i.e., when would AMOEBAE be useful), how to use AMOEBAE, and discussion of limitations. It also provides an example dataset, which demonstrates that the Golgi protein AP4 Epsilon is present as the sole retained subunit of the AP4 complex in basidiomycete fungi. AMOEBAE can facilitate comparative genomic studies by balancing reproducibility and speed with user-input and interpretation. It is hoped that AMOEBAE or similar tools will encourage cell biologists to incorporate an evolutionary context into their research.
Collapse
Affiliation(s)
- Lael D Barlow
- Department of Biological Sciences, University of Alberta, Edmonton, AB, Canada. .,Division of Biological Chemistry and Drug Discovery, School of Life Sciences, University of Dundee, Dundee, UK.
| | - William Maciejowski
- Department of Biological Sciences, University of Alberta, Edmonton, AB, Canada
| | - Kiran More
- Department of Biological Sciences, University of Alberta, Edmonton, AB, Canada
| | - Kara Terry
- Division of Infectious Diseases, Department of Medicine, University of Alberta, Edmonton, AB, Canada
| | - Romana Vargová
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Ostrava, Czechia
| | - Kristína Záhonová
- Department of Parasitology, Faculty of Science, Charles University, BIOCEV, Vestec, Czechia.,Institute of Parasitology, Biology Centre, Czech Academy of Sciences, České Budějovice, Czechia
| | - Joel B Dacks
- Department of Biological Sciences, University of Alberta, Edmonton, AB, Canada. .,Division of Infectious Diseases, Department of Medicine, University of Alberta, Edmonton, AB, Canada. .,Department of Parasitology, Faculty of Science, Charles University, BIOCEV, Vestec, Czechia. .,Centre for Life's Origin and Evolution, Department of Genetics, Evolution and Environment, University College of London, London, UK.
| |
Collapse
|
8
|
BuscoPhylo: a webserver for Busco-based phylogenomic analysis for non-specialists. Sci Rep 2022; 12:17352. [PMID: 36253435 PMCID: PMC9576783 DOI: 10.1038/s41598-022-22461-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2022] [Accepted: 10/14/2022] [Indexed: 01/10/2023] Open
Abstract
Here we present the BuscoPhylo tool that enables both students and established scientists to easily perform Busco-based phylogenomic analysis starting from a set of genomes sequences. BuscoPhylo is an efficient and user-friendly web server freely accessible at https://buscophylo.inra.org.ma/ . The source code, along with documentation, is freely available under an MIT license at https://github.com/alaesahbou/BuscoPhylo .
Collapse
|
9
|
Cenci A, Concepción-Hernández M, Guignon V, Angenon G, Rouard M. Genome-Wide Classification and Phylogenetic Analyses of the GDSL-Type Esterase/Lipase (GELP) Family in Flowering Plants. Int J Mol Sci 2022; 23:ijms232012114. [PMID: 36292971 PMCID: PMC9602515 DOI: 10.3390/ijms232012114] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2022] [Revised: 10/05/2022] [Accepted: 10/07/2022] [Indexed: 11/16/2022] Open
Abstract
GDSL-type esterase/lipase (GELP) enzymes have key functions in plants, such as developmental processes, anther and pollen development, and responses to biotic and abiotic stresses. Genes that encode GELP belong to a complex and large gene family, ranging from tens to more than hundreds of members per plant species. To facilitate functional transfer between them, we conducted a genome-wide classification of GELP in 46 plant species. First, we applied an iterative phylogenetic method using a selected set of representative angiosperm genomes (three monocots and five dicots) and identified 10 main clusters, subdivided into 44 orthogroups (OGs). An expert curation for gene structures, orthogroup composition, and functional annotation was made based on a literature review. Then, using the HMM profiles as seeds, we expanded the classification to 46 plant species. Our results revealed the variable evolutionary dynamics between OGs in which some expanded, mostly through tandem duplications, while others were maintained as single copies. Among these, dicot-specific clusters and specific amplifications in monocots and wheat were characterized. This approach, by combining manual curation and automatic identification, was effective in characterizing a large gene family, allowing the establishment of a classification framework for gene function transfer and a better understanding of the evolutionary history of GELP.
Collapse
Affiliation(s)
- Alberto Cenci
- Bioversity International, Parc Scientifique Agropolis II, 34397 Montpellier, France
- Correspondence: (A.C.); (M.R.)
| | - Mairenys Concepción-Hernández
- Instituto de Biotecnología de las Plantas, Universidad Central “Marta Abreu” de Las Villas (UCLV), Carretera a Camajuaní km 5.5, Santa Clara C.P. 54830, Villa Clara, Cuba
- Research Group Plant Genetics, Vrije Universiteit Brussel (VUB), Pleinlaan 2, 1050 Brussels, Belgium
| | - Valentin Guignon
- Bioversity International, Parc Scientifique Agropolis II, 34397 Montpellier, France
| | - Geert Angenon
- Research Group Plant Genetics, Vrije Universiteit Brussel (VUB), Pleinlaan 2, 1050 Brussels, Belgium
| | - Mathieu Rouard
- Bioversity International, Parc Scientifique Agropolis II, 34397 Montpellier, France
- Correspondence: (A.C.); (M.R.)
| |
Collapse
|
10
|
Leveraging orthology within maize and Arabidopsis QTL to identify genes affecting natural variation in gravitropism. Proc Natl Acad Sci U S A 2022; 119:e2212199119. [PMID: 36161933 PMCID: PMC9546580 DOI: 10.1073/pnas.2212199119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Plants typically orient their organs with respect to the Earth's gravity field by a dynamic process called gravitropism. To discover conserved genetic elements affecting seedling root gravitropism, we measured the process in a set of Zea mays (maize) recombinant inbred lines with machine vision and compared the results with those obtained in a similar study of Arabidopsis thaliana. Each of the several quantitative trait loci that we mapped in both species spanned many hundreds of genes, too many to test individually for causality. We reasoned that orthologous genes may be responsible for natural variation in monocot and dicot root gravitropism. If so, pairs of orthologous genes affecting gravitropism may be present within the maize and Arabidopsis QTL intervals. A reciprocal comparison of sequences within the QTL intervals identified seven pairs of such one-to-one orthologs. Analysis of knockout mutants demonstrated a role in gravitropism for four of the seven: CCT2 functions in phosphatidylcholine biosynthesis, ATG5 functions in membrane remodeling during autophagy, UGP2 produces the substrate for cellulose and callose polymer extension, and FAMA is a transcription factor. Automated phenotyping enabled this discovery of four naturally varying components of a conserved process (gravitropism) by making it feasible to conduct the same large-scale experiment in two species.
Collapse
|
11
|
Foley S, Vlasova A, Marcet-Houben M, Gabaldón T, Hinman VF. Evolutionary analyses of genes in Echinodermata offer insights towards the origin of metazoan phyla. Genomics 2022; 114:110431. [PMID: 35835427 PMCID: PMC9552553 DOI: 10.1016/j.ygeno.2022.110431] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2021] [Revised: 05/10/2022] [Accepted: 07/06/2022] [Indexed: 11/24/2022]
Abstract
Despite recent studies discussing the evolutionary impacts of gene duplications and losses among metazoans, the genomic basis for the evolution of phyla remains enigmatic. Here, we employ phylogenomic approaches to search for orthologous genes without known functions among echinoderms, and subsequently use them to guide the identification of their homologs across other metazoans. Our final set of 14 genes was obtained via a suite of homology prediction tools, gene expression data, gene ontology, and generating the Strongylocentrotus purpuratus phylome. The gene set was subjected to selection pressure analyses, which indicated that they are highly conserved and under negative selection. Their presence across broad taxonomic depths suggests that genes required to form a phylum are ancestral to that phylum. Therefore, rather than de novo gene genesis, we posit that evolutionary forces such as selection on existing genomic elements over large timescales may drive divergence and contribute to the emergence of phyla.
Collapse
Affiliation(s)
- Saoirse Foley
- Department of Biological Sciences, Carnegie Mellon University, 5000 Forbes Ave, Pittsburgh, PA 15213, USA; Echinobase #6-46, Mellon Institute, 4400 Fifth Ave, Pittsburgh, PA 15213, USA.
| | - Anna Vlasova
- Barcelona Supercomputing Centre (BSC-CNS), Jordi Girona, 29, 08034 Barcelona, Spain; Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Baldiri Reixac, 10, 08028 Barcelona, Spain
| | - Marina Marcet-Houben
- Barcelona Supercomputing Centre (BSC-CNS), Jordi Girona, 29, 08034 Barcelona, Spain; Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Baldiri Reixac, 10, 08028 Barcelona, Spain
| | - Toni Gabaldón
- Barcelona Supercomputing Centre (BSC-CNS), Jordi Girona, 29, 08034 Barcelona, Spain; Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Baldiri Reixac, 10, 08028 Barcelona, Spain; Catalan Institution for Research and Advanced Studies (ICREA), Barcelona, Spain
| | - Veronica F Hinman
- Department of Biological Sciences, Carnegie Mellon University, 5000 Forbes Ave, Pittsburgh, PA 15213, USA; Echinobase #6-46, Mellon Institute, 4400 Fifth Ave, Pittsburgh, PA 15213, USA
| |
Collapse
|
12
|
Cabezas-Bratesco D, Mcgee FA, Colenso CK, Zavala K, Granata D, Carnevale V, Opazo JC, Brauchi SE. Sequence and structural conservation reveal fingerprint residues in TRP channels. eLife 2022; 11:73645. [PMID: 35686986 PMCID: PMC9242649 DOI: 10.7554/elife.73645] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2021] [Accepted: 05/19/2022] [Indexed: 11/13/2022] Open
Abstract
Transient receptor potential (TRP) proteins are a large family of cation-selective channels, surpassed in variety only by voltage-gated potassium channels. Detailed molecular mechanisms governing how membrane voltage, ligand binding, or temperature can induce conformational changes promoting the open state in TRP channels are still a matter of debate. Aiming to unveil distinctive structural features common to the transmembrane domains within the TRP family, we performed phylogenetic reconstruction, sequence statistics, and structural analysis over a large set of TRP channel genes. Here, we report an exceptionally conserved set of residues. This fingerprint is composed of twelve residues localized at equivalent three-dimensional positions in TRP channels from the different subtypes. Moreover, these amino acids are arranged in three groups, connected by a set of aromatics located at the core of the transmembrane structure. We hypothesize that differences in the connectivity between these different groups of residues harbor the apparent differences in coupling strategies used by TRP subgroups.
Collapse
Affiliation(s)
| | - Francisco A Mcgee
- Department of Biology, Temple University, Philadelphia, United States
| | - Charlotte K Colenso
- School of Cellular and Molecular Medicine, University of Bristol, Bristol, United Kingdom
| | - Kattina Zavala
- Instituto de Ciencias Ambientales y Evolutivas, Universidad Austral de Chile, Valdivia, Chile
| | - Daniele Granata
- Department of Biology, Temple University, Philadelphia, United States
| | | | - Juan C Opazo
- Instituto de Ciencias Ambientales y Evolutivas, Universidad Austral de Chile, Valdivia, Chile
| | | |
Collapse
|
13
|
Cillingová A, Tóth R, Mojáková A, Zeman I, Vrzoňová R, Siváková B, Baráth P, Neboháčová M, Klepcová Z, Brázdovič F, Lichancová H, Hodorová V, Brejová B, Vinař T, Mutalová S, Vozáriková V, Mutti G, Tomáška Ľ, Gácser A, Gabaldón T, Nosek J. Transcriptome and proteome profiling reveals complex adaptations of Candida parapsilosis cells assimilating hydroxyaromatic carbon sources. PLoS Genet 2022; 18:e1009815. [PMID: 35255079 PMCID: PMC8929692 DOI: 10.1371/journal.pgen.1009815] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2021] [Revised: 03/17/2022] [Accepted: 02/22/2022] [Indexed: 11/19/2022] Open
Abstract
Many fungal species utilize hydroxyderivatives of benzene and benzoic acid as carbon sources. The yeast Candida parapsilosis metabolizes these compounds via the 3-oxoadipate and gentisate pathways, whose components are encoded by two metabolic gene clusters. In this study, we determine the chromosome level assembly of the C. parapsilosis strain CLIB214 and use it for transcriptomic and proteomic investigation of cells cultivated on hydroxyaromatic substrates. We demonstrate that the genes coding for enzymes and plasma membrane transporters involved in the 3-oxoadipate and gentisate pathways are highly upregulated and their expression is controlled in a substrate-specific manner. However, regulatory proteins involved in this process are not known. Using the knockout mutants, we show that putative transcriptional factors encoded by the genes OTF1 and GTF1 located within these gene clusters function as transcriptional activators of the 3-oxoadipate and gentisate pathway, respectively. We also show that the activation of both pathways is accompanied by upregulation of genes for the enzymes involved in β-oxidation of fatty acids, glyoxylate cycle, amino acid metabolism, and peroxisome biogenesis. Transcriptome and proteome profiles of the cells grown on 4-hydroxybenzoate and 3-hydroxybenzoate, which are metabolized via the 3-oxoadipate and gentisate pathway, respectively, reflect their different connection to central metabolism. Yet we find that the expression profiles differ also in the cells assimilating 4-hydroxybenzoate and hydroquinone, which are both metabolized in the same pathway. This finding is consistent with the phenotype of the Otf1p-lacking mutant, which exhibits impaired growth on hydroxybenzoates, but still utilizes hydroxybenzenes, thus indicating that additional, yet unidentified transcription factor could be involved in the 3-oxoadipate pathway regulation. Moreover, we propose that bicarbonate ions resulting from decarboxylation of hydroxybenzoates also contribute to differences in the cell responses to hydroxybenzoates and hydroxybenzenes. Finally, our phylogenetic analysis highlights evolutionary paths leading to metabolic adaptations of yeast cells assimilating hydroxyaromatic substrates.
Collapse
Affiliation(s)
- Andrea Cillingová
- Department of Biochemistry, Faculty of Natural Sciences, Comenius University in Bratislava, Bratislava, Slovakia
| | - Renáta Tóth
- HCEMM-USZ Department of Microbiology, University of Szeged, Szeged, Hungary
- MTA-SZTE Lendület Mycobiome Research Group, University of Szeged, Szeged, Hungary
| | - Anna Mojáková
- Department of Biochemistry, Faculty of Natural Sciences, Comenius University in Bratislava, Bratislava, Slovakia
| | - Igor Zeman
- Department of Biochemistry, Faculty of Natural Sciences, Comenius University in Bratislava, Bratislava, Slovakia
| | - Romana Vrzoňová
- Department of Biochemistry, Faculty of Natural Sciences, Comenius University in Bratislava, Bratislava, Slovakia
| | - Barbara Siváková
- Institute of Chemistry, Slovak Academy of Sciences, Bratislava, Slovakia
| | - Peter Baráth
- Institute of Chemistry, Slovak Academy of Sciences, Bratislava, Slovakia
| | - Martina Neboháčová
- Department of Biochemistry, Faculty of Natural Sciences, Comenius University in Bratislava, Bratislava, Slovakia
| | - Zuzana Klepcová
- Department of Biochemistry, Faculty of Natural Sciences, Comenius University in Bratislava, Bratislava, Slovakia
| | - Filip Brázdovič
- Department of Biochemistry, Faculty of Natural Sciences, Comenius University in Bratislava, Bratislava, Slovakia
| | - Hana Lichancová
- Department of Biochemistry, Faculty of Natural Sciences, Comenius University in Bratislava, Bratislava, Slovakia
| | - Viktória Hodorová
- Department of Biochemistry, Faculty of Natural Sciences, Comenius University in Bratislava, Bratislava, Slovakia
| | - Broňa Brejová
- Department of Computer Science, Faculty of Mathematics, Physics and Informatics, Comenius University in Bratislava, Bratislava, Slovakia
| | - Tomáš Vinař
- Department of Applied Informatics, Faculty of Mathematics, Physics and Informatics, Comenius University in Bratislava, Bratislava, Slovakia
| | - Sofia Mutalová
- Department of Biochemistry, Faculty of Natural Sciences, Comenius University in Bratislava, Bratislava, Slovakia
| | - Veronika Vozáriková
- Department of Genetics, Faculty of Natural Sciences, Comenius University in Bratislava, Bratislava, Slovakia
| | - Giacomo Mutti
- Institute for Research in Biomedicine (IRB), The Barcelona Institute of Science and Technology, Barcelona, Spain
- Barcelona Supercomputing Centre (BSC-CNS), Barcelona, Spain
| | - Ľubomír Tomáška
- Department of Genetics, Faculty of Natural Sciences, Comenius University in Bratislava, Bratislava, Slovakia
| | - Atilla Gácser
- HCEMM-USZ Department of Microbiology, University of Szeged, Szeged, Hungary
- MTA-SZTE Lendület Mycobiome Research Group, University of Szeged, Szeged, Hungary
| | - Toni Gabaldón
- Institute for Research in Biomedicine (IRB), The Barcelona Institute of Science and Technology, Barcelona, Spain
- Barcelona Supercomputing Centre (BSC-CNS), Barcelona, Spain
- Catalan Institution for Research and Advanced Studies (ICREA), Barcelona, Spain
- Centro de Investigación Biomédica En Red de Enfermedades Infecciosas (CIBERINFEC), Barcelona, Spain
| | - Jozef Nosek
- Department of Biochemistry, Faculty of Natural Sciences, Comenius University in Bratislava, Bratislava, Slovakia
- * E-mail:
| |
Collapse
|
14
|
Mixão V, del Olmo V, Hegedűsová E, Saus E, Pryszcz L, Cillingová A, Nosek J, Gabaldón T. Genome analysis of five recently described species of the CUG-Ser clade uncovers Candida theae as a new hybrid lineage with pathogenic potential in the Candida parapsilosis species complex. DNA Res 2022; 29:6570588. [PMID: 35438177 PMCID: PMC9046093 DOI: 10.1093/dnares/dsac010] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2022] [Indexed: 01/27/2023] Open
Abstract
Candida parapsilosis species complex comprises three important pathogenic species: Candida parapsilosis sensu stricto, Candida orthopsilosis and Candida metapsilosis. The majority of C. orthopsilosis and all C. metapsilosis isolates sequenced thus far are hybrids, and most of the parental lineages remain unidentified. This led to the hypothesis that hybrids with pathogenic potential were formed by the hybridization of non-pathogenic lineages that thrive in the environment. In a search for the missing hybrid parentals, and aiming to get a better understanding of the evolution of the species complex, we sequenced, assembled and analysed the genome of five close relatives isolated from the environment: Candida jiufengensis, Candida pseudojiufengensis, Candida oxycetoniae, Candida margitis and Candida theae. We found that the linear conformation of mitochondrial genomes in Candida species emerged multiple times independently. Furthermore, our analyses discarded the possible involvement of these species in the mentioned hybridizations, but identified C. theae as an additional hybrid in the species complex. Importantly, C. theae was recently associated with a case of infection, and we also uncovered the hybrid nature of this clinical isolate. Altogether, our results reinforce the hypothesis that hybridization is widespread among Candida species, and potentially contributes to the emergence of lineages with opportunistic pathogenic behaviour.
Collapse
Affiliation(s)
- Verónica Mixão
- Life Sciences Department, Barcelona Supercomputing Center (BSC), 08034 Barcelona, Spain
- Mechanisms of Disease Department, Institute for Research in Biomedicine (IRB), Barcelona, Spain
| | - Valentina del Olmo
- Life Sciences Department, Barcelona Supercomputing Center (BSC), 08034 Barcelona, Spain
- Mechanisms of Disease Department, Institute for Research in Biomedicine (IRB), Barcelona, Spain
| | - Eva Hegedűsová
- Department of Biochemistry, Faculty of Natural Sciences, Comenius University in Bratislava, 842 15 Bratislava, Slovak Republic
| | - Ester Saus
- Life Sciences Department, Barcelona Supercomputing Center (BSC), 08034 Barcelona, Spain
- Mechanisms of Disease Department, Institute for Research in Biomedicine (IRB), Barcelona, Spain
| | - Leszek Pryszcz
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona 08003, Spain
| | - Andrea Cillingová
- Department of Biochemistry, Faculty of Natural Sciences, Comenius University in Bratislava, 842 15 Bratislava, Slovak Republic
| | - Jozef Nosek
- Department of Biochemistry, Faculty of Natural Sciences, Comenius University in Bratislava, 842 15 Bratislava, Slovak Republic
| | - Toni Gabaldón
- Life Sciences Department, Barcelona Supercomputing Center (BSC), 08034 Barcelona, Spain
- Mechanisms of Disease Department, Institute for Research in Biomedicine (IRB), Barcelona, Spain
- ICREA, Barcelona 08010, Spain
- Centro de Investigación Biomédica En Red de Enfermedades Infecciosas, Barcelona, Spain
| |
Collapse
|
15
|
Wafula EK, Zhang H, Von Kuster G, Leebens-Mack JH, Honaas LA, dePamphilis CW. PlantTribes2: Tools for comparative gene family analysis in plant genomics. FRONTIERS IN PLANT SCIENCE 2022; 13:1011199. [PMID: 36798801 PMCID: PMC9928214 DOI: 10.3389/fpls.2022.1011199] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/03/2022] [Accepted: 12/02/2022] [Indexed: 05/12/2023]
Abstract
Plant genome-scale resources are being generated at an increasing rate as sequencing technologies continue to improve and raw data costs continue to fall; however, the cost of downstream analyses remains large. This has resulted in a considerable range of genome assembly and annotation qualities across plant genomes due to their varying sizes, complexity, and the technology used for the assembly and annotation. To effectively work across genomes, researchers increasingly rely on comparative genomic approaches that integrate across plant community resources and data types. Such efforts have aided the genome annotation process and yielded novel insights into the evolutionary history of genomes and gene families, including complex non-model organisms. The essential tools to achieve these insights rely on gene family analysis at a genome-scale, but they are not well integrated for rapid analysis of new data, and the learning curve can be steep. Here we present PlantTribes2, a scalable, easily accessible, highly customizable, and broadly applicable gene family analysis framework with multiple entry points including user provided data. It uses objective classifications of annotated protein sequences from existing, high-quality plant genomes for comparative and evolutionary studies. PlantTribes2 can improve transcript models and then sort them, either genome-scale annotations or individual gene coding sequences, into pre-computed orthologous gene family clusters with rich functional annotation information. Then, for gene families of interest, PlantTribes2 performs downstream analyses and customizable visualizations including, (1) multiple sequence alignment, (2) gene family phylogeny, (3) estimation of synonymous and non-synonymous substitution rates among homologous sequences, and (4) inference of large-scale duplication events. We give examples of PlantTribes2 applications in functional genomic studies of economically important plant families, namely transcriptomics in the weedy Orobanchaceae and a core orthogroup analysis (CROG) in Rosaceae. PlantTribes2 is freely available for use within the main public Galaxy instance and can be downloaded from GitHub or Bioconda. Importantly, PlantTribes2 can be readily adapted for use with genomic and transcriptomic data from any kind of organism.
Collapse
Affiliation(s)
- Eric K Wafula
- Department of Biology, The Pennsylvania State University, University Park, PA, United States
| | - Huiting Zhang
- Tree Fruit Research Laboratory, United States Department of Agriculture (USDA), Agricultural Research Service (ARS), Wenatchee, WA, United States
- Department of Horticulture, Washington State University, Pullman, WA, United States
| | - Gregory Von Kuster
- Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA, United States
| | | | - Loren A Honaas
- Tree Fruit Research Laboratory, United States Department of Agriculture (USDA), Agricultural Research Service (ARS), Wenatchee, WA, United States
| | - Claude W dePamphilis
- Department of Biology, The Pennsylvania State University, University Park, PA, United States
- Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA, United States
| |
Collapse
|
16
|
Fuentes D, Molina M, Chorostecki U, Capella-Gutiérrez S, Marcet-Houben M, Gabaldón T. PhylomeDB V5: an expanding repository for genome-wide catalogues of annotated gene phylogenies. Nucleic Acids Res 2021; 50:D1062-D1068. [PMID: 34718760 PMCID: PMC8728271 DOI: 10.1093/nar/gkab966] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2021] [Revised: 10/02/2021] [Accepted: 10/05/2021] [Indexed: 12/20/2022] Open
Abstract
PhylomeDB is a unique knowledge base providing public access to minable and browsable catalogues of pre-computed genome-wide collections of annotated sequences, alignments and phylogenies (i.e. phylomes) of homologous genes, as well as to their corresponding phylogeny-based orthology and paralogy relationships. In addition, PhylomeDB trees and alignments can be downloaded for further processing to detect and date gene duplication events, infer past events of inter-species hybridization and horizontal gene transfer, as well as to uncover footprints of selection, introgression, gene conversion, or other relevant evolutionary processes in the genes and organisms of interest. Here, we describe the latest evolution of PhylomeDB (version 5). This new version includes a newly implemented web interface and several new functionalities such as optimized searching procedures, the possibility to create user-defined phylome collections, and a fully redesigned data structure. This release also represents a significant core data expansion, with the database providing access to 534 phylomes, comprising over 8 million trees, and homology relationships for genes in over 6000 species. This makes PhylomeDB the largest and most comprehensive public repository of gene phylogenies. PhylomeDB is available at http://www.phylomedb.org.
Collapse
Affiliation(s)
- Diego Fuentes
- Barcelona Supercomputing Centre (BSC-CNS). Jordi Girona 29, 08034 Barcelona, Spain.,Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Baldiri Reixac 10, 08028 Barcelona, Spain
| | - Manuel Molina
- Barcelona Supercomputing Centre (BSC-CNS). Jordi Girona 29, 08034 Barcelona, Spain.,Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Baldiri Reixac 10, 08028 Barcelona, Spain
| | - Uciel Chorostecki
- Barcelona Supercomputing Centre (BSC-CNS). Jordi Girona 29, 08034 Barcelona, Spain.,Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Baldiri Reixac 10, 08028 Barcelona, Spain
| | | | - Marina Marcet-Houben
- Barcelona Supercomputing Centre (BSC-CNS). Jordi Girona 29, 08034 Barcelona, Spain.,Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Baldiri Reixac 10, 08028 Barcelona, Spain
| | - Toni Gabaldón
- Barcelona Supercomputing Centre (BSC-CNS). Jordi Girona 29, 08034 Barcelona, Spain.,Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Baldiri Reixac 10, 08028 Barcelona, Spain.,Catalan Institution for Research and Advanced Studies (ICREA), Barcelona, Spain
| |
Collapse
|
17
|
Huang LC, Taujale R, Gravel N, Venkat A, Yeung W, Byrne DP, Eyers PA, Kannan N. KinOrtho: a method for mapping human kinase orthologs across the tree of life and illuminating understudied kinases. BMC Bioinformatics 2021; 22:446. [PMID: 34537014 PMCID: PMC8449880 DOI: 10.1186/s12859-021-04358-3] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2021] [Accepted: 09/06/2021] [Indexed: 12/11/2022] Open
Abstract
BACKGROUND Protein kinases are among the largest druggable family of signaling proteins, involved in various human diseases, including cancers and neurodegenerative disorders. Despite their clinical relevance, nearly 30% of the 545 human protein kinases remain highly understudied. Comparative genomics is a powerful approach for predicting and investigating the functions of understudied kinases. However, an incomplete knowledge of kinase orthologs across fully sequenced kinomes severely limits the application of comparative genomics approaches for illuminating understudied kinases. Here, we introduce KinOrtho, a query- and graph-based orthology inference method that combines full-length and domain-based approaches to map one-to-one kinase orthologs across 17 thousand species. RESULTS Using multiple metrics, we show that KinOrtho performed better than existing methods in identifying kinase orthologs across evolutionarily divergent species and eliminated potential false positives by flagging sequences without a proper kinase domain for further evaluation. We demonstrate the advantage of using domain-based approaches for identifying domain fusion events, highlighting a case between an understudied serine/threonine kinase TAOK1 and a metabolic kinase PIK3C2A with high co-expression in human cells. We also identify evolutionary fission events involving the understudied OBSCN kinase domains, further highlighting the value of domain-based orthology inference approaches. Using KinOrtho-defined orthologs, Gene Ontology annotations, and machine learning, we propose putative biological functions of several understudied kinases, including the role of TP53RK in cell cycle checkpoint(s), the involvement of TSSK3 and TSSK6 in acrosomal vesicle localization, and potential functions for the ULK4 pseudokinase in neuronal development. CONCLUSIONS In sum, KinOrtho presents a novel query-based tool to identify one-to-one orthologous relationships across thousands of proteomes that can be applied to any protein family of interest. We exploit KinOrtho here to identify kinase orthologs and show that its well-curated kinome ortholog set can serve as a valuable resource for illuminating understudied kinases, and the KinOrtho framework can be extended to any protein-family of interest.
Collapse
Affiliation(s)
- Liang-Chin Huang
- Institute of Bioinformatics, University of Georgia, 120 Green St., Athens, GA 30602 USA
| | - Rahil Taujale
- Institute of Bioinformatics, University of Georgia, 120 Green St., Athens, GA 30602 USA
| | - Nathan Gravel
- PREP@UGA, University of Georgia, 500 D.W. Brooks Drive, Athens, GA 30602 USA
| | - Aarya Venkat
- Department of Biochemistry and Molecular Biology, University of Georgia, 120 Green St., Athens, GA 30602 USA
| | - Wayland Yeung
- Institute of Bioinformatics, University of Georgia, 120 Green St., Athens, GA 30602 USA
| | - Dominic P. Byrne
- Department of Biochemistry and Systems Biology, University of Liverpool, Crown St, Liverpool, UK
| | - Patrick A. Eyers
- Department of Biochemistry and Systems Biology, University of Liverpool, Crown St, Liverpool, UK
| | - Natarajan Kannan
- Institute of Bioinformatics, University of Georgia, 120 Green St., Athens, GA 30602 USA
- Department of Biochemistry and Molecular Biology, University of Georgia, 120 Green St., Athens, GA 30602 USA
| |
Collapse
|
18
|
Mixão V, Hegedűsová E, Saus E, Pryszcz LP, Cillingová A, Nosek J, Gabaldón T. Genome analysis of Candida subhashii reveals its hybrid nature and dual mitochondrial genome conformations. DNA Res 2021; 28:6299387. [PMID: 34129020 PMCID: PMC8311171 DOI: 10.1093/dnares/dsab006] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2021] [Accepted: 06/14/2021] [Indexed: 01/14/2023] Open
Abstract
Candida subhashii belongs to the CUG-Ser clade, a group of phylogenetically closely related yeast species that includes some human opportunistic pathogens, such as Candida albicans. Despite being present in the environment, C. subhashii was initially described as the causative agent of a case of peritonitis. Considering the relevance of whole-genome sequencing and analysis for our understanding of genome evolution and pathogenicity, we sequenced, assembled and annotated the genome of C. subhashii type strain. Our results show that C. subhashii presents a highly heterozygous genome and other signatures that point to a hybrid ancestry. The presence of functional pathways for assimilation of hydroxyaromatic compounds goes in line with the affiliation of this yeast with soil microbial communities involved in lignin decomposition. Furthermore, we observed that different clones of this strain may present circular or linear mitochondrial DNA. Re-sequencing and comparison of strains with differential mitochondrial genome topology revealed five candidate genes potentially associated with this conformational change: MSK1, SSZ1, ALG5, MRPL9 and OYE32.
Collapse
Affiliation(s)
- Verónica Mixão
- Life Sciences Department, Barcelona Supercomputing Center (BSC), Jordi Girona, 29, 08034 Barcelona, Spain.,Mechanisms of Disease Department, Institute for Research in Biomedicine (IRB), Barcelona, Spain
| | - Eva Hegedűsová
- Faculty of Natural Sciences, Department of Biochemistry, Comenius University in Bratislava, Ilkovičova 6, 842 15 Bratislava, Slovak Republic
| | - Ester Saus
- Life Sciences Department, Barcelona Supercomputing Center (BSC), Jordi Girona, 29, 08034 Barcelona, Spain.,Mechanisms of Disease Department, Institute for Research in Biomedicine (IRB), Barcelona, Spain
| | - Leszek P Pryszcz
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Spain
| | - Andrea Cillingová
- Faculty of Natural Sciences, Department of Biochemistry, Comenius University in Bratislava, Ilkovičova 6, 842 15 Bratislava, Slovak Republic
| | - Jozef Nosek
- Faculty of Natural Sciences, Department of Biochemistry, Comenius University in Bratislava, Ilkovičova 6, 842 15 Bratislava, Slovak Republic
| | - Toni Gabaldón
- Life Sciences Department, Barcelona Supercomputing Center (BSC), Jordi Girona, 29, 08034 Barcelona, Spain.,Mechanisms of Disease Department, Institute for Research in Biomedicine (IRB), Barcelona, Spain.,ICREA, Pg. Lluis Companys 23, Barcelona 08010, Spain
| |
Collapse
|
19
|
Independent duplications of the Golgi phosphoprotein 3 oncogene in birds. Sci Rep 2021; 11:12483. [PMID: 34127736 PMCID: PMC8203631 DOI: 10.1038/s41598-021-91909-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2021] [Accepted: 06/02/2021] [Indexed: 02/05/2023] Open
Abstract
Golgi phosphoprotein 3 (GOLPH3) was the first reported oncoprotein of the Golgi apparatus. It was identified as an evolutionarily conserved protein upon its discovery about 20 years ago, but its function remains puzzling in normal and cancer cells. The GOLPH3 gene is part of a group of genes that also includes the GOLPH3L gene. Because cancer has deep roots in multicellular evolution, studying the evolution of the GOLPH3 gene family in non-model species represents an opportunity to identify new model systems that could help better understand the biology behind this group of genes. The main goal of this study is to explore the evolution of the GOLPH3 gene family in birds as a starting point to understand the evolutionary history of this oncoprotein. We identified a repertoire of three GOLPH3 genes in birds. We found duplicated copies of the GOLPH3 gene in all main groups of birds other than paleognaths, and a single copy of the GOLPH3L gene. We suggest there were at least three independent origins for GOLPH3 duplicates. Amino acid divergence estimates show that most of the variation is located in the N-terminal region of the protein. Our transcript abundance estimations show that one paralog is highly and ubiquitously expressed, and the others were variable. Our results are an example of the significance of understanding the evolution of the GOLPH3 gene family, especially for unraveling its structural and functional attributes.
Collapse
|
20
|
Comparative Genomics Used to Predict Virulence Factors and Metabolic Genes among Monilinia Species. J Fungi (Basel) 2021; 7:jof7060464. [PMID: 34201288 PMCID: PMC8228255 DOI: 10.3390/jof7060464] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2021] [Revised: 05/31/2021] [Accepted: 06/01/2021] [Indexed: 02/07/2023] Open
Abstract
Brown rot, caused by Monilinia spp., is among the most important diseases in stone fruits, and some pome fruits (mainly apples). This disease is responsible for significant yield losses, particularly in stone fruits, when weather conditions favorable for disease development appear. To achieve future sustainable strategies to control brown rot on fruit, one potential approach will be to characterize genomic variation among Monilinia spp. to define, among others, the capacity to infect fruit in this genus. In the present work, we performed genomic and phylogenomic comparisons of five Monilinia species and inferred differences in numbers of secreted proteins, including CAZy proteins and other proteins important for virulence. Duplications specific to Monilinia were sparse and, overall, more genes have been lost than gained. Among Monilinia spp., low variability in the CAZome was observed. Interestingly, we identified several secondary metabolism clusters based on similarity to known clusters, and among them was a cluster with homology to pyriculol that could be responsible for the synthesis of chloromonilicin. Furthermore, we compared sequences of all strains available from NCBI of these species to assess their MAT loci and heterokaryon compatibility systems. Our comparative analyses provide the basis for future studies into understanding how these genomic differences underlie common or differential abilities to interact with the host plant.
Collapse
|
21
|
Derelle R, Philippe H, Colbourne JK. Broccoli: Combining Phylogenetic and Network Analyses for Orthology Assignment. Mol Biol Evol 2021; 37:3389-3396. [PMID: 32602888 DOI: 10.1093/molbev/msaa159] [Citation(s) in RCA: 35] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Orthology assignment is a key step of comparative genomic studies, for which many bioinformatic tools have been developed. However, all gene clustering pipelines are based on the analysis of protein distances, which are subject to many artifacts. In this article, we introduce Broccoli, a user-friendly pipeline designed to infer, with high precision, orthologous groups, and pairs of proteins using a phylogeny-based approach. Briefly, Broccoli performs ultrafast phylogenetic analyses on most proteins and builds a network of orthologous relationships. Orthologous groups are then identified from the network using a parameter-free machine learning algorithm. Broccoli is also able to detect chimeric proteins resulting from gene-fusion events and to assign these proteins to the corresponding orthologous groups. Tested on two benchmark data sets, Broccoli outperforms current orthology pipelines. In addition, Broccoli is scalable, with runtimes similar to those of recent distance-based pipelines. Given its high level of performance and efficiency, this new pipeline represents a suitable choice for comparative genomic studies. Broccoli is freely available at https://github.com/rderelle/Broccoli.
Collapse
Affiliation(s)
- Romain Derelle
- School of Biosciences, University of Birmingham, Birmingham, United Kingdom
| | - Hervé Philippe
- Station d'Ecologie Théorique et Expérimentale, UMR CNRS 5321, Moulis, France.,Département de Biochimie, Centre Robert-Cedergren, Université de Montréal, Montréal, QC, Canada
| | - John K Colbourne
- School of Biosciences, University of Birmingham, Birmingham, United Kingdom
| |
Collapse
|
22
|
Kandziora M, Sklenář P, Kolář F, Schmickl R. How to Tackle Phylogenetic Discordance in Recent and Rapidly Radiating Groups? Developing a Workflow Using Loricaria (Asteraceae) as an Example. FRONTIERS IN PLANT SCIENCE 2021; 12:765719. [PMID: 35069621 PMCID: PMC8777076 DOI: 10.3389/fpls.2021.765719] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/27/2021] [Accepted: 11/22/2021] [Indexed: 05/17/2023]
Abstract
A major challenge in phylogenetics and -genomics is to resolve young rapidly radiating groups. The fast succession of species increases the probability of incomplete lineage sorting (ILS), and different topologies of the gene trees are expected, leading to gene tree discordance, i.e., not all gene trees represent the species tree. Phylogenetic discordance is common in phylogenomic datasets, and apart from ILS, additional sources include hybridization, whole-genome duplication, and methodological artifacts. Despite a high degree of gene tree discordance, species trees are often well supported and the sources of discordance are not further addressed in phylogenomic studies, which can eventually lead to incorrect phylogenetic hypotheses, especially in rapidly radiating groups. We chose the high-Andean Asteraceae genus Loricaria to shed light on the potential sources of phylogenetic discordance and generated a phylogenetic hypothesis. By accounting for paralogy during gene tree inference, we generated a species tree based on hundreds of nuclear loci, using Hyb-Seq, and a plastome phylogeny obtained from off-target reads during target enrichment. We observed a high degree of gene tree discordance, which we found implausible at first sight, because the genus did not show evidence of hybridization in previous studies. We used various phylogenomic analyses (trees and networks) as well as the D-statistics to test for ILS and hybridization, which we developed into a workflow on how to tackle phylogenetic discordance in recent radiations. We found strong evidence for ILS and hybridization within the genus Loricaria. Low genetic differentiation was evident between species located in different Andean cordilleras, which could be indicative of substantial introgression between populations, promoted during Pleistocene glaciations, when alpine habitats shifted creating opportunities for secondary contact and hybridization.
Collapse
Affiliation(s)
- Martha Kandziora
- Department of Botany, Faculty of Science, Charles University, Prague, Czechia
- *Correspondence: Martha Kandziora,
| | - Petr Sklenář
- Department of Botany, Faculty of Science, Charles University, Prague, Czechia
| | - Filip Kolář
- Department of Botany, Faculty of Science, Charles University, Prague, Czechia
- Institute of Botany, The Czech Academy of Sciences, Průhonice, Czechia
| | - Roswitha Schmickl
- Department of Botany, Faculty of Science, Charles University, Prague, Czechia
- Institute of Botany, The Czech Academy of Sciences, Průhonice, Czechia
| |
Collapse
|
23
|
Chorostecki U, Molina M, Pryszcz LP, Gabaldón T. MetaPhOrs 2.0: integrative, phylogeny-based inference of orthology and paralogy across the tree of life. Nucleic Acids Res 2020; 48:W553-W557. [PMID: 32343307 PMCID: PMC7319458 DOI: 10.1093/nar/gkaa282] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2020] [Revised: 04/01/2020] [Accepted: 04/25/2020] [Indexed: 12/23/2022] Open
Abstract
Inferring homology relationships across genes in different species is a central task in comparative genomics. Therefore, a large number of resources and methods have been developed over the years. Some public databases include phylogenetic trees of homologous gene families which can be used to further differentiate homology relationships into orthology and paralogy. MetaPhOrs is a web server that integrates phylogenetic information from different sources to provide orthology and paralogy relationships based on a common phylogeny-based predictive algorithm and associated with a consistency-based confidence score. Here we describe the latest version of the web server which includes major new implementations and provides orthology and paralogy relationships derived from ∼8.2 million gene family trees-from 13 different source repositories across ∼4000 species with sequenced genomes. MetaPhOrs server is freely available, without registration, at http://orthology.phylomedb.org/.
Collapse
Affiliation(s)
- Uciel Chorostecki
- Barcelona Supercomputing Centre (BSC-CNS), 08034 Barcelona, Spain.,Institute for Research in Biomedicine (IRB), The Barcelona Institute of Science and Technology, 08028 Barcelona, Spain
| | - Manuel Molina
- Barcelona Supercomputing Centre (BSC-CNS), 08034 Barcelona, Spain.,Institute for Research in Biomedicine (IRB), The Barcelona Institute of Science and Technology, 08028 Barcelona, Spain
| | - Leszek P Pryszcz
- Centre for Genomic Regulation, 08003 Barcelona, Spain.,International Institute of Molecular and Cell Biology, 4 Ks. Trojdena Street, 02-109 Warsaw, Poland
| | - Toni Gabaldón
- Barcelona Supercomputing Centre (BSC-CNS), 08034 Barcelona, Spain.,Institute for Research in Biomedicine (IRB), The Barcelona Institute of Science and Technology, 08028 Barcelona, Spain.,ICREA, 08010 Barcelona, Spain
| |
Collapse
|
24
|
Gerdol M, Moreira R, Cruz F, Gómez-Garrido J, Vlasova A, Rosani U, Venier P, Naranjo-Ortiz MA, Murgarella M, Greco S, Balseiro P, Corvelo A, Frias L, Gut M, Gabaldón T, Pallavicini A, Canchaya C, Novoa B, Alioto TS, Posada D, Figueras A. Massive gene presence-absence variation shapes an open pan-genome in the Mediterranean mussel. Genome Biol 2020; 21:275. [PMID: 33168033 PMCID: PMC7653742 DOI: 10.1186/s13059-020-02180-3] [Citation(s) in RCA: 81] [Impact Index Per Article: 20.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2019] [Accepted: 10/15/2020] [Indexed: 01/14/2023] Open
Abstract
BACKGROUND The Mediterranean mussel Mytilus galloprovincialis is an ecologically and economically relevant edible marine bivalve, highly invasive and resilient to biotic and abiotic stressors causing recurrent massive mortalities in other bivalves. Although these traits have been recently linked with the maintenance of a high genetic variation within natural populations, the factors underlying the evolutionary success of this species remain unclear. RESULTS Here, after the assembly of a 1.28-Gb reference genome and the resequencing of 14 individuals from two independent populations, we reveal a complex pan-genomic architecture in M. galloprovincialis, with a core set of 45,000 genes plus a strikingly high number of dispensable genes (20,000) subject to presence-absence variation, which may be entirely missing in several individuals. We show that dispensable genes are associated with hemizygous genomic regions affected by structural variants, which overall account for nearly 580 Mb of DNA sequence not included in the reference genome assembly. As such, this is the first study to report the widespread occurrence of gene presence-absence variation at a whole-genome scale in the animal kingdom. CONCLUSIONS Dispensable genes usually belong to young and recently expanded gene families enriched in survival functions, which might be the key to explain the resilience and invasiveness of this species. This unique pan-genome architecture is characterized by dispensable genes in accessory genomic regions that exceed by orders of magnitude those observed in other metazoans, including humans, and closely mirror the open pan-genomes found in prokaryotes and in a few non-metazoan eukaryotes.
Collapse
Affiliation(s)
- Marco Gerdol
- Department of Life Sciences, Università degli Studi di Trieste, Via Licio Giorgieri 5, 34127 Trieste, Italy
| | - Rebeca Moreira
- Instituto de Investigaciones Marinas (IIM - CSIC), Eduardo Cabello, 6, 36208 Vigo, Spain
| | - Fernando Cruz
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, 08028 Barcelona, Spain
| | - Jessica Gómez-Garrido
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, 08028 Barcelona, Spain
| | - Anna Vlasova
- CRG - Centre for Genomic Regulation, Doctor Aiguader, 88, 08003 Barcelona, Spain
| | - Umberto Rosani
- Department of Biology, Università degli Studi di Padova, Via Ugo Bassi 58/B, 35131 Padova, Italy
| | - Paola Venier
- Department of Biology, Università degli Studi di Padova, Via Ugo Bassi 58/B, 35131 Padova, Italy
| | - Miguel A. Naranjo-Ortiz
- CRG - Centre for Genomic Regulation, Doctor Aiguader, 88, 08003 Barcelona, Spain
- Universitat Pompeu Fabra (UPF), 08003 Barcelona, Spain
| | - Maria Murgarella
- Department of Biochemistry, Genetics and Immunology, University of Vigo, 36310 Vigo, Spain
| | - Samuele Greco
- Department of Life Sciences, Università degli Studi di Trieste, Via Licio Giorgieri 5, 34127 Trieste, Italy
| | - Pablo Balseiro
- Instituto de Investigaciones Marinas (IIM - CSIC), Eduardo Cabello, 6, 36208 Vigo, Spain
- Norce Norwegian Research Centre AS, Bergen, Norway
| | - André Corvelo
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, 08028 Barcelona, Spain
- New York Genome Center, New York, NY 10013 USA
| | - Leonor Frias
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, 08028 Barcelona, Spain
| | - Marta Gut
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, 08028 Barcelona, Spain
| | - Toni Gabaldón
- CRG - Centre for Genomic Regulation, Doctor Aiguader, 88, 08003 Barcelona, Spain
- Universitat Pompeu Fabra (UPF), 08003 Barcelona, Spain
- ICREA, Pg. Lluís Companys 23, 08010 Barcelona, Spain
- Current address: Barelona Supercomputing Centre (BSC-CNS) and Institute for Research in Biomedicine (IRB), 08034 Barcelona, Spain
| | - Alberto Pallavicini
- Department of Life Sciences, Università degli Studi di Trieste, Via Licio Giorgieri 5, 34127 Trieste, Italy
- Anton Dohrn Zoological Station, 80121 Villa Comunale, Naples, Italy
| | - Carlos Canchaya
- Department of Biochemistry, Genetics and Immunology, University of Vigo, 36310 Vigo, Spain
- Biomedical Research Center (CINBIO), University of Vigo, 36310 Vigo, Spain
- Galicia Sur Health Research Institute, 36310 Vigo, Spain
| | - Beatriz Novoa
- Instituto de Investigaciones Marinas (IIM - CSIC), Eduardo Cabello, 6, 36208 Vigo, Spain
| | - Tyler S. Alioto
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, 08028 Barcelona, Spain
- Universitat Pompeu Fabra (UPF), 08003 Barcelona, Spain
| | - David Posada
- Department of Biochemistry, Genetics and Immunology, University of Vigo, 36310 Vigo, Spain
- Biomedical Research Center (CINBIO), University of Vigo, 36310 Vigo, Spain
- Galicia Sur Health Research Institute, 36310 Vigo, Spain
| | - Antonio Figueras
- Instituto de Investigaciones Marinas (IIM - CSIC), Eduardo Cabello, 6, 36208 Vigo, Spain
| |
Collapse
|
25
|
Owen CL, Stern DB, Hilton SK, Crandall KA. Hemiptera phylogenomic resources: Tree‐based orthology prediction and conserved exon identification. Mol Ecol Resour 2020; 20:1346-1360. [DOI: 10.1111/1755-0998.13180] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2018] [Revised: 04/02/2020] [Accepted: 04/27/2020] [Indexed: 12/21/2022]
Affiliation(s)
- Christopher L. Owen
- Computational Biology Institute George Washington University Washington DC USA
- Systematic Entomology Laboratory USDA‐ARS Beltsville MD USA
| | - David B. Stern
- Computational Biology Institute George Washington University Washington DC USA
- Department of Integrative Biology University of Wisconsin ‐ Madison Madison WI USA
| | - Sarah K. Hilton
- Computational Biology Institute George Washington University Washington DC USA
- Department of Genome Sciences University of Washington Washington DC USA
| | - Keith A. Crandall
- Computational Biology Institute George Washington University Washington DC USA
| |
Collapse
|
26
|
Brand JN, Wiberg RAW, Pjeta R, Bertemes P, Beisel C, Ladurner P, Schärer L. RNA-Seq of three free-living flatworm species suggests rapid evolution of reproduction-related genes. BMC Genomics 2020; 21:462. [PMID: 32631219 PMCID: PMC7336406 DOI: 10.1186/s12864-020-06862-x] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2019] [Accepted: 06/22/2020] [Indexed: 01/03/2023] Open
Abstract
Background The genus Macrostomum consists of small free-living flatworms and contains Macrostomum lignano, which has been used in investigations of ageing, stem cell biology, bioadhesion, karyology, and sexual selection in hermaphrodites. Two types of mating behaviour occur within this genus. Some species, including M. lignano, mate via reciprocal copulation, where, in a single mating, both partners insert their male copulatory organ into the female storage organ and simultaneously donate and receive sperm. Other species mate via hypodermic insemination, where worms use a needle-like copulatory organ to inject sperm into the tissue of the partner. These contrasting mating behaviours are associated with striking differences in sperm and copulatory organ morphology. Here we expand the genomic resources within the genus to representatives of both behaviour types and investigate whether genes vary in their rate of evolution depending on their putative function. Results We present de novo assembled transcriptomes of three Macrostomum species, namely M. hystrix, a close relative of M. lignano that mates via hypodermic insemination, M. spirale, a more distantly related species that mates via reciprocal copulation, and finally M. pusillum, which represents a clade that is only distantly related to the other three species and also mates via hypodermic insemination. We infer 23,764 sets of homologous genes and annotate them using experimental evidence from M. lignano. Across the genus, we identify 521 gene families with conserved patterns of differential expression between juvenile vs. adult worms and 185 gene families with a putative expression in the testes that are restricted to the two reciprocally mating species. Further, we show that homologs of putative reproduction-related genes have a higher protein divergence across the four species than genes lacking such annotations and that they are more difficult to identify across the four species, indicating that these genes evolve more rapidly, while genes involved in neoblast function are more conserved. Conclusions This study improves the genus Macrostomum as a model system, by providing resources for the targeted investigation of gene function in a broad range of species. And we, for the first time, show that reproduction-related genes evolve at an accelerated rate in flatworms.
Collapse
Affiliation(s)
- Jeremias N Brand
- Department of Environmental Sciences, Zoological Institute, University of Basel, Vesalgasse 1, 4051, Basel, Switzerland.
| | - R Axel W Wiberg
- Department of Environmental Sciences, Zoological Institute, University of Basel, Vesalgasse 1, 4051, Basel, Switzerland
| | - Robert Pjeta
- Institute of Zoology and Center of Molecular Biosciences Innsbruck, University of Innsbruck, Innsbruck, Austria
| | - Philip Bertemes
- Institute of Zoology and Center of Molecular Biosciences Innsbruck, University of Innsbruck, Innsbruck, Austria
| | - Christian Beisel
- Department of Biosystems Science and Engineering, ETH Zürich, Basel, Switzerland
| | - Peter Ladurner
- Institute of Zoology and Center of Molecular Biosciences Innsbruck, University of Innsbruck, Innsbruck, Austria
| | - Lukas Schärer
- Department of Environmental Sciences, Zoological Institute, University of Basel, Vesalgasse 1, 4051, Basel, Switzerland
| |
Collapse
|
27
|
Draft genome of the European medicinal leech Hirudo medicinalis (Annelida, Clitellata, Hirudiniformes) with emphasis on anticoagulants. Sci Rep 2020; 10:9885. [PMID: 32555498 PMCID: PMC7303139 DOI: 10.1038/s41598-020-66749-5] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2020] [Accepted: 04/28/2020] [Indexed: 02/07/2023] Open
Abstract
The European medicinal leech has been used for medicinal purposes for millennia, and continues to be used today in modern hospital settings. Its utility is granted by the extremely potent anticoagulation factors that the leech secretes into the incision wound during feeding and, although a handful of studies have targeted certain anticoagulants, the full range of anticoagulation factors expressed by this species remains unknown. Here, we present the first draft genome of the European medicinal leech, Hirudo medicinalis, and estimate that we have sequenced between 79–94% of the full genome. Leveraging these data, we searched for anticoagulation factors across the genome of H. medicinalis. Following orthology determination through a series of BLAST searches, as well as phylogenetic analyses, we estimate that fully 15 different known anticoagulation factors are utilized by the species, and that 17 other proteins that have been linked to antihemostasis are also present in the genome. We underscore the utility of the draft genome for comparative studies of leeches and discuss our results in an evolutionary context.
Collapse
|
28
|
Nagy LG, Merényi Z, Hegedüs B, Bálint B. Novel phylogenetic methods are needed for understanding gene function in the era of mega-scale genome sequencing. Nucleic Acids Res 2020; 48:2209-2219. [PMID: 31943056 PMCID: PMC7049691 DOI: 10.1093/nar/gkz1241] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2019] [Revised: 12/15/2019] [Accepted: 12/31/2019] [Indexed: 12/21/2022] Open
Abstract
Ongoing large-scale genome sequencing projects are forecasting a data deluge that will almost certainly overwhelm current analytical capabilities of evolutionary genomics. In contrast to population genomics, there are no standardized methods in evolutionary genomics for extracting evolutionary and functional (e.g. gene-trait association) signal from genomic data. Here, we examine how current practices of multi-species comparative genomics perform in this aspect and point out that many genomic datasets are under-utilized due to the lack of powerful methodologies. As a result, many current analyses emphasize gene families for which some functional data is already available, resulting in a growing gap between functionally well-characterized genes/organisms and the universe of unknowns. This leaves unknown genes on the 'dark side' of genomes, a problem that will not be mitigated by sequencing more and more genomes, unless we develop tools to infer functional hypotheses for unknown genes in a systematic manner. We provide an inventory of recently developed methods capable of predicting gene-gene and gene-trait associations based on comparative data, then argue that realizing the full potential of whole genome datasets requires the integration of phylogenetic comparative methods into genomics, a rich but underutilized toolbox for looking into the past.
Collapse
Affiliation(s)
- László G Nagy
- Synthetic and Systems Biology Unit, Institute of Biochemistry, Biological Research Centre, Temesvari krt 62. Szeged 6726, Hungary
| | - Zsolt Merényi
- Synthetic and Systems Biology Unit, Institute of Biochemistry, Biological Research Centre, Temesvari krt 62. Szeged 6726, Hungary
| | - Botond Hegedüs
- Synthetic and Systems Biology Unit, Institute of Biochemistry, Biological Research Centre, Temesvari krt 62. Szeged 6726, Hungary
| | - Balázs Bálint
- Synthetic and Systems Biology Unit, Institute of Biochemistry, Biological Research Centre, Temesvari krt 62. Szeged 6726, Hungary
| |
Collapse
|
29
|
Larson DA, Walker JF, Vargas OM, Smith SA. A consensus phylogenomic approach highlights paleopolyploid and rapid radiation in the history of Ericales. AMERICAN JOURNAL OF BOTANY 2020; 107:773-789. [PMID: 32350864 DOI: 10.1002/ajb2.1469] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/08/2019] [Accepted: 02/12/2020] [Indexed: 05/27/2023]
Abstract
PREMISE Large genomic data sets offer the promise of resolving historically recalcitrant species relationships. However, different methodologies can yield conflicting results, especially when clades have experienced ancient, rapid diversification. Here, we analyzed the ancient radiation of Ericales and explored sources of uncertainty related to species tree inference, conflicting gene tree signal, and the inferred placement of gene and genome duplications. METHODS We used a hierarchical clustering approach, with tree-based homology and orthology detection, to generate six filtered phylogenomic matrices consisting of data from 97 transcriptomes and genomes. Support for species relationships was inferred from multiple lines of evidence including shared gene duplications, gene tree conflict, gene-wise edge-based analyses, concatenation, and coalescent-based methods, and is summarized in a consensus framework. RESULTS Our consensus approach supported a topology largely concordant with previous studies, but suggests that the data are not capable of resolving several ancient relationships because of lack of informative characters, sensitivity to methodology, and extensive gene tree conflict correlated with paleopolyploidy. We found evidence of a whole-genome duplication before the radiation of all or most ericalean families, and demonstrate that tree topology and heterogeneous evolutionary rates affect the inferred placement of genome duplications. CONCLUSIONS We provide several hypotheses regarding the history of Ericales, and confidently resolve most nodes, but demonstrate that a series of ancient divergences are unresolvable with these data. Whether paleopolyploidy is a major source of the observed phylogenetic conflict warrants further investigation.
Collapse
Affiliation(s)
- Drew A Larson
- Department of Ecology & Evolutionary Biology, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Joseph F Walker
- Sainsbury Laboratory (SLCU), University of Cambridge, Cambridge, CB2 1LR, UK
| | - Oscar M Vargas
- Department of Ecology & Evolutionary Biology, University of California, Santa Cruz, CA, 95060, USA
| | - Stephen A Smith
- Department of Ecology & Evolutionary Biology, University of Michigan, Ann Arbor, MI, 48109, USA
| |
Collapse
|
30
|
Fernández R, Gabaldón T. Gene gain and loss across the metazoan tree of life. Nat Ecol Evol 2020; 4:524-533. [PMID: 31988444 PMCID: PMC7124887 DOI: 10.1038/s41559-019-1069-x] [Citation(s) in RCA: 75] [Impact Index Per Article: 18.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2019] [Accepted: 11/21/2019] [Indexed: 12/22/2022]
Abstract
Although recent research has revealed high genomic complexity in the earliest-splitting animals and their ancestors, the macroevolutionary trends orchestrating gene repertoire evolution throughout the animal phyla remain poorly understood. We used a phylogenomic approach to interrogate genome evolution across all animal phyla. Our analysis uncovered a bimodal distribution of recruitment of orthologous genes, with most genes gained very 'early' (that is, at deep nodes) or very 'late', representing lineage-specific acquisitions. The emergence of animals was characterized by high values of gene birth and duplications. Deuterostomes, ecdysozoans and Xenacoelomorpha were characterized by no gene gain but rampant differential gene loss. Genes considered as animal hallmarks, such as Notch/Delta, were convergently duplicated in all phyla and at different evolutionary depths. Genes duplicated in all nodes from Metazoa to phylum-specific levels were enriched in functions related to the neural system, suggesting that this system has been continuously and independently reshaped throughout evolution across animals. Our results indicate that animal genomes evolved by unparalleled gene duplication followed by differential gene loss, and provide an atlas of gene repertoire evolution throughout the animal tree of life to navigate how, when and how often each gene in each genome was gained, duplicated or lost.
Collapse
Affiliation(s)
- Rosa Fernández
- Bioinformatics and Genomics Unit, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology, Barcelona, Spain
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Barcelona, Spain
- Barcelona Supercomputing Centre (BSC-CNS) and Institute for Research in Biomedicine (IRB), Barcelona, Spain
| | - Toni Gabaldón
- Bioinformatics and Genomics Unit, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology, Barcelona, Spain.
- Institució Catalana de Recerca i Estudis Avançats (ICREA), Barcelona, Spain.
- Barcelona Supercomputing Centre (BSC-CNS) and Institute for Research in Biomedicine (IRB), Barcelona, Spain.
| |
Collapse
|
31
|
Julca I, Marcet-Houben M, Cruz F, Vargas-Chavez C, Johnston JS, Gómez-Garrido J, Frias L, Corvelo A, Loska D, Cámara F, Gut M, Alioto T, Latorre A, Gabaldón T. Phylogenomics Identifies an Ancestral Burst of Gene Duplications Predating the Diversification of Aphidomorpha. Mol Biol Evol 2020; 37:730-756. [PMID: 31702774 PMCID: PMC7038657 DOI: 10.1093/molbev/msz261] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Aphids (Aphidoidea) are a diverse group of hemipteran insects that feed on plant phloem sap. A common finding in studies of aphid genomes is the presence of a large number of duplicated genes. However, when these duplications occurred remains unclear, partly due to the high relatedness of sequenced species. To better understand the origin of aphid duplications we sequenced and assembled the genome of Cinara cedri, an early branching lineage (Lachninae) of the Aphididae family. We performed a phylogenomic comparison of this genome with 20 other sequenced genomes, including the available genomes of five other aphids, along with the transcriptomes of two species belonging to Adelgidae (a closely related clade to the aphids) and Coccoidea. We found that gene duplication has been pervasive throughout the evolution of aphids, including many parallel waves of recent, species-specific duplications. Most notably, we identified a consistent set of very ancestral duplications, originating from a large-scale gene duplication predating the diversification of Aphidomorpha (comprising aphids, phylloxerids, and adelgids). Genes duplicated in this ancestral wave are enriched in functions related to traits shared by Aphidomorpha, such as association with endosymbionts, and adaptation to plant defenses and phloem-sap-based diet. The ancestral nature of this duplication wave (106-227 Ma) and the lack of sufficiently conserved synteny make it difficult to conclude whether it originated from a whole-genome duplication event or, alternatively, from a burst of large-scale segmental duplications. Genome sequencing of other aphid species belonging to different Aphidomorpha and related lineages may clarify these findings.
Collapse
Affiliation(s)
- Irene Julca
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Marina Marcet-Houben
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Fernando Cruz
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain
| | - Carlos Vargas-Chavez
- Institute for Integrative Systems Biology (I2SysBio), University of Valencia and CSIC, Valencia, Spain
| | | | - Jèssica Gómez-Garrido
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain
| | - Leonor Frias
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain
| | - André Corvelo
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain
- New York Genome Center, New York, NY
| | - Damian Loska
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Francisco Cámara
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Marta Gut
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Department of Experimental and Health Sciences, Barcelona, Spain
| | - Tyler Alioto
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Department of Experimental and Health Sciences, Barcelona, Spain
| | - Amparo Latorre
- Institute for Integrative Systems Biology (I2SysBio), University of Valencia and CSIC, Valencia, Spain
- Joint Unit in Genomics and Health, Foundation for the Promotion of Sanitary and Biomedical Research (FISABIO) and University of Valencia, Valencia, Spain
| | - Toni Gabaldón
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Department of Experimental and Health Sciences, Barcelona, Spain
- Institució Catalana de Recerca i Estudis Avançats (ICREA), Barcelona, Spain
| |
Collapse
|
32
|
Prasanna AN, Gerber D, Kijpornyongpan T, Aime MC, Doyle VP, Nagy LG. Model Choice, Missing Data, and Taxon Sampling Impact Phylogenomic Inference of Deep Basidiomycota Relationships. Syst Biol 2020; 69:17-37. [PMID: 31062852 DOI: 10.1093/sysbio/syz029] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2017] [Revised: 04/21/2019] [Accepted: 04/26/2019] [Indexed: 11/12/2022] Open
Abstract
Resolving deep divergences in the tree of life is challenging even for analyses of genome-scale phylogenetic data sets. Relationships between Basidiomycota subphyla, the rusts and allies (Pucciniomycotina), smuts and allies (Ustilaginomycotina), and mushroom-forming fungi and allies (Agaricomycotina) were found particularly recalcitrant both to traditional multigene and genome-scale phylogenetics. Here, we address basal Basidiomycota relationships using concatenated and gene tree-based analyses of various phylogenomic data sets to examine the contribution of several potential sources of bias. We evaluate the contribution of biological causes (hard polytomy, incomplete lineage sorting) versus unmodeled evolutionary processes and factors that exacerbate their effects (e.g., fast-evolving sites and long-branch taxa) to inferences of basal Basidiomycota relationships. Bayesian Markov Chain Monte Carlo and likelihood mapping analyses reject the hard polytomy with confidence. In concatenated analyses, fast-evolving sites and oversimplified models of amino acid substitution favored the grouping of smuts with mushroom-forming fungi, often leading to maximal bootstrap support in both concatenation and coalescent analyses. On the contrary, the most conserved data subsets grouped rusts and allies with mushroom-forming fungi, although this relationship proved labile, sensitive to model choice, to different data subsets and to missing data. Excluding putative long-branch taxa, genes with high proportions of missing data and/or with strong signal failed to reveal a consistent trend toward one or the other topology, suggesting that additional sources of conflict are at play. While concatenated analyses yielded strong but conflicting support, individual gene trees mostly provided poor support for any resolution of rusts, smuts, and mushroom-forming fungi, suggesting that the true Basidiomycota tree might be in a part of tree space that is difficult to access using both concatenation and gene tree-based approaches. Inference-based assessments of absolute model fit strongly reject best-fit models for the vast majority of genes, indicating a poor fit of even the most commonly used models. While this is consistent with previous assessments of site-homogenous models of amino acid evolution, this does not appear to be the sole source of confounding signal. Our analyses suggest that topologies uniting smuts with mushroom-forming fungi can arise as a result of inappropriate modeling of amino acid sites that might be prone to systematic bias. We speculate that improved models of sequence evolution could shed more light on basal splits in the Basidiomycota, which, for now, remain unresolved despite the use of whole genome data.
Collapse
Affiliation(s)
- Arun N Prasanna
- Synthetic and Systems Biology Unit, Institute of Biochemistry, BRC-HAS, Szeged 6726, Hungary
| | - Daniel Gerber
- Synthetic and Systems Biology Unit, Institute of Biochemistry, BRC-HAS, Szeged 6726, Hungary.,Institute of Archaeology, Research Centre for the Humanities, Hungarian Academy of Sciences, Budapest 1097, Hungary
| | | | - M Catherine Aime
- Department of Botany and Plant Pathology, Purdue University, West Lafayette, IN 47907, USA
| | - Vinson P Doyle
- Department of Plant Pathology and Crop Physiology, Louisiana State University AgCenter, Baton Rouge, LA 70803, USA
| | - Laszlo G Nagy
- Synthetic and Systems Biology Unit, Institute of Biochemistry, BRC-HAS, Szeged 6726, Hungary
| |
Collapse
|
33
|
Alioto T, Alexiou KG, Bardil A, Barteri F, Castanera R, Cruz F, Dhingra A, Duval H, Fernández i Martí Á, Frias L, Galán B, García JL, Howad W, Gómez‐Garrido J, Gut M, Julca I, Morata J, Puigdomènech P, Ribeca P, Rubio Cabetas MJ, Vlasova A, Wirthensohn M, Garcia‐Mas J, Gabaldón T, Casacuberta JM, Arús P. Transposons played a major role in the diversification between the closely related almond and peach genomes: results from the almond genome sequence. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2020; 101:455-472. [PMID: 31529539 PMCID: PMC7004133 DOI: 10.1111/tpj.14538] [Citation(s) in RCA: 60] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/11/2019] [Revised: 08/29/2019] [Accepted: 09/02/2019] [Indexed: 05/19/2023]
Abstract
We sequenced the genome of the highly heterozygous almond Prunus dulcis cv. Texas combining short- and long-read sequencing. We obtained a genome assembly totaling 227.6 Mb of the estimated almond genome size of 238 Mb, of which 91% is anchored to eight pseudomolecules corresponding to its haploid chromosome complement, and annotated 27 969 protein-coding genes and 6747 non-coding transcripts. By phylogenomic comparison with the genomes of 16 additional close and distant species we estimated that almond and peach (Prunus persica) diverged around 5.88 million years ago. These two genomes are highly syntenic and show a high degree of sequence conservation (20 nucleotide substitutions per kb). However, they also exhibit a high number of presence/absence variants, many attributable to the movement of transposable elements (TEs). Transposable elements have generated an important number of presence/absence variants between almond and peach, and we show that the recent history of TE movement seems markedly different between them. Transposable elements may also be at the origin of important phenotypic differences between both species, and in particular for the sweet kernel phenotype, a key agronomic and domestication character for almond. Here we show that in sweet almond cultivars, highly methylated TE insertions surround a gene involved in the biosynthesis of amygdalin, whose reduced expression has been correlated with the sweet almond phenotype. Altogether, our results suggest a key role of TEs in the recent history and diversification of almond and its close relative peach.
Collapse
Affiliation(s)
- Tyler Alioto
- CNAG‐CRG, Centre for Genomic Regulation (CRG)Barcelona Institute of Science and Technology (BIST)Baldiri i Reixac 408028BarcelonaSpain
- Universitat Pompeu Fabra (UPF)08005BarcelonaSpain
| | - Konstantinos G. Alexiou
- IRTA, Campus UABEdifici CRAGCerdanyola del Vallès (Bellaterra)08193BarcelonaSpain
- Centre for Research in Agricultural Genomics (CRAG)CSIC‐IRTA‐UAB‐UB, Campus UABEdifici CRAGCerdanyola del Vallès (Bellaterra)08193BarcelonaSpain
| | - Amélie Bardil
- Centre for Research in Agricultural Genomics (CRAG)CSIC‐IRTA‐UAB‐UB, Campus UABEdifici CRAGCerdanyola del Vallès (Bellaterra)08193BarcelonaSpain
| | - Fabio Barteri
- Centre for Research in Agricultural Genomics (CRAG)CSIC‐IRTA‐UAB‐UB, Campus UABEdifici CRAGCerdanyola del Vallès (Bellaterra)08193BarcelonaSpain
| | - Raúl Castanera
- Centre for Research in Agricultural Genomics (CRAG)CSIC‐IRTA‐UAB‐UB, Campus UABEdifici CRAGCerdanyola del Vallès (Bellaterra)08193BarcelonaSpain
| | - Fernando Cruz
- CNAG‐CRG, Centre for Genomic Regulation (CRG)Barcelona Institute of Science and Technology (BIST)Baldiri i Reixac 408028BarcelonaSpain
- Universitat Pompeu Fabra (UPF)08005BarcelonaSpain
| | - Amit Dhingra
- Department of HorticultureWashington State University99164-6414PullmanWAUSA
| | - Henri Duval
- INRA, UR1052Unité de Génétique et Amélioration des Fruits et Légumes (GAFL)Domaine St. Maurice CS 6009484143Montfavet CedexFrance
| | - Ángel Fernández i Martí
- Department of Environmental Science Policy and ManagementUniversity of CaliforniaBerkeley94720CAUSA
- Innovative Genomics Institute (IGI)94720BerkeleyCAUSA
| | - Leonor Frias
- CNAG‐CRG, Centre for Genomic Regulation (CRG)Barcelona Institute of Science and Technology (BIST)Baldiri i Reixac 408028BarcelonaSpain
- Universitat Pompeu Fabra (UPF)08005BarcelonaSpain
| | - Beatriz Galán
- Department of Environmental BiologyCenter for Biological Research (CIB‐CSIC)Spanish National Research Council (CSIC)Ramiro de Maeztu 928040MadridSpain
| | - José L. García
- Department of Environmental BiologyCenter for Biological Research (CIB‐CSIC)Spanish National Research Council (CSIC)Ramiro de Maeztu 928040MadridSpain
| | - Werner Howad
- IRTA, Campus UABEdifici CRAGCerdanyola del Vallès (Bellaterra)08193BarcelonaSpain
- Centre for Research in Agricultural Genomics (CRAG)CSIC‐IRTA‐UAB‐UB, Campus UABEdifici CRAGCerdanyola del Vallès (Bellaterra)08193BarcelonaSpain
| | - Jèssica Gómez‐Garrido
- CNAG‐CRG, Centre for Genomic Regulation (CRG)Barcelona Institute of Science and Technology (BIST)Baldiri i Reixac 408028BarcelonaSpain
- Universitat Pompeu Fabra (UPF)08005BarcelonaSpain
| | - Marta Gut
- CNAG‐CRG, Centre for Genomic Regulation (CRG)Barcelona Institute of Science and Technology (BIST)Baldiri i Reixac 408028BarcelonaSpain
- Universitat Pompeu Fabra (UPF)08005BarcelonaSpain
| | - Irene Julca
- Universitat Pompeu Fabra (UPF)08005BarcelonaSpain
- Bioinformatics and Genomics ProgrammeCentre for Genomic Regulation (CRG)Dr Aiguader, 8808003BarcelonaSpain
| | - Jordi Morata
- Centre for Research in Agricultural Genomics (CRAG)CSIC‐IRTA‐UAB‐UB, Campus UABEdifici CRAGCerdanyola del Vallès (Bellaterra)08193BarcelonaSpain
| | - Pere Puigdomènech
- Centre for Research in Agricultural Genomics (CRAG)CSIC‐IRTA‐UAB‐UB, Campus UABEdifici CRAGCerdanyola del Vallès (Bellaterra)08193BarcelonaSpain
| | - Paolo Ribeca
- CNAG‐CRG, Centre for Genomic Regulation (CRG)Barcelona Institute of Science and Technology (BIST)Baldiri i Reixac 408028BarcelonaSpain
- Universitat Pompeu Fabra (UPF)08005BarcelonaSpain
- The Pirbright InstituteWokingSurreyGU24 0NFUK
| | - María J. Rubio Cabetas
- Centro de Investigación y Tecnología Agroalimentaria de Aragón (CITA)Unidad de HortofruticulturaGobierno de Aragón, Avda. Montañana 93050059ZaragozaSpain
- Instituto Agroalimentario de Aragón – IA2 (CITA‐Universidad de Zaragoza)Calle Miguel Servet 17750013ZaragozaSpain
| | - Anna Vlasova
- Bioinformatics and Genomics ProgrammeCentre for Genomic Regulation (CRG)Dr Aiguader, 8808003BarcelonaSpain
| | - Michelle Wirthensohn
- University of AdelaideWaite Research InstituteSchool of Agriculture, Food and WinePMB 1Glen OsmondSA5064Australia
| | - Jordi Garcia‐Mas
- IRTA, Campus UABEdifici CRAGCerdanyola del Vallès (Bellaterra)08193BarcelonaSpain
- Centre for Research in Agricultural Genomics (CRAG)CSIC‐IRTA‐UAB‐UB, Campus UABEdifici CRAGCerdanyola del Vallès (Bellaterra)08193BarcelonaSpain
| | - Toni Gabaldón
- Universitat Pompeu Fabra (UPF)08005BarcelonaSpain
- Bioinformatics and Genomics ProgrammeCentre for Genomic Regulation (CRG)Dr Aiguader, 8808003BarcelonaSpain
- Institució Catalana de Recerca i Estudis Avançats (ICREA)Pg Lluís Companys 2308010BarcelonaSpain
| | - Josep M. Casacuberta
- Centre for Research in Agricultural Genomics (CRAG)CSIC‐IRTA‐UAB‐UB, Campus UABEdifici CRAGCerdanyola del Vallès (Bellaterra)08193BarcelonaSpain
| | - Pere Arús
- IRTA, Campus UABEdifici CRAGCerdanyola del Vallès (Bellaterra)08193BarcelonaSpain
- Centre for Research in Agricultural Genomics (CRAG)CSIC‐IRTA‐UAB‐UB, Campus UABEdifici CRAGCerdanyola del Vallès (Bellaterra)08193BarcelonaSpain
| |
Collapse
|
34
|
Genome Assemblies of Two Rare Opportunistic Yeast Pathogens: Diutina rugosa (syn. Candida rugosa) and Trichomonascus ciferrii (syn. Candida ciferrii). G3-GENES GENOMES GENETICS 2019; 9:3921-3927. [PMID: 31575637 PMCID: PMC6893180 DOI: 10.1534/g3.119.400762] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Infections caused by opportunistic yeast pathogens have increased over the last years. These infections can be originated by a large number of diverse yeast species of varying incidence, and with distinct clinically relevant phenotypic traits, such as different susceptibility profiles to antifungal drugs, which challenge diagnosis and treatment. Diutina rugosa (syn. Candida rugosa) and Trichomonascus ciferrii (syn. Candida ciferrii) are two opportunistic rare yeast pathogens, which low incidence (< 1%) limits available clinical experience. Furthermore, these yeasts have elevated Minimum Inhibitory Concentration (MIC) levels to at least one class of antifungal agents. This makes it more difficult to manage their infections, and thus they are associated with high rates of mortality and clinical failure. With the aim of improving our knowledge on these opportunistic pathogens, we assembled and annotated their genomes. A phylogenomics approach revealed that genes specifically duplicated in each of the two species are often involved in transmembrane transport activities. These genomes and the reconstructed complete catalog of gene phylogenies and homology relationships constitute useful resources for future studies on these pathogens.
Collapse
|
35
|
Vizán-Rico HI, Mayer C, Petersen M, McKenna DD, Zhou X, Gómez-Zurita J. Patterns and Constraints in the Evolution of Sperm Individualization Genes in Insects, with an Emphasis on Beetles. Genes (Basel) 2019; 10:E776. [PMID: 31590243 PMCID: PMC6826512 DOI: 10.3390/genes10100776] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2019] [Revised: 09/20/2019] [Accepted: 10/01/2019] [Indexed: 11/17/2022] Open
Abstract
Gene expression profiles can change dramatically between sexes and sex bias may contribute specific macroevolutionary dynamics for sex-biased genes. However, these dynamics are poorly understood at large evolutionary scales due to the paucity of studies that have assessed orthology and functional homology for sex-biased genes and the pleiotropic effects possibly constraining their evolutionary potential. Here, we explore the correlation of sex-biased expression with macroevolutionary processes that are associated with sex-biased genes, including duplications and accelerated evolutionary rates. Specifically, we examined these traits in a group of 44 genes that orchestrate sperm individualization during spermatogenesis, with both unbiased and sex-biased expression. We studied these genes in the broad evolutionary framework of the Insecta, with a particular focus on beetles (order Coleoptera). We studied data mined from 119 insect genomes, including 6 beetle models, and from 19 additional beetle transcriptomes. For the subset of physically and/or genetically interacting proteins, we also analyzed how their network structure may condition the mode of gene evolution. The collection of genes was highly heterogeneous in duplication status, evolutionary rates, and rate stability, but there was statistical evidence for sex bias correlated with faster evolutionary rates, consistent with theoretical predictions. Faster rates were also correlated with clocklike (insect amino acids) and non-clocklike (beetle nucleotides) substitution patterns in these genes. Statistical associations (higher rates for central nodes) or lack thereof (centrality of duplicated genes) were in contrast to some current evolutionary hypotheses, highlighting the need for more research on these topics.
Collapse
Affiliation(s)
- Helena I. Vizán-Rico
- Animal Biodiversity and Evolution, Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003 Barcelona, Spain;
| | - Christoph Mayer
- Center for Molecular Biodiversity Research, Zoological Research Museum Alexander Koenig, 53113 Bonn, Germany; (C.M.); (M.P.)
| | - Malte Petersen
- Center for Molecular Biodiversity Research, Zoological Research Museum Alexander Koenig, 53113 Bonn, Germany; (C.M.); (M.P.)
| | - Duane D. McKenna
- Center for Biodiversity Research, Department of Biological Sciences, University of Memphis, Memphis, TN 38152, USA;
| | - Xin Zhou
- Department of Entomology, College of Plant Protection, China Agricultural University, Beijing 100193, China;
| | - Jesús Gómez-Zurita
- Animal Biodiversity and Evolution, Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003 Barcelona, Spain;
| |
Collapse
|
36
|
Hu X, Friedberg I. SwiftOrtho: A fast, memory-efficient, multiple genome orthology classifier. Gigascience 2019; 8:giz118. [PMID: 31648300 PMCID: PMC6812468 DOI: 10.1093/gigascience/giz118] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2019] [Revised: 06/07/2019] [Accepted: 09/05/2019] [Indexed: 11/13/2022] Open
Abstract
BACKGROUND Gene homology type classification is required for many types of genome analyses, including comparative genomics, phylogenetics, and protein function annotation. Consequently, a large variety of tools have been developed to perform homology classification across genomes of different species. However, when applied to large genomic data sets, these tools require high memory and CPU usage, typically available only in computational clusters. FINDINGS Here we present a new graph-based orthology analysis tool, SwiftOrtho, which is optimized for speed and memory usage when applied to large-scale data. SwiftOrtho uses long k-mers to speed up homology search, while using a reduced amino acid alphabet and spaced seeds to compensate for the loss of sensitivity due to long k-mers. In addition, it uses an affinity propagation algorithm to reduce the memory usage when clustering large-scale orthology relationships into orthologous groups. In our tests, SwiftOrtho was the only tool that completed orthology analysis of proteins from 1,760 bacterial genomes on a computer with only 4 GB RAM. Using various standard orthology data sets, we also show that SwiftOrtho has a high accuracy. CONCLUSIONS SwiftOrtho enables the accurate comparative genomic analyses of thousands of genomes using low-memory computers. SwiftOrtho is available at https://github.com/Rinoahu/SwiftOrtho.
Collapse
Affiliation(s)
- Xiao Hu
- Department of Veterinary Microbiology and Preventive Medicine, 2118 Veterinary Medicine, College of Veterinary Medicine, Iowa State University, Ames, IA, 50011, USA
| | - Iddo Friedberg
- Department of Veterinary Microbiology and Preventive Medicine, 2118 Veterinary Medicine, College of Veterinary Medicine, Iowa State University, Ames, IA, 50011, USA
| |
Collapse
|
37
|
Correia K, Yu SM, Mahadevan R. AYbRAH: a curated ortholog database for yeasts and fungi spanning 600 million years of evolution. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2019; 2019:5403499. [PMID: 30893420 PMCID: PMC6425859 DOI: 10.1093/database/baz022] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/30/2018] [Revised: 01/17/2019] [Accepted: 01/28/2019] [Indexed: 12/14/2022]
Abstract
Budding yeasts inhabit a range of environments by exploiting various metabolic traits. The genetic bases for these traits are mostly unknown, preventing their addition or removal in a chassis organism for metabolic engineering. Insight into the evolution of orthologs, paralogs and xenologs in the yeast pan-genome can help bridge these genotypes; however, existing phylogenomic databases do not span diverse yeasts, and sometimes cannot distinguish between these homologs. To help understand the molecular evolution of these traits in yeasts, we created Analyzing Yeasts by Reconstructing Ancestry of Homologs (AYbRAH), an open-source database of predicted and manually curated ortholog groups for 33 diverse fungi and yeasts in Dikarya, spanning 600 million years of evolution. OrthoMCL and OrthoDB were used to cluster protein sequence into ortholog and homolog groups, respectively; MAFFT and PhyML reconstructed the phylogeny of all homolog groups. Ortholog assignments for enzymes and small metabolite transporters were compared to their phylogenetic reconstruction, and curated to resolve any discrepancies. Information on homolog and ortholog groups can be viewed in the AYbRAH web portal (https://lmse.github.io/aybrah/), including functional annotations, predictions for mitochondrial localization and transmembrane domains, literature references and phylogenetic reconstructions. Ortholog assignments in AYbRAH were compared to HOGENOM, KEGG Orthology, OMA, eggNOG and PANTHER. PANTHER and OMA had the most congruent ortholog groups with AYbRAH, while the other phylogenomic databases had greater amounts of under-clustering, over-clustering or no ortholog annotations for proteins. Future plans are discussed for AYbRAH, and recommendations are made for other research communities seeking to create curated ortholog databases.
Collapse
Affiliation(s)
- Kevin Correia
- Department of Chemical Engineering and Applied Chemistry, University of Toronto, College Street, Toronto, ON, Canada
| | - Shi M Yu
- Department of Chemical Engineering and Applied Chemistry, University of Toronto, College Street, Toronto, ON, Canada
| | - Radhakrishnan Mahadevan
- Department of Chemical Engineering and Applied Chemistry, University of Toronto, College Street, Toronto, ON, Canada.,Institute of Biomaterials and Biomedical Engineering, University of Toronto, College Street, Toronto, ON, Canada
| |
Collapse
|
38
|
Siu-Ting K, Torres-Sánchez M, San Mauro D, Wilcockson D, Wilkinson M, Pisani D, O'Connell MJ, Creevey CJ. Inadvertent Paralog Inclusion Drives Artifactual Topologies and Timetree Estimates in Phylogenomics. Mol Biol Evol 2019; 36:1344-1356. [PMID: 30903171 PMCID: PMC6526904 DOI: 10.1093/molbev/msz067] [Citation(s) in RCA: 33] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Increasingly, large phylogenomic data sets include transcriptomic data from nonmodel organisms. This not only has allowed controversial and unexplored evolutionary relationships in the tree of life to be addressed but also increases the risk of inadvertent inclusion of paralogs in the analysis. Although this may be expected to result in decreased phylogenetic support, it is not clear if it could also drive highly supported artifactual relationships. Many groups, including the hyperdiverse Lissamphibia, are especially susceptible to these issues due to ancient gene duplication events and small numbers of sequenced genomes and because transcriptomes are increasingly applied to resolve historically conflicting taxonomic hypotheses. We tested the potential impact of paralog inclusion on the topologies and timetree estimates of the Lissamphibia using published and de novo sequencing data including 18 amphibian species, from which 2,656 single-copy gene families were identified. A novel paralog filtering approach resulted in four differently curated data sets, which were used for phylogenetic reconstructions using Bayesian inference, maximum likelihood, and quartet-based supertrees. We found that paralogs drive strongly supported conflicting hypotheses within the Lissamphibia (Batrachia and Procera) and older divergence time estimates even within groups where no variation in topology was observed. All investigated methods, except Bayesian inference with the CAT-GTR model, were found to be sensitive to paralogs, but with filtering convergence to the same answer (Batrachia) was observed. This is the first large-scale study to address the impact of orthology selection using transcriptomic data and emphasizes the importance of quality over quantity particularly for understanding relationships of poorly sampled taxa.
Collapse
Affiliation(s)
- Karen Siu-Ting
- Institute for Global Food Security, School of Biological Sciences, Queen's University Belfast, Belfast, United Kingdom.,School of Biotechnology, Dublin City University, Glasnevin, Dublin, Ireland.,Dpto. de Herpetología, Museo de Historia Natural, Universidad Nacional Mayor de San Marcos, Lima, Perú
| | - María Torres-Sánchez
- Department of Biodiversity, Ecology, and Evolution, Complutense University of Madrid, Madrid, Spain.,Department of Neuroscience, Spinal Cord and Brain Injury Research Center and Ambystoma Genetic Stock Center, University of Kentucky, Lexington, KY
| | - Diego San Mauro
- Department of Biodiversity, Ecology, and Evolution, Complutense University of Madrid, Madrid, Spain
| | - David Wilcockson
- Institute of Biological, Environmental and Rural Sciences, Aberystwyth University, Aberystwyth, United Kingdom
| | - Mark Wilkinson
- Department of Life Sciences, Natural History Museum, London, United Kingdom
| | - Davide Pisani
- Life Sciences Building, University of Bristol, Bristol, United Kingdom
| | - Mary J O'Connell
- School of Biology, Faculty of Biological Sciences, University of Leeds, Leeds, United Kingdom.,School of Life Sciences, University of Nottingham, University Park, United Kingdom
| | - Christopher J Creevey
- Institute for Global Food Security, School of Biological Sciences, Queen's University Belfast, Belfast, United Kingdom
| |
Collapse
|
39
|
Inoue J, Satoh N. ORTHOSCOPE: An Automatic Web Tool for Phylogenetically Inferring Bilaterian Orthogroups with User-Selected Taxa. Mol Biol Evol 2019; 36:621-631. [PMID: 30517749 PMCID: PMC6389317 DOI: 10.1093/molbev/msy226] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Identification of orthologous or paralogous relationships of coding genes is fundamental to all aspects of comparative genomics. For accurate identification of orthologs among deeply diversified bilaterian lineages, precise estimation of gene trees is indispensable, given the complicated histories of genes over millions of years. By estimating gene trees, orthologs can be identified as members of an orthogroup, a set of genes descended from a single gene in the last common ancestor of all the species being considered. In addition to comparisons with a given species tree, purposeful taxonomic sampling increases the accuracy of gene tree estimation and orthogroup identification. Although some major phylogenetic relationships of bilaterians are gradually being unraveled, the scattering of published genomic data among separate web databases is becoming a significant hindrance to identification of orthogroups with appropriate taxonomic sampling. By integrating more than 250 metazoan gene models predicted in genome projects, we developed a web tool called ORTHOSCOPE to identify orthogroups of specific protein-coding genes within major bilaterian lineages. ORTHOSCOPE allows users to employ several sequences of a specific molecule and broadly accepted nodes included in a user-specified species tree as queries and to evaluate the reliability of estimated orthogroups based on topologies and node support values of estimated gene trees. A test analysis using data from 36 bilaterians was accomplished within 140 s. ORTHOSCOPE results can be used to evaluate orthologs identified by other stand-alone programs using genome-scale data. ORTHOSCOPE is freely available at https://www.orthoscope.jp or https://github.com/jun-inoue/orthoscope (last accessed December 28, 2018).
Collapse
Affiliation(s)
- Jun Inoue
- Marine Genomics Unit, Okinawa Institute of Science and Technology Graduate University, Okinawa, Japan
| | - Noriyuki Satoh
- Marine Genomics Unit, Okinawa Institute of Science and Technology Graduate University, Okinawa, Japan
| |
Collapse
|
40
|
Pett W, Adamski M, Adamska M, Francis WR, Eitel M, Pisani D, Wörheide G. The Role of Homology and Orthology in the Phylogenomic Analysis of Metazoan Gene Content. Mol Biol Evol 2019; 36:643-649. [PMID: 30690573 DOI: 10.1093/molbev/msz013] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Resolving the relationships of animals (Metazoa) is crucial to our understanding of the origin of key traits such as muscles, guts, and nerves. However, a broadly accepted metazoan consensus phylogeny has yet to emerge. In part, this is because the genomes of deeply diverging and fast-evolving lineages may undergo significant gene turnover, reducing the number of orthologs shared with related phyla. This can limit the usefulness of traditional phylogenetic methods that rely on alignments of orthologous sequences. Phylogenetic analysis of gene content has the potential to circumvent this orthology requirement, with binary presence/absence of homologous gene families representing a source of phylogenetically informative characters. Applying binary substitution models to the gene content of 26 complete animal genomes, we demonstrate that patterns of gene conservation differ markedly depending on whether gene families are defined by orthology or homology, that is, whether paralogs are excluded or included. We conclude that the placement of some deeply diverging lineages may exceed the limit of resolution afforded by the current methods based on comparisons of orthologous protein sequences, and novel approaches are required to fully capture the evolutionary signal from genes within genomes.
Collapse
Affiliation(s)
- Walker Pett
- Department of Ecology, Evolution and Organismal Biology, Iowa State University, Ames, IA
| | - Marcin Adamski
- Computational Biology and Bioinformatics Unit, Research School of Biology, The Australian National University, Canberra, Australia
| | - Maja Adamska
- Computational Biology and Bioinformatics Unit, Research School of Biology, The Australian National University, Canberra, Australia
| | - Warren R Francis
- Department of Earth & Environmental Sciences & GeoBio-Center, Ludwig-Maximilians-Universität München, Munich, Germany
| | - Michael Eitel
- Department of Earth & Environmental Sciences & GeoBio-Center, Ludwig-Maximilians-Universität München, Munich, Germany
| | - Davide Pisani
- School of Earth Sciences, University of Bristol, Bristol, United Kingdom.,School of Biological Sciences, University of Bristol, Bristol, United Kingdom
| | - Gert Wörheide
- Department of Earth & Environmental Sciences & GeoBio-Center, Ludwig-Maximilians-Universität München, Munich, Germany.,SNSB-Bayerische Staatssammlung für Paläontologie und Geologie, München, Germany
| |
Collapse
|
41
|
Opazo JC, Kuraku S, Zavala K, Toloza-Villalobos J, Hoffmann FG. Evolution of nodal and nodal-related genes and the putative composition of the heterodimers that trigger the nodal pathway in vertebrates. Evol Dev 2019; 21:205-217. [PMID: 31210006 DOI: 10.1111/ede.12292] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2018] [Revised: 04/03/2019] [Accepted: 05/13/2019] [Indexed: 02/06/2023]
Abstract
Nodal is a signaling molecule that belongs to the transforming growth factor-β superfamily that plays key roles during the early stages of development of animals. In vertebrates Nodal forms an heterodimer with a GDF1/3 protein to activate the Nodal pathway. Vertebrates have a paralog of nodal in their genomes labeled Nodal-related, but the evolutionary history of these genes is a matter of debate, mainly because of the presence of a variable numbers of genes in the vertebrate genomes sequenced so far. Thus, the goal of this study was to investigate the evolutionary history of the Nodal and Nodal-related genes with an emphasis in tracking changes in the number of genes among vertebrates. Our results show the presence of two gene lineages (Nodal and Nodal-related) that can be traced back to the ancestor of jawed vertebrates. These lineages have undergone processes of differential retention and lineage-specific expansions. Our results imply that Nodal and Nodal-related duplicated at the latest in the ancestor of gnathostomes, and they still retain a significant level of functional redundancy. By comparing the evolution of the Nodal/Nodal-related with GDF1/3 gene family, it is possible to infer that there are several types of heterodimers that can trigger the Nodal pathway among vertebrates.
Collapse
Affiliation(s)
- Juan C Opazo
- Instituto de Ciencias Ambientales y Evolutivas, Facultad de Ciencias, Universidad Austral de Chile, Valdivia, Chile
| | - Shigehiro Kuraku
- Laboratory for Phyloinformatics, RIKEN Center for Biosystems Dynamics Research (BDR), Kobe, Japan
| | - Kattina Zavala
- Instituto de Ciencias Ambientales y Evolutivas, Facultad de Ciencias, Universidad Austral de Chile, Valdivia, Chile
| | - Jessica Toloza-Villalobos
- Instituto de Ciencias Ambientales y Evolutivas, Facultad de Ciencias, Universidad Austral de Chile, Valdivia, Chile
| | - Federico G Hoffmann
- Department of Biochemistry, Molecular Biology, Entomology, and Plant Pathology, Mississippi State University, Starkville, Mississippi.,Institute for Genomics, Biocomputing and Biotechnology, Mississippi State University, Starkville, Mississippi
| |
Collapse
|
42
|
Mixão V, Hansen AP, Saus E, Boekhout T, Lass-Florl C, Gabaldón T. Whole-Genome Sequencing of the Opportunistic Yeast Pathogen Candida inconspicua Uncovers Its Hybrid Origin. Front Genet 2019; 10:383. [PMID: 31105748 PMCID: PMC6494940 DOI: 10.3389/fgene.2019.00383] [Citation(s) in RCA: 33] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2019] [Accepted: 04/09/2019] [Indexed: 12/02/2022] Open
Abstract
Fungal infections such as those caused by Candida species are increasingly common complications in immunocompromised patients. The list of causative agents of candidiasis is growing and comprises a set of emerging species whose relative global incidence is rare but recurrent. This is the case of Candida inconspicua, which prevalence has increased 10-fold over the last years. To gain novel insights into the emergence of this opportunistic pathogen and its genetic diversity, we performed whole genome sequencing of the type strain (CBS180), and of 10 other clinical isolates. Our results revealed high levels of genetic heterozygosity structured in non-homogeneous patterns, which are indicative of a hybrid genome shaped by events of loss of heterozygosity (LOH). All analyzed strains were hybrids and could be clustered into two distinct clades. We found large variability across strains in terms of ploidy, patterns of LOH, and mitochondrial genome heterogeneity that suggest potential admixture between hybrids. Altogether, our results identify a new hybrid species with virulence potential toward humans and underscore the potential role of hybridization in the emergence of novel pathogenic lineages.
Collapse
Affiliation(s)
- Verónica Mixão
- Centre for Genomic Regulation, Barcelona Institute of Science and Technology, Barcelona, Spain.,Department of Experimental and Health Sciences, Universitat Pompeu Fabra, Barcelona, Spain
| | - Antonio Perez Hansen
- Division of Hygiene and Medical Microbiology, Innsbruck Medical University, Innsbruck, Austria
| | - Ester Saus
- Centre for Genomic Regulation, Barcelona Institute of Science and Technology, Barcelona, Spain.,Department of Experimental and Health Sciences, Universitat Pompeu Fabra, Barcelona, Spain
| | - Teun Boekhout
- Westerdijk Fungal Biodiversity Institute, Utrecht, Netherlands.,Institute of Biodiversity and Ecosystem Dynamics, University of Amsterdam, Amsterdam, Netherlands
| | - Cornelia Lass-Florl
- Division of Hygiene and Medical Microbiology, Innsbruck Medical University, Innsbruck, Austria
| | - Toni Gabaldón
- Centre for Genomic Regulation, Barcelona Institute of Science and Technology, Barcelona, Spain.,Department of Experimental and Health Sciences, Universitat Pompeu Fabra, Barcelona, Spain.,Institució Catalana de Recerca i Estudis Avançats, Barcelona, Spain
| |
Collapse
|
43
|
Laumer CE. Inferring Ancient Relationships with Genomic Data: A Commentary on Current Practices. Integr Comp Biol 2019; 58:623-639. [PMID: 29982611 DOI: 10.1093/icb/icy075] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open
Abstract
Contemporary phylogeneticists enjoy an embarrassment of riches, not only in the volumes of data now available, but also in the diversity of bioinformatic tools for handling these data. Here, I discuss a subset of these tools I consider well-suited to the task of inferring ancient relationships with coding sequence data in particular, encompassing data generation, orthology assignment, alignment and gene tree inference, supermatrix construction, and analysis under the best-fitting models applicable to large-scale datasets. Throughout, I compare and critique methods, considering both their theoretical principles and the details of their implementation, and offering practical tips on usage where appropriate. I also entertain different motivations for analyzing what are almost always originally DNA sequence data as codons, amino acids, and higher-order recodings. Although presented in a linear order, I see value in using the diversity of tools available to us to assess the sensitivity of clades of biological interest to different gene and taxon sets and analytical modes, which can be an indication of the presence of systematic error, of which a few forms remain poorly controlled by even the best available inference methods.
Collapse
Affiliation(s)
- Christopher E Laumer
- EMBL-European Bioinformatics Institute, Wellcome Trust Genome Campus, EBML-EBI South Building, Hinxton CB10 1SD, UK
| |
Collapse
|
44
|
Olsson S, Pinosio S, González-Martínez SC, Abascal F, Mayol M, Grivet D, Vendramin GG. De novo assembly of English yew (Taxus baccata) transcriptome and its applications for intra- and inter-specific analyses. PLANT MOLECULAR BIOLOGY 2018; 97:337-345. [PMID: 29850988 DOI: 10.1007/s11103-018-0742-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/07/2018] [Accepted: 05/25/2018] [Indexed: 06/08/2023]
Abstract
We provide novel genomic resources for Taxus baccata in the form of a reference transcriptome, SSR and SNP markers, and orthologous single-copy genes, useful for phylogenomic and population genomic applications. English yew (T. baccata) is the only European representative of the Taxaceae family, a conifer group originated in the Jurassic period. The wide extent of environmental heterogeneity within the species' range, together with its long presence in Europe, make English yew an ideal species to investigate adaptive evolution in conifers. To enlarge the genomic resources available for this species, we used Illumina short read sequencing followed by de novo assembly to build the transcriptome of English yew. In addition to a fully annotated transcriptome as well as large sets of new potential SSR and SNP markers for T. baccata, we provide a data set of orthologous single-copy genes across three Taxus species using Picea sitchensis as outgroup, and discuss ortholog uses and limitations for phylogenomic and population genomic applications.
Collapse
Affiliation(s)
- Sanna Olsson
- Department of Forest Ecology and Genetics, Forest Research Centre, INIA-CIFOR, Carretera de la Coruña km 7.5, 28040, Madrid, Spain
| | - Sara Pinosio
- Istituto di Genomica Applicata (IGA), Via J. Linussio, 51, 33100, Udine, Italy
- Division of Florence, Institute of Biosciences and Bioresources, National Research Council, 50019, Sesto Fiorentino, FI, Italy
| | - Santiago C González-Martínez
- UMR BIOGECO, INRA, University of Bordeaux, Cestas, France
- Sustainable Forest Management Research Institute, INIA - University of Valladolid, Avda. Madrid 44, 34004, Palencia, Spain
- CREAF, E08193 Bellaterra (Cerdanyola del Vallès), Catalonia, Spain
| | - Federico Abascal
- Human Genetics Department, Sandhu Group, Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Maria Mayol
- CREAF, E08193 Bellaterra (Cerdanyola del Vallès), Catalonia, Spain
| | - Delphine Grivet
- Department of Forest Ecology and Genetics, Forest Research Centre, INIA-CIFOR, Carretera de la Coruña km 7.5, 28040, Madrid, Spain.
- Sustainable Forest Management Research Institute, INIA - University of Valladolid, Avda. Madrid 44, 34004, Palencia, Spain.
| | - Giovanni G Vendramin
- Division of Florence, Institute of Biosciences and Bioresources, National Research Council, 50019, Sesto Fiorentino, FI, Italy
| |
Collapse
|
45
|
Catalán A, Macias-Muñoz A, Briscoe AD. Evolution of Sex-Biased Gene Expression and Dosage Compensation in the Eye and Brain of Heliconius Butterflies. Mol Biol Evol 2018; 35:2120-2134. [DOI: 10.1093/molbev/msy111] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Affiliation(s)
- Ana Catalán
- Department of Ecology and Evolutionary Biology, University of California, Irvine, CA
- Section of Evolutionary Biology, Department of Biology II, Ludwig Maximilians Universität, Planegg-Martinsried, Germany
| | - Aide Macias-Muñoz
- Department of Ecology and Evolutionary Biology, University of California, Irvine, CA
| | - Adriana D Briscoe
- Department of Ecology and Evolutionary Biology, University of California, Irvine, CA
| |
Collapse
|
46
|
Walker JF, Yang Y, Feng T, Timoneda A, Mikenas J, Hutchison V, Edwards C, Wang N, Ahluwalia S, Olivieri J, Walker-Hale N, Majure LC, Puente R, Kadereit G, Lauterbach M, Eggli U, Flores-Olvera H, Ochoterena H, Brockington SF, Moore MJ, Smith SA. From cacti to carnivores: Improved phylotranscriptomic sampling and hierarchical homology inference provide further insight into the evolution of Caryophyllales. AMERICAN JOURNAL OF BOTANY 2018; 105:446-462. [PMID: 29738076 DOI: 10.1002/ajb2.1069] [Citation(s) in RCA: 45] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/13/2017] [Accepted: 01/04/2018] [Indexed: 05/27/2023]
Abstract
PREMISE OF THE STUDY The Caryophyllales contain ~12,500 species and are known for their cosmopolitan distribution, convergence of trait evolution, and extreme adaptations. Some relationships within the Caryophyllales, like those of many large plant clades, remain unclear, and phylogenetic studies often recover alternative hypotheses. We explore the utility of broad and dense transcriptome sampling across the order for resolving evolutionary relationships in Caryophyllales. METHODS We generated 84 transcriptomes and combined these with 224 publicly available transcriptomes to perform a phylogenomic analysis of Caryophyllales. To overcome the computational challenge of ortholog detection in such a large data set, we developed an approach for clustering gene families that allowed us to analyze >300 transcriptomes and genomes. We then inferred the species relationships using multiple methods and performed gene-tree conflict analyses. KEY RESULTS Our phylogenetic analyses resolved many clades with strong support, but also showed significant gene-tree discordance. This discordance is not only a common feature of phylogenomic studies, but also represents an opportunity to understand processes that have structured phylogenies. We also found taxon sampling influences species-tree inference, highlighting the importance of more focused studies with additional taxon sampling. CONCLUSIONS Transcriptomes are useful both for species-tree inference and for uncovering evolutionary complexity within lineages. Through analyses of gene-tree conflict and multiple methods of species-tree inference, we demonstrate that phylogenomic data can provide unparalleled insight into the evolutionary history of Caryophyllales. We also discuss a method for overcoming computational challenges associated with homolog clustering in large data sets.
Collapse
Affiliation(s)
- Joseph F Walker
- Department of Ecology & Evolutionary Biology, University of Michigan, 830 North University Avenue, Ann Arbor, MI, 48109-1048, USA
| | - Ya Yang
- Department of Plant and Microbial Biology, University of Minnesota-Twin Cities, 1445 Gortner Avenue, St. Paul, MN, 55108, USA
| | - Tao Feng
- Department of Plant Sciences, University of Cambridge, Cambridge, CB2 3EA, UK
| | - Alfonso Timoneda
- Department of Plant Sciences, University of Cambridge, Cambridge, CB2 3EA, UK
| | - Jessica Mikenas
- Department of Biology, Oberlin College, Science Center K111, 119 Woodland Street, Oberlin, OH, 44074-1097, USA
| | - Vera Hutchison
- Department of Biology, Oberlin College, Science Center K111, 119 Woodland Street, Oberlin, OH, 44074-1097, USA
| | - Caroline Edwards
- Department of Biology, Oberlin College, Science Center K111, 119 Woodland Street, Oberlin, OH, 44074-1097, USA
| | - Ning Wang
- Department of Ecology & Evolutionary Biology, University of Michigan, 830 North University Avenue, Ann Arbor, MI, 48109-1048, USA
| | - Sonia Ahluwalia
- Department of Ecology & Evolutionary Biology, University of Michigan, 830 North University Avenue, Ann Arbor, MI, 48109-1048, USA
| | - Julia Olivieri
- Department of Biology, Oberlin College, Science Center K111, 119 Woodland Street, Oberlin, OH, 44074-1097, USA
- Institute of Computational and Mathematical Engineering (ICME), Stanford University, 475 Via Ortega, Suite B060, Stanford, CA, 94305-4042, USA
| | - Nathanael Walker-Hale
- School of Biological Sciences, Victoria University of Wellington, Kelburn Parade, Kelburn, Wellington, 6012, New Zealand
| | - Lucas C Majure
- Department of Research, Conservation and Collections, Desert Botanical Garden, 1201 N. Galvin Pkwy, Phoenix, AZ, 85008, USA
| | - Raúl Puente
- Department of Research, Conservation and Collections, Desert Botanical Garden, 1201 N. Galvin Pkwy, Phoenix, AZ, 85008, USA
| | - Gudrun Kadereit
- Institut für Molekulare Physiologie, Johannes Gutenberg-Universität Mainz, D-55099, Mainz, Germany
- Institut für Molekulare und Organismische Evolutionsbiologie, Johannes Gutenberg-Universität Mainz, D-55099, Mainz, Germany
| | - Maximilian Lauterbach
- Institut für Molekulare Physiologie, Johannes Gutenberg-Universität Mainz, D-55099, Mainz, Germany
- Institut für Molekulare und Organismische Evolutionsbiologie, Johannes Gutenberg-Universität Mainz, D-55099, Mainz, Germany
| | - Urs Eggli
- Sukkulenten-Sammlung Zürich / Grün Stadt Zürich, Mythenquai 88, CH-8002, Zürich, Switzerland
| | - Hilda Flores-Olvera
- Departamento de Botánica, Universidad Nacional Autónoma de México, Apartado, Postal 70-367, 04510, Mexico City, Mexico
| | - Helga Ochoterena
- Departamento de Botánica, Universidad Nacional Autónoma de México, Apartado, Postal 70-367, 04510, Mexico City, Mexico
| | | | - Michael J Moore
- Department of Biology, Oberlin College, Science Center K111, 119 Woodland Street, Oberlin, OH, 44074-1097, USA
| | - Stephen A Smith
- Department of Ecology & Evolutionary Biology, University of Michigan, 830 North University Avenue, Ann Arbor, MI, 48109-1048, USA
| |
Collapse
|
47
|
Mallo D, Posada D. Multilocus inference of species trees and DNA barcoding. Philos Trans R Soc Lond B Biol Sci 2017; 371:rstb.2015.0335. [PMID: 27481787 PMCID: PMC4971187 DOI: 10.1098/rstb.2015.0335] [Citation(s) in RCA: 50] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/10/2016] [Indexed: 11/30/2022] Open
Abstract
The unprecedented amount of data resulting from next-generation sequencing has opened a new era in phylogenetic estimation. Although large datasets should, in theory, increase phylogenetic resolution, massive, multilocus datasets have uncovered a great deal of phylogenetic incongruence among different genomic regions, due both to stochastic error and to the action of different evolutionary process such as incomplete lineage sorting, gene duplication and loss and horizontal gene transfer. This incongruence violates one of the fundamental assumptions of the DNA barcoding approach, which assumes that gene history and species history are identical. In this review, we explain some of the most important challenges we will have to face to reconstruct the history of species, and the advantages and disadvantages of different strategies for the phylogenetic analysis of multilocus data. In particular, we describe the evolutionary events that can generate species tree—gene tree discordance, compare the most popular methods for species tree reconstruction, highlight the challenges we need to face when using them and discuss their potential utility in barcoding. Current barcoding methods sacrifice a great amount of statistical power by only considering one locus, and a transition to multilocus barcodes would not only improve current barcoding methods, but also facilitate an eventual transition to species-tree-based barcoding strategies, which could better accommodate scenarios where the barcode gap is too small or inexistent. This article is part of the themed issue ‘From DNA barcodes to biomes’.
Collapse
Affiliation(s)
- Diego Mallo
- Department of Biochemistry, Genetics and Immunology, University of Vigo, Vigo 36310, Spain
| | - David Posada
- Department of Biochemistry, Genetics and Immunology, University of Vigo, Vigo 36310, Spain
| |
Collapse
|
48
|
Jahangiri-Tazehkand S, Wong L, Eslahchi C. OrthoGNC: A Software for Accurate Identification of Orthologs Based on Gene Neighborhood Conservation. GENOMICS PROTEOMICS & BIOINFORMATICS 2017; 15:361-370. [PMID: 29133277 PMCID: PMC5828658 DOI: 10.1016/j.gpb.2017.07.002] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/22/2017] [Revised: 07/17/2017] [Accepted: 07/28/2017] [Indexed: 11/17/2022]
Abstract
Orthology relations can be used to transfer annotations from one gene (or protein) to another. Hence, detecting orthology relations has become an important task in the post-genomic era. Various genomic events, such as duplication and horizontal gene transfer, can cause erroneous assignment of orthology relations. In closely-related species, gene neighborhood information can be used to resolve many ambiguities in orthology inference. Here we present OrthoGNC, a software for accurately predicting pairwise orthology relations based on gene neighborhood conservation. Analyses on simulated and real data reveal the high accuracy of OrthoGNC. In addition to orthology detection, OrthoGNC can be employed to investigate the conservation of genomic context among potential orthologs detected by other methods. OrthoGNC is freely available online at http://bs.ipm.ir/softwares/orthognc and http://tinyurl.com/orthoGNC.
Collapse
Affiliation(s)
| | - Limsoon Wong
- School of Computing, National University of Singapore, Singapore 117417, Singapore
| | - Changiz Eslahchi
- Department of Computer Science, Shahid Beheshti University, Tehran 1983969411, Iran.
| |
Collapse
|
49
|
Bravo-Ruiz G, Sassi AH, Marcet-Houben M, Di Pietro A, Gargouri A, Gabaldon T, Roncero MIG. Regulatory Mechanisms of a Highly Pectinolytic Mutant of Penicillium occitanis and Functional Analysis of a Candidate Gene in the Plant Pathogen Fusarium oxysporum. Front Microbiol 2017; 8:1627. [PMID: 28951729 PMCID: PMC5599776 DOI: 10.3389/fmicb.2017.01627] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2017] [Accepted: 08/10/2017] [Indexed: 11/24/2022] Open
Abstract
Penicillium occitanis is a model system for enzymatic regulation. A mutant strain exhibiting constitutive overproduction of different pectinolytic enzymes both under inducing (pectin) or repressing conditions (glucose) was previously isolated after chemical mutagenesis. In order to identify the molecular basis of this regulatory mechanism, the genomes of the wild type and the derived mutant strain were sequenced and compared, providing the first reference genome for this species. We used a phylogenomic approach to compare P. occitanis with other pectinolytic fungi and to trace expansions of gene families involved in carbohydrate degradation. Genome comparison between wild type and mutant identified seven mutations associated with predicted proteins. The most likely candidate was a mutation in a highly conserved serine residue of a conserved fungal protein containing a GAL4-like Zn2Cys6 binuclear cluster DNA-binding domain and a fungus-specific transcription factor regulatory middle homology region. To functionally characterize the role of this candidate gene, the mutation was recapitulated in the predicted orthologue Fusarium oxysporum, a vascular wilt pathogen which secretes a wide array of plant cell wall degrading enzymes, including polygalacturonases, pectate lyases, xylanases and proteases, all of which contribute to infection. However, neither the null mutant nor a mutant carrying the analogous point mutation exhibited a deregulation of pectinolytic enzymes. The availability, annotation and phylogenomic analysis of the P. occitanis genome sequence represents an important resource for understanding the evolution and biology of this species, and sets the basis for the discovery of new genes of biotechnological interest for the degradation of complex polysaccharides.
Collapse
Affiliation(s)
- Gustavo Bravo-Ruiz
- Departamento de Genetica, Universidad de Cordoba and Campus de Excelencia Agroalimentario (ceiA3)Cordoba, Spain
| | - Azza Hadj Sassi
- Bioinformatics and Genomics Programme, Centre for Genomic Regulation, The Barcelona Institute of Science and TechnologyBarcelona, Spain
| | - Marina Marcet-Houben
- Bioinformatics and Genomics Programme, Centre for Genomic Regulation, The Barcelona Institute of Science and TechnologyBarcelona, Spain
- Universitat Pompeu FabraBarcelona, Spain
| | - Antonio Di Pietro
- Departamento de Genetica, Universidad de Cordoba and Campus de Excelencia Agroalimentario (ceiA3)Cordoba, Spain
| | - Ali Gargouri
- Laboratoire de Biotechnologie Moléculaire des Eucaryotes, Centre de Biotechnologie de SfaxSfax, Tunisia
| | - Toni Gabaldon
- Bioinformatics and Genomics Programme, Centre for Genomic Regulation, The Barcelona Institute of Science and TechnologyBarcelona, Spain
- Universitat Pompeu FabraBarcelona, Spain
- Institucio Catalana de Recerca i Estudis AvançatsBarcelona, Spain
| | - M. Isabel G. Roncero
- Departamento de Genetica, Universidad de Cordoba and Campus de Excelencia Agroalimentario (ceiA3)Cordoba, Spain
| |
Collapse
|
50
|
Céspedes HA, Zavala K, Vandewege MW, Opazo JC. Evolution of the α 2-adrenoreceptors in vertebrates: ADRA2D is absent in mammals and crocodiles. Gen Comp Endocrinol 2017. [PMID: 28622977 DOI: 10.1016/j.ygcen.2017.06.006] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Evolutionary studies of genes that have been functionally characterized and whose variation has been associated with pathological conditions represent an opportunity to understand the genetic basis of pathologies. α2-Adrenoreceptors (ADRA2) are a class of G protein-coupled receptors that regulate several physiological processes including blood pressure, platelet aggregation, insulin secretion, lipolysis, and neurotransmitter release. This gene family has been extensively studied from a molecular/physiological perspective, yet much less is known about its evolutionary history. Accordingly, the goal of this study was to investigate the evolutionary history of α2-adrenoreceptors (ADRA2) in vertebrates. Our results show that in addition to the three well-recognized α2-adrenoreceptor genes (ADRA2A, ADRA2B and ADRA2C), we recovered a clade that corresponds to the fourth member of the α2-adrenoreceptor gene family (ADRA2D). We also recovered a clade that possesses two ADRA2 sequences found in two lamprey species. Furthermore, our results show that mammals and crocodiles are characterized by possessing three α2-adrenoreceptor genes, whereas all other vertebrate groups possess the full repertoire of α2-adrenoreceptor genes. Among vertebrates ADRA2D seems to be a dispensable gene, as it was lost two independent times during the evolutionary history of the group. Additionally, we found that most examined species possess the most common alleles described for humans; however, there are cases in which non-human mammals possess the alternative variant. Finally, transcript abundance profiles revealed that during the early evolutionary history of gnathostomes, the expression of ADRA2D in different taxonomic groups became specialized to different tissues, but in the ancestor of sarcopterygians this specialization would have been lost.
Collapse
Affiliation(s)
- Héctor A Céspedes
- Instituto de Ciencias Ambientales y Evolutivas, Facultad de Ciencias, Universidad Austral de Chile, Valdivia, Chile
| | - Kattina Zavala
- Instituto de Ciencias Ambientales y Evolutivas, Facultad de Ciencias, Universidad Austral de Chile, Valdivia, Chile
| | - Michael W Vandewege
- Department of Biological Sciences, Texas Tech University, Lubbock, TX 79409, USA
| | - Juan C Opazo
- Instituto de Ciencias Ambientales y Evolutivas, Facultad de Ciencias, Universidad Austral de Chile, Valdivia, Chile; David Rockefeller Center For Latin American Studies, Harvard University, Cambridge, MA 02138, USA.
| |
Collapse
|