1
|
Barrios-Núñez I, Martínez-Redondo G, Medina-Burgos P, Cases I, Fernández R, Rojas A. Decoding functional proteome information in model organisms using protein language models. NAR Genom Bioinform 2024; 6:lqae078. [PMID: 38962255 PMCID: PMC11217674 DOI: 10.1093/nargab/lqae078] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2024] [Revised: 05/31/2024] [Accepted: 06/26/2024] [Indexed: 07/05/2024] Open
Abstract
Protein language models have been tested and proved to be reliable when used on curated datasets but have not yet been applied to full proteomes. Accordingly, we tested how two different machine learning-based methods performed when decoding functional information from the proteomes of selected model organisms. We found that protein language models are more precise and informative than deep learning methods for all the species tested and across the three gene ontologies studied, and that they better recover functional information from transcriptomic experiments. The results obtained indicate that these language models are likely to be suitable for large-scale annotation and downstream analyses, and we recommend a guide for their use.
Collapse
Affiliation(s)
- Israel Barrios-Núñez
- Computational Biology and Bioinformatics Group, Andalusian Center for Developmental Biology (CABD-CSIC), 41013 Sevilla, Spain
| | | | - Patricia Medina-Burgos
- Computational Biology and Bioinformatics Group, Andalusian Center for Developmental Biology (CABD-CSIC), 41013 Sevilla, Spain
| | - Ildefonso Cases
- Bioinformatics Unit, Andalusian Center for Developmental Biology (CABD-CSIC), 41013 Sevilla, Spain
| | - Rosa Fernández
- Metazoa Phylogenomics Lab, Institute of Evolutionary Biology (CSIC-UPF), 08003 Barcelona, Spain
| | - Ana M Rojas
- Computational Biology and Bioinformatics Group, Andalusian Center for Developmental Biology (CABD-CSIC), 41013 Sevilla, Spain
| |
Collapse
|
2
|
Lin RC, Ferreira BT, Yuan YW. The molecular basis of phenotypic evolution: beyond the usual suspects. Trends Genet 2024; 40:668-680. [PMID: 38704304 PMCID: PMC11303103 DOI: 10.1016/j.tig.2024.04.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2024] [Revised: 04/15/2024] [Accepted: 04/17/2024] [Indexed: 05/06/2024]
Abstract
It has been well documented that mutations in coding DNA or cis-regulatory elements underlie natural phenotypic variation in many organisms. However, the development of sophisticated functional tools in recent years in a wide range of traditionally non-model systems have revealed many 'unusual suspects' in the molecular bases of phenotypic evolution, including upstream open reading frames (uORFs), cryptic splice sites, and small RNAs. Furthermore, large-scale genome sequencing, especially long-read sequencing, has identified a cornucopia of structural variation underlying phenotypic divergence and elucidated the composition of supergenes that control complex multi-trait polymorphisms. In this review article we highlight recent studies that demonstrate this great diversity of molecular mechanisms producing adaptive genetic variation and the panoply of evolutionary paths leading to the 'grandeur of life'.
Collapse
Affiliation(s)
- Rong-Chien Lin
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT 06269, USA
| | - Bianca T Ferreira
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT 06269, USA
| | - Yao-Wu Yuan
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT 06269, USA.
| |
Collapse
|
3
|
Gouda MNR, D S, Gaikwad K, Prabha R, Kumar A, Subramanian S. Elucidation of ejaculatory bulb proteins in Bemisia tabaci Asia-1 and Asia II-1 and confirmation of their mating transfer via RNAi. Mol Biol Rep 2024; 51:861. [PMID: 39068620 DOI: 10.1007/s11033-024-09816-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2024] [Accepted: 07/22/2024] [Indexed: 07/30/2024]
Abstract
BACKGROUND Bemisia tabaci, a significant agricultural pest in Asia, contains distinct genetic groups, Asia-1 and Asia II-1. Understanding its reproductive biology, particularly the role of ejaculatory bulb proteins (EBPs) in mating, is crucial. However, EBPs in B. tabaci were not well characterised until this study. METHODS AND RESULTS The EBPs have been characterised in the Asia-1 and Asia II-1 genetic groups of the whitefly B. tabaci, prevalent in Asia. The transcriptomic analysis yielded over 40,000,000 and 30,000,000 annotated transcripts, respectively, from Asia II-1 and Asia-1. Differential gene expression revealed the presence of 270 upregulated and 198 downregulated genes, with significant differences between these two genetic groups. Orphan genes (1992 numbers) were identified in both genetic groups. We report, for the first time, full-length sequences of EBP genes from B. tabaci. The 10 EBPs each deduced in B. tabaci Asia-1 and Asia II-1 are structurally akin to chemosensory proteins having four conserved cysteine residues. Additionally, we did domain analysis, protein structure prediction, mapping of these EBPs in the chromosomes of B. tabaci, and phylogenetic analysis to track their evolutionary lineage. We have specifically demonstrated the transfer of EBPs from males to females during mating using qPCR and further validated the transfer of EBPs through RNAi. Specifically, we targeted the highly expressed EBPs (EBP-3, 7, and 8 in BtAsia1; EBP-8, 9, and 10 in BtAsia II-1) through feeding bioassays of dsRNAs. Tracking by qPCR revealed that the females, when mated with dsRNA-treated males, did not show expression of the specific EBP, suggesting that the silencing of these genes in males hinders the transfer of EBP to females during mating. CONCLUSION Our findings provide novel insights into the genomic contours of EBPs in B. tabaci and underscore the potential of RNAi-based strategies for pest management by disrupting the reproductive processes.
Collapse
Affiliation(s)
- M N Rudra Gouda
- Division of Entomology, Indian Agricultural Research Institute, New Delhi, 110012, India
| | - Sagar D
- Division of Genomic Resources, ICAR-National Bureau of Agricultural Insect Resources, Bengaluru, 275103, India
| | - Kishor Gaikwad
- Division of Molecular Biology and Biotechnology, ICAR-National Institute for Plant Biotechnology, New Delhi, 110012, India
| | - Ratna Prabha
- Agricultural Knowledge Management Unit (AKMU), ICAR-Indian Agricultural Research Institute, New Delhi, 110012, India
| | - Amrender Kumar
- Agricultural Knowledge Management Unit (AKMU), ICAR-Indian Agricultural Research Institute, New Delhi, 110012, India
| | | |
Collapse
|
4
|
Vakirlis N, Kupczok A. Large-scale investigation of species-specific orphan genes in the human gut microbiome elucidates their evolutionary origins. Genome Res 2024; 34:888-903. [PMID: 38977308 PMCID: PMC11293555 DOI: 10.1101/gr.278977.124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2024] [Accepted: 06/12/2024] [Indexed: 07/10/2024]
Abstract
Species-specific genes, also known as orphans, are ubiquitous across life's domains. In prokaryotes, species-specific orphan genes (SSOGs) are mostly thought to originate in external elements such as viruses followed by horizontal gene transfer, whereas the scenario of native origination, through rapid divergence or de novo, is mostly dismissed. However, quantitative evidence supporting either scenario is lacking. Here, we systematically analyzed genomes from 4644 human gut microbiome species and identified more than 600,000 unique SSOGs, representing an average of 2.6% of a given species' pangenome. These sequences are mostly rare within each species yet show signs of purifying selection. Overall, SSOGs use optimal codons less frequently, and their proteins are more disordered than those of conserved genes (i.e., non-SSOGs). Importantly, across species, the GC content of SSOGs closely matches that of conserved ones. In contrast, the ∼5% of SSOGs that share similarity to known viral sequences have distinct characteristics, including lower GC content. Thus, SSOGs with similarity to viruses differ from the remaining SSOGs, contrasting an external origination scenario for most of them. By examining the orthologous genomic region in closely related species, we show that a small subset of SSOGs likely evolved natively de novo and find that these genes also differ in their properties from the remaining SSOGs. Our results challenge the notion that external elements are the dominant source of prokaryotic genetic novelty and will enable future studies into the biological role and relevance of species-specific genes in the human gut.
Collapse
Affiliation(s)
- Nikolaos Vakirlis
- Institute For Fundamental Biomedical Research, B.S.R.C. "Alexander Fleming," Vari 166 72, Greece;
- Institute for General Microbiology, Kiel University, 24118 Kiel, Germany
| | - Anne Kupczok
- Bioinformatics Group, Wageningen University, 6700 PB Wageningen, The Netherlands
| |
Collapse
|
5
|
Skalon EK, Starunov VV, Slyusarev GS. RNA-seq analysis of parasitism by Intoshia linei (Orthonectida) reveals protein effectors of defence, communication, feeding and growth. JOURNAL OF EXPERIMENTAL ZOOLOGY. PART B, MOLECULAR AND DEVELOPMENTAL EVOLUTION 2024; 342:398-405. [PMID: 38369898 DOI: 10.1002/jez.b.23247] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/22/2023] [Revised: 01/22/2024] [Accepted: 01/30/2024] [Indexed: 02/20/2024]
Abstract
Orthonectida is a group of multicellular endoparasites of a wide range of marine invertebrates. Their parasitic stage is a multinuclear shapeless plasmodium infiltrating host tissues. The development of the following worm-like sexual generation takes place within the cytoplasm of the plasmodium. The existence of the plasmodial stage and the development of a sexual stage within the plasmodium are unique features to Bilateria. However, the molecular mechanisms that maintain this peculiar organism, and hence enable parasitism in orthonectids, are unknown. Here, we present the first-ever RNA-seq analysis of the plasmodium, aimed at the identification and characterization of the plasmodium-specific protein-coding genes and corresponding hypothetical proteins that distinguish the parasitic plasmodium stage from the sexual stage of the orthonectid Intoshia linei Giard, 1877, parasite of nemertean Lineus ruber Müller, 1774. We discovered 119 plasmodium-specific proteins, 82 of which have inferred functions based on known domains. Thirty-five of the detected proteins are orphans, at least part of which may reflect the unique evolutionary adaptations of orthonectids to parasitism. Some of the identified proteins are known effector molecules of other endoparasites suggesting convergence. Our data indicate that the plasmodium-specific proteins might be involved in the plasmodium defense against the host, host-parasite communication, feeding and nutrient uptake, growth within the host, and support of the sexual stage development. These molecular processes in orthonectids have not been described before, and the particular protein effectors remained unknown until now.
Collapse
Affiliation(s)
- Elizaveta K Skalon
- Department of Invertebrate Zoology, Faculty of Biology, St Petersburg University, St. Petersburg, Russia
- Zoological Institute, Russian Academy of Sciences, St. Petersburg, Russia
| | - Viktor V Starunov
- Department of Invertebrate Zoology, Faculty of Biology, St Petersburg University, St. Petersburg, Russia
- Zoological Institute, Russian Academy of Sciences, St. Petersburg, Russia
| | - George S Slyusarev
- Department of Invertebrate Zoology, Faculty of Biology, St Petersburg University, St. Petersburg, Russia
| |
Collapse
|
6
|
Klimovich A, Bosch TCG. Novel technologies uncover novel 'anti'-microbial peptides in Hydra shaping the species-specific microbiome. Philos Trans R Soc Lond B Biol Sci 2024; 379:20230058. [PMID: 38497265 PMCID: PMC10945409 DOI: 10.1098/rstb.2023.0058] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2023] [Accepted: 11/16/2023] [Indexed: 03/19/2024] Open
Abstract
The freshwater polyp Hydra uses an elaborate innate immune machinery to maintain its specific microbiome. Major components of this toolkit are conserved Toll-like receptor (TLR)-mediated immune pathways and species-specific antimicrobial peptides (AMPs). Our study harnesses advanced technologies, such as high-throughput sequencing and machine learning, to uncover a high complexity of the Hydra's AMPs repertoire. Functional analysis reveals that these AMPs are specific against diverse members of the Hydra microbiome and expressed in a spatially controlled pattern. Notably, in the outer epithelial layer, AMPs are produced mainly in the neurons. The neuron-derived AMPs are secreted directly into the glycocalyx, the habitat for symbiotic bacteria, and display high selectivity and spatial restriction of expression. In the endodermal layer, in contrast, endodermal epithelial cells produce an abundance of different AMPs including members of the arminin and hydramacin families, while gland cells secrete kazal-type protease inhibitors. Since the endodermal layer lines the gastric cavity devoid of symbiotic bacteria, we assume that endodermally secreted AMPs protect the gastric cavity from intruding pathogens. In conclusion, Hydra employs a complex set of AMPs expressed in distinct tissue layers and cell types to combat pathogens and to maintain a stable spatially organized microbiome. This article is part of the theme issue 'Sculpting the microbiome: how host factors determine and respond to microbial colonization'.
Collapse
Affiliation(s)
- Alexander Klimovich
- Zoological Institute, Christian-Albrechts University of Kiel, Am Botanischen Garten 1-9, Kiel 24118, Germany
| | - Thomas C. G. Bosch
- Zoological Institute, Christian-Albrechts University of Kiel, Am Botanischen Garten 1-9, Kiel 24118, Germany
| |
Collapse
|
7
|
uz-Zaman MH, D’Alton S, Barrick JE, Ochman H. Promoter recruitment drives the emergence of proto-genes in a long-term evolution experiment with Escherichia coli. PLoS Biol 2024; 22:e3002418. [PMID: 38713714 PMCID: PMC11101190 DOI: 10.1371/journal.pbio.3002418] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Revised: 05/17/2024] [Accepted: 04/18/2024] [Indexed: 05/09/2024] Open
Abstract
The phenomenon of de novo gene birth-the emergence of genes from non-genic sequences-has received considerable attention due to the widespread occurrence of genes that are unique to particular species or genomes. Most instances of de novo gene birth have been recognized through comparative analyses of genome sequences in eukaryotes, despite the abundance of novel, lineage-specific genes in bacteria and the relative ease with which bacteria can be studied in an experimental context. Here, we explore the genetic record of the Escherichia coli long-term evolution experiment (LTEE) for changes indicative of "proto-genic" phases of new gene birth in which non-genic sequences evolve stable transcription and/or translation. Over the time span of the LTEE, non-genic regions are frequently transcribed, translated and differentially expressed, with levels of transcription across low-expressed regions increasing in later generations of the experiment. Proto-genes formed downstream of new mutations result either from insertion element activity or chromosomal translocations that fused preexisting regulatory sequences to regions that were not expressed in the LTEE ancestor. Additionally, we identified instances of proto-gene emergence in which a previously unexpressed sequence was transcribed after formation of an upstream promoter, although such cases were rare compared to those caused by recruitment of preexisting promoters. Tracing the origin of the causative mutations, we discovered that most occurred early in the history of the LTEE, often within the first 20,000 generations, and became fixed soon after emergence. Our findings show that proto-genes emerge frequently within evolving populations, can persist stably, and can serve as potential substrates for new gene formation.
Collapse
Affiliation(s)
- Md. Hassan uz-Zaman
- Department of Molecular Biosciences, University of Texas at Austin, Austin, Texas, United States of America
| | - Simon D’Alton
- Department of Molecular Biosciences, University of Texas at Austin, Austin, Texas, United States of America
| | - Jeffrey E. Barrick
- Department of Molecular Biosciences, University of Texas at Austin, Austin, Texas, United States of America
| | - Howard Ochman
- Department of Molecular Biosciences, University of Texas at Austin, Austin, Texas, United States of America
| |
Collapse
|
8
|
Domazet-Lošo M, Široki T, Šimičević K, Domazet-Lošo T. Macroevolutionary dynamics of gene family gain and loss along multicellular eukaryotic lineages. Nat Commun 2024; 15:2663. [PMID: 38531970 DOI: 10.1038/s41467-024-47017-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2023] [Accepted: 03/11/2024] [Indexed: 03/28/2024] Open
Abstract
The gain and loss of genes fluctuate over evolutionary time in major eukaryotic clades. However, the full profile of these macroevolutionary trajectories is still missing. To give a more inclusive view on the changes in genome complexity across the tree of life, here we recovered the evolutionary dynamics of gene family gain and loss ranging from the ancestor of cellular organisms to 352 eukaryotic species. We show that in all considered lineages the gene family content follows a common evolutionary pattern, where the number of gene families reaches the highest value at a major evolutionary and ecological transition, and then gradually decreases towards extant organisms. This supports theoretical predictions and suggests that the genome complexity is often decoupled from commonly perceived organismal complexity. We conclude that simplification by gene family loss is a dominant force in Phanerozoic genomes of various lineages, probably underpinned by intense ecological specializations and functional outsourcing.
Collapse
Affiliation(s)
- Mirjana Domazet-Lošo
- Department of Applied Computing, Faculty of Electrical Engineering and Computing, University of Zagreb, Unska 3, HR-10000, Zagreb, Croatia.
| | - Tin Široki
- Department of Applied Computing, Faculty of Electrical Engineering and Computing, University of Zagreb, Unska 3, HR-10000, Zagreb, Croatia
| | - Korina Šimičević
- Department of Applied Computing, Faculty of Electrical Engineering and Computing, University of Zagreb, Unska 3, HR-10000, Zagreb, Croatia
| | - Tomislav Domazet-Lošo
- Laboratory of Evolutionary Genetics, Division of Molecular Biology, Ruđer Bošković Institute, Bijenička cesta 54, HR-10000, Zagreb, Croatia.
- School of Medicine, Catholic University of Croatia, Ilica 242, HR-10000, Zagreb, Croatia.
| |
Collapse
|
9
|
Darragh K, Ramírez SR. The transcriptomic signature of adaptations associated with perfume collection in orchid bees. J Evol Biol 2024; 37:141-151. [PMID: 38271116 DOI: 10.1093/jeb/voad012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Revised: 09/11/2023] [Accepted: 11/23/2023] [Indexed: 01/27/2024]
Abstract
Secondary sexual traits can convey information on mate quality with the signal honesty maintained by the costly nature of trait expression. Mating signals are also often underpinned by physiological, morphological, and behavioural adaptations, which may require the evolution of novelty, but the genetic basis in many cases is unknown. In orchid bees, males acquire chemical compounds from the environment that act as pheromone-like bouquets (perfumes) during courtship displays. This process could be costly, potentially due to the cognitive demands of learning and the physiological demands of collecting a mix of extrinsic chemical compounds that may require detoxification. Furthermore, a novel trait, a specialized perfume pouch in the hind leg, is required for compound storage. We studied gene expression in the brain, hind leg, and Malpighian tubules-a tissue involved in detoxification-to investigate changes in gene expression following perfume collection. We detected upregulation of genes enriched in functions related to transcription, odorant binding, and receptor activity in the Malpighian tubules. On the other hand, we did not find any evidence for learning processes following perfume collection, or gene expression changes in the hind leg, perhaps due to constitutive expression, or the age of the sampled bees. We did identify high expression of chemosensory proteins in the hind legs, which we suggest could play a role in perfume collection or storage, with further functional studies necessary to determine their binding properties and potential physiological importance. Los rasgos sexuales secundarios pueden servir como indicadores de calidad de la pareja, y en algunos casos la honestidad de la señal se mantiene por el costo de expresar el rasgo. A menudo las señales sexuales están respaldadas por adaptaciones fisiológicas, morfológicas y de comportamiento por lo tanto pueden requerir la evolución de nuevos rasgos, pero en muchos casos se desconoce la base genética. En las abejas de las orquídeas, los machos recolectan compuestos químicos del medio ambiente, los cuales actúan como feromonas (perfumes) durante el despliegue de cortejo. Este proceso podría ser costoso, posiblemente debido a las demandas cognitivas del aprendizaje y las demandas fisiológicas de recolectar una mezcla de compuestos químicos extrínsecos que pueden requerir desintoxicación. Además, se requiere la evolución de un contenedor para almacenar perfumes en la pata trasera. Para investigar los cambios en la expresión génica después de la recolección de perfume, estudiamos la expresión génica en el cerebro, la pata trasera y los túbulos de Malpighi (tejido involucrado en la desintoxicación). Encontramos varios genes regulados positivamente en los túbulos de Malpighi después de la recolección que están enriquecidos en factores de transcripción, proteínas de fijación de olores, y proteínas con actividad de receptor. Por otro lado, no encontramos ninguna evidencia de procesos de aprendizaje posteriores a la recolección de perfumes, o cambios en la expresión génica en la pata trasera, esto quizás debido a la expresión constitutiva o la edad de las abejas muestreadas. Además, identificamos una alta expresión de proteínas quimio-sensoriales en las patas traseras, que podría desempeñar un papel en la recolección o almacenamiento de perfumes. Más estudios funcionales son necesarios para determinar las propiedades de fijación de las proteínas y su potencial importancia fisiológica.
Collapse
Affiliation(s)
- Kathy Darragh
- Department of Evolution and Ecology, University of California, Davis, CA, United States
| | - Santiago R Ramírez
- Department of Evolution and Ecology, University of California, Davis, CA, United States
| |
Collapse
|
10
|
Goodheart JA, Rio RA, Taraporevala NF, Fiorenza RA, Barnes SR, Morrill K, Jacob MAC, Whitesel C, Masterson P, Batzel GO, Johnston HT, Ramirez MD, Katz PS, Lyons DC. A chromosome-level genome for the nudibranch gastropod Berghia stephanieae helps parse clade-specific gene expression in novel and conserved phenotypes. BMC Biol 2024; 22:9. [PMID: 38233809 PMCID: PMC10795318 DOI: 10.1186/s12915-024-01814-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2023] [Accepted: 01/03/2024] [Indexed: 01/19/2024] Open
Abstract
BACKGROUND How novel phenotypes originate from conserved genes, processes, and tissues remains a major question in biology. Research that sets out to answer this question often focuses on the conserved genes and processes involved, an approach that explicitly excludes the impact of genetic elements that may be classified as clade-specific, even though many of these genes are known to be important for many novel, or clade-restricted, phenotypes. This is especially true for understudied phyla such as mollusks, where limited genomic and functional biology resources for members of this phylum have long hindered assessments of genetic homology and function. To address this gap, we constructed a chromosome-level genome for the gastropod Berghia stephanieae (Valdés, 2005) to investigate the expression of clade-specific genes across both novel and conserved tissue types in this species. RESULTS The final assembled and filtered Berghia genome is comparable to other high-quality mollusk genomes in terms of size (1.05 Gb) and number of predicted genes (24,960 genes) and is highly contiguous. The proportion of upregulated, clade-specific genes varied across tissues, but with no clear trend between the proportion of clade-specific genes and the novelty of the tissue. However, more complex tissue like the brain had the highest total number of upregulated, clade-specific genes, though the ratio of upregulated clade-specific genes to the total number of upregulated genes was low. CONCLUSIONS Our results, when combined with previous research on the impact of novel genes on phenotypic evolution, highlight the fact that the complexity of the novel tissue or behavior, the type of novelty, and the developmental timing of evolutionary modifications will all influence how novel and conserved genes interact to generate diversity.
Collapse
Affiliation(s)
- Jessica A Goodheart
- Division of Invertebrate Zoology, American Museum of Natural History, New York, NY, USA.
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA.
| | - Robin A Rio
- Bioengineering Department, Stanford University, Stanford, CA, USA
| | - Neville F Taraporevala
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA
- Department of Wildland Resources, Utah State University, Logan, UT, USA
| | - Rose A Fiorenza
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA
| | - Seth R Barnes
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA
| | - Kevin Morrill
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA
| | - Mark Allan C Jacob
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA
| | - Carl Whitesel
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA
| | - Park Masterson
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA
| | - Grant O Batzel
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA
| | - Hereroa T Johnston
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA
| | - M Desmond Ramirez
- Department of Biology, University of Massachusetts Amherst, Amherst, MA, USA
- Institute of Neuroscience, University of Oregon, Eugene, OR, USA
| | - Paul S Katz
- Department of Biology, University of Massachusetts Amherst, Amherst, MA, USA
| | - Deirdre C Lyons
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA.
| |
Collapse
|
11
|
Cheatle Jarvela AM, Wexler JR. Advances in genome sequencing reveal changes in gene content that contribute to arthropod macroevolution. Dev Genes Evol 2023; 233:59-76. [PMID: 37982820 DOI: 10.1007/s00427-023-00712-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Accepted: 11/05/2023] [Indexed: 11/21/2023]
Abstract
Current sequencing technology allows for the relatively affordable generation of highly contiguous genomes. Technological advances have made it possible for researchers to investigate the consequences of diverse sorts of genomic variants, such as gene gain and loss. With the extraordinary number of high-quality genomes now available, we take stock of how these genomic variants impact phenotypic evolution. We take care to point out that the identification of genomic variants of interest is only the first step in understanding their impact. Painstaking lab or fieldwork is still required to establish causal relationships between genomic variants and phenotypic evolution. We focus mostly on arthropod research, as this phylum has an impressive degree of phenotypic diversity and is also the subject of much evolutionary genetics research. This article is intended to both highlight recent advances in the field and also to be a primer for learning about evolutionary genetics and genomics.
Collapse
Affiliation(s)
- Alys M Cheatle Jarvela
- Department of Entomology, University of Maryland, College Park, MD, USA.
- HHMI Janelia Research Campus, Ashburn, VA, USA.
| | - Judith R Wexler
- Department of Ecology, Evolution, and Behavior, The Hebrew University in Jerusalem, Jerusalem, Israel.
| |
Collapse
|
12
|
Fakhar AZ, Liu J, Pajerowska-Mukhtar KM, Mukhtar MS. The ORFans' tale: new insights in plant biology. TRENDS IN PLANT SCIENCE 2023; 28:1379-1390. [PMID: 37453923 DOI: 10.1016/j.tplants.2023.06.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/10/2022] [Revised: 05/17/2023] [Accepted: 06/19/2023] [Indexed: 07/18/2023]
Abstract
Orphan genes (OGs) are protein-coding genes without a significant sequence similarity in closely related species. Despite their functional importance, very little is known about the underlying molecular mechanisms by which OGs participate in diverse biological processes. Here, we discuss the evolutionary mechanisms of OGs' emergence with relevance to species-specific adaptations. We also provide a mechanistic view of the involvement of OGs in multiple processes, including growth, development, reproduction, and carbon-metabolism-mediated immunity. We highlight the interconnection between OGs and the sucrose nonfermenting 1 (SNF1)-related protein kinases (SnRKs)-target of rapamycin (TOR) signaling axis for phytohormone signaling, nutrient metabolism, and stress responses. Finally, we propose a high-throughput pipeline for OGs' interspecies and intraspecies gene transfer through a transgenic approach for future biotechnological advances.
Collapse
Affiliation(s)
- Ali Zeeshan Fakhar
- Department of Biology, University of Alabama at Birmingham, 1300 University Blvd., Birmingham, AL 35294, USA
| | - Jinbao Liu
- Department of Biology, University of Alabama at Birmingham, 1300 University Blvd., Birmingham, AL 35294, USA
| | | | - M Shahid Mukhtar
- Department of Biology, University of Alabama at Birmingham, 1300 University Blvd., Birmingham, AL 35294, USA.
| |
Collapse
|
13
|
Uz-Zaman MH, D'Alton S, Barrick JE, Ochman H. Promoter capture drives the emergence of proto-genes in Escherichia coli. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.15.567300. [PMID: 38013999 PMCID: PMC10680751 DOI: 10.1101/2023.11.15.567300] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2023]
Abstract
The phenomenon of de novo gene birth-the emergence of genes from non-genic sequences-has received considerable attention due to the widespread occurrence of genes that are unique to particular species or genomes. Most instances of de novo gene birth have been recognized through comparative analyses of genome sequences in eukaryotes, despite the abundance of novel, lineage-specific genes in bacteria and the relative ease with which bacteria can be studied in an experimental context. Here, we explore the genetic record of the Escherichia coli Long-Term Evolution Experiment (LTEE) for changes indicative of "proto-genic" phases of new gene birth in which non-genic sequences evolve stable transcription and/or translation. Over the time-span of the LTEE, non-genic regions are frequently transcribed, translated and differentially expressed, thereby serving as raw material for new gene emergence. Most proto-genes result either from insertion element activity or chromosomal translocations that fused pre-existing regulatory sequences to regions that were not expressed in the LTEE ancestor. Additionally, we identified instances of proto-gene emergence in which a previously unexpressed sequence was transcribed after formation of an upstream promoter. Tracing the origin of the causative mutations, we discovered that most occurred early in the history of the LTEE, often within the first 20,000 generations, and became fixed soon after emergence. Our findings show that proto-genes emerge frequently within evolving populations, persist stably, and can serve as potential substrates for new gene formation.
Collapse
|
14
|
Goodheart JA, Rio RA, Taraporevala NF, Fiorenza RA, Barnes SR, Morrill K, Jacob MAC, Whitesel C, Masterson P, Batzel GO, Johnston HT, Ramirez MD, Katz PS, Lyons DC. A chromosome-level genome for the nudibranch gastropod Berghia stephanieae helps parse clade-specific gene expression in novel and conserved phenotypes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.04.552006. [PMID: 38014205 PMCID: PMC10680569 DOI: 10.1101/2023.08.04.552006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2023]
Abstract
How novel phenotypes originate from conserved genes, processes, and tissues remains a major question in biology. Research that sets out to answer this question often focuses on the conserved genes and processes involved, an approach that explicitly excludes the impact of genetic elements that may be classified as clade-specific, even though many of these genes are known to be important for many novel, or clade-restricted, phenotypes. This is especially true for understudied phyla such as mollusks, where limited genomic and functional biology resources for members of this phylum has long hindered assessments of genetic homology and function. To address this gap, we constructed a chromosome-level genome for the gastropod Berghia stephanieae (Valdés, 2005) to investigate the expression of clade-specific genes across both novel and conserved tissue types in this species. The final assembled and filtered Berghia genome is comparable to other high quality mollusk genomes in terms of size (1.05 Gb) and number of predicted genes (24,960 genes), and is highly contiguous. The proportion of upregulated, clade-specific genes varied across tissues, but with no clear trend between the proportion of clade-specific genes and the novelty of the tissue. However, more complex tissue like the brain had the highest total number of upregulated, clade-specific genes, though the ratio of upregulated clade-specific genes to the total number of upregulated genes was low. Our results, when combined with previous research on the impact of novel genes on phenotypic evolution, highlight the fact that the complexity of the novel tissue or behavior, the type of novelty, and the developmental timing of evolutionary modifications will all influence how novel and conserved genes interact to generate diversity.
Collapse
Affiliation(s)
- Jessica A. Goodheart
- Division of Invertebrate Zoology, American Museum of Natural History, New York, NY USA
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA
| | - Robin A. Rio
- Bioengineering Department, Stanford University, Stanford, CA, USA
| | - Neville F. Taraporevala
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA
- Department of Wildland Resources, Utah State University, Logan, UT, USA
| | - Rose A. Fiorenza
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA
| | - Seth R. Barnes
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA
| | - Kevin Morrill
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA
| | - Mark Allan C. Jacob
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA
| | - Carl Whitesel
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA
| | - Park Masterson
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA
| | - Grant O. Batzel
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA
| | - Hereroa T. Johnston
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA
| | - M. Desmond Ramirez
- Department of Biology, University of Massachusetts Amherst, Amherst, MA, USA
| | - Paul S. Katz
- Department of Biology, University of Massachusetts Amherst, Amherst, MA, USA
| | - Deirdre C. Lyons
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA
| |
Collapse
|
15
|
Gunasekera RS, Raja KKB, Hewapathirana S, Tundrea E, Gunasekera V, Galbadage T, Nelson PA. ORFanID: A web-based search engine for the discovery and identification of orphan and taxonomically restricted genes. PLoS One 2023; 18:e0291260. [PMID: 37879070 PMCID: PMC10599687 DOI: 10.1371/journal.pone.0291260] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Accepted: 08/24/2023] [Indexed: 10/27/2023] Open
Abstract
With the numerous genomes sequenced today, it has been revealed that a noteworthy percentage of genes in a given taxon of organisms in the phylogenetic tree of life do not have orthologous sequences in other taxa. These sequences are commonly referred to as "orphans" or "ORFans" if found as single occurrences in a single species or as "taxonomically restricted genes" (TRGs) when found at higher taxonomic levels. Quantitative and collective studies of these genes are necessary for understanding their biological origins. However, the current software for identifying orphan genes is limited in its functionality, database search range, and very complex algorithmically. Thus, researchers studying orphan genes must harvest their data from many disparate sources. ORFanID is a graphical web-based search engine that facilitates the efficient identification of both orphan genes and TRGs at all taxonomic levels, from DNA or amino acid sequences in the NCBI database cluster and other large bioinformatics repositories. The software allows users to identify genes that are unique to any taxonomic rank, from species to domain, using NCBI systematic classifiers. It provides control over NCBI database search parameters, and the results are presented in a spreadsheet as well as a graphical display. The tables in the software are sortable, and results can be filtered using the fuzzy search functionality. The visual presentation can be expanded and collapsed by the taxonomic tree to its various branches. Example results from searches on five species and gene expression data from specific orphan genes are provided in the Supplementary Information.
Collapse
Affiliation(s)
- Richard S. Gunasekera
- Department of Chemistry, Physics and Engineering, School of Science, Technology & Health, Biola University, La Mirada, CA, United States of America
| | - Komal K. B. Raja
- Department of Pathology & Immunology, Baylor College of Medicine, Houston, TX, United States of America
| | - Suresh Hewapathirana
- European Bioinformatics Institute, Welcome Genome Campus, Hinxton, Cambridgeshire, United Kingdom
| | - Emanuel Tundrea
- Griffiths School of Management and IT, Emanuel University of Oradea, Oradea, Romania
| | - Vinodh Gunasekera
- Bioinformatics, Chesalon USA, Inc., Houston, TX, United States of America
| | - Thushara Galbadage
- Department of Kinesiology and Public Health, School of Science, Technology & Health, Biola University, La Mirada, CA, United States of America
| | - Paul A. Nelson
- Biola University, La Mirada, CA, United States of America
| |
Collapse
|
16
|
Isdaner AJ, Levis NA, Pfennig DW. Comparative transcriptomics reveals that a novel form of phenotypic plasticity evolved via lineage-specific changes in gene expression. Ecol Evol 2023; 13:e10646. [PMID: 37869437 PMCID: PMC10589077 DOI: 10.1002/ece3.10646] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2023] [Accepted: 10/09/2023] [Indexed: 10/24/2023] Open
Abstract
Novel forms of phenotypic plasticity may evolve by lineage-specific changes or by co-opting mechanisms from more general forms of plasticity. Here, we evaluated whether a novel resource polyphenism in New World spadefoot toads (genus Spea) evolved by co-opting mechanisms from an ancestral form of plasticity common in anurans-accelerating larval development rate in response to pond drying. We compared overlap in differentially expressed genes between alternative trophic morphs constituting the polyphenism in Spea versus those found between tadpoles of Old World spadefoot toads (genus Pelobates) when experiencing different pond-drying regimes. Specifically, we (1) generated a de novo transcriptome and conducted differential gene expression analysis in Spea multiplicata, (2) utilized existing gene expression data and a recently published transcriptome for Pelobates cultripes when exposed to different drying regimes, and (3) identified unique and overlapping differentially expressed transcripts. We found thousands of differentially expressed genes between S. multiplicata morphs that were involved in major developmental reorganization, but the vast majority of these were not differentially expressed in P. cultripes. Thus, S. multiplicata's novel polyphenism appears to have arisen primarily through lineage-specific changes in gene expression and not by co-opting existing patterns of gene expression involved in pond-drying plasticity. Therefore, although ancestral stress responses might jump-start evolutionary innovation, substantial lineage-specific modification might be needed to refine these responses into more complex forms of plasticity.
Collapse
Affiliation(s)
- Andrew J. Isdaner
- Department of BiologyUniversity of North CarolinaChapel HillNorth CarolinaUSA
| | - Nicholas A. Levis
- Department of BiologyUniversity of North CarolinaChapel HillNorth CarolinaUSA
- Department of BiologyIndiana UniversityBloomingtonIndianaUSA
| | - David W. Pfennig
- Department of BiologyUniversity of North CarolinaChapel HillNorth CarolinaUSA
| |
Collapse
|
17
|
Zehentner B, Scherer S, Neuhaus K. Non-canonical transcriptional start sites in E. coli O157:H7 EDL933 are regulated and appear in surprisingly high numbers. BMC Microbiol 2023; 23:243. [PMID: 37653502 PMCID: PMC10469882 DOI: 10.1186/s12866-023-02988-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2022] [Accepted: 08/21/2023] [Indexed: 09/02/2023] Open
Abstract
Analysis of genome wide transcription start sites (TSSs) revealed an unexpected complexity since not only canonical TSS of annotated genes are recognized by RNA polymerase. Non-canonical TSS were detected antisense to, or within, annotated genes as well new intergenic (orphan) TSS, not associated with known genes. Previously, it was hypothesized that many such signals represent noise or pervasive transcription, not associated with a biological function. Here, a modified Cappable-seq protocol allows determining the primary transcriptome of the enterohemorrhagic E. coli O157:H7 EDL933 (EHEC). We used four different growth media, both in exponential and stationary growth phase, replicated each thrice. This yielded 19,975 EHEC canonical and non-canonical TSS, which reproducibly occurring in three biological replicates. This questions the hypothesis of experimental noise or pervasive transcription. Accordingly, conserved promoter motifs were found upstream indicating proper TSSs. More than 50% of 5,567 canonical and between 32% and 47% of 10,355 non-canonical TSS were differentially expressed in different media and growth phases, providing evidence for a potential biological function also of non-canonical TSS. Thus, reproducible and environmentally regulated expression suggests that a substantial number of the non-canonical TSSs may be of unknown function rather than being the result of noise or pervasive transcription.
Collapse
Affiliation(s)
- Barbara Zehentner
- Chair for Microbial Ecology, TUM School of Life Sciences, Department of Molecular Life Sciences, Technical University of Munich, Freising, Germany
| | - Siegfried Scherer
- Chair for Microbial Ecology, TUM School of Life Sciences, Department of Molecular Life Sciences, Technical University of Munich, Freising, Germany
- ZIEL - Institute for Food & Health, Technical University of Munich, Freising, Germany
| | - Klaus Neuhaus
- ZIEL - Institute for Food & Health, Technical University of Munich, Freising, Germany.
- Core Facility Microbiome, ZIEL - Institute for Food & Health, Technical University of Munich, Freising, Germany.
| |
Collapse
|
18
|
Zielezinski A, Dobrychlop W, Karlowski WM. TRGdb: a universal resource for the exploration of taxonomically restricted genes in bacteria. Database (Oxford) 2023; 2023:baad058. [PMID: 37555549 PMCID: PMC10410690 DOI: 10.1093/database/baad058] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2023] [Revised: 06/30/2023] [Accepted: 07/31/2023] [Indexed: 08/10/2023]
Abstract
The TRGdb database is a resource dedicated to taxonomically restricted genes (TRGs) in bacteria. It provides a comprehensive collection of genes that are specific to different genera and species, according to the latest release of bacterial taxonomy. The user interface allows for easy browsing and searching as well as sequence similarity exploration. The website also provides information on each TRG protein sequence, including its level of disorder, complexity and tendency to aggregate. TRGdb is a valuable resource for gaining a deeper understanding of the TRG-associated, unique features, and characteristics of bacterial organisms. Database URL www.combio.pl/trgdb.
Collapse
Affiliation(s)
- Andrzej Zielezinski
- Department of Computational Biology, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University, Uniwersytetu Poznanskiego 6, Poznan 61-614, Poland
| | - Wojciech Dobrychlop
- Department of Computational Biology, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University, Uniwersytetu Poznanskiego 6, Poznan 61-614, Poland
| | - Wojciech M Karlowski
- Department of Computational Biology, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University, Uniwersytetu Poznanskiego 6, Poznan 61-614, Poland
| |
Collapse
|
19
|
Wu L, Lambert JD. Clade-specific genes and the evolutionary origin of novelty; new tools in the toolkit. Semin Cell Dev Biol 2023; 145:52-59. [PMID: 35659164 DOI: 10.1016/j.semcdb.2022.05.025] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2021] [Revised: 04/27/2022] [Accepted: 05/25/2022] [Indexed: 10/18/2022]
Abstract
Clade-specific (a.k.a. lineage-specific) genes are very common and found at all taxonomic levels and in all clades examined. They can arise by duplication of previously existing genes, which can involve partial truncations or combinations with other protein domains or regulatory sequences. They can also evolve de novo from non-coding sequences, leading to potentially truly novel protein domains. Finally, since clade-specific genes are generally defined by lack of sequence homology with other proteins, they can also arise by sequence evolution that is rapid enough that previous sequence homology can no longer be detected. In such cases, where the rapid evolution is followed by constraint, we consider them to be ontologically non-novel but likely novel at a functional level. In general, clade-specific genes have received less attention from biologists but there are increasing numbers of fascinating examples of their roles in important traits. Here we review some selected recent examples, and argue that attention to clade-specific genes is an important corrective to the focus on the conserved developmental regulatory toolkit that has been the habit of evo-devo as a field. Finally, we discuss questions that arise about the evolution of clade-specific genes, and how these might be addressed by future studies. We highlight the hypothesis that clade-specific genes are more likely to be involved in synapomorphies that arose in the stem group where they appeared, compared to other genes.
Collapse
Affiliation(s)
- Longjun Wu
- Department of Biology, University of Rochester, Rochester, NY 14627, USA
| | - J David Lambert
- Department of Biology, University of Rochester, Rochester, NY 14627, USA.
| |
Collapse
|
20
|
Zhao Y, Huang S, Zhang Y, Tan C, Feng H. Role of Brassica orphan gene BrLFM on leafy head formation in Chinese cabbage (Brassica rapa). TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2023; 136:170. [PMID: 37420138 DOI: 10.1007/s00122-023-04411-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/20/2023] [Accepted: 06/22/2023] [Indexed: 07/09/2023]
Abstract
Brassica orphan gene BrFLM, identified by two allelic mutants, was involved in leafy head formation in Chinese cabbage. Leafy head formation is a unique agronomic trait of Chinese cabbage that determines its yield and quality. In our previous study, an EMS mutagenesis Chinese cabbage mutant library was constructed using the heading Chinese cabbage double haploid (DH) line FT as the wild-type. Here, we screened two extremely similar leafy head deficiency mutants lfm-1 and lfm-2 with geotropic growth leaves from the library to investigate the gene(s) related to leafy head formation. Reciprocal crossing results showed that these two mutants were allelic. We utilized lfm-1 to identify the mutant gene(s). Genetic analysis showed that the mutated trait was controlled by a single nuclear gene Brlfm. Mutmap analysis showed that Brlfm was located on chromosome A05, and BraA05g012440.3C or BraA05g021450.3C were the candidate gene. Kompetitive allele-specific PCR analysis eliminated BraA05g012440.3C from the candidates. Sanger sequencing identified an SNP from G to A at the 271st nucleotide on BraA05g021450.3C. The sequencing of lfm-2 detected another non-synonymous SNP (G to A) located at the 266st nucleotide on BraA05g021450.3C, which verified its function on leafy head formation. We blasted BraA05g021450.3C on database and found that it belongs to a Brassica orphan gene encoding an unknown 13.74 kDa protein, named BrLFM. Subcellular localization showed that BrLFM was located in the nucleus. These findings reveal that BrLFM is involved in leafy head formation in Chinese cabbage.
Collapse
Affiliation(s)
- Yonghui Zhao
- College of Horticulture, Shenyang Agricultural University, 120 Dongling Road, Shenhe District, Shenyang, 110866, People's Republic of China
| | - Shengnan Huang
- College of Horticulture, Shenyang Agricultural University, 120 Dongling Road, Shenhe District, Shenyang, 110866, People's Republic of China
| | - Yun Zhang
- College of Horticulture, Shenyang Agricultural University, 120 Dongling Road, Shenhe District, Shenyang, 110866, People's Republic of China
| | - Chong Tan
- College of Horticulture, Shenyang Agricultural University, 120 Dongling Road, Shenhe District, Shenyang, 110866, People's Republic of China
| | - Hui Feng
- College of Horticulture, Shenyang Agricultural University, 120 Dongling Road, Shenhe District, Shenyang, 110866, People's Republic of China.
| |
Collapse
|
21
|
Athanasouli M, Akduman N, Röseler W, Theam P, Rödelsperger C. Thousands of Pristionchus pacificus orphan genes were integrated into developmental networks that respond to diverse environmental microbiota. PLoS Genet 2023; 19:e1010832. [PMID: 37399201 DOI: 10.1371/journal.pgen.1010832] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2023] [Accepted: 06/15/2023] [Indexed: 07/05/2023] Open
Abstract
Adaptation of organisms to environmental change may be facilitated by the creation of new genes. New genes without homologs in other lineages are known as taxonomically-restricted orphan genes and may result from divergence or de novo formation. Previously, we have extensively characterized the evolution and origin of such orphan genes in the nematode model organism Pristionchus pacificus. Here, we employ large-scale transcriptomics to establish potential functional associations and to measure the degree of transcriptional plasticity among orphan genes. Specifically, we analyzed 24 RNA-seq samples from adult P. pacificus worms raised on 24 different monoxenic bacterial cultures. Based on coexpression analysis, we identified 28 large modules that harbor 3,727 diplogastrid-specific orphan genes and that respond dynamically to different bacteria. These coexpression modules have distinct regulatory architecture and also exhibit differential expression patterns across development suggesting a link between bacterial response networks and development. Phylostratigraphy revealed a considerably high number of family- and even species-specific orphan genes in certain coexpression modules. This suggests that new genes are not attached randomly to existing cellular networks and that integration can happen very fast. Integrative analysis of protein domains, gene expression and ortholog data facilitated the assignments of biological labels for 22 coexpression modules with one of the largest, fast-evolving module being associated with spermatogenesis. In summary, this work presents the first functional annotation for thousands of P. pacificus orphan genes and reveals insights into their integration into environmentally responsive gene networks.
Collapse
Affiliation(s)
- Marina Athanasouli
- Department for Integrative Evolutionary Biology, Max Planck Institute for Biology, Tübingen, Germany
| | - Nermin Akduman
- Department for Integrative Evolutionary Biology, Max Planck Institute for Biology, Tübingen, Germany
| | - Waltraud Röseler
- Department for Integrative Evolutionary Biology, Max Planck Institute for Biology, Tübingen, Germany
| | - Penghieng Theam
- Department for Integrative Evolutionary Biology, Max Planck Institute for Biology, Tübingen, Germany
| | - Christian Rödelsperger
- Department for Integrative Evolutionary Biology, Max Planck Institute for Biology, Tübingen, Germany
| |
Collapse
|
22
|
Fakhar AZ, Liu J, Pajerowska-Mukhtar KM, Mukhtar MS. The Lost and Found: Unraveling the Functions of Orphan Genes. J Dev Biol 2023; 11:27. [PMID: 37367481 PMCID: PMC10299390 DOI: 10.3390/jdb11020027] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2023] [Revised: 05/19/2023] [Accepted: 05/26/2023] [Indexed: 06/28/2023] Open
Abstract
Orphan Genes (OGs) are a mysterious class of genes that have recently gained significant attention. Despite lacking a clear evolutionary history, they are found in nearly all living organisms, from bacteria to humans, and they play important roles in diverse biological processes. The discovery of OGs was first made through comparative genomics followed by the identification of unique genes across different species. OGs tend to be more prevalent in species with larger genomes, such as plants and animals, and their evolutionary origins remain unclear but potentially arise from gene duplication, horizontal gene transfer (HGT), or de novo origination. Although their precise function is not well understood, OGs have been implicated in crucial biological processes such as development, metabolism, and stress responses. To better understand their significance, researchers are using a variety of approaches, including transcriptomics, functional genomics, and molecular biology. This review offers a comprehensive overview of the current knowledge of OGs in all domains of life, highlighting the possible role of dark transcriptomics in their evolution. More research is needed to fully comprehend the role of OGs in biology and their impact on various biological processes.
Collapse
Affiliation(s)
| | | | | | - M. Shahid Mukhtar
- Department of Biology, University of Alabama at Birmingham, 1300 University Blvd., Birmingham, AL 35294, USA
| |
Collapse
|
23
|
Zhang L, Park JJ, Dong MB, Arsala D, Xia S, Chen J, Sosa D, Atlas JE, Long M, Chen S. Human gene age dating reveals an early and rapid evolutionary construction of the adaptive immune system. Genome Biol Evol 2023; 15:evad081. [PMID: 37170918 PMCID: PMC10210621 DOI: 10.1093/gbe/evad081] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2022] [Revised: 04/24/2023] [Accepted: 05/02/2023] [Indexed: 05/13/2023] Open
Abstract
T cells are a type of white blood cell that play a critical role in the immune response against foreign pathogens through a process called T Cell Adaptive Immunity (TCAI). However, the evolution of the genes and nucleotide sequences involved in TCAI is not well understood. To investigate this, we performed comparative studies of gene annotations and genome assemblies of 28 vertebrate species and identified sets of human genes that are involved in TCAI, carcinogenesis, and ageing. We found that these gene sets share interaction pathways which may have contributed to the evolution of longevity in the vertebrate lineage leading to humans. Our human gene age dating analyses revealed that there was rapid origination of genes with TCAI-related functions prior to the Cretaceous eutherian radiation and these new genes mainly encode negative regulators. We identified no new TCAI-related genes after the divergence of placental mammals, but we did detect an extensive number of amino acid substitutions under strong positive selection in recently evolved human immunity genes suggesting they are co-evolving with adaptive immunity. More specifically, we observed that antigen processing and presentation and checkpoint genes are significantly enriched among new genes evolving under positive selection. These observations reveal an evolutionary process of T Cell Adaptive Immunity that were associated with rapid gene duplication in the early stages of vertebrates and subsequent sequence changes in TCAI-related genes. These processes together suggest an early genetic construction of the vertebrate immune system and subsequent molecular adaptation to diverse antigens.
Collapse
Affiliation(s)
- Li Zhang
- System Biology Institute, Integrated Science & Technology Center, West Haven, Connecticut, USA
- Department of Genetics, Yale University School of Medicine, New Haven, Connecticut, USA
- Center for Cancer Systems Biology, Integrated Science & Technology Center, West Haven, Connecticut, USA
- Yale M.D.-Ph.D. Program, New Haven, Connecticut, USA
| | - Jonathan J Park
- System Biology Institute, Integrated Science & Technology Center, West Haven, Connecticut, USA
- Department of Genetics, Yale University School of Medicine, New Haven, Connecticut, USA
- Center for Cancer Systems Biology, Integrated Science & Technology Center, West Haven, Connecticut, USA
- Yale M.D.-Ph.D. Program, New Haven, Connecticut, USA
| | - Matthew B Dong
- System Biology Institute, Integrated Science & Technology Center, West Haven, Connecticut, USA
- Department of Genetics, Yale University School of Medicine, New Haven, Connecticut, USA
- Center for Cancer Systems Biology, Integrated Science & Technology Center, West Haven, Connecticut, USA
- Yale M.D.-Ph.D. Program, New Haven, Connecticut, USA
- Immunobiology Program, The Anlyan Center, New Haven, Connecticut, USA
- Department of Immunobiology, The Anlyan Center, New Haven, Connecticut, USA
| | - Deanna Arsala
- Department of Ecology and Evolution, The University of Chicago, Chicago, Illinois, USA
| | - Shengqian Xia
- Department of Ecology and Evolution, The University of Chicago, Chicago, Illinois, USA
| | - Jianhai Chen
- Department of Ecology and Evolution, The University of Chicago, Chicago, Illinois, USA
| | - Dylan Sosa
- Department of Ecology and Evolution, The University of Chicago, Chicago, Illinois, USA
| | - Jared E Atlas
- Department of Ecology and Evolution, The University of Chicago, Chicago, Illinois, USA
- Committee on Genetics, Genomics and Systems Biology, The University of Chicago, Chicago, Illinois, USA
| | - Manyuan Long
- Department of Ecology and Evolution, The University of Chicago, Chicago, Illinois, USA
| | - Sidi Chen
- System Biology Institute, Integrated Science & Technology Center, West Haven, Connecticut, USA
- Department of Genetics, Yale University School of Medicine, New Haven, Connecticut, USA
- Center for Cancer Systems Biology, Integrated Science & Technology Center, West Haven, Connecticut, USA
- Yale M.D.-Ph.D. Program, New Haven, Connecticut, USA
- Immunobiology Program, The Anlyan Center, New Haven, Connecticut, USA
- Yale Comprehensive Cancer Center, New Haven, Connecticut, USA
- Yale Stem Cell Center, Yale University School of Medicine, New Haven, Connecticut, USA
| |
Collapse
|
24
|
Titus-McQuillan JE, Nanni AV, McIntyre LM, Rogers RL. Estimating transcriptome complexities across eukaryotes. BMC Genomics 2023; 24:254. [PMID: 37170194 PMCID: PMC10173493 DOI: 10.1186/s12864-023-09326-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2023] [Accepted: 04/20/2023] [Indexed: 05/13/2023] Open
Abstract
BACKGROUND Genomic complexity is a growing field of evolution, with case studies for comparative evolutionary analyses in model and emerging non-model systems. Understanding complexity and the functional components of the genome is an untapped wealth of knowledge ripe for exploration. With the "remarkable lack of correspondence" between genome size and complexity, there needs to be a way to quantify complexity across organisms. In this study, we use a set of complexity metrics that allow for evaluating changes in complexity using TranD. RESULTS We ascertain if complexity is increasing or decreasing across transcriptomes and at what structural level, as complexity varies. In this study, we define three metrics - TpG, EpT, and EpG- to quantify the transcriptome's complexity that encapsulates the dynamics of alternative splicing. Here we compare complexity metrics across 1) whole genome annotations, 2) a filtered subset of orthologs, and 3) novel genes to elucidate the impacts of orthologs and novel genes in transcript model analysis. Effective Exon Number (EEN) issued to compare the distribution of exon sizes within transcripts against random expectations of uniform exon placement. EEN accounts for differences in exon size, which is important because novel gene differences in complexity for orthologs and whole-transcriptome analyses are biased towards low-complexity genes with few exons and few alternative transcripts. CONCLUSIONS With our metric analyses, we are able to quantify changes in complexity across diverse lineages with greater precision and accuracy than previous cross-species comparisons under ortholog conditioning. These analyses represent a step toward whole-transcriptome analysis in the emerging field of non-model evolutionary genomics, with key insights for evolutionary inference of complexity changes on deep timescales across the tree of life. We suggest a means to quantify biases generated in ortholog calling and correct complexity analysis for lineage-specific effects. With these metrics, we directly assay the quantitative properties of newly formed lineage-specific genes as they lower complexity.
Collapse
Affiliation(s)
- James E Titus-McQuillan
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, NC, 28223, USA.
| | - Adalena V Nanni
- Department of Molecular Genetics and Microbiology, University of Florida, Gainesville, FL, 32611, USA
- University of Florida Genetics Institute, University of Florida, Gainesville, FL, 32611, USA
| | - Lauren M McIntyre
- Department of Molecular Genetics and Microbiology, University of Florida, Gainesville, FL, 32611, USA
- University of Florida Genetics Institute, University of Florida, Gainesville, FL, 32611, USA
| | - Rebekah L Rogers
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, NC, 28223, USA
| |
Collapse
|
25
|
Liu J, Yuan R, Shao W, Wang J, Silman I, Sussman JL. Do "Newly Born" orphan proteins resemble "Never Born" proteins? A study using three deep learning algorithms. Proteins 2023. [PMID: 37092778 DOI: 10.1002/prot.26496] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2022] [Revised: 02/26/2023] [Accepted: 04/01/2023] [Indexed: 04/25/2023]
Abstract
"Newly Born" proteins, devoid of detectable homology to any other proteins, known as orphan proteins, occur in a single species or within a taxonomically restricted gene family. They are generated by the expression of novel open reading frames, and appear throughout evolution. We were curious if three recently developed programs for predicting protein structures, namely, AlphaFold2, RoseTTAFold, and ESMFold, might be of value for comparison of such "Newly Born" proteins to random polypeptides with amino acid content similar to that of native proteins, which have been called "Never Born" proteins. The programs were used to compare the structures of two sets of "Never Born" proteins that had been expressed-Group 1, which had been shown experimentally to possess substantial secondary structure, and Group 3, which had been shown to be intrinsically disordered. Overall, although the models generated were scored as being of low quality, they nevertheless revealed some general principles. Specifically, all four members of Group 1 were predicted to be compact by all three algorithms, in agreement with the experimental data, whereas the members of Group 3 were predicted to be very extended, as would be expected for intrinsically disordered proteins, again consistent with the experimental data. These predicted differences were shown to be statistically significant by comparing their accessible surface areas. The three programs were then used to predict the structures of three orphan proteins whose crystal structures had been solved, two of which display novel folds. Surprisingly, only for the protein which did not have a novel fold, and was taxonomically restricted, rather than being a true orphan, did all three algorithms predict very similar, high-quality structures, closely resembling the crystal structure. Finally, they were used to predict the structures of seven orphan proteins with well-identified biological functions, whose 3D structures are not known. Two proteins, which were predicted to be disordered based on their sequences, are predicted by all three structure algorithms to be extended structures. The other five were predicted to be compact structures with only two exceptions in the case of AlphaFold2. All three prediction algorithms make remarkably similar and high-quality predictions for one large protein, HCO_11565, from a nematode. It is conjectured that this is due to many homologs in the taxonomically restricted family of which it is a member, and to the fact that the Dali server revealed several nonrelated proteins with similar folds. An animated Interactive 3D Complement (I3DC) is available in Proteopedia at http://proteopedia.org/w/Journal:Proteins:3.
Collapse
Affiliation(s)
- Jing Liu
- Department of Biotechnology and Food Engineering, Guangdong Technion-Israel Institute of Technology, Shantou, China
- Faculty of Biotechnology and Food Engineering, Technion-Israel Institute of Technology, Haifa, Israel
| | - Rongqing Yuan
- Department of Chemistry, Tsinghua University, Beijing, China
| | - Wei Shao
- School of Chemistry and Chemical Engineering, Shanghai Jiao Tong University, Shanghai, China
| | - Jitong Wang
- Department of Chemistry, Tsinghua University, Beijing, China
| | - Israel Silman
- Department of Brain Sciences, The Weizmann Institute of Science, Rehovot, Israel
| | - Joel L Sussman
- Department of Chemical and Structural Biology, The Weizmann Institute of Science, Rehovot, Israel
| |
Collapse
|
26
|
Wang L, Tonsager AJ, Zheng W, Wang Y, Stessman D, Fang W, Stenback KE, Campbell A, Tanvir R, Zhang J, Cothron S, Wan D, Meng Y, Spalding MH, Nikolau BJ, Li L. Single-cell genetic models to evaluate orphan gene function: The case of QQS regulating carbon and nitrogen allocation. FRONTIERS IN PLANT SCIENCE 2023; 14:1126139. [PMID: 37051080 PMCID: PMC10084940 DOI: 10.3389/fpls.2023.1126139] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/17/2022] [Accepted: 03/13/2023] [Indexed: 06/19/2023]
Abstract
We demonstrate two synthetic single-cell systems that can be used to better understand how the acquisition of an orphan gene can affect complex phenotypes. The Arabidopsis orphan gene, Qua-Quine Starch (QQS) has been identified as a regulator of carbon (C) and nitrogen (N) partitioning across multiple plant species. QQS modulates this important biotechnological trait by replacing NF-YB (Nuclear Factor Y, subunit B) in its interaction with NF-YC. In this study, we expand on these prior findings by developing Chlamydomonas reinhardtii and Saccharomyces cerevisiae strains, to refactor the functional interactions between QQS and NF-Y subunits to affect modulations in C and N allocation. Expression of QQS in C. reinhardtii modulates C (i.e., starch) and N (i.e., protein) allocation by affecting interactions between NF-YC and NF-YB subunits. Studies in S. cerevisiae revealed similar functional interactions between QQS and the NF-YC homolog (HAP5), modulating C (i.e., glycogen) and N (i.e., protein) allocation. However, in S. cerevisiae both the NF-YA (HAP2) and NF-YB (HAP3) homologs appear to have redundant functions to enable QQS and HAP5 to affect C and N allocation. The genetically tractable systems that developed herein exhibit the plasticity to modulate highly complex phenotypes.
Collapse
Affiliation(s)
- Lei Wang
- Department of Biological Sciences, Mississippi State University, Mississippi State, MS, United States
| | - Andrew J. Tonsager
- Roy J. Carver Department of Biochemistry, Biophysics, and Molecular Biology, Iowa State University, Ames, IA, United States
- Engineering Research Center for Biorenewable Chemicals, Iowa State University, Ames, IA, United States
- Center for Metabolic Biology, Iowa State University, Ames, IA, United States
| | - Wenguang Zheng
- Department of Genetics, Development and Cell Biology, Iowa State University, Ames, IA, United States
| | - Yingjun Wang
- Department of Genetics, Development and Cell Biology, Iowa State University, Ames, IA, United States
| | - Dan Stessman
- Department of Genetics, Development and Cell Biology, Iowa State University, Ames, IA, United States
| | - Wei Fang
- Department of Genetics, Development and Cell Biology, Iowa State University, Ames, IA, United States
| | - Kenna E. Stenback
- Roy J. Carver Department of Biochemistry, Biophysics, and Molecular Biology, Iowa State University, Ames, IA, United States
- Engineering Research Center for Biorenewable Chemicals, Iowa State University, Ames, IA, United States
- Center for Metabolic Biology, Iowa State University, Ames, IA, United States
| | - Alexis Campbell
- Roy J. Carver Department of Biochemistry, Biophysics, and Molecular Biology, Iowa State University, Ames, IA, United States
- Engineering Research Center for Biorenewable Chemicals, Iowa State University, Ames, IA, United States
- Center for Metabolic Biology, Iowa State University, Ames, IA, United States
| | - Rezwan Tanvir
- Department of Biological Sciences, Mississippi State University, Mississippi State, MS, United States
| | - Jinjiang Zhang
- Department of Biological Sciences, Mississippi State University, Mississippi State, MS, United States
- Mississippi School for Mathematics and Science, Columbus, MS, United States
| | - Samuel Cothron
- Department of Biological Sciences, Mississippi State University, Mississippi State, MS, United States
| | - Dongli Wan
- Institute of Grassland Research, Chinese Academy of Agricultural Sciences, Hohhot, China
| | - Yan Meng
- Department of Agriculture, Alcorn State University, Lorman, MS, United States
| | - Martin H. Spalding
- Department of Genetics, Development and Cell Biology, Iowa State University, Ames, IA, United States
| | - Basil J. Nikolau
- Roy J. Carver Department of Biochemistry, Biophysics, and Molecular Biology, Iowa State University, Ames, IA, United States
- Engineering Research Center for Biorenewable Chemicals, Iowa State University, Ames, IA, United States
- Center for Metabolic Biology, Iowa State University, Ames, IA, United States
| | - Ling Li
- Department of Biological Sciences, Mississippi State University, Mississippi State, MS, United States
| |
Collapse
|
27
|
Evolution and implications of de novo genes in humans. Nat Ecol Evol 2023:10.1038/s41559-023-02014-y. [PMID: 36928843 DOI: 10.1038/s41559-023-02014-y] [Citation(s) in RCA: 17] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2022] [Accepted: 02/06/2023] [Indexed: 03/18/2023]
Abstract
Genes and translated open reading frames (ORFs) that emerged de novo from previously non-coding sequences provide species with opportunities for adaptation. When aberrantly activated, some human-specific de novo genes and ORFs have disease-promoting properties-for instance, driving tumour growth. Thousands of putative de novo coding sequences have been described in humans, but we still do not know what fraction of those ORFs has readily acquired a function. Here, we discuss the challenges and controversies surrounding the detection, mechanisms of origin, annotation, validation and characterization of de novo genes and ORFs. Through manual curation of literature and databases, we provide a thorough table with most de novo genes reported for humans to date. We re-evaluate each locus by tracing the enabling mutations and list proposed disease associations, protein characteristics and supporting evidence for translation and protein detection. This work will support future explorations of de novo genes and ORFs in humans.
Collapse
|
28
|
Čorak N, Anniko S, Daschkin-Steinborn C, Krey V, Koska S, Futo M, Široki T, Woichansky I, Opašić L, Kifer D, Tušar A, Maxeiner HG, Domazet-Lošo M, Nicolaus C, Domazet-Lošo T. Pleomorphic Variants of Borreliella (syn. Borrelia) burgdorferi Express Evolutionary Distinct Transcriptomes. Int J Mol Sci 2023; 24:ijms24065594. [PMID: 36982667 PMCID: PMC10057712 DOI: 10.3390/ijms24065594] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2023] [Revised: 03/07/2023] [Accepted: 03/11/2023] [Indexed: 03/17/2023] Open
Abstract
Borreliella (syn. Borrelia) burgdorferi is a spirochete bacterium that causes tick-borne Lyme disease. Along its lifecycle B. burgdorferi develops several pleomorphic forms with unclear biological and medical relevance. Surprisingly, these morphotypes have never been compared at the global transcriptome level. To fill this void, we grew B. burgdorferi spirochete, round body, bleb, and biofilm-dominated cultures and recovered their transcriptomes by RNAseq profiling. We found that round bodies share similar expression profiles with spirochetes, despite their morphological differences. This sharply contrasts to blebs and biofilms that showed unique transcriptomes, profoundly distinct from spirochetes and round bodies. To better characterize differentially expressed genes in non-spirochete morphotypes, we performed functional, positional, and evolutionary enrichment analyses. Our results suggest that spirochete to round body transition relies on the delicate regulation of a relatively small number of highly conserved genes, which are located on the main chromosome and involved in translation. In contrast, spirochete to bleb or biofilm transition includes substantial reshaping of transcription profiles towards plasmids-residing and evolutionary young genes, which originated in the ancestor of Borreliaceae. Despite their abundance the function of these Borreliaceae-specific genes is largely unknown. However, many known Lyme disease virulence genes implicated in immune evasion and tissue adhesion originated in this evolutionary period. Taken together, these regularities point to the possibility that bleb and biofilm morphotypes might be important in the dissemination and persistence of B. burgdorferi inside the mammalian host. On the other hand, they prioritize the large pool of unstudied Borreliaceae-specific genes for functional characterization because this subset likely contains undiscovered Lyme disease pathogenesis genes.
Collapse
Affiliation(s)
- Nina Čorak
- Laboratory of Evolutionary Genetics, Division of Molecular Biology, Ruđer Bošković Institute, Bijenička Cesta 54, HR-10000 Zagreb, Croatia
| | - Sirli Anniko
- BCA-Research, BCA-Clinic Betriebs GmbH & Co. KG, D-86159 Augsburg, Germany
- Institute of Cancer Therapeutics, Faculty of Life Sciences, University of Bradford, Bradford BD7 1DP, UK
| | | | - Viktoria Krey
- BCA-Research, BCA-Clinic Betriebs GmbH & Co. KG, D-86159 Augsburg, Germany
- Physics of Synthetic Biological Systems-E14, Physics Department and ZNN, Technische Universität München, D-85748 Garching, Germany
| | - Sara Koska
- Laboratory of Evolutionary Genetics, Division of Molecular Biology, Ruđer Bošković Institute, Bijenička Cesta 54, HR-10000 Zagreb, Croatia
| | - Momir Futo
- Laboratory of Evolutionary Genetics, Division of Molecular Biology, Ruđer Bošković Institute, Bijenička Cesta 54, HR-10000 Zagreb, Croatia
- Faculty of Electrical Engineering and Computing, University of Zagreb, Unska 3, HR-10000 Zagreb, Croatia
- School of Medicine, Catholic University of Croatia, Ilica 242, HR-10000 Zagreb, Croatia
| | - Tin Široki
- Faculty of Electrical Engineering and Computing, University of Zagreb, Unska 3, HR-10000 Zagreb, Croatia
| | | | - Luka Opašić
- Laboratory of Evolutionary Genetics, Division of Molecular Biology, Ruđer Bošković Institute, Bijenička Cesta 54, HR-10000 Zagreb, Croatia
| | - Domagoj Kifer
- Faculty of Pharmacy and Biochemistry, University of Zagreb, A. Kovačića 1, HR-10000 Zagreb, Croatia
| | - Anja Tušar
- Laboratory of Evolutionary Genetics, Division of Molecular Biology, Ruđer Bošković Institute, Bijenička Cesta 54, HR-10000 Zagreb, Croatia
| | - Horst-Günter Maxeiner
- BCA-Research, BCA-Clinic Betriebs GmbH & Co. KG, D-86159 Augsburg, Germany
- Comlamed, Friedrich-Bergius Ring 15, D-97076 Würzburg, Germany
| | - Mirjana Domazet-Lošo
- Faculty of Electrical Engineering and Computing, University of Zagreb, Unska 3, HR-10000 Zagreb, Croatia
| | - Carsten Nicolaus
- BCA-Research, BCA-Clinic Betriebs GmbH & Co. KG, D-86159 Augsburg, Germany
| | - Tomislav Domazet-Lošo
- Laboratory of Evolutionary Genetics, Division of Molecular Biology, Ruđer Bošković Institute, Bijenička Cesta 54, HR-10000 Zagreb, Croatia
- School of Medicine, Catholic University of Croatia, Ilica 242, HR-10000 Zagreb, Croatia
- Correspondence:
| |
Collapse
|
29
|
Poretti M, Praz CR, Sotiropoulos AG, Wicker T. A survey of lineage-specific genes in Triticeae reveals de novo gene evolution from genomic raw material. PLANT DIRECT 2023; 7:e484. [PMID: 36937792 PMCID: PMC10020141 DOI: 10.1002/pld3.484] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/27/2022] [Revised: 01/26/2023] [Accepted: 01/27/2023] [Indexed: 06/18/2023]
Abstract
Diploid plant genomes typically contain ~35,000 genes, almost all belonging to highly conserved gene families. Only a small fraction are lineage-specific, which are found in only one or few closely related species. Little is known about how genes arise de novo in plant genomes and how often this occurs; however, they are believed to be important for plants diversification and adaptation. We developed a pipeline to identify lineage-specific genes in Triticeae, using newly available genome assemblies of wheat, barley, and rye. Applying a set of stringent criteria, we identified 5942 candidate Triticeae-specific genes (TSGs), of which 2337 were validated as protein-coding genes in wheat. Differential gene expression analyses revealed that stress-induced wheat TSGs are strongly enriched in putative secreted proteins. Some were previously described to be involved in Triticeae non-host resistance and cold response. Additionally, we show that 1079 TSGs have sequence homology to transposable elements (TEs), ~68% of them deriving from regulatory non-coding regions of Gypsy retrotransposons. Most importantly, we demonstrate that these TSGs are enriched in transmembrane domains and are among the most highly expressed wheat genes overall. To summarize, we conclude that de novo gene formation is relatively rare and that Triticeae probably possess ~779 lineage-specific genes per haploid genome. TSGs, which respond to pathogen and environmental stresses, may be interesting candidates for future targeted resistance breeding in Triticeae. Finally, we propose that non-coding regions of TEs might provide important genetic raw material for the functional innovation of TM domains and the evolution of novel secreted proteins.
Collapse
Affiliation(s)
- Manuel Poretti
- Department of Plant and Microbial BiologyUniversity of ZurichZurichSwitzerland
- Department of BiologyUniversity of FribourgFribourgSwitzerland
| | - Coraline R. Praz
- Department of Plant and Microbial BiologyUniversity of ZurichZurichSwitzerland
- Centro de Biotecnología y Genómica de PlantasUniversidad Politécnica de Madrid (UPM)–Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA)MadridSpain
| | | | - Thomas Wicker
- Department of Plant and Microbial BiologyUniversity of ZurichZurichSwitzerland
| |
Collapse
|
30
|
Abstract
Developmental processes in animals are influenced by colonization and/or signaling from microbial symbionts. Here, we show that bacteria from the environment are linked to development of a symbiotic organ that houses a bacterial consortium in female Hawaiian bobtail squid, Euprymna scolopes. In addition to the well-characterized light organ association with the bioluminescent bacterium Vibrio fischeri, female E. scolopes house a simple bacterial community in a reproductive organ, the accessory nidamental gland (ANG). In order to understand the influences of bacteria on ANG development, squid were raised in the laboratory under conditions where exposure to environmental microorganisms was experimentally manipulated. Under conditions where hosts were exposed to depleted environmental bacteria, ANGs were completely absent or stunted, a result independent of the presence of the light organ symbiont V. fischeri. When squid were raised in the laboratory with substrate from the host's natural environment containing the native microbiota, normal ANG development was observed, and the bacterial communities were similar to wild-caught animals. Analysis of the bacterial communities from ANGs and substrates of wild-caught and laboratory-raised animals suggests that certain bacterial groups, namely, the Verrucomicrobia, are linked to ANG development. The ANG community composition was also experimentally manipulated. Squid raised with natural substrate supplemented with a specific ANG bacterial strain, Leisingera sp. JC1, had high proportions of this strain in the ANG, suggesting that once ANG development is initiated, specific strains can be introduced and subsequently colonize the organ. Overall, these data suggest that environmental bacteria are required for development of the ANG in E. scolopes. IMPORTANCE Microbiota have profound effects on animal and plant development. Hosts raised axenically or without symbionts often suffer negative outcomes resulting in developmental defects or reduced organ function. Using defined experimental conditions, we demonstrate that environmental bacteria are required for the formation of a female-specific symbiotic organ in the Hawaiian bobtail squid, Euprymna scolopes. Although nascent tissues from this organ that are involved with bacterial recruitment formed initially, the mature organ failed to develop and was absent or severely reduced in sexually mature animals that were not exposed to microbiota from the host's natural environment. This is the first example of complete organ development relying on exposure to symbiotic bacteria in an animal host. This study broadens the use of E. scolopes as a model organism for studying the influence of beneficial bacteria on animal development.
Collapse
|
31
|
Late Embryogenesis Abundant Proteins Contribute to the Resistance of Toxoplasma gondii Oocysts against Environmental Stresses. mBio 2023; 14:e0286822. [PMID: 36809045 PMCID: PMC10128015 DOI: 10.1128/mbio.02868-22] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/23/2023] Open
Abstract
Toxoplasma gondii oocysts, which are shed in large quantities in the feces from infected felines, are very stable in the environment, resistant to most inactivation procedures, and highly infectious. The oocyst wall provides an important physical barrier for sporozoites contained inside oocysts, protecting them from many chemical and physical stressors, including most inactivation procedures. Furthermore, sporozoites can withstand large temperature changes, even freeze-thawing, as well as desiccation, high salinity, and other environmental insults; however, the genetic basis for this environmental resistance is unknown. Here, we show that a cluster of four genes encoding Late Embryogenesis Abundant (LEA)-related proteins are required to provide Toxoplasma sporozoites resistance to environmental stresses. Toxoplasma LEA-like genes (TgLEAs) exhibit the characteristic features of intrinsically disordered proteins, explaining some of their properties. Our in vitro biochemical experiments using recombinant TgLEA proteins show that they have cryoprotective effects on the oocyst-resident lactate dehydrogenase enzyme and that induced expression in E. coli of two of them leads to better survival after cold stress. Oocysts from a strain in which the four LEA genes were knocked out en bloc were significantly more susceptible to high salinity, freezing, and desiccation compared to wild-type oocysts. We discuss the evolutionary acquisition of LEA-like genes in Toxoplasma and other oocyst-producing apicomplexan parasites of the Sarcocystidae family and discuss how this has likely contributed to the ability of sporozoites within oocysts to survive outside the host for extended periods. Collectively, our data provide a first molecular detailed view on a mechanism that contributes to the remarkable resilience of oocysts against environmental stresses. IMPORTANCE Toxoplasma gondii oocysts are highly infectious and may survive in the environment for years. Their resistance against disinfectants and irradiation has been attributed to the oocyst and sporocyst walls by acting as physical and permeability barriers. However, the genetic basis for their resistance against stressors like changes in temperature, salinity, or humidity, is unknown. We show that a cluster of four genes encoding Toxoplasma Late Embryogenesis Abundant (TgLEA)-related proteins are important for this resistance to environmental stresses. TgLEAs have features of intrinsically disordered proteins, explaining some of their properties. Recombinant TgLEA proteins show cryoprotective effects on the parasite's lactate dehydrogenase, an abundant enzyme in oocysts, and expression in E. coli of two TgLEAs has a beneficial effect on growth after cold stress. Moreover, oocysts from a strain lacking all four TgLEA genes were more susceptible to high salinity, freezing, and desiccation compared to wild-type oocysts, highlighting the importance of the four TgLEAs for oocyst resilience.
Collapse
|
32
|
Liang M, Chen W, LaFountain AM, Liu Y, Peng F, Xia R, Bradshaw H, Yuan YW. Taxon-specific, phased siRNAs underlie a speciation locus in monkeyflowers. Science 2023; 379:576-582. [PMID: 36758083 PMCID: PMC10601778 DOI: 10.1126/science.adf1323] [Citation(s) in RCA: 22] [Impact Index Per Article: 22.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2022] [Accepted: 12/06/2022] [Indexed: 02/11/2023]
Abstract
Taxon-specific small RNA loci are widespread in eukaryotic genomes, yet their role in lineage-specific adaptation, phenotypic diversification, and speciation is poorly understood. Here, we report that a speciation locus in monkeyflowers (Mimulus), YELLOW UPPER (YUP), contains an inverted repeat region that produces small interfering RNAs (siRNAs) in a phased pattern. Although the inverted repeat is derived from a partial duplication of a protein-coding gene that is not involved in flower pigmentation, one of the siRNAs targets and represses a master regulator of floral carotenoid pigmentation. YUP emerged with two protein-coding genes that control other aspects of flower coloration as a "superlocus" in a subclade of Mimulus and has contributed to subsequent phenotypic diversification and pollinator-mediated speciation in the descendant species.
Collapse
Affiliation(s)
- Mei Liang
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, Connecticut 06269
| | - Wenjie Chen
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, Connecticut 06269
- Qinghai Provincial Key Laboratory of Crop Molecular Breeding and Key Laboratory of Adaptation and Evolution of Plateau Biota, Northwest Institute of Plateau Biology, Chinese Academy of Sciences, Xining 810008, Qinghai, China
| | - Amy M. LaFountain
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, Connecticut 06269
| | - Yuanlong Liu
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, South China Agricultural University, Guangzhou, Guangdong 510642, China
| | - Foen Peng
- Department of Biology, University of Washington, Seattle, WA 98195
- Department of Biology, Haverford College, Haverford, Pennsylvania 19041
| | - Rui Xia
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, South China Agricultural University, Guangzhou, Guangdong 510642, China
| | - H.D. Bradshaw
- Department of Biology, University of Washington, Seattle, WA 98195
| | - Yao-Wu Yuan
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, Connecticut 06269
- Institute for Systems Genomics, University of Connecticut, Storrs, Connecticut 06269
| |
Collapse
|
33
|
A chromosome-scale epigenetic map of the Hydra genome reveals conserved regulators of cell state. Genome Res 2023; 33:283-298. [PMID: 36639202 PMCID: PMC10069465 DOI: 10.1101/gr.277040.122] [Citation(s) in RCA: 14] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2022] [Accepted: 01/10/2023] [Indexed: 01/15/2023]
Abstract
The epithelial and interstitial stem cells of the freshwater polyp Hydra are the best-characterized stem cell systems in any cnidarian, providing valuable insight into cell type evolution and the origin of stemness in animals. However, little is known about the transcriptional regulatory mechanisms that determine how these stem cells are maintained and how they give rise to their diverse differentiated progeny. To address such questions, a thorough understanding of transcriptional regulation in Hydra is needed. To this end, we generated extensive new resources for characterizing transcriptional regulation in Hydra, including new genome assemblies for Hydra oligactis and the AEP strain of Hydra vulgaris, an updated whole-animal single-cell RNA-seq atlas, and genome-wide maps of chromatin interactions, chromatin accessibility, sequence conservation, and histone modifications. These data revealed the existence of large kilobase-scale chromatin interaction domains in the Hydra genome that contain transcriptionally coregulated genes. We also uncovered the transcriptomic profiles of two previously molecularly uncharacterized cell types: isorhiza-type nematocytes and somatic gonad ectoderm. Finally, we identified novel candidate regulators of cell type-specific transcription, several of which have likely been conserved at least since the divergence of Hydra and the jellyfish Clytia hemisphaerica more than 400 million years ago.
Collapse
|
34
|
Bisio H, Legendre M, Giry C, Philippe N, Alempic JM, Jeudy S, Abergel C. Evolution of giant pandoravirus revealed by CRISPR/Cas9. Nat Commun 2023; 14:428. [PMID: 36702819 PMCID: PMC9879987 DOI: 10.1038/s41467-023-36145-4] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2022] [Accepted: 01/17/2023] [Indexed: 01/27/2023] Open
Abstract
Giant viruses (GVs) are a hotspot of unresolved controversies since their discovery, including the definition of "Virus" and their origin. While increasing knowledge of genome diversity has accumulated, GV functional genomics was largely neglected. Here, we describe an experimental framework to genetically modify nuclear GVs and their host Acanthamoeba castellanii using CRISPR/Cas9, shedding light on the evolution from small icosahedral viruses to amphora-shaped GVs. Ablation of the icosahedral major capsid protein in the phylogenetically-related mollivirus highlights a transition in virion shape and size. We additionally demonstrate the existence of a reduced core essential genome in pandoravirus, reminiscent of their proposed smaller ancestors. This proposed genetic expansion led to increased genome robustness, indicating selective pressures for adaptation to uncertain environments. Overall, we introduce new tools for manipulation of the unexplored genome of nuclear GVs and provide experimental evidence suggesting that viral gigantism has aroused as an emerging trait.
Collapse
Affiliation(s)
- Hugo Bisio
- Aix-Marseille University, Centre National de la Recherche Scientifique, Information Genomique & Structurale, Unite Mixte de Recherche 7256 (Institut de Microbiologie de la Mediterranee, FR3479, IM2B), 13288, Marseille, Cedex 9, France.
| | - Matthieu Legendre
- Aix-Marseille University, Centre National de la Recherche Scientifique, Information Genomique & Structurale, Unite Mixte de Recherche 7256 (Institut de Microbiologie de la Mediterranee, FR3479, IM2B), 13288, Marseille, Cedex 9, France
| | - Claire Giry
- Aix-Marseille University, Centre National de la Recherche Scientifique, Information Genomique & Structurale, Unite Mixte de Recherche 7256 (Institut de Microbiologie de la Mediterranee, FR3479, IM2B), 13288, Marseille, Cedex 9, France
| | - Nadege Philippe
- Aix-Marseille University, Centre National de la Recherche Scientifique, Information Genomique & Structurale, Unite Mixte de Recherche 7256 (Institut de Microbiologie de la Mediterranee, FR3479, IM2B), 13288, Marseille, Cedex 9, France
| | - Jean-Marie Alempic
- Aix-Marseille University, Centre National de la Recherche Scientifique, Information Genomique & Structurale, Unite Mixte de Recherche 7256 (Institut de Microbiologie de la Mediterranee, FR3479, IM2B), 13288, Marseille, Cedex 9, France
| | - Sandra Jeudy
- Aix-Marseille University, Centre National de la Recherche Scientifique, Information Genomique & Structurale, Unite Mixte de Recherche 7256 (Institut de Microbiologie de la Mediterranee, FR3479, IM2B), 13288, Marseille, Cedex 9, France
| | - Chantal Abergel
- Aix-Marseille University, Centre National de la Recherche Scientifique, Information Genomique & Structurale, Unite Mixte de Recherche 7256 (Institut de Microbiologie de la Mediterranee, FR3479, IM2B), 13288, Marseille, Cedex 9, France.
| |
Collapse
|
35
|
Moreyra NN, Almeida FC, Allan C, Frankel N, Matzkin LM, Hasson E. Phylogenomics provides insights into the evolution of cactophily and host plant shifts in Drosophila. Mol Phylogenet Evol 2023; 178:107653. [PMID: 36404461 DOI: 10.1016/j.ympev.2022.107653] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2022] [Revised: 09/30/2022] [Accepted: 10/25/2022] [Indexed: 11/06/2022]
Abstract
Cactophilic species of the Drosophila buzzatii cluster (repleta group) comprise an excellent model group to investigate genomic changes underlying adaptation to extreme climate conditions and host plants. In particular, these species form a tractable system to study the transition from chemically simpler breeding sites (like prickly pears of the genus Opuntia) to chemically more complex hosts (columnar cacti). Here, we report four highly contiguous genome assemblies of three species of the buzzatii cluster. Based on this genomic data and inferred phylogenetic relationships, we identified candidate taxonomically restricted genes (TRGs) likely involved in the evolution of cactophily and cactus host specialization. Functional enrichment analyses of TRGs within the buzzatii cluster identified genes involved in detoxification, water preservation, immune system response, anatomical structure development, and morphogenesis. In contrast, processes that regulate responses to stress, as well as the metabolism of nitrogen compounds, transport, and secretion were found in the set of species that are columnar cacti dwellers. These findings are in line with the hypothesis that those genomic changes brought about key mechanisms underlying the adaptation of the buzzatii cluster species to arid regions in South America.
Collapse
Affiliation(s)
- Nicolás Nahuel Moreyra
- Departamento de Ecología, Genética y Evolución (EGE), Facultad de Ciencias Exactas y Naturales (FCEyN), Universidad de Buenos Aires (UBA), Ciudad Autónoma de Buenos Aires C1428EGA, Argentina; Instituto de Ecología, Genética y Evolución de Buenos Aires (IEGEBA), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Ciudad Autónoma de Buenos Aires C1428EGA, Argentina.
| | - Francisca Cunha Almeida
- Departamento de Ecología, Genética y Evolución (EGE), Facultad de Ciencias Exactas y Naturales (FCEyN), Universidad de Buenos Aires (UBA), Ciudad Autónoma de Buenos Aires C1428EGA, Argentina; Instituto de Ecología, Genética y Evolución de Buenos Aires (IEGEBA), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Ciudad Autónoma de Buenos Aires C1428EGA, Argentina.
| | - Carson Allan
- Department of Entomology, University of Arizona, Tucson, AZ 85719, USA.
| | - Nicolás Frankel
- Departamento de Ecología, Genética y Evolución (EGE), Facultad de Ciencias Exactas y Naturales (FCEyN), Universidad de Buenos Aires (UBA), Ciudad Autónoma de Buenos Aires C1428EGA, Argentina; Instituto de Ecología, Genética y Evolución de Buenos Aires (IEGEBA), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Ciudad Autónoma de Buenos Aires C1428EGA, Argentina.
| | | | - Esteban Hasson
- Departamento de Ecología, Genética y Evolución (EGE), Facultad de Ciencias Exactas y Naturales (FCEyN), Universidad de Buenos Aires (UBA), Ciudad Autónoma de Buenos Aires C1428EGA, Argentina; Instituto de Ecología, Genética y Evolución de Buenos Aires (IEGEBA), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Ciudad Autónoma de Buenos Aires C1428EGA, Argentina.
| |
Collapse
|
36
|
Salis P, Peyran C, Morage T, de Bernard S, Nourikyan J, Coupé S, Bunet R, Planes S. RNA-Seq comparative study reveals molecular effectors linked to the resistance of Pinna nobilis to Haplosporidium pinnae parasite. Sci Rep 2022; 12:21229. [PMID: 36482098 PMCID: PMC9731998 DOI: 10.1038/s41598-022-25555-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2022] [Accepted: 11/30/2022] [Indexed: 12/13/2022] Open
Abstract
With the intensification of maritime traffic, recently emerged infectious diseases have become major drivers in the decline and extinction of species. Since 2016, mass mortality events have decimated the endemic Mediterranean Sea bivalve Pinna nobilis, affecting ca. 100% of individuals. These events have largely been driven by Haplosporidium pinnae's infection, an invasive species which was likely introduced by shipping. While monitoring wild populations of P. nobilis, we observed individuals that survived such a mass mortality event during the summer of 2018 (France). We considered these individuals resistant, as they did not show any symptoms of the disease, while the rest of the population in the area was devastated. Furthermore, the parasite was not detected when we conducted a PCR amplification of a species-specific fragment of the small subunit ribosomal DNA. In parallel, the transcriptomic analysis showed evidence of some parasite RNA indicating that the resistant individuals had been exposed to the parasite without proliferating. To understand the underlying mechanisms of resistance in these individuals, we compared their gene expression with that of susceptible individuals. We performed de novo transcriptome assembly and annotated the expressed genes. A comparison of the transcriptomes in resistant and susceptible individuals highlighted a gene expression signature of the resistant phenotype. We found significant differential expressions of genes involved in immunity and cell architecture. This data provides the first insights into how individuals escape the pathogenicity associated with infection.
Collapse
Affiliation(s)
- Pauline Salis
- PSL Research University: EPHE-UPVD-CNRS, UAR 3278 CRIOBE, 66860 Perpignan, France
| | - Claire Peyran
- PSL Research University: EPHE-UPVD-CNRS, UAR 3278 CRIOBE, 66860 Perpignan, France
| | - Titouan Morage
- PSL Research University: EPHE-UPVD-CNRS, UAR 3278 CRIOBE, 66860 Perpignan, France
| | | | | | - Stéphane Coupé
- grid.12611.350000000088437055CNRS/INSU, IRD, MIO UM 110, Mediterranean Institute of Oceanography, University of Toulon, 83130 La Garde, France
| | - Robert Bunet
- Institut Océanographique Paul Ricard, Ile des Embiez, 83140 Six-Fours-Les-Plages, France
| | - Serge Planes
- PSL Research University: EPHE-UPVD-CNRS, UAR 3278 CRIOBE, 66860 Perpignan, France ,grid.452595.aLaboratoire d’Excellence “CORAIL”, Perpignan, France
| |
Collapse
|
37
|
Sruthi KB, Menon A, P A, Vasudevan Soniya E. Pervasive translation of small open reading frames in plant long non-coding RNAs. FRONTIERS IN PLANT SCIENCE 2022; 13:975938. [PMID: 36352887 PMCID: PMC9638090 DOI: 10.3389/fpls.2022.975938] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/22/2022] [Accepted: 09/29/2022] [Indexed: 06/16/2023]
Abstract
Long non-coding RNAs (lncRNAs) are primarily recognized as non-coding transcripts longer than 200 nucleotides with low coding potential and are present in both eukaryotes and prokaryotes. Recent findings reveal that lncRNAs can code for micropeptides in various species. Micropeptides are generated from small open reading frames (smORFs) and have been discovered frequently in short mRNAs and non-coding RNAs, such as lncRNAs, circular RNAs, and pri-miRNAs. The most accepted definition of a smORF is an ORF containing fewer than 100 codons, and ribosome profiling and mass spectrometry are the most prevalent experimental techniques used to identify them. Although the majority of micropeptides perform critical roles throughout plant developmental processes and stress conditions, only a handful of their functions have been verified to date. Even though more research is being directed toward identifying micropeptides, there is still a dearth of information regarding these peptides in plants. This review outlines the lncRNA-encoded peptides, the evolutionary roles of such peptides in plants, and the techniques used to identify them. It also describes the functions of the pri-miRNA and circRNA-encoded peptides that have been identified in plants.
Collapse
|
38
|
Iwasaki-Yokozawa S, Nanjo R, Akiyama-Oda Y, Oda H. Lineage-specific, fast-evolving GATA-like gene regulates zygotic gene activation to promote endoderm specification and pattern formation in the Theridiidae spider. BMC Biol 2022; 20:223. [PMID: 36203191 PMCID: PMC9535882 DOI: 10.1186/s12915-022-01421-0] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2022] [Accepted: 09/27/2022] [Indexed: 11/23/2022] Open
Abstract
Background
The process of early development varies across the species-rich phylum Arthropoda. Owing to the limited research strategies for dissecting lineage-specific processes of development in arthropods, little is known about the variations in early arthropod development at molecular resolution. The Theridiidae spider, Parasteatoda tepidariorum, has its genome sequenced and could potentially contribute to dissecting early embryonic processes. Results We present genome-wide identification of candidate genes that exhibit locally restricted expression in germ disc forming stage embryos of P. tepidariorum, based on comparative transcriptomes of isolated cells from different regions of the embryo. A subsequent pilot screen by parental RNA interference identifies three genes required for body axis formation. One of them is a GATA-like gene that has been fast evolving after duplication and divergence from a canonical GATA family gene. This gene is designated fuchi nashi (fuchi) after its knockdown phenotypes, where the cell movement toward the formation of a germ disc was reversed. fuchi expression occurs in cells outside a forming germ disc and persists in the endoderm. Transcriptome and chromatin accessibility analyses of fuchi pRNAi embryos suggest that early fuchi activity regulates chromatin state and zygotic gene activation to promote endoderm specification and pattern formation. We also show that there are many uncharacterized genes regulated by fuchi. Conclusions Our genome-based research using an arthropod phylogenetically distant from Drosophila identifies a lineage-specific, fast-evolving gene with key developmental roles in one of the earliest, genome-wide regulatory events, and allows for molecular exploration of the developmental variations in early arthropod embryos. Supplementary Information The online version contains supplementary material available at 10.1186/s12915-022-01421-0.
Collapse
Affiliation(s)
- Sawa Iwasaki-Yokozawa
- Laboratory of Evolutionary Cell and Developmental Biology, JT Biohistory Research Hall, Takatsuki, Osaka, 569-1125, Japan
| | - Ryota Nanjo
- Laboratory of Evolutionary Cell and Developmental Biology, JT Biohistory Research Hall, Takatsuki, Osaka, 569-1125, Japan.,Department of Biological Sciences, Graduate School of Science, Osaka University, Toyonaka, Japan
| | - Yasuko Akiyama-Oda
- Laboratory of Evolutionary Cell and Developmental Biology, JT Biohistory Research Hall, Takatsuki, Osaka, 569-1125, Japan.,PRESTO, Japan Science and Technology Agency, Kawaguchi, Saitama, 332-0012, Japan.,Department of Microbiology and Infection Control, Faculty of Medicine, Osaka Medical and Pharmaceutical University, Takatsuki, Japan
| | - Hiroki Oda
- Laboratory of Evolutionary Cell and Developmental Biology, JT Biohistory Research Hall, Takatsuki, Osaka, 569-1125, Japan. .,Department of Biological Sciences, Graduate School of Science, Osaka University, Toyonaka, Japan.
| |
Collapse
|
39
|
Moutinho AF, Eyre-Walker A, Dutheil JY. Strong evidence for the adaptive walk model of gene evolution in Drosophila and Arabidopsis. PLoS Biol 2022; 20:e3001775. [PMID: 36099311 PMCID: PMC9470001 DOI: 10.1371/journal.pbio.3001775] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2021] [Accepted: 08/01/2022] [Indexed: 11/19/2022] Open
Abstract
Understanding the dynamics of species adaptation to their environments has long been a central focus of the study of evolution. Theories of adaptation propose that populations evolve by “walking” in a fitness landscape. This “adaptive walk” is characterised by a pattern of diminishing returns, where populations further away from their fitness optimum take larger steps than those closer to their optimal conditions. Hence, we expect young genes to evolve faster and experience mutations with stronger fitness effects than older genes because they are further away from their fitness optimum. Testing this hypothesis, however, constitutes an arduous task. Young genes are small, encode proteins with a higher degree of intrinsic disorder, are expressed at lower levels, and are involved in species-specific adaptations. Since all these factors lead to increased protein evolutionary rates, they could be masking the effect of gene age. While controlling for these factors, we used population genomic data sets of Arabidopsis and Drosophila and estimated the rate of adaptive substitutions across genes from different phylostrata. We found that a gene’s evolutionary age significantly impacts the molecular rate of adaptation. Moreover, we observed that substitutions in young genes tend to have larger physicochemical effects. Our study, therefore, provides strong evidence that molecular evolution follows an adaptive walk model across a large evolutionary timescale. This study uses population genomic datasets from Arabidopsis and Drosophila to show that young genes adapt faster and are subject to mutations of larger fitness effects, providing strong evidence that molecular evolution follows an adaptive walk model across a large evolutionary timescale.
Collapse
Affiliation(s)
- Ana Filipa Moutinho
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Biology, Plön, Germany
- School of Life Sciences, University of Sussex, Brighton, United Kingdom
- * E-mail:
| | - Adam Eyre-Walker
- School of Life Sciences, University of Sussex, Brighton, United Kingdom
| | - Julien Y. Dutheil
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Biology, Plön, Germany
- Unité Mixte de Recherche 5554 Institut des Sciences de l’Evolution, CNRS, IRD, EPHE, Université de Montpellier, Montpellier, France
| |
Collapse
|
40
|
Batzel GO, Moreno BK, Lopez LS, Nguyen CK, Livingston BT, Joester D, Lyons DC. Proteomic and Transcriptomic Analyses in the Slipper Snail Crepidula
fornicata Uncover Shell Matrix Genes Expressed During Adult and Larval Biomineralization. Integr Org Biol 2022; 4:obac023. [PMID: 35968217 PMCID: PMC9365450 DOI: 10.1093/iob/obac023] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2022] [Revised: 05/18/2022] [Indexed: 11/12/2022] Open
Abstract
The gastropod shell is a composite composed of minerals and shell matrix proteins (SMPs). SMPs have been identified by proteomics in many molluscs, but few have been studied in detail. Open questions include (1) what gene regulatory networks regulate SMP expression, (2) what roles individual SMPs play in biomineralization, and (3) how the complement of SMPs changes over development. These questions are best addressed in a species in which gene perturbation studies are available; one such species is the slipper snail, Crepidula fornicata. Here, SEM and pXRD analysis demonstrated that the adult shell of C. fornicata exhibits crossed lamellar microstructure and is composed of aragonite. Using high-throughput proteomics we identified 185 SMPs occluded within the adult shell. Over half of the proteins in the shell proteome have known biomineralization domains, while at least 10% have no homologs in public databases. Differential gene expression analysis identified 20 SMP genes that are up-regulated in the shell-producing mantle tissue. Over half of these 20 SMPs are expressed during development with two, CfSMP1 and CfSMP2, expressed exclusively in the shell gland. Together, the description of the shell microstructure and a list of SMPs now sets the stage for studying the consequences of SMP gene knockdowns in molluscs.
Collapse
Affiliation(s)
- G O Batzel
- Center for Marine Biotechnology and Biomedicine, Scripps Institution of Oceanography , UCSD, La Jolla, CA 92037, USA
| | - B K Moreno
- Department of Materials Science and Engineering, Northwestern University , Evanston, IL 60208, USA
| | - L S Lopez
- Department of Biological Sciences, California State University , Long Beach, CA 90802, USA
| | - C K Nguyen
- Department of Biological Sciences, California State University , Long Beach, CA 90802, USA
| | - B T Livingston
- Department of Biological Sciences, California State University , Long Beach, CA 90802, USA
| | - D Joester
- Department of Materials Science and Engineering, Northwestern University , Evanston, IL 60208, USA
| | - D C Lyons
- Center for Marine Biotechnology and Biomedicine, Scripps Institution of Oceanography , UCSD, La Jolla, CA 92037, USA
| |
Collapse
|
41
|
Abstract
"De novo" genes evolve from previously non-genic DNA. This strikes many of us as remarkable, because it seems extraordinarily unlikely that random sequence would produce a functional gene. How is this possible? In this two-part review, I first summarize what is known about the origins and molecular functions of the small number of de novo genes for which such information is available. I then speculate on what these examples may tell us about how de novo genes manage to emerge despite what seem like enormous opposing odds.
Collapse
Affiliation(s)
- Caroline M Weisman
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ, USA.
| |
Collapse
|
42
|
Jiang M, Li X, Dong X, Zu Y, Zhan Z, Piao Z, Lang H. Research Advances and Prospects of Orphan Genes in Plants. FRONTIERS IN PLANT SCIENCE 2022; 13:947129. [PMID: 35874010 PMCID: PMC9305701 DOI: 10.3389/fpls.2022.947129] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/18/2022] [Accepted: 06/23/2022] [Indexed: 06/15/2023]
Abstract
Orphan genes (OGs) are defined as genes having no sequence similarity with genes present in other lineages. OGs have been regarded to play a key role in the development of lineage-specific adaptations and can also serve as a constant source of evolutionary novelty. These genes have often been found related to various stress responses, species-specific traits, special expression regulation, and also participate in primary substance metabolism. The advancement in sequencing tools and genome analysis methods has made the identification and characterization of OGs comparatively easier. In the study of OG functions in plants, significant progress has been made. We review recent advances in the fast evolving characteristics, expression modulation, and functional analysis of OGs with a focus on their role in plant biology. We also emphasize current challenges, adoptable strategies and discuss possible future directions of functional study of OGs.
Collapse
Affiliation(s)
- Mingliang Jiang
- School of Agriculture, Jilin Agricultural Science and Technology College, Jilin, China
| | - Xiaonan Li
- College of Horticulture, Shenyang Agricultural University, Shenyang, China
| | - Xiangshu Dong
- School of Agriculture, Yunnan University, Kunming, China
| | - Ye Zu
- College of Horticulture, Shenyang Agricultural University, Shenyang, China
| | - Zongxiang Zhan
- College of Horticulture, Shenyang Agricultural University, Shenyang, China
| | - Zhongyun Piao
- College of Horticulture, Shenyang Agricultural University, Shenyang, China
| | - Hong Lang
- School of Agriculture, Jilin Agricultural Science and Technology College, Jilin, China
| |
Collapse
|
43
|
Cardoso-Silva CB, Aono AH, Mancini MC, Sforça DA, da Silva CC, Pinto LR, Adams KL, de Souza AP. Taxonomically Restricted Genes Are Associated With Responses to Biotic and Abiotic Stresses in Sugarcane ( Saccharum spp.). FRONTIERS IN PLANT SCIENCE 2022; 13:923069. [PMID: 35845637 PMCID: PMC9280035 DOI: 10.3389/fpls.2022.923069] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/18/2022] [Accepted: 06/13/2022] [Indexed: 06/15/2023]
Abstract
Orphan genes (OGs) are protein-coding genes that are restricted to particular clades or species and lack homology with genes from other organisms, making their biological functions difficult to predict. OGs can rapidly originate and become functional; consequently, they may support rapid adaptation to environmental changes. Extensive spread of mobile elements and whole-genome duplication occurred in the Saccharum group, which may have contributed to the origin and diversification of OGs in the sugarcane genome. Here, we identified and characterized OGs in sugarcane, examined their expression profiles across tissues and genotypes, and investigated their regulation under varying conditions. We identified 319 OGs in the Saccharum spontaneum genome without detected homology to protein-coding genes in green plants, except those belonging to Saccharinae. Transcriptomic analysis revealed 288 sugarcane OGs with detectable expression levels in at least one tissue or genotype. We observed similar expression patterns of OGs in sugarcane genotypes originating from the closest geographical locations. We also observed tissue-specific expression of some OGs, possibly indicating a complex regulatory process for maintaining diverse functional activity of these genes across sugarcane tissues and genotypes. Sixty-six OGs were differentially expressed under stress conditions, especially cold and osmotic stresses. Gene co-expression network and functional enrichment analyses suggested that sugarcane OGs are involved in several biological mechanisms, including stimulus response and defence mechanisms. These findings provide a valuable genomic resource for sugarcane researchers, especially those interested in selecting stress-responsive genes.
Collapse
Affiliation(s)
- Cláudio Benício Cardoso-Silva
- Center of Molecular Biology and Genetic Engineering (CBMEG), University of Campinas (UNICAMP), Campinas, Brazil
- Department of Botany, University of British Columbia, Vancouver, BC, Canada
| | - Alexandre Hild Aono
- Center of Molecular Biology and Genetic Engineering (CBMEG), University of Campinas (UNICAMP), Campinas, Brazil
| | - Melina Cristina Mancini
- Center of Molecular Biology and Genetic Engineering (CBMEG), University of Campinas (UNICAMP), Campinas, Brazil
| | - Danilo Augusto Sforça
- Center of Molecular Biology and Genetic Engineering (CBMEG), University of Campinas (UNICAMP), Campinas, Brazil
| | - Carla Cristina da Silva
- Center of Molecular Biology and Genetic Engineering (CBMEG), University of Campinas (UNICAMP), Campinas, Brazil
- Agronomy Department, Federal University of Viçosa (UFV), Viçosa, Brazil
| | - Luciana Rossini Pinto
- Sugarcane Research Advanced Centre, Agronomic Institute of Campinas (IAC/APTA), Ribeirão Preto, Brazil
| | - Keith L. Adams
- Department of Botany, University of British Columbia, Vancouver, BC, Canada
| | - Anete Pereira de Souza
- Center of Molecular Biology and Genetic Engineering (CBMEG), University of Campinas (UNICAMP), Campinas, Brazil
- Institute of Biology, University of Campinas (UNICAMP), Campinas, Brazil
| |
Collapse
|
44
|
Ma D, Ding Q, Guo Z, Xu C, Liang P, Zhao Z, Song S, Zheng HL. The genome of a mangrove plant, Avicennia marina, provides insights into adaptation to coastal intertidal habitats. PLANTA 2022; 256:6. [PMID: 35678934 DOI: 10.1007/s00425-022-03916-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/25/2022] [Accepted: 05/17/2022] [Indexed: 05/26/2023]
Abstract
Whole-genome duplication, gene family and lineage-specific genes analysis based on high-quality genome reveal the adaptation mechanisms of Avicennia marina to coastal intertidal habitats. Mangrove plants grow in a complex habitat of coastal intertidal zones with high salinity, hypoxia, etc. Therefore, it is an interesting question how mangroves adapt to the unique intertidal environment. Here, we present a chromosome-level genome of the Avicennia marina, a typical true mangrove with a size of 480.43 Mb, contig N50 of 11.33 Mb and 30,956 annotated protein-coding genes. We identified 621 Avicennia-specific genes that are mainly related to flavonoid and lignin biosynthesis, auxin homeostasis and response to abiotic stimulus. We found that A. marina underwent a novel specific whole-genome duplication, which is in line with a brief era of global warming that occurred during the paleocene-eocene maximum. Comparative genomic and transcriptomic analyses outline the distinct evolution and sophisticated regulations of A. marina adaptation to the intertidal environments, including expansion of photosynthesis and oxidative phosphorylation gene families, unique genes and pathways for antibacterial, detoxifying antioxidant and reactive oxygen species scavenging. In addition, we also analyzed salt gland secretion-related genes, and those involved in the red bark-related flavonoid biosynthesis, while significant expansions of key genes such as NHX, 4CL, CHS and CHI. High-quality genomes in future investigations will facilitate the understand of evolution of mangrove and improve breeding.
Collapse
Affiliation(s)
- Dongna Ma
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361102, Fujian, China
| | - Qiansu Ding
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361102, Fujian, China
| | - Zejun Guo
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361102, Fujian, China
| | - Chaoqun Xu
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361102, Fujian, China
| | - Pingping Liang
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361102, Fujian, China
| | - Zhizhu Zhao
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361102, Fujian, China
| | - Shiwei Song
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361102, Fujian, China
| | - Hai-Lei Zheng
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361102, Fujian, China.
| |
Collapse
|
45
|
Weisman CM, Murray AW, Eddy SR. Mixing genome annotation methods in a comparative analysis inflates the apparent number of lineage-specific genes. Curr Biol 2022; 32:2632-2639.e2. [PMID: 35588743 DOI: 10.1016/j.cub.2022.04.085] [Citation(s) in RCA: 25] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2022] [Revised: 03/17/2022] [Accepted: 04/21/2022] [Indexed: 12/16/2022]
Abstract
Comparisons of genomes of different species are used to identify lineage-specific genes, those genes that appear unique to one species or clade. Lineage-specific genes are often thought to represent genetic novelty that underlies unique adaptations. Identification of these genes depends not only on genome sequences, but also on inferred gene annotations. Comparative analyses typically use available genomes that have been annotated using different methods, increasing the risk that orthologous DNA sequences may be erroneously annotated as a gene in one species but not another, appearing lineage specific as a result. To evaluate the impact of such "annotation heterogeneity," we identified four clades of species with sequenced genomes with more than one publicly available gene annotation, allowing us to compare the number of lineage-specific genes inferred when differing annotation methods are used to those resulting when annotation method is uniform across the clade. In these case studies, annotation heterogeneity increases the apparent number of lineage-specific genes by up to 15-fold, suggesting that annotation heterogeneity is a substantial source of potential artifact.
Collapse
Affiliation(s)
- Caroline M Weisman
- Lewis-Sigler Institute for Integrative Genomics, Carl Icahn Laboratory, Princeton University, South Drive, Princeton, NJ 08540, USA.
| | - Andrew W Murray
- Department of Molecular & Cellular Biology, Harvard University, Divinity Avenue, Cambridge, MA 02138, USA
| | - Sean R Eddy
- Department of Molecular & Cellular Biology, Harvard University, Divinity Avenue, Cambridge, MA 02138, USA; Howard Hughes Medical Institute, Jones Bridge Road, Chevy Chase, MD 20815, USA; John A. Paulson School of Engineering and Applied Sciences, Harvard University, Oxford Street, Cambridge, MA 02138, USA
| |
Collapse
|
46
|
Zumajo-Cardona C, Ambrose BA. Fleshy or dry: transcriptome analyses reveal the genetic mechanisms underlying bract development in Ephedra. EvoDevo 2022; 13:10. [PMID: 35477429 PMCID: PMC9047513 DOI: 10.1186/s13227-022-00195-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2021] [Accepted: 04/04/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Gnetales have a key phylogenetic position in the evolution of seed plants. Among the Gnetales, there is an extraordinary morphological diversity of seeds, the genus Ephedra, in particular, exhibits fleshy, coriaceous or winged (dry) seeds. Despite this striking diversity, its underlying genetic mechanisms remain poorly understood due to the limited studies in gymnosperms. Expanding the genomic and developmental data from gymnosperms contributes to a better understanding of seed evolution and development. RESULTS We performed transcriptome analyses on different plant tissues of two Ephedra species with different seed morphologies. Anatomical observations in early developing ovules, show that differences in the seed morphologies are established early in their development. The transcriptomic analyses in dry-seeded Ephedra californica and fleshy-seeded Ephedra antisyphilitica, allowed us to identify the major differences between the differentially expressed genes in these species. We detected several genes known to be involved in fruit ripening as upregulated in the fleshy seed of Ephedra antisyphilitica. CONCLUSIONS This study allowed us to determine the differentially expressed genes involved in seed development of two Ephedra species. Furthermore, the results of this study of seeds with the enigmatic morphology in Ephedra californica and Ephedra antisyphilitica, allowed us to corroborate the hypothesis which suggest that the extra envelopes covering the seeds of Gnetales are not genetically similar to integument. Our results highlight the importance of carrying out studies on less explored species such as gymnosperms, to gain a better understanding of the evolutionary history of plants.
Collapse
Affiliation(s)
- Cecilia Zumajo-Cardona
- New York Botanical Garden, Bronx, NY, USA.,The Graduate Center, City University of New York, New York, NY, USA
| | - Barbara A Ambrose
- New York Botanical Garden, Bronx, NY, USA. .,The Graduate Center, City University of New York, New York, NY, USA.
| |
Collapse
|
47
|
Moon H, Jeong AR, Kwon OK, Park CJ. Oryza-Specific Orphan Protein Triggers Enhanced Resistance to Xanthomonas oryzae pv. oryzae in Rice. FRONTIERS IN PLANT SCIENCE 2022; 13:859375. [PMID: 35360326 PMCID: PMC8961030 DOI: 10.3389/fpls.2022.859375] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/21/2022] [Accepted: 02/17/2022] [Indexed: 05/27/2023]
Abstract
All genomes carry lineage-specific orphan genes lacking homology in their closely related species. Identification and functional study of the orphan genes is fundamentally important for understanding lineage-specific adaptations including acquirement of resistance to pathogens. However, most orphan genes are of unknown function due to the difficulties in studying them using helpful comparative genomics. Here, we present a defense-related Oryza-specific orphan gene, Xio1, specifically induced by the bacterial pathogen Xanthomonas oryzae pv. oryzae (Xoo) in an immune receptor XA21-dependent manner. Salicylic acid (SA) and ethephon (ET) also induced its expression, but methyl jasmonic acid (MeJA) reduced its basal expression. C-terminal green fluorescent protein (GFP) tagged Xio1 (Xio1-GFP) was visualized in the nucleus and the cytosol after polyethylene glycol (PEG)-mediated transformation in rice protoplasts and Agrobacterium-mediated infiltration in tobacco leaves. Transgenic rice plants overexpressing Xio1-GFP showed significantly enhanced resistance to Xoo with reduced lesion lengths and bacterial growth, in company with constitutive expression of defense-related genes. However, all of the transgenic plants displayed severe growth retardation and premature death. Reactive oxygen species (ROS) was significantly produced in rice protoplasts constitutively expressing Xio1-GFP. Overexpression of Xio1-GFP in non-Oryza plant species, Arabidopsis thaliana, failed to induce growth retardation and enhanced resistance to Pseudomonas syringae pv. tomato (Pst) DC3000. Our results suggest that the defense-related orphan gene Xio1 plays an important role in distinctive mechanisms evolved within the Oryza and provides a new source of Oryza-specific genes for crop-breeding programs.
Collapse
Affiliation(s)
- Hyeran Moon
- Department of Molecular Biology, Sejong University, Seoul, South Korea
| | - A-Ram Jeong
- Department of Molecular Biology, Sejong University, Seoul, South Korea
| | - Oh-Kyu Kwon
- Department of Molecular Biology, Sejong University, Seoul, South Korea
| | - Chang-Jin Park
- Department of Molecular Biology, Sejong University, Seoul, South Korea
- Department of Bioresources Engineering, Sejong University, Seoul, South Korea
| |
Collapse
|
48
|
Nagar N, Ben Tal N, Pupko T. EvoRator: Prediction of residue-level evolutionary rates from protein structures using machine learning. J Mol Biol 2022; 434:167538. [DOI: 10.1016/j.jmb.2022.167538] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Revised: 03/07/2022] [Accepted: 03/07/2022] [Indexed: 10/18/2022]
|
49
|
Sollitto M, Kenny NJ, Greco S, Tucci CF, Calcino AD, Gerdol M. Detecting Structural Variants and Associated Gene Presence-Absence Variation Phenomena in the Genomes of Marine Organisms. Methods Mol Biol 2022; 2498:53-76. [PMID: 35727540 DOI: 10.1007/978-1-0716-2313-8_4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
As complete genomes become easier to attain, even from previously difficult-to-sequence species, and as genomic resequencing becomes more routine, it is becoming obvious that genomic structural variation is more widespread than originally thought and plays an important role in maintaining genetic variation in populations. Structural variants (SVs) and associated gene presence-absence variation (PAV) can be important players in local adaptation, allowing the maintenance of genetic variation and taking part in other evolutionarily relevant phenomena. While recent studies have highlighted the importance of structural variation in Mollusca, the prevalence of this phenomenon in the broader context of marine organisms remains to be fully investigated.Here, we describe a straightforward and broadly applicable method for the identification of SVs in fully assembled diploid genomes, leveraging the same reads used for assembly. We also explain a gene PAV analysis protocol, which could be broadly applied to any species with a fully sequenced reference genome available. Although the strength of these approaches have been tested and proven in marine invertebrates, which tend to have high levels of heterozygosity, possibly due to their lifestyle traits, they are also applicable to other species across the tree of life, providing a ready means to begin investigations into this potentially widespread phenomena.
Collapse
Affiliation(s)
- Marco Sollitto
- Department of Life Sciences, Università degli Studi di Trieste, Trieste, Italy
| | - Nathan J Kenny
- Faculty of Health and Life Sciences, Oxford Brookes, Oxford, UK
- Department of Biochemistry, University of Otago, Dunedin, New Zealand
| | - Samuele Greco
- Department of Life Sciences, Università degli Studi di Trieste, Trieste, Italy
| | | | - Andrew D Calcino
- Department of Evolutionary Biology, Integrative Zoology, University of Vienna, Vienna, Austria
| | - Marco Gerdol
- Department of Life Sciences, Università degli Studi di Trieste, Trieste, Italy.
| |
Collapse
|
50
|
Barua A, Koludarov I, Mikheyev AS. Co-option of the same ancestral gene family gave rise to mammalian and reptilian toxins. BMC Biol 2021; 19:268. [PMID: 34949191 PMCID: PMC8705180 DOI: 10.1186/s12915-021-01191-1] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2021] [Accepted: 11/11/2021] [Indexed: 12/03/2022] Open
Abstract
Background Evolution can occur with surprising predictability when organisms face similar ecological challenges. For most traits, it is difficult to ascertain whether this occurs due to constraints imposed by the number of possible phenotypic solutions or because of parallel responses by shared genetic and regulatory architecture. Exceptionally, oral venoms are a tractable model of trait evolution, being largely composed of proteinaceous toxins that have evolved in many tetrapods, ranging from reptiles to mammals. Given the diversity of venomous lineages, they are believed to have evolved convergently, even though biochemically similar toxins occur in all taxa. Results Here, we investigate whether ancestral genes harbouring similar biochemical activity may have primed venom evolution, focusing on the origins of kallikrein-like serine proteases that form the core of most vertebrate oral venoms. Using syntenic relationships between genes flanking known toxins, we traced the origin of kallikreins to a single locus containing one or more nearby paralogous kallikrein-like clusters. Additionally, phylogenetic analysis of vertebrate serine proteases revealed that kallikrein-like toxins in mammals and reptiles are genetically distinct from non-toxin ones. Conclusions Given the shared regulatory and genetic machinery, these findings suggest that tetrapod venoms evolved by co-option of proteins that were likely already present in saliva. We term such genes ‘toxipotent’—in the case of salivary kallikreins they already had potent vasodilatory activity that was weaponized by venomous lineages. Furthermore, the ubiquitous distribution of kallikreins across vertebrates suggests that the evolution of envenomation may be more common than previously recognized, blurring the line between venomous and non-venomous animals. Supplementary Information The online version contains supplementary material available at 10.1186/s12915-021-01191-1.
Collapse
Affiliation(s)
- Agneesh Barua
- Ecology and Evolution Unit, Okinawa Institute of Science and Technology Graduate University, Okinawa, Japan.
| | - Ivan Koludarov
- Animal Venomics Group, Justus Leibig University, Giessen, Germany
| | - Alexander S Mikheyev
- Research School of Biology, Australian National University, Canberra, ACT, Australia.
| |
Collapse
|