1
|
Yee SW, Ferrández-Peral L, Alentorn-Moron P, Fontsere C, Ceylan M, Koleske ML, Handin N, Artegoitia VM, Lara G, Chien HC, Zhou X, Dainat J, Zalevsky A, Sali A, Brand CM, Wolfreys FD, Yang J, Gestwicki JE, Capra JA, Artursson P, Newman JW, Marquès-Bonet T, Giacomini KM. Illuminating the function of the orphan transporter, SLC22A10, in humans and other primates. Nat Commun 2024; 15:4380. [PMID: 38782905 PMCID: PMC11116522 DOI: 10.1038/s41467-024-48569-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Accepted: 05/06/2024] [Indexed: 05/25/2024] Open
Abstract
SLC22A10 is an orphan transporter with unknown substrates and function. The goal of this study is to elucidate its substrate specificity and functional characteristics. In contrast to orthologs from great apes, human SLC22A10, tagged with green fluorescent protein, is not expressed on the plasma membrane. Cells expressing great ape SLC22A10 orthologs exhibit significant accumulation of estradiol-17β-glucuronide, unlike those expressing human SLC22A10. Sequence alignments reveal a proline at position 220 in humans, which is a leucine in great apes. Replacing proline with leucine in SLC22A10-P220L restores plasma membrane localization and uptake function. Neanderthal and Denisovan genomes show proline at position 220, akin to modern humans, indicating functional loss during hominin evolution. Human SLC22A10 is a unitary pseudogene due to a fixed missense mutation, P220, while in great apes, its orthologs transport sex steroid conjugates. Characterizing SLC22A10 across species sheds light on its biological role, influencing organism development and steroid homeostasis.
Collapse
Affiliation(s)
- Sook Wah Yee
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, USA
| | - Luis Ferrández-Peral
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, Dr. Aiguader 88, 08003, Barcelona, Spain
| | - Pol Alentorn-Moron
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, Dr. Aiguader 88, 08003, Barcelona, Spain
| | - Claudia Fontsere
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, Dr. Aiguader 88, 08003, Barcelona, Spain
- Center for Evolutionary Hologenomics, The Globe Institute, University of Copenhagen, Øster Farimagsgade 5A, 1352, Copenhagen, Denmark
| | - Merve Ceylan
- Department of Pharmacy, Uppsala University, Uppsala, Sweden
| | - Megan L Koleske
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, USA
| | - Niklas Handin
- Department of Pharmacy, Uppsala University, Uppsala, Sweden
| | - Virginia M Artegoitia
- United States Department of Agriculture, Agricultural Research Service, Western Human Nutrition Research Center, Davis, CA, 95616, USA
| | - Giovanni Lara
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, USA
| | - Huan-Chieh Chien
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, USA
| | - Xujia Zhou
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, USA
| | - Jacques Dainat
- Joint Research Unit for Infectious Diseases and Vectors Ecology Genetics Evolution and Control (MIVEGEC), University of Montpellier, French National Center for Scientific Research (CNRS 5290), French National Research Institute for Sustainable Development (IRD 224), 911 Avenue Agropolis, BP 64501, 34394, Montpellier Cedex 5, France
| | - Arthur Zalevsky
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, USA
| | - Andrej Sali
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, USA
- Department of Pharmaceutical Chemistry, University of California, San Francisco, CA, USA
- Quantitative Biosciences Institute (QBI), University of California, San Francisco, San Francisco, CA, US
| | - Colin M Brand
- Bakar Computational Health Sciences Institute, University of California, San Francisco, CA, USA
- Department of Epidemiology and Biostatistics, University of California, San Francisco, CA, USA
| | - Finn D Wolfreys
- Department of Ophthalmology, University of California, San Francisco, CA, USA
| | - Jia Yang
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, USA
| | - Jason E Gestwicki
- Department of Pharmaceutical Chemistry, University of California, San Francisco, CA, USA
- Institute for Neurodegenerative Diseases, University of California, San Francisco, CA, USA
| | - John A Capra
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, USA
- Bakar Computational Health Sciences Institute, University of California, San Francisco, CA, USA
- Department of Epidemiology and Biostatistics, University of California, San Francisco, CA, USA
- Institute for Human Genetics, University of California San Francisco, San Francisco, CA, USA
| | - Per Artursson
- Department of Pharmacy, Uppsala University, Uppsala, Sweden
- Science for Life Laboratories, Uppsala University, Uppsala, Sweden
| | - John W Newman
- United States Department of Agriculture, Agricultural Research Service, Western Human Nutrition Research Center, Davis, CA, 95616, USA
- Department of Nutrition, University of California, Davis, Davis, CA, 95616, USA
| | - Tomàs Marquès-Bonet
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, Dr. Aiguader 88, 08003, Barcelona, Spain
- Catalan Institution of Research and Advanced Studies (ICREA), Passeig de Lluís Companys, 23, 08010, Barcelona, Spain
- CNAG, Centro Nacional de Analisis Genomico, Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, 08028, Barcelona, Spain
- Institut Català de Paleontologia Miquel Crusafont, Universitat Autònoma de Barcelona, Edifici ICTA-ICP, c/ Columnes s/n, 08193, Cerdanyola del Vallès, Barcelona, Spain
| | - Kathleen M Giacomini
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, USA.
- Institute for Human Genetics, University of California San Francisco, San Francisco, CA, USA.
| |
Collapse
|
2
|
Kang J, Gu L, Guo B, Rong W, Xu S, Yang G, Ren W. Molecular evolution of wound healing-related genes during cetacean secondary aquatic adaptation. Integr Zool 2023. [PMID: 37897119 DOI: 10.1111/1749-4877.12781] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/29/2023]
Abstract
The marine environment presents challenges for wound healing in cetaceans, despite their remarkable recovery abilities with minimal infections or complications. However, the molecular mechanism underlying this efficient wound healing remains underexplored. To better understand the molecular mechanisms behind wound healing in cetaceans, we investigated the evolutionary patterns of 37 wound healing-related genes in representative mammals. We found wound healing-related genes experience adaptive evolution in cetaceans: (1) Three extrinsic coagulation pathway-related genes-tissue factor (F3), coagulation factor VII (F7), and coagulation factor X (F10)-are subject to positive selection in cetaceans, which might promote efficient hemostasis after injury; positive selection in transforming growth factor-beta 2 (TGF-β2), transforming growth factor-beta 3 (TGF-β3), and platelet-derived growth factor D (PDGFD), which play immunological roles in wound healing, may help cetaceans enhance inflammatory response and tissue debridement. (2) Coagulation factor XII (F12) is the initiation factor in the intrinsic coagulation pathway. It had a premature stop codon mutation and was subjected to selective stress relaxation in cetaceans, suggesting that the early termination of F12 may help cetaceans avoid the risk of vascular blockage during diving. (3) Fibrinogen alpha chain (FGA) and FIII, which were detected to contain the specific amino acid substitutions in marine mammals, indicating similar evolutionary mechanisms might exist among marine mammals to maintain strong wound-healing ability. Thus, our research provides further impetus to study the evolution of the wound healing system in cetaceans and other marine mammals, extending knowledge of preventing coagulation disorder and atherosclerosis in humans.
Collapse
Affiliation(s)
- Jieqiong Kang
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, Nanjing, China
| | - Long Gu
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, Nanjing, China
| | - Boxiong Guo
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, Nanjing, China
| | - Wenqi Rong
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, Nanjing, China
| | - Shixia Xu
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, Nanjing, China
| | - Guang Yang
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, Nanjing, China
| | - Wenhua Ren
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, Nanjing, China
| |
Collapse
|
3
|
Sato H, Sonoda N, Nakano M, Matsuyama Y, Shizume A, Arai R, Nogawa M, Taguchi G, Shimosaka M. Multi-enzyme Machinery for Chitin Degradation in the Chitinolytic Bacterium Chitiniphilus shinanonensis SAY3 T. Curr Microbiol 2023; 80:360. [PMID: 37796346 DOI: 10.1007/s00284-023-03489-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2022] [Accepted: 07/19/2023] [Indexed: 10/06/2023]
Abstract
The chitinolytic bacterium, Chitiniphilus shinanonensis SAY3T was examined to characterize its chitin-degrading enzymes in view of its potential to convert biomass chitin into useful saccharides. A survey of the whole-genome sequence revealed 49 putative genes encoding polypeptides that are thought to be related to chitin degradation. Based on an analysis of the relative quantity of each transcript and an assay for chitin-degrading activity of recombinant proteins, a chitin degradation system driven by 19 chitinolytic enzymes was proposed. These include sixteen endo-type chitinases, two N-acetylglucosaminidases, and one lipopolysaccharide monooxygenase that catalyzes the oxidative cleavage of glycosidic bonds. Among the 16 chitinases, ChiL was characterized by its remarkable transglycosylation activity. Of the two N-acetylglucosaminidases (ChiI and ChiT), ChiI was the major enzyme, corresponding to > 98% of the total cellular activity. Surprisingly, a chiI-disrupted mutant was still able to grow on medium with powdered chitin or GlcNAc dimer. However, its growth rate was slightly lower compared to that of the wild-type SAY3. This multi-enzyme machinery composed of various types of chitinolytic enzymes may support SAY3 to efficiently utilize native chitin as a carbon and energy source and may play a role in developing an enzymatic process to decompose and utilize abundant chitin at the industrial scale.
Collapse
Affiliation(s)
- Hiroaki Sato
- Department of Applied Biology, Faculty of Textile Science and Technology, Shinshu University, 3-15-1 Tokida, Ueda, Nagano, 386-8567, Japan
| | - Norie Sonoda
- Department of Applied Biology, Faculty of Textile Science and Technology, Shinshu University, 3-15-1 Tokida, Ueda, Nagano, 386-8567, Japan
| | - Moe Nakano
- Department of Applied Biology, Faculty of Textile Science and Technology, Shinshu University, 3-15-1 Tokida, Ueda, Nagano, 386-8567, Japan
| | - Yuka Matsuyama
- Department of Applied Biology, Faculty of Textile Science and Technology, Shinshu University, 3-15-1 Tokida, Ueda, Nagano, 386-8567, Japan
| | - Arisa Shizume
- Department of Applied Biology, Faculty of Textile Science and Technology, Shinshu University, 3-15-1 Tokida, Ueda, Nagano, 386-8567, Japan
| | - Ryoichi Arai
- Department of Applied Biology, Faculty of Textile Science and Technology, Shinshu University, 3-15-1 Tokida, Ueda, Nagano, 386-8567, Japan
| | - Masahiro Nogawa
- Department of Applied Biology, Faculty of Textile Science and Technology, Shinshu University, 3-15-1 Tokida, Ueda, Nagano, 386-8567, Japan
| | - Goro Taguchi
- Department of Applied Biology, Faculty of Textile Science and Technology, Shinshu University, 3-15-1 Tokida, Ueda, Nagano, 386-8567, Japan
| | - Makoto Shimosaka
- Department of Applied Biology, Faculty of Textile Science and Technology, Shinshu University, 3-15-1 Tokida, Ueda, Nagano, 386-8567, Japan.
| |
Collapse
|
4
|
Yadav S, Kalwan G, Meena S, Gill SS, Yadava YK, Gaikwad K, Jain PK. Unravelling the due importance of pseudogenes and their resurrection in plants. PLANT PHYSIOLOGY AND BIOCHEMISTRY : PPB 2023; 203:108062. [PMID: 37778114 DOI: 10.1016/j.plaphy.2023.108062] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/20/2023] [Revised: 09/22/2023] [Accepted: 09/26/2023] [Indexed: 10/03/2023]
Abstract
The complexities of a genome are underpinned to the vast expanses of the intergenic region, which constitutes ∼97-98% of the genome. This region is essentially composed of what is colloquially referred to as the "junk DNA" and is composed of various elements like transposons, repeats, pseudogenes, etc. The latter have long been considered as dead elements merely contributing to transcriptional noise in the genome. Many studies now describe the previously unknown regulatory functions of these genes. Recent advances in the Next-generation sequencing (NGS) technologies have allowed unprecedented access to these regions. With the availability of whole genome sequences of more than 788 different plant species in past 20 years, genome annotation has become feasible like never before. Different bioinformatic pipelines are available for the identification of pseudogenes. However, still little is known about their biological functions. The functional validation of these genes remains challenging and research in this area is still in infancy, particularly in plants. CRISPR/Cas-based genome editing could provide solutions to understand the biological roles of these genes by allowing creation of precise edits within these genes. The possibility of pseudogene reactivation or resurrection as has been demonstrated in a few studies might open new avenues of genetic manipulation to yield a desirable phenotype. This review aims at comprehensively summarizing the progress made with regards to the identification of pseudogenes and understanding their biological functions in plants.
Collapse
Affiliation(s)
- Sheel Yadav
- ICAR-National Institute for Plant Biotechnology, New Delhi, 110012, India; PG School, ICAR-Indian Agricultural Research Institute, New Delhi, 110012, India; Division of Genomic Resources, ICAR-National Bureau of Plant Genetic Resources, New Delhi, 110012, India
| | - Gopal Kalwan
- ICAR-National Institute for Plant Biotechnology, New Delhi, 110012, India; PG School, ICAR-Indian Agricultural Research Institute, New Delhi, 110012, India
| | - Shashi Meena
- PG School, ICAR-Indian Agricultural Research Institute, New Delhi, 110012, India; Division of Plant Physiology, ICAR-Indian Agricultural Research Institute, New Delhi, 110012, India
| | - Sarvajeet Singh Gill
- Stress Physiology & Molecular Biology Lab, Centre for Biotechnology, Maharshi Dayanand University, Rohtak, 124 001, Haryana, India
| | - Yashwant K Yadava
- ICAR-National Institute for Plant Biotechnology, New Delhi, 110012, India
| | - Kishor Gaikwad
- ICAR-National Institute for Plant Biotechnology, New Delhi, 110012, India
| | - P K Jain
- ICAR-National Institute for Plant Biotechnology, New Delhi, 110012, India.
| |
Collapse
|
5
|
Yee SW, Ferrández-Peral L, Alentorn P, Fontsere C, Ceylan M, Koleske ML, Handin N, Artegoitia VM, Lara G, Chien HC, Zhou X, Dainat J, Zalevsky A, Sali A, Brand CM, Capra JA, Artursson P, Newman JW, Marques-Bonet T, Giacomini KM. Illuminating the Function of the Orphan Transporter, SLC22A10 in Humans and Other Primates. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.08.552553. [PMID: 37609337 PMCID: PMC10441401 DOI: 10.1101/2023.08.08.552553] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/24/2023]
Abstract
SLC22A10 is classified as an orphan transporter with unknown substrates and function. Here we describe the discovery of the substrate specificity and functional characteristics of SLC22A10. The human SLC22A10 tagged with green fluorescent protein was found to be absent from the plasma membrane, in contrast to the SLC22A10 orthologs found in great apes. Estradiol-17β-glucuronide accumulated in cells expressing great ape SLC22A10 orthologs (over 4-fold, p<0.001). In contrast, human SLC22A10 displayed no uptake function. Sequence alignments revealed two amino acid differences including a proline at position 220 of the human SLC22A10 and a leucine at the same position of great ape orthologs. Site-directed mutagenesis yielding the human SLC22A10-P220L produced a protein with excellent plasma membrane localization and associated uptake function. Neanderthal and Denisovan genomes show human-like sequences at proline 220 position, corroborating that SLC22A10 were rendered nonfunctional during hominin evolution after the divergence from the pan lineage (chimpanzees and bonobos). These findings demonstrate that human SLC22A10 is a unitary pseudogene and was inactivated by a missense mutation that is fixed in humans, whereas orthologs in great apes transport sex steroid conjugates.
Collapse
Affiliation(s)
- Sook Wah Yee
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, California, USA
| | | | - Pol Alentorn
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, 08003 Barcelona, Spain
| | - Claudia Fontsere
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, 08003 Barcelona, Spain; Center for Evolutionary Hologenomics, The Globe Institute, University of Copenhagen, Øster Farimagsgade 5A, 1352 Copenhagen, Denmark
| | - Merve Ceylan
- Department of Pharmacy and Science for Life Laboratory, Uppsala University, P.O. Box 580, 75123, Uppsala, Sweden
| | - Megan L. Koleske
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, California, USA
| | - Niklas Handin
- Department of Pharmacy and Science for Life Laboratory, Uppsala University, P.O. Box 580, 75123, Uppsala, Sweden
| | - Virginia M. Artegoitia
- United States Department of Agriculture, Agricultural Research Service, Western Human Nutrition Research Center, Davis, CA 95616, USA
| | - Giovanni Lara
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, California, USA
| | - Huan-Chieh Chien
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, California, USA
| | - Xujia Zhou
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, California, USA
| | - Jacques Dainat
- Joint Research Unit for Infectious Diseases and Vectors Ecology Genetics Evolution and Control (MIVEGEC), University of Montpellier, French National Center for Scientific Research (CNRS 5290), French National Research Institute for Sustainable Development (IRD 224), 911 Avenue Agropolis, BP 64501, 34394 Montpellier Cedex 5, France
| | - Arthur Zalevsky
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, California, USA
| | - Andrej Sali
- Department of Bioengineering and Therapeutic Sciences, UCSF Box 0775 1700 4th St, University of California, San Francisco, San Francisco, CA 94158, United States; Department of Pharmaceutical Chemistry, University of California, San Francisco, UCSF Box 2880 600 16th St, San Francisco, CA 94143, United States; Quantitative Biosciences Institute (QBI), University of California, San Francisco, 1700 4th St, San Francisco, CA, United States
| | - Colin M. Brand
- Bakar Computational Health Sciences Institute, University of California, San Francisco, CA, USA; Department of Epidemiology and Biostatistics, University of California, San Francisco, CA, USA
| | - John A. Capra
- Bakar Computational Health Sciences Institute, University of California, San Francisco, CA, USA; Department of Epidemiology and Biostatistics, University of California, San Francisco, CA, USA
| | - Per Artursson
- Department of Pharmacy and Science for Life Laboratory, Uppsala University, P.O. Box 580, 75123, Uppsala, Sweden
| | - John W. Newman
- United States Department of Agriculture, Agricultural Research Service, Western Human Nutrition Research Center, Davis, CA 95616, USA; Department of Nutrition, University of California, Davis, Davis, CA 95616, USA; UC Davis West Coast Metabolomics Center, Davis, CA 95616, USA
| | - Tomas Marques-Bonet
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, 08003 Barcelona, Spain; Institute of Evolutionary Biology (UPF-CSIC), PRBB, Dr. Aiguader 88, 08003 Barcelona, Spain; Catalan Institution of Research and Advanced Studies (ICREA), Passeig de Lluís Companys, 23, 08010, Barcelona, Spain; CNAG, Centro Nacional de Analisis Genomico, Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, 08028 Barcelona, Spain; Institut Català de Paleontologia Miquel Crusafont, Universitat Autònoma de Barcelona, Edifici ICTA-ICP, c/ Columnes s/n, 08193 Cerdanyola del Vallès, Barcelona, Spain
| | - Kathleen M. Giacomini
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, California, USA
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, California, USA; Institute for Human Genetics, University of California San Francisco, San Francisco, CA, USA
| |
Collapse
|
6
|
Pan C, Mao S, Xiong Z, Chen Z, Xu N. Glutamate dehydrogenase: Potential therapeutic targets for neurodegenerative disease. Eur J Pharmacol 2023; 950:175733. [PMID: 37116563 DOI: 10.1016/j.ejphar.2023.175733] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2022] [Revised: 03/31/2023] [Accepted: 04/17/2023] [Indexed: 04/30/2023]
Abstract
Glutamate dehydrogenase (GDH) is a key enzyme in mammalian glutamate metabolism. It is located at the intersection of multiple metabolic pathways and participates in a variety of cellular activities. GDH activity is strictly regulated by a variety of allosteric compounds. Here, we review the unique distribution and expressions of GDH in the brain nervous system. GDH plays an essential role in the glutamate-glutamine-GABA cycle between astrocytes and neurons. The dysfunction of GDH may induce the occurrence of many neurodegenerative diseases, such as Parkinson's disease, epilepsy, Alzheimer's disease, schizophrenia, and frontotemporal dementia. GDH activators and gene therapy have been found to protect neurons and improve motor disorders in neurodegenerative diseases caused by glutamate metabolism disorders. To date, no medicine has been discovered that specifically targets neurodegenerative diseases, although several potential medicines are used clinically. Targeting GDH to treat neurodegenerative diseases is expected to provide new insights and treatment strategies.
Collapse
Affiliation(s)
- Chuqiao Pan
- College of Pharmaceutical Science, Institute of Drug Development & Chemical Biology, Zhejiang University of Technology, Huzhou, 313200, Zhejiang, People's Republic of China
| | - Shijie Mao
- College of Pharmaceutical Science, Institute of Drug Development & Chemical Biology, Zhejiang University of Technology, Huzhou, 313200, Zhejiang, People's Republic of China
| | - Zeping Xiong
- Department of Collaborative Innovation Center of Yangtze River Delta Region Green Pharmaceuticals, Zhejiang University of Technology, Huzhou, 313200, Zhejiang, People's Republic of China
| | - Zhao Chen
- College of Pharmaceutical Science, Institute of Drug Development & Chemical Biology, Zhejiang University of Technology, Huzhou, 313200, Zhejiang, People's Republic of China
| | - Ning Xu
- College of Pharmaceutical Science, Institute of Drug Development & Chemical Biology, Zhejiang University of Technology, Huzhou, 313200, Zhejiang, People's Republic of China.
| |
Collapse
|
7
|
Munk M, Villalobo E, Villalobo A, Berchtold MW. Differential expression of the three independent CaM genes coding for an identical protein: Potential relevance of distinct mRNA stability by different codon usage. Cell Calcium 2022; 107:102656. [DOI: 10.1016/j.ceca.2022.102656] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2022] [Revised: 09/01/2022] [Accepted: 09/25/2022] [Indexed: 11/24/2022]
|
8
|
Abrahim M, Machado E, Alvarez-Valín F, de Miranda AB, Catanho M. Uncovering Pseudogenes and Intergenic Protein-coding Sequences in TriTryps' Genomes. Genome Biol Evol 2022; 14:6754225. [PMID: 36208292 PMCID: PMC9576210 DOI: 10.1093/gbe/evac142] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2022] [Revised: 09/14/2022] [Accepted: 09/20/2022] [Indexed: 01/24/2023] Open
Abstract
Trypanosomatids belong to a remarkable group of unicellular, parasitic organisms of the order Kinetoplastida, an early diverging branch of the phylogenetic tree of eukaryotes, exhibiting intriguing biological characteristics affecting gene expression (intronless polycistronic transcription, trans-splicing, and RNA editing), metabolism, surface molecules, and organelles (compartmentalization of glycolysis, variation of the surface molecules, and unique mitochondrial DNA), cell biology and life cycle (phagocytic vacuoles evasion and intricate patterns of cell morphogenesis). With numerous genomic-scale data of several trypanosomatids becoming available since 2005 (genomes, transcriptomes, and proteomes), the scientific community can further investigate the mechanisms underlying these unusual features and address other unexplored phenomena possibly revealing biological aspects of the early evolution of eukaryotes. One fundamental aspect comprises the processes and mechanisms involved in the acquisition and loss of genes throughout the evolutionary history of these primitive microorganisms. Here, we present a comprehensive in silico analysis of pseudogenes in three major representatives of this group: Leishmania major, Trypanosoma brucei, and Trypanosoma cruzi. Pseudogenes, DNA segments originating from altered genes that lost their original function, are genomic relics that can offer an essential record of the evolutionary history of functional genes, as well as clues about the dynamics and evolution of hosting genomes. Scanning these genomes with functional proteins as proxies to reveal intergenic regions with protein-coding features, relying on a customized threshold to distinguish statistically and biologically significant sequence similarities, and reassembling remnant sequences from their debris, we found thousands of pseudogenes and hundreds of open reading frames, with particular characteristics in each trypanosomatid: mutation profile, number, content, density, codon bias, average size, single- or multi-copy gene origin, number and type of mutations, putative primitive function, and transcriptional activity. These features suggest a common process of pseudogene formation, different patterns of pseudogene evolution and extant biological functions, and/or distinct genome organization undertaken by those parasites during evolution, as well as different evolutionary and/or selective pressures acting on distinct lineages.
Collapse
Affiliation(s)
- Mayla Abrahim
- Laboratório de Tecnologia Imunológica, Instituto de Tecnologia em Imunobiológicos, Vice-Diretoria de Desenvolvimento Tecnológico, Bio-Manguinhos, Fundação Oswaldo Cruz (FIOCRUZ), Rio de Janeiro, RJ, Brazil
| | - Edson Machado
- Laboratório de Biologia Molecular Aplicada a Micobactérias, Instituto Oswaldo Cruz, Fiocruz, Brazil
| | - Fernando Alvarez-Valín
- Unidad de Genómica Evolutiva, Sección Biomatemática, Universidad de la República del Uruguay, Montevideo, Uruguay
| | | | | |
Collapse
|
9
|
Pankaj P, Anupam M, Amita P, Veerendra V, Shivani P. Pseudogenes in Juvenile Nasopharyngeal Angiofibroma: First Pilot Observation. Indian J Otolaryngol Head Neck Surg 2022; 74:1237-1241. [PMID: 36452697 PMCID: PMC9702465 DOI: 10.1007/s12070-020-02336-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2020] [Accepted: 12/14/2020] [Indexed: 11/30/2022] Open
Abstract
To investigate the role of Pseudogenes (PG) in juvenile nasopharyngeal angiofibroma (JNA) that were once considered to be junk-DNA or 'genomic fossils'. Five to 10 fresh JNA samples were analyzed for molecular expressions of 5 PG/respective parent genes (VEGFR1P1/VEGFR; FGFR3P1/FGFR3; PDGFAP1/PDGFA; IL6RP1/IL6R; POU5F1B/POU5F1) and clinical details correlated. IL6R, PDGFA, VEGFR2, FGF3 and their respective PG (VEGFR1P1, PDGFAP1, IL6RP1, FGFR3P1) were highly expressed but POU5F1/POU5F1B were not. The difference in expression between IL6R & IL6RP1 was substantially larger compared with other 4 pairs. While VEGFR1P1, PDGFAP1 & POU5F1B were expressed more than their respective parent genes; IL6RP1 & FGFR3P1 showed reduced expression. No clinical significance was apparent in any parameter nor was any difference noted between recurrent and upfront cases. A definite implication of pseudogene in JNA is evident in this ever first global study but future studies are needed to validate the current findings as well as further characterize its role/profile in larger sample. This may explain extreme variability of JNA, its heterogenous etiopathogenesis, evolving patterns and molecular characterization for possible targeted therapy.
Collapse
Affiliation(s)
- Pandey Pankaj
- Department of Otorhinolaryngology and Head & Neck Surgery, King George’s Medical University, A-1/19, Sector H, Aliganj, Lucknow India
| | - Mishra Anupam
- Department of Otorhinolaryngology and Head & Neck Surgery, King George’s Medical University, A-1/19, Sector H, Aliganj, Lucknow India
| | - Pandey Amita
- Department of Obstetrics/Gynecology, King George’s Medical University, Aliganj, Lucknow India
| | - Verma Veerendra
- Department of Otorhinolaryngology and Head & Neck Surgery, King George’s Medical University, A-1/19, Sector H, Aliganj, Lucknow India
| | - Pandey Shivani
- Department of Biochemistry, King George’s Medical University, Aliganj, Lucknow India
| |
Collapse
|
10
|
Chiefari E, Arcidiacono B, Mirabelli M, Brunetti FS, Greco E, Foti DP, Brunetti A. Methods to Study Protein-Binding to Pseudogene Transcripts. Methods Mol Biol 2021; 2324:187-202. [PMID: 34165716 DOI: 10.1007/978-1-0716-1503-4_12] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022]
Abstract
One of the most commonly described biological feature of processed pseudogenes is the ability to influence the expression of their parental coding genes. As evidenced in several studies, the high sequence similarity between these RNA pairs sets up a certain level of competition for posttranscriptional regulators, including, among others, RNA-binding proteins (RBPs). RBPs may affect, positively or negatively, the stability of bound mRNAs, so that, if an overexpressed pseudogene competes with its homologous coding gene, the downstream protein synthesis would change, with potential pathological consequences. Given these premises, a rigorous and comprehensive understanding of interactions between pseudogene-parental gene RNA pairs and RBPs could provide further insights into the biological bases of complex diseases, such as cancer, cardiovascular disease, and type 2 diabetes, identifying novel predictive and/or prognostic biomarkers.Herein, we detail easily adaptable protocols of plasmid-based molecular cloning and RNA-electrophoretic mobility shift assay (EMSA) used in our laboratory for determining the interaction between a cytoplasmatic stabilizing protein (αCP1) and the pseudogene-parental gene RNA pair HMGA1-p /HMGA1. We also offer a general overview of RNA immunoprecipitation procedures and present novel bioinformatic tools for predicting RBPs binding sites on pseudogene transcripts.
Collapse
Affiliation(s)
- Eusebio Chiefari
- Department of Health Sciences, University of Catanzaro "Magna Græcia", Catanzaro, Italy
| | - Biagio Arcidiacono
- Department of Health Sciences, University of Catanzaro "Magna Græcia", Catanzaro, Italy
| | - Maria Mirabelli
- Department of Health Sciences, University of Catanzaro "Magna Græcia", Catanzaro, Italy
| | | | - Emanuela Greco
- Department of Health Sciences, University of Catanzaro "Magna Græcia", Catanzaro, Italy
| | - Daniela Patrizia Foti
- Department of Health Sciences, University of Catanzaro "Magna Græcia", Catanzaro, Italy
| | - Antonio Brunetti
- Department of Health Sciences, University of Catanzaro "Magna Græcia", Catanzaro, Italy.
| |
Collapse
|
11
|
Pan CT, Lin YS. MicroRNA retrocopies generated via L1-mediated retrotransposition in placental mammals help to reveal how their parental genes were transcribed. Sci Rep 2020; 10:20612. [PMID: 33244051 PMCID: PMC7692494 DOI: 10.1038/s41598-020-77381-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2020] [Accepted: 11/10/2020] [Indexed: 02/07/2023] Open
Abstract
In mammalian genomes, most retrocopies emerged via the L1 retrotransposition machinery. The hallmarks of an L1-mediated retrocopy, i.e., the intronlessness, the presence of a 3′ poly-A tail, and the TSDs at both ends, were frequently used to identify retrotransposition events. However, most previous studies only focused on protein-coding genes as their possible parental sources and thus only a few retrocopies derived from non-coding genes were reported. Remarkably, none of them was from microRNAs. Here in this study, we found several retrocopies generated from the mir-302–367 cluster gene (MIR302CHG), and identified a novel alternatively spliced exon encoding mir-302a. The other recognized microRNA retrotransposition events are primate-specific with mir-373 and mir-498 as their parental genes. The 3′ poly-A tracts of these two retrocopy groups were directly attached to the end of the microRNA precursor homologous regions, which suggests that their parental transcripts might alternatively terminate at the end of mir-373 and mir-498. All the three parental microRNAs are highly expressed in specific tissues with elevated retrotransposon activity, such as the embryonic stem cells and the placenta. This might be the reason that our first microRNA retrocopy findings were derived from these three microRNA genes.
Collapse
Affiliation(s)
- Cheng-Tsung Pan
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu, 300, Taiwan
| | - Yeong-Shin Lin
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu, 300, Taiwan. .,Department of Biological Science and Technology, College of Biological Science and Technology, National Chiao Tung University, Hsinchu, 300, Taiwan. .,Center For Intelligent Drug Systems and Smart Bio-devices (IDS2B), National Chiao Tung University, Hsinchu, 300, Taiwan.
| |
Collapse
|
12
|
Gong L, Luo H, Shi W, Yang M. Intra-individual variation and transcribed pseudogenes in the ribosomal ITS1-5.8S-ITS2 rDNA of Paraplagusia japonica (Pleuronectiformes: Cynoglossidae). Biochem Biophys Res Commun 2019; 513:726-731. [DOI: 10.1016/j.bbrc.2019.04.064] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2019] [Accepted: 04/09/2019] [Indexed: 10/27/2022]
|
13
|
Silar P, Dauget JM, Gautier V, Grognet P, Chablat M, Hermann-Le Denmat S, Couloux A, Wincker P, Debuchy R. A gene graveyard in the genome of the fungus Podospora comata. Mol Genet Genomics 2018; 294:177-190. [PMID: 30288581 DOI: 10.1007/s00438-018-1497-3] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2018] [Accepted: 09/28/2018] [Indexed: 02/07/2023]
Abstract
Mechanisms involved in fine adaptation of fungi to their environment include differential gene regulation associated with single nucleotide polymorphisms and indels (including transposons), horizontal gene transfer, gene copy amplification, as well as pseudogenization and gene loss. The two Podospora genome sequences examined here emphasize the role of pseudogenization and gene loss, which have rarely been documented in fungi. Podospora comata is a species closely related to Podospora anserina, a fungus used as model in several laboratories. Comparison of the genome of P. comata with that of P. anserina, whose genome is available for over 10 years, should yield interesting data related to the modalities of genome evolution between these two closely related fungal species that thrive in the same types of biotopes, i.e., herbivore dung. Here, we present the genome sequence of the mat + isolate of the P. comata reference strain T. Comparison with the genome of the mat + isolate of P. anserina strain S confirms that P. anserina and P. comata are likely two different species that rarely interbreed in nature. Despite having a 94-99% of nucleotide identity in the syntenic regions of their genomes, the two species differ by nearly 10% of their gene contents. Comparison of the species-specific gene sets uncovered genes that could be responsible for the known physiological differences between the two species. Finally, we identified 428 and 811 pseudogenes (3.8 and 7.2% of the genes) in P. anserina and P. comata, respectively. Presence of high numbers of pseudogenes supports the notion that difference in gene contents is due to gene loss rather than horizontal gene transfers. We propose that the high frequency of pseudogenization leading to gene loss in P. anserina and P. comata accompanies specialization of these two fungi. Gene loss may be more prevalent during the evolution of other fungi than usually thought.
Collapse
Affiliation(s)
- Philippe Silar
- Univ Paris Diderot, Sorbonne Paris Cité, Laboratoire Interdisciplinaire des Energies de Demain, 75205, Paris Cedex 13, France.
| | - Jean-Marc Dauget
- Univ Paris Diderot, Sorbonne Paris Cité, Laboratoire Interdisciplinaire des Energies de Demain, 75205, Paris Cedex 13, France
| | - Valérie Gautier
- Univ Paris Diderot, Sorbonne Paris Cité, Laboratoire Interdisciplinaire des Energies de Demain, 75205, Paris Cedex 13, France
| | - Pierre Grognet
- Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Univ. Paris-Sud, Université Paris-Saclay, 91198, Gif-sur-Yvette cedex, France
| | - Michelle Chablat
- Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Univ. Paris-Sud, Université Paris-Saclay, 91198, Gif-sur-Yvette cedex, France
| | - Sylvie Hermann-Le Denmat
- Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Univ. Paris-Sud, Université Paris-Saclay, 91198, Gif-sur-Yvette cedex, France.,Ecole Normale Supérieure, 75005, Paris, France
| | - Arnaud Couloux
- CEA, Genoscope, Institut de biologie François Jacob, CP 5706, Evry, France
| | - Patrick Wincker
- CEA, Genoscope, Institut de biologie François Jacob, CP 5706, Evry, France.,CNRS UMR 8030, Evry, France.,Univ. Evry, Université Paris-Saclay, Evry, France
| | - Robert Debuchy
- Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Univ. Paris-Sud, Université Paris-Saclay, 91198, Gif-sur-Yvette cedex, France.
| |
Collapse
|
14
|
Annotation of the Corymbia terpene synthase gene family shows broad conservation but dynamic evolution of physical clusters relative to Eucalyptus. Heredity (Edinb) 2018. [PMID: 29523839 DOI: 10.1038/s41437-018-0058-1] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open
Abstract
Terpenes are economically and ecologically important phytochemicals. Their synthesis is controlled by the terpene synthase (TPS) gene family, which is highly diversified throughout the plant kingdom. The plant family Myrtaceae are characterised by especially high terpene concentrations, and considerable variation in terpene profiles. Many Myrtaceae are grown commercially for terpene products including the eucalypts Corymbia and Eucalyptus. Eucalyptus grandis has the largest TPS gene family of plants currently sequenced, which is largely conserved in the closely related E. globulus. However, the TPS gene family has been well studied only in these two eucalypt species. The recent assembly of two Corymbia citriodora subsp. variegata genomes presents an opportunity to examine the conservation of this important gene family across more divergent eucalypt lineages. Manual annotation of the TPS gene family in C. citriodora subsp. variegata revealed a similar overall number, and relative subfamily representation, to that previously reported in E. grandis and E. globulus. Many of the TPS genes were in physical clusters that varied considerably between Eucalyptus and Corymbia, with several instances of translocation, expansion/contraction and loss. Notably, there was greater conservation in the subfamilies involved in primary metabolism than those involved in secondary metabolism, likely reflecting different selective constraints. The variation in cluster size within subfamilies and the broad conservation between the eucalypts in the face of this variation are discussed, highlighting the potential contribution of selection, concerted evolution and stochastic processes. These findings provide the foundation to better understand terpene evolution within the ecologically and economically important Myrtaceae.
Collapse
|
15
|
Hanada K, Tezuka A, Nozawa M, Suzuki Y, Sugano S, Nagano AJ, Ito M, Morinaga SI. Functional divergence of duplicate genes several million years after gene duplication in Arabidopsis. DNA Res 2018; 25:4898128. [PMID: 29481587 PMCID: PMC6014284 DOI: 10.1093/dnares/dsy005] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2017] [Accepted: 02/02/2018] [Indexed: 12/02/2022] Open
Abstract
Lineage-specific duplicated genes likely contribute to the phenotypic divergence in closely related species. However, neither the frequency of duplication events nor the degree of selection pressures immediately after gene duplication is clear in the speciation process. Here, using Illumina DNA-sequencing reads from Arabidopsis halleri, which has multiple closely related species with high-quality genome assemblies (A. thaliana and A. lyrata), we succeeded in generating orthologous gene groups in Brassicaceae. The duplication frequency of retained genes in the Arabidopsis lineage was ∼10 times higher than the duplication frequency inferred by comparative genomics of Arabidopsis, poplar, rice and moss (Physcomitrella patens). The difference of duplication frequencies can be explained by a rapid decay of anciently duplicated genes. To examine the degree of selection pressure on genes duplicated in either the A. halleri-lyrata or the A. halleri lineage, we examined positive and purifying selection in the A. halleri-lyrata and A. halleri lineages throughout the ratios of nonsynonymous to synonymous substitution rates (KA/KS). Duplicate genes tended to have a higher proportion of positive selection compared with non-duplicated genes. Interestingly, we found that functional divergence of duplicated genes was accelerated several million years after gene duplication compared with immediately after gene duplication.
Collapse
Affiliation(s)
- Kousuke Hanada
- Department of Bioscience and Bioinformatics, Frontier Research Academy for Young Researchers, Kyusyu Institute of Technology, Iizuka, Fukuoka 820-8502, Japan
- RIKEN Center for Sustainable Resource Science, RIKEN, Yokohama, Kanagawa 230-0045, Japan
- CREST, Japan Science and Technology Agency, Kawaguchi, Saitama 332-0012, Japan
| | - Ayumi Tezuka
- Department of Bioscience and Bioinformatics, Frontier Research Academy for Young Researchers, Kyusyu Institute of Technology, Iizuka, Fukuoka 820-8502, Japan
| | - Masafumi Nozawa
- Center for Information Biology, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan
- Department of Genetics, SOKENDAI, Mishima, Shizuoka 411-8540, Japan
- Department of Biological Sciences, Tokyo Metropolitan University, Hachioji, Tokyo 192-0397, Japan
| | - Yutaka Suzuki
- Graduate School of Frontier Science, The University of Tokyo, Kashiwa, Chiba 277-8562, Japan
| | - Sumio Sugano
- Graduate School of Frontier Science, The University of Tokyo, Kashiwa, Chiba 277-8562, Japan
| | - Atsushi J Nagano
- CREST, Japan Science and Technology Agency, Kawaguchi, Saitama 332-0012, Japan
- Center of Ecological Research, Kyoto University, Hirano, Otsu, Shiga 520-2113, Japan
| | - Motomi Ito
- Graduate School of Arts and Sciences, The University of Tokyo, Tokyo 153-8902, Japan
| | - Shin-Ichi Morinaga
- CREST, Japan Science and Technology Agency, Kawaguchi, Saitama 332-0012, Japan
- Graduate School of Arts and Sciences, The University of Tokyo, Tokyo 153-8902, Japan
- College of Bioresource Sciences, Nihon University, Fujisawa, Kanagawa 252-0880, Japan
| |
Collapse
|
16
|
Casola C, Betrán E. The Genomic Impact of Gene Retrocopies: What Have We Learned from Comparative Genomics, Population Genomics, and Transcriptomic Analyses? Genome Biol Evol 2017; 9:1351-1373. [PMID: 28605529 PMCID: PMC5470649 DOI: 10.1093/gbe/evx081] [Citation(s) in RCA: 56] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/18/2017] [Indexed: 02/07/2023] Open
Abstract
Gene duplication is a major driver of organismal evolution. Gene retroposition is a mechanism of gene duplication whereby a gene's transcript is used as a template to generate retroposed gene copies, or retrocopies. Intriguingly, the formation of retrocopies depends upon the enzymatic machinery encoded by retrotransposable elements, genomic parasites occurring in the majority of eukaryotes. Most retrocopies are depleted of the regulatory regions found upstream of their parental genes; therefore, they were initially considered transcriptionally incompetent gene copies, or retropseudogenes. However, examples of functional retrocopies, or retrogenes, have accumulated since the 1980s. Here, we review what we have learned about retrocopies in animals, plants and other eukaryotic organisms, with a particular emphasis on comparative and population genomic analyses complemented with transcriptomic datasets. In addition, these data have provided information about the dynamics of the different "life cycle" stages of retrocopies (i.e., polymorphic retrocopy number variants, fixed retropseudogenes and retrogenes) and have provided key insights into the retroduplication mechanisms, the patterns and evolutionary forces at work during the fixation process and the biological function of retrogenes. Functional genomic and transcriptomic data have also revealed that many retropseudogenes are transcriptionally active and a biological role has been experimentally determined for many. Finally, we have learned that not only non-long terminal repeat retroelements but also long terminal repeat retroelements play a role in the emergence of retrocopies across eukaryotes. This body of work has shown that mRNA-mediated duplication represents a widespread phenomenon that produces an array of new genes that contribute to organismal diversity and adaptation.
Collapse
Affiliation(s)
- Claudio Casola
- Department of Ecosystem Science and Management, Texas A&M University, TX
| | - Esther Betrán
- Department of Biology, University of Texas at Arlington, Arlington, TX
| |
Collapse
|
17
|
Farré D, Engel P, Angulo A. Novel Role of 3'UTR-Embedded Alu Elements as Facilitators of Processed Pseudogene Genesis and Host Gene Capture by Viral Genomes. PLoS One 2016; 11:e0169196. [PMID: 28033411 PMCID: PMC5199112 DOI: 10.1371/journal.pone.0169196] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2016] [Accepted: 12/13/2016] [Indexed: 11/19/2022] Open
Abstract
Since the discovery of the high abundance of Alu elements in the human genome, the interest for the functional significance of these retrotransposons has been increasing. Primate Alu and rodent Alu-like elements are retrotransposed by a mechanism driven by the LINE1 (L1) encoded proteins, the same machinery that generates the L1 repeats, the processed pseudogenes (PPs), and other retroelements. Apart from free Alu RNAs, Alus are also transcribed and retrotranscribed as part of cellular gene transcripts, generally embedded inside 3' untranslated regions (UTRs). Despite different proposed hypotheses, the functional implication of the presence of Alus inside 3'UTRs remains elusive. In this study we hypothesized that Alu elements in 3'UTRs could be involved in the genesis of PPs. By analyzing human genome data we discovered that the existence of 3'UTR-embedded Alu elements is overrepresented in genes source of PPs. In contrast, the presence of other retrotransposable elements in 3'UTRs does not show this PP linked overrepresentation. This research was extended to mouse and rat genomes and the results accordingly reveal overrepresentation of 3'UTR-embedded B1 (Alu-like) elements in PP parent genes. Interestingly, we also demonstrated that the overrepresentation of 3'UTR-embedded Alus is particularly significant in PP parent genes with low germline gene expression level. Finally, we provide data that support the hypothesis that the L1 machinery is also the system that herpesviruses, and possibly other large DNA viruses, use to capture host genes expressed in germline or somatic cells. Altogether our results suggest a novel role for Alu or Alu-like elements inside 3'UTRs as facilitators of the genesis of PPs, particularly in lowly expressed genes. Moreover, we propose that this L1-driven mechanism, aided by the presence of 3'UTR-embedded Alus, may also be exploited by DNA viruses to incorporate host genes to their viral genomes.
Collapse
Affiliation(s)
- Domènec Farré
- Immunology Unit, Department of Biomedical Sciences, Medical School, University of Barcelona, Barcelona, Spain
- Institut d’Investigacions Biomèdiques August Pi i Sunyer, Barcelona, Spain
- * E-mail:
| | - Pablo Engel
- Immunology Unit, Department of Biomedical Sciences, Medical School, University of Barcelona, Barcelona, Spain
- Institut d’Investigacions Biomèdiques August Pi i Sunyer, Barcelona, Spain
| | - Ana Angulo
- Immunology Unit, Department of Biomedical Sciences, Medical School, University of Barcelona, Barcelona, Spain
- Institut d’Investigacions Biomèdiques August Pi i Sunyer, Barcelona, Spain
| |
Collapse
|
18
|
Xiao J, Sekhwal MK, Li P, Ragupathy R, Cloutier S, Wang X, You FM. Pseudogenes and Their Genome-Wide Prediction in Plants. Int J Mol Sci 2016; 17:E1991. [PMID: 27916797 PMCID: PMC5187791 DOI: 10.3390/ijms17121991] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2016] [Revised: 11/20/2016] [Accepted: 11/22/2016] [Indexed: 11/17/2022] Open
Abstract
Pseudogenes are paralogs generated from ancestral functional genes (parents) during genome evolution, which contain critical defects in their sequences, such as lacking a promoter, having a premature stop codon or frameshift mutations. Generally, pseudogenes are functionless, but recent evidence demonstrates that some of them have potential roles in regulation. The majority of pseudogenes are generated from functional progenitor genes either by gene duplication (duplicated pseudogenes) or retro-transposition (processed pseudogenes). Pseudogenes are primarily identified by comparison to their parent genes. Bioinformatics tools for pseudogene prediction have been developed, among which PseudoPipe, PSF and Shiu's pipeline are publicly available. We compared these three tools using the well-annotated Arabidopsis thaliana genome and its known 924 pseudogenes as a test data set. PseudoPipe and Shiu's pipeline identified ~80% of A. thaliana pseudogenes, of which 94% were shared, while PSF failed to generate adequate results. A need for improvement of the bioinformatics tools for pseudogene prediction accuracy in plant genomes was thus identified, with the ultimate goal of improving the quality of genome annotation in plants.
Collapse
Affiliation(s)
- Jin Xiao
- Morden Research and Development Centre, Agriculture and Agri-Food Canada, Morden, MB R6M 1Y5, Canada.
- Department of Agronomy, Nanjing Agricultural University, Nanjing 210095, China.
| | - Manoj Kumar Sekhwal
- Morden Research and Development Centre, Agriculture and Agri-Food Canada, Morden, MB R6M 1Y5, Canada.
- Department of Soil Science, University of Saskatchewan, Saskatoon, SK S7N 5A8, Canada.
| | - Pingchuan Li
- Morden Research and Development Centre, Agriculture and Agri-Food Canada, Morden, MB R6M 1Y5, Canada.
| | - Raja Ragupathy
- Department of Plant Science, University of Saskatchewan, Saskatoon, SK S7N 5A2, Canada.
| | - Sylvie Cloutier
- Ottawa Research and Development Centre, Agriculture and Agri-Food Canada, Ottawa, ON K1A 0C6, Canada.
| | - Xiue Wang
- Department of Agronomy, Nanjing Agricultural University, Nanjing 210095, China.
| | - Frank M You
- Morden Research and Development Centre, Agriculture and Agri-Food Canada, Morden, MB R6M 1Y5, Canada.
| |
Collapse
|
19
|
Tandem Duplication Events in the Expansion of the Small Heat Shock Protein Gene Family in Solanum lycopersicum (cv. Heinz 1706). G3-GENES GENOMES GENETICS 2016; 6:3027-3034. [PMID: 27565886 PMCID: PMC5068928 DOI: 10.1534/g3.116.032045] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
In plants, fruit maturation and oxidative stress can induce small heat shock protein (sHSP) synthesis to maintain cellular homeostasis. Although the tomato reference genome was published in 2012, the actual number and functionality of sHSP genes remain unknown. Using a transcriptomic (RNA-seq) and evolutionary genomic approach, putative sHSP genes in the Solanum lycopersicum (cv. Heinz 1706) genome were investigated. A sHSP gene family of 33 members was established. Remarkably, roughly half of the members of this family can be explained by nine independent tandem duplication events that determined, evolutionarily, their functional fates. Within a mitochondrial class subfamily, only one duplicated member, Solyc08g078700, retained its ancestral chaperone function, while the others, Solyc08g078710 and Solyc08g078720, likely degenerated under neutrality and lack ancestral chaperone function. Functional conservation occurred within a cytosolic class I subfamily, whose four members, Solyc06g076570, Solyc06g076560, Solyc06g076540, and Solyc06g076520, support ∼57% of the total sHSP RNAm in the red ripe fruit. Subfunctionalization occurred within a new subfamily, whose two members, Solyc04g082720 and Solyc04g082740, show heterogeneous differential expression profiles during fruit ripening. These findings, involving the birth/death of some genes or the preferential/plastic expression of some others during fruit ripening, highlight the importance of tandem duplication events in the expansion of the sHSP gene family in the tomato genome. Despite its evolutionary diversity, the sHSP gene family in the tomato genome seems to be endowed with a core set of four homeostasis genes: Solyc05g014280, Solyc03g082420, Solyc11g020330, and Solyc06g076560, which appear to provide a baseline protection during both fruit ripening and heat shock stress in different tomato tissues.
Collapse
|
20
|
Mechanisms of Evolutionary Innovation Point to Genetic Control Logic as the Key Difference Between Prokaryotes and Eukaryotes. J Mol Evol 2015. [PMID: 26208881 DOI: 10.1007/s00239-015-9688-6] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
The evolution of life from the simplest, original form to complex, intelligent animal life occurred through a number of key innovations. Here we present a new tool to analyze these key innovations by proposing that the process of evolutionary innovation may follow one of three underlying processes, namely a Random Walk, a Critical Path, or a Many Paths process, and in some instances may also constitute a "Pull-up the Ladder" event. Our analysis is based on the occurrence of function in modern biology, rather than specific structure or mechanism. A function in modern biology may be classified in this way either on the basis of its evolution or the basis of its modern mechanism. Characterizing key innovations in this way helps identify the likelihood that an innovation could arise. In this paper, we describe the classification, and methods to classify functional features of modern organisms into these three classes based on the analysis of how a function is implemented in modern biology. We present the application of our categorization to the evolution of eukaryotic gene control. We use this approach to support the argument that there are few, and possibly no basic chemical differences between the functional constituents of the machinery of gene control between eukaryotes, bacteria and archaea. This suggests that the difference between eukaryotes and prokaryotes that allows the former to develop the complex genetic architecture seen in animals and plants is something other than their chemistry. We tentatively identify the difference as a difference in control logic, that prokaryotic genes are by default 'on' and eukaryotic genes are by default 'off.' The Many Paths evolutionary process suggests that, from a 'default off' starting point, the evolution of the genetic complexity of higher eukaryotes is a high probability event.
Collapse
|
21
|
RNA-Seq alignment to individualized genomes improves transcript abundance estimates in multiparent populations. Genetics 2015; 198:59-73. [PMID: 25236449 PMCID: PMC4174954 DOI: 10.1534/genetics.114.165886] [Citation(s) in RCA: 71] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
Massively parallel RNA sequencing (RNA-seq) has yielded a wealth of new insights into transcriptional regulation. A first step in the analysis of RNA-seq data is the alignment of short sequence reads to a common reference genome or transcriptome. Genetic variants that distinguish individual genomes from the reference sequence can cause reads to be misaligned, resulting in biased estimates of transcript abundance. Fine-tuning of read alignment algorithms does not correct this problem. We have developed Seqnature software to construct individualized diploid genomes and transcriptomes for multiparent populations and have implemented a complete analysis pipeline that incorporates other existing software tools. We demonstrate in simulated and real data sets that alignment to individualized transcriptomes increases read mapping accuracy, improves estimation of transcript abundance, and enables the direct estimation of allele-specific expression. Moreover, when applied to expression QTL mapping we find that our individualized alignment strategy corrects false-positive linkage signals and unmasks hidden associations. We recommend the use of individualized diploid genomes over reference sequence alignment for all applications of high-throughput sequencing technology in genetically diverse populations.
Collapse
|
22
|
Zhao Y, Tang L, Li Z, Jin J, Luo J, Gao G. Identification and analysis of unitary loss of long-established protein-coding genes in Poaceae shows evidences for biased gene loss and putatively functional transcription of relics. BMC Evol Biol 2015; 15:66. [PMID: 25927997 PMCID: PMC4425925 DOI: 10.1186/s12862-015-0345-x] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2015] [Accepted: 03/31/2015] [Indexed: 11/24/2022] Open
Abstract
Background Long-established protein-coding genes may lose their coding potential during evolution (“unitary gene loss”). Members of the Poaceae family are a major food source and represent an ideal model clade for plant evolution research. However, the global pattern of unitary gene loss in Poaceae genomes as well as the evolutionary fate of lost genes are still less-investigated and remain largely elusive. Results Using a locally developed pipeline, we identified 129 unitary gene loss events for long-established protein-coding genes from four representative species of Poaceae, i.e. brachypodium, rice, sorghum and maize. Functional annotation suggested that the lost genes in all or most of Poaceae species are enriched for genes involved in development and response to endogenous stimulus. We also found that 44 mutated genomic loci of lost genes, which we referred as relics, were still actively transcribed, and of which 84% (37 of 44) showed significantly differential expression across different tissues. More interestingly, we found that there were totally five expressed relics may function as competitive endogenous RNA in brachypodium, rice and sorghum genome. Conclusions Based on comparative genomics and transcriptome data, we firstly compiled a comprehensive catalogue of unitary gene loss events in Poaceae species and characterized a statistically significant functional preference for these lost genes as well showed the potential of relics functioning as competitive endogenous RNAs in Poaceae genomes. Electronic supplementary material The online version of this article (doi:10.1186/s12862-015-0345-x) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Yi Zhao
- State Key Laboratory of Protein and Plant Gene Research, College of Life Science, Center for Bioinformatics, Peking University, Beijing, 100871, People's Republic of China.
| | - Liang Tang
- State Key Laboratory of Protein and Plant Gene Research, College of Life Science, Center for Bioinformatics, Peking University, Beijing, 100871, People's Republic of China. .,Current address: College of Horticulture and Landscape Architecture, Southwest University, Chongqing, 400715, People's Republic of China.
| | - Zhe Li
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing, 100093, People's Republic of China.
| | - Jinpu Jin
- State Key Laboratory of Protein and Plant Gene Research, College of Life Science, Center for Bioinformatics, Peking University, Beijing, 100871, People's Republic of China.
| | - Jingchu Luo
- State Key Laboratory of Protein and Plant Gene Research, College of Life Science, Center for Bioinformatics, Peking University, Beijing, 100871, People's Republic of China.
| | - Ge Gao
- State Key Laboratory of Protein and Plant Gene Research, College of Life Science, Center for Bioinformatics, Peking University, Beijing, 100871, People's Republic of China.
| |
Collapse
|
23
|
Zuriaga MA, Mas-Coma S, Bargues MD. A nuclear ribosomal DNA pseudogene in triatomines opens a new research field of fundamental and applied implications in Chagas disease. Mem Inst Oswaldo Cruz 2015; 110:353-62. [PMID: 25760450 PMCID: PMC4489472 DOI: 10.1590/0074-02760140398] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2014] [Accepted: 02/10/2015] [Indexed: 01/13/2023] Open
Abstract
A pseudogene, designated as "ps(5.8S+ITS-2)", paralogous to the 5.8S gene and
internal transcribed spacer (ITS)-2 of the nuclear ribosomal DNA (rDNA), has been
recently found in many triatomine species distributed throughout North America,
Central America and northern South America. Among characteristics used as criteria
for pseudogene verification, secondary structures and free energy are highlighted,
showing a lower fit between minimum free energy, partition function and centroid
structures, although in given cases the fit only appeared to be slightly lower. The
unique characteristics of "ps(5.8S+ITS-2)" as a processed or retrotransposed
pseudogenic unit of the ghost type are reviewed, with emphasis on its potential
functionality compared to the functionality of genes and spacers of the normal rDNA
operon. Besides the technical problem of the risk for erroneous sequence results, the
usefulness of "ps(5.8S+ITS-2)" for specimen classification, phylogenetic analyses and
systematic/taxonomic studies should be highlighted, based on consistence and
retention index values, which in pseudogenic sequence trees were higher than in
functional sequence trees. Additionally, intraindividual, interpopulational and
interspecific differences in pseudogene amount and the fact that it is a pseudogene
in the nuclear rDNA suggests a potential relationships with fitness, behaviour and
adaptability of triatomine vectors and consequently its potential utility in Chagas
disease epidemiology and control.
Collapse
|
24
|
Welker CAD, Souza-Chies TT, Longhi-Wagner HM, Peichoto MC, McKain MR, Kellogg EA. Phylogenetic analysis of Saccharum s.l. (Poaceae; Andropogoneae), with emphasis on the circumscription of the South American species. AMERICAN JOURNAL OF BOTANY 2015; 102:248-263. [PMID: 25667078 DOI: 10.3732/ajb.1400397] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
PREMISE OF THE STUDY Polyploidy and reticulate evolution are often a complication for discovering phylogenetic relationships between genera and species. Despite the huge economic importance of sugarcane (Saccharum officinarum-Poaceae, Andropogoneae), the limits of the genus Saccharum and its species are complex and largely unresolved, involving both polyploidy and reticulate evolution. This study aimed to assess the phylogenetic relationships of Saccharum s.l., including Erianthus and Tripidium, as well as investigate the taxonomic circumscription of the South American species of the genus. METHODS Molecular cloning and sequencing of five regions of four low-copy nuclear loci were performed, including Aberrant panicle organization1 (apo1), Dwarf8 (d8), two exons of Erect panicle2 (ep2-ex7 and ep2-ex8), and Retarded palea1 (rep1). Concatenated trees were reconstructed using Maximum Parsimony, Maximum Likelihood, and Bayesian Inference analyses. KEY RESULTS The allopolyploid origin of Saccharum was demonstrated using evidence from nuclear genes. The samples of Saccharum s.l. grouped in two distinct clades, with S. arundinaceum and S. ravennae (= Tripidium, or Erianthus sect. Ripidium) apart from all other species analyzed of the genus. Saccharum angustifolium, S. asperum, and S. villosum correspond to distinct clades (different species). The plants with intermediate morphology between S. angustifolium and S. villosum presented a pattern of paralogues consistent with a hybrid origin. CONCLUSIONS Saccharum s.l. is polyphyletic and Tripidium should be recognized as a distinct genus. However, no strong evidence was found to support the segregation of Erianthus. The taxonomic circumscription of the South American species of the genus was resolved and the occurrence of natural hybrids was documented. Better understanding of the phylogenetic relationships of Saccharum and relatives may be useful for sugarcane breeders to identify potential taxa for interspecific and intergeneric crosses in the genetic improvement of sugarcane.
Collapse
Affiliation(s)
- Cassiano A D Welker
- Universidade Federal do Rio Grande do Sul, Programa de Pós-Graduação em Botânica, Av. Bento Gonçalves 9500, CEP 91501-970, Porto Alegre, RS, Brazil
| | - Tatiana T Souza-Chies
- Universidade Federal do Rio Grande do Sul, Programa de Pós-Graduação em Botânica, Av. Bento Gonçalves 9500, CEP 91501-970, Porto Alegre, RS, Brazil
| | - Hilda M Longhi-Wagner
- Universidade Federal do Rio Grande do Sul, Programa de Pós-Graduação em Botânica, Av. Bento Gonçalves 9500, CEP 91501-970, Porto Alegre, RS, Brazil
| | - Myriam Carolina Peichoto
- Instituto de Botánica del Nordeste (UNNE-CONICET), Facultad de Ciencias Agrarias (UNNE), Sargento Cabral 2131, Corrientes, 3400, Argentina
| | - Michael R McKain
- Donald Danforth Plant Science Center, 975 North Warson Road, St. Louis, Missouri 63132 USA
| | - Elizabeth A Kellogg
- Donald Danforth Plant Science Center, 975 North Warson Road, St. Louis, Missouri 63132 USA
| |
Collapse
|
25
|
Abstract
Pseudogenes are degraded fossil copies of genes. Here, we report a comparison of pseudogenes spanning three phyla, leveraging the completed annotations of the human, worm, and fly genomes, which we make available as an online resource. We find that pseudogenes are lineage specific, much more so than protein-coding genes, reflecting the different remodeling processes marking each organism's genome evolution. The majority of human pseudogenes are processed, resulting from a retrotranspositional burst at the dawn of the primate lineage. This burst can be seen in the largely uniform distribution of pseudogenes across the genome, their preservation in areas with low recombination rates, and their preponderance in highly expressed gene families. In contrast, worm and fly pseudogenes tell a story of numerous duplication events. In worm, these duplications have been preserved through selective sweeps, so we see a large number of pseudogenes associated with highly duplicated families such as chemoreceptors. However, in fly, the large effective population size and high deletion rate resulted in a depletion of the pseudogene complement. Despite large variations between these species, we also find notable similarities. Overall, we identify a broad spectrum of biochemical activity for pseudogenes, with the majority in each organism exhibiting varying degrees of partial activity. In particular, we identify a consistent amount of transcription (∼15%) across all species, suggesting a uniform degradation process. Also, we see a uniform decay of pseudogene promoter activity relative to their coding counterparts and identify a number of pseudogenes with conserved upstream sequences and activity, hinting at potential regulatory roles.
Collapse
|
26
|
Evolutionary and functional novelty of pancreatic ribonuclease: a study of Musteloidea (order Carnivora). Sci Rep 2014; 4:5070. [PMID: 24861105 PMCID: PMC5381406 DOI: 10.1038/srep05070] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2014] [Accepted: 04/30/2014] [Indexed: 11/08/2022] Open
Abstract
Pancreatic ribonuclease (RNASE1) is a digestive enzyme that has been one of the key models in studies of evolutionary innovation and functional diversification. It has been believed that the RNASE1 gene duplications are correlated with the plant-feeding adaptation of foregut-fermenting herbivores. Here, we characterized RNASE1 genes from Caniformia, which has a simple digestive system and lacks microbial digestion typical of herbivores, in an unprecedented scope based on both gene sequence and tissue expression analyses. Remarkably, the results yielded new hypotheses regarding the evolution and the function of Caniformia RNASE1 genes. Four independent gene duplication events in the families of superfamily Musteloidea, including Procyonidae, Ailuridae, Mephitidae and Mustelidae, were recovered, rejecting previous Mustelidae-specific duplication hypothesis, but supporting Musteloidea duplication hypothesis. Moreover, our analyses revealed pronounced differences among the RNASE1 gene copies regarding their selection pressures, pI values and tissue expression patterns, suggesting the differences in their physiological functions. Notably, the expression analyses detected the transcription of a RNASE1 pseudogene in several tissues, raising the possibility that pseudogenes are also a potential source during the RNase functional diversification. In sum, the present work demonstrated a far more complex and intriguing evolutionary pattern and functional diversity of mammalian ribonuclease than previously thought.
Collapse
|
27
|
Ding W, Dai J. Methods for detecting transcribed pseudogenes: PCR on regions of high sequence similarity followed by cloning and sequencing. Methods Mol Biol 2014; 1167:103-15. [PMID: 24823774 DOI: 10.1007/978-1-4939-0835-6_8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/21/2023]
Abstract
Identifying pseudogene transcription is problematic in many cases due to the high sequence similarity between pseudogenes and their parental genes. In this chapter, we detail the procedure for the detection of pseudogene transcription using the reverse transcription polymerase chain reaction (RT-PCR) method. The protocol comprises (1) extraction of total RNA, (2) first-strand cDNA synthesis from total RNA, (3) amplification of the cDNA by PCR, and (4) cloning and sequencing of the PCR products. Technical and practical guidance is provided, and the critical points during each of the steps are discussed. In particular, the importance of designing high specific PCR primers and thoroughly eliminating genomic DNA contamination from RNA preparation is emphasized.
Collapse
Affiliation(s)
- Wenyong Ding
- Department of Biochemistry, Dalian Medical University, Dalian, China
| | | |
Collapse
|
28
|
Affiliation(s)
- Alexander F. Palazzo
- University of Toronto, Department of Biochemistry, Toronto, Ontario, Canada
- * E-mail: (AP); (TG)
| | - T. Ryan Gregory
- University of Guelph, Department of Integrative Biology, Guelph, Ontario, Canada
- * E-mail: (AP); (TG)
| |
Collapse
|
29
|
Characterization of human pseudogene-derived non-coding RNAs for functional potential. PLoS One 2014; 9:e93972. [PMID: 24699680 PMCID: PMC3974860 DOI: 10.1371/journal.pone.0093972] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2014] [Accepted: 03/10/2014] [Indexed: 11/19/2022] Open
Abstract
Thousands of pseudogenes exist in the human genome and many are transcribed, but their functional potential remains elusive and understudied. To explore these issues systematically, we first developed a computational pipeline to identify transcribed pseudogenes from RNA-Seq data. Applying the pipeline to datasets from 16 distinct normal human tissues identified ∼ 3,000 pseudogenes that could produce non-coding RNAs in a manner of low abundance but high tissue specificity under normal physiological conditions. Cross-tissue comparison revealed that the transcriptional profiles of pseudogenes and their parent genes showed mostly positive correlations, suggesting that pseudogene transcription could have a positive effect on the expression of their parent genes, perhaps by functioning as competing endogenous RNAs (ceRNAs), as previously suggested and demonstrated with the PTEN pseudogene, PTENP1. Our analysis of the ENCODE project data also found many transcriptionally active pseudogenes in the GM12878 and K562 cell lines; moreover, it showed that many human pseudogenes produced small RNAs (sRNAs) and some pseudogene-derived sRNAs, especially those from antisense strands, exhibited evidence of interfering with gene expression. Further integrated analysis of transcriptomics and epigenomics data, however, demonstrated that trimethylation of histone 3 at lysine 9 (H3K9me3), a posttranslational modification typically associated with gene repression and heterochromatin, was enriched at many transcribed pseudogenes in a transcription-level dependent manner in the two cell lines. The H3K9me3 enrichment was more prominent in pseudogenes that produced sRNAs at pseudogene loci and their adjacent regions, an observation further supported by the co-enrichment of SETDB1 (a H3K9 methyltransferase), suggesting that pseudogene sRNAs may have a role in regional chromatin repression. Taken together, our comprehensive and systematic characterization of pseudogene transcription uncovers a complex picture of how pseudogene ncRNAs could influence gene and pseudogene expression, at both epigenetic and post-transcriptional levels.
Collapse
|
30
|
Coman D, Altenhoff A, Zoller S, Gruissem W, Vranová E. Distinct evolutionary strategies in the GGPPS family from plants. FRONTIERS IN PLANT SCIENCE 2014; 5:230. [PMID: 24904625 PMCID: PMC4034038 DOI: 10.3389/fpls.2014.00230] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/14/2014] [Accepted: 05/09/2014] [Indexed: 05/07/2023]
Abstract
Multiple geranylgeranyl diphosphate synthases (GGPPS) for biosynthesis of geranylgeranyl diphosphate (GGPP) exist in plants. GGPP is produced in the isoprenoid pathway and is a central precursor for various primary and specialized plant metabolites. Therefore, its biosynthesis is an essential regulatory point in the isoprenoid pathway. We selected 119 GGPPSs from 48 species representing all major plant lineages, based on stringent homology criteria. After the diversification of land plants, the number of GGPPS paralogs per species increases. Already in the moss Physcomitrella patens, GGPPS appears to be encoded by multiple paralogous genes. In gymnosperms, neofunctionalization of GGPPS may have enabled optimized biosynthesis of primary and specialized metabolites. Notably, lineage-specific expansion of GGPPS occurred in land plants. As a representative species we focused here on Arabidopsis thaliana, which retained the highest number of GGPPS paralogs (twelve) among the 48 species we considered in this study. Our results show that the A. thaliana GGPPS gene family is an example of evolution involving neo- and subfunctionalization as well as pseudogenization. We propose subfunctionalization as one of the main mechanisms allowing the maintenance of multiple GGPPS paralogs in A. thaliana genome. Accordingly, the changes in the expression patterns of the GGPPS paralogs occurring after gene duplication led to developmental and/or condition specific functional evolution.
Collapse
Affiliation(s)
- Diana Coman
- Department of Biology, ETH ZurichZurich, Switzerland
| | - Adrian Altenhoff
- Department of Computer Science, ETH ZurichZurich, Switzerland
- Swiss Institute of BioinformaticsZurich, Switzerland
| | - Stefan Zoller
- Department of Computer Science, ETH ZurichZurich, Switzerland
- Swiss Institute of BioinformaticsZurich, Switzerland
| | | | - Eva Vranová
- Department of Biology, ETH ZurichZurich, Switzerland
- Institute of Biology and Ecology, Pavol Jozef Šafárik UniversityKošice, Slovakia
- *Correspondence: Eva Vranová, Faculty of Science, Institute of Biology and Ecology, Pavol Jozef Šafárik University in Košice, Mánesova 23, Košice, 04154, Slovakia e-mail:
| |
Collapse
|
31
|
Nuclear rDNA pseudogenes in Chagas disease vectors: Evolutionary implications of a new 5.8S+ITS-2 paralogous sequence marker in triatomines of North, Central and northern South America. INFECTION GENETICS AND EVOLUTION 2014; 21:134-56. [DOI: 10.1016/j.meegid.2013.10.028] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/30/2013] [Revised: 10/29/2013] [Accepted: 10/31/2013] [Indexed: 02/04/2023]
|
32
|
Lee HH. Mutational analysis of CYP21A2 gene and CYP21A1P pseudogene: long-range PCR on genomic DNA. Methods Mol Biol 2014; 1167:275-87. [PMID: 24823785 DOI: 10.1007/978-1-4939-0835-6_19] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/05/2022]
Abstract
CYP21A2, the gene that codes for P450c21 (Steroid 21-hydroxylase), has a duplicated pseudogene called CYP21A1P. The gene and the pseudogene share 98 % and 96 % sequence homology in exons and in noncoding sequences, respectively, and are located 30 kb apart within the HLA class III human histocompatibility complex locus on chromosome 6p21.3. CYP21A1P is inactive due to the presence of 11 deteriorated mutations in its coding region. These mutations can be transferred to the functional CYP21A2 through intergenic recombination during meiosis or mitosis and lead to the congenital adrenal hyperplasia (CAH) resulting from 21-hydroxylase deficiency. Conversely, portions of CYP21A2 sequence can be transferred to CYP21A1P, modifying the haplotype. Here, we describe a well-established protocol that can be used to unambiguously study the mutational profile of CYP21A2 gene and CYP21A1P pseudogene. The protocol is based on long-range PCR amplification with allele-specific primers, followed by DNA sequencing of smaller fragments.
Collapse
Affiliation(s)
- Hsien-Hsiung Lee
- Department of Laboratory Medicine, China Medical University Hospital, 2 Yuh-Der Road, Taichung, 404, Taiwan,
| |
Collapse
|
33
|
Korrodi-Gregório L, Abrantes J, Muller T, Melo-Ferreira J, Marcus K, da Cruz e Silva OAB, Fardilha M, Esteves PJ. Not so pseudo: the evolutionary history of protein phosphatase 1 regulatory subunit 2 and related pseudogenes. BMC Evol Biol 2013; 13:242. [PMID: 24195737 PMCID: PMC3840573 DOI: 10.1186/1471-2148-13-242] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2013] [Accepted: 10/29/2013] [Indexed: 01/23/2023] Open
Abstract
Background Pseudogenes are traditionally considered “dead” genes, therefore lacking biological functions. This view has however been challenged during the last decade. This is the case of the Protein phosphatase 1 regulatory subunit 2 (PPP1R2) or inhibitor-2 gene family, for which several incomplete copies exist scattered throughout the genome. Results In this study, the pseudogenization process of PPP1R2 was analyzed. Ten PPP1R2-related pseudogenes (PPP1R2P1-P10), highly similar to PPP1R2, were retrieved from the human genome assembly present in the databases. The phylogenetic analysis of mammalian PPP1R2 and related pseudogenes suggested that PPP1R2P7 and PPP1R2P9 retroposons appeared before the great mammalian radiation, while the remaining pseudogenes are primate-specific and retroposed at different times during Primate evolution. Although considered inactive, four of these pseudogenes seem to be transcribed and possibly possess biological functions. Given the role of PPP1R2 in sperm motility, the presence of these proteins was assessed in human sperm, and two PPP1R2-related proteins were detected, PPP1R2P3 and PPP1R2P9. Signatures of negative and positive selection were also detected in PPP1R2P9, further suggesting a role as a functional protein. Conclusions The results show that contrary to initial observations PPP1R2-related pseudogenes are not simple bystanders of the evolutionary process but may rather be at the origin of genes with novel functions.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Pedro J Esteves
- CIBIO-UP, Centro de Investigação em Biodiversidade e Recursos Genéticos, Universidade do Porto, InBIO, Laboratório Associado, Campus Agrário de Vairão, Vairão, Portugal.
| |
Collapse
|
34
|
Rebordinos L, Cross I, Merlo A. High evolutionary dynamism in 5S rDNA of fish: state of the art. Cytogenet Genome Res 2013; 141:103-13. [PMID: 24080995 DOI: 10.1159/000354871] [Citation(s) in RCA: 53] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
The 5S ribosomal DNA (rDNA) consists of one transcriptional unit of about 120 base pairs, which is separated from the next unit by a non-transcribed spacer (NTS). The coding sequence and the NTS together form a repeat unit which can be found in hundreds to thousands of copies tandemly repeated in the genomes. The NTS regions seem to be subject to rapid evolution. The first general model of evolution of these multigene families was referred to as divergent evolution, based on studies using hemoglobin and myoglobin as model systems. Later studies showed that nucleotide sequences of different multigene family members are more closely related within species than between species. This observation led to a new model of multigene family evolution, termed concerted evolution. Another model of evolution, named the birth-and-death model, has been found to be more suitable to explain the long-term evolution of these multigene families. According to this model, new genes originate by successive duplications, and these new genes are either maintained for a long time or are lost, or else degenerate into pseudogenes. In this review we describe different sources of variability in the 5S rDNA genes observed in several distinct fish species. This variability is mainly referred to NTSs and includes the presence of other multigene families (mainly LINEs, SINEs, non-LTR retrotransposons, and U snRNA families). Different types of microsatellites have also been found to contribute to the increase of variability in this region. Our recent results suggest that horizontal transfer contributes to the increase of diversity in the NTSs of some species. Variability in the 5S rDNA coding region affecting the stability of the structure, but without effects on the function of the 5S rRNA, is also described. Retrotransposons seem to be responsible for the high dynamism of 5S rDNA, while microsatellites acting as recombination hot spots could stabilize a wide variety of unusual DNA structures, affecting DNA replication and enhancing or decreasing promoter activity in gene expression. The relationship between the high variability found at molecular level and the low variability found at chromosomal level is also discussed.
Collapse
Affiliation(s)
- L Rebordinos
- Area de Genética, Facultad de Ciencias del Mar y Ambientales, CEI-Mar, Universidad de Cádiz, Puerto Real, Spain
| | | | | |
Collapse
|
35
|
Abstract
Genes are perpetually added to and deleted from genomes during evolution. Thus, it is important to understand how new genes are formed and how they evolve to be critical components of the genetic systems that determine the biological diversity of life. Two decades of effort have shed light on the process of new gene origination and have contributed to an emerging comprehensive picture of how new genes are added to genomes, ranging from the mechanisms that generate new gene structures to the presence of new genes in different organisms to the rates and patterns of new gene origination and the roles of new genes in phenotypic evolution. We review each of these aspects of new gene evolution, summarizing the main evidence for the origination and importance of new genes in evolution. We highlight findings showing that new genes rapidly change existing genetic systems that govern various molecular, cellular, and phenotypic functions.
Collapse
Affiliation(s)
- Manyuan Long
- Department of Ecology and Evolution, The University of Chicago, Chicago, Illinois 60637;
| | | | | | | |
Collapse
|
36
|
Sen K, Ghosh TC. Pseudogenes and their composers: delving in the 'debris' of human genome. Brief Funct Genomics 2013; 12:536-47. [PMID: 23900003 DOI: 10.1093/bfgp/elt026] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023] Open
Abstract
Pseudogenes, the nonfunctional homologs of functional genes and thus exemplified as 'genomic fossils' provide intriguing snapshots of the evolutionary history of human genome. These defunct copies generally arise by retrotransposition or duplication followed by various genetic disablements. In this study, focusing on human pseudogenes and their functional homologues we describe their characteristic features and relevance to protein sequence evolution. We recapitulate that pseudogenes harbor disease-causing degenerative sequence variations in conjunction with the immense disease gene association of their progenitors. Furthermore, we also discuss the issue of functional resurrection and the potentiality observed in some pseudogenes to regulate their functional counterparts.
Collapse
Affiliation(s)
- Kamalika Sen
- Bioinformatics Centre, Bose Institute, P 1/12, C.I.T. Scheme VII M, Kolkata 700 054, India. Tel.: +91 33 2355 6626; Fax: +91 33 2355 3886;
| | | |
Collapse
|
37
|
Bornberg-Bauer E, Albà MM. Dynamics and adaptive benefits of modular protein evolution. Curr Opin Struct Biol 2013; 23:459-66. [PMID: 23562500 DOI: 10.1016/j.sbi.2013.02.012] [Citation(s) in RCA: 80] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2013] [Revised: 02/15/2013] [Accepted: 02/15/2013] [Indexed: 11/29/2022]
Abstract
During protein evolution, novel domain arrangements are continuously formed. Rearrangements are important for the creation of molecular biodiversity and for functional molecular changes which underlie developmental shifts in the bauplan of organisms. Here we review the mechanisms by which new arrangements arise and the potential benefits of rearrangements. We concentrate on how new domains emerge and why they rapidly spread across genomes, gaining higher copy numbers than older, more established domains. This spread is most likely a consequence of their high adaptive potential but is unlikely to make up on its own for the drastic loss of domains, which is observed across different taxa. We show that a significant portion of the recently emerged domains, especially those in multidomain families, are highly disordered and speculate about the significance of these findings for the evolvability of novel genetic material.
Collapse
Affiliation(s)
- Erich Bornberg-Bauer
- Institute for Evolution and Biodiversity, School of Biological Sciences, University of Münster, Hüfferstrasse 1, D48149 Münster, Germany.
| | | |
Collapse
|
38
|
Li W, Yang W, Wang XJ. Pseudogenes: pseudo or real functional elements? J Genet Genomics 2013; 40:171-7. [PMID: 23618400 DOI: 10.1016/j.jgg.2013.03.003] [Citation(s) in RCA: 53] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2013] [Revised: 03/04/2013] [Accepted: 03/04/2013] [Indexed: 11/24/2022]
Abstract
Pseudogenes are genomic remnants of ancient protein-coding genes which have lost their coding potentials through evolution. Although broadly existed, pseudogenes used to be considered as junk or relics of genomes which have not drawn enough attentions of biologists until recent years. With the broad applications of high-throughput experimental techniques, growing lines of evidence have strongly suggested that some pseudogenes possess special functions, including regulating parental gene expression and participating in the regulation of many biological processes. In this review, we summarize some basic features of pseudogenes and their functions in regulating development and diseases. All of these observations indicate that pseudogenes are not purely dead fossils of genomes, but warrant further exploration in their distribution, expression regulation and functions. A new nomenclature is desirable for the currently called 'pseudogenes' to better describe their functions.
Collapse
Affiliation(s)
- Wen Li
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China
| | | | | |
Collapse
|
39
|
Wang L, Si W, Yao Y, Tian D, Araki H, Yang S. Genome-wide survey of pseudogenes in 80 fully re-sequenced Arabidopsis thaliana accessions. PLoS One 2012; 7:e51769. [PMID: 23272162 PMCID: PMC3521719 DOI: 10.1371/journal.pone.0051769] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2012] [Accepted: 11/07/2012] [Indexed: 11/18/2022] Open
Abstract
Pseudogenes (Ψs), including processed and non-processed Ψs, are ubiquitous genetic elements derived from originally functional genes in all studied genomes within the three kingdoms of life. However, systematic surveys of non-processed Ψs utilizing genomic information from multiple samples within a species are still rare. Here a systematic comparative analysis was conducted of Ψs within 80 fully re-sequenced Arabidopsis thaliana accessions, and 7546 genes, representing ∼28% of the genomic annotated open reading frames (ORFs), were found with disruptive mutations in at least one accession. The distribution of these Ψs on chromosomes showed a significantly negative correlation between Ψs/ORFs and their local gene densities, suggesting a higher proportion of Ψs in gene desert regions, e.g. near centromeres. On the other hand, compared with the non-Ψ loci, even the intact coding sequences (CDSs) in the Ψ loci were found to have shorter CDS length, fewer exon number and lower GC content. In addition, a significant functional bias against the null hypothesis was detected in the Ψs mainly involved in responses to environmental stimuli and biotic stress as reported, suggesting that they are likely important for adaptive evolution to rapidly changing environments by pseudogenization to accumulate successive mutations.
Collapse
Affiliation(s)
- Long Wang
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, China
| | - Weina Si
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, China
| | - Yongfang Yao
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, China
| | - Dacheng Tian
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, China
| | - Hitoshi Araki
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, China
- Eawag, Swiss Federal Institute of Aquatic Science and Technology, Center of Ecology, Evolution and Biogeochemistry, Kastanienbaum, Switzerland
- * E-mail: (SY); (HA)
| | - Sihai Yang
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, China
- * E-mail: (SY); (HA)
| |
Collapse
|
40
|
Abstract
A new study in this issue of Genome Biology sheds light on why some pseudogenes persist in rodent, and other mammalian, genomes. Please see related Research article by Marques et al http://genomebiology.com/2012/13/11/R102
Collapse
|
41
|
Abstract
A new study in this issue of Genome Biology sheds light on why some pseudogenes persist in rodent, and other mammalian, genomes.
Collapse
|
42
|
Marques AC, Tan J, Lee S, Kong L, Heger A, Ponting CP. Evidence for conserved post-transcriptional roles of unitary pseudogenes and for frequent bifunctionality of mRNAs. Genome Biol 2012; 13:R102. [PMID: 23153069 PMCID: PMC3580494 DOI: 10.1186/gb-2012-13-11-r102] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2012] [Accepted: 11/15/2012] [Indexed: 01/15/2023] Open
Abstract
Background Recent reports have highlighted instances of mRNAs that, in addition to coding for protein, regulate the abundance of related transcripts by altering microRNA availability. These two mRNA roles - one mediated by RNA and the other by protein - are inter-dependent and hence cannot easily be separated. Whether the RNA-mediated role of transcripts is important, per se, or whether it is a relatively innocuous consequence of competition by different transcripts for microRNA binding remains unknown. Results Here we took advantage of 48 loci that encoded proteins in the earliest eutherian ancestor, but whose protein-coding capability has since been lost specifically during rodent evolution. Sixty-five percent of such loci, which we term 'unitary pseudogenes', have retained their expression in mouse and their transcripts exhibit conserved tissue expression profiles. The maintenance of these unitary pseudogenes' spatial expression profiles is associated with conservation of their microRNA response elements and these appear to preserve the post-transcriptional roles of their protein-coding ancestor. We used mouse Pbcas4, an exemplar of these transcribed unitary pseudogenes, to experimentally test our genome-wide predictions. We demonstrate that the role of Pbcas4 as a competitive endogenous RNA has been conserved and has outlived its ancestral gene's loss of protein-coding potential. Conclusions These results show that post-transcriptional regulation by bifunctional mRNAs can persist over long evolutionary time periods even after their protein coding ability has been lost.
Collapse
|
43
|
Pei B, Sisu C, Frankish A, Howald C, Habegger L, Mu XJ, Harte R, Balasubramanian S, Tanzer A, Diekhans M, Reymond A, Hubbard TJ, Harrow J, Gerstein MB. The GENCODE pseudogene resource. Genome Biol 2012; 13:R51. [PMID: 22951037 PMCID: PMC3491395 DOI: 10.1186/gb-2012-13-9-r51] [Citation(s) in RCA: 253] [Impact Index Per Article: 21.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2012] [Revised: 05/30/2012] [Accepted: 06/25/2012] [Indexed: 12/11/2022] Open
Abstract
Background Pseudogenes have long been considered as nonfunctional genomic sequences. However, recent evidence suggests that many of them might have some form of biological activity, and the possibility of functionality has increased interest in their accurate annotation and integration with functional genomics data. Results As part of the GENCODE annotation of the human genome, we present the first genome-wide pseudogene assignment for protein-coding genes, based on both large-scale manual annotation and in silico pipelines. A key aspect of this coupled approach is that it allows us to identify pseudogenes in an unbiased fashion as well as untangle complex events through manual evaluation. We integrate the pseudogene annotations with the extensive ENCODE functional genomics information. In particular, we determine the expression level, transcription-factor and RNA polymerase II binding, and chromatin marks associated with each pseudogene. Based on their distribution, we develop simple statistical models for each type of activity, which we validate with large-scale RT-PCR-Seq experiments. Finally, we compare our pseudogenes with conservation and variation data from primate alignments and the 1000 Genomes project, producing lists of pseudogenes potentially under selection. Conclusions At one extreme, some pseudogenes possess conventional characteristics of functionality; these may represent genes that have recently died. On the other hand, we find interesting patterns of partial activity, which may suggest that dead genes are being resurrected as functioning non-coding RNAs. The activity data of each pseudogene are stored in an associated resource, psiDR, which will be useful for the initial identification of potentially functional pseudogenes.
Collapse
Affiliation(s)
- Baikang Pei
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
44
|
Abstract
Because they are generally noncoding and thus considered nonfunctional and unimportant, pseudogenes have long been neglected. Recent advances have established that the DNA of a pseudogene, the RNA transcribed from a pseudogene, or the protein translated from a pseudogene can have multiple, diverse functions and that these functions can affect not only their parental genes but also unrelated genes. Therefore, pseudogenes have emerged as a previously unappreciated class of sophisticated modulators of gene expression, with a multifaceted involvement in the pathogenesis of human cancer.
Collapse
Affiliation(s)
- Laura Poliseno
- Oncogenomics Unit, Core Research Laboratory, Istituto Toscano Tumori (CRL-ITT), c/o IFC-CNR Via Moruzzi 1, 56124 Pisa, Italy.
| |
Collapse
|
45
|
Abstract
Novel protein-coding genes can arise either through re-organization of pre-existing genes or de novo1,2. Processes involving re-organization of pre-existing genes, notably following gene duplication, have been extensively described1,2. In contrast, de novo gene birth remains poorly understood, mainly because translation of sequences devoid of genes, or “non-genic” sequences, is expected to produce insignificant polypeptides rather than proteins with specific biological functions1,3-6. Here, we formalize an evolutionary model according to which functional genes evolve de novo through transitory proto-genes4 generated by widespread translational activity in non-genic sequences. Testing this model at genome-scale in Saccharomyces cerevisiae, we detect translation of hundreds of short species-specific open reading frames (ORFs) located in non-genic sequences. These translation events appear to provide adaptive potential7, as suggested by their differential regulation upon stress and by signatures of retention by natural selection. In line with our model, we establish that S. cerevisiae ORFs can be placed within an evolutionary continuum ranging from non-genic sequences to genes. We identify ~1,900 candidate proto-genes among S. cerevisiae ORFs and find that de novo gene birth from such a reservoir may be more prevalent than sporadic gene duplication. Our work illustrates that evolution exploits seemingly dispensable sequences to generate adaptive functional innovation.
Collapse
|
46
|
Mboup M, Fischer I, Lainer H, Stephan W. Trans-species polymorphism and allele-specific expression in the CBF gene family of wild tomatoes. Mol Biol Evol 2012; 29:3641-52. [PMID: 22787283 DOI: 10.1093/molbev/mss176] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
Abiotic stresses such as drought, extreme temperatures, and salinity have a strong impact on plant adaptation. They act as selective forces on plant physiology and morphology. These selective pressures leave characteristic footprints that can be detected at the DNA sequence level using population genetic tools. On the basis of a candidate gene approach, we investigated signatures of adaptation in two wild tomato species, Solanum peruvianum and S. chilense. These species are native to western South America and constitute a model system for studying adaptation, due to their ability to colonize diverse habitats and the available genetic resources. We have determined the selective forces acting on the C-repeat binding factor (CBF) gene family, which consists of three genes, and is known to be involved in tolerance to abiotic stresses, in particular in cold tolerance. We also analyzed the expression pattern of these genes after drought and cold stresses. We found that CBF3 evolves under very strong purifying selection, CBF2 is under balancing selection in some populations of both species (S. peruvianum/Quicacha and S. chilense/Nazca) maintaining a trans-species polymorphism, and CBF1 is a pseudogene. In contrast to previous studies of cultivated tomatoes showing that only CBF1 was cold induced, we found that all three CBF genes are cold induced in wild tomatoes. All three genes are also drought induced. CBF2 exhibits an allele-specific expression pattern associated with the trans-species polymorphism.
Collapse
Affiliation(s)
- Mamadou Mboup
- Section of Evolutionary Biology, Department of Biology II, University of Munich, Planegg-Martinsried, Germany.
| | | | | | | |
Collapse
|
47
|
Naidoo N, Pawitan Y, Soong R, Cooper DN, Ku CS. Human genetics and genomics a decade after the release of the draft sequence of the human genome. Hum Genomics 2012; 5:577-622. [PMID: 22155605 PMCID: PMC3525251 DOI: 10.1186/1479-7364-5-6-577] [Citation(s) in RCA: 77] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Substantial progress has been made in human genetics and genomics research over the past ten years since the publication of the draft sequence of the human genome in 2001. Findings emanating directly from the Human Genome Project, together with those from follow-on studies, have had an enormous impact on our understanding of the architecture and function of the human genome. Major developments have been made in cataloguing genetic variation, the International HapMap Project, and with respect to advances in genotyping technologies. These developments are vital for the emergence of genome-wide association studies in the investigation of complex diseases and traits. In parallel, the advent of high-throughput sequencing technologies has ushered in the 'personal genome sequencing' era for both normal and cancer genomes, and made possible large-scale genome sequencing studies such as the 1000 Genomes Project and the International Cancer Genome Consortium. The high-throughput sequencing and sequence-capture technologies are also providing new opportunities to study Mendelian disorders through exome sequencing and whole-genome sequencing. This paper reviews these major developments in human genetics and genomics over the past decade.
Collapse
Affiliation(s)
- Nasheen Naidoo
- Centre for Molecular Epidemiology, Department of Epidemiology and Public Health, Yong Loo Lin School of Medicine, National University of Singapore, Singapore
| | | | | | | | | |
Collapse
|
48
|
Abstract
Pseudogenes are ubiquitous and abundant in genomes. Pseudogenes were once called “genomic fossils” and treated as “junk DNA” several years. Nevertheless, it has been recognized that some pseudogenes play essential roles in gene regulation of their parent genes, and many pseudogenes are transcribed into RNA. Pseudogene transcripts may also form small interfering RNA or decrease cellular miRNA concentration. Thus, pseudogenes regulate tumor suppressors and oncogenes. Their essential functions draw the attention of our research group in my current work on heat shock protein 90: a chaperone of oncogenes. The paper reviews our current knowledge on pseudogenes and evaluates preliminary results of the chaperone data. Current efforts to understand pseudogenes interactions help to understand the functions of a genome.
Collapse
|
49
|
Calvete O, González J, Betrán E, Ruiz A. Segmental duplication, microinversion, and gene loss associated with a complex inversion breakpoint region in Drosophila. Mol Biol Evol 2012; 29:1875-89. [PMID: 22328714 DOI: 10.1093/molbev/mss067] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
Chromosomal inversions are usually portrayed as simple two-breakpoint rearrangements changing gene order but not gene number or structure. However, increasing evidence suggests that inversion breakpoints may often have a complex structure and entail gene duplications with potential functional consequences. Here, we used a combination of different techniques to investigate the breakpoint structure and the functional consequences of a complex rearrangement fixed in Drosophila buzzatii and comprising two tandemly arranged inversions sharing the middle breakpoint: 2m and 2n. By comparing the sequence in the breakpoint regions between D. buzzatii (inverted chromosome) and D. mojavensis (noninverted chromosome), we corroborate the breakpoint reuse at the molecular level and infer that inversion 2m was associated with a duplication of a ~13 kb segment and likely generated by staggered breaks plus repair by nonhomologous end joining. The duplicated segment contained the gene CG4673, involved in nuclear transport, and its two nested genes CG5071 and CG5079. Interestingly, we found that other than the inversion and the associated duplication, both breakpoints suffered additional rearrangements, that is, the proximal breakpoint experienced a microinversion event associated at both ends with a 121-bp long duplication that contains a promoter. As a consequence of all these different rearrangements, CG5079 has been lost from the genome, CG5071 is now a single copy nonnested gene, and CG4673 has a transcript ~9 kb shorter and seems to have acquired a more complex gene regulation. Our results illustrate the complex effects of chromosomal rearrangements and highlight the need of complementing genomic approaches with detailed sequence-level and functional analyses of breakpoint regions if we are to fully understand genome structure, function, and evolutionary dynamics.
Collapse
Affiliation(s)
- Oriol Calvete
- Departament de Genètica i de Microbiologia, Facultat de Biociències, Universitat Autònoma de Barcelona, Bellaterra, Barcelona, Spain
| | | | | | | |
Collapse
|
50
|
Ayres CFJ, Müller P, Dyer N, Wilding CS, Rigden DJ, Donnelly MJ. Comparative genomics of the anopheline glutathione S-transferase epsilon cluster. PLoS One 2011; 6:e29237. [PMID: 22206003 PMCID: PMC3242777 DOI: 10.1371/journal.pone.0029237] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2011] [Accepted: 11/23/2011] [Indexed: 02/04/2023] Open
Abstract
Enzymes of the glutathione S-transferase (GST) family play critical roles in detoxification of xenobiotics across many taxa. While GSTs are ubiquitous both in animals and plants, the GST epsilon class (GSTE) is insect-specific and has been associated with resistance to chemical insecticides. While both Aedes aegypti and Anopheles gambiae GSTE clusters consist of eight members, only four putative orthologs are identifiable between the species, suggesting independent expansions of the class in each lineage. We used a primer walking approach, sequencing almost the entire cluster from three Anopheles species (An. stephensi, An. funestus (both Cellia subgenus) and An. plumbeus (Anopheles subgenus)) and compared the sequences to putative orthologs in An. gambiae (Cellia) in an attempt to trace the evolution of the cluster within the subfamily Anophelinae. Furthermore, we measured transcript levels from the identified GSTE loci by real time reverse transcription PCR to determine if all genes were similarly transcribed at different life stages. Among the species investigated, gene order and orientation were similar with three exceptions: (i) GSTE1 was absent in An. plumbeus; (ii) GSTE2 is duplicated in An. plumbeus and (iii) an additional transcriptionally active pseudogene (ψAsGSTE2) was found in An. stephensi. Further statistical analysis and protein modelling gave evidence for positive selection on codons of the catalytic site in GSTE5 albeit its origin seems to predate the introduction of chemical insecticides. Gene expression profiles revealed differences in expression pattern among genes at different life stages. With the exception of GSTE1, ψAsGSTE2 and GSTE2b, all Anopheles species studied share orthologs and hence we assume that GSTE expansion generally predates radiation into subgenera, though the presence of GSTE1 may also suggest a recent duplication event in the Old World Cellia subgenus, instead of a secondary loss. The modifications of the catalytic site within GSTE5 may represent adaptations to new habitats.
Collapse
|