1
|
Moyer DC, Larue GE, Hershberger CE, Roy SW, Padgett RA. Comprehensive database and evolutionary dynamics of U12-type introns. Nucleic Acids Res 2020; 48:7066-7078. [PMID: 32484558 PMCID: PMC7367187 DOI: 10.1093/nar/gkaa464] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2020] [Revised: 05/19/2020] [Accepted: 05/20/2020] [Indexed: 12/16/2022] Open
Abstract
During nuclear maturation of most eukaryotic pre-messenger RNAs and long non-coding RNAs, introns are removed through the process of RNA splicing. Different classes of introns are excised by the U2-type or the U12-type spliceosomes, large complexes of small nuclear ribonucleoprotein particles and associated proteins. We created intronIC, a program for assigning intron class to all introns in a given genome, and used it on 24 eukaryotic genomes to create the Intron Annotation and Orthology Database (IAOD). We then used the data in the IAOD to revisit several hypotheses concerning the evolution of the two classes of spliceosomal introns, finding support for the class conversion model explaining the low abundance of U12-type introns in modern genomes.
Collapse
Affiliation(s)
- Devlin C Moyer
- Department of Cardiovascular and Metabolic Sciences, Lerner Research Institute, Cleveland Clinic Lerner College of Medicine, Cleveland Clinic and Department of Molecular Medicine, Case Western Reserve University, Cleveland, OH 44106, USA
| | - Graham E Larue
- Department of Molecular and Cell Biology, University of California, Merced, Merced, CA 95343, USA
| | - Courtney E Hershberger
- Department of Cardiovascular and Metabolic Sciences, Lerner Research Institute, Cleveland Clinic Lerner College of Medicine, Cleveland Clinic and Department of Molecular Medicine, Case Western Reserve University, Cleveland, OH 44106, USA
| | - Scott W Roy
- Department of Biology, San Francisco State University, San Francisco, CA 94132, USA
| | - Richard A Padgett
- Department of Cardiovascular and Metabolic Sciences, Lerner Research Institute, Cleveland Clinic Lerner College of Medicine, Cleveland Clinic and Department of Molecular Medicine, Case Western Reserve University, Cleveland, OH 44106, USA
| |
Collapse
|
2
|
Arribere JA, Kuroyanagi H, Hundley HA. mRNA Editing, Processing and Quality Control in Caenorhabditis elegans. Genetics 2020; 215:531-568. [PMID: 32632025 PMCID: PMC7337075 DOI: 10.1534/genetics.119.301807] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2019] [Accepted: 05/03/2020] [Indexed: 02/06/2023] Open
Abstract
While DNA serves as the blueprint of life, the distinct functions of each cell are determined by the dynamic expression of genes from the static genome. The amount and specific sequences of RNAs expressed in a given cell involves a number of regulated processes including RNA synthesis (transcription), processing, splicing, modification, polyadenylation, stability, translation, and degradation. As errors during mRNA production can create gene products that are deleterious to the organism, quality control mechanisms exist to survey and remove errors in mRNA expression and processing. Here, we will provide an overview of mRNA processing and quality control mechanisms that occur in Caenorhabditis elegans, with a focus on those that occur on protein-coding genes after transcription initiation. In addition, we will describe the genetic and technical approaches that have allowed studies in C. elegans to reveal important mechanistic insight into these processes.
Collapse
Affiliation(s)
| | - Hidehito Kuroyanagi
- Laboratory of Gene Expression, Medical Research Institute, Tokyo Medical and Dental University, Tokyo 113-8510, Japan, and
| | - Heather A Hundley
- Medical Sciences Program, Indiana University School of Medicine-Bloomington, Indiana 47405
| |
Collapse
|
3
|
Abou Alezz M, Celli L, Belotti G, Lisa A, Bione S. GC-AG Introns Features in Long Non-coding and Protein-Coding Genes Suggest Their Role in Gene Expression Regulation. Front Genet 2020; 11:488. [PMID: 32499820 PMCID: PMC7242645 DOI: 10.3389/fgene.2020.00488] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2019] [Accepted: 04/20/2020] [Indexed: 12/16/2022] Open
Abstract
Long non-coding RNAs (lncRNAs) are recognized as an important class of regulatory molecules involved in a variety of biological functions. However, the regulatory mechanisms of long non-coding genes expression are still poorly understood. The characterization of the genomic features of lncRNAs is crucial to get insight into their function. In this study, we exploited recent annotations by GENCODE to characterize the genomic and splicing features of long non-coding genes in comparison with protein-coding ones, both in human and mouse. Our analysis highlighted differences between the two classes of genes in terms of their gene architecture. Significant differences in the splice sites usage were observed between long non-coding and protein-coding genes (PCG). While the frequency of non-canonical GC-AG splice junctions represents about 0.8% of total splice sites in PCGs, we identified a significant enrichment of the GC-AG splice sites in long non-coding genes, both in human (3.0%) and mouse (1.9%). In addition, we found a positional bias of GC-AG splice sites being enriched in the first intron in both classes of genes. Moreover, a significant shorter length and weaker donor and acceptor sites were found comparing GC-AG introns to GT-AG introns. Genes containing at least one GC-AG intron were found conserved in many species, more prone to alternative splicing and a functional analysis pointed toward their enrichment in specific biological processes such as DNA repair. Our study shows for the first time that GC-AG introns are mainly associated with lncRNAs and are preferentially located in the first intron. Additionally, we discovered their regulatory potential indicating the existence of a new mechanism of non-coding and PCGs expression regulation.
Collapse
Affiliation(s)
- Monah Abou Alezz
- Computational Biology Unit, Institute of Molecular Genetics Luigi Luca Cavalli-Sforza, National Research Council, Pavia, Italy
| | - Ludovica Celli
- Computational Biology Unit, Institute of Molecular Genetics Luigi Luca Cavalli-Sforza, National Research Council, Pavia, Italy
| | - Giulia Belotti
- Computational Biology Unit, Institute of Molecular Genetics Luigi Luca Cavalli-Sforza, National Research Council, Pavia, Italy
| | - Antonella Lisa
- Computational Biology Unit, Institute of Molecular Genetics Luigi Luca Cavalli-Sforza, National Research Council, Pavia, Italy
| | - Silvia Bione
- Computational Biology Unit, Institute of Molecular Genetics Luigi Luca Cavalli-Sforza, National Research Council, Pavia, Italy
| |
Collapse
|
4
|
Ryskaliyeva A, Henry C, Miranda G, Faye B, Konuspayeva G, Martin P. The main WAP isoform usually found in camel milk arises from the usage of an improbable intron cryptic splice site in the precursor to mRNA in which a GC-AG intron occurs. BMC Genet 2019; 20:14. [PMID: 30696406 PMCID: PMC6350295 DOI: 10.1186/s12863-018-0704-x] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2018] [Accepted: 11/30/2018] [Indexed: 01/10/2023] Open
Abstract
Background Whey acidic protein (WAP) is a major protein identified in the milk of several mammalian species with cysteine-rich domains known as four-disulfide cores (4-DSC). The organization of the eutherian WAP genes is highly conserved through evolution. It has been proposed that WAP could play an important role in regulating the proliferation of mammary epithelial cells. A bacteriostatic activity was also reported. Conversely to the other mammalian species expressing WAP in their milk, camel WAP contains 4 additional amino acid residues at the beginning of the second 4-DSC domain, introducing a phosphorylation site. The aim of this study was to elucidate the origin of this specificity, which possibly impacts its physiological functions. Results Using LC-ESI-MS, we identified in Camelus bactrianus from Kazakhstan a phosphorylated whey protein, exhibiting a molecular mass (12,596 Da), 32 Da higher than the original WAP (12,564 Da) and co-eluting with WAP. cDNA sequencing revealed a transition G/A, which modifies an amino acid residue of the mature protein (V12 M), accounting for the mass difference observed between WAP genetic variants. We also report the existence of two splicing variants of camel WAP precursors to mRNA, arising from an alternative usage of the canonical splice site recognized as such in the other mammalian species. However, the major camel WAP isoform results from the usage of an unlikely intron cryptic splice site, extending camel exon 3 upstream by 12-nucleotides encoding the 4 additional amino acid residues (VSSP) in which a potentially phosphorylable Serine residue occurs. Combining protein and cDNA sequences with genome data available (NCBI database), we report another feature of the camel WAP gene which displays a very rare GC-AG type intron. This result was confirmed by sequencing a genomic DNA fragment encompassing exon 3 to exon 4, suggesting for the GC donor site a compensatory effect in terms of consensus at the acceptor exon position. Conclusions Combining proteomic and molecular biology approaches we report: the characterization of a new genetic variant of camel WAP, the usage of an unlikely intron cryptic splice site, and the occurrence of an extremely rare GC-AG type of intron.
Collapse
Affiliation(s)
- Alma Ryskaliyeva
- INRA, UMR GABI, AgroParisTech, Université Paris-Saclay, 78350, Jouy-en-Josas, France
| | - Céline Henry
- Plateforme d'Analyse Protéomique Paris Sud-Ouest (PAPPSO), INRA, MICALIS Institute, Université Paris-Saclay, 78350, Jouy-en-Josas, France
| | - Guy Miranda
- INRA, UMR GABI, AgroParisTech, Université Paris-Saclay, 78350, Jouy-en-Josas, France
| | - Bernard Faye
- CIRAD, UMR SELMET, 34398, Montpellier Cedex 5, France
| | - Gaukhar Konuspayeva
- Biological Technology Department, Al-Farabi Kazakh National University, Almaty, Kazakhstan
| | - Patrice Martin
- INRA, UMR GABI, AgroParisTech, Université Paris-Saclay, 78350, Jouy-en-Josas, France.
| |
Collapse
|
5
|
Fanelli E, Troccoli A, De Luca F. Functional Variation of Two Novel Cellulases, Pv-eng-5 and Pv-eng-8, and the Heat Shock 90 Gene, Pv-hsp-90, in Pratylenchus vulnus and Their Expression in Response to Different Temperature Stress. Int J Mol Sci 2018; 20:E107. [PMID: 30597892 PMCID: PMC6337429 DOI: 10.3390/ijms20010107] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2018] [Revised: 12/12/2018] [Accepted: 12/20/2018] [Indexed: 11/18/2022] Open
Abstract
Functional characterization of two novel endoglucanase genes, Pv-eng-5 and Pv-eng-8, of the root-lesion nematode Pratylenchus vulnus was carried out. In situ-hybridization experiments revealed that Pv-eng-8 transcript was localized in the pharyngeal glands. Silencing of Pv-eng-5 and Pv-eng-8 resulted in a significant reduction of expression level (52% and 67%, respectively). Furthermore, the silencing of Pv-eng-8 determined a reduction (41%) in nematode reproduction, suggesting that treated nematodes are much less able to process food. Surprisingly, no significant difference on reproduction rate was observed with Pv-eng-5 dsRNA nematodes, suggesting a neofunctionalization of Pv-eng-5 despite the high similarity with nematode endoglucanases. Pratylenchus species are poikilothermic organisms showing close relationships with the environmental temperature. The effects of different temperature ranges revealed that the reproductive potential of P. vulnus increased with increasing temperature from 23 °C to 28 °C, but no reproduction was observed at 33 °C. In real time, increasing temperature from 23 °C to 28 °C the heat shock gene Pv-hsp-90 was differentially expressed in adult stages, while the levels of the effector genes Pv-eng-1 and Pv-eng-8 in females showed no significant differences compared to those observed at 23 °C, only in males Pv-eng-8 level decreased (45%). The upregulation of Pv-hsp-90 in both adult stages suggests a protective mechanism in order to cope with unfavorable environmental conditions.
Collapse
Affiliation(s)
- Elena Fanelli
- Istituto per la Protezione Sostenibile delle Piante (IPSP), SS-Bari, Consiglio Nazionale delle Ricerche, (CNR), 70126 Bari, Italy.
| | - Alberto Troccoli
- Istituto per la Protezione Sostenibile delle Piante (IPSP), SS-Bari, Consiglio Nazionale delle Ricerche, (CNR), 70126 Bari, Italy.
| | - Francesca De Luca
- Istituto per la Protezione Sostenibile delle Piante (IPSP), SS-Bari, Consiglio Nazionale delle Ricerche, (CNR), 70126 Bari, Italy.
| |
Collapse
|
6
|
Chen C, Gao S, Sun Q, Tang Y, Han Y, Zhang J, Li Z. Induced splice site mutation generates alternative intron splicing in starch synthase II ( SSII) gene in rice. BIOTECHNOL BIOTEC EQ 2017. [DOI: 10.1080/13102818.2017.1370984] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Affiliation(s)
- Chao Chen
- The Laboratory of Vector Biology and Control, Department of Biotechnology, College of Engineering, Beijing Normal University (Zhuhai), Zhuhai, P.R. China
| | - Shan Gao
- The Laboratory of Vector Biology and Control, Department of Biotechnology, College of Engineering, Beijing Normal University (Zhuhai), Zhuhai, P.R. China
| | - Qing Sun
- The Laboratory of Vector Biology and Control, Department of Biotechnology, College of Engineering, Beijing Normal University (Zhuhai), Zhuhai, P.R. China
| | - Yuling Tang
- The Laboratory of Vector Biology and Control, Department of Biotechnology, College of Engineering, Beijing Normal University (Zhuhai), Zhuhai, P.R. China
| | - Yuhao Han
- The Laboratory of Vector Biology and Control, Department of Biotechnology, College of Engineering, Beijing Normal University (Zhuhai), Zhuhai, P.R. China
| | - Jinkun Zhang
- The Laboratory of Vector Biology and Control, Department of Biotechnology, College of Engineering, Beijing Normal University (Zhuhai), Zhuhai, P.R. China
| | - Zhipeng Li
- The Laboratory of Vector Biology and Control, Department of Biotechnology, College of Engineering, Beijing Normal University (Zhuhai), Zhuhai, P.R. China
| |
Collapse
|
7
|
Kaisers W, Ptok J, Schwender H, Schaal H. Validation of Splicing Events in Transcriptome Sequencing Data. Int J Mol Sci 2017; 18:ijms18061110. [PMID: 28545234 PMCID: PMC5485934 DOI: 10.3390/ijms18061110] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2017] [Revised: 04/26/2017] [Accepted: 04/28/2017] [Indexed: 11/16/2022] Open
Abstract
Genomic alignments of sequenced cellular messenger RNA contain gapped alignments which are interpreted as consequence of intron removal. The resulting gap-sites, genomic locations of alignment gaps, are landmarks representing potential splice-sites. As alignment algorithms report gap-sites with a considerable false discovery rate, validations are required. We describe two quality scores, gap quality score (gqs) and weighted gap information score (wgis), developed for validation of putative splicing events: While gqs solely relies on alignment data wgis additionally considers information from the genomic sequence. FASTQ files obtained from 54 human dermal fibroblast samples were aligned against the human genome (GRCh38) using TopHat and STAR aligner. Statistical properties of gap-sites validated by gqs and wgis were evaluated by their sequence similarity to known exon-intron borders. Within the 54 samples, TopHat identifies 1,000,380 and STAR reports 6,487,577 gap-sites. Due to the lack of strand information, however, the percentage of identified GT-AG gap-sites is rather low. While gap-sites from TopHat contain ≈89% GT-AG, gap-sites from STAR only contain ≈42% GT-AG dinucleotide pairs in merged data from 54 fibroblast samples. Validation with gqs yields 156,251 gap-sites from TopHat alignments and 166,294 from STAR alignments. Validation with wgis yields 770,327 gap-sites from TopHat alignments and 1,065,596 from STAR alignments. Both alignment algorithms, TopHat and STAR, report gap-sites with considerable false discovery rate, which can drastically be reduced by validation with gqs and wgis.
Collapse
Affiliation(s)
- Wolfgang Kaisers
- Department for Anaesthesiology, University Hospital Düsseldorf, Heinrich Heine University, 40225 Düsseldorf, Germany.
- BMFZ (Biologisch-Medizinisches Forschungszentrum), Heinrich Heine University, 40225 Düsseldorf, Germany.
| | - Johannes Ptok
- Institute of Virology, University Hospital Düsseldorf, Heinrich Heine University, 40225 Düsseldorf, Germany.
| | - Holger Schwender
- BMFZ (Biologisch-Medizinisches Forschungszentrum), Heinrich Heine University, 40225 Düsseldorf, Germany.
- Mathematical Institute, Heinrich Heine University, 40225 Düsseldorf, Germany.
| | - Heiner Schaal
- BMFZ (Biologisch-Medizinisches Forschungszentrum), Heinrich Heine University, 40225 Düsseldorf, Germany.
- Institute of Virology, University Hospital Düsseldorf, Heinrich Heine University, 40225 Düsseldorf, Germany.
| |
Collapse
|
8
|
Eves-van den Akker S, Laetsch DR, Thorpe P, Lilley CJ, Danchin EGJ, Da Rocha M, Rancurel C, Holroyd NE, Cotton JA, Szitenberg A, Grenier E, Montarry J, Mimee B, Duceppe MO, Boyes I, Marvin JMC, Jones LM, Yusup HB, Lafond-Lapalme J, Esquibet M, Sabeh M, Rott M, Overmars H, Finkers-Tomczak A, Smant G, Koutsovoulos G, Blok V, Mantelin S, Cock PJA, Phillips W, Henrissat B, Urwin PE, Blaxter M, Jones JT. The genome of the yellow potato cyst nematode, Globodera rostochiensis, reveals insights into the basis of parasitism and virulence. Genome Biol 2016; 17:124. [PMID: 27286965 PMCID: PMC4901422 DOI: 10.1186/s13059-016-0985-1] [Citation(s) in RCA: 102] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2016] [Accepted: 05/12/2016] [Indexed: 11/23/2022] Open
Abstract
Background The yellow potato cyst nematode, Globodera rostochiensis, is a devastating plant pathogen of global economic importance. This biotrophic parasite secretes effectors from pharyngeal glands, some of which were acquired by horizontal gene transfer, to manipulate host processes and promote parasitism. G. rostochiensis is classified into pathotypes with different plant resistance-breaking phenotypes. Results We generate a high quality genome assembly for G. rostochiensis pathotype Ro1, identify putative effectors and horizontal gene transfer events, map gene expression through the life cycle focusing on key parasitic transitions and sequence the genomes of eight populations including four additional pathotypes to identify variation. Horizontal gene transfer contributes 3.5 % of the predicted genes, of which approximately 8.5 % are deployed as effectors. Over one-third of all effector genes are clustered in 21 putative ‘effector islands’ in the genome. We identify a dorsal gland promoter element motif (termed DOG Box) present upstream in representatives from 26 out of 28 dorsal gland effector families, and predict a putative effector superset associated with this motif. We validate gland cell expression in two novel genes by in situ hybridisation and catalogue dorsal gland promoter element-containing effectors from available cyst nematode genomes. Comparison of effector diversity between pathotypes highlights correlation with plant resistance-breaking. Conclusions These G. rostochiensis genome resources will facilitate major advances in understanding nematode plant-parasitism. Dorsal gland promoter element-containing effectors are at the front line of the evolutionary arms race between plant and parasite and the ability to predict gland cell expression a priori promises rapid advances in understanding their roles and mechanisms of action. Electronic supplementary material The online version of this article (doi:10.1186/s13059-016-0985-1) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | - Dominik R Laetsch
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh, EH9 3FL, UK
| | - Peter Thorpe
- Cell and Molecular Sciences Group, Dundee Effector Consortium, James Hutton Institute, Dundee, DD2 5DA, UK
| | | | - Etienne G J Danchin
- INRA, University Nice Sophia Antipolis, CNRS, UMR 1355-7254 Institut Sophia Agrobiotech, 06900, Sophia Antipolis, France
| | - Martine Da Rocha
- INRA, University Nice Sophia Antipolis, CNRS, UMR 1355-7254 Institut Sophia Agrobiotech, 06900, Sophia Antipolis, France
| | - Corinne Rancurel
- INRA, University Nice Sophia Antipolis, CNRS, UMR 1355-7254 Institut Sophia Agrobiotech, 06900, Sophia Antipolis, France
| | - Nancy E Holroyd
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Cambridge, CB10 1SA, UK
| | - James A Cotton
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Cambridge, CB10 1SA, UK
| | - Amir Szitenberg
- School of Biological, Biomedical and Environmental Sciences, University of Hull, Hull, HU6 7RX, UK
| | - Eric Grenier
- INRA, UMR1349 IGEPP (Institute for Genetics, Environment and Plant Protection), 35653, Le Rheu, France
| | - Josselin Montarry
- INRA, UMR1349 IGEPP (Institute for Genetics, Environment and Plant Protection), 35653, Le Rheu, France
| | - Benjamin Mimee
- Agriculture and Agri-food Canada, Horticulture Research and Development Centre, 430 Bboul. Gouin, St-Jean-sur-Richelieu, Quebec, J3B 3E6, Canada
| | - Marc-Olivier Duceppe
- Agriculture and Agri-food Canada, Horticulture Research and Development Centre, 430 Bboul. Gouin, St-Jean-sur-Richelieu, Quebec, J3B 3E6, Canada
| | - Ian Boyes
- Sidney Laboratory, Canadian Food Inspection Agency (CFIA), 8801 East Saanich Rd, Sidney, BC, V8L 1H3, Canada
| | | | - Laura M Jones
- Centre for Plant Sciences, University of Leeds, Leeds, LS2 9JT, UK
| | - Hazijah B Yusup
- Centre for Plant Sciences, University of Leeds, Leeds, LS2 9JT, UK
| | - Joël Lafond-Lapalme
- Agriculture and Agri-food Canada, Horticulture Research and Development Centre, 430 Bboul. Gouin, St-Jean-sur-Richelieu, Quebec, J3B 3E6, Canada
| | - Magali Esquibet
- INRA, UMR1349 IGEPP (Institute for Genetics, Environment and Plant Protection), 35653, Le Rheu, France
| | - Michael Sabeh
- Agriculture and Agri-food Canada, Horticulture Research and Development Centre, 430 Bboul. Gouin, St-Jean-sur-Richelieu, Quebec, J3B 3E6, Canada
| | - Michael Rott
- Sidney Laboratory, Canadian Food Inspection Agency (CFIA), 8801 East Saanich Rd, Sidney, BC, V8L 1H3, Canada
| | - Hein Overmars
- Laboratory of Nematology, Department of Plant Sciences, Wageningen University, Droevendaalsesteeg 1, 6708, PB, Wageningen, The Netherlands
| | - Anna Finkers-Tomczak
- Laboratory of Nematology, Department of Plant Sciences, Wageningen University, Droevendaalsesteeg 1, 6708, PB, Wageningen, The Netherlands
| | - Geert Smant
- Laboratory of Nematology, Department of Plant Sciences, Wageningen University, Droevendaalsesteeg 1, 6708, PB, Wageningen, The Netherlands
| | | | - Vivian Blok
- Cell and Molecular Sciences Group, Dundee Effector Consortium, James Hutton Institute, Dundee, DD2 5DA, UK
| | - Sophie Mantelin
- Cell and Molecular Sciences Group, Dundee Effector Consortium, James Hutton Institute, Dundee, DD2 5DA, UK
| | - Peter J A Cock
- Information and Computational Sciences Group, James Hutton Institute, Dundee, UK
| | - Wendy Phillips
- USDA-ARS Horticultural Crops Research Laboratory, Corvallis, OR, USA
| | - Bernard Henrissat
- CNRS UMR 7257, INRA, USC 1408, Aix-Marseille University, AFMB, 13288, Marseille, France.,Department of Biological Sciences, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Peter E Urwin
- Centre for Plant Sciences, University of Leeds, Leeds, LS2 9JT, UK
| | - Mark Blaxter
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh, EH9 3FL, UK
| | - John T Jones
- Cell and Molecular Sciences Group, Dundee Effector Consortium, James Hutton Institute, Dundee, DD2 5DA, UK.,School of Biology, University of St Andrews, North Haugh, St Andrews, KY16 9TZ, UK
| |
Collapse
|
9
|
Kuroyanagi H, Takei S, Suzuki Y. Comprehensive analysis of mutually exclusive alternative splicing in C. elegans. WORM 2014; 3:e28459. [PMID: 25254147 PMCID: PMC4165533 DOI: 10.4161/worm.28459] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/20/2013] [Revised: 02/28/2014] [Accepted: 03/06/2014] [Indexed: 12/03/2022]
Abstract
Mutually exclusive selection of one exon in a cluster of exons is a rare form of alternative pre-mRNA splicing, yet suggests strict regulation. However, the repertoires of regulation mechanisms for the mutually exclusive (ME) splicing in vivo are still unknown. Here, we experimentally explore putative ME exons in C. elegans to demonstrate that 29 ME exon clusters in 27 genes are actually selected in a mutually exclusive manner. Twenty-two of the clusters consist of homologous ME exons. Five clusters have too short intervening introns to be excised between the ME exons. Fidelity of ME splicing relies at least in part on nonsense-mediated mRNA decay for 14 clusters. These results thus characterize all the repertoires of ME splicing in this organism.
Collapse
Affiliation(s)
- Hidehito Kuroyanagi
- Laboratory of Gene Expression; Medical Research Institute; Tokyo Medical and Dental University; Bunkyo-ku, Tokyo, Japan
| | - Satomi Takei
- Laboratory of Gene Expression; Medical Research Institute; Tokyo Medical and Dental University; Bunkyo-ku, Tokyo, Japan
| | - Yutaka Suzuki
- Department of Medical Genome Science; University of Tokyo; Kashiwa, Chiba, Japan
| |
Collapse
|
10
|
Dibari B, Murat F, Chosson A, Gautier V, Poncet C, Lecomte P, Mercier I, Bergès H, Pont C, Blanco A, Salse J. Deciphering the genomic structure, function and evolution of carotenogenesis related phytoene synthases in grasses. BMC Genomics 2012; 13:221. [PMID: 22672222 PMCID: PMC3413518 DOI: 10.1186/1471-2164-13-221] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2012] [Accepted: 06/06/2012] [Indexed: 12/31/2022] Open
Abstract
BACKGROUND Carotenoids are isoprenoid pigments, essential for photosynthesis and photoprotection in plants. The enzyme phytoene synthase (PSY) plays an essential role in mediating condensation of two geranylgeranyl diphosphate molecules, the first committed step in carotenogenesis. PSY are nuclear enzymes encoded by a small gene family consisting of three paralogous genes (PSY1-3) that have been widely characterized in rice, maize and sorghum. RESULTS In wheat, for which yellow pigment content is extremely important for flour colour, only PSY1 has been extensively studied because of its association with QTLs reported for yellow pigment whereas PSY2 has been partially characterized. Here, we report the isolation of bread wheat PSY3 genes from a Renan BAC library using Brachypodium as a model genome for the Triticeae to develop Conserved Orthologous Set markers prior to gene cloning and sequencing. Wheat PSY3 homoeologous genes were sequenced and annotated, unravelling their novel structure associated with intron-loss events and consequent exonic fusions. A wheat PSY3 promoter region was also investigated for the presence of cis-acting elements involved in the response to abscisic acid (ABA), since carotenoids also play an important role as precursors of signalling molecules devoted to plant development and biotic/abiotic stress responses. Expression of wheat PSYs in leaves and roots was investigated during ABA treatment to confirm the up-regulation of PSY3 during abiotic stress. CONCLUSIONS We investigated the structural and functional determinisms of PSY genes in wheat. More generally, among eudicots and monocots, the PSY gene family was found to be associated with differences in gene copy numbers, allowing us to propose an evolutionary model for the entire PSY gene family in Grasses.
Collapse
Affiliation(s)
- Bianca Dibari
- INRA-UMR 1095 Génétique Diversité Ecophysiologie des Céréales (GDEC), Clermont-Ferrand, France
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
11
|
Rogozin IB, Carmel L, Csuros M, Koonin EV. Origin and evolution of spliceosomal introns. Biol Direct 2012; 7:11. [PMID: 22507701 PMCID: PMC3488318 DOI: 10.1186/1745-6150-7-11] [Citation(s) in RCA: 217] [Impact Index Per Article: 18.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2011] [Accepted: 03/15/2012] [Indexed: 12/31/2022] Open
Abstract
Evolution of exon-intron structure of eukaryotic genes has been a matter of long-standing, intensive debate. The introns-early concept, later rebranded ‘introns first’ held that protein-coding genes were interrupted by numerous introns even at the earliest stages of life's evolution and that introns played a major role in the origin of proteins by facilitating recombination of sequences coding for small protein/peptide modules. The introns-late concept held that introns emerged only in eukaryotes and new introns have been accumulating continuously throughout eukaryotic evolution. Analysis of orthologous genes from completely sequenced eukaryotic genomes revealed numerous shared intron positions in orthologous genes from animals and plants and even between animals, plants and protists, suggesting that many ancestral introns have persisted since the last eukaryotic common ancestor (LECA). Reconstructions of intron gain and loss using the growing collection of genomes of diverse eukaryotes and increasingly advanced probabilistic models convincingly show that the LECA and the ancestors of each eukaryotic supergroup had intron-rich genes, with intron densities comparable to those in the most intron-rich modern genomes such as those of vertebrates. The subsequent evolution in most lineages of eukaryotes involved primarily loss of introns, with only a few episodes of substantial intron gain that might have accompanied major evolutionary innovations such as the origin of metazoa. The original invasion of self-splicing Group II introns, presumably originating from the mitochondrial endosymbiont, into the genome of the emerging eukaryote might have been a key factor of eukaryogenesis that in particular triggered the origin of endomembranes and the nucleus. Conversely, splicing errors gave rise to alternative splicing, a major contribution to the biological complexity of multicellular eukaryotes. There is no indication that any prokaryote has ever possessed a spliceosome or introns in protein-coding genes, other than relatively rare mobile self-splicing introns. Thus, the introns-first scenario is not supported by any evidence but exon-intron structure of protein-coding genes appears to have evolved concomitantly with the eukaryotic cell, and introns were a major factor of evolution throughout the history of eukaryotes. This article was reviewed by I. King Jordan, Manuel Irimia (nominated by Anthony Poole), Tobias Mourier (nominated by Anthony Poole), and Fyodor Kondrashov. For the complete reports, see the Reviewers’ Reports section.
Collapse
Affiliation(s)
- Igor B Rogozin
- National Center for Biotechnology Information NLM/NIH, 8600 Rockville Pike, Bldg, 38A, Bethesda, MD 20894, USA
| | | | | | | |
Collapse
|
12
|
Short-read sequencing for genomic analysis of the brown rot fungus Fibroporia radiculosa. Appl Environ Microbiol 2012; 78:2272-81. [PMID: 22247176 DOI: 10.1128/aem.06745-11] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The feasibility of short-read sequencing for genomic analysis was demonstrated for Fibroporia radiculosa, a copper-tolerant fungus that causes brown rot decay of wood. The effect of read quality on genomic assembly was assessed by filtering Illumina GAIIx reads from a single run of a paired-end library (75-nucleotide read length and 300-bp fragment size) at three different stringency levels and then assembling each data set with Velvet. A simple approach was devised to determine which filter stringency was "best." Venn diagrams identified the regions containing reads that were used in an assembly but were of a low-enough quality to be removed by a filter. By plotting base quality histograms of reads in this region, we judged whether a filter was too stringent or not stringent enough. Our best assembly had a genome size of 33.6 Mb, an N50 of 65.8 kb for a k-mer of 51, and a maximum contig length of 347 kb. Using GeneMark, 9,262 genes were predicted. TargetP and SignalP analyses showed that among the 1,213 genes with secreted products, 986 had motifs for signal peptides and 227 had motifs for signal anchors. Blast2GO analysis provided functional annotation for 5,407 genes. We identified 29 genes with putative roles in copper tolerance and 73 genes for lignocellulose degradation. A search for homologs of these 102 genes showed that F. radiculosa exhibited more similarity to Postia placenta than Serpula lacrymans. Notable differences were found, however, and their involvements in copper tolerance and wood decay are discussed.
Collapse
|
13
|
Kim HJ, Triplett BA, Zhang HB, Lee MK, Hinchliffe DJ, Li P, Fang DD. Cloning and characterization of homeologous cellulose synthase catalytic subunit 2 genes from allotetraploid cotton (Gossypium hirsutum L.). Gene 2011; 494:181-9. [PMID: 22200568 DOI: 10.1016/j.gene.2011.12.018] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2011] [Revised: 12/07/2011] [Accepted: 12/08/2011] [Indexed: 12/25/2022]
Abstract
Cellulose synthase catalytic subunits (CesAs) are the catalytic sites within a multisubunit complex for cellulose biosynthesis in plants. CesAs have been extensively studied in diploid plants, but are not well characterized in polyploid plants. Gossypium hirsutum is an allotetraploid cotton species producing over 90% of the world's cotton fibers. Although G. hirsutum CesAs (GhCesAs) are responsible for cellulose production in cotton fiber, very limited numbers of GhCesA genes have been identified. Here, we report isolating and characterizing a pair of homeologous CesA2 genes and their full-length cDNAs from allotetraploid cotton. The GhCesA2-A(T) gene from the A-subgenome and GhCesA2-D(T) gene from the D-subgenome were screened from a G. hirsutum BAC library. These genes shared 92% sequence similarity throughout the entire sequence. The coding sequences were nearly identical, and the deduced amino acid sequences from GhCesA2-A(T) (1,039 amino acids) and GhCesA2-D(T) (1,040 amino acids) were identical except four amino acids, whereas the noncoding sequences showed divergence. Sequence analyses showed that all exons of GhCesA2-A(T) contained consensus splice donor dinucleotides, but one exon in GhCesA2-D(T) contained nonconsensus splice donor dinucleotides. Although the nonconsensus splice donor dinucleotides were previously suggested to be involved in alternative splice or pseudogenization, our results showed that a majority of GhCesA2-A(T) and GhCesA2-D(T) transcripts consisted of functional and full-length transcripts with little evidence for alternative mRNA isoforms in developing cotton fibers. Expression analyses showed that GhCesA2-A(T) and GhCesA2-D(T) shared common temporal and spatial expression patterns, and they were highly and preferentially expressed during the cellulose biosynthesis stage in developing cotton fibers. The observations of higher expression levels of both GhCesA2-A(T) and GhCesA2-D(T) in developing fibers of one near-isogenic line (NIL) with higher fiber bundle strength over the other NIL with lower fiber bundle strength suggested that the differential expression of genes associated with secondary cell wall cellulose biosynthesis in developing fiber might affect cotton fiber properties.
Collapse
Affiliation(s)
- Hee Jin Kim
- USDA-ARS, Southern Regional Research Center, Cotton Fiber Bioscience, 1100 Robert E. Lee Blvd, New Orleans, LA 70124 USA.
| | | | | | | | | | | | | |
Collapse
|
14
|
Gravato-Nobre MJ, Stroud D, O'Rourke D, Darby C, Hodgkin J. Glycosylation genes expressed in seam cells determine complex surface properties and bacterial adhesion to the cuticle of Caenorhabditis elegans. Genetics 2011; 187:141-55. [PMID: 20980242 PMCID: PMC3018313 DOI: 10.1534/genetics.110.122002] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2010] [Accepted: 10/19/2010] [Indexed: 11/18/2022] Open
Abstract
The surface of the nematode Caenorhabditis elegans is poorly understood but critical for its interactions with the environment and with pathogens. We show here that six genes (bus-2, bus-4, and bus-12, together with the previously cloned srf-3, bus-8, and bus-17) encode proteins predicted to act in surface glycosylation, thereby affecting disease susceptibility, locomotory competence, and sexual recognition. Mutations in all six genes cause resistance to the bacterial pathogen Microbacterium nematophilum, and most of these mutations also affect bacterial adhesion and biofilm formation by Yersinia species, demonstrating that both infection and biofilm formation depend on interaction with complex surface carbohydrates. A new bacterial interaction, involving locomotory inhibition by a strain of Bacillus pumilus, reveals diversity in the surface properties of these mutants. Another biological property--contact recognition of hermaphrodites by males during mating--was also found to be impaired in mutants of all six genes. An important common feature is that all are expressed most strongly in seam cells, rather than in the main hypodermal syncytium, indicating that seam cells play the major role in secreting surface coat and consequently in determining environmental interactions. To test for possible redundancies in gene action, the 15 double mutants for this set of genes were constructed and examined, but no synthetic phenotypes were observed. Comparison of the six genes shows that each has distinctive properties, suggesting that they do not act in a linear pathway.
Collapse
Affiliation(s)
- Maria J. Gravato-Nobre
- Department of Biochemistry, University of Oxford, Oxford OX1 3QU, United Kingdom and Department of Cell and Tissue Biology, University of California, San Francisco, California 94143
| | - Dave Stroud
- Department of Biochemistry, University of Oxford, Oxford OX1 3QU, United Kingdom and Department of Cell and Tissue Biology, University of California, San Francisco, California 94143
| | - Delia O'Rourke
- Department of Biochemistry, University of Oxford, Oxford OX1 3QU, United Kingdom and Department of Cell and Tissue Biology, University of California, San Francisco, California 94143
| | - Creg Darby
- Department of Biochemistry, University of Oxford, Oxford OX1 3QU, United Kingdom and Department of Cell and Tissue Biology, University of California, San Francisco, California 94143
| | - Jonathan Hodgkin
- Department of Biochemistry, University of Oxford, Oxford OX1 3QU, United Kingdom and Department of Cell and Tissue Biology, University of California, San Francisco, California 94143
| |
Collapse
|
15
|
|
16
|
Cloning and characterization of squalene synthase gene from Fusarium fujikuroi (Saw.) Wr. J Ind Microbiol Biotechnol 2010; 37:1171-82. [DOI: 10.1007/s10295-010-0764-z] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2010] [Accepted: 06/07/2010] [Indexed: 10/19/2022]
|
17
|
Genomic organization and regulation of the human orexin (hypocretin) receptor 2 gene: identification of alternative promoters. Biochem J 2010; 427:377-90. [DOI: 10.1042/bj20091755] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Orexins (hypocretins), acting via their receptors, are involved in the control of feeding behaviour, sleep, arousal and energy homoeostasis. However, regulation of the human orexin receptor 2 (hOX2R) gene remains unknown. We have identified four transcripts arising from alternative splicing from three exons. These exon 1 variants were designated exons 1A, 1B and 1C on the basis of their 5′–3′ order. RT (reverse transcription)–PCR demonstrates the differential expression in various human tissues. The alternative 5′-UTRs (untranslated regions) possessed by these isoforms have different translational efficiencies, which regulate the level of protein expression. In the present study, we have demonstrated that the hOX2R gene is regulated by two promoters and the novel transcripts are regulated by the distal promoter located upstream of exon 1A. We have demonstrated that the AP-1 (activator protein 1) motif is critical for sustaining the basal activity of distal promoter. Analysis of the proximal promoter revealed the region regulating promoter activity contained putative binding elements including those for CREB (cAMP-response-element-binding protein), GATA-2 and Oct-1. Using the chromatin immunoprecipitation assay, we demonstrated that CREB, GATA-2 and Oct-1 transcription factors bind to these critical regulatory promoter elements. Mutational studies suggested that these motifs functioned independently, but have a compound effect regulating hOX2R gene transcription. Furthermore, proximal promoter activity is enhanced by both PKA (protein kinase A) and PKC (protein kinase C) pathway activation, via binding of CREB and GATA-2 transcription factors. In conclusion, we have demonstrated that expression of hOX2R is regulated by a complex involving a proximal PKA/PKC-regulated promoter and a distal promoter regulating tissue-specific expression of alternative transcripts which in turn post-transcriptionally regulate receptor levels.
Collapse
|
18
|
Wilkerson MD, Ru Y, Brendel VP. Common introns within orthologous genes: software and application to plants. Brief Bioinform 2010; 10:631-44. [PMID: 19933210 DOI: 10.1093/bib/bbp051] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The residence of spliceosomal introns within protein-coding genes can fluctuate over time, with genes gaining, losing or conserving introns in a complex process that is not entirely understood. One approach for studying intron evolution is to compare introns with respect to position and type within closely related genes. Here, we describe new, freely available software called Common Introns Within Orthologous Genes (CIWOG), available at http://ciwog.gdcb.iastate.edu/, which detects common introns in protein-coding genes based on position and sequence conservation in the corresponding protein alignments. CIWOG provides dynamic web displays that facilitate detailed intron studies within orthologous genes. User-supplied options control how introns are clustered into sets of common introns. CIWOG also identifies special classes of introns, in particular those with GC- or U12-type donor sites, which enables analyses of these introns in relation to their counterparts in the other genes in orthologous groups. The software is demonstrated with application to a comprehensive study of eight plant transcriptomes. Three specific examples are discussed: intron class conversion from GT- to GC-donor-type introns in monocots, plant U12-type intron conservation and a global analysis of intron evolution across the eight plant species.
Collapse
|
19
|
Grohmann M, Hammer P, Walther M, Paulmann N, Büttner A, Eisenmenger W, Baghai TC, Schüle C, Rupprecht R, Bader M, Bondy B, Zill P, Priller J, Walther DJ. Alternative splicing and extensive RNA editing of human TPH2 transcripts. PLoS One 2010; 5:e8956. [PMID: 20126463 PMCID: PMC2813293 DOI: 10.1371/journal.pone.0008956] [Citation(s) in RCA: 57] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2009] [Accepted: 01/08/2010] [Indexed: 11/19/2022] Open
Abstract
Brain serotonin (5-HT) neurotransmission plays a key role in the regulation of mood and has been implicated in a variety of neuropsychiatric conditions. Tryptophan hydroxylase (TPH) is the rate-limiting enzyme in the biosynthesis of 5-HT. Recently, we discovered a second TPH isoform (TPH2) in vertebrates, including man, which is predominantly expressed in brain, while the previously known TPH isoform (TPH1) is primarly a non-neuronal enzyme. Overwhelming evidence now points to TPH2 as a candidate gene for 5-HT-related psychiatric disorders. To assess the role of TPH2 gene variability in the etiology of psychiatric diseases we performed cDNA sequence analysis of TPH2 transcripts from human post mortem amygdala samples obtained from individuals with psychiatric disorders (drug abuse, schizophrenia, suicide) and controls. Here we show that TPH2 exists in two alternatively spliced variants in the coding region, denoted TPH2a and TPH2b. Moreover, we found evidence that the pre-mRNAs of both splice variants are dynamically RNA-edited in a mutually exclusive manner. Kinetic studies with cell lines expressing recombinant TPH2 variants revealed a higher activity of the novel TPH2B protein compared with the previously known TPH2A, whereas RNA editing was shown to inhibit the enzymatic activity of both TPH2 splice variants. Therefore, our results strongly suggest a complex fine-tuning of central nervous system 5-HT biosynthesis by TPH2 alternative splicing and RNA editing. Finally, we present molecular and large-scale linkage data evidencing that deregulated alternative splicing and RNA editing is involved in the etiology of psychiatric diseases, such as suicidal behaviour.
Collapse
Affiliation(s)
- Maik Grohmann
- Department of Human Molecular Genetics, Max Planck Institute for Molecular Genetics, Berlin, Germany
- Department of Biology, Chemistry, and Pharmacy, Free University Berlin, Berlin, Germany
- Neuropsychiatry and Laboratory of Molecular Psychiatry, Charité-Universitätsmedizin, Berlin, Germany
| | - Paul Hammer
- Department of Human Molecular Genetics, Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Maria Walther
- Department of Human Molecular Genetics, Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Nils Paulmann
- Department of Human Molecular Genetics, Max Planck Institute for Molecular Genetics, Berlin, Germany
- Department of Biology, Chemistry, and Pharmacy, Free University Berlin, Berlin, Germany
| | - Andreas Büttner
- Institute for Legal Medicine, Ludwig Maximilians University, Munich, Germany
| | | | - Thomas C. Baghai
- Department of Psychiatry, Ludwig Maximilians University, Munich, Germany
| | - Cornelius Schüle
- Department of Psychiatry, Ludwig Maximilians University, Munich, Germany
| | - Rainer Rupprecht
- Department of Psychiatry, Ludwig Maximilians University, Munich, Germany
| | - Michael Bader
- Laboratory of Molecular Biology of Peptide Hormones, Max Delbrück Center for Molecular Medicine, Berlin, Germany
| | - Brigitta Bondy
- Department of Psychiatry, Ludwig Maximilians University, Munich, Germany
| | - Peter Zill
- Department of Psychiatry, Ludwig Maximilians University, Munich, Germany
| | - Josef Priller
- Neuropsychiatry and Laboratory of Molecular Psychiatry, Charité-Universitätsmedizin, Berlin, Germany
| | - Diego J. Walther
- Department of Human Molecular Genetics, Max Planck Institute for Molecular Genetics, Berlin, Germany
| |
Collapse
|
20
|
Filichkin SA, Priest HD, Givan SA, Shen R, Bryant DW, Fox SE, Wong WK, Mockler TC. Genome-wide mapping of alternative splicing in Arabidopsis thaliana. Genome Res 2010. [PMID: 19858364 DOI: 10.1101/gr.093302.109.2008] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/13/2023]
Abstract
Alternative splicing can enhance transcriptome plasticity and proteome diversity. In plants, alternative splicing can be manifested at different developmental stages, and is frequently associated with specific tissue types or environmental conditions such as abiotic stress. We mapped the Arabidopsis transcriptome at single-base resolution using the Illumina platform for ultrahigh-throughput RNA sequencing (RNA-seq). Deep transcriptome sequencing confirmed a majority of annotated introns and identified thousands of novel alternatively spliced mRNA isoforms. Our analysis suggests that at least approximately 42% of intron-containing genes in Arabidopsis are alternatively spliced; this is significantly higher than previous estimates based on cDNA/expressed sequence tag sequencing. Random validation confirmed that novel splice isoforms empirically predicted by RNA-seq can be detected in vivo. Novel introns detected by RNA-seq were substantially enriched in nonconsensus terminal dinucleotide splice signals. Alternative isoforms with premature termination codons (PTCs) comprised the majority of alternatively spliced transcripts. Using an example of an essential circadian clock gene, we show that intron retention can generate relatively abundant PTC(+) isoforms and that this specific event is highly conserved among diverse plant species. Alternatively spliced PTC(+) isoforms can be potentially targeted for degradation by the nonsense mediated mRNA decay (NMD) surveillance machinery or regulate the level of functional transcripts by the mechanism of regulated unproductive splicing and translation (RUST). We demonstrate that the relative ratios of the PTC(+) and reference isoforms for several key regulatory genes can be considerably shifted under abiotic stress treatments. Taken together, our results suggest that like in animals, NMD and RUST may be widespread in plants and may play important roles in regulating gene expression.
Collapse
Affiliation(s)
- Sergei A Filichkin
- Department of Botany and Plant Pathology and Center for Genome Research and Biocomputing, Oregon State University, Corvallis, Oregon 97331, USA
| | | | | | | | | | | | | | | |
Collapse
|
21
|
Filichkin SA, Priest HD, Givan SA, Shen R, Bryant DW, Fox SE, Wong WK, Mockler TC. Genome-wide mapping of alternative splicing in Arabidopsis thaliana. Genome Res 2009; 20:45-58. [PMID: 19858364 DOI: 10.1101/gr.093302.109] [Citation(s) in RCA: 670] [Impact Index Per Article: 44.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Alternative splicing can enhance transcriptome plasticity and proteome diversity. In plants, alternative splicing can be manifested at different developmental stages, and is frequently associated with specific tissue types or environmental conditions such as abiotic stress. We mapped the Arabidopsis transcriptome at single-base resolution using the Illumina platform for ultrahigh-throughput RNA sequencing (RNA-seq). Deep transcriptome sequencing confirmed a majority of annotated introns and identified thousands of novel alternatively spliced mRNA isoforms. Our analysis suggests that at least approximately 42% of intron-containing genes in Arabidopsis are alternatively spliced; this is significantly higher than previous estimates based on cDNA/expressed sequence tag sequencing. Random validation confirmed that novel splice isoforms empirically predicted by RNA-seq can be detected in vivo. Novel introns detected by RNA-seq were substantially enriched in nonconsensus terminal dinucleotide splice signals. Alternative isoforms with premature termination codons (PTCs) comprised the majority of alternatively spliced transcripts. Using an example of an essential circadian clock gene, we show that intron retention can generate relatively abundant PTC(+) isoforms and that this specific event is highly conserved among diverse plant species. Alternatively spliced PTC(+) isoforms can be potentially targeted for degradation by the nonsense mediated mRNA decay (NMD) surveillance machinery or regulate the level of functional transcripts by the mechanism of regulated unproductive splicing and translation (RUST). We demonstrate that the relative ratios of the PTC(+) and reference isoforms for several key regulatory genes can be considerably shifted under abiotic stress treatments. Taken together, our results suggest that like in animals, NMD and RUST may be widespread in plants and may play important roles in regulating gene expression.
Collapse
Affiliation(s)
- Sergei A Filichkin
- Department of Botany and Plant Pathology and Center for Genome Research and Biocomputing, Oregon State University, Corvallis, Oregon 97331, USA
| | | | | | | | | | | | | | | |
Collapse
|
22
|
Alexander M, Chan KKM, Byrne AB, Selman G, Lee T, Ono J, Wong E, Puckrin R, Dixon SJ, Roy PJ. An UNC-40 pathway directs postsynaptic membrane extension in Caenorhabditis elegans. Development 2009; 136:911-22. [PMID: 19211675 DOI: 10.1242/dev.030759] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
The postsynaptic membrane of the embryonic neuromuscular junction undergoes a dramatic expansion during later development to facilitate the depolarization of larger muscles. In C. elegans, the postsynaptic membrane resides at the termini of plasma membrane extensions called muscle arms. Membrane extension to the motor axons during larval development doubles the number of muscle arms, making them a tractable model to investigate both postsynaptic membrane expansion and guided membrane extension. To identify genes required for muscle arm extension, we performed a forward screen for mutants with fewer muscle arms. We isolated 23 mutations in 14 genes, including unc-40/Dcc, which encodes a transmembrane receptor that guides the migration of cells and extending axons in response to the secreted UNC-6/Netrin spatial cue. We discovered that UNC-40 is enriched at muscle arm termini and functions cell-autonomously to direct arm extension to the motor axons. Surprisingly, UNC-6 is dispensable for muscle arm extension, suggesting that UNC-40 relies on other spatial cues to direct arm extension. We provide the first evidence that the guanine-nucleotide exchange factor UNC-73/Trio, members of the WAVE actin-polymerization complex, and a homolog of the focal adhesion complex can function downstream of UNC-40 to direct membrane extension. Our work is the first to define a pathway for directed muscle membrane extension and illustrates that axon guidance components can play key roles in postsynaptic membrane expansion.
Collapse
Affiliation(s)
- Mariam Alexander
- Department of Molecular Genetics, The Terrence Donnelly Centre for Cellular and Biomolecular Research, 160 College Street, University of Toronto, Toronto, ON, M5S 3E1, Canada
| | | | | | | | | | | | | | | | | | | |
Collapse
|
23
|
Akamine RN, Winter CE. Oscheius tipulae as an example of eEF1A gene diversity in nematodes. J Mol Evol 2008; 67:278-90. [PMID: 18696027 DOI: 10.1007/s00239-008-9147-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2008] [Revised: 04/15/2008] [Accepted: 07/14/2008] [Indexed: 10/21/2022]
Abstract
We characterized four eEF1A genes in the alternative rhabditid nematode model organism Oscheius tipulae. This is twice the copy number of eEF1A genes in C. elegans, C. briggsae, and, probably, many other free-living and parasitic nematodes. The introns show features remarkably different from those of other metazoan eEF1A genes. Most of the introns in the eEF1A genes are specific to O. tipulae and are not shared with any of the other genes described in metazoans. Most of the introns are phase 0 (inserted between two codons), and few are inserted in protosplice sites (introns inserted between the nucleotide sequence A/CAG and G/A). Two of these phase 0 introns are conserved in sequence in two or more of the four eEF1A gene copies, and are inserted in the same position in the genes. Neither of these characteristics has been detected in any of the nematode eEF1A genes characterized to date. The coding sequences were also compared with other eEF1A cDNAs from 11 different nematodes to determine the variability of these genes within the phylum Nematoda. Parsimony and distance trees yielded similar topologies, which were similar to those created using other molecular markers. The presence of more than one copy of the eEF1A gene with nearly identical coding regions makes it difficult to define the orthologous cDNAs. As shown by our data on O. tipulae, careful and extensive examination of intron positions in the eEF1A gene across the phylum is necessary to define their potential for use as valid phylogenetic markers.
Collapse
Affiliation(s)
- Rubens N Akamine
- Departmento de Parasitologia, Instituto de Ciências Biomédicas/Universidade de São Paulo, Av. Prof. Lineu Prestes 1374, 05508-900, Sao Paulo, SP, Brazil
| | | |
Collapse
|
24
|
Churbanov A, Winters-Hilt S, Koonin EV, Rogozin IB. Accumulation of GC donor splice signals in mammals. Biol Direct 2008; 3:30. [PMID: 18613975 PMCID: PMC2490688 DOI: 10.1186/1745-6150-3-30] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2008] [Accepted: 07/09/2008] [Indexed: 11/10/2022] Open
Abstract
UNLABELLED The GT dinucleotide in the first two intron positions is the most conserved element of the U2 donor splice signals. However, in a small fraction of donor sites, GT is replaced by GC. A substantial enrichment of GC in donor sites of alternatively spliced genes has been observed previously in human, nematode and Arabidopsis, suggesting that GC signals are important for regulation of alternative splicing. We used parsimony analysis to reconstruct evolution of donor splice sites and inferred 298 GT > GC conversion events compared to 40 GC > GT conversion events in primate and rodent genomes. Thus, there was substantive accumulation of GC donor splice sites during the evolution of mammals. Accumulation of GC sites might have been driven by selection for alternative splicing. REVIEWERS This article was reviewed by Jerzy Jurka and Anton Nekrutenko. For the full reviews, please go to the Reviewers' Reports section.
Collapse
Affiliation(s)
- Alexander Churbanov
- Loyola University Medical Center, 2160 S. First Ave., Maywood, IL, 60153, USA.
| | | | | | | |
Collapse
|
25
|
Ohno G, Hagiwara M, Kuroyanagi H. STAR family RNA-binding protein ASD-2 regulates developmental switching of mutually exclusive alternative splicing in vivo. Genes Dev 2008; 22:360-74. [PMID: 18230701 PMCID: PMC2216695 DOI: 10.1101/gad.1620608] [Citation(s) in RCA: 61] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2007] [Accepted: 12/04/2007] [Indexed: 11/25/2022]
Abstract
Alternative splicing of pre-mRNAs greatly contributes to the spatiotemporal diversity of gene expression in metazoans. However, the molecular basis of developmental regulation and the precise sequence of alternative pre-mRNA processing in vivo are poorly understood. In the present study, we focus on the developmental switching of the mutually exclusive alternative splicing of the let-2 gene of Caenorhabditis elegans from the exon 9 form in embryos to the exon 10 form in adults. By visualizing the usage of the let-2 mutually exclusive exons through differential expression of green fluorescent protein (GFP) and red fluorescent protein (RFP), we isolated several switching-defective mutants and identified the alternative splicing defective-2 (asd-2) gene, encoding a novel member of the evolutionarily conserved STAR (signal transduction activators of RNA) family of RNA-binding proteins. Comparison of the amounts of partially spliced let-2 RNAs in synchronized wild-type and asd-2 mutant worms suggested that either of the introns downstream from the let-2 mutually exclusive exons is removed prior to the removal of the upstream ones, and that asd-2 promotes biased excision of intron 10 in the late larval stages. We propose that the developmental switching between alternative sequences of intron removal determines the ratio between the mature let-2 mRNA isoforms.
Collapse
Affiliation(s)
- Genta Ohno
- Laboratory of Gene Expression, School of Biomedical Science, Tokyo Medical and Dental University, Tokyo 113-8510, Japan
| | - Masatoshi Hagiwara
- Laboratory of Gene Expression, School of Biomedical Science, Tokyo Medical and Dental University, Tokyo 113-8510, Japan
- Department of Functional Genomics, Medical Research Institute, Tokyo Medical and Dental University, Tokyo 113-8510, Japan
| | - Hidehito Kuroyanagi
- Laboratory of Gene Expression, School of Biomedical Science, Tokyo Medical and Dental University, Tokyo 113-8510, Japan
- Department of Functional Genomics, Medical Research Institute, Tokyo Medical and Dental University, Tokyo 113-8510, Japan
| |
Collapse
|
26
|
Rubio MC, Becana M, Sato S, James EK, Tabata S, Spaink HP. Characterization of genomic clones and expression analysis of the three types of superoxide dismutases during nodule development in Lotus japonicus. MOLECULAR PLANT-MICROBE INTERACTIONS : MPMI 2007; 20:262-75. [PMID: 17378429 DOI: 10.1094/mpmi-20-3-0262] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/10/2023]
Abstract
Superoxide dismutases (SODs) are metalloenzymes that play a primary role in the protection against oxidative stress in plants and other organisms. We have characterized four SOD genes in Lotus japonicus and have analyzed their expression in roots and four developmental stages of nodules. The expression of cytosolic CuZnSOD, at the mRNA, protein, and enzyme activity levels, decreases with nodule age, and the protein is localized in the dividing cells and infection threads of emergent nodules and in the infected cells of young nodules. The mitochondrial MnSOD was downregulated, whereas the bacteroidal MnSOD displayed maximal protein and enzyme activity levels in older nodules. Two additional genes, encoding plastidic (FeSOD1) and cytosolic (FeSOD2) FeSOD isoforms, were identified and mapped. The genes are located in different chromosomes and show differential expression. The FeSOD1 mRNA level did not change during nodule development, whereas FeSOD2 was upregulated. The distinct expression patterns of the SOD genes may reflect different regulatory mechanisms of the enzyme activities during nodule ontogeny. In particular, at the mRNA and activity levels, the virtual loss of cytosolic CuZnSOD in mature and old nodules, concomitant with the induction of FeSOD2, suggests that the two enzymes may functionally compensate each other in the cytosol at the late stages of nodule development.
Collapse
Affiliation(s)
- Maria C Rubio
- Department of Cell Biology, Institute of Biology, Leiden University, Wassenaarseweg 64, 2333 AL Leiden, The Netherlands.
| | | | | | | | | | | |
Collapse
|
27
|
Lal A, Radhakrishnan S, Srinivas SS, Najarian K, Mays LE. Splice site detection using pruned maximum likelihood model. CONFERENCE PROCEEDINGS : ... ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL CONFERENCE 2007; 2004:2836-9. [PMID: 17270868 DOI: 10.1109/iembs.2004.1403809] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/13/2023]
Abstract
In this paper we propose a novel method for splice site prediction using the maximum likelihood model. We performed maximum likelihood over the acceptor and donor datasets, and calculated sensitivity to measure the prediction performance. Then, by aggressive pruning of less informative nucleotide sites, while maintaining the high sensitivity of the method, we improved the model's performance in terms of the computational speed. In addition, after pruning fewer nucleotide sites need to be tagged, which in turn simplifies the development of an assay. The proposed method was tested on the human splice dataset. The results indicate that the proposed method was successful at splice site prediction with optimal sensitivity.
Collapse
Affiliation(s)
- Anuradha Lal
- Coll. of Inf. Technol., North Carolina Univ., Charlotte, NC, USA
| | | | | | | | | |
Collapse
|
28
|
Comprehensive analysis of alternative splicing in rice and comparative analyses with Arabidopsis. BMC Genomics 2006; 7:327. [PMID: 17194304 PMCID: PMC1769492 DOI: 10.1186/1471-2164-7-327] [Citation(s) in RCA: 317] [Impact Index Per Article: 17.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2006] [Accepted: 12/28/2006] [Indexed: 11/29/2022] Open
Abstract
Background Recently, genomic sequencing efforts were finished for Oryza sativa (cultivated rice) and Arabidopsis thaliana (Arabidopsis). Additionally, these two plant species have extensive cDNA and expressed sequence tag (EST) libraries. We employed the Program to Assemble Spliced Alignments (PASA) to identify and analyze alternatively spliced isoforms in both species. Results A comprehensive analysis of alternative splicing was performed in rice that started with >1.1 million publicly available spliced ESTs and over 30,000 full length cDNAs in conjunction with the newly enhanced PASA software. A parallel analysis was performed with Arabidopsis to compare and ascertain potential differences between monocots and dicots. Alternative splicing is a widespread phenomenon (observed in greater than 30% of the loci with transcript support) and we have described nine alternative splicing variations. While alternative splicing has the potential to create many RNA isoforms from a single locus, the majority of loci generate only two or three isoforms and transcript support indicates that these isoforms are generally not rare events. For the alternate donor (AD) and acceptor (AA) classes, the distance between the splice sites for the majority of events was found to be less than 50 basepairs (bp). In both species, the most frequent distance between AA is 3 bp, consistent with reports in mammalian systems. Conversely, the most frequent distance between AD is 4 bp in both plant species, as previously observed in mouse. Most alternative splicing variations are localized to the protein coding sequence and are predicted to significantly alter the coding sequence. Conclusion Alternative splicing is widespread in both rice and Arabidopsis and these species share many common features. Interestingly, alternative splicing may play a role beyond creating novel combinations of transcripts that expand the proteome. Many isoforms will presumably have negative consequences for protein structure and function, suggesting that their biological role involves post-transcriptional regulation of gene expression.
Collapse
|
29
|
Kabat JL, Barberan-Soler S, McKenna P, Clawson H, Farrer T, Zahler AM. Intronic alternative splicing regulators identified by comparative genomics in nematodes. PLoS Comput Biol 2006; 2:e86. [PMID: 16839192 PMCID: PMC1500816 DOI: 10.1371/journal.pcbi.0020086] [Citation(s) in RCA: 81] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2006] [Accepted: 05/30/2006] [Indexed: 11/18/2022] Open
Abstract
Many alternative splicing events are regulated by pentameric and hexameric intronic sequences that serve as binding sites for splicing regulatory factors. We hypothesized that intronic elements that regulate alternative splicing are under selective pressure for evolutionary conservation. Using a Wobble Aware Bulk Aligner genomic alignment of Caenorhabditis elegans and Caenorhabditis briggsae, we identified 147 alternatively spliced cassette exons that exhibit short regions of high nucleotide conservation in the introns flanking the alternative exon. In vivo experiments on the alternatively spliced let-2 gene confirm that these conserved regions can be important for alternative splicing regulation. Conserved intronic element sequences were collected into a dataset and the occurrence of each pentamer and hexamer motif was counted. We compared the frequency of pentamers and hexamers in the conserved intronic elements to a dataset of all C. elegans intron sequences in order to identify short intronic motifs that are more likely to be associated with alternative splicing. High-scoring motifs were examined for upstream or downstream preferences in introns surrounding alternative exons. Many of the high-scoring nematode pentamer and hexamer motifs correspond to known mammalian splicing regulatory sequences, such as (T)GCATG, indicating that the mechanism of alternative splicing regulation is well conserved in metazoans. A comparison of the analysis of the conserved intronic elements, and analysis of the entire introns flanking these same exons, reveals that focusing on intronic conservation can increase the sensitivity of detecting putative splicing regulatory motifs. This approach also identified novel sequences whose role in splicing is under investigation and has allowed us to take a step forward in defining a catalog of splicing regulatory elements for an organism. In vivo experiments confirm that one novel high-scoring sequence from our analysis, (T)CTATC, is important for alternative splicing regulation of the unc-52 gene.
Collapse
Affiliation(s)
- Jennifer L Kabat
- Department of Molecular, Cell, and Developmental Biology and Center for Molecular Biology of RNA, University of California Santa Cruz, Santa Cruz, California, USA
| | | | | | | | | | | |
Collapse
|
30
|
Munch K, Krogh A. Automatic generation of gene finders for eukaryotic species. BMC Bioinformatics 2006; 7:263. [PMID: 16712739 PMCID: PMC1522026 DOI: 10.1186/1471-2105-7-263] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2005] [Accepted: 05/21/2006] [Indexed: 11/22/2022] Open
Abstract
Background The number of sequenced eukaryotic genomes is rapidly increasing. This means that over time it will be hard to keep supplying customised gene finders for each genome. This calls for procedures to automatically generate species-specific gene finders and to re-train them as the quantity and quality of reliable gene annotation grows. Results We present a procedure, Agene, that automatically generates a species-specific gene predictor from a set of reliable mRNA sequences and a genome. We apply a Hidden Markov model (HMM) that implements explicit length distribution modelling for all gene structure blocks using acyclic discrete phase type distributions. The state structure of the each HMM is generated dynamically from an array of sub-models to include only gene features represented in the training set. Conclusion Acyclic discrete phase type distributions are well suited to model sequence length distributions. The performance of each individual gene predictor on each individual genome is comparable to the best of the manually optimised species-specific gene finders. It is shown that species-specific gene finders are superior to gene finders trained on other species.
Collapse
Affiliation(s)
- Kasper Munch
- Bioinformatics Centre, University of Copenhagen, Universitetsparken 15, 2100 Copenhagen Ø, Denmark
| | - Anders Krogh
- Bioinformatics Centre, University of Copenhagen, Universitetsparken 15, 2100 Copenhagen Ø, Denmark
| |
Collapse
|
31
|
Rep M, Duyvesteijn RGE, Gale L, Usgaard T, Cornelissen BJC, Ma LJ, Ward TJ. The presence of GC-AG introns in Neurospora crassa and other euascomycetes determined from analyses of complete genomes: implications for automated gene prediction. Genomics 2006; 87:338-47. [PMID: 16406724 DOI: 10.1016/j.ygeno.2005.11.014] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2005] [Revised: 11/07/2005] [Accepted: 11/24/2005] [Indexed: 11/30/2022]
Abstract
A combination of experimental and computational approaches was employed to identify introns with noncanonical GC-AG splice sites (GC-AG introns) within euascomycete genomes. Evaluation of 2335 cDNA-confirmed introns from Neurospora crassa revealed 27 such introns (1.2%). A similar frequency (1.0%) of GC-AG introns was identified in Fusarium graminearum, in which 3 of 292 cDNA-confirmed introns contained GC-AG splice sites. Computational analyses of the N. crassa genome using a GC-AG intron consensus sequence identified an additional 20 probable GC-AG introns in this fungus. For 8 of the 47 GC-AG introns identified in N. crassa a GC donor site is also present in a homolog from Magnaporthe grisea, F. graminearum, or Aspergillus nidulans. In most cases, however, homologs in these fungi contain a GT-AG intron or no intron at the corresponding position. These findings have important implications for fungal genome annotation, as the automated annotations of euascomycete genomes incorrectly identified intron boundaries for all of the confirmed and probable GC-AG introns reported here.
Collapse
Affiliation(s)
- Martijn Rep
- Plant Pathology, Swammerdam Institute for Life Sciences, Faculty of Science, University of Amsterdam, Kruislaan 318, 1098 SM Amsterdam, The Netherlands.
| | | | | | | | | | | | | |
Collapse
|
32
|
Veit G, Kobbe B, Keene DR, Paulsson M, Koch M, Wagener R. Collagen XXVIII, a novel von Willebrand factor A domain-containing protein with many imperfections in the collagenous domain. J Biol Chem 2005; 281:3494-504. [PMID: 16330543 DOI: 10.1074/jbc.m509333200] [Citation(s) in RCA: 119] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Here we describe a novel collagen belonging to the class of von Willebrand factor A (VWA) domain-containing proteins. This novel protein was identified by screening the EST data base and was subsequently recombinantly expressed and characterized as an authentic tissue component. The COL28A1 gene on human chromosome 7p21.3 and on mouse chromosome 6A1 encodes a novel protein that structurally resembles the beaded filament-forming collagens. The collagenous domain contains several very short interruptions arranged in a repeat pattern. As shown for other novel minor collagens, the expression of collagen XXVIII protein in mouse is very restricted. In addition to small amounts in skin and calvaria, the major signals were in dorsal root ganglia and peripheral nerves. By immunoelectron microscopy, collagen XXVIII was detected in the sciatic nerve, at the basement membrane of certain Schwann cells surrounding the nerve fibers. Even though the protein is present in the adult sciatic nerve, collagen XXVIII mRNA was only detected in sciatic nerve of newborn mice, indicating that the protein persists for an extended period after synthesis.
Collapse
MESH Headings
- Alternative Splicing
- Amino Acid Sequence
- Animals
- Animals, Newborn
- Aprotinin/chemistry
- Blotting, Northern
- Cattle
- Cloning, Molecular
- Collagen/biosynthesis
- Collagen/chemistry
- Collagen/metabolism
- DNA, Complementary/metabolism
- Electrophoresis, Polyacrylamide Gel
- Exons
- Expressed Sequence Tags
- Humans
- Immunoblotting
- Immunohistochemistry
- Ions
- Mice
- Mice, Inbred C57BL
- Microscopy, Immunoelectron
- Models, Genetic
- Molecular Sequence Data
- Neurons/metabolism
- Open Reading Frames
- Phylogeny
- Plasmids/metabolism
- Protein Structure, Tertiary
- RNA, Messenger/metabolism
- Recombinant Proteins/chemistry
- Reverse Transcriptase Polymerase Chain Reaction
- Schwann Cells/metabolism
- Sciatic Nerve/metabolism
- Sequence Homology, Amino Acid
- Tissue Distribution
- von Willebrand Factor/chemistry
Collapse
Affiliation(s)
- Guido Veit
- Center for Biochemistry, Center for Molecular Medicine, and Department of Dermatology, Medical Faculty, University of Cologne, D-50931 Cologne, Germany
| | | | | | | | | | | |
Collapse
|
33
|
Yochem J, Bell LR, Herman RK. The identities of sym-2, sym-3 and sym-4, three genes that are synthetically lethal with mec-8 in Caenorhabditis elegans. Genetics 2005; 168:1293-306. [PMID: 15579686 PMCID: PMC1448807 DOI: 10.1534/genetics.104.029827] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
On the basis of synthetic lethality, five genes in Caenorhabditis elegans are known to be redundant with the mec-8 gene, which encodes a protein that contains two copies of an RNA recognition motif (RRM) and affects alternative RNA splicing. The molecular identities of two of the redundant genes, sym-1 and sym-5, were previously reported. The remaining three genes have now been cloned, and their synthetically lethal phenotypes with mec-8 are described in more detail. Animals homozygous for mec-8 and sym-2 loss-of-function mutations die during late embryogenesis. The SYM-2 predicted protein contains three RRMs; we propose that SYM-2 and MEC-8 can substitute for each other in promoting the maturation of the transcripts of a vital gene. Animals homozygous for mutations in mec-8 and in either sym-3 or sym-4 have the same striking defect: they arrest development just prior to or just after hatching with a pharynx that appears fully formed but is not properly attached to the body cuticle. sym-3 encodes a protein of unknown function with orthologs in Drosophila and mammals. sym-4 encodes a WD-repeat protein and may also have orthologs in Drosophila and mammals. We propose that SYM-3 and SYM-4 contribute to a common developmental pathway that is redundant with a MEC-8-dependent pathway.
Collapse
Affiliation(s)
- John Yochem
- Department of Genetics, Cell Biology, and Development, University of Minnesota, Minneapolis, Minnesota 55455, USA.
| | | | | |
Collapse
|
34
|
Kurose K, Koyano S, Ikeda S, Tohkin M, Hasegawa R, Sawada JI. 5′ Diversity of human hepatic PXR (NR1I2) transcripts and identification of the major transcription initiation site. Mol Cell Biochem 2005; 273:79-85. [PMID: 16013442 DOI: 10.1007/s11010-005-7757-7] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
The human pregnane X receptor (PXR) is a crucial regulator of the genes encoding several major cytochrome P450 enzymes and transporters, such as CYP3A4 and MDR1, but its own transcriptional regulation remains unclear. To elucidate the transcriptional mechanisms of human PXR gene, we first endeavored to identify the transcription initiation site of human PXR using 5'-RACE. Five types of 5'-variable transcripts (a, b, c, d, and e) with common exon 2 sequence were found, and comparison of these sequences with the genomic sequence suggested that their 5' diversity is derived from initiation by alternative promoters and alternative splicing. None of the exons found in our study contain any new in-frame coding regions. Newly identified introns IVS-a and IVS-b were found to have CT-AC splice sites that do not follow the GT-AG rule of conventional donor and acceptor splice sites. Of the five types of 5' variable transcripts identified, RT-PCR showed that type-a was the major transcript type. Four transcription initiation sites (A-D) for type-a transcript were identified by 5'-RACE using GeneRacer RACE Ready cDNA (human liver) constructed by the oligo-capping method. Putative TATA boxes were located approximately 30 bp upstream from the transcriptional start sites of the major transcript (C) and the longest minor transcript (A) expressed in the human liver. These results indicate that the initiation of transcription of human PXR is more complex than previously reported.
Collapse
Affiliation(s)
- Kouichi Kurose
- Project Team for Pharmacogenetics, National Institute of Health Sciences, Kamiyoga, Setagaya-ku, Tokyo, Japan.
| | | | | | | | | | | |
Collapse
|
35
|
Nikolaou S, Hartman D, Nisbet AJ, Presidente PJA, Gasser RB. Genomic organization and expression analysis for hcstk, a serine/threonine protein kinase gene of Haemonchus contortus, and comparison with Caenorhabditis elegans par-1. Gene 2005; 343:313-22. [PMID: 15588586 DOI: 10.1016/j.gene.2004.09.017] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2004] [Revised: 08/02/2004] [Accepted: 09/23/2004] [Indexed: 11/19/2022]
Abstract
The organization and expression of a putative serine/threonine kinase gene (designated hcstk), proposed to relate to a conserved eukaryotic signal transduction pathway, was characterized for the socio-economically important pathogen Haemonchus contortus (Nematoda). The entire hcstk gene is approximately 26.7 kb in size, has 26 exons and is inferred to produce multiple isoforms via alternative splicing in its N-terminal header and spacer domains. Comparison of hcstk with its Caenorhabditis elegans homologue, par-1, revealed major differences in genomic organization, exon number and inferred mRNA processing. The expression of hcstk transcripts was highest in the first- and late-fourth-stage larvae of the parasite compared with other developmental stages, somewhat distinct from par-1 in C. elegans. In spite of a substantial amino acid sequence identity in the functional domains between the predicted proteins HcSTK and PAR-1, overall, the findings suggest a unique functional role for each molecule.
Collapse
Affiliation(s)
- Sia Nikolaou
- Primary Industries Research Victoria (Animal Genetics and Genomics), 475 Mickleham Road, Attwood, Victoria 3049, Australia
| | | | | | | | | |
Collapse
|
36
|
Hammer SE, Strehl S, Hagemann S. Homologs of Drosophila P transposons were mobile in zebrafish but have been domesticated in a common ancestor of chicken and human. Mol Biol Evol 2004; 22:833-44. [PMID: 15616143 DOI: 10.1093/molbev/msi068] [Citation(s) in RCA: 36] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
A substantial fraction of vertebrate and invertebrate genomes is composed of mobile elements and their derivatives. One of the most intensively studied transposon families, the P elements of Drosophila, was thought to exist exclusively in the genomes of dipteran insects. Based on the data provided by the human genome project, in 2001 our group has identified a P element-homologous sequence in the human genome. This P element-homologous human gene, named Phsa, is 19,533 nucleotides long, comprises six exons and five introns, and encodes a protein of still unknown function with a length of 903 amino acid residues. The N-terminal THAP domain of the putative Phsa protein shows similarities to the site-specific DNA-binding domain of the Drosophila P element transposase. In the present study, FISH analysis and the screening of a human lambda genomic library revealed a single copy of Phsa located on the long arm of chromosome 4, upstream of a gene coding for the hypothetical protein DKFZp686L1814. The same gene arrangement was found for the homologous gene Pgga in the genome of chicken, thus, displaying Pgga at orthologous position on the long arm of chromosome 4. The single-copy gene status and the absence of terminal inverted repeats and target-site duplications indicate that Phsa and Pgga constitute domesticated stationary sequences. In contrast, a considerable number of P-homologous sequences with terminal inverted repeats and intact target-site duplications could be identified in zebrafish, strongly indicating that Pdre elements were mobile within the zebrafish genome. Pdre elements are the first P-like transposons identified in a vertebrate species. With respect to Phsa, gene expression studies showed that Phsa is expressed in a broad range of human tissues, suggesting that the putative Phsa protein plays a not yet understood but essential role in a specific metabolic pathway. We demonstrate that P-homologous DNA sequences occur in the genomes of 21 analyzed vertebrates but only as rudiments in the rodents. Finally, the evolutionary history of P element-homologous vertebrate sequences is discussed in the context of the "molecular domestication" hypothesis versus the "source gene hypothesis."
Collapse
Affiliation(s)
- Sabine E Hammer
- Laboratories of Genome Dynamics, Center of Anatomy and Cell Biology, Medical University of Vienna, Vienna, Austria
| | | | | |
Collapse
|
37
|
Abstract
Alternative splicing creates transcriptome diversification, possibly leading to speciation. A large fraction of the protein-coding genes of multicellular organisms are alternatively spliced, although no regulated splicing has been detected in unicellular eukaryotes such as yeasts. A comparative analysis of unicellular and multicellular eukaryotic 5' splice sites has revealed important differences - the plasticity of the 5' splice sites of multicellular eukaryotes means that these sites can be used in both constitutive and alternative splicing, and for the regulation of the inclusion/skipping ratio in alternative splicing. So, alternative splicing might have originated as a result of relaxation of the 5' splice site recognition in organisms that originally could support only constitutive splicing.
Collapse
Affiliation(s)
- Gil Ast
- Department of Human Genetics and Molecular Medicine, Sackler Faculty of Medicine, Tel Aviv University, Ramat Aviv 69978, Israel.
| |
Collapse
|
38
|
Sorek R, Lev-Maor G, Reznik M, Dagan T, Belinky F, Graur D, Ast G. Minimal conditions for exonization of intronic sequences: 5' splice site formation in alu exons. Mol Cell 2004; 14:221-31. [PMID: 15099521 DOI: 10.1016/s1097-2765(04)00181-9] [Citation(s) in RCA: 131] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2004] [Revised: 03/17/2004] [Accepted: 03/23/2004] [Indexed: 02/05/2023]
Abstract
Alu exonization, which is an evolutionary pathway that creates primate-specific transcriptomic diversity, is a powerful tool for studying alternative-splicing regulation. Through bioinformatic analyses combined with experimental methodology, we identified the mutational changes needed to create functional 5' splice sites in Alu. We revealed a complex mechanism by which the sequence composition of the 5' splice site and its base pairing with the small nuclear RNA U1 govern alternative splicing. We show that in Alu-derived GC introns the strength of the base pairing between U1 snRNA and the 5' splice site controls the skipping/inclusion ratio of alternative splicing. Based on these findings, we identified 7810 Alus within the human genome that are prone to exonization. Mutations in these Alus may cause genetic disorders or contribute to human-specific protein diversity.
Collapse
Affiliation(s)
- Rotem Sorek
- Department of Human Genetics and Molecular Medicine, Sackler Faculty of Medicine, Tel Aviv University, Ramat Aviv 69978, Israel
| | | | | | | | | | | | | |
Collapse
|
39
|
Silar P, Barreau C, Debuchy R, Kicka S, Turcq B, Sainsard-Chanet A, Sellem CH, Billault A, Cattolico L, Duprat S, Weissenbach J. Characterization of the genomic organization of the region bordering the centromere of chromosome V of Podospora anserina by direct sequencing. Fungal Genet Biol 2003; 39:250-63. [PMID: 12892638 DOI: 10.1016/s1087-1845(03)00025-2] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
A Podospora anserina BAC library of 4800 clones has been constructed in the vector pBHYG allowing direct selection in fungi. Screening of the BAC collection for centromeric sequences of chromosome V allowed the recovery of clones localized on either sides of the centromere, but no BAC clone was found to contain the centromere. Seven BAC clones containing 322,195 and 156,244bp from either sides of the centromeric region were sequenced and annotated. One 5S rRNA gene, 5 tRNA genes, and 163 putative coding sequences (CDS) were identified. Among these, only six CDS seem specific to P. anserina. The gene density in the centromeric region is approximately one gene every 2.8kb. Extrapolation of this gene density to the whole genome of P. anserina suggests that the genome contains about 11,000 genes. Synteny analyses between P. anserina and Neurospora crassa show that co-linearity extends at the most to a few genes, suggesting rapid genome rearrangements between these two species.
Collapse
MESH Headings
- Amino Acid Sequence
- Centromere/chemistry
- Centromere/genetics
- Chromosomes, Artificial, Bacterial
- Chromosomes, Fungal/genetics
- Chromosomes, Fungal/ultrastructure
- DNA, Intergenic/analysis
- Gene Rearrangement
- Genes, Fungal
- Genes, rRNA
- Genome, Fungal
- Genomic Library
- Introns
- Molecular Sequence Data
- Physical Chromosome Mapping
- RNA, Transfer/genetics
- Sequence Analysis, DNA
- Sequence Homology
- Sordariales/genetics
- Synteny
Collapse
Affiliation(s)
- Philippe Silar
- Institut de Génétique et Microbiologie, UMR CNRS 8621, Bât. 400, Université de Paris Sud, 91405 Orsay Cedex, France.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|