1
|
Genome-Wide Identification and Expression Analyses of AnSnRK2 Gene Family under Osmotic Stress in Ammopiptanthus nanus. PLANTS 2021; 10:plants10050882. [PMID: 33925572 PMCID: PMC8145913 DOI: 10.3390/plants10050882] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/24/2021] [Revised: 04/23/2021] [Accepted: 04/24/2021] [Indexed: 11/29/2022]
Abstract
Sucrose non-fermenting-1 (SNF1)-related protein kinase 2’s (SnRK2s) are plant-specific serine/threonine protein kinases and play crucial roles in the abscisic acid signaling pathway and abiotic stress response. Ammopiptanthus nanus is a relict xerophyte shrub and extremely tolerant of abiotic stresses. Therefore, we performed genome-wide identification of the AnSnRK2 genes and analyzed their expression profiles under osmotic stresses including drought and salinity. A total of 11 AnSnRK2 genes (AnSnRK2.1-AnSnRK2.11) were identified in the A. nanus genome and were divided into three groups according to the phylogenetic tree. The AnSnRK2.6 has seven introns and others have eight introns. All of the AnSnRK2 proteins are highly conserved at the N-terminus and contain similar motif composition. The result of cis-acting element analysis showed that there were abundant hormone- and stress-related cis-elements in the promoter regions of AnSnRK2s. Moreover, the results of quantitative real-time PCR exhibited that the expression of most AnSnRK2s was induced by NaCl and PEG-6000 treatments, but the expression of AnSnRK2.3 and AnSnRK2.6 was inhibited, suggesting that the AnSnRK2s might play key roles in stress tolerance. The study provides insights into understanding the function of AnSnRK2s.
Collapse
|
2
|
Poverennaya IV, Roytberg MA. Spliceosomal Introns: Features, Functions, and Evolution. BIOCHEMISTRY (MOSCOW) 2021; 85:725-734. [PMID: 33040717 DOI: 10.1134/s0006297920070019] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
Spliceosomal introns, which have been found in most eukaryotic genes, are non-coding sequences excised from pre-mRNAs by a special complex called spliceosome during mRNA splicing. Introns occur in both protein- and RNA-coding genes and can be found in coding and untranslated gene regions. Because intron sequences vary greatly due to a high rate of polymorphism, the functions of intron had been for a long time associated only with alternative splicing, while intron evolution had been viewed not as an evolution of an individual genomic element, but rather considered within a framework of the evolution of the gene intron-exon structure. Here, we review the theories of intron origin, evolutionary events in the exon-intron structure, such as intron gain, loss, and sliding, intron functions known to date, and mechanisms by which changes in the intron features (length and phase) can affect the regulation of gene-mediated processes.
Collapse
Affiliation(s)
- I V Poverennaya
- Vavilov Institute of General Genetics, Russian Academy of Sciences, 119991, Moscow, Russia. .,Institute of Mathematical Problems in Biology, Keldysh Branch of Institute of Applied Mathematics, Russian Academy of Sciences, Pushchino, Moscow Region, 142290, Russia
| | - M A Roytberg
- Institute of Mathematical Problems in Biology, Keldysh Branch of Institute of Applied Mathematics, Russian Academy of Sciences, Pushchino, Moscow Region, 142290, Russia.,Moscow Institute of Physics and Technology, Dolgoprudny, Moscow Region, 141701, Russia.,Higher School of Economics, Moscow, 101000, Russia
| |
Collapse
|
3
|
Wu X, Hurst LD. Determinants of the Usage of Splice-Associated cis-Motifs Predict the Distribution of Human Pathogenic SNPs. Mol Biol Evol 2015; 33:518-29. [PMID: 26545919 PMCID: PMC4866546 DOI: 10.1093/molbev/msv251] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2015] [Accepted: 10/25/2015] [Indexed: 12/11/2022] Open
Abstract
Where in genes do pathogenic mutations tend to occur and does this provide clues as to the possible underlying mechanisms by which single nucleotide polymorphisms (SNPs) cause disease? As splice-disrupting mutations tend to occur predominantly at exon ends, known also to be hot spots of cis-exonic splice control elements, we examine the relationship between the relative density of such exonic cis-motifs and pathogenic SNPs. In particular, we focus on the intragene distribution of exonic splicing enhancers (ESE) and the covariance between them and disease-associated SNPs. In addition to showing that disease-causing genes tend to be genes with a high intron density, consistent with missplicing, five factors established as trends in ESE usage, are considered: relative position in exons, relative position in genes, flanking intron size, splice sites usage, and phase. We find that more than 76% of pathogenic SNPs are within 3–69 bp of exon ends where ESEs generally reside, this being 13% more than expected. Overall from enrichment of pathogenic SNPs at exon ends, we estimate that approximately 20–45% of SNPs affect splicing. Importantly, we find that within genes pathogenic SNPs tend to occur in splicing-relevant regions with low ESE density: they are found to occur preferentially in the terminal half of genes, in exons flanked by short introns and at the ends of phase (0,0) exons with 3′ non-“AGgt” splice site. We suggest the concept of the “fragile” exon, one home to pathogenic SNPs owing to its vulnerability to splice disruption owing to low ESE density.
Collapse
Affiliation(s)
- XianMing Wu
- Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, Somerset, United Kingdom
| | - Laurence D Hurst
- Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, Somerset, United Kingdom
| |
Collapse
|
4
|
Zhou K, Salamov A, Kuo A, Aerts AL, Kong X, Grigoriev IV. Alternative splicing acting as a bridge in evolution. Stem Cell Investig 2015; 2:19. [PMID: 27358887 DOI: 10.3978/j.issn.2306-9759.2015.10.01] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2015] [Accepted: 10/15/2015] [Indexed: 12/15/2022]
Abstract
BACKGROUND Alternative splicing (AS) regulates diverse cellular and developmental functions through alternative protein structures of different isoforms. Alternative exons dominate AS in vertebrates; however, very little is known about the extent and function of AS in lower eukaryotes. To understand the role of introns in gene evolution, we examined AS from a green algal and five fungal genomes using a novel EST-based gene-modeling algorithm (COMBEST). METHODS AS from each genome was classified with COMBEST that maps EST sequences to genomes to build gene models. Various aspects of AS were analyzed through statistical methods. The interplay of intron 3n length, phase, coding property, and intron retention (RI) were examined with Chi-square testing. RESULTS With 3 to 834 times EST coverage, we identified up to 73% of AS in intron-containing genes and found preponderance of RI among 11 types of AS. The number of exons, expression level, and maximum intron length correlated with number of AS per gene (NAG), and intron-rich genes suppressed AS. Genes with AS were more ancient, and AS was conserved among fungal genomes. Among stopless introns, non-retained introns (NRI) avoided, but major RI preferred 3n length. In contrast, stop-containing introns showed uniform distribution among 3n, 3n+1, and 3n+2 lengths. We found a clue to the intron phase enigma: it was the coding function of introns involved in AS that dictates the intron phase bias. CONCLUSIONS Majority of AS is non-functional, and the extent of AS is suppressed for intron-rich genes. RI through 3n length, stop codon, and phase bias bridges the transition from functionless to functional alternative isoforms.
Collapse
Affiliation(s)
- Kemin Zhou
- 1 US Department of Energy Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA ; 2 Roche Molecular Diagnostics, 4300 Hacienda Drive, Pleasanton, CA 94588, USA ; 3 Department of Clinical Medicine, Kunming University of Science and Technology, Kunming 650031, China
| | - Asaf Salamov
- 1 US Department of Energy Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA ; 2 Roche Molecular Diagnostics, 4300 Hacienda Drive, Pleasanton, CA 94588, USA ; 3 Department of Clinical Medicine, Kunming University of Science and Technology, Kunming 650031, China
| | - Alan Kuo
- 1 US Department of Energy Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA ; 2 Roche Molecular Diagnostics, 4300 Hacienda Drive, Pleasanton, CA 94588, USA ; 3 Department of Clinical Medicine, Kunming University of Science and Technology, Kunming 650031, China
| | - Andrea L Aerts
- 1 US Department of Energy Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA ; 2 Roche Molecular Diagnostics, 4300 Hacienda Drive, Pleasanton, CA 94588, USA ; 3 Department of Clinical Medicine, Kunming University of Science and Technology, Kunming 650031, China
| | - Xiangyang Kong
- 1 US Department of Energy Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA ; 2 Roche Molecular Diagnostics, 4300 Hacienda Drive, Pleasanton, CA 94588, USA ; 3 Department of Clinical Medicine, Kunming University of Science and Technology, Kunming 650031, China
| | - Igor V Grigoriev
- 1 US Department of Energy Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA ; 2 Roche Molecular Diagnostics, 4300 Hacienda Drive, Pleasanton, CA 94588, USA ; 3 Department of Clinical Medicine, Kunming University of Science and Technology, Kunming 650031, China
| |
Collapse
|
5
|
Salinas Castellanos LC, Chomilier J, Hernández-Torres J. Recombination of chl-fus gene (Plastid Origin) downstream of hop: a locus of chromosomal instability. BMC Genomics 2015; 16:573. [PMID: 26238241 PMCID: PMC4522979 DOI: 10.1186/s12864-015-1780-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2014] [Accepted: 07/14/2015] [Indexed: 11/26/2022] Open
Abstract
Background The co-chaperone Hop [heat shock protein (HSP) organizing protein] has been shown to act as an adaptor for protein folding and maturation, in concert with Hsp70 and Hsp90. The hop gene is of eukaryotic origin. Likewise, the chloroplast elongation factor G (cEF-G) catalyzes the translocation step in chloroplast protein synthesis. The chl-fus gene, which encodes the cEF-G protein, is of plastid origin. Both proteins, Hop and cEF-G, derived from domain duplications. It was demonstrated that the nuclear chl-fus gene locates in opposite orientation to a hop gene in Glycine max. We explored 53 available plant genomes from Chlorophyta to higher plants, to determine whether the chl-fus gene was transferred directly downstream of the primordial hop in the proto-eukaryote host cell. Since both genes came from exon/module duplication events, we wanted to explore the involvement of introns in the early origin and the ensuing evolutionary changes in gene structure. Results We reconstructed the evolutionary history of the two convergent plant genes, on the basis of their gene structure, microsynteny and microcolinearity, from 53 plant nuclear genomes. Despite a high degree (72 %) of microcolinearity among vascular plants, our results demonstrate that their adjacency was a product of chromosomal rearrangements. Based on predicted exon − intron structures, we inferred the molecular events giving rise to the current form of genes. Therefore, we propose a simple model of exon/module shuffling by intronic recombinations in which phase-0 introns were essential for domain duplication, and a phase-1 intron for transit peptide recruiting. Finally, we demonstrate a natural susceptibility of the intergenic region to recombine or delete, seriously threatening the integrity of the chl-fus gene for the future. Conclusions Our results are consistent with the interpretation that the chl-fus gene was transferred from the chloroplast to a chromosome different from that of hop, in the primitive photosynthetic eukaryote, and much later before the appearance of angiosperms, it was recombined downstream of hop. Exon/module shuffling mediated by symmetric intron phases (i.e., phase-0 introns) was essential for gene evolution. The intergenic region is prone to recombine, risking the integrity of both genes. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-1780-1) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | - Jacques Chomilier
- IMPMC, UPMC, CNRS UMR 7590, MNHN, IRD, Paris, France and RPBS, Paris, France.
| | - Jorge Hernández-Torres
- Laboratorio de Biología Molecular, Escuela de Biología, Universidad Industrial de Santander, Apartado Aéreo 678, Bucaramanga, Colombia.
| |
Collapse
|
6
|
Annala MJ, Parker BC, Zhang W, Nykter M. Fusion genes and their discovery using high throughput sequencing. Cancer Lett 2013; 340:192-200. [PMID: 23376639 PMCID: PMC3675181 DOI: 10.1016/j.canlet.2013.01.011] [Citation(s) in RCA: 52] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2012] [Revised: 12/28/2012] [Accepted: 01/04/2013] [Indexed: 01/25/2023]
Abstract
Fusion genes are hybrid genes that combine parts of two or more original genes. They can form as a result of chromosomal rearrangements or abnormal transcription, and have been shown to act as drivers of malignant transformation and progression in many human cancers. The biological significance of fusion genes together with their specificity to cancer cells has made them into excellent targets for molecular therapy. Fusion genes are also used as diagnostic and prognostic markers to confirm cancer diagnosis and monitor response to molecular therapies. High-throughput sequencing has enabled the systematic discovery of fusion genes in a wide variety of cancer types. In this review, we describe the history of fusion genes in cancer and the ways in which fusion genes form and affect cellular function. We also describe computational methodologies for detecting fusion genes from high-throughput sequencing experiments, and the most common sources of error that lead to false discovery of fusion genes.
Collapse
Affiliation(s)
- M J Annala
- Tampere University of Technology, Tampere, Finland.
| | | | | | | |
Collapse
|
7
|
Convergent intron gains in hymenopteran elongation factor-1α. Mol Phylogenet Evol 2013; 67:266-76. [PMID: 23396205 DOI: 10.1016/j.ympev.2013.01.015] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2012] [Revised: 01/17/2013] [Accepted: 01/29/2013] [Indexed: 11/23/2022]
Abstract
The eukaryotic translation elongation factor-1α gene (eEF1A) has been used extensively in higher level phylogenetics of insects and other groups, despite being present in two or more copies in several taxa. Orthology assessment has relied heavily on the position of introns, but the basic assumption of low rates of intron loss and absence of convergent intron gains has not been tested thoroughly. Here, we study the evolution of eEF1A based on a broad sample of taxa in the insect order Hymenoptera. The gene is universally present in two copies - F1 and F2 - both of which apparently originated before the emergence of the order. An elevated ratio of non-synonymous versus synonymous substitutions and differences in rates of amino acid replacements between the copies suggest that they evolve independently, and phylogenetic methods clearly cluster the copies separately. The F2 copy appears to be ancient; it is orthologous with the copy known as F1 in Diptera, and is likely present in most insect orders. The hymenopteran F1 copy, which may or may not be unique to this order, apparently originated through retroposition and was originally intron free. During the evolution of the Hymenoptera, it has successively accumulated introns, at least three of which have appeared at the same position as introns in the F2 copy or in eEF1A copies in other insects. The sites of convergent intron gain are characterized by highly conserved nucleotides that strongly resemble specific intron-associated sequence motifs, so-called proto-splice sites. The significant rate of convergent intron gain renders intron-exon structure unreliable as an indicator of orthology in eEF1A, and probably also in other protein-coding genes.
Collapse
|
8
|
Rogozin IB, Carmel L, Csuros M, Koonin EV. Origin and evolution of spliceosomal introns. Biol Direct 2012; 7:11. [PMID: 22507701 PMCID: PMC3488318 DOI: 10.1186/1745-6150-7-11] [Citation(s) in RCA: 224] [Impact Index Per Article: 18.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2011] [Accepted: 03/15/2012] [Indexed: 12/31/2022] Open
Abstract
Evolution of exon-intron structure of eukaryotic genes has been a matter of long-standing, intensive debate. The introns-early concept, later rebranded ‘introns first’ held that protein-coding genes were interrupted by numerous introns even at the earliest stages of life's evolution and that introns played a major role in the origin of proteins by facilitating recombination of sequences coding for small protein/peptide modules. The introns-late concept held that introns emerged only in eukaryotes and new introns have been accumulating continuously throughout eukaryotic evolution. Analysis of orthologous genes from completely sequenced eukaryotic genomes revealed numerous shared intron positions in orthologous genes from animals and plants and even between animals, plants and protists, suggesting that many ancestral introns have persisted since the last eukaryotic common ancestor (LECA). Reconstructions of intron gain and loss using the growing collection of genomes of diverse eukaryotes and increasingly advanced probabilistic models convincingly show that the LECA and the ancestors of each eukaryotic supergroup had intron-rich genes, with intron densities comparable to those in the most intron-rich modern genomes such as those of vertebrates. The subsequent evolution in most lineages of eukaryotes involved primarily loss of introns, with only a few episodes of substantial intron gain that might have accompanied major evolutionary innovations such as the origin of metazoa. The original invasion of self-splicing Group II introns, presumably originating from the mitochondrial endosymbiont, into the genome of the emerging eukaryote might have been a key factor of eukaryogenesis that in particular triggered the origin of endomembranes and the nucleus. Conversely, splicing errors gave rise to alternative splicing, a major contribution to the biological complexity of multicellular eukaryotes. There is no indication that any prokaryote has ever possessed a spliceosome or introns in protein-coding genes, other than relatively rare mobile self-splicing introns. Thus, the introns-first scenario is not supported by any evidence but exon-intron structure of protein-coding genes appears to have evolved concomitantly with the eukaryotic cell, and introns were a major factor of evolution throughout the history of eukaryotes. This article was reviewed by I. King Jordan, Manuel Irimia (nominated by Anthony Poole), Tobias Mourier (nominated by Anthony Poole), and Fyodor Kondrashov. For the complete reports, see the Reviewers’ Reports section.
Collapse
Affiliation(s)
- Igor B Rogozin
- National Center for Biotechnology Information NLM/NIH, 8600 Rockville Pike, Bldg, 38A, Bethesda, MD 20894, USA
| | | | | | | |
Collapse
|
9
|
Shepard SS, McSweeny A, Serpen G, Fedorov A. Exploiting mid-range DNA patterns for sequence classification: binary abstraction Markov models. Nucleic Acids Res 2012; 40:4765-73. [PMID: 22344692 PMCID: PMC3367190 DOI: 10.1093/nar/gks154] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023] Open
Abstract
Messenger RNA sequences possess specific nucleotide patterns distinguishing them from non-coding genomic sequences. In this study, we explore the utilization of modified Markov models to analyze sequences up to 44 bp, far beyond the 8-bp limit of conventional Markov models, for exon/intron discrimination. In order to analyze nucleotide sequences of this length, their information content is first reduced by conversion into shorter binary patterns via the application of numerous abstraction schemes. After the conversion of genomic sequences to binary strings, homogenous Markov models trained on the binary sequences are used to discriminate between exons and introns. We term this approach the Binary Abstraction Markov Model (BAMM). High-quality abstraction schemes for exon/intron discrimination are selected using optimization algorithms on supercomputers. The best MM classifiers are then combined using support vector machines into a single classifier. With this approach, over 95% classification accuracy is achieved without taking reading frame into account. With further development, the BAMM approach can be applied to sequences lacking the genetic code such as ncRNAs and 5′-untranslated regions.
Collapse
Affiliation(s)
- Samuel S Shepard
- Department of Medicine, University of Toledo, Health Science Campus, Toledo, OH 43614, USA
| | | | | | | |
Collapse
|
10
|
Kapustin Y, Chan E, Sarkar R, Wong F, Vorechovsky I, Winston RM, Tatusova T, Dibb NJ. Cryptic splice sites and split genes. Nucleic Acids Res 2011; 39:5837-44. [PMID: 21470962 PMCID: PMC3152350 DOI: 10.1093/nar/gkr203] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023] Open
Abstract
We describe a new program called cryptic splice finder (CSF) that can reliably identify cryptic splice sites (css), so providing a useful tool to help investigate splicing mutations in genetic disease. We report that many css are not entirely dormant and are often already active at low levels in normal genes prior to their enhancement in genetic disease. We also report a fascinating correlation between the positions of css and introns, whereby css within the exons of one species frequently match the exact position of introns in equivalent genes from another species. These results strongly indicate that many introns were inserted into css during evolution and they also imply that the splicing information that lies outside some introns can be independently recognized by the splicing machinery and was in place prior to intron insertion. This indicates that non-intronic splicing information had a key role in shaping the split structure of eukaryote genes.
Collapse
Affiliation(s)
- Yuri Kapustin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20814, USA.
| | | | | | | | | | | | | | | |
Collapse
|
11
|
Martínez-Pérez F, Bendena WG, Chang BSW, Tobe SS. Influence of codon usage bias on FGLamide-allatostatin mRNA secondary structure. Peptides 2011; 32:509-17. [PMID: 20950662 DOI: 10.1016/j.peptides.2010.10.007] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/28/2010] [Revised: 10/06/2010] [Accepted: 10/06/2010] [Indexed: 02/07/2023]
Abstract
The FGLamide allatostatins (ASTs) are invertebrate neuropeptides which inhibit juvenile hormone biosynthesis in Dictyoptera and related orders. They also show myomodulatory activity. FGLamide AST nucleotide frequencies and codon bias were investigated with respect to possible effects on mRNA secondary structure. 367 putative FGLamide ASTs and their potential endoproteolytic cleavage sites were identified from 40 species of crustaceans, chelicerates and insects. Among these, 55% comprised only 11 amino acids. An FGLamide AST consensus was identified to be (X)(1→16)Y(S/A/N/G)FGLGKR, with a strong bias for the codons UUU encoding for Phe and AAA for Lys, which can form strong Watson-Crick pairing in all peptides analyzed. The physical distance between these codons favor a loop structure from Ser/Ala-Phe to Lys-Arg. Other loop and hairpin loops were also inferred from the codon frequencies in the N-terminal motif, and the first amino acids from the C-terminal motif, or the dibasic potential endoproteolytic cleavage site. Our results indicate that nucleotide frequencies and codon usage bias in FGLamide ASTs tend to favor mRNA folds in the codon sequence in the C-terminal active peptide core and at the dibasic potential endoproteolytic cleavage site.
Collapse
Affiliation(s)
- Francisco Martínez-Pérez
- Department of Cell and Systems Biology, University of Toronto, 110 St. George St., Toronto, ON M5S 3G5, Canada
| | | | | | | |
Collapse
|
12
|
Mekouar M, Blanc-Lenfle I, Ozanne C, Da Silva C, Cruaud C, Wincker P, Gaillardin C, Neuvéglise C. Detection and analysis of alternative splicing in Yarrowia lipolytica reveal structural constraints facilitating nonsense-mediated decay of intron-retaining transcripts. Genome Biol 2010; 11:R65. [PMID: 20573210 PMCID: PMC2911113 DOI: 10.1186/gb-2010-11-6-r65] [Citation(s) in RCA: 54] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2010] [Revised: 06/15/2010] [Accepted: 06/23/2010] [Indexed: 11/10/2022] Open
Abstract
Background Hemiascomycetous yeasts have intron-poor genomes with very few cases of alternative splicing. Most of the reported examples result from intron retention in Saccharomyces cerevisiae and some have been shown to be functionally significant. Here we used transcriptome-wide approaches to evaluate the mechanisms underlying the generation of alternative transcripts in Yarrowia lipolytica, a yeast highly divergent from S. cerevisiae. Results Experimental investigation of Y. lipolytica gene models identified several cases of alternative splicing, mostly generated by intron retention, principally affecting the first intron of the gene. The retention of introns almost invariably creates a premature termination codon, as a direct consequence of the structure of intron boundaries. An analysis of Y. lipolytica introns revealed that introns of multiples of three nucleotides in length, particularly those without stop codons, were underrepresented. In other organisms, premature termination codon-containing transcripts are targeted for degradation by the nonsense-mediated mRNA decay (NMD) machinery. In Y. lipolytica, homologs of S. cerevisiae UPF1 and UPF2 genes were identified, but not UPF3. The inactivation of Y. lipolytica UPF1 and UPF2 resulted in the accumulation of unspliced transcripts of a test set of genes. Conclusions Y. lipolytica is the hemiascomycete with the most intron-rich genome sequenced to date, and it has several unusual genes with large introns or alternative transcription start sites, or introns in the 5' UTR. Our results suggest Y. lipolytica intron structure is subject to significant constraints, leading to the under-representation of stop-free introns. Consequently, intron-containing transcripts are degraded by a functional NMD pathway.
Collapse
Affiliation(s)
- Meryem Mekouar
- INRA UMR1319 Micalis - AgroParisTech, Biologie intégrative du métabolisme lipidique microbien, Bât, CBAI, 78850 Thiverval-Grignon, France
| | | | | | | | | | | | | | | |
Collapse
|
13
|
Babenko V, Ward W, Ruvinsky A. Does drive toward canonic exonic splicing sites exist in mammals? J Mol Evol 2010; 70:387-94. [PMID: 20336453 DOI: 10.1007/s00239-010-9336-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2009] [Accepted: 03/08/2010] [Indexed: 11/30/2022]
Abstract
About 2/3 of introns are inserted between G and G/A, which has previously been explained by codon usage frequencies existing during the period of intron insertions. However, less is known about the evolution of exonic splicing sites. Exonic nucleotides that frame introns are involved in both protein coding and splicing. While a compromise between protein coding and splicing constraints is achieved differently in each intron phase, AG|G is the most common site in all phases comprising about one quarter of all such sites. There is also a great variety of other splicing sites. Here we examine evolutionary changes in exonic nucleotides located at positions -2 -1|+1 which occurred after the beginning of eutherian radiation using comparisons of orthologous splicing sites from five mammalian species. AG|G accumulated fewer substitutions and was more conservative than less frequent exonic splicing sites. Such trend could potentially increase frequencies of AG|G during mammalian evolution and cause a decline of less common sites which had higher substitution rates. However, there is a limit to this process determined by the dynamic equilibrium of substitution rates and the frequencies of different splicing sites. It seems that this equilibrium was already achieved at the time of eutherian radiation and a moderate increase in AG|G frequency was observed only in the human genome.
Collapse
Affiliation(s)
- Vladimir Babenko
- The Institute of Cytology and Genetics, Russian Academy of Sciences, Novosibirsk-90, Russia
| | | | | |
Collapse
|
14
|
Abstract
FGLamide allatostatins are invertebrate neuropeptides which inhibit juvenile hormone biosynthesis in Dictyoptera and related orders and also show myomodulatory activity. The FGLamide allatostatin (AST) gene structure in Dictyoptera is intronless within the ORF, whereas in 9 species of Diptera, the FGLamide AST ORF has one intron. To investigate the evolutionary history of AST intron structure, (intron early versus intron late hypothesis), all available Arthropoda FGLamide AST gene sequences were examined from genome databases with reference to intron presence and position/phase. Three types of FGLamide AST ORF organization were found: intronless in I. scapularis and P. humanus corporis; one intron in D. pulex, A. pisum, A. mellifera and five Drosophila sp.; two introns in N. vitripennis, B. mori strains, A. aegypti, A. gambiae and C. quinquefasciatus. The literature suggests that for the majority of genes examined, most introns exist between codons (phase 0) which may reflect an ancient function of introns to separate protein modules. 60% of the FGLamide AST ORFs introns were between the first and second base within a codon (phase 1), 28% were between the second and third nucleotides within a codon (phase two) and 12% were phase 0. As would be required for correct intron splicing consensus sequence, 84% of introns were in codons starting with guanine. The positioning of introns was a maximum of 9 codons from a dibasic cleavage site. Our results suggest that the introns in the analyzed species support the intron late model.
Collapse
|
15
|
Artamonova II, Gelfand MS. Comparative Genomics and Evolution of Alternative Splicing: The Pessimists' Science. Chem Rev 2007; 107:3407-30. [PMID: 17645315 DOI: 10.1021/cr068304c] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Irena I Artamonova
- Group of Bioinformatics, Vavilov Institute of General Genetics, RAS, Gubkina 3, Moscow 119991, Russia
| | | |
Collapse
|
16
|
Vigneault F, Lachance D, Cloutier M, Pelletier G, Levasseur C, Séguin A. Members of the plant NIMA-related kinases are involved in organ development and vascularization in poplar, Arabidopsis and rice. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2007; 51:575-88. [PMID: 17886359 DOI: 10.1111/j.1365-313x.2007.03161.x] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/07/2023]
Abstract
NIMA-related kinases (Neks) are a family of serine/threonine kinases that have been linked to cell-cycle regulation in fungi and mammals. Information regarding the function of Neks in plants is very limited. We screened the three plant species that have had their genomes sequenced in an attempt to improve our understanding of their role in plants. We retrieved seven members in Arabidopsis thaliana, nine in Populus trichocarpa and six in Oryza sativa. Phylogenetic analysis showed that plant Neks are closely related to each other and contain paralogous genes. Moreover, their chromosome distribution and their exon-intron structure revealed that the actual plant Nek family was derived from a single representative followed by large segmental duplication events. Functional expression analyses in the three species relied on RTqPCR in poplar and publicly available microarray data for Arabidopsis and rice. Although plant Neks are present in every organ analyzed, their expression profiles suggest their involvement in plant development processes. Furthermore, we showed that PNek1, a member of the poplar family, is expressed at sites of free auxin synthesis and is specifically involved during the vascularization process.
Collapse
Affiliation(s)
- Frédéric Vigneault
- Natural Resources Canada, Canadian Forest Service, Laurentian Forestry Centre, 1055 du P.E.P.S., PO Box 10380, Stn. Sainte-Foy, Quebec, QC, Canada G1 V 4C7
| | | | | | | | | | | |
Collapse
|
17
|
De Kee DW, Gopalan V, Stoltzfus A. A Sequence-Based Model Accounts Largely for the Relationship of Intron Positions to Protein Structural Features. Mol Biol Evol 2007; 24:2158-68. [PMID: 17646255 DOI: 10.1093/molbev/msm151] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Claims of intron-structure correlations have played a major role in debates surrounding split gene origins. In the formative (as opposed to disruptive or "insertional") model of split gene origins, introns represent the scars of chimaeric gene assembly. When analyzed retrospectively, formative introns should tend to fall between modular units, if such units exist, or at least to exhibit a preference for sites favorable to chimaera formation. However, there is another possible source of preferences: under a disruptive model of split gene origins, fortuitous intron-structure correlations may arise because the gain of introns is biased with respect to flanking nucleotide sequences. To investigate the extent to which a sequence-biased intron gain model may account for the present-day distribution of introns, data on over 10,000 introns in eukaryotic protein-coding genes were integrated with structural data from a set of 1,851 nonredundant protein chains. The positions of introns with respect to secondary structures, solvent accessibility, and so-called "modules" were evaluated relative to the expectations of a null model, a disruptive model based on amino acid frequencies at splice junctions, and a formative model defined relative to these. The null model can be excluded for most structural features and is highly improbable when intron sites are grouped by reading frame phase. Phase-dependent correlations with secondary structure and side-chain surface accessibility are particularly strong. However, these phase-dependent correlations are explained largely by the sequence-based disruptive model.
Collapse
Affiliation(s)
- Danny W De Kee
- Center for Advanced Research in Biotechnology, Rockville, MD, USA
| | | | | |
Collapse
|
18
|
Abstract
Research into the origins of introns is at a critical juncture in the resolution of theories on the evolution of early life (which came first, RNA or DNA?), the identity of LUCA (the last universal common ancestor, was it prokaryotic- or eukaryotic-like?), and the significance of noncoding nucleotide variation. One early notion was that introns would have evolved as a component of an efficient mechanism for the origin of genes. But alternative theories emerged as well. From the debate between the "introns-early" and "introns-late" theories came the proposal that introns arose before the origin of genetically encoded proteins and DNA, and the more recent "introns-first" theory, which postulates the presence of introns at that early evolutionary stage from a reconstruction of the "RNA world." Here we review seminal and recent ideas about intron origins. Recent discoveries about the patterns and causes of intron evolution make this one of the most hotly debated and exciting topics in molecular evolutionary biology today.
Collapse
Affiliation(s)
- Francisco Rodríguez-Trelles
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92697-2525, USA.
| | | | | |
Collapse
|
19
|
Nguyen HD, Yoshihama M, Kenmochi N. Phase distribution of spliceosomal introns: implications for intron origin. BMC Evol Biol 2006; 6:69. [PMID: 16959043 PMCID: PMC1574350 DOI: 10.1186/1471-2148-6-69] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2006] [Accepted: 09/08/2006] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The origin of spliceosomal introns is the central subject of the introns-early versus introns-late debate. The distribution of intron phases is non-uniform, with an excess of phase-0 introns. Introns-early explains this by speculating that a fraction of present-day introns were present between minigenes in the progenote and therefore must lie in phase-0. In contrast, introns-late predicts that the nonuniformity of intron phase distribution reflects the nonrandomness of intron insertions. RESULTS In this paper, we tested the two theories using analyses of intron phase distribution. We inferred the evolution of intron phase distribution from a dataset of 684 gene orthologs from seven eukaryotes using a maximum likelihood method. We also tested whether the observed intron phase distributions from 10 eukaryotes can be explained by intron insertions on a genome-wide scale. In contrast to the prediction of introns-early, the inferred evolution of intron phase distribution showed that the proportion of phase-0 introns increased over evolution. Consistent with introns-late, the observed intron phase distributions matched those predicted by an intron insertion model quite well. CONCLUSION Our results strongly support the introns-late hypothesis of the origin of spliceosomal introns.
Collapse
Affiliation(s)
- Hung D Nguyen
- Frontier Science Research Center, University of Miyazaki 5200 Kihara, Kiyotake, Miyazaki 889-1692, Japan
| | - Maki Yoshihama
- Frontier Science Research Center, University of Miyazaki 5200 Kihara, Kiyotake, Miyazaki 889-1692, Japan
| | - Naoya Kenmochi
- Frontier Science Research Center, University of Miyazaki 5200 Kihara, Kiyotake, Miyazaki 889-1692, Japan
| |
Collapse
|
20
|
Ruvinsky A, Ward W. A gradient in the distribution of introns in eukaryotic genes. J Mol Evol 2006; 63:136-41. [PMID: 16736103 DOI: 10.1007/s00239-005-0261-6] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2005] [Accepted: 02/13/2006] [Indexed: 10/24/2022]
Abstract
The majority of eukaryotic genes consist of exons and introns. Introns can be inserted either between codons (phase 0) or within codons, after the first nucleotide (phase 1) and after the second (phase 2). We report here that the frequency of phase 0 increases and phase 1 declines from the 5' region to the 3' end of genes. This trend is particularly noticeable in genomes of Homo sapiens and Arabidopsis thaliana, in which gains of novel introns in the 3' portion of genes were probably a dominant process. Similar but more moderate gradients exist in Drosophila melanogaster and Caenorhabditis elegans genomes, where the accumulation of novel introns was not a prevailing factor. There are nine types of exons, three symmetric (0,0; 1,1; 2,2) and six asymmetric (0,1; 1,0; 1,2; 2,1; 2,0; 0,2). Assuming random distribution of different types of introns along genes, one can expect the frequencies of asymmetric exons such as 0,1 and 1,0 or 1,2 and 2,1 to be approximately equal, allowing for some variation caused by randomness. The gradient in intron distribution leads to a small but consistent and statistically significant bias: phase 1 introns are more likely at the 5' ends and phase 0 introns are more likely at the 3' ends of asymmetric exons. For the same reason, the frequency of 0,0 exons increases and the frequency of 1,1 exons decreases in the 3' direction, at least in H. sapiens and A. thaliana. The number of introns per gene also affects the distribution and frequency of phase 0 and 1 introns. The gradient provides an insight into the evolution of intron-exon structures of eukaryotic genes.
Collapse
Affiliation(s)
- A Ruvinsky
- The Institute for Genetics and Bioinformatics, University of New England, Armidale, 2351, NSW, Australia.
| | | |
Collapse
|
21
|
Abstract
There has been a lively debate over the evolution of eukaryote introns: at what point in the tree of life did they appear and from where, and what has been their subsequent pattern of loss and gain? A diverse range of recent research papers is relevant to this debate, and it is timely to bring them together. The absence of introns that are not self-splicing in prokaryotes and several other lines of evidence suggest an ancient eukaryotic origin for these introns, and the subsequent gain and loss of introns appears to be an ongoing process in many organisms. Some introns are now functionally important and there have been suggestions that invoke natural selection for the ancient and recent gain of introns, but it is also possible that fixation and loss of introns can occur in the absence of positive selection.
Collapse
Affiliation(s)
- R Belshaw
- Department of Zoology, University of Oxford, South Parks Road, Oxford OX1 3PS, UK.
| | | |
Collapse
|
22
|
Abstract
The origins and importance of spliceosomal introns comprise one of the longest-abiding mysteries of molecular evolution. Considerable debate remains over several aspects of the evolution of spliceosomal introns, including the timing of intron origin and proliferation, the mechanisms by which introns are lost and gained, and the forces that have shaped intron evolution. Recent important progress has been made in each of these areas. Patterns of intron-position correspondence between widely diverged eukaryotic species have provided insights into the origins of the vast differences in intron number between eukaryotic species, and studies of specific cases of intron loss and gain have led to progress in understanding the underlying molecular mechanisms and the forces that control intron evolution.
Collapse
Affiliation(s)
- Scott William Roy
- Allan Wilson Centre for Molecular Ecology and Evolution, Massey University, Palmerston North, New Zealand.
| | | |
Collapse
|
23
|
Rodríguez-Trelles F, Tarrío R, Ayala FJ. Models of spliceosomal intron proliferation in the face of widespread ectopic expression. Gene 2006; 366:201-8. [PMID: 16288838 DOI: 10.1016/j.gene.2005.09.004] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2005] [Revised: 08/04/2005] [Accepted: 09/02/2005] [Indexed: 11/27/2022]
Abstract
It is now certain that today living organisms can acquire new spliceosomal introns in their genes. The proposed sources of spliceosomal introns are exons, transposons, and other introns, including spliceosomal and group II self-splicing introns. Spliceosomal introns are thought to be the most likely source, because the inserted sequence would immediately be endowed with the essential set of intron recognition sequences, thereby preventing the deleterious effects associated with incorrect splicing. The most obvious spliceosomal intron duplication pathways involve an RNA transcript intermediate step. Therefore, for a spliceosomal intron to be originated by duplication, either the source gene from which the novel intron is derived, or that gene and the recipient gene, which contains the novel intron, would need to be expressed in the germ line. Intron proliferation surveys indicate that putative intron duplicate-containing genes do not always match detectable expression in the germ line, which casts doubt on the generality of the duplication model. However, judging mechanisms of intron gain (or loss) from present-day gene expression profiles could be erroneous, if expression patterns were different at the time the introns arose. In fact, this may likely be so in most cases. Ectopic expression, i.e., the expression of genes at times and locations where the target gene is not known to have a function, is a much more common phenomenon than previously realized. We conclude with a speculation on a possible interplay between spliceosomal introns and ectopic expression at the origin of multicellularity.
Collapse
Affiliation(s)
- Francisco Rodríguez-Trelles
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92697-2525, USA.
| | | | | |
Collapse
|