1
|
Affiliation(s)
- Alexander F. Palazzo
- University of Toronto, Department of Biochemistry, Toronto, Ontario, Canada
- * E-mail: (AP); (TG)
| | - T. Ryan Gregory
- University of Guelph, Department of Integrative Biology, Guelph, Ontario, Canada
- * E-mail: (AP); (TG)
| |
Collapse
|
2
|
Carmel L, Rogozin IB, Wolf YI, Koonin EV. Evolutionarily conserved genes preferentially accumulate introns. Genes Dev 2007; 17:1045-50. [PMID: 17495009 PMCID: PMC1899115 DOI: 10.1101/gr.5978207] [Citation(s) in RCA: 63] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2006] [Accepted: 03/28/2007] [Indexed: 11/24/2022]
Abstract
Introns that interrupt eukaryotic protein-coding sequences are generally thought to be nonfunctional. However, for reasons still poorly understood, positions of many introns are highly conserved in evolution. Previous reconstructions of intron gain and loss events during eukaryotic evolution used a variety of simplified evolutionary models that yielded contradicting conclusions and are not suited to reveal some of the key underlying processes. We combine a comprehensive probabilistic model and an extended data set, including 391 conserved genes from 19 eukaryotes, to uncover previously unnoticed aspects of intron evolution--in particular, to assign intron gain and loss rates to individual genes. The rates of intron gain and loss in a gene show moderate positive correlation. A gene's intron gain rate shows a highly significant negative correlation with the coding-sequence evolution rate; intron loss rate also significantly, but positively, correlates with the sequence evolution rate. Correlations of the opposite signs, albeit less significant ones, are observed between intron gain and loss rates and gene expression level. It is proposed that intron evolution includes a neutral component, which is manifest in the positive correlation between the gain and loss rates and a selection-driven component as reflected in the links between intron gain and loss and sequence evolution. The increased intron gain and decreased intron loss in evolutionarily conserved genes indicate that intron insertion often might be adaptive, whereas some of the intron losses might be deleterious. This apparent functional importance of introns is likely to be due, at least in part, to their multiple effects on gene expression.
Collapse
Affiliation(s)
- Liran Carmel
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA
| | - Igor B. Rogozin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA
| | - Yuri I. Wolf
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA
| | - Eugene V. Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA
| |
Collapse
|
3
|
Whamond GS, Thornton JM. An analysis of intron positions in relation to nucleotides, amino acids, and protein secondary structure. J Mol Biol 2006; 359:238-47. [PMID: 16616935 DOI: 10.1016/j.jmb.2006.03.029] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2005] [Revised: 03/05/2006] [Accepted: 03/15/2006] [Indexed: 10/24/2022]
Abstract
We present an analysis of intron positions in relation to nucleotides, amino acid residues, and protein secondary structure. Previous work has shown that intron sites in proteins are not randomly distributed with respect to secondary structures. Here we show that this preference can be almost totally explained by the nucleotide bias of splice site machinery, and may well not relate to protein stability or conformation at all. Each intron phase is preferentially associated with its own set of residues: phase 0 introns with lysine, glutamine, and glutamic acid before the intron, and valine after; phase 1 introns with glycine, alanine, valine, aspartic acid, and glutamic acid; and phase 2 introns with arginine, serine, lysine, and tryptophan. These preferences can be explained principally on the basis of nucleotide bias at intron locations, which is in accordance with previous literature. Although this work does not prove that introns are inserted into genomes at specific proto-splice sites, it shows that the nucleotide bias surrounding introns, however it originally occurred, explains the observed correlations between introns and protein secondary structure.
Collapse
Affiliation(s)
- Gordon S Whamond
- Department of Biochemistry and Molecular Biology, University College London, UK.
| | | |
Collapse
|
4
|
Abstract
The evolutionary origin of spliceosomal introns remains elusive. The startling success of a new way of predicting intron sites suggests that the splicing machinery determines where introns are added to genes.
Collapse
Affiliation(s)
- Arlin Stoltzfus
- Center for Advanced Research in Biotechnology (CARB), 9600 Gudelsky Drive, Rockville, Maryland 20850, USA.
| |
Collapse
|
5
|
Sverdlov AV, Rogozin IB, Babenko VN, Koonin EV. Evidence of splice signal migration from exon to intron during intron evolution. Curr Biol 2004; 13:2170-4. [PMID: 14680632 DOI: 10.1016/j.cub.2003.12.003] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
A comparison of the nucleotide sequences around the splice junctions that flank old (shared by two or more major lineages of eukaryotes) and new (lineage-specific) introns in eukaryotic genes reveals substantial differences in the distribution of information between introns and exons. Old introns have a lower information content in the exon regions adjacent to the splice sites than new introns but have a corresponding higher information content in the intron itself. This suggests that introns insert into nonrandom (proto-splice) sites but, during the evolution of an intron after insertion, the splice signal shifts from the flanking exon regions to the ends of the intron itself. Accumulation of information inside the intron during evolution suggests that new introns largely emerge de novo rather than through propagation and migration of old introns.
Collapse
Affiliation(s)
- Alexander V Sverdlov
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | | | | | | |
Collapse
|
6
|
Blasczyk R, Kotsch K, Wehling J. The nature of polymorphism of the HLA-DRB intron sequences is lineage specific. TISSUE ANTIGENS 1998; 52:19-26. [PMID: 9714470 DOI: 10.1111/j.1399-0039.1998.tb03019.x] [Citation(s) in RCA: 21] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
The sequence database of HLA-DRB genes is mainly derived from mRNA analysis or has focused exclusively on the polymorphism of the 2nd exon. Little is known about the non-coding sequences of the different DRB alleles which represent about 94% of the genes. In this study we have determined the sequence of the 3' 500 bp intron 1 fragment adjacent to exon 2 in all serologically defined HLA-DRB genes and their most frequent allelic subtypes. The intron sequences turned out to be highly polymorphic. Similar to the class I introns, this variability was not characterized by random point mutations but by a highly systematic diversity reflecting the lineage-specific relationship of the HLA-DR alleles. With a few exceptions in DRBI*15, 13 and 08 as well as DRB4 and 5, the variability mirrors the serological diversity. As well as delivering insight into the genetic relationship between the different DRB alleles, these sequences will provide an extremely valuable basis for developing advanced DRB sequencing strategies for clinical purposes.
Collapse
Affiliation(s)
- R Blasczyk
- Department of Internal Medicine, Blood Bank, Charité, Campus Virchow-Klinikum, Humboldt-University, Berlin, Germany.
| | | | | |
Collapse
|
7
|
Chen WM, Hsieh HM, Huang PC. Type 2 rice metallothionein-like gene has two introns. DNA SEQUENCE : THE JOURNAL OF DNA SEQUENCING AND MAPPING 1998; 8:223-8. [PMID: 10520450 DOI: 10.3109/10425179809008455] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
A type 2 rice metallothionein-like gene was isolated from root by PCR and sequenced. The PCR fragment was designated as pcr1460, which overlaps with OsMT-2, a cDNA sequence previously characterized, with the presence of two additional segments, 583 and 613 bp in length. These segments are recognized as introns which divide the coding sequence into three exons, 65, 78 and 106 base pairs in length. The sequences flanking the introns conform with the GT/AG rule for splice junctions, and one exonic open reading frame can be identified in each of the introns. The observation that MT-like gene has two introns is the first of such a finding obtained from monocotyledonous plants.
Collapse
Affiliation(s)
- W M Chen
- Department of Life Science, National Tsing Hua University, Taiwan, ROC
| | | | | |
Collapse
|
8
|
Bagavathi S, Malathi R. Introns and protein revolution--an analysis of the exon/intron organisation of actin genes. FEBS Lett 1996; 392:63-5. [PMID: 8769316 DOI: 10.1016/0014-5793(96)00769-7] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
A catalogue of intron positions obtained from a large number of actin genes has been compiled with a view to understanding the possible origin of intervening sequences. Actins are ubiquitous proteins conserved in evolution and an analysis of their gene structures from various organisms has revealed that there may be at least 25 intron positions distributed at different positions in the coding regions. A comparison of intron positions from a wide range of organisms from that of yeast to human actins shows that introns could be more ancestral in origin. The conservation in the observed intron patterns within the different tissue types hints at a possible functional significance of introns in present day actin genes.
Collapse
Affiliation(s)
- S Bagavathi
- Department of Genetics, Dr. ALM Post Graduate Institute of Basic Medical Sciences, University of Madras, India
| | | |
Collapse
|
9
|
Stoltzfus A, Spencer DF, Zuker M, Logsdon JM, Doolittle WF. Testing the exon theory of genes: the evidence from protein structure. Science 1994; 265:202-7. [PMID: 8023140 DOI: 10.1126/science.8023140] [Citation(s) in RCA: 158] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
A tendency for exons to correspond to discrete units of protein structure in protein-coding genes of ancient origin would provide clear evidence in favor of the exon theory of genes, which proposes that split genes arose not by insertion of introns into unsplit genes, but from combinations of primordial mini-genes (exons) separated by spacers (introns). Although putative examples of such correspondence have strongly influenced previous debate on the origin of introns, a general correspondence has not been rigorously proved. Objective methods for detecting correspondences were developed and applied to four examples that have been cited previously as evidence of the exon theory of genes. No significant correspondence between exons and units of protein structure was detected, suggesting that the putative correspondence does not exist and that the exon theory of genes is untenable.
Collapse
Affiliation(s)
- A Stoltzfus
- Department of Biochemistry, Dalhousie University, Halifax, Nova Scotia, Canada
| | | | | | | | | |
Collapse
|
10
|
Affiliation(s)
- D A Hickey
- Department of Biology, University of Ottawa, Ontario, Canada
| |
Collapse
|
11
|
Tittiger C, Whyard S, Walker VK. A novel intron site in the triosephosphate isomerase gene from the mosquito Culex tarsalis. Nature 1993; 361:470-2. [PMID: 8429888 DOI: 10.1038/361470a0] [Citation(s) in RCA: 39] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
Abstract
The origin and function of introns in eukaryotic genes has provoked considerable debate since their discovery in 1977. Central to this issue are studies on the highly conserved enzyme, triosephosphate isomerase (TPI, EC 5.3.1.1). The 'introns early' argument suggests that introns are as old as the genes themselves and that the apparent correlation of many of the intron sites in plant, animal and fungal TPI genes with the boundaries of modules is evidence of the assembly of ancient proteins by exon shuffling. In contrast, the 'introns late' view holds that ancient genomes contained few if any introns; introns were inserted into pre-existing genes during the last billion years. We have found that the TPI gene from the mosquito, Culex tarsalis, contains an intron in a unique position that was predicted by W. Gilbert and the exon shuffling hypothesis.
Collapse
Affiliation(s)
- C Tittiger
- Department of Biology, Queen's University, Kingston, Ontario, Canada
| | | | | |
Collapse
|
12
|
Stephens RM, Schneider TD. Features of spliceosome evolution and function inferred from an analysis of the information at human splice sites. J Mol Biol 1992; 228:1124-36. [PMID: 1474582 DOI: 10.1016/0022-2836(92)90320-j] [Citation(s) in RCA: 220] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
An information analysis of the 5' (donor) and 3' (acceptor) sequences spanning the ends of nearly 1800 human introns has provided evidence for structural features of splice sites that bear upon spliceosome evolution and function: (1) 82% of the sequence information (i.e. sequence conservation) at donor junctions and 97% of the sequence information at acceptor junctions is confined to the introns, allowing codon choices throughout exons to be largely unrestricted. The distribution of information at intron-exon junctions is also described in detail and compared with footprints. (2) Acceptor sites are found to possess enough information to be located in the transcribed portion of the human genome, whereas donor sites possess about one bit less than the information needed to locate them independently. This difference suggests that acceptor sites are located first in humans and, having been located, reduce by a factor of two the number of alternative sites available as donors. Direct experimental evidence exists to support this conclusion. (3) The sequences of donor and acceptor splice sites exhibit a striking similarity. This suggests that the two junctions derive from a common ancestor and that during evolution the information of both sites shifted onto the intron. If so, the protein and RNA components that are found in contemporary spliceosomes, and which are responsible for recognizing donor and acceptor sequences, should also be related. This conclusion is supported by the common structures found in different parts of the spliceosome.
Collapse
Affiliation(s)
- R M Stephens
- National Cancer Institute, Frederick Cancer Research and Development Center, Laboratory of Mathematical Biology, MD 21702-1201
| | | |
Collapse
|
13
|
Abstract
Nonhomologous fully sequenced human protein-coding genes were studied. Three sets of exon-exon junctions were formed defined by the intron (shadow) position relative to the reading frame. For the analysis of intron shadow signals in exons, information content and discrimination energy approaches were used with the correction allowing one to ignore the influence of a protein-coding message. The corrected formulas allow one to define the consensuses for the three types of intron shadow signals as a AG/guwn, cAG/GUnn, and cAG/gunU, and provide better recognition than the original formulas. The analysis of the codon usage in the signal positions leads to the conclusion that the prevalence of some amino acids in corresponding protein sites is caused by the signal requirements and not vice versa. The distribution of potential intron shadow signals in exons contradicts the hypothesis of intron insertion into suitable preexisting sites. There exists a correlation between the intron types and/or the exon length modulo 3.
Collapse
Affiliation(s)
- M S Gelfand
- Institute of Protein Research, Russia Academy of Sciences, Pushchino, Moscow Region
| |
Collapse
|
14
|
Fedorov A, Suboch G, Bujakov M, Fedorova L. Analysis of nonuniformity in intron phase distribution. Nucleic Acids Res 1992; 20:2553-7. [PMID: 1598214 PMCID: PMC312392 DOI: 10.1093/nar/20.10.2553] [Citation(s) in RCA: 57] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
The distribution of different intron groups with respect to phases has been analyzed. It has been established that group II introns and nuclear introns have a minimum frequency of phase 2 introns. Since the phase of introns is an extremely conservative measure the observed minimum reflects evolutionary processes. A sample of all known, group I introns was too small to provide a valid characteristic of their phase distribution. The findings observed for the unequal distribution of phases cannot be explained solely on the basis of the mobile properties of introns. One of the most likely explanations for this nonuniformity in the intron phase distribution is the process of exon shuffling. It is proposed that group II introns originated at the early stages of evolution and were involved in the process of exon shuffling.
Collapse
Affiliation(s)
- A Fedorov
- Department of Molecular Basis of Human Genetics, Academy of Sciences of Russia, Moscow
| | | | | | | |
Collapse
|
15
|
Abstract
This paper summarizes some recent theories about the evolution of transposable genetic elements in outbreeding, sexual eukaryotic organisms. The evolutionary possibilities available to self-replicating transposable elements are shown to vary depending on the reproductive biology of the host genome. This effect can be used to explain, in part, the differences in abundance of transposable elements between prokaryotes and eukaryotes. It is argued that the pattern of sexual outbreeding seen in mammals and plants is especially favorable to the spread of transposons. Moreover, because transposon spread is facilitated by zygote formation, the evolutionary origin of sexual conjugation may have been due to selection on transposon-encoded genes. Finally, evidence is also presented that introns could have originated as transposable genetic elements.
Collapse
Affiliation(s)
- D A Hickey
- Department of Biology, University of Ottawa, Ontario, Canada
| |
Collapse
|
16
|
|
17
|
Abstract
Bacteriophage T4 possesses three self-splicing group I introns. Two of the three introns are mobile elements; the third, in the gene encoding a subunit of the phage nucleotide reductase (nrdB), is not mobile. Because intron mobility offers a reasonable explanation for the paradoxical occurrence of large intervening sequences in a space-efficient eubacterial phage, it is puzzling that the nrdB intron is not mobile like its compatriots. We have discovered a larger nrdB intron in a closely related phage, and we infer from comparative sequence data that the T4 intron is a deletion mutant derived from this larger intron. This larger nrdB intron encodes an open reading frame of 269 codons, which we have cloned and overexpressed. The overexpressed protein shows a dsDNA endonuclease activity specific for the intronless nrdB gene, typical of mobile introns. Thus, we believe that all three introns of T4 are or were mobile "infectious introns" and that they have entered into and been maintained in the phage population by virtue of this efficient mobility.
Collapse
Affiliation(s)
- S R Eddy
- Department of Molecular, Cellular, and Developmental Biology, University of Colorado, Boulder 80309-0347
| | | |
Collapse
|
18
|
Abstract
In a recent paper, Walter Gilbert's group has estimated the number of original exons from which all extant proteins might have been constructed. The approach used is subjected to a critical analysis here. It is shown that there are flawed assumptions about both the mechanism and generality of exon-shuffling and in the sequence comparison procedures employed, the latter failing to distinguish chance similarity from similarity due to common ancestry. These methodological errors lead to the omission of many known cases of exon-shuffling and the inclusion of others which may not be genuine. In consequence, the analysis from the Gilbert group cannot give a reliable estimate of those modules that actually participated in exon-shuffling and provides no information on the number of protein archetypes that did not participate in these processes.
Collapse
Affiliation(s)
- L Patthy
- Institute of Enzymology, Hungarian Academy of Sciences, Budapest
| |
Collapse
|
19
|
Abstract
If genes have been assembled from exon subunits, the frequency with which exons are reused leads to an estimate of the size of the underlying exon universe. An exon database was constructed from available protein sequences, and homologous exons were identified on the basis of amino acid identity; statistically significant matches were determined by Monte Carlo methods. It is estimated that only 1000 to 7000 exons were needed to construct all proteins.
Collapse
Affiliation(s)
- R L Dorit
- Department of Cellular and Developmental Biology, Harvard University, Cambridge, MA 02138
| | | | | |
Collapse
|
20
|
Abstract
Previous theories have suggested that some introns with the ability to self-splice are derived from transposable elements. However, an interpretation is given here that suggests retrotransposons and retroviruses (transposable elements which move via RNA intermediates) have evolved from self-splicing introns. This is based on the involvement of RNA intermediates, the ancestral nature of the self-splicing reaction, and the assumed presence of introns in an RNA world. Conserved sequences within the introns, essential for splicing, and their wide phylogenetic distribution also make it unlikely that they are descended from transposable elements. Mitochondrial plasmids of Neurospora species containing features of both introns and retrotransposons have a central role in the resolution of the problem and are considered here to support the view that introns are, or have been, sources of mobile elements. The possibility of other transposable elements arising from introns is also considered.
Collapse
Affiliation(s)
- R E Hickson
- Department of Microbiology and Genetics, Massey University, Palmerston North, New Zealand
| |
Collapse
|
21
|
Quirk SM, Bell-Pedersen D, Belfort M. Intron mobility in the T-even phages: high frequency inheritance of group I introns promoted by intron open reading frames. Cell 1989; 56:455-65. [PMID: 2644046 DOI: 10.1016/0092-8674(89)90248-1] [Citation(s) in RCA: 118] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
Intron mobility in the T-even phages has been demonstrated. Efficient nonreciprocal conversion of intron minus (In-) alleles to intron plus (In+) occurred for the td and sunY genes, but not for nrdB. Conversion to In+ was absolutely dependent on expression of the respective intron open reading frame (ORF). Introns were inserted at their cognate sites in an intronless phage genome via an RNA-independent, DNA-based, duplicative recombination event that was stimulated by exon homology. The td intron ORF product directs the endonucleolytic cleavage of DNA, targeting the site of intron integration. A 21 nucleotide deletion of the integration site abolished high frequency intron inheritance. These experiments provide a novel example of gene conversion in prokaryotes, while suggesting a molecular rationale for the inconsistent distribution of introns within highly conserved exon contexts of the T-even phage genomes.
Collapse
Affiliation(s)
- S M Quirk
- Wadsworth Center for Laboratories and Research, New York State Department of Health, Albany 12201-0509
| | | | | |
Collapse
|
22
|
Chauvet G. REMOVED: Bibliography. Mol Cells 1986. [DOI: 10.1016/b978-0-08-041992-3.50031-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
|