1
|
Abstract
BACKGROUND The evolution of spliceosomal introns has been widely studied among various eukaryotic groups. Researchers nearly reached the consensuses on the pattern and the mechanisms of intron losses and gains across eukaryotes. However, according to previous studies that analyzed a few genes or genomes, Nematoda seems to be an eccentric group. RESULTS Taking advantage of the recent accumulation of sequenced genomes, we extensively analyzed the intron losses and gains using 104 nematode genomes across all the five Clades of the phylum. Nematodes have a wide range of intron density, from less than one to more than nine per kbp coding sequence. The rates of intron losses and gains exhibit significant heterogeneity both across different nematode lineages and across different evolutionary stages of the same lineage. The frequency of intron losses far exceeds that of intron gains. Five pieces of evidence supporting the model of cDNA-mediated intron loss have been observed in ten Caenorhabditis species, the dominance of the precise intron losses, frequent loss of adjacent introns, high-level expression of the intron-lost genes, preferential losses of short introns, and the preferential losses of introns close to 3'-ends of genes. Like studies in most eukaryotic groups, we cannot find the source sequences for the limited number of intron gains detected in the Caenorhabditis genomes. CONCLUSIONS These results indicate that nematodes are a typical eukaryotic group rather than an outlier in intron evolution.
Collapse
Affiliation(s)
- Ming-Yue Ma
- Chongqing Key Laboratory of Big Data for Bio Intelligence, School of Bioinformatics, Chongqing University of Posts and Telecommunications, Chongqing, 400065, China
| | - Ji Xia
- Chongqing Key Laboratory of Big Data for Bio Intelligence, School of Bioinformatics, Chongqing University of Posts and Telecommunications, Chongqing, 400065, China
| | - Kun-Xian Shu
- Chongqing Key Laboratory of Big Data for Bio Intelligence, School of Bioinformatics, Chongqing University of Posts and Telecommunications, Chongqing, 400065, China.
| | - Deng-Ke Niu
- MOE Key Laboratory for Biodiversity Science and Ecological Engineering and Beijing Key Laboratory of Gene Resource and Molecular Development, College of Life Sciences, Beijing Normal University, Beijing, 100875, China.
| |
Collapse
|
2
|
Abbas MN, Chlastáková A, Jmel MA, Iliaki-Giannakoudaki E, Chmelař J, Kotsyfakis M. Serpins in Tick Physiology and Tick-Host Interaction. Front Cell Infect Microbiol 2022; 12:892770. [PMID: 35711658 PMCID: PMC9195624 DOI: 10.3389/fcimb.2022.892770] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2022] [Accepted: 04/22/2022] [Indexed: 11/13/2022] Open
Abstract
Tick saliva has been extensively studied in the context of tick-host interactions because it is involved in host homeostasis modulation and microbial pathogen transmission to the host. Accumulated knowledge about the tick saliva composition at the molecular level has revealed that serine protease inhibitors play a key role in the tick-host interaction. Serpins are one highly expressed group of protease inhibitors in tick salivary glands, their expression can be induced during tick blood-feeding, and they have many biological functions at the tick-host interface. Indeed, tick serpins have an important role in inhibiting host hemostatic processes and in the modulation of the innate and adaptive immune responses of their vertebrate hosts. Tick serpins have also been studied as potential candidates for therapeutic use and vaccine development. In this review, we critically summarize the current state of knowledge about the biological role of tick serpins in shaping tick-host interactions with emphasis on the mechanisms by which they modulate host immunity. Their potential use in drug and vaccine development is also discussed.
Collapse
Affiliation(s)
- Muhammad Nadeem Abbas
- State Key Laboratory of Silkworm Genome Biology, Key Laboratory of Sericultural Biology and Genetic Breeding, Ministry of Agriculture, Southwest University, Chongqing, China
| | - Adéla Chlastáková
- Department of Medical Biology, Faculty of Science, University of South Bohemia in České Budějovice, České Budějovice, Czechia
- Laboratory of Molecular Biology of Ticks, Institute of Parasitology, Biology Centre of the Czech Academy of Sciences, České Budějovice, Czechia
| | - Mohamed Amine Jmel
- Laboratory of Genomics and Proteomics of Disease Vectors, Institute of Parasitology, Biology Centre of the Czech Academy of Sciences, České Budějovice, Czechia
| | | | - Jindřich Chmelař
- Department of Medical Biology, Faculty of Science, University of South Bohemia in České Budějovice, České Budějovice, Czechia
- *Correspondence: Jindřich Chmelař, ; Michail Kotsyfakis,
| | - Michail Kotsyfakis
- Laboratory of Genomics and Proteomics of Disease Vectors, Institute of Parasitology, Biology Centre of the Czech Academy of Sciences, České Budějovice, Czechia
- *Correspondence: Jindřich Chmelař, ; Michail Kotsyfakis,
| |
Collapse
|
3
|
Abstract
Junctophilins (JPHs) comprise a family of structural proteins that connect the plasma membrane to intracellular organelles such as the endo/sarcoplasmic reticulum. Tethering of these membrane structures results in the formation of highly organized subcellular junctions that play important signaling roles in all excitable cell types. There are four JPH isoforms, expressed primarily in muscle and neuronal cell types. Each JPH protein consists of 6 'membrane occupation and recognition nexus' (MORN) motifs, a joining region connecting these to another set of 2 MORN motifs, a putative alpha-helical region, a divergent region exhibiting low homology between JPH isoforms, and a carboxy-terminal transmembrane region anchoring into the ER/SR membrane. JPH isoforms play essential roles in developing and maintaining subcellular membrane junctions. Conversely, inherited mutations in JPH2 cause hypertrophic or dilated cardiomyopathy, while trinucleotide expansions in the JPH3 gene cause Huntington Disease-Like 2. Loss of JPH1 protein levels can cause skeletal myopathy, while loss of cardiac JPH2 levels causes heart failure and atrial fibrillation, among other disease. This review will provide a comprehensive overview of the JPH gene family, phylogeny, and evolutionary analysis of JPH genes and other MORN domain proteins. JPH biogenesis, membrane tethering, and binding partners will be discussed, as well as functional roles of JPH isoforms in excitable cells. Finally, potential roles of JPH isoform deficits in human disease pathogenesis will be reviewed.
Collapse
Affiliation(s)
- Stephan E Lehnart
- Cellular Biophysics and Translational Cardiology Section, Heart Research Center Göttingen, University Medical Center Göttingen, Department of Cardiology and Pneumology, Georg-August University Göttingen, Göttingen, Germany.,Cluster of Excellence "Multiscale Bioimaging: from Molecular Machines to Networks of Excitable Cells" (MBExC), University of Göttingen, Germany.,DZHK (German Centre for Cardiovascular Research), partner site Göttingen, Germany
| | - Xander H T Wehrens
- Cardiovascular Research Institute, Baylor College of Medicine, Houston, Texas, United States; Departments of Molecular Physiology and Biophysics, Medicine (Cardiology), Pediatrics (Cardiology), Neuroscience, and Center for Space Medicine, Baylor College of Medicine, Houston, Texas, United States
| |
Collapse
|
4
|
Sinha S, Manoj N. Molecular evolution of proteins mediating mitochondrial fission-fusion dynamics. FEBS Lett 2019; 593:703-718. [PMID: 30861107 DOI: 10.1002/1873-3468.13356] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2018] [Revised: 03/02/2019] [Accepted: 03/07/2019] [Indexed: 01/24/2023]
Abstract
Eukaryotes employ a subset of dynamins to mediate mitochondrial fusion and fission dynamics. Here we report the molecular evolution and diversification of the dynamin-related mitochondrial proteins that drive the fission (Drp1) and the fusion processes (mitofusin and OPA1). We demonstrate that the three paralogs emerged concurrently in an early mitochondriate eukaryotic ancestor. Furthermore, multiple independent duplication events from an ancestral bifunctional fission protein gave rise to specialized fission proteins. The evolutionary history of these proteins is marked by transformations that include independent gain and loss events occurring at the levels of entire genes, specific functional domains, and intronic regions. The domain level variations primarily comprise loss-gain of lineage specific domains that are present in the terminal regions of the sequences.
Collapse
Affiliation(s)
- Sansrity Sinha
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai, India
| | - Narayanan Manoj
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai, India
| |
Collapse
|
5
|
Ma MY, Lan XR, Niu DK. Intron gain by tandem genomic duplication: a novel case in a potato gene encoding RNA-dependent RNA polymerase. PeerJ 2016; 4:e2272. [PMID: 27547574 PMCID: PMC4974935 DOI: 10.7717/peerj.2272] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2015] [Accepted: 06/29/2016] [Indexed: 01/15/2023] Open
Abstract
The origin and subsequent accumulation of spliceosomal introns are prominent events in the evolution of eukaryotic gene structure. However, the mechanisms underlying intron gain remain unclear because there are few proven cases of recently gained introns. In an RNA-dependent RNA polymerase (RdRp) gene, we found that a tandem duplication occurred after the divergence of potato and its wild relatives among other Solanum plants. The duplicated sequence crosses the intron-exon boundary of the first intron and the second exon. A new intron was detected at this duplicated region, and it includes a small previously exonic segment of the upstream copy of the duplicated sequence and the intronic segment of the downstream copy of the duplicated sequence. The donor site of this new intron was directly obtained from the small previously exonic segment. Most of the splicing signals were inherited directly from the parental intron/exon structure, including a putative branch site, the polypyrimidine tract, the 3' splicing site, two putative exonic splicing enhancers, and the GC contents differed between the intron and exon. In the widely cited model of intron gain by tandem genomic duplication, the duplication of an AGGT-containing exonic segment provides the GT and AG splicing sites for the new intron. Our results illustrate that the tandem duplication model of intron gain should be diverse in terms of obtaining the proper splicing signals.
Collapse
Affiliation(s)
- Ming-Yue Ma
- MOE Key Laboratory for Biodiversity Science and Ecological Engineering and Beijing Key Laboratory of Gene Resource and Molecular Development, College of Life Sciences, Beijing Normal University , Beijing , China
| | - Xin-Ran Lan
- MOE Key Laboratory for Biodiversity Science and Ecological Engineering and Beijing Key Laboratory of Gene Resource and Molecular Development, College of Life Sciences, Beijing Normal University , Beijing , China
| | - Deng-Ke Niu
- MOE Key Laboratory for Biodiversity Science and Ecological Engineering and Beijing Key Laboratory of Gene Resource and Molecular Development, College of Life Sciences, Beijing Normal University , Beijing , China
| |
Collapse
|
6
|
Ma MY, Che XR, Porceddu A, Niu DK. Evaluation of the mechanisms of intron loss and gain in the social amoebae Dictyostelium. BMC Evol Biol 2015; 15:286. [PMID: 26678305 PMCID: PMC4683709 DOI: 10.1186/s12862-015-0567-y] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2015] [Accepted: 12/13/2015] [Indexed: 11/18/2022] Open
Abstract
Background Spliceosomal introns are a common feature of eukaryotic genomes. To approach a comprehensive understanding of intron evolution on Earth, studies should look beyond repeatedly studied groups such as animals, plants, and fungi. The slime mold Dictyostelium belongs to a supergroup of eukaryotes not covered in previous studies. Results We found 441 precise intron losses in Dictyostelium discoideum and 202 precise intron losses in Dictyostelium purpureum. Consistent with these observations, Dictyostelium discoideum was found to have significantly more copies of reverse transcriptase genes than Dictyostelium purpureum. We also found that the lost introns are significantly further from the 5′ end of genes than the conserved introns. Adjacent introns were prone to be lost simultaneously in Dictyostelium discoideum. In both Dictyostelium species, the exonic sequences flanking lost introns were found to have a significantly higher GC content than those flanking conserved introns. Together, these observations support a reverse-transcription model of intron loss in which intron losses were caused by gene conversion between genomic DNA and cDNA reverse transcribed from mature mRNA. We also identified two imprecise intron losses in Dictyostelium discoideum that may have resulted from genomic deletions. Ninety-eight putative intron gains were also observed. Consistent with previous studies of other lineages, the source sequences were found in only a small number of cases, with only two instances of intron gain identified in Dictyostelium discoideum. Conclusions Although they diverged very early from animals and fungi, Dictyostelium species have similar mechanisms of intron loss. Electronic supplementary material The online version of this article (doi:10.1186/s12862-015-0567-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Ming-Yue Ma
- MOE Key Laboratory for Biodiversity Science and Ecological Engineering, College of Life Sciences, Beijing Normal University, Beijing, 100875, China. .,Beijing Key Laboratory of Gene Resource and Molecular Development, College of Life Sciences, Beijing Normal University, Beijing, 100875, China.
| | - Xun-Ru Che
- The High School Affiliated to Renmin University of China, Beijing, 100080, China.
| | - Andrea Porceddu
- Department of Agricultural Sciences, University of Sassari, Viale Italia, 39, 07100, Sassari, Italy.
| | - Deng-Ke Niu
- MOE Key Laboratory for Biodiversity Science and Ecological Engineering, College of Life Sciences, Beijing Normal University, Beijing, 100875, China. .,Beijing Key Laboratory of Gene Resource and Molecular Development, College of Life Sciences, Beijing Normal University, Beijing, 100875, China.
| |
Collapse
|
7
|
Milia G, Camiolo S, Avesani L, Porceddu A. The dynamic loss and gain of introns during the evolution of the Brassicaceae. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2015; 82:915-924. [PMID: 25899207 DOI: 10.1111/tpj.12860] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/18/2015] [Revised: 04/02/2015] [Accepted: 04/09/2015] [Indexed: 06/04/2023]
Abstract
Sequence comparison allows the detailed analysis of evolution at the nucleotide and amino acid levels, but much less information is known about the structural evolution of genes, i.e. how the number, length and distribution of introns change over time. We constructed a parsimonious model for the evolutionary rate of intron loss (IL) and intron gain (IG) within the Brassicaceae and found that IL/IG has been highly dynamic, with substantial differences between and even within lineages. The divergence of the Brassicaceae lineages I and II marked a dramatic change in the IL rate, with the common ancestor of lineage I losing introns three times more rapidly than the common ancestor of lineage II. Our data also indicate a subsequent declining trend in the rate of IL, although in Arabidopsis thaliana introns continue to be lost at approximately the ancestral rate. Variations in the rate of IL/IG within lineage II have been even more remarkable. Brassica rapa appears to have lost introns approximately 15 times more rapidly than the common ancestor of B. rapa and Schenkiella parvula, and approximately 25 times more rapidly than its sister species Eutrema salsugineum. Microhomology was detected at the splice sites of several dynamic introns suggesting that the non-homologous end-joining and double-strand break repair is a common pathway underlying IL/IG in these species. We also detected molecular signatures typical of mRNA-mediated IL, but only in B. rapa.
Collapse
Affiliation(s)
- Giampiera Milia
- Department of Agricultural Sciences, University of Sassari, Viale Italia, 39, 07100, Sassari, Italy
| | - Salvatore Camiolo
- Department of Agricultural Sciences, University of Sassari, Viale Italia, 39, 07100, Sassari, Italy
| | - Linda Avesani
- Department of Biotechnology, University of Verona, Strada Le Grazie, 15, 37134, Verona, Italy
| | - Andrea Porceddu
- Department of Agricultural Sciences, University of Sassari, Viale Italia, 39, 07100, Sassari, Italy
| |
Collapse
|
8
|
Jiang L, Li XN, Niu DK. Higher frequency of intron loss from the promoter proximally paused genes of Drosophila melanogaster. Fly (Austin) 2014; 8:120-5. [PMID: 25483256 DOI: 10.4161/fly.29489] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
Although intron losses have been widely reported, it is not clear whether they are neutral and therefore random or driven by positive selection. Intron transcription and splicing are time-consuming and can delay the expression of its host gene. For genes that must be activated quickly to respond to physiological or stress signals, intron delay may be deleterious. Promoter proximally paused (PPP) genes are a group of rapidly expressed genes. To respond quickly to activation signals, they generally initiate transcription competently but stall after synthesizing a short RNA. In this study, performed in Drosophila melanogaster, the PPP genes were found to have a significantly higher rate of intron loss than control genes. However, further analysis did not find more significant shrinkage of intron size in PPP genes. Referring to previous studies on the rates of transcription and splicing and to the time saved by deletion of the introns from mouse gene Hes7, it is here suggested that transcription delay is comparable to splicing delay only when the intron is 28.5 kb or larger, which is greater in size than 95% of vertebrate introns, 99.5% of Drosophila introns, and all the annotated introns of Saccharomyces cerevisiae and Arabidopsis thaliana. Delays in intron splicing are probably a selective force, promoting intron loss from quickly expressed genes. In other genes, it may have been an exaptation during the emergency of developmental clocks.
Collapse
Affiliation(s)
- Li Jiang
- a MOE Key Laboratory for Biodiversity Science and Ecological Engineering and Beijing Key Laboratory of Gene Resource and Molecular Development; College of Life Sciences; Beijing Normal University; Beijing, PR China
| | | | | |
Collapse
|
9
|
Reichman JR, Vize PD. Separate introns gained within short and long soluble peridinin-chlorophyll a-protein genes during radiation of Symbiodinium (Dinophyceae) clade A and B lineages. PLoS One 2014; 9:e110608. [PMID: 25330037 PMCID: PMC4201569 DOI: 10.1371/journal.pone.0110608] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2014] [Accepted: 09/22/2014] [Indexed: 11/18/2022] Open
Abstract
Here we document introns in two Symbiodinium clades that were most likely gained following divergence of this genus from other peridinin-containing dinoflagellate lineages. Soluble peridinin-chlorophyll a-proteins (sPCP) occur in short and long forms in different species. Duplication and fusion of short sPCP genes produced long sPCP genes. All short and long sPCP genes characterized to date, including those from free living species and Symbiodinium sp. 203 (clade C/type C2) are intronless. However, we observed that long sPCP genes from two Caribbean Symbiodinium clade B isolates each contained two introns. To test the hypothesis that introns were gained during radiation of clade B, we compared sPCP genomic and cDNA sequences from 13 additional distinct Caribbean and Pacific Symbiodinium clade A, B, and F isolates. Long sPCP genes from all clade B/B1 and B/B19 descendants contain orthologs of both introns. Short sPCP genes from S. pilosum (A/A2) and S. muscatinei (B/B4) plus long sPCP genes from S. microadriaticum (A/A1) and S. kawagutii (F/F1) are intronless. Short sPCP genes of S. microadriaticum have a third unique intron. Symbiodinium clade B long sPCP sequences are useful for assessing divergence among B1 and B19 descendants. Phylogenetic analyses of coding sequences from four dinoflagellate orders indicate that introns were gained independently during radiation of Symbiodinium clades A and B. Long sPCP introns were present in the most recent common ancestor of Symbiodinium clade B core types B1 and B19, which apparently diverged sometime during the Miocene. The clade A short sPCP intron was either gained by S. microadriaticum or possibly by the ancestor of Symbiodinium types A/A1, A3, A4 and A5. The timing of short sPCP intron gain in Symbiodinium clade A is less certain. But, all sPCP introns were gained after fusion of ancestral short sPCP genes, which we confirm as occurring once in dinoflagellate evolution.
Collapse
Affiliation(s)
- Jay R. Reichman
- US Environmental Protection Agency, Western Ecology Division, Corvallis, Oregon, United States of America
- Oregon State University, Department of Botany and Plant Pathology, Corvallis, Oregon, United States of America
- * E-mail:
| | - Peter D. Vize
- University of Calgary, Department of Biological Sciences, Calgary, Alberta, Canada
| |
Collapse
|
10
|
Yan H, Zhang W, Lin Y, Dong Q, Peng X, Jiang H, Zhu S, Cheng B. Different evolutionary patterns among intronless genes in maize genome. Biochem Biophys Res Commun 2014; 449:146-50. [PMID: 24820954 DOI: 10.1016/j.bbrc.2014.05.008] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2014] [Accepted: 05/02/2014] [Indexed: 02/06/2023]
Abstract
Intronless genes, as a characteristic feature of prokaryotes, are an important resource for the study of the evolution of gene architecture in eukaryotes. In the study, 14,623 (36.87%) intronless genes in maize were identified and the percentage is greater than that of other monocots and algae. The number of maize intronless genes on each chromosome has a significant linear correlation with the number of total genes on the chromosome and the length of the chromosomes. Intronless genes in maize play important roles in translation and energy metabolism. Evolutionary analysis revealed that 2601 intronless genes conserved among the three domains of life and 2323 intronless genes that had no homology with genes of other species. These two sets of intronless genes were distinct in genetic features, physical locations and function. These results provided a useful source to understand the evolutionary patterns of related genes and genomes and some intronless genes are good candidates for subsequent functional analyses specifically.
Collapse
Affiliation(s)
- Hanwei Yan
- Key Laboratory of Crop Biology of Anhui Province, Anhui Agricultural University, Hefei 230036, China
| | - Wei Zhang
- Key Laboratory of Crop Biology of Anhui Province, Anhui Agricultural University, Hefei 230036, China
| | - Yongxiang Lin
- Key Laboratory of Crop Biology of Anhui Province, Anhui Agricultural University, Hefei 230036, China
| | - Qing Dong
- Key Laboratory of Crop Biology of Anhui Province, Anhui Agricultural University, Hefei 230036, China
| | - Xiaojian Peng
- Key Laboratory of Crop Biology of Anhui Province, Anhui Agricultural University, Hefei 230036, China
| | - Haiyang Jiang
- Key Laboratory of Crop Biology of Anhui Province, Anhui Agricultural University, Hefei 230036, China
| | - Suwen Zhu
- Key Laboratory of Crop Biology of Anhui Province, Anhui Agricultural University, Hefei 230036, China
| | - Beijiu Cheng
- Key Laboratory of Crop Biology of Anhui Province, Anhui Agricultural University, Hefei 230036, China.
| |
Collapse
|
11
|
Heit C, Jackson BC, McAndrews M, Wright MW, Thompson DC, Silverman GA, Nebert DW, Vasiliou V. Update of the human and mouse SERPIN gene superfamily. Hum Genomics 2013; 7:22. [PMID: 24172014 PMCID: PMC3880077 DOI: 10.1186/1479-7364-7-22] [Citation(s) in RCA: 167] [Impact Index Per Article: 15.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2013] [Accepted: 10/15/2013] [Indexed: 12/14/2022] Open
Abstract
The serpin family comprises a structurally similar, yet functionally diverse, set of proteins. Named originally for their function as serine proteinase inhibitors, many of its members are not inhibitors but rather chaperones, involved in storage, transport, and other roles. Serpins are found in genomes of all kingdoms, with 36 human protein-coding genes and five pseudogenes. The mouse has 60 Serpin functional genes, many of which are orthologous to human SERPIN genes and some of which have expanded into multiple paralogous genes. Serpins are found in tissues throughout the body; whereas most are extracellular, there is a class of intracellular serpins. Serpins appear to have roles in inflammation, immune function, tumorigenesis, blood clotting, dementia, and cancer metastasis. Further characterization of these proteins will likely reveal potential biomarkers and therapeutic targets for disease.
Collapse
Affiliation(s)
| | | | | | | | | | | | - Daniel W Nebert
- Department of Pharmaceutical Sciences, Molecular Toxicology and Environmental Health Sciences Program, University of Colorado Anschutz Medical Center, Aurora, CO 80045, USA.
| | | |
Collapse
|
12
|
Koonin EV, Csuros M, Rogozin IB. Whence genes in pieces: reconstruction of the exon-intron gene structures of the last eukaryotic common ancestor and other ancestral eukaryotes. WILEY INTERDISCIPLINARY REVIEWS-RNA 2012; 4:93-105. [PMID: 23139082 DOI: 10.1002/wrna.1143] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
In eukaryotes, protein-coding sequences are interrupted by non-coding sequences known as introns. During mRNA maturation, introns are excised by the spliceosome and the coding regions, exons, are spliced to form the mature coding region. The intron densities widely differ between eukaryotic lineages, from 6 to 7 introns per kb of coding sequence in vertebrates, some invertebrates and green plants, to only a few introns across the entire genome in many unicellular eukaryotes. Evolutionary reconstructions using maximum likelihood methods suggest intron-rich ancestors for each major group of eukaryotes. For the last common ancestor of animals, the highest intron density of all extant and extinct eukaryotes was inferred, at 120-130% of the human intron density. Furthermore, an intron density within 53-74% of the human values was inferred for the last eukaryotic common ancestor. Accordingly, evolution of eukaryotic genes in all lines of descent involved primarily intron loss, with substantial gain only at the bases of several branches including plants and animals. These conclusions have substantial biological implications indicating that the common ancestor of all modern eukaryotes was a complex organism with a gene architecture resembling those in multicellular organisms. Alternative splicing most likely initially appeared as an inevitable result of splicing errors and only later was employed to generate structural and functional diversification of proteins.
Collapse
Affiliation(s)
- Eugene V Koonin
- National Center for Biotechnology Information NLM/NIH, Bethesda, MD, USA.
| | | | | |
Collapse
|
13
|
Wei K, Chen J, Chen Y, Wu L, Xie D. Multiple-strategy analyses of ZmWRKY subgroups and functional exploration of ZmWRKY genes in pathogen responses. MOLECULAR BIOSYSTEMS 2012; 8:1940-9. [PMID: 22569521 DOI: 10.1039/c2mb05483c] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
The WRKY transcription factor family plays crucial roles in biotic responses, such as fungi, bacteria, viruses and nematode infections and insect attacks. In this article, multiple-strategy analyses of the three subgroups were performed in order to gain structural and evolutionary proofs of the overall WRKY family and unravel the functions possessed by each group or subgroup. Thus we analyzed the similarity of WRKY factors between maize and Arabidopsis based on homology modelling. The gene structure and motif analyses of Group II demonstrated that specific motifs existing in the given subgroups may contribute to the functional diversification of WRKY proteins and the two types of conserved intron splice sites suggest their evolutionary conservation. The evolutionary divergence time estimation of Group III proteins indicated that the divergence of Group III occurred during the Neogene period. Further, we focused on the roles of maize WRKYs in pathogen responses based on publicly available microarray experiments. The result suggested that some ZmWRKYs are expressed specifically under the infection of certain fungus, among which some are up-regulated and some are down-regulated, indicating their positive or negative roles in pathogen response. Also, some genes remain unchanged upon fungal infection. Pearson correlation coefficient (PCC) analysis was performed using 62 fungal infection experiments to calculate the correlation between each pair of genes. A PCC value higher than 0.6 was regarded as strong correlation - in these circumstances, ninety pairs of genes showed a strong positive correlation, while fifteen pairs of genes displayed a strong negative correlation. These correlated genes form a co-regulatory network and help us investigate the existence of interactions between WRKY proteins.
Collapse
Affiliation(s)
- Kaifa Wei
- Department of Biological Sciences and Biotechnology, Zhangzhou Normal University, 36 Xian Qian Zhi Street, Zhangzhou 363000, Fujian, China.
| | | | | | | | | |
Collapse
|
14
|
Rogozin IB, Carmel L, Csuros M, Koonin EV. Origin and evolution of spliceosomal introns. Biol Direct 2012; 7:11. [PMID: 22507701 PMCID: PMC3488318 DOI: 10.1186/1745-6150-7-11] [Citation(s) in RCA: 224] [Impact Index Per Article: 18.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2011] [Accepted: 03/15/2012] [Indexed: 12/31/2022] Open
Abstract
Evolution of exon-intron structure of eukaryotic genes has been a matter of long-standing, intensive debate. The introns-early concept, later rebranded ‘introns first’ held that protein-coding genes were interrupted by numerous introns even at the earliest stages of life's evolution and that introns played a major role in the origin of proteins by facilitating recombination of sequences coding for small protein/peptide modules. The introns-late concept held that introns emerged only in eukaryotes and new introns have been accumulating continuously throughout eukaryotic evolution. Analysis of orthologous genes from completely sequenced eukaryotic genomes revealed numerous shared intron positions in orthologous genes from animals and plants and even between animals, plants and protists, suggesting that many ancestral introns have persisted since the last eukaryotic common ancestor (LECA). Reconstructions of intron gain and loss using the growing collection of genomes of diverse eukaryotes and increasingly advanced probabilistic models convincingly show that the LECA and the ancestors of each eukaryotic supergroup had intron-rich genes, with intron densities comparable to those in the most intron-rich modern genomes such as those of vertebrates. The subsequent evolution in most lineages of eukaryotes involved primarily loss of introns, with only a few episodes of substantial intron gain that might have accompanied major evolutionary innovations such as the origin of metazoa. The original invasion of self-splicing Group II introns, presumably originating from the mitochondrial endosymbiont, into the genome of the emerging eukaryote might have been a key factor of eukaryogenesis that in particular triggered the origin of endomembranes and the nucleus. Conversely, splicing errors gave rise to alternative splicing, a major contribution to the biological complexity of multicellular eukaryotes. There is no indication that any prokaryote has ever possessed a spliceosome or introns in protein-coding genes, other than relatively rare mobile self-splicing introns. Thus, the introns-first scenario is not supported by any evidence but exon-intron structure of protein-coding genes appears to have evolved concomitantly with the eukaryotic cell, and introns were a major factor of evolution throughout the history of eukaryotes. This article was reviewed by I. King Jordan, Manuel Irimia (nominated by Anthony Poole), Tobias Mourier (nominated by Anthony Poole), and Fyodor Kondrashov. For the complete reports, see the Reviewers’ Reports section.
Collapse
Affiliation(s)
- Igor B Rogozin
- National Center for Biotechnology Information NLM/NIH, 8600 Rockville Pike, Bldg, 38A, Bethesda, MD 20894, USA
| | | | | | | |
Collapse
|
15
|
Abou Mossallam AA. Identification of Cu/Zn superoxide dismutase in cattle and river buffaloes. J Genet Eng Biotechnol 2011. [DOI: 10.1016/j.jgeb.2011.05.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
|
16
|
Teixeira KN, Souza KN, Vidigal THDA, Brito CA, Santos AMC, Santoro MM. Size polymorphism in alleles of the myoglobin gene from biomphalaria mollusks. Genes (Basel) 2010; 1:357-70. [PMID: 24710092 PMCID: PMC3966218 DOI: 10.3390/genes1030357] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2010] [Revised: 10/07/2010] [Accepted: 10/14/2010] [Indexed: 11/17/2022] Open
Abstract
Introns are common among all eukaryotes, while only a limited number of introns are found in prokaryotes. Globin and globin-like proteins are widely distributed in nature, being found even in prokaryotes and a wide range of patterns of intron-exon have been reported in several eukaryotic globin genes. Globin genes in invertebrates show considerable variation in the positions of introns; globins can be found without introns, with only one intron or with three introns in different positions. In this work we analyzed the introns in the myoglobin gene from Biomphalaria glabrata, B. straminea and B. tenagophila. In the Biomphalaria genus, the myoglobin gene has three introns; these were amplified by PCR and analyzed by PCR-RFLP. Results showed that the size (number or nucleotides) and the nucleotide sequence of the coding gene of the myoglobin are variable in the three species. We observed the presence of size polymorphisms in intron 2 and 3; this characterizes a homozygous/heterozygous profile and it indicates the existence of two alleles which are different in size in each species of Biomphalaria. This polymorphism could be explored for specific identification of Biomphalaria individuals.
Collapse
Affiliation(s)
- Kádima N Teixeira
- Department of Biochemistry and Immunology, Federal University of Minas Gerais, Belo Horizonte/MG, 31270-901, Brazil.
| | - Karyne N Souza
- Department of Zoology, Federal University of Minas Gerais, Belo Horizonte/MG, 31270-901, Brazil.
| | - Teofânia H D A Vidigal
- Department of Zoology, Federal University of Minas Gerais, Belo Horizonte/MG, 31270-901, Brazil.
| | | | - Alexandre M C Santos
- Department of Biochemistry and Immunology, Federal University of Minas Gerais, Belo Horizonte/MG, 31270-901, Brazil.
| | - Marcelo M Santoro
- Department of Biochemistry and Immunology, Federal University of Minas Gerais, Belo Horizonte/MG, 31270-901, Brazil.
| |
Collapse
|
17
|
Catania F, Gao X, Scofield DG. Endogenous mechanisms for the origins of spliceosomal introns. J Hered 2009; 100:591-6. [PMID: 19635762 DOI: 10.1093/jhered/esp062] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Over 30 years since their discovery, the origin of spliceosomal introns remains uncertain. One nearly universally accepted hypothesis maintains that spliceosomal introns originated from self-splicing group-II introns that invaded the uninterrupted genes of the last eukaryotic common ancestor (LECA) and proliferated by "insertion" events. Although this is a possible explanation for the original presence of introns and splicing machinery, the emphasis on a high number of insertion events in the genome of the LECA neglects a considerable body of empirical evidence showing that spliceosomal introns can simply arise from coding or, more generally, nonintronic sequences within genes. After presenting a concise overview of some of the most common hypotheses and mechanisms for intron origin, we propose two further hypotheses that are broadly based on central cellular processes: 1) internal gene duplication and 2) the response to aberrant and fortuitously spliced transcripts. These two nonmutually exclusive hypotheses provide a powerful way to explain the establishment of spliceosomal introns in eukaryotes without invoking an exogenous source.
Collapse
Affiliation(s)
- Francesco Catania
- Department of Biology, Indiana University, Bloomington, IN 47405, USA.
| | | | | |
Collapse
|
18
|
Bae YA, Cai GB, Kim SH, Zo YG, Kong Y. Modular evolution of glutathione peroxidase genes in association with different biochemical properties of their encoded proteins in invertebrate animals. BMC Evol Biol 2009; 9:72. [PMID: 19344533 PMCID: PMC2679728 DOI: 10.1186/1471-2148-9-72] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2008] [Accepted: 04/06/2009] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Phospholipid hydroperoxide glutathione peroxidases (PHGPx), the most abundant isoforms of GPx families, interfere directly with hydroperoxidation of lipids. Biochemical properties of these proteins vary along with their donor organisms, which has complicated the phylogenetic classification of diverse PHGPx-like proteins. Despite efforts for comprehensive analyses, the evolutionary aspects of GPx genes in invertebrates remain largely unknown. RESULTS We isolated GPx homologs via in silico screening of genomic and/or expressed sequence tag databases of eukaryotic organisms including protostomian species. Genes showing strong similarity to the mammalian PHGPx genes were commonly found in all genomes examined. GPx3- and GPx7-like genes were additionally detected from nematodes and platyhelminths, respectively. The overall distribution of the PHGPx-like proteins with different biochemical properties was biased across taxa; selenium- and glutathione (GSH)-dependent proteins were exclusively detected in platyhelminth and deuterostomian species, whereas selenium-independent and thioredoxin (Trx)-dependent enzymes were isolated in the other taxa. In comparison of genomic organization, the GSH-dependent PHGPx genes showed a conserved architectural pattern, while their Trx-dependent counterparts displayed complex exon-intron structures. A codon for the resolving Cys engaged in reductant binding was found to be substituted in a series of genes. Selection pressure to maintain the selenocysteine codon in GSH-dependent genes also appeared to be relaxed during their evolution. With the dichotomized fashion in genomic organizations, a highly polytomic topology of their phylogenetic trees implied that the GPx genes have multiple evolutionary intermediate forms. CONCLUSION Comparative analysis of invertebrate GPx genes provides informative evidence to support the modular pathways of GPx evolution, which have been accompanied with sporadic expansion/deletion and exon-intron remodeling. The differentiated enzymatic properties might be acquired by the evolutionary relaxation of selection pressure and/or biochemical adaptation to the acting environments. Our present study would be beneficial to get detailed insights into the complex GPx evolution, and to understand the molecular basis of the specialized physiological implications of this antioxidant system in their respective donor organisms.
Collapse
Affiliation(s)
- Young-An Bae
- Department of Molecular Parasitology, Sungkyunkwan University School of Medicine and Center for Molecular Medicine, Samsung Biomedical Research Institute, Suwon, Gyeonggi-do 440-746, Korea.
| | | | | | | | | |
Collapse
|
19
|
del Campo EM, Casano LM. Degradation of plastid unspliced transcripts and lariat group II introns. Biochimie 2008; 90:474-83. [DOI: 10.1016/j.biochi.2007.10.006] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2007] [Accepted: 10/15/2007] [Indexed: 11/16/2022]
|
20
|
Abstract
Although introns were first discovered almost 30 years ago, their evolutionary origin remains elusive. In this work, we used multispecies whole-genome alignments to map Drosophila melanogaster introns onto 10 other fully sequenced Drosophila genomes. We were able to find 1,944 sites where an intron was missing in one or more species. We show that for most (>80%) of these cases, there is no leftover intronic sequence or any missing exonic sequence, indicating exact intron loss or gain events. We used parsimony to classify these differences as 1,754 intron loss events and 213 gain events. We show that lost and gained introns are significantly shorter than average and flanked by longer than average exons. They also display quite distinct phase distributions and show greater than average similarity between the 5' splice site and its 3' partner splice site. Introns that have been lost in one or more species evolve faster than other introns, occur in slowly evolving genes, and are found adjacent to each other more often than would be expected for independent single losses. Our results support the cDNA recombination mechanism of intron loss, suggest that selective pressures affect site-specific loss rates, and show conclusively that intron gain has occurred within the Drosophila lineage, solidifying the "introns-middle" hypothesis and providing some hints about the gain mechanism.
Collapse
|
21
|
Stergiopoulos I, Groenewald M, Staats M, Lindhout P, Crous PW, De Wit PJGM. Mating-type genes and the genetic structure of a world-wide collection of the tomato pathogen Cladosporium fulvum. Fungal Genet Biol 2007; 44:415-29. [PMID: 17178244 DOI: 10.1016/j.fgb.2006.11.004] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2006] [Revised: 11/06/2006] [Accepted: 11/07/2006] [Indexed: 11/17/2022]
Abstract
Two mating-type genes, designated MAT1-1-1 and MAT1-2-1, were cloned and sequenced from the presumed asexual ascomycete Cladosporium fulvum (syn. Passalora fulva). The encoded products are highly homologous to mating-type proteins from members of the Mycosphaerellaceae, such as Mycosphaerella graminicola and Cercospora beticola. In addition, the two MAT idiomorphs of C. fulvum showed regions of homology and each contained one additional putative ORF without significant similarity to known sequences. The distribution of the two mating-type genes in a world-wide collection of 86 C. fulvum strains showed a departure from a 1:1 ratio (chi(2)=4.81, df=1). AFLP analysis revealed a high level of genotypic diversity, while strains of the fungus were identified with similar virulence spectra but distinct AFLP patterns and opposite mating-types. These features could suggest the occurrence of recombination in C. fulvum.
Collapse
MESH Headings
- Amino Acid Sequence
- Cladosporium/genetics
- Cladosporium/growth & development
- Cloning, Molecular
- DNA, Fungal/chemistry
- DNA, Fungal/genetics
- Gene Expression Regulation, Fungal
- Genes, Mating Type, Fungal/genetics
- Genetic Variation
- Haplotypes
- Solanum lycopersicum/microbiology
- Models, Genetic
- Molecular Sequence Data
- Phylogeny
- Polymorphism, Restriction Fragment Length
- Sequence Alignment
- Sequence Analysis, DNA
- Sequence Homology, Amino Acid
Collapse
Affiliation(s)
- Ioannis Stergiopoulos
- Laboratory of Phytopathology, Wageningen University and Research Centre, Binnenhaven 5, 6709 PD Wageningen, The Netherlands
| | | | | | | | | | | |
Collapse
|
22
|
Stajich JE, Dietrich FS, Roy SW. Comparative genomic analysis of fungal genomes reveals intron-rich ancestors. Genome Biol 2007; 8:R223. [PMID: 17949488 PMCID: PMC2246297 DOI: 10.1186/gb-2007-8-10-r223] [Citation(s) in RCA: 99] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2006] [Revised: 10/12/2007] [Accepted: 10/19/2007] [Indexed: 11/17/2022] Open
Abstract
BACKGROUND Eukaryotic protein-coding genes are interrupted by spliceosomal introns, which are removed from transcripts before protein translation. Many facets of spliceosomal intron evolution, including age, mechanisms of origins, the role of natural selection, and the causes of the vast differences in intron number between eukaryotic species, remain debated. Genome sequencing and comparative analysis has made possible whole genome analysis of intron evolution to address these questions. RESULTS We analyzed intron positions in 1,161 sets of orthologous genes across 25 eukaryotic species. We find strong support for an intron-rich fungus-animal ancestor, with more than four introns per kilobase, comparable to the highest known modern intron densities. Indeed, the fungus-animal ancestor is estimated to have had more introns than any of the extant fungi in this study. Thus, subsequent fungal evolution has been characterized by widespread and recurrent intron loss occurring in all fungal clades. These results reconcile three previously proposed methods for estimation of ancestral intron number, which previously gave very different estimates of ancestral intron number for eight eukaryotic species, as well as a fourth more recent method. We do not find a clear inverse correspondence between rates of intron loss and gain, contrary to the predictions of selection-based proposals for interspecific differences in intron number. CONCLUSION Our results underscore the high intron density of eukaryotic ancestors and the widespread importance of intron loss through eukaryotic evolution.
Collapse
Affiliation(s)
- Jason E Stajich
- Department of Molecular Genetics and Microbiology, Center for Genome Technology, Institute for Genome Science and Policy, Duke University, Durham, NC 27710, USA
- Miller Institute for Basic Research and Department of Plant and Microbial Biology, 111 Koshland Hall #3102, University of California, Berkeley, CA 94720-3102, USA
| | - Fred S Dietrich
- Department of Molecular Genetics and Microbiology, Center for Genome Technology, Institute for Genome Science and Policy, Duke University, Durham, NC 27710, USA
| | - Scott W Roy
- Department of Molecular Genetics and Microbiology, Center for Genome Technology, Institute for Genome Science and Policy, Duke University, Durham, NC 27710, USA
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, 8600 Rockville Pike, Bethesda, MD 20894, USA
| |
Collapse
|
23
|
Abstract
Research into the origins of introns is at a critical juncture in the resolution of theories on the evolution of early life (which came first, RNA or DNA?), the identity of LUCA (the last universal common ancestor, was it prokaryotic- or eukaryotic-like?), and the significance of noncoding nucleotide variation. One early notion was that introns would have evolved as a component of an efficient mechanism for the origin of genes. But alternative theories emerged as well. From the debate between the "introns-early" and "introns-late" theories came the proposal that introns arose before the origin of genetically encoded proteins and DNA, and the more recent "introns-first" theory, which postulates the presence of introns at that early evolutionary stage from a reconstruction of the "RNA world." Here we review seminal and recent ideas about intron origins. Recent discoveries about the patterns and causes of intron evolution make this one of the most hotly debated and exciting topics in molecular evolutionary biology today.
Collapse
Affiliation(s)
- Francisco Rodríguez-Trelles
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92697-2525, USA.
| | | | | |
Collapse
|
24
|
Fridmanis D, Fredriksson R, Kapa I, Schiöth HB, Klovins J. Formation of new genes explains lower intron density in mammalian Rhodopsin G protein-coupled receptors. Mol Phylogenet Evol 2006; 43:864-80. [PMID: 17188520 DOI: 10.1016/j.ympev.2006.11.007] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2006] [Revised: 10/06/2006] [Accepted: 11/02/2006] [Indexed: 10/23/2022]
Abstract
Mammalian G protein-coupled receptor (GPCR) genes are characterised by a large proportion of intronless genes or a lower density of introns when compared with GPCRs of invertebrates. It is unclear which mechanisms have influenced intron density in this protein family, which is one of the largest in the mammalian genomes. We used a combination of Hidden Markov Models (HMM) and BLAST searches to establish the comprehensive repertoire of Rhodopsin GPCRs from seven species and performed overall alignments and phylogenetic analysis using the maximum parsimony method for over 1400 receptors in 12 subgroups. We identified 14 different Ancestral Receptor Groups (ARGs) that have members in both vertebrate and invertebrate species. We found that there exists a remarkable difference in the intron density among ancestral and new Rhodopsin GPCRs. The intron density among ARGs members was more than 3.5-fold higher than that within non-ARG members and more than 2-fold higher when considering only the 7TM region. This suggests that the new GPCR genes have been predominantly formed intronless while the ancestral receptors likely accumulated introns during their evolution. Many of the intron positions found in mammalian ARG receptor sequences were found to be present in orthologue invertebrate receptors suggesting that these intron positions are ancient. This analysis also revealed that one intron position is much more frequent than any other position and it is common for a number of phylogenetically different Rhodopsin GPCR groups. This intron position lies within a functionally important, conserved, DRY motif which may form a proto-splice site that could contribute to positional intron insertion. Moreover, we have found that other receptor motifs, similar to DRY, also contain introns between the second and third nucleotide of the arginine codon which also forms a proto-splice site. Our analysis presents compelling evidence that there was not a major loss of introns in mammalian GPCRs and formation of new GPCRs among mammals explains why these have fewer introns compared to invertebrate GPCRs. We also discuss and speculate about the possible role of different RNA- and DNA-based mechanisms of intron insertion and loss.
Collapse
Affiliation(s)
- Davids Fridmanis
- Biomedical Research and Study Centre, University of Latvia, Ratsupites 1, Riga, Latvia
| | | | | | | | | |
Collapse
|
25
|
Abstract
The exon/intron structure of eukaryotic genes differs extensively across species, but the mechanisms and relative rates of intron loss and gain are still poorly understood. Here, we used whole-genome sequence alignments of human, mouse, rat, and dog to perform a genome-wide analysis of intron loss and gain events in >17,000 mammalian genes. We found no evidence for intron gain and 122 cases of intron loss, most of which occurred within the rodent lineage. The majority (68%) of the deleted introns were extremely small (<150 bp), significantly smaller than average. The intron losses occurred almost exclusively within highly expressed, housekeeping genes, supporting the hypothesis that intron loss is mediated via germline recombination of genomic DNA with intronless cDNA. This study constitutes the largest scale analysis for intron dynamics in vertebrates to date and allows us to confirm and extend several hypotheses previously based on much smaller samples. Our results in mammals show that intron gain has not been a factor in the evolution of gene structure during the past 95 Myr and has likely been restricted to more ancient history.
Collapse
Affiliation(s)
| | - Jacek Majewski
- Department of Human Genetics, McGill University, Montreal, Quebec H3A 1A4, Canada
- Corresponding author.E-mail ; fax (514) 398-1790
| |
Collapse
|
26
|
Stajich JE, Dietrich FS. Evidence of mRNA-mediated intron loss in the human-pathogenic fungus Cryptococcus neoformans. EUKARYOTIC CELL 2006; 5:789-93. [PMID: 16682456 PMCID: PMC1459680 DOI: 10.1128/ec.5.5.789-793.2006] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
Introns are a defining feature of eukaryotic genomes, though the mechanism of intron gain or loss is not well understood. Reverse transcription of mRNA followed by homologous recombination with the genome has been posited as a mechanism of intron loss, though little direct evidence of recent loss events has been described to support this model. We find supporting evidence for an mRNA-mediated mechanism of loss through comparative genome analyses that revealed a recent loss of 10 adjacent introns in a 22-exon gene in the human-pathogenic fungus Cryptococcus neoformans. We surveyed the gene structures of the entire genomes of Cryptococcus gattii, which diverged from the C. neoformans lineage 37 million years ago (Mya), and C. neoformans var. grubii and var. neoformans, which diverged 18 Mya. Our comparison revealed greater than 99.9% intron conservation, with evidence from 20 genes showing evidence of intron loss, but no convincing evidence of intron gain. Our findings confirm that Cryptococcus introns have been quite stable over recent evolutionary time, with occasional mRNA-mediated intron loss events.
Collapse
Affiliation(s)
- Jason E Stajich
- Department of Molecular Genetics and Microbiology, Center for Applied Genomics and Technology, and Institute for Genome Sciences and Policy, Duke University, Box 3568, Durham, NC 27710, USA
| | | |
Collapse
|
27
|
Abstract
The origins and importance of spliceosomal introns comprise one of the longest-abiding mysteries of molecular evolution. Considerable debate remains over several aspects of the evolution of spliceosomal introns, including the timing of intron origin and proliferation, the mechanisms by which introns are lost and gained, and the forces that have shaped intron evolution. Recent important progress has been made in each of these areas. Patterns of intron-position correspondence between widely diverged eukaryotic species have provided insights into the origins of the vast differences in intron number between eukaryotic species, and studies of specific cases of intron loss and gain have led to progress in understanding the underlying molecular mechanisms and the forces that control intron evolution.
Collapse
Affiliation(s)
- Scott William Roy
- Allan Wilson Centre for Molecular Ecology and Evolution, Massey University, Palmerston North, New Zealand.
| | | |
Collapse
|
28
|
Abstract
We recently reported the unprecedented occurrence of a hemoglobin gene (glob1) in the fruitfly Drosophila melanogaster. Here we investigate the structure and evolution of the glob1 gene in other Drosophila species. We cloned and sequenced glob1 genes and cDNA from D. pseudoobscura and D. virilis, and identified the glob1 gene sequences of D. simulans, D. yakuba, D. erecta, D. ananassae, D. mojavensis and D. grimshawi in the databases. Gene structure (introns in helix positions D7.0 and G7.0), gene synteny and sequence of glob1 are highly conserved, with high ds/dn ratios indicating strong purifying selection. The data suggest an important role of the glob1 protein in Drosophila, which may be the control of oxygen flow from the tracheal system. Furthermore, we identified two additional globin genes (glob2 and glob3) in the Drosophilidae. Although the sequences are highly derived, the amino acids required for heme- and oxygen-binding are conserved. In contrast to other known insect globin, the glob2 and glob3 genes harbour both globin-typical introns at positions B12.2 and G7.0. Both genes are conserved in various drosophilid species, but only expression of glob2 could be demonstrated by western blotting and RT-PCR. Phylogenetic analyses show that the clade leading to glob2 and glob3, which are sistergroups, diverged first in the evolution of dipteran globins. glob1 is closely related to the intracellular hemoglobin of the botfly Gasterophilus intestinalis, and the extracellular hemoglobins from the chironomid midges derive from this clade.
Collapse
|
29
|
Rodríguez-Trelles F, Tarrío R, Ayala FJ. Models of spliceosomal intron proliferation in the face of widespread ectopic expression. Gene 2006; 366:201-8. [PMID: 16288838 DOI: 10.1016/j.gene.2005.09.004] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2005] [Revised: 08/04/2005] [Accepted: 09/02/2005] [Indexed: 11/27/2022]
Abstract
It is now certain that today living organisms can acquire new spliceosomal introns in their genes. The proposed sources of spliceosomal introns are exons, transposons, and other introns, including spliceosomal and group II self-splicing introns. Spliceosomal introns are thought to be the most likely source, because the inserted sequence would immediately be endowed with the essential set of intron recognition sequences, thereby preventing the deleterious effects associated with incorrect splicing. The most obvious spliceosomal intron duplication pathways involve an RNA transcript intermediate step. Therefore, for a spliceosomal intron to be originated by duplication, either the source gene from which the novel intron is derived, or that gene and the recipient gene, which contains the novel intron, would need to be expressed in the germ line. Intron proliferation surveys indicate that putative intron duplicate-containing genes do not always match detectable expression in the germ line, which casts doubt on the generality of the duplication model. However, judging mechanisms of intron gain (or loss) from present-day gene expression profiles could be erroneous, if expression patterns were different at the time the introns arose. In fact, this may likely be so in most cases. Ectopic expression, i.e., the expression of genes at times and locations where the target gene is not known to have a function, is a much more common phenomenon than previously realized. We conclude with a speculation on a possible interplay between spliceosomal introns and ectopic expression at the origin of multicellularity.
Collapse
Affiliation(s)
- Francisco Rodríguez-Trelles
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92697-2525, USA.
| | | | | |
Collapse
|
30
|
Jin S, Hu GA, Qian YH, Zhang L, Zhang J, Qiu G, Zeng QT, Gui JF. Identification of one intron loss and phylogenetic evolution of Dfak gene in the Drosophila melanogaster species group. Genetica 2006; 125:223-30. [PMID: 16247694 DOI: 10.1007/s10709-005-8693-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2005] [Accepted: 06/13/2005] [Indexed: 11/26/2022]
Abstract
Intron loss and its evolutionary significance have been noted in Drosophila. The current study provides another example of intron loss within a single-copy Dfak gene in Drosophila. By using polymerase chain reaction (PCR), we amplified about 1.3 kb fragment spanning intron 5-10, located in the position of Tyr kinase (TyK) domain of Dfak gene from Drosophila melanogaster species group, and observed size difference among the amplified DNA fragments from different species. Further sequencing analysis revealed that D. melanogaster and D. simulans deleted an about 60 bp of DNA fragment relative to other 7 Drosophila species, such as D. elegans, D. ficusphila, D. biarmipes, D. takahashii, D. jambulina, D. prostipennis and D. pseudoobscura, and the deleted fragment located precisely in the position of one intron. The data suggested that intron loss might have occurred in the Dfak gene evolutionary process of D. melanogaster and D. simulans of Drosophila melanogaster species group. In addition, the constructed phylogenetic tree based on the Dfak TyK domains clearly revealed the evolutionary relationships between subgroups of Drosophila melanogaster species group, and the intron loss identified from D. melanogaster and D. simulans provides a unique diagnostic tool for taxonomic classification of the melanogaster subgroup from other group of genus Drosophila.
Collapse
Affiliation(s)
- S Jin
- Wuhan Center for Developmental Biology, Institute of Hydrobiology, Graduate School of the Chinese Academy of Sciences, Chinese Academy of Sciences, Wuhan 430072, China
| | | | | | | | | | | | | | | |
Collapse
|
31
|
Nguyen HD, Yoshihama M, Kenmochi N. New maximum likelihood estimators for eukaryotic intron evolution. PLoS Comput Biol 2005; 1:e79. [PMID: 16389300 PMCID: PMC1323467 DOI: 10.1371/journal.pcbi.0010079] [Citation(s) in RCA: 67] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2005] [Accepted: 11/29/2005] [Indexed: 11/19/2022] Open
Abstract
The evolution of spliceosomal introns remains poorly understood. Although many approaches have been used to infer intron evolution from the patterns of intron position conservation, the results to date have been contradictory. In this paper, we address the problem using a novel maximum likelihood method, which allows estimation of the frequency of intron insertion target sites, together with the rates of intron gain and loss. We analyzed the pattern of 10,044 introns (7,221 intron positions) in the conserved regions of 684 sets of orthologs from seven eukaryotes. We determined that there is an average of one target site per 11.86 base pairs (bp) (95% confidence interval, 9.27 to 14.39 bp). In addition, our results showed that: (i) overall intron gains are ~25% greater than intron losses, although specific patterns vary with time and lineage; (ii) parallel gains account for ~18.5% of shared intron positions; and (iii) reacquisition following loss accounts for ~0.5% of all intron positions. Our results should assist in resolving the long-standing problem of inferring the evolution of spliceosomal introns. When did spliceosomal introns originate, and what is their role? These questions are the central subject of the introns-early versus introns-late debate. Inference of intron evolution from the pattern of intron position conservation is vital for resolving this debate. So far, different methods of two approaches, maximum parsimony (MP) and maximum likelihood (ML), have been developed, but the results are contradictory. The differences between previous ML results are due predominantly to differing assumptions concerning the frequency of target sites for intron insertion. This paper describes a new ML method that treats this frequency as a parameter requiring optimization. Using the pattern of intron position in conserved regions of 684 clusters of gene orthologs from seven eukaryotes, the authors found that, on average, there is one target site per ~12 base pairs. The results of intron evolution inferred using this optimal frequency are more definitive than previous ML results. Since the ML method is preferred to the MP one for large datasets, the current results should be the most reliable ones to date. The results show that during the course of evolution there have been slightly more intron gains than losses, and thus they favor introns-late. These results should shed new light on our understanding of intron evolution.
Collapse
Affiliation(s)
- Hung D Nguyen
- Frontier Science Research Center, University of Miyazaki, Kiyotake, Miyazaki, Japan
| | - Maki Yoshihama
- Frontier Science Research Center, University of Miyazaki, Kiyotake, Miyazaki, Japan
| | - Naoya Kenmochi
- Frontier Science Research Center, University of Miyazaki, Kiyotake, Miyazaki, Japan
- * To whom correspondence should be addressed. E-mail:
| |
Collapse
|
32
|
Galagan JE, Henn MR, Ma LJ, Cuomo CA, Birren B. Genomics of the fungal kingdom: Insights into eukaryotic biology. Genome Res 2005; 15:1620-31. [PMID: 16339359 DOI: 10.1101/gr.3767105] [Citation(s) in RCA: 233] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
The last decade has witnessed a revolution in the genomics of the fungal kingdom. Since the sequencing of the first fungus in 1996, the number of available fungal genome sequences has increased by an order of magnitude. Over 40 complete fungal genomes have been publicly released with an equal number currently being sequenced--representing the widest sampling of genomes from any eukaryotic kingdom. Moreover, many of these sequenced species form clusters of related organisms designed to enable comparative studies. These data provide an unparalleled opportunity to study the biology and evolution of this medically, industrially, and environmentally important kingdom. In addition, fungi also serve as model organisms for all eukaryotes. The available fungal genomic resource, coupled with the experimental tractability of the fungi, is accelerating research into the fundamental aspects of eukaryotic biology. We provide here an overview of available fungal genomes and highlight some of the biological insights that have been derived through their analysis. We also discuss insights into the fundamental cellular biology shared between fungi and other eukaryotic organisms.
Collapse
Affiliation(s)
- James E Galagan
- The Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, Massachusetts 02141, USA.
| | | | | | | | | |
Collapse
|
33
|
Haugen P, Wikmark OG, Vader A, Coucheron DH, Sjøttem E, Johansen SD. The recent transfer of a homing endonuclease gene. Nucleic Acids Res 2005; 33:2734-41. [PMID: 15891115 PMCID: PMC1110740 DOI: 10.1093/nar/gki564] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
The myxomycete Didymium iridis (isolate Panama 2) contains a mobile group I intron named Dir.S956-1 after position 956 in the nuclear small subunit (SSU) rRNA gene. The intron is efficiently spread through homing by the intron-encoded homing endonuclease I-DirI. Homing endonuclease genes (HEGs) usually spread with their associated introns as a unit, but infrequently also spread independent of introns (or inteins). Clear examples of HEG mobility are however sparse. Here, we provide evidence for the transfer of a HEG into a group I intron named Dir.S956-2 that is inserted into the SSU rDNA of the Costa Rica 8 isolate of D.iridis. Similarities between intron sequences that flank the HEG and rDNA sequences that flank the intron (the homing endonuclease recognition sequence) suggest that the HEG invaded the intron during the recent evolution in a homing-like event. Dir.S956-2 is inserted into the same SSU site as Dir.S956-1. Remarkably, the two group I introns encode distantly related splicing ribozymes with phylogenetically related HEGs inserted on the opposite strands of different peripheral loop regions. The HEGs are both interrupted by small spliceosomal introns that must be removed during RNA maturation.
Collapse
Affiliation(s)
| | | | | | | | | | - Steinar D. Johansen
- To whom correspondence should be addressed. Tel: +47 77 64 53 67; Fax: +47 77 64 53 50;
| |
Collapse
|
34
|
Abstract
The discovery of RNA-based enzymatic activity by Thomas Cech's and Sidney Altman's laboratories was a momentous event that led Walter Gilbert to the concept of an "RNA world"--a primitive ancient stage of life that existed before the appearance of DNA and protein molecules. A year later, Gilbert formulated "the exon theory of genes," which hypothesized that introns are very ancient genetic elements present at the earliest stages of life in the RNA world. This theory has been fiercely debated and still has vigorous supporters and opponents. In this communication, we explore peculiarities in the RNA-protein world and their effect on intron-exon structures. We demonstrate that these peculiarities, which exist in the absence of DNA, could shed light on introns' original functions as well as the important role they might have played in the origin of life. For ancient DNA-lacking cells, a crucial problem existed in distinguishing two distinct subsets of RNAs: those messenger molecules coding for proteins and those heritable genetic molecules complementary to messenger RNAs that propagate the genetic information through generations. We propose that ancient introns could act as markers of RNA subsets, directing them to different functions.
Collapse
|
35
|
Vanácová S, Yan W, Carlton JM, Johnson PJ. Spliceosomal introns in the deep-branching eukaryote Trichomonas vaginalis. Proc Natl Acad Sci U S A 2005; 102:4430-5. [PMID: 15764705 PMCID: PMC554003 DOI: 10.1073/pnas.0407500102] [Citation(s) in RCA: 76] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Eukaryotes have evolved elaborate splicing mechanisms to remove introns that would otherwise destroy the protein-coding capacity of genes. Nuclear premRNA splicing requires sequence motifs in the intron and is mediated by a ribonucleoprotein complex, the spliceosome. Here we demonstrate the presence of a splicing apparatus in the protist Trichomonas vaginalis and show that RNA motifs found in yeast and metazoan introns are required for splicing. We also describe the first introns in this deep-branching lineage. The positions of these introns are often conserved in orthologous genes, indicating they were present in a common ancestor of trichomonads, yeast, and metazoa. All examined T. vaginalis introns have a highly conserved 12-nt 3' splice-site motif that encompasses the branch point and is necessary for splicing. This motif is also found in the only described intron in a gene from another deep-branching eukaryote, Giardia intestinalis. These studies demonstrate the conservation of intron splicing signals across large evolutionary distances, reveal unexpected motif conservation in deep-branching lineages that suggest a simplified mechanism of splicing in primitive unicellular eukaryotes, and support the presence of introns in the earliest eukaryote.
Collapse
Affiliation(s)
- Stepánka Vanácová
- Department of Microbiology, Immunology, and Molecular Genetics, University of California, Los Angeles, CA 90095, USA
| | | | | | | |
Collapse
|
36
|
Nielsen CB, Friedman B, Birren B, Burge CB, Galagan JE. Patterns of intron gain and loss in fungi. PLoS Biol 2004; 2:e422. [PMID: 15562318 PMCID: PMC532390 DOI: 10.1371/journal.pbio.0020422] [Citation(s) in RCA: 102] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2004] [Accepted: 10/05/2004] [Indexed: 11/19/2022] Open
Abstract
Little is known about the patterns of intron gain and loss or the relative contributions of these two processes to gene evolution. To investigate the dynamics of intron evolution, we analyzed orthologous genes from four filamentous fungal genomes and determined the pattern of intron conservation. We developed a probabilistic model to estimate the most likely rates of intron gain and loss giving rise to these observed conservation patterns. Our data reveal the surprising importance of intron gain. Between about 150 and 250 gains and between 150 and 350 losses were inferred in each lineage. We discuss one gene in particular (encoding 1-phosphoribosyl-5-pyrophosphate synthetase) that displays an unusually high rate of intron gain in multiple lineages. It has been recognized that introns are biased towards the 5' ends of genes in intron-poor genomes but are evenly distributed in intron-rich genomes. Current models attribute this bias to 3' intron loss through a poly-adenosine-primed reverse transcription mechanism. Contrary to standard models, we find no increased frequency of intron loss toward the 3' ends of genes. Thus, recent intron dynamics do not support a model whereby 5' intron positional bias is generated solely by 3'-biased intron loss.
Collapse
Affiliation(s)
- Cydney B Nielsen
- 1Department of Biology, Massachusetts Institute of TechnologyCambridge, MassachusettsUnited States of America
- 2The Broad Institute of Massachusetts Institute of Technology and Harvard UniversityCambridge, MassachusettsUnited States of America
| | - Brad Friedman
- 1Department of Biology, Massachusetts Institute of TechnologyCambridge, MassachusettsUnited States of America
- 3Department of Mathematics, Massachusetts Institute of TechnologyCambridge, MassachusettsUnited States of America
| | - Bruce Birren
- 2The Broad Institute of Massachusetts Institute of Technology and Harvard UniversityCambridge, MassachusettsUnited States of America
| | - Christopher B Burge
- 1Department of Biology, Massachusetts Institute of TechnologyCambridge, MassachusettsUnited States of America
| | - James E Galagan
- 2The Broad Institute of Massachusetts Institute of Technology and Harvard UniversityCambridge, MassachusettsUnited States of America
| |
Collapse
|
37
|
Tordai H, Patthy L. Insertion of spliceosomal introns in proto-splice sites: the case of secretory signal peptides. FEBS Lett 2004; 575:109-11. [PMID: 15388343 DOI: 10.1016/j.febslet.2004.08.045] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2004] [Revised: 08/19/2004] [Accepted: 08/24/2004] [Indexed: 11/20/2022]
Abstract
Analysis of the exon-intron structures of 2208 human genes has revealed that there is a statistically highly significant excess of phase 1 introns in the vicinity of the signal peptide cleavage sites. It is suggested that amino acid sequences surrounding signal peptide cleavage sites are significantly enriched in phase 1 proto-splice sites and this has favored insertion of spliceosomal introns in these sites.
Collapse
Affiliation(s)
- Hedvig Tordai
- Institute of Enzymology, Biological Research Center, Hungarian Academy of Sciences, Karolina út 29, P.O. Box 7, Budapest H-1518, Hungary
| | | |
Collapse
|
38
|
Sverdlov AV, Rogozin IB, Babenko VN, Koonin EV. Reconstruction of ancestral protosplice sites. Curr Biol 2004; 14:1505-8. [PMID: 15324669 DOI: 10.1016/j.cub.2004.08.027] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2004] [Revised: 06/24/2004] [Accepted: 06/24/2004] [Indexed: 11/20/2022]
Abstract
Most of the eukaryotic protein-coding genes are interrupted by multiple introns. A substantial fraction of introns occupy the same position in orthologous genes from distant eukaryotes, such as plants and animals, and consequently are inferred to have been inherited from the common ancestor of these organisms. In contrast to these conserved introns, many other introns appear to have been gained during evolution of each major eukaryotic lineage. The mechanism(s) of insertion of new introns into genes remains unknown. Because the nucleotides that flank splice junctions are nonrandom, it has been proposed that introns are preferentially inserted into specific target sequences termed protosplice sites. However, it remains unclear whether the consensus nucleotides flanking the splice junctions are remnants of the original protosplice sites or if they evolved convergently after intron insertion. Here, we directly address the existence of protosplice sites by examining the context of introns inserted within codons that encode amino acids conserved in all eukaryotes and accordingly are not subject to selection for splicing efficiency. We show that introns are either predominantly inserted into specific protosplice sites, which have the consensus sequence (A/C)AG/Gt, or that they are inserted randomly but are preferentially fixed at such sites.
Collapse
Affiliation(s)
- Alexander V Sverdlov
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | | | | | | |
Collapse
|
39
|
Krauss V, Pecyna M, Kurz K, Sass H. Phylogenetic Mapping of Intron Positions: A Case Study of Translation Initiation Factor eIF2γ. Mol Biol Evol 2004; 22:74-84. [PMID: 15356279 DOI: 10.1093/molbev/msh255] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Eukaryotic translation initiation factor 2 (eIF2) is a G protein that delivers the methionyl initiator tRNA to the small ribosomal subunit and releases it upon GTP hydrolysis after the recognition of the initiation codon. eIF2 is composed of three subunits, alpha, beta, and gamma. Subunit gamma shows the strongest conservation, and it confers both tRNA and GTP/GDP binding. Using intron positioning and protein sequence alignment, here we show that eIF2gamma is a suitable phylogenetic marker for eukaryotes. We determined or completed the sequences of 13 arthropod eIF2gamma genes. Analyzing the phylogenetic distribution of 52 different intron positions in 55 distantly related eIF2gamma genes, we identified ancient ones and shared derived introns in our data set. Obviously, intron positioning in eIF2gamma is evolutionarily conserved. However, there were episodes of complete and partial intron losses followed by intron gains. We identified 17 clusters of intron positions based on their distribution. The evolution of these clusters appears to be connected with preferred exon length and can be used to estimate the relative timing of intron gain because nearby precursor introns had to be erased from the gene before the new introns could be inserted. Moreover, we identified a putative case of intron sliding that constitutes a synapomorphic character state supporting monophyly of Coleoptera, Lepidoptera, and Diptera excluding Hymenoptera. We also performed tree reconstructions using the eIF2gamma protein sequences and intron positioning as phylogenetic information. Our results support the monophyly of Viridoplantae, Ascomycota, Homobasidiomyceta, and Apicomplexa.
Collapse
Affiliation(s)
- Veiko Krauss
- Department of Genetics, University of Leipzig, Leipzig, Germany.
| | | | | | | |
Collapse
|
40
|
Roesner A, Fuchs C, Hankeln T, Burmester T. A globin gene of ancient evolutionary origin in lower vertebrates: evidence for two distinct globin families in animals. Mol Biol Evol 2004; 22:12-20. [PMID: 15356282 DOI: 10.1093/molbev/msh258] [Citation(s) in RCA: 100] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
Hemoglobin, myoglobin, neuroglobin, and cytoglobin are four types of vertebrate globins with distinct tissue distributions and functions. Here, we report the identification of a fifth and novel globin gene from fish and amphibians, which has apparently been lost in the evolution of higher vertebrates (Amniota). Because its function is presently unknown, we tentatively call it globin X (GbX). Globin X sequences were obtained from three fish species, the zebrafish Danio rerio, the goldfish Carassius auratus, and the pufferfish Tetraodon nigroviridis, and the clawed frog Silurana tropicalis. Globin X sequences are distinct from vertebrate hemoglobins, myoglobins, neuroglobins, and cytoglobins. Globin X displays the highest identity scores with neuroglobin (approximately 26% to 35%), although it is not a neuronal protein, as revealed by RT-PCR experiments on goldfish RNA from various tissues. The distal ligand-binding and the proximal heme-binding histidines (E7 and F8), as well as the conserved phenylalanine CD1 are present in the globin X sequences, but because of extensions at the N-terminal and C-terminal, the globin X proteins are longer than the typical eight alpha-helical globins and comprise about 200 amino acids. In addition to the conserved globin introns at helix positions B12.2 and G7.0, the globin X genes contain two introns in E10.2 and H10.0. The intron in E10.2 is shifted by 1 bp in respect to the vertebrate neuroglobin gene (E11.0), providing possible evidence for an intron sliding event. Phylogenetic analyses confirm an ancient evolutionary relationship of globin X with neuroglobin and suggest the existence of two distinct globin types in the last common ancestor of Protostomia and Deuterostomia.
Collapse
Affiliation(s)
- Anja Roesner
- Institute of Zoology, Johannes Gutenberg University, Mainz, Germany
| | | | | | | |
Collapse
|
41
|
Coghlan A, Wolfe KH. Origins of recently gained introns in Caenorhabditis. Proc Natl Acad Sci U S A 2004; 101:11362-7. [PMID: 15243155 PMCID: PMC509176 DOI: 10.1073/pnas.0308192101] [Citation(s) in RCA: 103] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2003] [Indexed: 11/18/2022] Open
Abstract
The genomes of the nematodes Caenorhabditis elegans and Caenorhabditis briggsae both contain approximately 100,000 introns, of which >6,000 are unique to one or the other species. To study the origins of new introns, we used a conservative method involving phylogenetic comparisons to animal orthologs and nematode paralogs to identify cases where an intron content difference between C. elegans and C. briggsae was caused by intron insertion rather than deletion. We identified 81 recently gained introns in C. elegans and 41 in C. briggsae. Novel introns have a stronger exon splice site consensus sequence than the general population of introns and show the same preference for phase 0 sites in codons over phases 1 and 2. More of the novel introns are inserted in genes that are expressed in the C. elegans germ line than expected by chance. Thirteen of the 122 gained introns are in genes whose protein products function in premRNA processing, including three gains in the gene for spliceosomal protein SF3B1 and two in the nonsense-mediated decay gene smg-2. Twenty-eight novel introns have significant DNA sequence identity to other introns, including three that are similar to other introns in the same gene. All of these similarities involve minisatellites or palindromes in the intron sequences. Our results suggest that at least some of the intron gains were caused by reverse splicing of a preexisting intron.
Collapse
Affiliation(s)
- Avril Coghlan
- Department of Genetics, Smurfit Institute, University of Dublin, Trinity College, Dublin 2, Ireland
| | | |
Collapse
|
42
|
Affiliation(s)
- John M Logsdon
- Department of Biological Sciences, Roy J. Carver Center for Comparative Genomics, University of Iowa, Iowa City, IA 52242, USA.
| |
Collapse
|
43
|
Haugen P, Runge HJ, Bhattacharya D. Long-term evolution of the S788 fungal nuclear small subunit rRNA group I introns. RNA (NEW YORK, N.Y.) 2004; 10:1084-96. [PMID: 15208444 PMCID: PMC1370599 DOI: 10.1261/rna.5202704] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/14/2023]
Abstract
More than 1000 group I introns have been identified in fungal rDNA. Little is known, however, of the splicing and secondary structure evolution of these ribozymes. Here, we use a combination of comparative and biochemical methods to address the evolution and splicing of a vertically inherited group I intron found at position 788 in the fungal small subunit (S) rRNA. The ancestral state of the S788 intron contains a highly conserved core and an extended P5 domain typical of IC1 introns. In contrast, the more derived introns have lost most of P5, and have an accelerated divergence rate within the core region with three functionally important substitutions that unambiguously separate them from the ancestral pool. Of 14 S788 group I introns that were tested for splicing, five, all of the ancestral type, were able to self-splice and produced intron RNA circles in vitro. The more derived S788 introns did not self-splice, and potentially rely on fungal-specific factors to facilitate splicing. In summary, we demonstrate one possible fate of vertically inherited group I introns, the loss of secondary structure elements, lessened selective constraints in the intron core, and ultimately, dependence on host-mediated splicing.
Collapse
Affiliation(s)
- Peik Haugen
- Department of Biological Sciences and Center for Comparative Genomics, University of Iowa, 210 Biology Building, Iowa City, IA 52242-1324, USA
| | | | | |
Collapse
|
44
|
Kiontke K, Gavin NP, Raynes Y, Roehrig C, Piano F, Fitch DHA. Caenorhabditis phylogeny predicts convergence of hermaphroditism and extensive intron loss. Proc Natl Acad Sci U S A 2004; 101:9003-8. [PMID: 15184656 PMCID: PMC428462 DOI: 10.1073/pnas.0403094101] [Citation(s) in RCA: 265] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2004] [Indexed: 11/18/2022] Open
Abstract
Despite the prominence of Caenorhabditis elegans as a major developmental and genetic model system, its phylogenetic relationship to its closest relatives has not been resolved. Resolution of these relationships is necessary for studying the steps that underlie life history, genomic, and morphological evolution of this important system. By using data from five different nuclear genes from 10 Caenorhabditis species currently in culture, we find a well resolved phylogeny that reveals three striking patterns in the evolution of this animal group: (i) Hermaphroditism has evolved independently in C. elegans and its close relative Caenorhabditis briggsae; (ii) there is a large degree of intron turnover within Caenorhabditis, and intron losses are much more frequent than intron gains; and (iii) despite the lack of marked morphological diversity, more genetic disparity is present within this one genus than has occurred within all vertebrates.
Collapse
Affiliation(s)
- Karin Kiontke
- Department of Biology, New York University, 100 Washington Square East, New York, NY 10003, USA
| | | | | | | | | | | |
Collapse
|
45
|
Sverdlov AV, Rogozin IB, Babenko VN, Koonin EV. Evidence of splice signal migration from exon to intron during intron evolution. Curr Biol 2004; 13:2170-4. [PMID: 14680632 DOI: 10.1016/j.cub.2003.12.003] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
A comparison of the nucleotide sequences around the splice junctions that flank old (shared by two or more major lineages of eukaryotes) and new (lineage-specific) introns in eukaryotic genes reveals substantial differences in the distribution of information between introns and exons. Old introns have a lower information content in the exon regions adjacent to the splice sites than new introns but have a corresponding higher information content in the intron itself. This suggests that introns insert into nonrandom (proto-splice) sites but, during the evolution of an intron after insertion, the splice signal shifts from the flanking exon regions to the ends of the intron itself. Accumulation of information inside the intron during evolution suggests that new introns largely emerge de novo rather than through propagation and migration of old introns.
Collapse
Affiliation(s)
- Alexander V Sverdlov
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | | | | | | |
Collapse
|
46
|
Klovins J, Haitina T, Fridmanis D, Kilianova Z, Kapa I, Fredriksson R, Gallo-Payet N, Schiöth HB. The melanocortin system in Fugu: determination of POMC/AGRP/MCR gene repertoire and synteny, as well as pharmacology and anatomical distribution of the MCRs. Mol Biol Evol 2003; 21:563-79. [PMID: 14694081 DOI: 10.1093/molbev/msh050] [Citation(s) in RCA: 145] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The G-protein-coupled melanocortin receptors (MCRs) play an important role in a variety of essential functions such as the regulation of pigmentation, energy homeostasis, and steroid production. We performed a comprehensive characterization of the MC system in Fugu (Takifugu rubripes). We show that Fugu has an AGRP gene with high degree of conservation in the C-terminal region in addition to a POMC gene lacking gamma-MSH. The Fugu genome contains single copies of four MCRs, whereas the MC3R is missing. The MC2R and MC5R are found in tandem and remarkably contain one and two introns, respectively. We suggest that these introns were inserted through a reverse splicing mechanism into the DRY motif that is widely conserved through GPCRs. We were able to assemble large blocks around the MCRs in Fugu, showing remarkable synteny with human chromosomes 16 and 18. Detailed pharmacological characterization showed that ACTH had surprisingly high affinity for the Fugu MC1R and MC4R, whereas alpha-MSH had lower affinity. We also showed that the MC2R gene in Fugu codes for an ACTH receptor, which did not respond to alpha-MSH. All the Fugu receptors were able to couple functionally to cAMP production in line with the mammalian orthologs. The anatomical characterization shows that the MC2R is expressed in the brain in addition to the head-kidney, whereas the MC4R and MC5R are found in both brain regions and peripheral tissues. This is the first comprehensive genomic and functional characterization of a GPCR family within the Fugu genome. The study shows that some parts of the MC system are highly conserved through vertebrate evolution, such as regions in POMC coding for ACTH, alpha-MSH, and beta-MSH, the C-terminal region of AGRP, key binding units within the MC1R, MC2R, MC4R, and MC5R, synteny blocks around the MCRs, pharmacological properties of the MC2R, whereas other parts in the system are either missing, such as the MC3R and gamma-MSH, or different as compared to mammals, such as the affinity of ACTH and MSH peptides to MC1R and MC4R and the anatomical expression pattern of the MCRs.
Collapse
Affiliation(s)
- Janis Klovins
- Department of Neuroscience, Uppsala University, Uppsala, Sweden
| | | | | | | | | | | | | | | |
Collapse
|
47
|
Abstract
For nearly 15 years, it has been widely believed that many introns were recently acquired by the genes of multicellular organisms. However, the mechanism of acquisition has yet to be described for a single animal intron. Here, we report a large-scale computational analysis of the human, Drosophila melanogaster, Caenorhabditis elegans, and Arabidopsis thaliana genomes. We divided 147,796 human intron sequences into batches of similar lengths and aligned them with each other. Different types of homologies between introns were found, but none showed evidence of simple intron transposition. Also, 106,902 plant, 39,624 Drosophila, and 6021 C. elegans introns were examined. No single case of homologous introns in nonhomologous genes was detected. Thus, we found no example of transposition of introns in the last 50 million years in humans, in 3 million years in Drosophila and C. elegans, or in 5 million years in Arabidopsis. Either new introns do not arise via transposition of other introns or intron transposition must have occurred so early in evolution that all traces of homology have been lost.
Collapse
Affiliation(s)
- Alexei Fedorov
- Department of Medicine, Medical College of Ohio, Toledo, Ohio 43614, USA.
| | | | | | | |
Collapse
|
48
|
Rogozin IB, Wolf YI, Sorokin AV, Mirkin BG, Koonin EV. Remarkable interkingdom conservation of intron positions and massive, lineage-specific intron loss and gain in eukaryotic evolution. Curr Biol 2003; 13:1512-7. [PMID: 12956953 DOI: 10.1016/s0960-9822(03)00558-x] [Citation(s) in RCA: 301] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
Sequencing of eukaryotic genomes allows one to address major evolutionary problems, such as the evolution of gene structure. We compared the intron positions in 684 orthologous gene sets from 8 complete genomes of animals, plants, fungi, and protists and constructed parsimonious scenarios of evolution of the exon-intron structure for the respective genes. Approximately one-third of the introns in the malaria parasite Plasmodium falciparum are shared with at least one crown group eukaryote; this number indicates that these introns have been conserved through >1.5 billion years of evolution that separate Plasmodium from the crown group. Paradoxically, humans share many more introns with the plant Arabidopsis thaliana than with the fly or nematode. The inferred evolutionary scenario holds that the common ancestor of Plasmodium and the crown group and, especially, the common ancestor of animals, plants, and fungi had numerous introns. Most of these ancestral introns, which are retained in the genomes of vertebrates and plants, have been lost in fungi, nematodes, arthropods, and probably Plasmodium. In addition, numerous introns have been inserted into vertebrate and plant genes, whereas, in other lineages, intron gain was much less prominent.
Collapse
Affiliation(s)
- Igor B Rogozin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | | | | | | | | |
Collapse
|
49
|
Roy SW, Fedorov A, Gilbert W. Large-scale comparison of intron positions in mammalian genes shows intron loss but no gain. Proc Natl Acad Sci U S A 2003; 100:7158-62. [PMID: 12777620 PMCID: PMC165846 DOI: 10.1073/pnas.1232297100] [Citation(s) in RCA: 152] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
We compared intron-exon structures in 1,560 human-mouse orthologs and 360 mouse-rat orthologs. The origin of differences in intron positions between species was inferred by comparison with an outgroup, Fugu for human-mouse and human for mouse-rat. Among 10,020 intron positions in the human-mouse comparison, we found unequivocal evidence for five independent intron losses in the mouse lineage but no evidence for intron loss in humans or for intron gain in either lineage. Among 1,459 positions in rat-mouse comparisons, we found evidence for one loss in rat but neither loss in mouse nor gain in either lineage. In each case, the intron losses were exact, without change in the surrounding coding sequence, and involved introns that are extremely short, with an average of 200 bp, an order of magnitude shorter than the mammalian average. These results favor a model whereby introns are lost through gene conversion with intronless copies of the gene. In addition, the finding of widespread conservation of intron-exon structure, even over large evolutionary distances, suggests that comparative methods employing information about gene structures should be very successful in correctly predicting exon boundaries in genomic sequences.
Collapse
Affiliation(s)
- Scott W Roy
- Biological Laboratories, 16 Divinity Avenue, Cambridge, MA 02138, USA
| | | | | |
Collapse
|
50
|
Tarrio R, Rodríguez-Trelles F, Ayala FJ. A new Drosophila spliceosomal intron position is common in plants. Proc Natl Acad Sci U S A 2003; 100:6580-3. [PMID: 12750476 PMCID: PMC164489 DOI: 10.1073/pnas.0731952100] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The 25-year-old debate about the origin of introns between proponents of "introns early" and "introns late" has yielded significant advances, yet important questions remain to be ascertained. One question concerns the density of introns in the last common ancestor of the three multicellular kingdoms. Approaches to this issue thus far have relied on counts of the numbers of identical intron positions across present-day taxa on the assumption that the introns at those sites are orthologous. However, dismissing parallel intron gain for those sites may be unwarranted, because various factors can potentially constrain the site of intron insertion. Demonstrating parallel intron gain is severely handicapped, because intron sequences often evolve exceedingly fast and intron phylogenetic distributions are usually ambiguous, such that alternative loss and gain scenarios cannot be clearly distinguished. We have identified an intron position that was gained independently in animals and plants in the xanthine dehydrogenase gene. The extremely disjointed phylogenetic distribution of the intron argues strongly for separate gain rather than recurrent loss. If the observed phylogenetic pattern had resulted from recurrent loss, all observational support previously gathered for the introns-late theory of intron origins based on the phylogenetic distribution of introns would be invalidated.
Collapse
Affiliation(s)
- Rosa Tarrio
- Department of Ecology and Evolutionary Biology, University of California, Irvine 92697-2525, USA
| | | | | |
Collapse
|