51
|
Su Z, Gu X. Revisit on the evolutionary relationship between alternative splicing and gene duplication. Gene 2012; 504:102-6. [PMID: 22621894 DOI: 10.1016/j.gene.2012.05.012] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2012] [Revised: 03/18/2012] [Accepted: 05/07/2012] [Indexed: 11/19/2022]
Abstract
Gene duplications and alternative splicing (AS) isoforms are two widespread types of genetic variations that can facilitate diversification of protein function. A number of studies claimed that after gene duplication, two AS isoforms with differential functions can be 'fixed', respectively, in each of the duplicate copies. This simple 'functional-sharing' hypothesis was recently challenged by Roux and Robinson-Rechavi (2011). Instead, they proposed a more sophisticated hypothesis, invoking that less alternative splicing genes tend to be duplicated more frequently, and single-copy genes are younger than duplicate genes, or the 'duplicability-age' hypothesis for short. In this letter, we show that all these genome-wide analyses of AS isoforms actually did not provide clear-cut evidence to nullify the basic idea of functional-sharing hypothesis. After updating our understanding of genome-wide alternative splicing, duplicability and CNV (copy number variation), we argue that the foundation of the duplicability-age hypothesis remains to be justified carefully. Finally, we suggest that a better approach to resolving this controversy is the correspondence analysis of indels (insertions and deletions) between duplicate genes to the genomic exon-intron structure, which can be used to experimentally test the effect of functional-sharing hypothesis.
Collapse
Affiliation(s)
- Zhixi Su
- MOE Key Laboratory of Contemporary Anthropology and Center for Evolutionary Biology, School of Life Sciences, Fudan University, Shanghai 200433, China
| | | |
Collapse
|
52
|
Davis MJ, Shin CJ, Jing N, Ragan MA. Rewiring the dynamic interactome. MOLECULAR BIOSYSTEMS 2012; 8:2054-66, 2013. [DOI: 10.1039/c2mb25050k] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
53
|
Evolution of microRNA genes in Oryza sativa and Arabidopsis thaliana: an update of the inverted duplication model. PLoS One 2011; 6:e28073. [PMID: 22194805 PMCID: PMC3237417 DOI: 10.1371/journal.pone.0028073] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2011] [Accepted: 10/31/2011] [Indexed: 12/12/2022] Open
Abstract
The origin and evolution of microRNA (miRNA) genes, which are of significance in tuning and buffering gene expressions in a number of critical cellular processes, have long attracted evolutionary biologists. However, genome-wide perspectives on their origins, potential mechanisms of their de novo generation and subsequent evolution remain largely unsolved in flowering plants. Here, genome-wide analyses of Oryza sativa and Arabidopsis thaliana revealed apparently divergent patterns of miRNA gene origins. A large proportion of miRNA genes in O. sativa were TE-related and MITE-related miRNAs in particular, whereas the fraction of these miRNA genes much decreased in A. thaliana. Our results show that the majority of TE-related and pseudogene-related miRNA genes have originated through inverted duplication instead of segmental or tandem duplication events. Based on the presented findings, we hypothesize and illustrate the four likely molecular mechanisms to de novo generate novel miRNA genes from TEs and pseudogenes. Our rice genome analysis demonstrates that non-MITEs and MITEs mediated inverted duplications have played different roles in de novo generating miRNA genes. It is confirmed that the previously proposed inverted duplication model may give explanations for non-MITEs mediated duplication events. However, many other miRNA genes, known from the earlier proposed model, were rather arisen from MITE transpositions into target genes to yield binding sites. We further investigated evolutionary processes spawned from de novo generated to maturely-formed miRNA genes and their regulatory systems. We found that miRNAs increase the tunability of some gene regulatory systems with low gene copy numbers. The results also suggest that gene balance effects may have largely contributed to the evolution of miRNA regulatory systems.
Collapse
|
54
|
Chen TW, Wu TH, Ng WV, Lin WC. Interrogation of alternative splicing events in duplicated genes during evolution. BMC Genomics 2011; 12 Suppl 3:S16. [PMID: 22369477 PMCID: PMC3333175 DOI: 10.1186/1471-2164-12-s3-s16] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/05/2022] Open
Abstract
Background Gene duplication provides resources for developing novel genes and new functions while retaining the original functions. In addition, alternative splicing could increase the complexity of expression at the transcriptome and proteome level without increasing the number of gene copy in the genome. Duplication and alternative splicing are thought to work together to provide the diverse functions or expression patterns for eukaryotes. Previously, it was believed that duplication and alternative splicing were negatively correlated and probably interchangeable. Results We look into the relationship between occurrence of alternative splicing and duplication at different time after duplication events. We found duplication and alternative splicing were indeed inversely correlated if only recently duplicated genes were considered, but they became positively correlated when we took those ancient duplications into account. Specifically, for slightly or moderately duplicated genes with gene families containing 2 - 7 paralogs, genes were more likely to evolve alternative splicing and had on average a greater number of alternative splicing isoforms after long-term evolution compared to singleton genes. On the other hand, those large gene families (contain at least 8 paralogs) had a lower proportion of alternative splicing, and fewer alternative splicing isoforms on average even when ancient duplicated genes were taken into consideration. We also found these duplicated genes having alternative splicing were under tighter evolutionary constraints compared to those having no alternative splicing, and had an enrichment of genes that participate in molecular transducer activities. Conclusions We studied the association between occurrences of alternative splicing and gene duplication. Our results implicate that there are key differences in functions and evolutionary constraints among singleton genes or duplicated genes with or without alternative splicing incidences. It implies that the gene duplication and alternative splicing may have different functional significance in the evolution of speciation diversity.
Collapse
Affiliation(s)
- Ting-Wen Chen
- Institute of Biomedical Informatics, National Yang-Ming University, Taipei, Taiwan
| | | | | | | |
Collapse
|
55
|
Abstract
In the canonical version of evolution by gene duplication, one copy is kept unaltered while the other is free to evolve. This process of evolutionary experimentation can persist for millions of years. Since it is so short lived in comparison to the lifetime of the core genes that make up the majority of most genomes, a substantial fraction of the genome and the transcriptome may—in principle—be attributable to what we will refer to as “evolutionary transients”, referring here to both the process and the genes that have gone or are undergoing this process. Using the rice gene set as a test case, we argue that this phenomenon goes a long way towards explaining why there are so many more rice genes than Arabidopsis genes, and why most excess rice genes show low similarity to eudicots.
Collapse
|
56
|
Díez D, Sánchez-Jiménez F, Ranea JAG. Evolutionary expansion of the Ras switch regulatory module in eukaryotes. Nucleic Acids Res 2011; 39:5526-37. [PMID: 21447561 PMCID: PMC3141262 DOI: 10.1093/nar/gkr154] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open
Abstract
Ras proteins control many aspects of eukaryotic cell homeostasis by switching between active (GTP-bound) and inactive (GDP-bound) conformations, a reaction catalyzed by GTPase exchange factors (GEF) and GTPase activating proteins (GAP) regulators, respectively. Here, we show that the complexity, measured as number of genes, of the canonical Ras switch genetic system (including Ras, RasGEF, RasGAP and RapGAP families) from 24 eukaryotic organisms is correlated with their genome size and is inversely correlated to their evolutionary distances from humans. Moreover, different gene subfamilies within the Ras switch have contributed unevenly to the module’s expansion and speciation processes during eukaryote evolution. The Ras system remarkably reduced its genetic expansion after the split of the Euteleostomi clade and presently looks practically crystallized in mammals. Supporting evidence points to gene duplication as the predominant mechanism generating functional diversity in the Ras system, stressing the leading role of gene duplication in the Ras family expansion. Domain fusion and alternative splicing are significant sources of functional diversity in the GAP and GEF families but their contribution is limited in the Ras family. An evolutionary model of the Ras system expansion is proposed suggesting an inherent ‘decision making’ topology with the GEF input signal integrated by a homologous molecular mechanism and bifurcation in GAP signaling propagation.
Collapse
Affiliation(s)
- Diego Díez
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji, Kyoto 611-0011 Japan.
| | | | | |
Collapse
|
57
|
Evolution of exon-intron structure and alternative splicing. PLoS One 2011; 6:e18055. [PMID: 21464961 PMCID: PMC3064661 DOI: 10.1371/journal.pone.0018055] [Citation(s) in RCA: 69] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2010] [Accepted: 02/19/2011] [Indexed: 12/22/2022] Open
Abstract
Despite significant advances in high-throughput DNA sequencing, many important
species remain understudied at the genome level. In this study we addressed a
question of what can be predicted about the genome-wide characteristics of less
studied species, based on the genomic data from completely sequenced species.
Using NCBI databases we performed a comparative genome-wide analysis of such
characteristics as alternative splicing, number of genes, gene products and
exons in 36 completely sequenced model species. We created statistical
regression models to fit these data and applied them to loblolly pine
(Pinus taeda L.), an example of an important species whose
genome has not been completely sequenced yet. Using these models, the
genome-wide characteristics, such as total number of genes and exons, can be
roughly predicted based on parameters estimated from available limited genomic
data, e.g. exon length and exon/gene ratio.
Collapse
|
58
|
Zhang ZG, Ye ZQ, Yu L, Shi P. Phylogenomic reconstruction of lactic acid bacteria: an update. BMC Evol Biol 2011; 11:1. [PMID: 21194491 PMCID: PMC3024227 DOI: 10.1186/1471-2148-11-1] [Citation(s) in RCA: 109] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2010] [Accepted: 01/01/2011] [Indexed: 01/28/2023] Open
Abstract
Background Lactic acid bacteria (LAB) are important in the food industry for the production of fermented food products and in human health as commensals in the gut. However, the phylogenetic relationships among LAB species remain under intensive debate owing to disagreements among different data sets. Results We performed a phylogenetic analysis of LAB species based on 232 genes from 28 LAB genome sequences. Regardless of the tree-building methods used, combined analyses yielded an identical, well-resolved tree topology with strong supports for all nodes. The LAB species examined were divided into two groups. Group 1 included families Enterococcaceae and Streptococcaceae. Group 2 included families Lactobacillaceae and Leuconostocaceae. Within Group 2, the LAB species were divided into two clades. One clade comprised of the acidophilus complex of genus Lactobacillus and two other species, Lb. sakei and Lb. casei. In the acidophilus complex, Lb. delbrueckii separated first, while Lb. acidophilus/Lb. helveticus and Lb. gasseri/Lb. johnsonii were clustered into a sister group. The other clade within Group 2 consisted of the salivarius subgroup, including five species, Lb. salivarius, Lb. plantarum, Lb. brevis, Lb. reuteri, Lb. fermentum, and the genera Pediococcus, Oenococcus, and Leuconostoc. In this clade, Lb. salivarius was positioned most basally, followed by two clusters, one corresponding to Lb. plantarum/Lb. brevis pair and Pediococcus, and the other including Oenococcus/Leuconostoc pair and Lb. reuteri/Lb. fermentum pair. In addition, phylogenetic utility of the 232 genes was analyzed to identify those that may be more useful than others. The genes identified as useful were related to translation and ribosomal structure and biogenesis (TRSB), and a three-gene set comprising genes encoding ultra-violet resistance protein B (uvrB), DNA polymerase III (polC) and penicillin binding protein 2B (pbpB). Conclusions Our phylogenomic analyses provide important insights into the evolution and diversification of LAB species, and also revealed the phylogenetic utility of several genes. We infer that the occurrence of multiple, independent adaptation events in LAB species, have resulted in their occupation of various habitats. Further analyses of more genes from additional, representative LAB species are needed to reveal the molecular mechanisms underlying adaptation of LAB species to various environmental niches.
Collapse
Affiliation(s)
- Zhi-Gang Zhang
- State Key Laboratory of Genetic Resources and Evolution, Laboratory of Evolutionary and Functional Genomics, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, PR China
| | | | | | | |
Collapse
|
59
|
Zhan Z, Ren J, Zhang Y, Zhao R, Yang S, Wang W. Evolution of alternative splicing in newly evolved genes of Drosophila. Gene 2011; 470:1-6. [DOI: 10.1016/j.gene.2010.09.001] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2010] [Revised: 08/18/2010] [Accepted: 09/02/2010] [Indexed: 12/25/2022]
|
60
|
Roux J, Robinson-Rechavi M. Age-dependent gain of alternative splice forms and biased duplication explain the relation between splicing and duplication. Genome Res 2010; 21:357-63. [PMID: 21173032 DOI: 10.1101/gr.113803.110] [Citation(s) in RCA: 44] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
We analyze here the relation between alternative splicing and gene duplication in light of recent genomic data, with a focus on the human genome. We show that the previously reported negative correlation between level of alternative splicing and family size no longer holds true. We clarify this pattern and show that it is sufficiently explained by two factors. First, genes progressively gain new splice variants with time. The gain is consistent with a selectively relaxed regime, until purifying selection slows it down as aging genes accumulate a large number of variants. Second, we show that duplication does not lead to a loss of splice forms, but rather that genes with low levels of alternative splicing tend to duplicate more frequently. This leads us to reconsider the role of alternative splicing in duplicate retention.
Collapse
Affiliation(s)
- Julien Roux
- University of Lausanne, Department of Ecology and Evolution, Quartier Sorge, 1015 Lausanne, Switzerland
| | | |
Collapse
|
61
|
Bagowski CP, Bruins W, te Velthuis AJ. The nature of protein domain evolution: shaping the interaction network. Curr Genomics 2010; 11:368-76. [PMID: 21286315 PMCID: PMC2945003 DOI: 10.2174/138920210791616725] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2010] [Revised: 06/04/2010] [Accepted: 06/13/2010] [Indexed: 11/30/2022] Open
Abstract
The proteomes that make up the collection of proteins in contemporary organisms evolved through recombination and duplication of a limited set of domains. These protein domains are essentially the main components of globular proteins and are the most principal level at which protein function and protein interactions can be understood. An important aspect of domain evolution is their atomic structure and biochemical function, which are both specified by the information in the amino acid sequence. Changes in this information may bring about new folds, functions and protein architectures. With the present and still increasing wealth of sequences and annotation data brought about by genomics, new evolutionary relationships are constantly being revealed, unknown structures modeled and phylogenies inferred. Such investigations not only help predict the function of newly discovered proteins, but also assist in mapping unforeseen pathways of evolution and reveal crucial, co-evolving inter- and intra-molecular interactions. In turn this will help us describe how protein domains shaped cellular interaction networks and the dynamics with which they are regulated in the cell. Additionally, these studies can be used for the design of new and optimized protein domains for therapy. In this review, we aim to describe the basic concepts of protein domain evolution and illustrate recent developments in molecular evolution that have provided valuable new insights in the field of comparative genomics and protein interaction networks.
Collapse
Affiliation(s)
- Christoph P Bagowski
- German University Cairo, Faculty of Pharmacy and Biotechnology, New Cairo City, Egypt
| | - Wouter Bruins
- Institute of Biology, Leiden University, 2333 AL Leiden, The Netherlands
| | - Aartjan J.W te Velthuis
- Department of Medical Microbiology, Molecular Virology Laboratory, Leiden University Medical Center, Albinusdreef 2, 2333 ZA Leiden, The Netherlands
- Department of Bionanoscience, Delft University of Technology, Lorentzweg 1, 2628 CJ, Delft, The Netherlands
| |
Collapse
|
62
|
Santos ME, Athanasiadis A, Leitão AB, DuPasquier L, Sucena E. Alternative splicing and gene duplication in the evolution of the FoxP gene subfamily. Mol Biol Evol 2010; 28:237-47. [PMID: 20651048 PMCID: PMC3002244 DOI: 10.1093/molbev/msq182] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
The FoxP gene subfamily of transcription factors is defined by its characteristic 110 amino acid long DNA-binding forkhead domain and plays essential roles in vertebrate biology. Its four members, FoxP1–P4, have been extensively characterized functionally. FoxP1, FoxP2, and FoxP4 are involved in lung, heart, gut, and central nervous system (CNS) development. FoxP3 is necessary and sufficient for the specification of regulatory T cells (Tregs) of the adaptive immune system. In Drosophila melanogaster, in silico predictions identify one unique FoxP subfamily gene member (CG16899) with no described function. We characterized this gene and established that it generates by alternative splicing two isoforms that differ in the forkhead DNA-binding domain. In D. melanogaster, both isoforms are expressed in the embryonic CNS, but in hemocytes, only isoform A is expressed, hinting to a putative modulation through alternative splicing of FoxP1 function in immunity and/or other hemocyte-dependent processes. Furthermore, we show that in vertebrates, this novel alternative splicing pattern is conserved for FoxP1. In mice, this new FoxP1 isoform is expressed in brain, liver, heart, testes, thymus, and macrophages (equivalent in function to hemocytes). This alternative splicing pattern has arisen at the base of the Bilateria, probably through exon tandem duplication. Moreover, our phylogenetic analysis suggests that in vertebrates, FoxP1 is more related to the FoxP gene ancestral form and the other three paralogues, originated through serial duplications, which only retained one of the alternative exons. Also, the newly described isoform differs from the other in amino acids critical for DNA-binding specificity. The integrity of its fold is maintained, but the molecule has lost the direct hydrogen bonding to DNA bases leading to a putatively lower specificity and possibly affinity toward DNA. With the present comparative study, through the integration of experimental and in silico studies of the FoxP gene subfamily across the animal kingdom, we establish a new model for the FoxP gene in invertebrates and for the vertebrate FoxP1 paralogue. Furthermore, we present a scenario for the structural evolution of this gene class and reveal new previously unsuspected levels of regulation for FoxP1 in the vertebrate system.
Collapse
Affiliation(s)
- M Emília Santos
- Departamento de Biologia Animal, Faculdade de Ciências da Universidade de Lisboa, Lisboa, Portugal
| | | | | | | | | |
Collapse
|
63
|
Ozernyuk ND. Correlation of ontogenetic and evolutionary processes in view of achievements of modern genetics: Role of gene duplication. BIOL BULL+ 2010. [DOI: 10.1134/s1062359010020020] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
64
|
Alternative splicing of the histone demethylase LSD1/KDM1 contributes to the modulation of neurite morphogenesis in the mammalian nervous system. J Neurosci 2010; 30:2521-32. [PMID: 20164337 DOI: 10.1523/jneurosci.5500-09.2010] [Citation(s) in RCA: 125] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023] Open
Abstract
A variety of chromatin remodeling complexes are thought to orchestrate transcriptional programs that lead neuronal precursors from earliest commitment to terminal differentiation. Here we show that mammalian neurons have a specialized chromatin remodeling enzyme arising from a neurospecific splice variant of LSD1/KDM1, histone lysine specific demethylase 1, whose demethylase activity on Lys4 of histone H3 has been related to gene repression. We found that alternative splicing of LSD1 transcript generates four full-length isoforms from combinatorial retention of two identified exons: the 4 aa exon E8a is internal to the amine oxidase domain, and its inclusion is restricted to the nervous system. Remarkably, the expression of LSD1 splice variants is dynamically regulated throughout cortical development, particularly during perinatal stages, with a progressive increase of LSD1 neurospecific isoforms over the ubiquitous ones. Notably, the same LSD1 splice dynamics can be fairly recapitulated in cultured cortical neurons. Functionally, LSD1 isoforms display in vitro a comparable demethylase activity, yet the inclusion of the sole exon E8a reduces LSD1 repressor activity on a reporter gene. Additional distinction among isoforms is supported by the knockdown of neurospecific variants in cortical neurons resulting in the inhibition of neurite maturation, whereas overexpression of the same variants enhances it. Instead, perturbation of LSD1 isoforms that are devoid of the neurospecific exon elicits no morphogenic effect. Collectively, results demonstrate that the arousal of neuronal LSD1 isoforms pacemakes early neurite morphogenesis, conferring a neurospecific function to LSD1 epigenetic activity.
Collapse
|
65
|
Zhang PG, Huang SZ, Pin AL, Adams KL. Extensive divergence in alternative splicing patterns after gene and genome duplication during the evolutionary history of Arabidopsis. Mol Biol Evol 2010; 27:1686-97. [PMID: 20185454 DOI: 10.1093/molbev/msq054] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Gene duplication at various scales, from single gene duplication to whole-genome (WG) duplication, has occurred throughout eukaryotic evolution and contributed greatly to the large number of duplicated genes in the genomes of many eukaryotes. Previous studies have shown divergence in expression patterns of many duplicated genes at various evolutionary time scales and cases of gain of a new function or expression pattern by one duplicate or partitioning of functions or expression patterns between duplicates. Alternative splicing (AS) is a fundamental aspect of the expression of many genes that can increase gene product diversity and affect gene regulation. However, the evolution of AS patterns of genes duplicated by polyploidy, as well as in a sizable number of duplicated gene pairs in plants, has not been examined. Here, we have characterized conservation and divergence in AS patterns in genes duplicated by a polyploidy event during the evolutionary history of Arabidopsis thaliana. We used reverse transcription-polymerase chain reaction to assay 104 WG duplicates in six organ types and in plants grown under three abiotic stress treatments to detect organ- and stress-specific patterns of AS. Differences in splicing patterns in one or more organs, or under stress conditions, were found between the genes in a large majority of the duplicated pairs. In a few cases, AS patterns were the same between duplicates only under one or more abiotic stress treatments and not under normal growing conditions or vice versa. We also examined AS in 42 tandem duplicates and we found patterns of AS roughly comparable with the genes duplicated by polyploidy. The alternatively spliced forms in some of the genes created premature stop codons that would result in missing or partial functional domains if the transcripts are translated, which could affect gene function and cause functional divergence between duplicates. Our results indicate that AS patterns have diverged considerably after gene and genome duplication during the evolutionary history of the Arabidopsis lineage, sometimes in an organ- or stress-specific manner. AS divergence between duplicated genes may have contributed to gene functional evolution and led to preservation of some duplicated genes.
Collapse
Affiliation(s)
- Peter G Zhang
- UBC Botanical Garden and Centre for Plant Research, Department of Botany, University of British Columbia, Vancouver, British Columbia, Canada
| | | | | | | |
Collapse
|
66
|
Irimia M, Maeso I, Gunning PW, Garcia-Fernàndez J, Roy SW. Internal and external paralogy in the evolution of tropomyosin genes in metazoans. Mol Biol Evol 2010; 27:1504-17. [PMID: 20147436 DOI: 10.1093/molbev/msq018] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Nature contains a tremendous diversity of forms both at the organismal and genomic levels. This diversity motivates the twin central questions of molecular evolution: what are the molecular mechanisms of adaptation, and what are the functional consequences of genomic diversity. We report a 22-species comparative analysis of tropomyosin (PPM) genes, which exist in a variety of forms and are implicated in the emergence of a wealth of cellular functions, including the novel muscle functions integral to the functional diversification of bilateral animals. TPM genes encode either or both of long-form [284 amino acid (aa)] and short-form (approximately 248 aa) proteins. Consistent with a role of TPM diversification in the origins and radiation of bilaterians, we find evidence that the muscle-specific long-form protein arose in proximal bilaterian ancestors (the bilaterian 'stem'). Duplication of the 5' end of the gene led to alternative promoters encoding long- and short-form transcripts with distinct functions. This dual-function gene then underwent strikingly parallel evolution in different bilaterian lineages. In each case, recurrent tandem exon duplication and mutually exclusive alternative splicing of the duplicates, with further association between these alternatively spliced exons along the gene, led to long- and short-form-specific exons, allowing for gradual emergence of alternative "internal paralogs" within the same gene. We term these Mutually exclusively Alternatively spliced Tandemly duplicated Exon sets "MATEs". This emergence of internal paralogs in various bilaterians has employed every single TPM exon in at least one lineage and reaches striking levels of divergence with up to 77% of long- and short-form transcripts being transcribed from different genomic regions. Interestingly, in some lineages, these internal alternatively spliced paralogs have subsequently been "externalized" by full gene duplication and reciprocal retention/loss of the two transcript isoforms, a particularly clear case of evolution by subfunctionalization. This parallel evolution of TPM genes in diverse metazoans attests to common selective forces driving divergence of different gene transcripts and represents a striking case of emergence of evolutionary novelty by alternative splicing.
Collapse
Affiliation(s)
- Manuel Irimia
- Departament de Genètica, Facultat de Biologia, Universitat de Barcelona, Barcelona, Spain
| | | | | | | | | |
Collapse
|
67
|
Zhang Z, Zhou L, Wang P, Liu Y, Chen X, Hu L, Kong X. Divergence of exonic splicing elements after gene duplication and the impact on gene structures. Genome Biol 2009; 10:R120. [PMID: 19883501 PMCID: PMC3091315 DOI: 10.1186/gb-2009-10-11-r120] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2009] [Revised: 09/28/2009] [Accepted: 11/02/2009] [Indexed: 12/18/2022] Open
Abstract
An analysis of human exonic splicing elements in duplicated genes reveals their important role in the generation of new gene structures. Background The origin of new genes and their contribution to functional novelty has been the subject of considerable interest. There has been much progress in understanding the mechanisms by which new genes originate. Here we examine a novel way that new gene structures could originate, namely through the evolution of new alternative splicing isoforms after gene duplication. Results We studied the divergence of exonic splicing enhancers and silencers after gene duplication and the contributions of such divergence to the generation of new splicing isoforms. We found that exonic splicing enhancers and exonic splicing silencers diverge especially fast shortly after gene duplication. About 10% and 5% of paralogous exons undergo significantly asymmetric evolution of exonic splicing enhancers and silencers, respectively. When compared to pre-duplication ancestors, we found that there is a significant overall loss of exonic splicing enhancers and the magnitude increases with duplication age. Detailed examination reveals net gains and losses of exonic splicing enhancers and silencers in different copies and paralog clusters after gene duplication. Furthermore, we found that exonic splicing enhancer and silencer changes are mainly caused by synonymous mutations, though nonsynonymous changes also contribute. Finally, we found that exonic splicing enhancer and silencer divergence results in exon splicing state transitions (from constitutive to alternative or vice versa), and that the proportion of paralogous exon pairs with different splicing states also increases over time, consistent with previous predictions. Conclusions Our results suggest that exonic splicing enhancer and silencer changes after gene duplication have important roles in alternative splicing divergence and that these changes contribute to the generation of new gene structures.
Collapse
Affiliation(s)
- Zhenguo Zhang
- The Key Laboratory of Stem Cell Biology, Institute of Health Sciences, Shanghai Institutes for Biological Sciences (SIBS), Chinese Academy of Sciences (CAS) and Shanghai Jiao Tong University School of Medicine (SJTUSM), 225 South Chong Qing Road, Shanghai 200025, PR China.
| | | | | | | | | | | | | |
Collapse
|
68
|
Kandul NP, Noor MAF. Large introns in relation to alternative splicing and gene evolution: a case study of Drosophila bruno-3. BMC Genet 2009; 10:67. [PMID: 19840385 PMCID: PMC2767349 DOI: 10.1186/1471-2156-10-67] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2009] [Accepted: 10/19/2009] [Indexed: 01/12/2023] Open
Abstract
Background Alternative splicing (AS) of maturing mRNA can generate structurally and functionally distinct transcripts from the same gene. Recent bioinformatic analyses of available genome databases inferred a positive correlation between intron length and AS. To study the interplay between intron length and AS empirically and in more detail, we analyzed the diversity of alternatively spliced transcripts (ASTs) in the Drosophila RNA-binding Bruno-3 (Bru-3) gene. This gene was known to encode thirteen exons separated by introns of diverse sizes, ranging from 71 to 41,973 nucleotides in D. melanogaster. Although Bru-3's structure is expected to be conducive to AS, only two ASTs of this gene were previously described. Results Cloning of RT-PCR products of the entire ORF from four species representing three diverged Drosophila lineages provided an evolutionary perspective, high sensitivity, and long-range contiguity of splice choices currently unattainable by high-throughput methods. Consequently, we identified three new exons, a new exon fragment and thirty-three previously unknown ASTs of Bru-3. All exon-skipping events in the gene were mapped to the exons surrounded by introns of at least 800 nucleotides, whereas exons split by introns of less than 250 nucleotides were always spliced contiguously in mRNA. Cases of exon loss and creation during Bru-3 evolution in Drosophila were also localized within large introns. Notably, we identified a true de novo exon gain: exon 8 was created along the lineage of the obscura group from intronic sequence between cryptic splice sites conserved among all Drosophila species surveyed. Exon 8 was included in mature mRNA by the species representing all the major branches of the obscura group. To our knowledge, the origin of exon 8 is the first documented case of exonization of intronic sequence outside vertebrates. Conclusion We found that large introns can promote AS via exon-skipping and exon turnover during evolution likely due to frequent errors in their removal from maturing mRNA. Large introns could be a reservoir of genetic diversity, because they have a greater number of mutable sites than short introns. Taken together, gene structure can constrain and/or promote gene evolution.
Collapse
Affiliation(s)
- Nikolai P Kandul
- Biology Department, Duke University, PO Box 90338, FFSC 4244, Durham, NC 27708, USA.
| | | |
Collapse
|
69
|
Nurtdinov RN, Mironov AA, Gelfand MS. Rodent-specific alternative exons are more frequent in rapidly evolving genes and in paralogs. BMC Evol Biol 2009; 9:142. [PMID: 19558667 PMCID: PMC2711938 DOI: 10.1186/1471-2148-9-142] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2008] [Accepted: 06/26/2009] [Indexed: 01/01/2023] Open
Abstract
Background Alternative splicing is an important mechanism for generating functional and evolutionary diversity of proteins in eukaryotes. Here, we studied the frequency and functionality of recently gained, rodent-specific alternative exons. Results We projected the data about alternative splicing of mouse genes to the rat, human, and dog genomes, and identified exons conserved in the rat genome, but missing in more distant genomes. We estimated the frequency of rodent-specific exons while controlling for possible residual conservation of spurious exons. The frequency of rodent-specific exons is higher among predominantly skipped exons and exons disrupting the reading frame. Separation of all genes by the rate of sequence evolution and by gene families has demonstrated that rodent-specific cassette exons are more frequent in rapidly evolving genes and in rodent-specific paralogs. Conclusion Thus we demonstrated that recently gained exons tend to occur in fast-evolving genes, and their inclusion rate tends to be lower than that of older exons. This agrees with the theory that gain of alternative exons is one of the major mechanisms of gene evolution.
Collapse
Affiliation(s)
- Ramil N Nurtdinov
- Departament of Bioengineering and Bioinformatics, M V Lomonosov Moscow State University, Moscow, Russia.
| | | | | |
Collapse
|
70
|
Comparative analysis indicates that alternative splicing in plants has a limited role in functional expansion of the proteome. BMC Genomics 2009; 10:154. [PMID: 19358722 PMCID: PMC2674458 DOI: 10.1186/1471-2164-10-154] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2008] [Accepted: 04/09/2009] [Indexed: 11/23/2022] Open
Abstract
Background Alternative splicing (AS) is a widespread phenomenon in higher eukaryotes but the extent to which it leads to functional protein isoforms and to proteome expansion at large is still a matter of debate. In contrast to animal species, for which AS has been studied extensively at the protein and functional level, protein-centered studies of AS in plant species are scarce. Here we investigate the functional impact of AS in dicot and monocot plant species using a comparative approach. Results Detailed comparison of AS events in alternative spliced orthologs from the dicot Arabidopsis thaliana and the monocot Oryza sativa (rice) revealed that the vast majority of AS events in both species do not result from functional conservation. Transcript isoforms that are putative targets for the nonsense-mediated decay (NMD) pathway are as likely to contain conserved AS events as isoforms that are translated into proteins. Similar results were obtained when the same comparison was performed between the two more closely related monocot species rice and Zea mays (maize). Genome-wide computational analysis of functional protein domains encoded in alternatively and constitutively spliced genes revealed that only the RNA recognition motif (RRM) is overrepresented in alternatively spliced genes in all species analyzed. In contrast, three domain types were overrepresented in constitutively spliced genes. AS events were found to be less frequent within than outside predicted protein domains and no domain type was found to be enriched with AS introns. Analysis of AS events that result in the removal of complete protein domains revealed that only a small number of domain types is spliced-out in all species analyzed. Finally, in a substantial fraction of cases where a domain is completely removed, this domain appeared to be a unit of a tandem repeat. Conclusion The results from the ortholog comparisons suggest that the ability of a gene to produce more than one functional protein through AS does not persist during evolution. Cross-species comparison of the results of the protein-domain oriented analyses indicates little correspondence between the analyzed species. Based on the premise that functional genetic features are most likely to be conserved during evolution, we conclude that AS has only a limited role in functional expansion of the proteome in plants.
Collapse
|
71
|
Bertossa RC, van de Zande L, Beukeboom LW. The Fruitless gene in Nasonia displays complex sex-specific splicing and contains new zinc finger domains. Mol Biol Evol 2009; 26:1557-69. [PMID: 19349644 DOI: 10.1093/molbev/msp067] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
The transcription factor Fruitless exerts a broad range of functions during Drosophila development, the most apparent of which is the determination of sexual behavior in males. Although fruitless sequences are found in other insect orders, little is known about fruitless structure and function outside Diptera. We have performed a thorough analysis of fruitless transcripts in the haplo-diploid wasp Nasonia vitripennis and found both sex-specific and non-sex-specific transcripts similar to those found in Drosophila. In Nasonia, however, a novel, large fruitless transcript is present in females only. Putative binding sites for sex-specific splicing factors found in Nasonia fruitless and doublesex as well as Apis mellifera doublesex transcripts were sufficient to identify a corresponding female-specific fruitless exon in A. mellifera, suggesting that similar factors in both hymenopteran species could be responsible for sex-specific splicing of both genes. Furthermore, new C(2)H(2) zinc finger domains found in Nasonia fruitless transcripts were also identified in the fruitless locus of major holometabolous insect species but not in drosophilids. Conservation of important domains and sex-specific splicing in Diptera and Hymenoptera support the hypothesis that fruitless is an ancient gene and has conserved functions in insects. Considerable divergences in other parts of the gene are expected to underlie species-specific differences and may help to explain diversity observed in insect sexual behaviors.
Collapse
Affiliation(s)
- Rinaldo C Bertossa
- Evolutionary Genetics, Centre for Ecological and Evolutionary Studies, The Netherlands.
| | | | | |
Collapse
|
72
|
Peng T, Li Y. Tandem exon duplication tends to propagate rather than to create de novo alternative splicing. Biochem Biophys Res Commun 2009; 383:163-6. [PMID: 19351527 DOI: 10.1016/j.bbrc.2009.03.162] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2009] [Accepted: 03/22/2009] [Indexed: 11/15/2022]
Abstract
Tandem gene duplication is one of the most prevalent ways of generating genes with new function. Similarly, tandem exon duplication is an important source of new exons. Tandem exon duplication is often associated with alternative splicing to reduce the possible deleterious impacts on transcript/protein structure. However, how alternative splicing is established on two new exons from duplication remains controversial. By analyzing the duplication of human-mouse conserved exons, we illustrated that newly duplicated exons tend to preserve the splicing status of their parent exon. That is, the exons duplicated from an alternative exon are usually alternatively spliced, while those from constitutive parents are more likely to be constitutively spliced. Newly generated, constitutively spliced exons showed a higher percentage of frame preservation and protein domain preference, indicating some evolutionary scenarios other than alternative splicing operates for the relief of negative selection pressure. These results suggest that alternative splicing is usually established before the tandem duplication. The duplication therefore propagates, rather than creates de novo, alternative splicing.
Collapse
Affiliation(s)
- Tao Peng
- TNLIST/Department of Automation, Bioinformatics Division, MOE Key Laboratory of Bioinformatics, Tsinghua University, Beijing, China
| | | |
Collapse
|
73
|
Rohmann KN, Deitcher DL, Bass AH. Calcium-activated potassium (BK) channels are encoded by duplicate slo1 genes in teleost fishes. Mol Biol Evol 2009; 26:1509-21. [PMID: 19321796 DOI: 10.1093/molbev/msp060] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
Calcium-activated, large conductance potassium (BK) channels in tetrapods are encoded by a single slo1 gene, which undergoes extensive alternative splicing. Alternative splicing generates a high level of functional diversity in BK channels that contributes to the wide range of frequencies electrically tuned by the inner ear hair cells of many tetrapods. To date, the role of BK channels in hearing among teleost fishes has not been investigated at the molecular level, although teleosts account for approximately half of all extant vertebrate species. We identified slo1 genes in teleost and nonteleost fishes using polymerase chain reaction and genetic sequence databases. In contrast to tetrapods, all teleosts examined were found to express duplicate slo1 genes in the central nervous system, whereas nonteleosts that diverged prior to the teleost whole-genome duplication event express a single slo1 gene. Phylogenetic analyses further revealed that whereas other slo1 duplicates were the result of a single duplication event, an independent duplication occurred in a basal teleost (Anguilla rostrata) following the slo1 duplication in teleosts. A third, independent slo1 duplication (autotetraploidization) occurred in salmonids. Comparison of teleost slo1 genomic sequences to their tetrapod orthologue revealed a reduced number of alternative splice sites in both slo1 co-orthologues. For the teleost Porichthys notatus, a focal study species that vocalizes with maximal spectral energy in the range electrically tuned by BK channels in the inner ear, peripheral tissues show the expression of either one (e.g., vocal muscle) or both (e.g., inner ear) slo1 paralogues with important implications for both auditory and vocal physiology. Additional loss of expression of one slo1 paralogue in nonneural tissues in P. notatus suggests that slo1 duplicates were retained via subfunctionalization. Together, the results predict that teleost fish achieve a diversity of BK channel subfunction via gene duplication, rather than increased alternative splicing as witnessed for the tetrapod and invertebrate orthologue.
Collapse
Affiliation(s)
- Kevin N Rohmann
- Department of Neurobiology and Behavior, Cornell University.
| | | | | |
Collapse
|
74
|
Abstract
It has been known for more than 35 years that, during evolution, new proteins are formed by gene duplications, sequence and structural divergence and, in many cases, gene combinations. The genome projects have produced complete, or almost complete, descriptions of the protein repertoires of over 600 distinct organisms. Analyses of these data have dramatically increased our understanding of the formation of new proteins. At the present time, we can accurately trace the evolutionary relationships of about half the proteins found in most genomes, and it is these proteins that we discuss in the present review. Usually, the units of evolution are protein domains that are duplicated, diverge and form combinations. Small proteins contain one domain, and large proteins contain combinations of two or more domains. Domains descended from a common ancestor are clustered into superfamilies. In most genomes, the net growth of superfamily members means that more than 90% of domains are duplicates. In a section on domain duplications, we discuss the number of currently known superfamilies, their size and distribution, and superfamily expansions related to biological complexity and to specific lineages. In a section on divergence, we describe how sequences and structures diverge, the changes in stability produced by acceptable mutations, and the nature of functional divergence and selection. In a section on domain combinations, we discuss their general nature, the sequential order of domains, how combinations modify function, and the extraordinary variety of the domain combinations found in different genomes. We conclude with a brief note on other forms of protein evolution and speculations of the origins of the duplication, divergence and combination processes.
Collapse
|
75
|
Unique signatures of natural background radiation on human Y chromosomes from Kerala, India. PLoS One 2009; 4:e4541. [PMID: 19242544 PMCID: PMC2644265 DOI: 10.1371/journal.pone.0004541] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2008] [Accepted: 12/09/2008] [Indexed: 11/25/2022] Open
Abstract
Background The most frequently observed major consequences of ionizing radiation are chromosomal lesions and cancers, although the entire genome may be affected. Owing to its haploid status and absence of recombination, the human Y chromosome is an ideal candidate to be assessed for possible genetic alterations induced by ionizing radiation. We studied the human Y chromosome in 390 males from the South Indian state of Kerala, where the level of natural background radiation (NBR) is ten-fold higher than the worldwide average, and that from 790 unexposed males as control. Results We observed random microdeletions in the Azoospermia factor (AZF) a, b and c regions in >90%, and tandem duplication and copy number polymorphism (CNP) of 11 different Y-linked genes in about 80% of males exposed to NBR. The autosomal homologues of Y-linked CDY genes largely remained unaffected. Multiple polymorphic copies of the Y-linked genes showing single Y-specific signals suggested their tandem duplication. Some exposed males showed unilocus duplication of DAZ genes resulting in six copies. Notably, in the AZFa region, approximately 25% of exposed males showed deletion of the DBY gene, whereas flanking genes USP9Y and UTY remained unaffected. All these alterations were detected in blood samples but not in the germline (sperm) samples. Conclusions Exposure to high levels of NBR correlated with several interstitial polymorphisms of the human Y chromosome. CNPs and enhanced transcription of the SRY gene after duplication are envisaged to compensate for the loss of Y chromosome in some cells. The aforesaid changes, confined to peripheral blood lymphocytes, suggest a possible innate mechanism protecting the germline DNA from the NBR. Genome analysis of a larger population focusing on greater numbers of genes may provide new insights into the mechanisms and risks of the resultant genetic damages. The present work demonstrates unique signatures of NBR on human Y chromosomes from Kerala, India.
Collapse
|
76
|
Irimia M, Rukov JL, Roy SW, Vinther J, Garcia-Fernandez J. Quantitative regulation of alternative splicing in evolution and development. Bioessays 2009; 31:40-50. [DOI: 10.1002/bies.080092] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
|
77
|
Molecular evolution and diversification of plant cysteine proteinase inhibitors: New insights after the poplar genome. Mol Phylogenet Evol 2008; 49:349-55. [DOI: 10.1016/j.ympev.2008.04.025] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2008] [Accepted: 04/12/2008] [Indexed: 12/24/2022]
|
78
|
Jin L, Kryukov K, Clemente JC, Komiyama T, Suzuki Y, Imanishi T, Ikeo K, Gojobori T. The evolutionary relationship between gene duplication and alternative splicing. Gene 2008; 427:19-31. [PMID: 18835337 DOI: 10.1016/j.gene.2008.09.002] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2008] [Revised: 09/03/2008] [Accepted: 09/03/2008] [Indexed: 10/21/2022]
Abstract
Gene duplication and alternative splicing (AS) are the two major evolutionary mechanisms that can bring the functional variation by increasing gene diversification. The purpose of this research is to understand the evolutionary relationship between these two different mechanisms, utilizing available data resources. We found the proportion of AS loci and the average number of AS isoforms per locus to be larger in duplicated genes compared to those in singleton genes. However we also found that small gene families have larger proportion of AS loci and larger average number of AS isoforms per locus than large gene families. These results suggest that gene duplication allows for more alternative splicing events to occur on newly duplicated copies than on singletons, probably due to the reduced functional constraint on the duplicates. Smaller average number of AS isoforms in the larger gene families can be explained by the decreased possibility for new useful function to be created via a new alternative splicing event.
Collapse
Affiliation(s)
- Lihua Jin
- Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Yata 1111, Mishima, Shizuoka, 411-8540, Japan
| | | | | | | | | | | | | | | |
Collapse
|
79
|
Abstract
The evolutionary process leading to the fixation of newly duplicated genes is not well understood. It was recently proposed that the fixation of duplicate genes is frequently driven by positive selection for increased gene dosage (i.e., the gene dosage hypothesis), because haploinsufficient genes were reported to have more paralogs than haplosufficient genes in the human genome. However, the previous analysis incorrectly assumed that the presence of dominant abnormal alleles of a human gene means that the gene is haploinsufficient, ignoring the fact that many dominant abnormal alleles arise from gain-of-function mutations. Here we show in both humans and yeast that haploinsufficient genes generally do not duplicate more frequently than haplosufficient genes. Yeast haploinsufficient genes do exhibit enhanced retention after whole-genome duplication compared to haplosufficient genes if they encode members of stable protein complexes, but the same phenomenon is absent if the genes do not encode protein complex members, suggesting that the dosage balance effect rather than the dosage effect is the underlying cause of the phenomenon. On the basis of these and other results, we conclude that selection for higher gene dosage does not play a major role in driving the fixation of duplication genes.
Collapse
|
80
|
Baek JM, Han P, Iandolino A, Cook DR. Characterization and comparison of intron structure and alternative splicing between Medicago truncatula, Populus trichocarpa, Arabidopsis and rice. PLANT MOLECULAR BIOLOGY 2008; 67:499-510. [PMID: 18438730 DOI: 10.1007/s11103-008-9334-4] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/03/2007] [Accepted: 04/01/2008] [Indexed: 05/26/2023]
Abstract
Alignment of transcripts and genome sequences yielded a set of alternatively spliced transcripts in four angiosperm genomes: three dicotyledon species Medicago truncatula (Medicago), Populus trichocarpa (poplar) and Arabidopsis thaliana (Arabidopsis), and the monocotyledon Oryzae sativa (rice). Intron retention was the predominant mode of alternative splicing (AS) in each species, consistent with previous reports for Arabidopsis and rice. We analyzed the structure of 5'-splice junctions and observed commonalities between species. There was dependency of base composition between sites flanking the 5'-splice junction, with the potential to create a subset of splice sites that interact more weakly or strongly than average with U1 snRNA. Such altered nucleotide composition was correlated with splicing fidelity in all four species. For Medicago, poplar and Arabidopsis, but not in rice, alternative splicing was most prevalent for introns with decreased UA content, consistent with lower UA content for monocot introns and potentially reflecting evolved differences in splicing mechanisms. Similarly, the occurrence of AS between transcript Gene Ontology categories was positively correlated between Arabidopsis and Medicago, with no correlation between dicots and rice. Analysis of within-species paralogs and between-species reciprocal best-hit homologs yielded rare cases of potentially conserved AS events. Reverse transcriptase PCR and amplicon sequencing were used to confirm a subset of the in silico-predicted AS events within Medicago, as well as to characterize conserved AS events between Medicago and Arabidopsis.
Collapse
Affiliation(s)
- Jong-Min Baek
- College of Agricultural and Environmental Sciences Genomics Facility, University of California, 117 Robbins hall, Davis, CA 95616, USA.
| | | | | | | |
Collapse
|
81
|
Korbel JO, Kim PM, Chen X, Urban AE, Weissman S, Snyder M, Gerstein MB. The current excitement about copy-number variation: how it relates to gene duplications and protein families. Curr Opin Struct Biol 2008; 18:366-74. [PMID: 18511261 DOI: 10.1016/j.sbi.2008.02.005] [Citation(s) in RCA: 78] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2008] [Accepted: 02/13/2008] [Indexed: 01/28/2023]
Abstract
Following recent technological advances there has been an increasing interest in genome structural variants (SVs), in particular copy-number variants (CNVs)--large-scale duplications and deletions. Although not immediately evident, CNV surveys make a conceptual connection between the fields of population genetics and protein families, in particular with regard to the stability and expandability of families. The mechanisms giving rise to CNVs can be considered as fundamental processes underlying gene duplication and loss; duplicated genes being the results of 'successful' copies, fixed and maintained in the population. Conversely, many 'unsuccessful' duplicates remain in the genome as pseudogenes. Here, we survey studies on CNVs, highlighting issues related to protein families. In particular, CNVs tend to affect specific gene functional categories, such as those associated with environmental response, and are depleted in genes related to basic cellular processes. Furthermore, CNVs occur more often at the periphery of the protein interaction network. In comparison, protein families associated with successful and unsuccessful duplicates are associated with similar functional categories but are differentially placed in the interaction network. These trends are likely reflective of CNV formation biases and natural selection, both of which differentially influence distinct protein families.
Collapse
Affiliation(s)
- Jan O Korbel
- Molecular Biophysics and Biochemistry Department, Yale University, New Haven, CT 06520, USA
| | | | | | | | | | | | | |
Collapse
|
82
|
Shakhnovich BE, Shakhnovich EI. Improvisation in evolution of genes and genomes: whose structure is it anyway? Curr Opin Struct Biol 2008; 18:375-81. [PMID: 18487041 DOI: 10.1016/j.sbi.2008.02.007] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2008] [Accepted: 02/13/2008] [Indexed: 01/31/2023]
Abstract
Significant progress has been made in recent years in a variety of seemingly unrelated fields such as sequencing, protein structure prediction, and high-throughput transcriptomics and metabolomics. At the same time, new microscopic models have been developed that made it possible to analyze the evolution of genes and genomes from first principles. The results from these efforts enable, for the first time, a comprehensive insight into the evolution of complex systems and organisms on all scales--from sequences to organisms and populations. Every newly sequenced genome uncovers new genes, families, and folds. Where do these new genes come from? How do gene duplication and subsequent divergence of sequence and structure affect the fitness of the organism? What role does regulation play in the evolution of proteins and folds? Emerging synergism between data and modeling provides first robust answers to these questions.
Collapse
Affiliation(s)
- Boris E Shakhnovich
- Department of Molecular and Cellular Biology, Harvard University, 12 Oxford Street, Cambridge, MA 02138, United States
| | | |
Collapse
|
83
|
The Evolution of Alternative Splicing in the Pax Family: The View from the Basal Chordate Amphioxus. J Mol Evol 2008; 66:605-20. [DOI: 10.1007/s00239-008-9113-5] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2008] [Revised: 04/16/2008] [Accepted: 04/22/2008] [Indexed: 10/22/2022]
|
84
|
Selection on Length Mutations After Frameshift Can Explain the Origin and Retention of the AP3/DEF-Like Paralogues in Impatiens. J Mol Evol 2008; 66:424-35. [DOI: 10.1007/s00239-008-9085-5] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2007] [Revised: 01/19/2008] [Accepted: 02/08/2008] [Indexed: 10/22/2022]
|
85
|
Castrignanò T, D’Antonio M, Anselmo A, Carrabino D, D’Onorio De Meo A, D’Erchia AM, Licciulli F, Mangiulli M, Mignone F, Pavesi G, Picardi E, Riva A, Rizzi R, Bonizzoni P, Pesole G. ASPicDB: A database resource for alternative splicing analysis. Bioinformatics 2008; 24:1300-4. [DOI: 10.1093/bioinformatics/btn113] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023] Open
|
86
|
Abstract
Alternative splicing is a well-characterized mechanism by which multiple transcripts are generated from a single mRNA precursor. By allowing production of several protein isoforms from one pre-mRNA, alternative splicing contributes to proteomic diversity. But what do we know about the origin of this mechanism? Do the same evolutionary forces apply to alternatively and constitutively splice exons? Do similar forces act on all types of alternative splicing? Are the products generated by alternative splicing functional? Why is "improper" recognition of exons and introns allowed by the splicing machinery? In this review, we summarize the current knowledge regarding these issues from an evolutionary perspective.
Collapse
Affiliation(s)
- Eddo Kim
- Department of Human Molecular Genetics and Biochemistry, Sackler Faculty of Medicine, Tel-Aviv University, Ramat Aviv, Israel
| | | | | |
Collapse
|
87
|
Lin H, Ouyang S, Simons R, Nobuta K, Haas BJ, Zhu W, Gu X, Silva JC, Meyers BC, Buell CR. Characterization of paralogous protein families in rice. BMC PLANT BIOLOGY 2008; 8:18. [PMID: 18284697 PMCID: PMC2275729 DOI: 10.1186/1471-2229-8-18] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/15/2007] [Accepted: 02/19/2008] [Indexed: 05/19/2023]
Abstract
BACKGROUND High gene numbers in plant genomes reflect polyploidy and major gene duplication events. Oryza sativa, cultivated rice, is a diploid monocotyledonous species with a ~390 Mb genome that has undergone segmental duplication of a substantial portion of its genome. This, coupled with other genetic events such as tandem duplications, has resulted in a substantial number of its genes, and resulting proteins, occurring in paralogous families. RESULTS Using a computational pipeline that utilizes Pfam and novel protein domains, we characterized paralogous families in rice and compared these with paralogous families in the model dicotyledonous diploid species, Arabidopsis thaliana. Arabidopsis, which has undergone genome duplication as well, has a substantially smaller genome (~120 Mb) and gene complement compared to rice. Overall, 53% and 68% of the non-transposable element-related rice and Arabidopsis proteins could be classified into paralogous protein families, respectively. Singleton and paralogous family genes differed substantially in their likelihood of encoding a protein of known or putative function; 26% and 66% of singleton genes compared to 73% and 96% of the paralogous family genes encode a known or putative protein in rice and Arabidopsis, respectively. Furthermore, a major skew in the distribution of specific gene function was observed; a total of 17 Gene Ontology categories in both rice and Arabidopsis were statistically significant in their differential distribution between paralogous family and singleton proteins. In contrast to mammalian organisms, we found that duplicated genes in rice and Arabidopsis tend to have more alternative splice forms. Using data from Massively Parallel Signature Sequencing, we show that a significant portion of the duplicated genes in rice show divergent expression although a correlation between sequence divergence and correlation of expression could be seen in very young genes. CONCLUSION Collectively, these data suggest that while co-regulation and conserved function are present in some paralogous protein family members, evolutionary pressures have resulted in functional divergence with differential expression patterns.
Collapse
Affiliation(s)
- Haining Lin
- The Institute for Genomic Research, 9712 Medical Center Dr. , Rockville, MD 20850, USA.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
88
|
Pahari S, Mackessy SP, Kini RM. The venom gland transcriptome of the Desert Massasauga rattlesnake (Sistrurus catenatus edwardsii): towards an understanding of venom composition among advanced snakes (Superfamily Colubroidea). BMC Mol Biol 2007; 8:115. [PMID: 18096037 PMCID: PMC2242803 DOI: 10.1186/1471-2199-8-115] [Citation(s) in RCA: 91] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2007] [Accepted: 12/20/2007] [Indexed: 11/14/2022] Open
Abstract
Background Snake venoms are complex mixtures of pharmacologically active proteins and peptides which belong to a small number of superfamilies. Global cataloguing of the venom transcriptome facilitates the identification of new families of toxins as well as helps in understanding the evolution of venom proteomes. Results We have constructed a cDNA library of the venom gland of a threatened rattlesnake (a pitviper), Sistrurus catenatus edwardsii (Desert Massasauga), and sequenced 576 ESTs. Our results demonstrate a high abundance of serine proteinase and metalloproteinase transcripts, indicating that the disruption of hemostasis is a principle mechanism of action of the venom. In addition to the transcripts encoding common venom proteins, we detected two varieties of low abundance unique transcripts in the library; these encode for three-finger toxins and a novel toxin possibly generated from the fusion of two genes. We also observed polyadenylated ribosomal RNAs in the venom gland library, an interesting preliminary obsevation of this unusual phenomenon in a reptilian system. Conclusion The three-finger toxins are characteristic of most elapid venoms but are rare in viperid venoms. We detected several ESTs encoding this group of toxins in this study. We also observed the presence of a transcript encoding a fused protein of two well-characterized toxins (Kunitz/BPTI and Waprins), and this is the first report of this kind of fusion in a snake toxin transcriptome. We propose that these new venom proteins may have ancillary functions for envenomation. The presence of a fused toxin indicates that in addition to gene duplication and accelerated evolution, exon shuffling or transcriptional splicing may also contribute to generating the diversity of toxins and toxin isoforms observed among snake venoms. The detection of low abundance toxins, as observed in this and other studies, indicates a greater compositional similarity of venoms (though potency will differ) among advanced snakes than has been previously recognized.
Collapse
Affiliation(s)
- Susanta Pahari
- Center for Post Graduate Studies, Sri Bhagawan Mahaveer Jain College, 18/3, 9th Main, Jayanagar 3rd Block, Bangalore, India.
| | | | | |
Collapse
|
89
|
Amit M, Sela N, Keren H, Melamed Z, Muler I, Shomron N, Izraeli S, Ast G. Biased exonization of transposed elements in duplicated genes: A lesson from the TIF-IA gene. BMC Mol Biol 2007; 8:109. [PMID: 18047649 PMCID: PMC2231382 DOI: 10.1186/1471-2199-8-109] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2007] [Accepted: 11/29/2007] [Indexed: 01/21/2023] Open
Abstract
BACKGROUND Gene duplication and exonization of intronic transposed elements are two mechanisms that enhance genomic diversity. We examined whether there is less selection against exonization of transposed elements in duplicated genes than in single-copy genes. RESULTS Genome-wide analysis of exonization of transposed elements revealed a higher rate of exonization within duplicated genes relative to single-copy genes. The gene for TIF-IA, an RNA polymerase I transcription initiation factor, underwent a humanoid-specific triplication, all three copies of the gene are active transcriptionally, although only one copy retains the ability to generate the TIF-IA protein. Prior to TIF-IA triplication, an Alu element was inserted into the first intron. In one of the non-protein coding copies, this Alu is exonized. We identified a single point mutation leading to exonization in one of the gene duplicates. When this mutation was introduced into the TIF-IA coding copy, exonization was activated and the level of the protein-coding mRNA was reduced substantially. A very low level of exonization was detected in normal human cells. However, this exonization was abundant in most leukemia cell lines evaluated, although the genomic sequence is unchanged in these cancerous cells compared to normal cells. CONCLUSION The definition of the Alu element within the TIF-IA gene as an exon is restricted to certain types of cancers; the element is not exonized in normal human cells. These results further our understanding of the delicate interplay between gene duplication and alternative splicing and of the molecular evolutionary mechanisms leading to genetic innovations. This implies the existence of purifying selection against exonization in single copy genes, with duplicate genes free from such constrains.
Collapse
Affiliation(s)
- Maayan Amit
- Department of Human Molecular Genetics and Biochemistry, Sackler Faculty of Medicine, Tel-Aviv University, Ramat Aviv 69978, Israel
| | - Noa Sela
- Department of Human Molecular Genetics and Biochemistry, Sackler Faculty of Medicine, Tel-Aviv University, Ramat Aviv 69978, Israel
| | - Hadas Keren
- Department of Human Molecular Genetics and Biochemistry, Sackler Faculty of Medicine, Tel-Aviv University, Ramat Aviv 69978, Israel
| | - Ze'ev Melamed
- Department of Human Molecular Genetics and Biochemistry, Sackler Faculty of Medicine, Tel-Aviv University, Ramat Aviv 69978, Israel
| | - Inna Muler
- Department of Human Molecular Genetics and Biochemistry, Sackler Faculty of Medicine, Tel-Aviv University, Ramat Aviv 69978, Israel
- Chaim Sheba Cancer Research Center, Tel Hashomer, Israel
- Pediatric Hemato-Oncology, Sheba Medical Center, Tel Hashomer, Israel
| | - Noam Shomron
- Department of Biology, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA
| | - Shai Izraeli
- Department of Human Molecular Genetics and Biochemistry, Sackler Faculty of Medicine, Tel-Aviv University, Ramat Aviv 69978, Israel
- Chaim Sheba Cancer Research Center, Tel Hashomer, Israel
- Pediatric Hemato-Oncology, Sheba Medical Center, Tel Hashomer, Israel
| | - Gil Ast
- Department of Human Molecular Genetics and Biochemistry, Sackler Faculty of Medicine, Tel-Aviv University, Ramat Aviv 69978, Israel
| |
Collapse
|
90
|
Irimia M, Rukov JL, Penny D, Garcia-Fernandez J, Vinther J, Roy SW. Widespread evolutionary conservation of alternatively spliced exons in Caenorhabditis. Mol Biol Evol 2007; 25:375-82. [PMID: 18048400 DOI: 10.1093/molbev/msm262] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Alternative splicing (AS) contributes to increased transcriptome and proteome diversity in various eukaryotic lineages. Previous studies showed low levels of conservation of alternatively spliced (cassette) exons within mammals and within dipterans. We report a strikingly different pattern in Caenorhabditis nematodes-more than 92% of cassette exons from Caenorhabditis elegans are conserved in Caenorhabditis briggsae and/or Caenorhabditis remanei. High levels of conservation extend to minor-form exons (present in a minority of transcripts) and are particularly pronounced for exons showing complex patterns of splicing. The functionality of the vast majority of cassette exons is underscored by various other features. We suggest that differences in conservation between lineages reflect differences in levels of functionality and further suggest that these differences are due to differences in intron length and the strength of consensus boundaries across lineages. Finally, we demonstrate an inverse relationship between AS and gene duplication, suggesting that the latter may be primarily responsible for the emergence of new functional transcripts in nematodes.
Collapse
Affiliation(s)
- Manuel Irimia
- Departament de Genètica, Universitat de Barcelona, Barcelona, Spain.
| | | | | | | | | | | |
Collapse
|
91
|
Hughes AL, Friedman R. Alternative splicing, gene duplication and connectivity in the genetic interaction network of the nematode worm Caenorhabditis elegans. Genetica 2007; 134:181-6. [PMID: 18026854 DOI: 10.1007/s10709-007-9223-9] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2007] [Accepted: 11/01/2007] [Indexed: 11/27/2022]
Abstract
We examined the relationship between gene duplication, alternative splicing, and connectedness in a predicted genetic interaction network using published data from the nematode worm Caenorhabditis elegans. Similar to previous results from mammals, genes belonging to families with only one member ("singletons") were significantly more likely to lack alternative splicing than were members of large multi-gene families. Genes belonging to multi-gene families lacking alternative splicing tended to have higher connectedness in the genetic interaction network than did genes in families that included one or more alternatively spliced members. Moreover, alternatively spliced genes were significantly more likely to interact with other alternatively spliced genes. These results support the hypothesis that certain key proteins with high degrees of network connectedness are subject to selection opposing the occurrence of alternatively spliced forms.
Collapse
Affiliation(s)
- Austin L Hughes
- Department of Biological Sciences, University of South Carolina, Coker Life Sciences Bldg., 700 Sumter St., Columbia, SC 29208, USA.
| | | |
Collapse
|
92
|
Conrad B, Antonarakis SE. Gene Duplication: A Drive for Phenotypic Diversity and Cause of Human Disease. Annu Rev Genomics Hum Genet 2007; 8:17-35. [PMID: 17386002 DOI: 10.1146/annurev.genom.8.021307.110233] [Citation(s) in RCA: 182] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Gene duplication is one of the key factors driving genetic innovation, i.e., producing novel genetic variants. Although the contribution of whole-genome and segmental duplications to phenotypic diversity across species is widely appreciated, the phenotypic spectrum and potential pathogenicity of small-scale duplications in individual genomes are less well explored. This review discusses the nature of small-scale duplications and the phenotypes produced by such duplications. Phenotypic variation and disease phenotypes induced by duplications are more diverse and widespread than previously anticipated, and duplications are a major class of disease-related genomic variation. Pathogenic duplications particularly involve dosage-sensitive genes with both similar and dissimilar over- and underexpression phenotypes, and genes encoding proteins with a propensity to aggregate. Phenotypes related to human-specific copy number variation in genes regulating environmental responses and immunity are increasingly recognized. Small genomic duplications containing defense-related genes also contribute to complex common phenotypes.
Collapse
Affiliation(s)
- Bernard Conrad
- Department of Genetic Medicine & Development, University of Geneva Medical School and Geneva University Hospitals, CH-1211 Geneva 4, Switzerland.
| | | |
Collapse
|
93
|
Tirosh I, Bilu Y, Barkai N. Comparative biology: beyond sequence analysis. Curr Opin Biotechnol 2007; 18:371-7. [PMID: 17693073 DOI: 10.1016/j.copbio.2007.07.003] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2007] [Accepted: 07/12/2007] [Indexed: 12/18/2022]
Abstract
Comparative analysis is a fundamental tool in biology. Conservation among species greatly assists the detection and characterization of functional elements, whereas inter-species differences are probably the best indicators of biological adaptation. Traditionally, comparative approaches were applied to the analysis of genomic sequences. With the growing availability of functional genomic data, comparative paradigms are now being extended also to the study of other functional attributes, most notably the gene expression. Here we review recent works applying comparative analysis to large-scale gene expression datasets and discuss the central principles and challenges of such approaches.
Collapse
Affiliation(s)
- Itay Tirosh
- Department of Molecular Genetics, Weizmann Institute of Science, 76100 Rehovot, Israel
| | | | | |
Collapse
|
94
|
Artamonova II, Gelfand MS. Comparative Genomics and Evolution of Alternative Splicing: The Pessimists' Science. Chem Rev 2007; 107:3407-30. [PMID: 17645315 DOI: 10.1021/cr068304c] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Irena I Artamonova
- Group of Bioinformatics, Vavilov Institute of General Genetics, RAS, Gubkina 3, Moscow 119991, Russia
| | | |
Collapse
|
95
|
A procedure for identifying homologous alternative splicing events. BMC Bioinformatics 2007; 8:260. [PMID: 17640387 PMCID: PMC1950890 DOI: 10.1186/1471-2105-8-260] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2007] [Accepted: 07/19/2007] [Indexed: 01/11/2023] Open
Abstract
Background The study of the functional role of alternative splice isoforms of a gene is a very active area of research in biology. The difficulty of the experimental approach (in particular, in its high-throughput version) leaves ample room for the development of bioinformatics tools that can provide a useful first picture of the problem. Among the possible approaches, one of the simplest is to follow classical protein function annotation protocols and annotate target alternative splice events with the information available from conserved events in other species. However, the application of this protocol requires a procedure capable of recognising such events. Here we present a simple but accurate method developed for this purpose. Results We have developed a method for identifying homologous, or equivalent, alternative splicing events, based on the combined use of neural networks and sequence searches. The procedure comprises four steps: (i) BLAST search for homologues of the two isoforms defining the target alternative splicing event; (ii) construction of all possible candidate events; (iii) scoring of the latter with a series of neural networks; and (iv) filtering of the results. When tested in a set of 473 manually annotated pairs of homologous events, our method showed a good performance, with an accuracy of 0.99, a precision of 0.98 and a sensitivity of 0.93. When no candidates were available, the specificity of our method varied between 0.81 and 0.91. Conclusion The method described in this article allows the identification of homologous alternative splicing events, with a good success rate, indicating that such method could be used for the development of functional annotation of alternative splice isoforms.
Collapse
|
96
|
Zhang D, Boulware M, Pendleton M, Nogi T, Marchant J. The inositol 1,4,5-trisphosphate receptor (Itpr) gene family in Xenopus: identification of type 2 and type 3 inositol 1,4,5-trisphosphate receptor subtypes. Biochem J 2007; 404:383-91. [PMID: 17338679 PMCID: PMC1896291 DOI: 10.1042/bj20070101] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
Abstract
Studies in the Xenopus model system have provided considerable insight into the developmental role of intracellular Ca2+ signals produced by activation of IP3Rs (inositol 1,4,5-trisphosphate receptors). However, unlike mammalian systems where three IP3R subtypes have been well characterized, our molecular understanding of the IP3Rs that underpin Ca2+ signalling during Xenopus embryogenesis relate solely to the original characterization of the 'Xenopus IP3R' cloned and purified from Xenopus laevis oocytes several years ago. In the present study, we have identified Xenopus type 2 and type 3 IP3Rs and report the full-length sequence, genomic architecture and developmental expression profile of these additional IP3R subtypes. In the light of the emerging genomic resources and opportunities for genetic manipulation in the diploid frog Xenopus tropicalis, these data will facilitate manipulations to resolve the contribution of IP3R diversity in Ca2+ signalling events observed during vertebrate development.
Collapse
MESH Headings
- Amino Acid Sequence
- Animals
- Calcium/metabolism
- Calcium Signaling/physiology
- Evolution, Molecular
- Female
- Humans
- Inositol 1,4,5-Trisphosphate Receptors/classification
- Inositol 1,4,5-Trisphosphate Receptors/genetics
- Inositol 1,4,5-Trisphosphate Receptors/metabolism
- Molecular Sequence Data
- Phylogeny
- Protein Isoforms/classification
- Protein Isoforms/genetics
- Protein Isoforms/metabolism
- RNA, Messenger/genetics
- RNA, Messenger/metabolism
- Sequence Alignment
- Sequence Homology, Amino Acid
- Xenopus/embryology
- Xenopus/genetics
Collapse
Affiliation(s)
- Dan Zhang
- Department of Pharmacology, University of Minnesota Medical School, MN 55455, U.S.A
| | - Michael J. Boulware
- Department of Pharmacology, University of Minnesota Medical School, MN 55455, U.S.A
| | - Matthew R. Pendleton
- Department of Pharmacology, University of Minnesota Medical School, MN 55455, U.S.A
| | - Taisaku Nogi
- Department of Pharmacology, University of Minnesota Medical School, MN 55455, U.S.A
| | - Jonathan S. Marchant
- Department of Pharmacology, University of Minnesota Medical School, MN 55455, U.S.A
- To whom correspondence should be addressed (email )
| |
Collapse
|
97
|
Talavera D, Vogel C, Orozco M, Teichmann SA, de la Cruz X. The (in)dependence of alternative splicing and gene duplication. PLoS Comput Biol 2007; 3:e33. [PMID: 17335345 PMCID: PMC1808492 DOI: 10.1371/journal.pcbi.0030033] [Citation(s) in RCA: 58] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2006] [Accepted: 01/04/2007] [Indexed: 11/18/2022] Open
Abstract
Alternative splicing (AS) and gene duplication (GD) both are processes that diversify the protein repertoire. Recent examples have shown that sequence changes introduced by AS may be comparable to those introduced by GD. In addition, the two processes are inversely correlated at the genomic scale: large gene families are depleted in splice variants and vice versa. All together, these data strongly suggest that both phenomena result in interchangeability between their effects. Here, we tested the extent to which this applies with respect to various protein characteristics. The amounts of AS and GD per gene are anticorrelated even when accounting for different gene functions or degrees of sequence divergence. In contrast, the two processes appear to be independent in their influence on variation in mRNA expression. Further, we conducted a detailed comparison of the effect of sequence changes in both alternative splice variants and gene duplicates on protein structure, in particular the size, location, and types of sequence substitutions and insertions/deletions. We find that, in general, alternative splicing affects protein sequence and structure in a more drastic way than gene duplication and subsequent divergence. Our results reveal an interesting paradox between the anticorrelation of AS and GD at the genomic level, and their impact at the protein level, which shows little or no equivalence in terms of effects on protein sequence, structure, and function. We discuss possible explanations that relate to the order of appearance of AS and GD in a gene family, and to the selection pressure imposed by the environment.
Collapse
Affiliation(s)
- David Talavera
- Molecular Modeling and Bioinformatics Unit, Parc Científic de Barcelona, Barcelona, Spain
- Protein Structure and Modelling Node, Instituto Nacional de Bioinfomática, Genoma España, Parc Científic de Barcelona, Barcelona, Spain
| | - Christine Vogel
- Medical Research Council Laboratory of Molecular Biology, Cambridge, United Kingdom
- Institute for Cellular and Molecular Biology, University of Texas Austin, Austin, Texas, United States of America
- * To whom correspondence should be addressed. E-mail: (XdlC); (CV)
| | - Modesto Orozco
- Molecular Modeling and Bioinformatics Unit, Parc Científic de Barcelona, Barcelona, Spain
- Protein Structure and Modelling Node, Instituto Nacional de Bioinfomática, Genoma España, Parc Científic de Barcelona, Barcelona, Spain
- Departament de Bioquímica i Biologia Molecular, Universitat de Barcelona, Barcelona, Spain
- Institut per a la Recerca i Estudis Avançats (IRCA), Barcelona, Spain
| | - Sarah A Teichmann
- Medical Research Council Laboratory of Molecular Biology, Cambridge, United Kingdom
| | - Xavier de la Cruz
- Molecular Modeling and Bioinformatics Unit, Parc Científic de Barcelona, Barcelona, Spain
- Protein Structure and Modelling Node, Instituto Nacional de Bioinfomática, Genoma España, Parc Científic de Barcelona, Barcelona, Spain
- Computational Biology Program, Barcelona Supercomputing Center, Barcelona, Spain
- * To whom correspondence should be addressed. E-mail: (XdlC); (CV)
| |
Collapse
|
98
|
Hiller M, Huse K, Szafranski K, Rosenstiel P, Schreiber S, Backofen R, Platzer M. Phylogenetically widespread alternative splicing at unusual GYNGYN donors. Genome Biol 2007; 7:R65. [PMID: 16869967 PMCID: PMC1779574 DOI: 10.1186/gb-2006-7-7-r65] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2006] [Accepted: 07/25/2006] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Splice donor sites have a highly conserved GT or GC dinucleotide and an extended intronic consensus sequence GTRAGT that reflects the sequence complementarity to the U1 snRNA. Here, we focus on unusual donor sites with the motif GYNGYN (Y stands for C or T; N stands for A, C, G, or T). RESULTS While only one GY functions as a splice donor for the majority of these splice sites in human, we provide computational and experimental evidence that 110 (1.3%) allow alternative splicing at both GY donors. The resulting splice forms differ in only three nucleotides, which results mostly in the insertion/deletion of one amino acid. However, we also report the insertion of a stop codon in four cases. Investigating what distinguishes alternatively from not alternatively spliced GYNGYN donors, we found differences in the binding to U1 snRNA, a strong correlation between U1 snRNA binding strength and the preferred donor, over-represented sequence motifs in the adjacent introns, and a higher conservation of the exonic and intronic flanks between human and mouse. Extending our genome-wide analysis to seven other eukaryotic species, we found alternatively spliced GYNGYN donors in all species from mouse to Caenorhabditis elegans and even in Arabidopsis thaliana. Experimental verification of a conserved GTAGTT donor of the STAT3 gene in human and mouse reveals a remarkably similar ratio of alternatively spliced transcripts in both species. CONCLUSION In contrast to alternative splicing in general, GYNGYN donors in addition to NAGNAG acceptors enable subtle protein variations.
Collapse
Affiliation(s)
- Michael Hiller
- Institute of Computer Science, Chair for Bioinformatics, Albert-Ludwigs-University Freiburg, Georges-Koehler-Allee 106, 79110 Freiburg, Germany
| | - Klaus Huse
- Genome Analysis, Leibniz Institute for Age Research - Fritz Lipmann Institute, Beutenbergstr. 11, 07745 Jena, Germany
| | - Karol Szafranski
- Genome Analysis, Leibniz Institute for Age Research - Fritz Lipmann Institute, Beutenbergstr. 11, 07745 Jena, Germany
| | - Philip Rosenstiel
- Institute of Clinical Molecular Biology, Christian-Albrechts-University Kiel, Schittenhelmstr. 12, 24105 Kiel, Germany
- Max Planck Institute for Molecular Genetics, Ihnestr. 63, 14195 Berlin, Germany
| | - Stefan Schreiber
- Institute of Clinical Molecular Biology, Christian-Albrechts-University Kiel, Schittenhelmstr. 12, 24105 Kiel, Germany
| | - Rolf Backofen
- Institute of Computer Science, Chair for Bioinformatics, Albert-Ludwigs-University Freiburg, Georges-Koehler-Allee 106, 79110 Freiburg, Germany
| | - Matthias Platzer
- Genome Analysis, Leibniz Institute for Age Research - Fritz Lipmann Institute, Beutenbergstr. 11, 07745 Jena, Germany
| |
Collapse
|
99
|
Spokony RF, Restifo LL. Anciently duplicated Broad Complex exons have distinct temporal functions during tissue morphogenesis. Dev Genes Evol 2007; 217:499-513. [PMID: 17530286 DOI: 10.1007/s00427-007-0159-y] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2006] [Accepted: 04/20/2007] [Indexed: 12/22/2022]
Abstract
Broad Complex (BRC) is an essential ecdysone-pathway gene required for entry into and progression through metamorphosis in Drosophila melanogaster. Mutations of three BRC complementation groups cause numerous phenotypes, including a common suite of morphogenesis defects involving central nervous system (CNS), adult salivary glands (aSG), and male genitalia. These defects are phenocopied by the juvenile hormone mimic methoprene. Four BRC isoforms are produced by alternative splicing of a protein-binding BTB/POZ-encoding exon (BTBBRC) to one of four tandemly duplicated, DNA-binding zinc-finger-encoding exons (Z1BRC, Z2BRC, Z3BRC, Z4BRC). Highly conserved orthologs of BTBBRC and all four ZBRC were found among published cDNA sequences or genome databases from Diptera, Lepidoptera, Hymenoptera, and Coleoptera, indicating that BRC arose and underwent internal exon duplication before the split of holometabolous orders. Tramtrack subfamily members, abrupt, tramtrack, fruitless, longitudinals lacking (lola), and CG31666 were characterized throughout Holometabola and used to root phylogenetic analyses of ZBRC exons, which revealed that the ZBRC clade includes Zabrupt. All four ZBRC domains, including Z4BRC, which has no known essential function, are evolving in a manner consistent with selective constraint. We used transgenic rescue to explore how different BRC isoforms contribute to shared tissue-morphogenesis functions. As predicted from earlier studies, the common CNS and aSG phenotypes were rescued by BRC-Z1 in rbp mutants, BRC-Z2 in br mutants, and BRC-Z3 in 2Bc mutants. However, the isoforms are required at two different developmental stages, with BRC-Z2 and -Z3 required earlier than BRC-Z1. The sequential action of BRC isoforms indicates subfunctionalization of duplicated ZBRC exons even when they contribute to common developmental processes.
Collapse
Affiliation(s)
- Rebecca F Spokony
- Graduate Interdisciplinary Program in Insect Science, University of Arizona, Tucson, AZ 85721-0108, USA.
| | | |
Collapse
|
100
|
Radhakrishnan Y, Fares MA, French FS, Hall SH. Comparative genomic analysis of a mammalian beta-defensin gene cluster. Physiol Genomics 2007; 30:213-22. [PMID: 17456736 DOI: 10.1152/physiolgenomics.00263.2006] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open
Abstract
Comparative genomic analyses have yielded valuable insights into conserved and divergent aspects of gene function, regulation, and evolution. Herein, we describe the characterization of a mouse beta-defensin gene cluster locus on chromosome 2F6. In addition, we present the evolutionary analysis of this cluster and its human, rhesus, and rat orthologs. Expression analysis in mouse revealed the occurrence of defensin cluster transcripts in multiple tissues, with the highest abundance in the urogenital tract. Molecular evolutionary analysis suggests that this cluster originated by a series of duplication events, and by positive selection occurring even after the rodent-primate split. In addition, the constraints analysis showed higher positive selection in rodents than in primates, especially distal to the six-cysteine array. Positive selection in the evolution of these defensins may relate not only to the evolving enhancement of ancestral host defense but also to functional innovations in reproduction. The multiplicity of defensins and their preferential overexpression in the urogenital tract indicate that defensins function in the protection and maintenance of fertility.
Collapse
Affiliation(s)
- Yashwanth Radhakrishnan
- Laboratories for Reproductive Biology, Department of Pediatrics, University of North Carolina, Chapel Hill, North Carolina, USA
| | | | | | | |
Collapse
|