1
|
Spence MA, Mortimer MD, Buckle AM, Minh BQ, Jackson CJ. A Comprehensive Phylogenetic Analysis of the Serpin Superfamily. Mol Biol Evol 2021; 38:2915-2929. [PMID: 33744972 PMCID: PMC8233489 DOI: 10.1093/molbev/msab081] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
Serine protease inhibitors (serpins) are found in all kingdoms of life and play essential roles in multiple physiological processes. Owing to the diversity of the superfamily, phylogenetic analysis is challenging and prokaryotic serpins have been speculated to have been acquired from Metazoa through horizontal gene transfer due to their unexpectedly high homology. Here, we have leveraged a structural alignment of diverse serpins to generate a comprehensive 6,000-sequence phylogeny that encompasses serpins from all kingdoms of life. We show that in addition to a central “hub” of highly conserved serpins, there has been extensive diversification of the superfamily into many novel functional clades. Our analysis indicates that the hub proteins are ancient and are similar because of convergent evolution, rather than the alternative hypothesis of horizontal gene transfer. This work clarifies longstanding questions in the evolution of serpins and provides new directions for research in the field of serpin biology.
Collapse
Affiliation(s)
- Matthew A Spence
- Research School of Chemistry, Australian National University, Canberra, ACT, Australia
| | - Matthew D Mortimer
- Research School of Chemistry, Australian National University, Canberra, ACT, Australia
| | - Ashley M Buckle
- Department of Biochemistry and Molecular Biology, Biomedicine Discovery Institute, Monash University, Clayton, Melbourne, VIC, Australia
| | - Bui Quang Minh
- Research School of Computing and Research School of Biology, Australian National University, Canberra, ACT, Australia
| | - Colin J Jackson
- Research School of Chemistry, Australian National University, Canberra, ACT, Australia.,Australian Research Council Centre of Excellence for Innovations in Peptide and Protein Science, Research School of Chemistry, Australian National University, Canberra, ACT, Australia.,Australian Research Council Centre of Excellence in Synthetic Biology, Research School of Chemistry, Australian National University, Canberra, ACT, Australia
| |
Collapse
|
2
|
Genetic Diversity of Serine Protease Inhibitors in Myxozoan (Cnidaria, Myxozoa) Fish Parasites. Microorganisms 2020; 8:microorganisms8101502. [PMID: 33003479 PMCID: PMC7650755 DOI: 10.3390/microorganisms8101502] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2020] [Revised: 09/25/2020] [Accepted: 09/27/2020] [Indexed: 01/02/2023] Open
Abstract
We studied the genetic variability of serine protease inhibitors (serpins) of Myxozoa, microscopic endoparasites of fish. Myxozoans affect the health of both farmed and wild fish populations, causing diseases and mortalities. Despite their global impact, no effective protection exists against these parasites. Serpins were reported as important factors for host invasion and immune evasion, and as promising targets for the development of antiparasitic therapies. For the first time, we identified and aligned serpin sequences from high throughput sequencing datasets of ten myxozoan species, and analyzed 146 serpins from this parasite group together with those of other taxa phylogenetically, to explore their relationship and origins. High intra- and interspecific variability was detected among the examined serpins. The average sequence identity was 25–30% only. The conserved domains (i.e., motif and signature) showed taxon-level differences. Serpins clustered according to taxonomy rather than to serpin types, and myxozoan serpins seemed to be highly divergent from that of other taxa. None of them clustered with their closest relative free-living cnidarians. The genetic distinction of myxozoan serpins further strengthens the idea of an independent origin of Myxozoa, and may indicate novel protein functions potentially related to parasitism in this animal group.
Collapse
|
3
|
Kumar A, Bhandari A, Sarde SJ, Goswami C. Ancestry & molecular evolutionary analyses of heat shock protein 47 kDa (HSP47/SERPINH1). Sci Rep 2017; 7:10394. [PMID: 28871169 PMCID: PMC5583329 DOI: 10.1038/s41598-017-10740-0] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2017] [Accepted: 08/14/2017] [Indexed: 11/25/2022] Open
Abstract
HSP47/SERPINH1 is key-regulator for collagen biosynthesis and its structural assembly. To date, there is no comprehensive study on the phylogenetic history of HSP47. Herein we illustrate the evolutionary history of HSP47/SERPINH1 along with sequence, structural and syntenic traits for HSP47/SERPINH1. We have identified ancestral HSP47/SERPINH1 locus in Japanese lamprey (Lethenteron japonicum). This gene remains on the same or similar locus for ~500 million years (MY), but chromosomal duplication was observed in ray-finned fishes, leading into three sets of three sets (I-III) of HSP47/SERPINH1. Two novel introns were inserted at the positions 36b and 102b in the first exon of only HSP47_1 gene from the selected ray-finned fishes. On the evolutionary time scale, the events of HSP47 duplications took placed between 416–360 MY ago (MYA) while intron insertion dates back to 231–190 MYA after early divergence of ray-finned fishes.
Collapse
Affiliation(s)
- Abhishek Kumar
- Department of Genetics & Molecular Biology in Botany, Institute of Botany, Christian-Albrechts-University at Kiel, Kiel, Germany. .,Division of Molecular Genetic Epidemiology German Cancer Research Center, Heidelberg, Germany.
| | - Anita Bhandari
- Institute for Cardiogenetics, University of Lübeck, Lübeck, Germany
| | - Sandeep J Sarde
- Department of Genetics & Molecular Biology in Botany, Institute of Botany, Christian-Albrechts-University at Kiel, Kiel, Germany.,Laboratory of Entomology, Department of Plant Sciences, Wageningen University, Wageningen, Netherlands
| | - Chandan Goswami
- National Institute of Science Education and Research, Bhubaneswar, Orissa, India
| |
Collapse
|
4
|
Kumar A, Bhandari A, Sarde SJ, Muppavarapu S, Tandon R. Data on the evolutionary history of the V(D)J recombination-activating protein 1 - RAG1 coupled with sequence and variant analyses. Data Brief 2016; 8:87-92. [PMID: 27284568 PMCID: PMC4887553 DOI: 10.1016/j.dib.2016.05.021] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2015] [Revised: 05/05/2016] [Accepted: 05/12/2016] [Indexed: 11/28/2022] Open
Abstract
RAG1 protein is one of the key component of RAG complex regulating the V(D)J recombination. There are only few studies for RAG1 concerning evolutionary history, detailed sequence and mutational hotspots. Herein, we present out datasets used for the recent comprehensive study of RAG1 based on sequence, phylogenetic and genetic variant analyses (Kumar et al., 2015) [1]. Protein sequence alignment helped in characterizing the conserved domains and regions of RAG1. It also aided in unraveling ancestral RAG1 in the sea urchin. Human genetic variant analyses revealed 751 mutational hotspots, located both in the coding and the non-coding regions. For further analysis and discussion, see (Kumar et al., 2015) [1].
Collapse
Affiliation(s)
- Abhishek Kumar
- Department of Genetics & Molecular Biology in Botany, Institute of Botany, Christian-Albrechts-University at Kiel, Kiel, Germany
- Division of Molecular Genetic Epidemiology German Cancer Research Center, Heidelberg, Germany
| | - Anita Bhandari
- Molecular Physiology, Institute of Zoology, Christian-Albrechts-University at Kiel, Kiel, Germany
| | - Sandeep J. Sarde
- Department of Genetics & Molecular Biology in Botany, Institute of Botany, Christian-Albrechts-University at Kiel, Kiel, Germany
- Agrigenomics, Christian-Albrechts-University at Kiel, Kiel, Germany
| | | | - Ravi Tandon
- School of Biotechnology, Jawaharlal Nehru University, New Delhi, India
| |
Collapse
|
5
|
Kumar A, Bhandari A, Sarde SJ, Muppavarapu S, Tandon R. Understanding V(D)J recombination initiator RAG1 gene using molecular phylogenetic and genetic variant analyses and upgrading missense and non-coding variants of clinical importance. Biochem Biophys Res Commun 2015; 462:301-13. [DOI: 10.1016/j.bbrc.2015.04.125] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2015] [Accepted: 04/26/2015] [Indexed: 11/27/2022]
|
6
|
Kumar A. Bayesian phylogeny analysis of vertebrate serpins illustrates evolutionary conservation of the intron and indels based six groups classification system from lampreys for ∼500 MY. PeerJ 2015; 3:e1026. [PMID: 26157611 PMCID: PMC4476131 DOI: 10.7717/peerj.1026] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2015] [Accepted: 05/26/2015] [Indexed: 11/20/2022] Open
Abstract
The serpin superfamily is characterized by proteins that fold into a conserved tertiary structure and exploits a sophisticated and irreversible suicide-mechanism of inhibition. Vertebrate serpins are classified into six groups (V1-V6), based on three independent biological features-genomic organization, diagnostic amino acid sites and rare indels. However, this classification system was based on the limited number of mammalian genomes available. In this study, several non-mammalian genomes are used to validate this classification system using the powerful Bayesian phylogenetic method. This method supports the intron and indel based vertebrate classification and proves that serpins have been maintained from lampreys to humans for about 500 MY. Lampreys have fewer than 10 serpins, which expand into 36 serpins in humans. The two expanding groups V1 and V2 have SERPINB1/SERPINB6 and SERPINA8/SERPIND1 as the ancestral serpins, respectively. Large clusters of serpins are formed by local duplications of these serpins in tetrapod genomes. Interestingly, the ancestral HCII/SERPIND1 locus (nested within PIK4CA) possesses group V4 serpin (A2APL1, homolog of α 2-AP/SERPINF2) of lampreys; hence, pointing to the fact that group V4 might have originated from group V2. Additionally in this study, details of the phylogenetic history and genomic characteristics of vertebrate serpins are revisited.
Collapse
Affiliation(s)
- Abhishek Kumar
- Department of Genetics & Molecular Biology in Botany, Institute of Botany, Christian-Albrechts-University at Kiel, Kiel, Germany
| |
Collapse
|
7
|
The aggregation-prone intracellular serpin SRP-2 fails to transit the ER in Caenorhabditis elegans. Genetics 2015; 200:207-19. [PMID: 25786854 DOI: 10.1534/genetics.115.176180] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2015] [Accepted: 03/17/2015] [Indexed: 11/18/2022] Open
Abstract
Familial encephalopathy with neuroserpin inclusions bodies (FENIB) is a serpinopathy that induces a rare form of presenile dementia. Neuroserpin contains a classical signal peptide and like all extracellular serine proteinase inhibitors (serpins) is secreted via the endoplasmic reticulum (ER)-Golgi pathway. The disease phenotype is due to gain-of-function missense mutations that cause neuroserpin to misfold and aggregate within the ER. In a previous study, nematodes expressing a homologous mutation in the endogenous Caenorhabditis elegans serpin, srp-2, were reported to model the ER proteotoxicity induced by an allele of mutant neuroserpin. Our results suggest that SRP-2 lacks a classical N-terminal signal peptide and is a member of the intracellular serpin family. Using confocal imaging and an ER colocalization marker, we confirmed that GFP-tagged wild-type SRP-2 localized to the cytosol and not the ER. Similarly, the aggregation-prone SRP-2 mutant formed intracellular inclusions that localized to the cytosol. Interestingly, wild-type SRP-2, targeted to the ER by fusion to a cleavable N-terminal signal peptide, failed to be secreted and accumulated within the ER lumen. This ER retention phenotype is typical of other obligate intracellular serpins forced to translocate across the ER membrane. Neuroserpin is a secreted protein that inhibits trypsin-like proteinase. SRP-2 is a cytosolic serpin that inhibits lysosomal cysteine peptidases. We concluded that SRP-2 is neither an ortholog nor a functional homolog of neuroserpin. Furthermore, animals expressing an aggregation-prone mutation in SRP-2 do not model the ER proteotoxicity associated with FENIB.
Collapse
|
8
|
Kumar A, Bhandari A, Goswami C. Surveying genetic variants and molecular phylogeny of cerebral cavernous malformation gene, CCM3/PDCD10. Biochem Biophys Res Commun 2014; 455:98-106. [DOI: 10.1016/j.bbrc.2014.10.105] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2014] [Accepted: 10/21/2014] [Indexed: 11/29/2022]
|
9
|
Kumar A, Bhandari A. Urochordate serpins are Classified into Six Groups Encoded by Exon-Intron Structures, Microsynteny and Bayesian Phylogenetic Analyses. J Genomics 2014; 2:131-40. [PMID: 25184006 PMCID: PMC4150122 DOI: 10.7150/jgen.9437] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022] Open
Abstract
Members of serpin superfamily are involved in wide array of cellular processes to control proteolytic activities of eukaryotic organisms. Vertebrate serpins are extensively studied and reported to be classified into six groups (V1-V6) based on gene structures. However, there is no study conducted for serpins in urochordates (the closest living invertebrates related to vertebrates) to date. To unravel further the phylogenetic history of serpin genes, we characterized serpin genes from two urochordates (Ciona intestinalis and Ciona savignyi). There are 11 and 5 serpins in the C. intestinalis and C. savignyi, respectively. The exon/intron structures and genomic locus comparisons together with sequence phylogenetic analysis, suggested that urochordate serpins are classified into six groups (U1-U6), different from six groups (V1-V6) of vertebrate serpins. Human α1-antitrypsin shared lower sequence identities and similarities with urochordates serpins ranged from 14-29% and 30-49%, respectively. Based on protein sequences, genes and genomic architectures, we conclude that these two urochordates do not contain a single copy of genuine ortholog of the vertebrate serpins.
Collapse
Affiliation(s)
- Abhishek Kumar
- 1. Department of Genetics & Molecular Biology in Botany, Institute of Botany, Christian-Albrechts-University at Kiel, Kiel, Germany
| | - Anita Bhandari
- 2. Molecular Physiology, Zoological Institute, Christian-Albrechts-University at Kiel, Kiel, Germany
| |
Collapse
|
10
|
Kumar A, Bhandari A, Sarde SJ, Goswami C. Molecular phylogeny of C1 inhibitor depicts two immunoglobulin-like domains fusion in fishes and ray-finned fishes specific intron insertion after separation from zebrafish. Biochem Biophys Res Commun 2014; 450:219-26. [DOI: 10.1016/j.bbrc.2014.05.097] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2014] [Accepted: 05/21/2014] [Indexed: 01/26/2023]
|
11
|
Kumar A, Bhandari A, Sarde SJ, Goswami C. Genetic variants and evolutionary analyses of heparin cofactor II. Immunobiology 2014; 219:713-28. [PMID: 24950623 DOI: 10.1016/j.imbio.2014.05.003] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2013] [Revised: 04/13/2014] [Accepted: 05/19/2014] [Indexed: 11/25/2022]
Abstract
Heparin cofactor II (HCII) belongs to serpin superfamily and it acts as a thrombin inhibitor in the coagulation cascade, in a glycosaminoglycan-dependent pathway using the release of a sequestered hirudin-like N-terminal tail for interaction with thrombin. This serpin belongs to multiple member group V2 of vertebrate serpin classification. However, there is no comprehensive study illustrating the exact phylogenetic history of HCII, to date. Herein, we explored phylogenetic traits of HCII genes. Structures of HCII gene from selected ray-finned fishes and lamprey varied in exon I and II with insertions of novel introns of which one in core domain for ray-finned fishes in exon II at the position 241c. We found HCII remain nested in the largest intron of phosphatidylinositol (PI) 4-kinase (PIK4CA) gene (genetic variants of this gene cause schizophrenia) at the origin of vertebrates, dated about 500MY old. We found that sequence features such as two acidic repeats (AR1-II), GAG-binding helix-D, three serpin motifs and inhibitory reactive center loop (RCL) of HCII protein are highly conserved in 55 vertebrates analyzed. We identified 985 HCII variants by analysis of 1092 human genomes with top three variation classes belongs to SNPs (84.3%), insertion (7.1%) and deletion (5.0%). We identified 37 deleterious mutations in the human HCII protein and we have described these mutations in relation to HCII sequence-structure-function relationships. These understandings may have clinical and medical importance as well.
Collapse
Affiliation(s)
- Abhishek Kumar
- Department of Genetics & Molecular Biology in Botany, Institute of Botany, Christian-Albrechts-University at Kiel, Kiel, Germany.
| | - Anita Bhandari
- Molecular Physiology, Zoological Institute, Christian-Albrechts-University at Kiel, Kiel, Germany
| | - Sandeep J Sarde
- Department of Genetics & Molecular Biology in Botany, Institute of Botany, Christian-Albrechts-University at Kiel, Kiel, Germany; Master Program Agrigenomics, Christian-Albrechts-University at Kiel, Kiel, Germany
| | - Chandan Goswami
- National Institute of Science Education and Research, Bhubaneswar, Orissa, India
| |
Collapse
|
12
|
Kumar A, Sarde SJ, Bhandari A. Revising angiotensinogen from phylogenetic and genetic variants perspectives. Biochem Biophys Res Commun 2014; 446:504-18. [DOI: 10.1016/j.bbrc.2014.02.139] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2014] [Accepted: 02/28/2014] [Indexed: 11/29/2022]
|
13
|
Sequencing, De Novo assembly and annotation of the Colorado Potato Beetle, Leptinotarsa decemlineata, Transcriptome. PLoS One 2014; 9:e86012. [PMID: 24465841 PMCID: PMC3900453 DOI: 10.1371/journal.pone.0086012] [Citation(s) in RCA: 45] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2012] [Accepted: 12/09/2013] [Indexed: 12/18/2022] Open
Abstract
Background The Colorado potato beetle (Leptinotarsa decemlineata) is a major pest and a serious threat to potato cultivation throughout the northern hemisphere. Despite its high importance for invasion biology, phenology and pest management, little is known about L. decemlineata from a genomic perspective. We subjected European L. decemlineata adult and larval transcriptome samples to 454-FLX massively-parallel DNA sequencing to characterize a basal set of genes from this species. We created a combined assembly of the adult and larval datasets including the publicly available midgut larval Roche 454 reads and provided basic annotation. We were particularly interested in diapause-specific genes and genes involved in pesticide and Bacillus thuringiensis (Bt) resistance. Results Using 454-FLX pyrosequencing, we obtained a total of 898,048 reads which, together with the publicly available 804,056 midgut larval reads, were assembled into 121,912 contigs. We established a repository of genes of interest, with 101 out of the 108 diapause-specific genes described in Drosophila montana; and 621 contigs involved in insecticide resistance, including 221 CYP450, 45 GSTs, 13 catalases, 15 superoxide dismutases, 22 glutathione peroxidases, 194 esterases, 3 ADAM metalloproteases, 10 cadherins and 98 calmodulins. We found 460 putative miRNAs and we predicted a significant number of single nucleotide polymorphisms (29,205) and microsatellite loci (17,284). Conclusions This report of the assembly and annotation of the transcriptome of L. decemlineata offers new insights into diapause-associated and insecticide-resistance-associated genes in this species and provides a foundation for comparative studies with other species of insects. The data will also open new avenues for researchers using L. decemlineata as a model species, and for pest management research. Our results provide the basis for performing future gene expression and functional analysis in L. decemlineata and improve our understanding of the biology of this invasive species at the molecular level.
Collapse
|
14
|
Kumar A, Bhandari A, Sarde SJ, Goswami C. Sequence, phylogenetic and variant analyses of antithrombin III. Biochem Biophys Res Commun 2013; 440:714-24. [DOI: 10.1016/j.bbrc.2013.09.134] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2013] [Accepted: 09/29/2013] [Indexed: 10/26/2022]
|
15
|
Characterization of bud emergence 46 (BEM46) protein: sequence, structural, phylogenetic and subcellular localization analyses. Biochem Biophys Res Commun 2013; 438:526-32. [PMID: 23916612 DOI: 10.1016/j.bbrc.2013.07.103] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2013] [Accepted: 07/25/2013] [Indexed: 02/04/2023]
Abstract
The bud emergence 46 (BEM46) protein from Neurospora crassa belongs to the α/β-hydrolase superfamily. Recently, we have reported that the BEM46 protein is localized in the perinuclear ER and also forms spots close by the plasma membrane. The protein appears to be required for cell type-specific polarity formation in N. crassa. Furthermore, initial studies suggested that the BEM46 amino acid sequence is conserved in eukaryotes and is considered to be one of the widespread conserved "known unknown" eukaryotic genes. This warrants for a comprehensive phylogenetic analysis of this superfamily to unravel origin and molecular evolution of these genes in different eukaryotes. Herein, we observe that all eukaryotes have at least a single copy of a bem46 ortholog. Upon scanning of these proteins in various genomes, we find that there are expansions leading into several paralogs in vertebrates. Usingcomparative genomic analyses, we identified insertion/deletions (indels) in the conserved domain of BEM46 protein, which allow to differentiate fungal classes such as ascomycetes from basidiomycetes. We also find that exonic indels are able to differentiate BEM46 homologs of different eukaryotic lineage. Furthermore, we unravel that BEM46 protein from N. crassa possess a novel endoplasmic-retention signal (PEKK) using GFP-fusion tagging experiments. We propose that three residues namely a serine 188S, a histidine 292H and an aspartic acid 262D are most critical residues, forming a catalytic triad in BEM46 protein from N. crassa. We carried out a comprehensive study on bem46 genes from a molecular evolution perspective with combination of functional analyses. The evolutionary history of BEM46 proteins is characterized by exonic indels in lineage specific manner.
Collapse
|
16
|
Rogozin IB, Carmel L, Csuros M, Koonin EV. Origin and evolution of spliceosomal introns. Biol Direct 2012; 7:11. [PMID: 22507701 PMCID: PMC3488318 DOI: 10.1186/1745-6150-7-11] [Citation(s) in RCA: 217] [Impact Index Per Article: 18.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2011] [Accepted: 03/15/2012] [Indexed: 12/31/2022] Open
Abstract
Evolution of exon-intron structure of eukaryotic genes has been a matter of long-standing, intensive debate. The introns-early concept, later rebranded ‘introns first’ held that protein-coding genes were interrupted by numerous introns even at the earliest stages of life's evolution and that introns played a major role in the origin of proteins by facilitating recombination of sequences coding for small protein/peptide modules. The introns-late concept held that introns emerged only in eukaryotes and new introns have been accumulating continuously throughout eukaryotic evolution. Analysis of orthologous genes from completely sequenced eukaryotic genomes revealed numerous shared intron positions in orthologous genes from animals and plants and even between animals, plants and protists, suggesting that many ancestral introns have persisted since the last eukaryotic common ancestor (LECA). Reconstructions of intron gain and loss using the growing collection of genomes of diverse eukaryotes and increasingly advanced probabilistic models convincingly show that the LECA and the ancestors of each eukaryotic supergroup had intron-rich genes, with intron densities comparable to those in the most intron-rich modern genomes such as those of vertebrates. The subsequent evolution in most lineages of eukaryotes involved primarily loss of introns, with only a few episodes of substantial intron gain that might have accompanied major evolutionary innovations such as the origin of metazoa. The original invasion of self-splicing Group II introns, presumably originating from the mitochondrial endosymbiont, into the genome of the emerging eukaryote might have been a key factor of eukaryogenesis that in particular triggered the origin of endomembranes and the nucleus. Conversely, splicing errors gave rise to alternative splicing, a major contribution to the biological complexity of multicellular eukaryotes. There is no indication that any prokaryote has ever possessed a spliceosome or introns in protein-coding genes, other than relatively rare mobile self-splicing introns. Thus, the introns-first scenario is not supported by any evidence but exon-intron structure of protein-coding genes appears to have evolved concomitantly with the eukaryotic cell, and introns were a major factor of evolution throughout the history of eukaryotes. This article was reviewed by I. King Jordan, Manuel Irimia (nominated by Anthony Poole), Tobias Mourier (nominated by Anthony Poole), and Fyodor Kondrashov. For the complete reports, see the Reviewers’ Reports section.
Collapse
Affiliation(s)
- Igor B Rogozin
- National Center for Biotechnology Information NLM/NIH, 8600 Rockville Pike, Bldg, 38A, Bethesda, MD 20894, USA
| | | | | | | |
Collapse
|
17
|
Conservation of tubulin-binding sequences in TRPV1 throughout evolution. PLoS One 2012; 7:e31448. [PMID: 22496727 PMCID: PMC3322131 DOI: 10.1371/journal.pone.0031448] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2011] [Accepted: 01/08/2012] [Indexed: 01/10/2023] Open
Abstract
Background Transient Receptor Potential Vanilloid sub type 1 (TRPV1), commonly known as capsaicin receptor can detect multiple stimuli ranging from noxious compounds, low pH, temperature as well as electromagnetic wave at different ranges. In addition, this receptor is involved in multiple physiological and sensory processes. Therefore, functions of TRPV1 have direct influences on adaptation and further evolution also. Availability of various eukaryotic genomic sequences in public domain facilitates us in studying the molecular evolution of TRPV1 protein and the respective conservation of certain domains, motifs and interacting regions that are functionally important. Methodology and Principal Findings Using statistical and bioinformatics tools, our analysis reveals that TRPV1 has evolved about ∼420 million years ago (MYA). Our analysis reveals that specific regions, domains and motifs of TRPV1 has gone through different selection pressure and thus have different levels of conservation. We found that among all, TRP box is the most conserved and thus have functional significance. Our results also indicate that the tubulin binding sequences (TBS) have evolutionary significance as these stretch sequences are more conserved than many other essential regions of TRPV1. The overall distribution of positively charged residues within the TBS motifs is conserved throughout evolution. In silico analysis reveals that the TBS-1 and TBS-2 of TRPV1 can form helical structures and may play important role in TRPV1 function. Conclusions and Significance Our analysis identifies the regions of TRPV1, which are important for structure – function relationship. This analysis indicates that tubulin binding sequence-1 (TBS-1) near the TRP-box forms a potential helix and the tubulin interactions with TRPV1 via TBS-1 have evolutionary significance. This interaction may be required for the proper channel function and regulation and may also have significance in the context of Taxol®-induced neuropathy.
Collapse
|
18
|
Kumar A, Bhandari A, Sinha R, Goyal P, Grapputo A. Spliceosomal intron insertions in genome compacted ray-finned fishes as evident from phylogeny of MC receptors, also supported by a few other GPCRs. PLoS One 2011; 6:e22046. [PMID: 21850219 PMCID: PMC3151243 DOI: 10.1371/journal.pone.0022046] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2010] [Accepted: 06/16/2011] [Indexed: 11/18/2022] Open
Abstract
BACKGROUND Insertions of spliceosomal introns are very rare events during evolution of vertebrates and the mechanisms governing creation of novel intron(s) remain obscure. Largely, gene structures of melanocortin (MC) receptors are characterized by intron-less architecture. However, recently a few exceptions have been reported in some fishes. This warrants a systematic survey of MC receptors for understanding intron insertion events during vertebrate evolution. METHODOLOGY/PRINCIPAL FINDINGS We have compiled an extended list of MC receptors from different vertebrate genomes with variations in fishes. Notably, the closely linked MC2Rs and MC5Rs from a group of ray-finned fishes have three and one intron insertion(s), respectively, with conserved positions and intron phase. In both genes, one novel insertion was in the highly conserved DRY motif at the end of helix TM3. Further, the proto-splice site MAG↑R is maintained at intron insertion sites in these two genes. However, the orthologs of these receptors from zebrafish and tetrapods are intron-less, suggesting these introns are simultaneously created in selected fishes. Surprisingly, these novel introns are traceable only in four fish genomes. We found that these fish genomes are severely compacted after the separation from zebrafish. Furthermore, we also report novel intron insertions in P2Y receptors and in CHRM3. Finally, we report ultrasmall introns in MC2R genes from selected fishes. CONCLUSIONS/SIGNIFICANCE The current repository of MC receptors illustrates that fishes have no MC3R ortholog. MC2R, MC5R, P2Y receptors and CHRM3 have novel intron insertions only in ray-finned fishes that underwent genome compaction. These receptors share one intron at an identical position suggestive of being inserted contemporaneously. In addition to repetitive elements, genome compaction is now believed to be a new hallmark that promotes intron insertions, as it requires rapid DNA breakage and subsequent repair processes to gain back normal functionality.
Collapse
Affiliation(s)
- Abhishek Kumar
- Department of Biology, University of Padua, Padova, Italy.
| | | | | | | | | |
Collapse
|
19
|
Ragg H, Kumar A, Köster K, Bentele C, Wang Y, Frese MA, Prib N, Krüger O. Multiple gains of spliceosomal introns in a superfamily of vertebrate protease inhibitor genes. BMC Evol Biol 2009; 9:208. [PMID: 19698129 PMCID: PMC2746811 DOI: 10.1186/1471-2148-9-208] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2009] [Accepted: 08/22/2009] [Indexed: 01/13/2023] Open
Abstract
Background Intron gains reportedly are very rare during evolution of vertebrates, and the mechanisms underlying their creation are largely unknown. Previous investigations have shown that, during metazoan radiation, the exon-intron patterns of serpin superfamily genes were subject to massive changes, in contrast to many other genes. Results Here we investigated intron dynamics in the serpin superfamily in lineages pre- and postdating the split of vertebrates. Multiple intron gains were detected in a group of ray-finned fishes, once the canonical groups of vertebrate serpins had been established. In two genes, co-occurrence of non-standard introns was observed, implying that intron gains in vertebrates may even happen concomitantly or in a rapidly consecutive manner. DNA breakage/repair processes associated with genome compaction are introduced as a novel factor potentially favoring intron gain, since all non-canonical introns were found in a lineage of ray-finned fishes that experienced genomic downsizing. Conclusion Multiple intron acquisitions were identified in serpin genes of a lineage of ray-finned fishes, but not in any other vertebrates, suggesting that insertion rates for introns may be episodically increased. The co-occurrence of non-standard introns within the same gene discloses the possibility that introns may be gained simultaneously. The sequences flanking the intron insertion points correspond to the proto-splice site consensus sequence MAG↑N, previously proposed to serve as intron insertion site. The association of intron gains in the serpin superfamily with a group of fishes that underwent genome compaction may indicate that DNA breakage/repair processes might foster intron birth.
Collapse
Affiliation(s)
- Hermann Ragg
- Department of Biotechnology, Faculty of Technology and Center for Biotechnology, University of Bielefeld, D-33501 Bielefeld, Germany.
| | | | | | | | | | | | | | | |
Collapse
|