1
|
Teterina AA, Willis JH, Baer CF, Phillips PC. Pervasive conservation of intron number and other genetic elements revealed by a chromosome-level genomic assembly of the hyper-polymorphic nematode Caenorhabditis brenneri. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.25.600681. [PMID: 38979286 PMCID: PMC11230420 DOI: 10.1101/2024.06.25.600681] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/10/2024]
Abstract
With within-species genetic diversity estimates that span the gambit of that seen across the entirety of animals, the Caenorhabditis genus of nematodes holds unique potential to provide insights into how population size and reproductive strategies influence gene and genome organization and evolution. Our study focuses on Caenorhabditis brenneri, currently known as one of the most genetically diverse nematodes within its genus and metazoan phyla. Here, we present a high-quality gapless genome assembly and annotation for C. brenneri, revealing a common nematode chromosome arrangement characterized by gene-dense central regions and repeat rich peripheral parts. Comparison of C. brenneri with other nematodes from the 'Elegans' group revealed conserved macrosynteny but a lack of microsynteny, characterized by frequent rearrangements and low correlation iof orthogroup sizes, indicative of high rates of gene turnover. We also assessed genome organization within corresponding syntenic blocks in selfing and outcrossing species, affirming that selfing species predominantly experience loss of both genes and intergenic DNA. Comparison of gene structures revealed strikingly small number of shared introns across species, yet consistent distributions of intron number and length, regardless of population size or reproductive mode, suggesting that their evolutionary dynamics are primarily reflective of functional constraints. Our study provides valuable insights into genome evolution and expands the nematode genome resources with the highly genetically diverse C. brenneri, facilitating research into various aspects of nematode biology and evolutionary processes.
Collapse
Affiliation(s)
- Anastasia A Teterina
- Institute of Ecology and Evolution, University of Oregon, Eugene, OR, USA
- Center of Parasitology, Severtsov Institute of Ecology and Evolution RAS, Moscow, Russia
| | - John H Willis
- Institute of Ecology and Evolution, University of Oregon, Eugene, OR, USA
| | - Charles F Baer
- Department of Biology, University of Florida, Gainesville, USA
| | - Patrick C Phillips
- Institute of Ecology and Evolution, University of Oregon, Eugene, OR, USA
| |
Collapse
|
2
|
Abstract
BACKGROUND The evolution of spliceosomal introns has been widely studied among various eukaryotic groups. Researchers nearly reached the consensuses on the pattern and the mechanisms of intron losses and gains across eukaryotes. However, according to previous studies that analyzed a few genes or genomes, Nematoda seems to be an eccentric group. RESULTS Taking advantage of the recent accumulation of sequenced genomes, we extensively analyzed the intron losses and gains using 104 nematode genomes across all the five Clades of the phylum. Nematodes have a wide range of intron density, from less than one to more than nine per kbp coding sequence. The rates of intron losses and gains exhibit significant heterogeneity both across different nematode lineages and across different evolutionary stages of the same lineage. The frequency of intron losses far exceeds that of intron gains. Five pieces of evidence supporting the model of cDNA-mediated intron loss have been observed in ten Caenorhabditis species, the dominance of the precise intron losses, frequent loss of adjacent introns, high-level expression of the intron-lost genes, preferential losses of short introns, and the preferential losses of introns close to 3'-ends of genes. Like studies in most eukaryotic groups, we cannot find the source sequences for the limited number of intron gains detected in the Caenorhabditis genomes. CONCLUSIONS These results indicate that nematodes are a typical eukaryotic group rather than an outlier in intron evolution.
Collapse
Affiliation(s)
- Ming-Yue Ma
- Chongqing Key Laboratory of Big Data for Bio Intelligence, School of Bioinformatics, Chongqing University of Posts and Telecommunications, Chongqing, 400065, China
| | - Ji Xia
- Chongqing Key Laboratory of Big Data for Bio Intelligence, School of Bioinformatics, Chongqing University of Posts and Telecommunications, Chongqing, 400065, China
| | - Kun-Xian Shu
- Chongqing Key Laboratory of Big Data for Bio Intelligence, School of Bioinformatics, Chongqing University of Posts and Telecommunications, Chongqing, 400065, China.
| | - Deng-Ke Niu
- MOE Key Laboratory for Biodiversity Science and Ecological Engineering and Beijing Key Laboratory of Gene Resource and Molecular Development, College of Life Sciences, Beijing Normal University, Beijing, 100875, China.
| |
Collapse
|
3
|
Chicote JU, DeSalle R, García-España A. Phosphotyrosine phosphatase R3 receptors: Origin, evolution and structural diversification. PLoS One 2017; 12:e0172887. [PMID: 28257417 PMCID: PMC5336234 DOI: 10.1371/journal.pone.0172887] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2016] [Accepted: 02/10/2017] [Indexed: 11/18/2022] Open
Abstract
Subtype R3 phosphotyrosine phosphatase receptors (R3 RPTPs) are single-spanning membrane proteins characterized by a unique modular composition of extracellular fibronectin repeats and a single cytoplasmatic protein tyrosine phosphatase (PTP) domain. Vertebrate R3 RPTPs consist of five members: PTPRB, PTPRJ, PTPRH and PTPRO, which dephosphorylate tyrosine residues, and PTPRQ, which dephosphorylates phophoinositides. R3 RPTPs are considered novel therapeutic targets in several pathologies such as ear diseases, nephrotic syndromes and cancer. R3 RPTP vertebrate receptors, as well as their known invertebrate counterparts from animal models: PTP52F, PTP10D and PTP4e from the fruitfly Drosophila melanogaster and F44G4.8/DEP-1 from the nematode Caenorhabditis elegans, participate in the regulation of cellular activities including cell growth and differentiation. Despite sharing structural and functional properties, the evolutionary relationships between vertebrate and invertebrate R3 RPTPs are not fully understood. Here we gathered R3 RPTPs from organisms covering a broad evolutionary distance, annotated their structure and analyzed their phylogenetic relationships. We show that R3 RPTPs (i) have probably originated in the common ancestor of animals (metazoans), (ii) are variants of a single ancestral gene in protostomes (arthropods, annelids and nematodes); (iii) a likely duplication of this ancestral gene in invertebrate deuterostomes (echinodermes, hemichordates and tunicates) generated the precursors of PTPRQ and PTPRB genes, and (iv) R3 RPTP groups are monophyletic in vertebrates and have specific conserved structural characteristics. These findings could have implications for the interpretation of past studies and provide a framework for future studies and functional analysis of this important family of proteins.
Collapse
Affiliation(s)
- Javier U. Chicote
- Hospital Universitari de Tarragona Joan XXIII, Institut d’Investigació Sanitària Pere Virgili, Universitat Rovira i Virgili, Tarragona, Spain
| | - Rob DeSalle
- Sackler Institute for Comparative Genomics, American Museum of Natural History, New York, NewYork, United States of America
| | - Antonio García-España
- Hospital Universitari de Tarragona Joan XXIII, Institut d’Investigació Sanitària Pere Virgili, Universitat Rovira i Virgili, Tarragona, Spain
- * E-mail:
| |
Collapse
|
4
|
Peterson SW, Vega FE, Posada F, Nagai C. Penicillium coffeae,a new endophytic species isolated from a coffee plant and its phylogenetic relationship toP. fellutanum, P. thiersiiandP. brocaebased on parsimony analysis of multilocus DNA sequences. Mycologia 2017. [DOI: 10.1080/15572536.2006.11832796] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Affiliation(s)
- Stephen W. Peterson
- Microbial Genomics and Bioprocessing Research Unit, National Center for Agricultural Utilization Research, Agricultural Research Service, U.S. Department of Agriculture, 1815 North University Street, Peoria, Illinois 61604
| | | | - Francisco Posada
- Insect Biocontrol Laboratory, Agricultural Research Service, U.S. Department of Agriculture, Beltsville, Maryland 20705
| | - Chifumi Nagai
- Hawaii Agriculture Research Center (HARC), 99-193 Aiea Heights Drive, Suite 300 Aiea, Hawaii 96701-3911
| |
Collapse
|
5
|
Affiliation(s)
- Stephen W. Peterson
- Microbial Genomics and Bioprocessing Research Unit, National Center for Agricultural Utilization Research, Agricultural Research Service, U.S. Department of Agriculture, 1815 North University Street, Peoria, Illinois 61604
| |
Collapse
|
6
|
Behura SK, Severson DW. Overlapping genes of Aedes aegypti: evolutionary implications from comparison with orthologs of Anopheles gambiae and other insects. BMC Evol Biol 2013; 13:124. [PMID: 23777277 PMCID: PMC3689595 DOI: 10.1186/1471-2148-13-124] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2012] [Accepted: 06/12/2013] [Indexed: 11/11/2022] Open
Abstract
Background Although gene overlapping is a common feature of prokaryote and mitochondria genomes, such genes have also been identified in many eukaryotes. The overlapping genes in eukaryotes are extensively rearranged even between closely related species. In this study, we investigated retention and rearrangement of positionally overlapping genes between the mosquitoes Aedes aegypti (dengue virus vector) and Anopheles gambiae (malaria vector). The overlapping gene pairs of A. aegypti were further compared with orthologs of other selected insects to conduct several hypothesis driven investigations relating to the evolution and rearrangement of overlapping genes. Results The results show that as much as ~10% of the predicted genes of A. aegypti and A. gambiae are localized in positional overlapping manner. Furthermore, the study shows that differential abundance of introns and simple sequence repeats have significant association with positional rearrangement of overlapping genes between the two species. Gene expression analysis further suggests that antisense transcripts generated from the oppositely oriented overlapping genes are differentially regulated and may have important regulatory functions in these mosquitoes. Our data further shows that synonymous and non-synonymous mutations have differential but non-significant effect on overlapping localization of orthologous genes in other insect genomes. Conclusion Gene overlapping in insects may be a species-specific evolutionary process as evident from non-dependency of gene overlapping with species phylogeny. Based on the results, our study suggests that overlapping genes may have played an important role in genome evolution of insects.
Collapse
Affiliation(s)
- Susanta K Behura
- Eck Institute for Global Health, Department of Biological Sciences, University of Notre Dame, Notre Dame, IN 46556, USA
| | | |
Collapse
|
7
|
Irimia M, Maeso I, Burguera D, Hidalgo-Sánchez M, Puelles L, Roy SW, Garcia-Fernàndez J, Ferran JL. Contrasting 5' and 3' evolutionary histories and frequent evolutionary convergence in Meis/hth gene structures. Genome Biol Evol 2011; 3:551-64. [PMID: 21680890 PMCID: PMC3140891 DOI: 10.1093/gbe/evr056] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
Organisms show striking differences in genome structure; however, the functional implications and fundamental forces that govern these differences remain obscure. The intron–exon organization of nuclear genes is involved in a particularly large variety of structures and functional roles. We performed a 22-species study of Meis/hth genes, intron-rich homeodomain-containing transcription factors involved in a wide range of developmental processes. Our study revealed three surprising results that suggest important and very different functions for Meis intron–exon structures. First, we find unexpected conservation across species of intron positions and lengths along most of the Meis locus. This contrasts with the high degree of structural divergence found in genome-wide studies and may attest to conserved regulatory elements residing within these conserved introns. Second, we find very different evolutionary histories for the 5′ and 3′ regions of the gene. The 5′-most 10 exons, which encode the highly conserved Meis domain and homeodomain, show striking conservation. By contrast, the 3′ of the gene, which encodes several domains implicated in transcriptional activation and response to cell signaling, shows a remarkably active evolutionary history, with diverse isoforms and frequent creation and loss of new exons and splice sites. This region-specific diversity suggests evolutionary “tinkering,” with alternative splicing allowing for more subtle regulation of protein function. Third, we find a large number of cases of convergent evolution in the 3′ region, including 1) parallel losses of ancestral coding sequence, 2) parallel gains of external and internal splice sites, and 3) recurrent truncation of C-terminal coding regions. These results attest to the importance of locus-specific splicing functions in differences in structural evolution across genes, as well as to commonalities of forces shaping the evolution of individual genes along different lineages.
Collapse
Affiliation(s)
- Manuel Irimia
- Department of Genetics, School of Biology, University of Barcelona, Barcelona, Spain.
| | | | | | | | | | | | | | | |
Collapse
|
8
|
Rebscher N, Deichmann C, Sudhop S, Fritzenwanker JH, Green S, Hassel M. Conserved intron positions in FGFR genes reflect the modular structure of FGFR and reveal stepwise addition of domains to an already complex ancestral FGFR. Dev Genes Evol 2009; 219:455-68. [PMID: 20016912 DOI: 10.1007/s00427-009-0309-5] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2009] [Accepted: 11/22/2009] [Indexed: 11/26/2022]
Abstract
We have analyzed the evolution of fibroblast growth factor receptor (FGFR) tyrosine kinase genes throughout a wide range of animal phyla. No evidence for an FGFR gene was found in Porifera, but we tentatively identified an FGFR gene in the placozoan Trichoplax adhaerens. The gene encodes a protein with three immunoglobulin-like domains, a single-pass transmembrane, and a split tyrosine kinase domain. By superimposing intron positions of 20 FGFR genes from Placozoa, Cnidaria, Protostomia, and Deuterostomia over the respective protein domain structure, we identified ten ancestral introns and three conserved intron groups. Our analysis shows (1) that the position of ancestral introns correlates to the modular structure of FGFRs, (2) that the acidic domain very likely evolved in the last common ancestor of triploblasts, (3) that splicing of IgIII was enabled by a triploblast-specific insertion, and (4) that IgI is subject to substantial loss or duplication particularly in quickly evolving genomes. Moreover, intron positions in the catalytic domain of FGFRs map to the borders of protein subdomains highly conserved in other serine/threonine kinases. Nevertheless, these introns were introduced in metazoan receptor tyrosine kinases exclusively. Our data support the view that protein evolution dating back to the Cambrian explosion took place in such a short time window that only subtle changes in the domain structure are detectable in extant representatives of animal phyla. We propose that the first multidomain FGFR originated in the last common ancestor of Placozoa, Cnidaria, and Bilateria. Additional domains were introduced mainly in the ancestor of triploblasts and in the Ecdysozoa.
Collapse
Affiliation(s)
- Nicole Rebscher
- FB 17, Morphology and Evolution of Invertebrates, Philipps Universitaet Marburg, Karl von Frisch Str. 8, 35032, Marburg, Germany
| | | | | | | | | | | |
Collapse
|
9
|
Garcia-España A, Mares R, Sun TT, DeSalle R. Intron evolution: testing hypotheses of intron evolution using the phylogenomics of tetraspanins. PLoS One 2009; 4:e4680. [PMID: 19262691 PMCID: PMC2650405 DOI: 10.1371/journal.pone.0004680] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2008] [Accepted: 12/30/2008] [Indexed: 11/20/2022] Open
Abstract
Background Although large scale informatics studies on introns can be useful in making broad inferences concerning patterns of intron gain and loss, more specific questions about intron evolution at a finer scale can be addressed using a gene family where structure and function are well known. Genome wide surveys of tetraspanins from a broad array of organisms with fully sequenced genomes are an excellent means to understand specifics of intron evolution. Our approach incorporated several new fully sequenced genomes that cover the major lineages of the animal kingdom as well as plants, protists and fungi. The analysis of exon/intron gene structure in such an evolutionary broad set of genomes allowed us to identify ancestral intron structure in tetraspanins throughout the eukaryotic tree of life. Methodology/Principal Findings We performed a phylogenomic analysis of the intron/exon structure of the tetraspanin protein family. In addition, to the already characterized tetraspanin introns numbered 1 through 6 found in animals, three additional ancient, phase 0 introns we call 4a, 4b and 4c were found. These three novel introns in combination with the ancestral introns 1 to 6, define three basic tetraspanin gene structures which have been conserved throughout the animal kingdom. Our phylogenomic approach also allows the estimation of the time at which the introns of the 33 human tetraspanin paralogs appeared, which in many cases coincides with the concomitant acquisition of new introns. On the other hand, we observed that new introns (introns other than 1–6, 4a, b and c) were not randomly inserted into the tetraspanin gene structure. The region of tetraspanin genes corresponding to the small extracellular loop (SEL) accounts for only 10.5% of the total sequence length but had 46% of the new animal intron insertions. Conclusions/Significance Our results indicate that tests of intron evolution are strengthened by the phylogenomic approach with specific gene families like tetraspanins. These tests add to our understanding of genomic innovation coupled to major evolutionary divergence events, functional constraints and the timing of the appearance of evolutionary novelty.
Collapse
Affiliation(s)
- Antonio Garcia-España
- Unitat de Recerca, Hospital Joan XXIII, Institut de Investigacio Sanitaria Rovira I Virgili (IISPV), Universitat Rovira i Virgili, Tarragona, Spain
- CIBER de Diabetes y Enfermedades Metabólicas Asociadas (CIBERDEM), Universitat Rovira i Virgili, Tarragona, Spain
- * E-mail: (AG); (RD)
| | - Roso Mares
- Unitat de Recerca, Hospital Joan XXIII, Institut de Investigacio Sanitaria Rovira I Virgili (IISPV), Universitat Rovira i Virgili, Tarragona, Spain
| | - Tung-Tien Sun
- Department of Cell Biology, New York University School of Medicine, New York, New York, United States of America
- Department of Dermatology, New York University School of Medicine, New York, New York, United States of America
- Department of Pharmacology, New York University School of Medicine, New York, New York, United States of America
- Department of Urology, New York University School of Medicine, New York, New York, United States of America
| | - Rob DeSalle
- Sackler Institute for Comparative Genomics, American Museum of Natural History, New York, New York, United States of America
- * E-mail: (AG); (RD)
| |
Collapse
|
10
|
Regulapati R, Bhasi A, Singh CK, Senapathy P. Origination of the split structure of spliceosomal genes from random genetic sequences. PLoS One 2008; 3:e3456. [PMID: 18941625 PMCID: PMC2565106 DOI: 10.1371/journal.pone.0003456] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2008] [Accepted: 08/01/2008] [Indexed: 11/18/2022] Open
Abstract
The mechanism by which protein-coding portions of eukaryotic genes came to be separated by long non-coding stretches of DNA, and the purpose for this perplexing arrangement, have remained unresolved fundamental biological problems for three decades. We report here a plausible solution to this problem based on analysis of open reading frame (ORF) length constraints in the genomes of nine diverse species. If primordial nucleic acid sequences were random in sequence, functional proteins that are innately long would not be encoded due to the frequent occurrence of stop codons. The best possible way that a long protein-coding sequence could have been derived was by evolving a split-structure from the random DNA (or RNA) sequence. Results of the systematic analyses of nine complete genome sequences presented here suggests that perhaps the major underlying structural features of split-genes have evolved due to the indigenous occurrence of split protein-coding genes in primordial random nucleotide sequence. The results also suggest that intron-rich genes containing short exons may have been the original form of genes intrinsically occurring in random DNA, and that intron-poor genes containing long exons were perhaps derived from the original intron-rich genes.
Collapse
Affiliation(s)
- Rahul Regulapati
- Department of Biotechnology, Indian Institute of Technology Madras, Chennai, India
| | - Ashwini Bhasi
- Department of Human Genetics, Genome International Corporation, Madison, Wisconsin, United States of America
| | - Chandan Kumar Singh
- Department of Bioinformatics, International Center for Advanced Genomics and Proteomics, Nehru Nagar, Chennai, India
| | - Periannan Senapathy
- Department of Human Genetics, Genome International Corporation, Madison, Wisconsin, United States of America
- Department of Bioinformatics, International Center for Advanced Genomics and Proteomics, Nehru Nagar, Chennai, India
- * E-mail:
| |
Collapse
|
11
|
Irimia M, Rukov JL, Penny D, Vinther J, Garcia-Fernandez J, Roy SW. Origin of introns by 'intronization' of exonic sequences. Trends Genet 2008; 24:378-81. [PMID: 18597887 DOI: 10.1016/j.tig.2008.05.007] [Citation(s) in RCA: 65] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2007] [Revised: 05/19/2008] [Accepted: 05/20/2008] [Indexed: 11/24/2022]
Abstract
The mechanisms of spliceosomal intron creation have proved elusive. Here we describe a new mechanism: the recruitment of internal exonic sequences ('intronization') in Caenorhabditis species. The numbers of intronization events and introns gained by other mechanisms are similar, suggesting that intronization significantly contributes to recent intron creation in nematodes. Intronization is more common than the reverse process, loss of splicing of retained introns. Finally, these findings link alternative splicing with modern intron creation.
Collapse
Affiliation(s)
- Manuel Irimia
- Departament de Genètica, Facultat de Biologia, Universitat de Barcelona, Barcelona 08028, Spain
| | | | | | | | | | | |
Collapse
|
12
|
Lin H, Zhu W, Silva JC, Gu X, Buell CR. Intron gain and loss in segmentally duplicated genes in rice. Genome Biol 2006; 7:R41. [PMID: 16719932 PMCID: PMC1779517 DOI: 10.1186/gb-2006-7-5-r41] [Citation(s) in RCA: 119] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2006] [Revised: 03/21/2006] [Accepted: 04/24/2006] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Introns are under less selection pressure than exons, and consequently, intronic sequences have a higher rate of gain and loss than exons. In a number of plant species, a large portion of the genome has been segmentally duplicated, giving rise to a large set of duplicated genes. The recent completion of the rice genome in which segmental duplication has been documented has allowed us to investigate intron evolution within rice, a diploid monocotyledonous species. RESULTS Analysis of segmental duplication in rice revealed that 159 Mb of the 371 Mb genome and 21,570 of the 43,719 non-transposable element-related genes were contained within a duplicated region. In these duplicated regions, 3,101 collinear paired genes were present. Using this set of segmentally duplicated genes, we investigated intron evolution from full-length cDNA-supported non-transposable element-related gene models of rice. Using gene pairs that have an ortholog in the dicotyledonous model species Arabidopsis thaliana, we identified more intron loss (49 introns within 35 gene pairs) than intron gain (5 introns within 5 gene pairs) following segmental duplication. We were unable to demonstrate preferential intron loss at the 3' end of genes as previously reported in mammalian genomes. However, we did find that the four nucleotides of exons that flank lost introns had less frequently used 4-mers. CONCLUSION We observed that intron evolution within rice following segmental duplication is largely dominated by intron loss. In two of the five cases of intron gain within segmentally duplicated genes, the gained sequences were similar to transposable elements.
Collapse
Affiliation(s)
- Haining Lin
- The Institute for Genomic Research, Medical Center Drive, Rockville, MD 20850, USA
| | - Wei Zhu
- The Institute for Genomic Research, Medical Center Drive, Rockville, MD 20850, USA
| | - Joana C Silva
- The Institute for Genomic Research, Medical Center Drive, Rockville, MD 20850, USA
| | - Xun Gu
- Department of Genetics, Development, and Cell Biology, Center for Bioinformatics and Biological Statistics, Iowa State University, Ames, IA 50011, USA
| | - C Robin Buell
- The Institute for Genomic Research, Medical Center Drive, Rockville, MD 20850, USA
| |
Collapse
|
13
|
Roy SW, Hartl DL. Very little intron loss/gain in Plasmodium: intron loss/gain mutation rates and intron number. Genome Res 2006; 16:750-6. [PMID: 16702411 PMCID: PMC1473185 DOI: 10.1101/gr.4845406] [Citation(s) in RCA: 54] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
We compared intron positions in conserved regions of 3479 orthologous gene pairs from Plasmodium falciparum and Plasmodium yoelii, which likely diverged >or=100 million years ago (Mya). Only 27 out of 2212 positions were specific to one of the two species. Intron presence in related species shows that at least 19 and possibly 26 of the changes are due to intron loss, depending on phylogeny. The implied intron loss and gain rates are much lower than previously estimated for nematodes, arthropods, fungi, and plants, and are comparable only with the rates in vertebrates. That all observed changes were exact, occurring without loss or gain of flanking coding sequence, suggests intron loss via an mRNA intermediate, as does a nonsignificant trend toward loss of introns at adjacent positions. Many of the intron changes occurred in genes encoding proteins involved in nucleic acid-related processes, as previously found for intron gains in nematodes. Two changes occurred in the chloroquine resistance transporter, suggesting a role for positive selection in intron loss in Plasmodium. The dearth of intron loss and gain could be explained by the lack of known transposable elements in Plasmodium, since transposable elements and/or reverse transcriptase are thought to be necessary for both processes. The observed pattern suggests that the availability of stochastic intron loss and gain mutations can be a major determinant of changes in intron number.
Collapse
Affiliation(s)
- Scott William Roy
- Department of Organismic and Evolutionary Biology, Harvard, Cambridge, Massachusetts 02138, USA.
| | | |
Collapse
|
14
|
Goetze E. Elongation factor 1-alpha in marine copepods (Calanoida: Eucalanidae): phylogenetic utility and unique intron structure. Mol Phylogenet Evol 2006; 40:880-6. [PMID: 16725351 DOI: 10.1016/j.ympev.2006.04.009] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2005] [Revised: 04/05/2006] [Accepted: 04/06/2006] [Indexed: 11/23/2022]
Affiliation(s)
- Erica Goetze
- Department of Marine Ecology and Aquaculture, Danish Institute for Fisheries Research, Kavalergården 6, DK-2920 Charlottenlund, Denmark.
| |
Collapse
|
15
|
Abstract
In this work, 21 completely sequenced eukaryotic genomes were analyzed using an intragene comparison approach. We found that all of these genomes show a significant 5'-biased distribution of introns of protein-coding genes. Our findings are different from previous studies based on the intergene method, where introns are biased towards the 5' end of genes only in intron-poor genomes, but are evenly distributed in intron-rich genomes. In addition, by analyzing the patterns of intron distribution of a set of well-compiled housekeeping genes from human and their respective orthologs identified by a bidirectional best BLAST hit method from the other genomes, we found that the trend of 5'-biased intron positions of the set of housekeeping genes for each genome is much more skewed than that of all genes of the same genome, and rarely if any of the housekeeping genes examined have an extremely 3'-biased position distribution in which all introns of a gene are located only at the 3' portion of the gene. The most parsimonious explanation for our findings may be the model in which intron loss is caused by homologous recombination between the genomic copy of a gene and a reverse transcriptase product of a spliced mRNA.
Collapse
Affiliation(s)
- Kui Lin
- MOE Key Laboratory for Biodiversity Science and Ecological Engineering and College of Life Sciences, Beijing Normal University, Beijing 100875, China.
| | | |
Collapse
|
16
|
Froy O. Convergent evolution of invertebrate defensins and nematode antibacterial factors. Trends Microbiol 2005; 13:314-9. [PMID: 15914006 DOI: 10.1016/j.tim.2005.05.001] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2005] [Revised: 04/20/2005] [Accepted: 05/06/2005] [Indexed: 10/25/2022]
Abstract
Antibacterial factors (ABFs) are secreted polypeptides that have an important role in the innate immune system of nematodes. Comparison of these polypeptides revealed similarity in bioactivity, protein sequence and 3D structure, suggesting that they originated from a common ancestor. Comparison of gene organization of nematode ABF genes revealed that all except one contain a Phase 0 intron at a conserved location. The intron phase and location are congruent with the postulated intron gain rules, suggesting a gain of intron before duplication and divergence of the ancestral gene. Although nematode ABFs display similarity in activity and structure to invertebrate (arthropod and mollusk) defensins, lack of sequence similarity and the different genomic organization suggest that these two polypeptide families exhibit convergent evolution.
Collapse
Affiliation(s)
- Oren Froy
- Institute of Biochemistry, Food Science and Nutrition, Faculty of Agricultural, Food and Environmental Quality, The Hebrew University of Jerusalem, P.O. Box 12, Rehovot 76100, Israel.
| |
Collapse
|
17
|
Doolittle RF. Evolutionary aspects of whole-genome biology. Curr Opin Struct Biol 2005; 15:248-53. [PMID: 15963888 DOI: 10.1016/j.sbi.2005.04.001] [Citation(s) in RCA: 49] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2005] [Revised: 02/08/2005] [Accepted: 04/12/2005] [Indexed: 11/28/2022]
Abstract
A decade of access to whole-genome sequences has been increasingly revealing about the informational network relating all living organisms. Although at one point there was concern that extensive horizontal gene transfer might hopelessly muddle phylogenies, it has not proved a severe hindrance. The melding of sequence and structural information is being used to great advantage, and the prospect exists that some of the earliest aspects of life on Earth can be reconstructed, including the invention of biosynthetic and metabolic pathways. Still, some fundamental phylogenetic problems remain, including determining the root--if there is one--of the historical relationship between Archaea, Bacteria and Eukarya.
Collapse
Affiliation(s)
- Russell F Doolittle
- Department of Chemistry & Biochemistry, University of California San Diego, La Jolla, CA 92093-0314, USA.
| |
Collapse
|
18
|
Roy SW, Gilbert W. Rates of intron loss and gain: implications for early eukaryotic evolution. Proc Natl Acad Sci U S A 2005; 102:5773-8. [PMID: 15827119 PMCID: PMC556292 DOI: 10.1073/pnas.0500383102] [Citation(s) in RCA: 150] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
We study the intron-exon structures of 684 groups of orthologs from seven diverse eukaryotic genomes and provide maximum likelihood estimates for rates and numbers of intron losses and gains in these same genes for a variety of lineages. Rates of intron loss vary from approximately 2 x 10(-9) to 2 x 10(-10) per year. Rates of gain vary from 6 x 10(-13) to 4 x 10(-12) per possible intron insertion site per year. There is an inverse correspondence between rates of intron loss and gain, leading to a 20-fold variation among lineages in the ratio of the rates of the two processes. The observed rates of intron gain are insufficient to explain the large number of introns estimated to have been present in the plant-animal ancestor, suggesting that introns present in early eukaryotes may have been created by a fundamentally different process than more recently gained introns.
Collapse
Affiliation(s)
- Scott William Roy
- Department of Molecular and Cellular Biology, Harvard University, 16 Divinity Avenue, Cambridge, MA 02138, USA.
| | | |
Collapse
|
19
|
Abstract
We use the pattern of intron conservation in 684 groups of orthologs from seven fully sequenced eukaryotic genomes to provide maximum likelihood estimates of the number of introns present in the same orthologs in various eukaryotic ancestors. We find: (i) intron density in the plant-animal ancestor was high, perhaps two-thirds that of humans and three times that of Drosophila; and (ii) intron density in the ancestral bilateran was also high, equaling that of humans and four times that of Drosophila. We further find that modern introns are generally very old, with two-thirds of modern bilateran introns dating to the ancestral bilateran and two-fifths of modern plant, animal, and fungus introns dating to the plant-animal ancestor. Intron losses outnumber gains over a large range of eukaryotic lineages. These results show that early eukaryotic gene structures were very complex, and that simplification, not embellishment, has dominated subsequent evolution.
Collapse
Affiliation(s)
- Scott W Roy
- Department of Molecular and Cellular Biology, Harvard University, 16 Divinity Avenue, Cambridge, MA 02138, USA.
| | | |
Collapse
|
20
|
Abstract
A recent study makes considerable progress towards answering the question of how genes acquire introns by identifying numerous recently gained introns in nematodes. The long-standing question of how genes acquire introns has provoked much debate. A recent study makes considerable progress by identifying numerous recently gained introns in nematodes - although it remains difficult to distinguish definitively between models of intron gain.
Collapse
Affiliation(s)
- Scott W Roy
- Harvard University, 16 Divinity Avenue, Cambridge, MA 02138, USA.
| |
Collapse
|