1
|
Ware TB, Hsu KL. Advances in chemical proteomic evaluation of lipid kinases-DAG kinases as a case study. Curr Opin Chem Biol 2021; 65:101-108. [PMID: 34311404 PMCID: PMC8671151 DOI: 10.1016/j.cbpa.2021.06.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2021] [Revised: 05/24/2021] [Accepted: 06/18/2021] [Indexed: 10/20/2022]
Abstract
Advancements in chemical proteomics and mass spectrometry lipidomics are providing new opportunities to understand lipid kinase activity, specificity, and regulation on a global cellular scale. Here, we describe recent developments in chemical biology of lipid kinases with a focus on those members that phosphorylate diacylglycerols. We further discuss future implications of how these mass spectrometry-based approaches can be adapted for studies of additional lipid kinase members with the aim of bridging the gap between protein and lipid kinase-focused investigations.
Collapse
Affiliation(s)
- Timothy B Ware
- Department of Chemistry, University of Virginia, Charlottesville, VA 22904, United States
| | - Ku-Lung Hsu
- Department of Chemistry, University of Virginia, Charlottesville, VA 22904, United States; Department of Pharmacology, University of Virginia School of Medicine, Charlottesville, VA 22908, United States; Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA 22908, United States; University of Virginia Cancer Center, University of Virginia, Charlottesville, VA 22903, USA.
| |
Collapse
|
2
|
Kang S, Tice AK, Stairs CW, Jones RE, Lahr DJG, Brown MW. The integrin-mediated adhesive complex in the ancestor of animals, fungi, and amoebae. Curr Biol 2021; 31:3073-3085.e3. [PMID: 34077702 DOI: 10.1016/j.cub.2021.04.076] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2020] [Revised: 03/17/2021] [Accepted: 04/28/2021] [Indexed: 11/25/2022]
Abstract
Integrins are transmembrane receptors that activate signal transduction pathways upon extracellular matrix binding. The integrin-mediated adhesive complex (IMAC) mediates various cell physiological processes. Although the IMAC was thought to be specific to animals, in the past ten years these complexes were discovered in other lineages of Obazoa, the group containing animals, fungi, and several microbial eukaryotes. Very recently, many genomes and transcriptomes from Amoebozoa (the eukaryotic supergroup sister to Obazoa), other obazoans, orphan protist lineages, and the eukaryotes' closest prokaryotic relatives, have become available. To increase the resolution of where and when IMAC proteins exist and have emerged, we surveyed these newly available genomes and transcriptomes for the presence of IMAC proteins. Our results highlight that many of these proteins appear to have evolved earlier in eukaryote evolution than previously thought and that co-option of this apparently ancient protein complex was key to the emergence of animal-type multicellularity. The role of the IMACs in amoebozoans is unknown, but they play critical adhesive roles in at least some unicellular organisms.
Collapse
Affiliation(s)
- Seungho Kang
- Department of Biological Sciences, Mississippi State University, Starkville, MS, USA; Institute for Genomics, Biocomputing & Biotechnology, Mississippi State University, Starkville, MS, USA
| | - Alexander K Tice
- Department of Biological Sciences, Mississippi State University, Starkville, MS, USA; Institute for Genomics, Biocomputing & Biotechnology, Mississippi State University, Starkville, MS, USA
| | - Courtney W Stairs
- Department of Cell and Molecular Biology, Uppsala University, Uppsala, Sweden; Department of Biology, Lund University, Lund, Sweden
| | - Robert E Jones
- Department of Biological Sciences, Mississippi State University, Starkville, MS, USA; Institute for Genomics, Biocomputing & Biotechnology, Mississippi State University, Starkville, MS, USA
| | - Daniel J G Lahr
- Department of Zoology, University of São Paulo, São Paulo, Brazil
| | - Matthew W Brown
- Department of Biological Sciences, Mississippi State University, Starkville, MS, USA; Institute for Genomics, Biocomputing & Biotechnology, Mississippi State University, Starkville, MS, USA.
| |
Collapse
|
3
|
McCartney AM, Hyland EM, Cormican P, Moran RJ, Webb AE, Lee KD, Hernandez-Rodriguez J, Prado-Martinez J, Creevey CJ, Aspden JL, McInerney JO, Marques-Bonet T, O'Connell MJ. Gene Fusions Derived by Transcriptional Readthrough are Driven by Segmental Duplication in Human. Genome Biol Evol 2020; 11:2678-2690. [PMID: 31400206 PMCID: PMC6764479 DOI: 10.1093/gbe/evz163] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/17/2019] [Indexed: 12/14/2022] Open
Abstract
Gene fusion occurs when two or more individual genes with independent open reading frames becoming juxtaposed under the same open reading frame creating a new fused gene. A small number of gene fusions described in detail have been associated with novel functions, for example, the hominid-specific PIPSL gene, TNFSF12, and the TWE-PRIL gene family. We use Sequence Similarity Networks and species level comparisons of great ape genomes to identify 45 new genes that have emerged by transcriptional readthrough, that is, transcription-derived gene fusion. For 35 of these putative gene fusions, we have been able to assess available RNAseq data to determine whether there are reads that map to each breakpoint. A total of 29 of the putative gene fusions had annotated transcripts (9/29 of which are human-specific). We carried out RT-qPCR in a range of human tissues (placenta, lung, liver, brain, and testes) and found that 23 of the putative gene fusion events were expressed in at least one tissue. Examining the available ribosome foot-printing data, we find evidence for translation of three of the fused genes in human. Finally, we find enrichment for transcription-derived gene fusions in regions of known segmental duplication in human. Together, our results implicate chromosomal structural variation brought about by segmental duplication with the emergence of novel transcripts and translated protein products.
Collapse
Affiliation(s)
- Ann M McCartney
- Bioinformatics and Molecular Evolution Group, School of Biotechnology, Dublin City University, Ireland.,Computational and Molecular Evolutionary Biology Group, School of Biology, Faculty of Biological Sciences, The University of Leeds, United Kingdom
| | - Edel M Hyland
- Bioinformatics and Molecular Evolution Group, School of Biotechnology, Dublin City University, Ireland.,Institute for Global Food Security, Queens University Belfast, United Kingdom
| | - Paul Cormican
- Teagasc Animal and Bioscience Research Department, Animal & Grassland Research and Innovation Centre, Teagasc, Grange, Dunsany, County Meath, Ireland
| | - Raymond J Moran
- Bioinformatics and Molecular Evolution Group, School of Biotechnology, Dublin City University, Ireland.,Computational and Molecular Evolutionary Biology Group, School of Biology, Faculty of Biological Sciences, The University of Leeds, United Kingdom
| | - Andrew E Webb
- Bioinformatics and Molecular Evolution Group, School of Biotechnology, Dublin City University, Ireland
| | - Kate D Lee
- Bioinformatics and Molecular Evolution Group, School of Biotechnology, Dublin City University, Ireland.,School of Biological Sciences, University of Auckland, New Zealand.,School of Fundamental Sciences, Massey University, New Zealand
| | | | - Javier Prado-Martinez
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, Dr. Aiguader 88, 08003 Barcelona, Spain.,Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, United Kingdom
| | - Christopher J Creevey
- Institute for Global Food Security, Queens University Belfast, United Kingdom.,Institute of Biological, Environmental and Rural Sciences, Aberystwyth University, United Kingdom
| | - Julie L Aspden
- School of Molecular and Cellular Biology, Faculty of Biological Sciences, The University of Leeds, United Kingdom
| | - James O McInerney
- Division of Evolution and Genomic Sciences, School of Biological Sciences, Faculty of Biology, Medicine and Health, The University of Manchester, M13 9PL, United Kingdom.,School of Life Sciences, Faculty of Medicine and Health Sciences, The University of Nottingham, NG7 2RD, United Kingdom
| | - Tomas Marques-Bonet
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, Dr. Aiguader 88, 08003 Barcelona, Spain.,Catalan Institution of Research and Advanced Studies (ICREA), Passeig de Lluís Companys, 23, 08010, Barcelona, Spain.,NAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, 08028 Barcelona, Spain.,Institut Català de Paleontologia Miquel Crusafont, Universitat Autònoma de Barcelona, Edifici ICTA-ICP, c/ Columnes s/n, 08193 Cerdanyola del Vallés, Barcelona, Spain
| | - Mary J O'Connell
- Bioinformatics and Molecular Evolution Group, School of Biotechnology, Dublin City University, Ireland.,Computational and Molecular Evolutionary Biology Group, School of Biology, Faculty of Biological Sciences, The University of Leeds, United Kingdom.,School of Life Sciences, Faculty of Medicine and Health Sciences, The University of Nottingham, NG7 2RD, United Kingdom
| |
Collapse
|
4
|
Paço A, Freitas R, Vieira-da-Silva A. Conversion of DNA Sequences: From a Transposable Element to a Tandem Repeat or to a Gene. Genes (Basel) 2019; 10:E1014. [PMID: 31817529 PMCID: PMC6947457 DOI: 10.3390/genes10121014] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2019] [Revised: 11/18/2019] [Accepted: 11/29/2019] [Indexed: 01/24/2023] Open
Abstract
Eukaryotic genomes are rich in repetitive DNA sequences grouped in two classes regarding their genomic organization: tandem repeats and dispersed repeats. In tandem repeats, copies of a short DNA sequence are positioned one after another within the genome, while in dispersed repeats, these copies are randomly distributed. In this review we provide evidence that both tandem and dispersed repeats can have a similar organization, which leads us to suggest an update to their classification based on the sequence features, concretely regarding the presence or absence of retrotransposons/transposon specific domains. In addition, we analyze several studies that show that a repetitive element can be remodeled into repetitive non-coding or coding sequences, suggesting (1) an evolutionary relationship among DNA sequences, and (2) that the evolution of the genomes involved frequent repetitive sequence reshuffling, a process that we have designated as a "DNA remodeling mechanism". The alternative classification of the repetitive DNA sequences here proposed will provide a novel theoretical framework that recognizes the importance of DNA remodeling for the evolution and plasticity of eukaryotic genomes.
Collapse
Affiliation(s)
- Ana Paço
- MED-Mediterranean Institute for Agriculture, Environment and Development, University of Évora, 7002–554 Évora, Portugal;
| | - Renata Freitas
- IBMC-Institute for Molecular and Cell Biology, University of Porto, R. Campo Alegre 823, 4150–180 Porto, Portugal;
- I3S-Institute for Innovation and Health Research, University of Porto, Rua Alfredo Allen, 208, 4200–135 Porto, Portugal
- ICBAS-Institute of Biomedical Sciences Abel Salazar, University of Porto, 4050-313 Porto, Portugal
| | - Ana Vieira-da-Silva
- MED-Mediterranean Institute for Agriculture, Environment and Development, University of Évora, 7002–554 Évora, Portugal;
| |
Collapse
|
5
|
Wu H, Singh S, Shi X, Xie Z, Lin E, Li X, Li H. Functional heritage: the evolution of chimeric RNA into a gene. RNA Biol 2019; 17:125-134. [PMID: 31566065 DOI: 10.1080/15476286.2019.1670038] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
Once believed to be unique features of neoplasia, chimeric RNAs are now being discovered in normal physiology. We speculated that some chimeric RNAs may be functional precursors of genes, and that forming chimeric RNA at the transcriptional level may be a 'trial' mechanism before the functional element is fixed into the genome. Supporting this idea, we identified a chimeric RNA, HNRNPA1L2-SUGT1 (H-S), whose sequence is highly similar to that of a 'pseudogene' MRPS31P5. Sequence analysis revealed that MRPS31P5 transcript is more similar to H-S chimeric RNA than its 'parent' gene, MRPS31. Evolutionarily, H-S precedes MRPS31P5, as it can be detected bioinformatically and experimentally in marmosets, which do not yet possess MRPS31P5 in their genome. Conversely, H-S is minimally expressed in humans, while instead, MRPS31P5 is abundantly expressed. Silencing H-S in marmoset cells resulted in similar phenotype as silencing MRPS31P5 in human cells. In addition, whole transcriptome analysis and candidate downstream target validation revealed common signalling pathways shared by the two transcripts. Interestingly, H-S failed to rescue the phenotype caused by silencing MPRS31P5 in human and rhesus cells, whereas MRPS31P5 can at least partially rescue the phenotype caused by silencing H-S in marmoset cells, suggesting that MRPS31P5 may have further evolved into a distinct entity. Thus, multiple lines of evidence support that MRPS31P5 is not truly a pseudogene of MRPS31, but a likely functional descendent of H-S chimera. Instead being a gene fusion product, H-S is a product of cis-splicing between adjacent genes, while MRPS31P5 is likely produced by genome rearrangement.
Collapse
Affiliation(s)
- Hao Wu
- Department of Gastrointestinal Surgery, The Third Xiangya Hospital of Central South University, Changsha, Hunan, China.,Department of Pathology, School of Medicine, University of Virginia, Charlottesville, VA, USA
| | - Sandeep Singh
- Department of Pathology, School of Medicine, University of Virginia, Charlottesville, VA, USA
| | - Xinrui Shi
- Department of Biochemistry and Molecular Genetics, School of Medicine, University of Virginia, Charlottesville, VA, USA
| | - Zhongqiu Xie
- Department of Pathology, School of Medicine, University of Virginia, Charlottesville, VA, USA
| | - Emily Lin
- Department of Pathology, School of Medicine, University of Virginia, Charlottesville, VA, USA
| | - Xiaorong Li
- Department of Gastrointestinal Surgery, The Third Xiangya Hospital of Central South University, Changsha, Hunan, China
| | - Hui Li
- Department of Pathology, School of Medicine, University of Virginia, Charlottesville, VA, USA.,Department of Biochemistry and Molecular Genetics, School of Medicine, University of Virginia, Charlottesville, VA, USA
| |
Collapse
|
6
|
Tang Y, Ma S, Wang X, Xing Q, Huang T, Liu H, Li Q, Zhang Y, Zhang K, Yao M, Yang GL, Li H, Zang X, Yang B, Guan F. Identification of chimeric RNAs in human infant brains and their implications in neural differentiation. Int J Biochem Cell Biol 2019; 111:19-26. [DOI: 10.1016/j.biocel.2019.03.012] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2018] [Revised: 03/06/2019] [Accepted: 03/30/2019] [Indexed: 02/07/2023]
|
7
|
Matsumura K, Imai H, Go Y, Kusuhara M, Yamaguchi K, Shirai T, Ohshima K. Transcriptional activation of a chimeric retrogene PIPSL in a hominoid ancestor. Gene 2018; 678:318-323. [PMID: 30096459 DOI: 10.1016/j.gene.2018.08.033] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2018] [Revised: 08/05/2018] [Accepted: 08/07/2018] [Indexed: 01/09/2023]
Abstract
Retrogenes are a class of functional genes derived from the mRNA of various intron-containing genes. PIPSL was created through a unique mechanism, whereby distinct genes were assembled at the RNA level, and the resulting chimera was then reverse transcribed and integrated into the genome by the L1 retrotransposon. Expression of PIPSL RNA via its transcription start sites (TSSs) has been confirmed in the testes of humans and chimpanzee. Here, we demonstrated that PIPSL RNA is expressed in the testis of the white-handed gibbon. The 5'-end positions of gibbon RNAs were confined to a narrow range upstream of the PIPSL start codon and overlapped with those of orangutan and human, suggesting that PIPSL TSSs are similar among hominoid species. Reporter assays using a luciferase gene and the flanking sequences of human PIPSL showed that an upstream sequence exhibits weak promoter activity in human cells. Our findings suggest that PIPSL might have acquired a promoter at an early stage of hominoid evolution before the divergence of gibbons and ultimately retained similar TSSs in all of the lineages. Moreover, the upstream sequence derived from the phosphatidylinositol-4-phosphate 5-kinase, type I, alpha 5' untranslated region and/or neighboring repetitive sequences in the genome possibly exhibits promoter activity. Furthermore, we observed that a TATA-box-like sequence has emerged by nucleotide substitution in a lineage leading to humans, with this possibly responsible for a broader distribution of the human PIPSL TSSs.
Collapse
Affiliation(s)
- Kenya Matsumura
- Graduate School of Bioscience, Nagahama Institute of Bio-Science and Technology, Nagahama, Shiga, Japan; Shizuoka Cancer Center Research Institute, Sunto, Shizuoka, Japan
| | - Hiroo Imai
- Department of Cellular and Molecular Biology, Primate Research Institute, Kyoto University, Inuyama, Aichi, Japan
| | - Yasuhiro Go
- Cognitive Genomics Research Group, Exploratory Research Center on Life and Living Systems, National Institutes of Natural Sciences, Okazaki, Aichi, Japan; Department of Physiological Sciences, National Institute for Physiological Sciences, Okazaki, Aichi, Japan; School of Life Science, SOKENDAI (The Graduate University for Advanced Studies), Okazaki, Aichi, Japan
| | | | - Ken Yamaguchi
- Shizuoka Cancer Center Research Institute, Sunto, Shizuoka, Japan
| | - Tsuyoshi Shirai
- Graduate School of Bioscience, Nagahama Institute of Bio-Science and Technology, Nagahama, Shiga, Japan
| | - Kazuhiko Ohshima
- Graduate School of Bioscience, Nagahama Institute of Bio-Science and Technology, Nagahama, Shiga, Japan.
| |
Collapse
|
8
|
Yadav U, Arya R, Kundu S, Sundd M. The “Recognition Helix” of the Type II Acyl Carrier Protein (ACP) Utilizes a “Ubiquitin Interacting Motif (UIM)”-like Surface To Bind Its Partners. Biochemistry 2018; 57:3690-3701. [DOI: 10.1021/acs.biochem.8b00220] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Usha Yadav
- National Institute of Immunology, Aruna Asaf Ali Marg, New Delhi 110 067, India
| | - Richa Arya
- Department of Biochemistry, University of Delhi South Campus, Benito Juarez Road, New Delhi 110 021, India
| | - Suman Kundu
- Department of Biochemistry, University of Delhi South Campus, Benito Juarez Road, New Delhi 110 021, India
| | - Monica Sundd
- National Institute of Immunology, Aruna Asaf Ali Marg, New Delhi 110 067, India
| |
Collapse
|
9
|
Hagel JM, Facchini PJ. Tying the knot: occurrence and possible significance of gene fusions in plant metabolism and beyond. JOURNAL OF EXPERIMENTAL BOTANY 2017; 68:4029-4043. [PMID: 28521055 DOI: 10.1093/jxb/erx152] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]
Abstract
Gene fusions have recently attracted attention especially in the field of plant specialized metabolism. The occurrence of a gene fusion, in which originally separate gene products are combined into a single polypeptide, often corresponds to the functional association of individual components within a single metabolic pathway. Examples include gene fusions implicated in benzylisoquinoline alkaloid (BIA), terpenoid, and amino acid biosynthetic pathways, in which distinct domains within a fusion catalyze consecutive, yet independent reactions. Both genomic and transcriptional mechanisms result in the fusion of gene products, which can include partial or complete domain repeats and extensive domain shuffling as evident in the BIA biosynthetic enzyme norcoclaurine synthase. Artificial gene fusions are commonly deployed in attempts to engineer new or improved pathways in plants or microorganisms, based on the premise that fusions are advantageous. However, a survey of functionally characterized fusions in microbial systems shows that the functional impact of fused gene products is not straightforward. For example, whereas enzyme fusions might facilitate the metabolic channeling of unstable intermediates, this channeling can also occur between tightly associated independent enzymes. The frequent occurrence of both fused and unfused enzymes in plant and microbial metabolism adds additional complexity, in terms of both pathway functionality and evolution.
Collapse
Affiliation(s)
- Jillian M Hagel
- Department of Biological Sciences, University of Calgary, 2500 University Dr N.W., Alberta T2N 1N4, Canada
| | - Peter J Facchini
- Department of Biological Sciences, University of Calgary, 2500 University Dr N.W., Alberta T2N 1N4, Canada
| |
Collapse
|
10
|
Tandem duplications lead to novel expression patterns through exon shuffling in Drosophila yakuba. PLoS Genet 2017; 13:e1006795. [PMID: 28531189 PMCID: PMC5460883 DOI: 10.1371/journal.pgen.1006795] [Citation(s) in RCA: 39] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2015] [Revised: 06/06/2017] [Accepted: 05/03/2017] [Indexed: 01/06/2023] Open
Abstract
One common hypothesis to explain the impacts of tandem duplications is that whole gene duplications commonly produce additive changes in gene expression due to copy number changes. Here, we use genome wide RNA-seq data from a population sample of Drosophila yakuba to test this ‘gene dosage’ hypothesis. We observe little evidence of expression changes in response to whole transcript duplication capturing 5′ and 3′ UTRs. Among whole gene duplications, we observe evidence that dosage sharing across copies is likely to be common. The lack of expression changes after whole gene duplication suggests that the majority of genes are subject to tight regulatory control and therefore not sensitive to changes in gene copy number. Rather, we observe changes in expression level due to both shuffling of regulatory elements and the creation of chimeric structures via tandem duplication. Additionally, we observe 30 de novo gene structures arising from tandem duplications, 23 of which form with expression in the testes. Thus, the value of tandem duplications is likely to be more intricate than simple changes in gene dosage. The common regulatory effects from chimeric gene formation after tandem duplication may explain their contribution to genome evolution. The enclosed work shows that whole gene duplications rarely affect gene expression, in contrast to widely held views that the adaptive value of duplicate genes is related to additive changes in gene expression due to gene copy number. We further explain how tandem duplications that create shuffled gene structures can force upregulation of gene sequences, de novo gene creation, and multifold changes in transcript levels. These results show that tandem duplications can produce new genes that are a source of immediate novelty associated with more extreme expression changes than previously suggested by theory. Further, these gene expression changes are a potential source of both beneficial and pathogenic mutations, immediately relevant to clinical and medical genetics in humans and other metazoans.
Collapse
|
11
|
It Is Imperative to Establish a Pellucid Definition of Chimeric RNA and to Clear Up a Lot of Confusion in the Relevant Research. Int J Mol Sci 2017; 18:ijms18040714. [PMID: 28350330 PMCID: PMC5412300 DOI: 10.3390/ijms18040714] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2017] [Revised: 03/15/2017] [Accepted: 03/17/2017] [Indexed: 12/27/2022] Open
Abstract
There have been tens of thousands of RNAs deposited in different databases that contain sequences of two genes and are coined chimeric RNAs, or chimeras. However, "chimeric RNA" has never been lucidly defined, partly because "gene" itself is still ill-defined and because the means of production for many RNAs is unclear. Since the number of putative chimeras is soaring, it is imperative to establish a pellucid definition for it, in order to differentiate chimeras from regular RNAs. Otherwise, not only will chimeric RNA studies be misled but also characterization of fusion genes and unannotated genes will be hindered. We propose that only those RNAs that are formed by joining two RNA transcripts together without a fusion gene as a genomic basis should be regarded as authentic chimeras, whereas those RNAs transcribed as, and cis-spliced from, single transcripts should not be deemed as chimeras. Many RNAs containing sequences of two neighboring genes may be transcribed via a readthrough mechanism, and thus are actually RNAs of unannotated genes or RNA variants of known genes, but not chimeras. In today's chimeric RNA research, there are still several key flaws, technical constraints and understudied tasks, which are also described in this perspective essay.
Collapse
|
12
|
Kozlov AP. Expression of evolutionarily novel genes in tumors. Infect Agent Cancer 2016; 11:34. [PMID: 27437030 PMCID: PMC4949931 DOI: 10.1186/s13027-016-0077-6] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2016] [Accepted: 05/18/2016] [Indexed: 01/29/2023] Open
Abstract
The evolutionarily novel genes originated through different molecular mechanisms are expressed in tumors. Sometimes the expression of evolutionarily novel genes in tumors is highly specific. Moreover positive selection of many human tumor-related genes in primate lineage suggests their involvement in the origin of new functions beneficial to organisms. It is suggested to consider the expression of evolutionarily young or novel genes in tumors as a new biological phenomenon, a phenomenon of TSEEN (tumor specifically expressed, evolutionarily novel) genes.
Collapse
Affiliation(s)
- A. P. Kozlov
- The Biomedical Center and Peter the Great St. Petersburg Polytechnic University, St. Petersburg, Russia
| |
Collapse
|
13
|
Zhang ZN, Wu QY, Zhang GZ, Zhu YY, Murphy RW, Liu Z, Zou CG. Systematic analyses reveal uniqueness and origin of the CFEM domain in fungi. Sci Rep 2015; 5:13032. [PMID: 26255557 PMCID: PMC4530338 DOI: 10.1038/srep13032] [Citation(s) in RCA: 41] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2015] [Accepted: 07/16/2015] [Indexed: 11/25/2022] Open
Abstract
CFEM domain commonly occurs in fungal extracellular membrane proteins. To provide insights for understanding putative functions of CFEM, we investigate the evolutionary dynamics of CFEM domains by systematic comparative genomic analyses among diverse animals, plants, and more than 100 fungal species, which are representative across the entire group of fungi. We here show that CFEM domain is unique to fungi. Experiments using tissue culture demonstrate that the CFEM-containing ESTs in some plants originate from endophytic fungi. We also find that CFEM domain does not occur in all fungi. Its single origin dates to the most recent common ancestors of Ascomycota and Basidiomycota, instead of multiple origins. Although the length and architecture of CFEM domains are relatively conserved, the domain-number varies significantly among different fungal species. In general, pathogenic fungi have a larger number of domains compared to other species. Domain-expansion across fungal genomes appears to be driven by domain duplication and gene duplication via recombination. These findings generate a clear evolutionary trajectory of CFEM domains and provide novel insights into the functional exchange of CFEM-containing proteins from cell-surface components to mediators in host-pathogen interactions.
Collapse
Affiliation(s)
- Zhen-Na Zhang
- 1] Laboratory for Conservation and Utilization of Bio-Resources, Yunnan University, Kunming, China [2] Xiamen Tobacco Industrial CO., LTD, Xiamen, China
| | - Qin-Yi Wu
- Laboratory for Conservation and Utilization of Bio-Resources, Yunnan University, Kunming, China
| | | | - Yue-Yan Zhu
- Laboratory for Conservation and Utilization of Bio-Resources, Yunnan University, Kunming, China
| | - Robert W Murphy
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China
| | - Zhen Liu
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China
| | - Cheng-Gang Zou
- Laboratory for Conservation and Utilization of Bio-Resources, Yunnan University, Kunming, China
| |
Collapse
|
14
|
Peng Z, Yuan C, Zellmer L, Liu S, Xu N, Liao DJ. Hypothesis: Artifacts, Including Spurious Chimeric RNAs with a Short Homologous Sequence, Caused by Consecutive Reverse Transcriptions and Endogenous Random Primers. J Cancer 2015; 6:555-67. [PMID: 26000048 PMCID: PMC4439942 DOI: 10.7150/jca.11997] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2015] [Accepted: 04/02/2015] [Indexed: 12/21/2022] Open
Abstract
Recent RNA-sequencing technology and associated bioinformatics have led to identification of tens of thousands of putative human chimeric RNAs, i.e. RNAs containing sequences from two different genes, most of which are derived from neighboring genes on the same chromosome. In this essay, we redefine "two neighboring genes" as those producing individual transcripts, and point out two known mechanisms for chimeric RNA formation, i.e. transcription from a fusion gene or trans-splicing of two RNAs. By our definition, most putative RNA chimeras derived from canonically-defined neighboring genes may either be technical artifacts or be cis-splicing products of 5'- or 3'-extended RNA of either partner that is redefined herein as an unannotated gene, whereas trans-splicing events are rare in human cells. Therefore, most authentic chimeric RNAs result from fusion genes, about 1,000 of which have been identified hitherto. We propose a hypothesis of "consecutive reverse transcriptions (RTs)", i.e. another RT reaction following the previous one, for how most spurious chimeric RNAs, especially those containing a short homologous sequence, may be generated during RT, especially in RNA-sequencing wherein RNAs are fragmented. We also point out that RNA samples contain numerous RNA and DNA shreds that can serve as endogenous random primers for RT and ensuing polymerase chain reactions (PCR), creating artifacts in RT-PCR.
Collapse
Affiliation(s)
- Zhiyu Peng
- 1. Beijing Genomics Institute at Shenzhen, Building No.11, Beishan Industrial Zone, Yantian District, Shenzhen 518083, P. R. China
| | - Chengfu Yuan
- 2. Hormel Institute, University of Minnesota, Austin, MN 55912, USA
| | - Lucas Zellmer
- 2. Hormel Institute, University of Minnesota, Austin, MN 55912, USA
| | - Siqi Liu
- 3. CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, P. R. China
| | - Ningzhi Xu
- 4. Laboratory of Cell and Molecular Biology, Cancer Institute, Chinese Academy of Medical Science, Beijing 100021, P. R. China
| | - D Joshua Liao
- 2. Hormel Institute, University of Minnesota, Austin, MN 55912, USA
| |
Collapse
|
15
|
Adelson DL, Raison JM, Garber M, Edgar RC. Interspersed repeats in the horse (Equus caballus); spatial correlations highlight conserved chromosomal domains. Anim Genet 2015; 41 Suppl 2:91-9. [PMID: 21070282 DOI: 10.1111/j.1365-2052.2010.02115.x] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
The interspersed repeat content of mammalian genomes has been best characterized in human, mouse and cow. In this study, we carried out de novo identification of repeated elements in the equine genome and identified previously unknown elements present at low copy number. The equine genome contains typical eutherian mammal repeats, but also has a significant number of hybrid repeats in addition to clade-specific Long Interspersed Nuclear Elements (LINE). Equus caballus clade specific LINE 1 (L1) repeats can be classified into approximately five subfamilies, three of which have undergone significant expansion. There are 1115 full-length copies of these equine L1, but of the 103 presumptive active copies, 93 fall within a single subfamily, indicating a rapid recent expansion of this subfamily. We also analysed both interspersed and simple sequence repeats (SSR) genome-wide, finding that some repeat classes are spatially correlated with each other as well as with G+C content and gene density. Based on these spatial correlations, we have confirmed that recently-described ancestral vs. clade-specific genome territories can be defined by their repeat content. The clade-specific Short Interspersed Nuclear Element correlations were scattered over the genome and appear to have been extensively remodelled. In contrast, territories enriched for ancestral repeats tended to be contiguous domains. To determine if the latter territories were evolutionarily conserved, we compared these results with a similar analysis of the human genome, and observed similar ancestral repeat enriched domains. These results indicate that ancestral, evolutionarily conserved mammalian genome territories can be identified on the basis of repeat content alone. Interspersed repeats of different ages appear to be analogous to geologic strata, allowing identification of ancient vs. newly remodelled regions of mammalian genomes.
Collapse
Affiliation(s)
- D L Adelson
- School of Molecular and Biomedical Science, University of Adelaide, North Terrace, Adelaide, South Australia, Australia.
| | | | | | | |
Collapse
|
16
|
Livnat A. Interaction-based evolution: how natural selection and nonrandom mutation work together. Biol Direct 2013; 8:24. [PMID: 24139515 PMCID: PMC4231362 DOI: 10.1186/1745-6150-8-24] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2013] [Accepted: 09/26/2013] [Indexed: 12/30/2022] Open
Abstract
BACKGROUND The modern evolutionary synthesis leaves unresolved some of the most fundamental, long-standing questions in evolutionary biology: What is the role of sex in evolution? How does complex adaptation evolve? How can selection operate effectively on genetic interactions? More recently, the molecular biology and genomics revolutions have raised a host of critical new questions, through empirical findings that the modern synthesis fails to explain: for example, the discovery of de novo genes; the immense constructive role of transposable elements in evolution; genetic variance and biochemical activity that go far beyond what traditional natural selection can maintain; perplexing cases of molecular parallelism; and more. PRESENTATION OF THE HYPOTHESIS Here I address these questions from a unified perspective, by means of a new mechanistic view of evolution that offers a novel connection between selection on the phenotype and genetic evolutionary change (while relying, like the traditional theory, on natural selection as the only source of feedback on the fit between an organism and its environment). I hypothesize that the mutation that is of relevance for the evolution of complex adaptation-while not Lamarckian, or "directed" to increase fitness-is not random, but is instead the outcome of a complex and continually evolving biological process that combines information from multiple loci into one. This allows selection on a fleeting combination of interacting alleles at different loci to have a hereditary effect according to the combination's fitness. TESTING AND IMPLICATIONS OF THE HYPOTHESIS This proposed mechanism addresses the problem of how beneficial genetic interactions can evolve under selection, and also offers an intuitive explanation for the role of sex in evolution, which focuses on sex as the generator of genetic combinations. Importantly, it also implies that genetic variation that has appeared neutral through the lens of traditional theory can actually experience selection on interactions and thus has a much greater adaptive potential than previously considered. Empirical evidence for the proposed mechanism from both molecular evolution and evolution at the organismal level is discussed, and multiple predictions are offered by which it may be tested. REVIEWERS This article was reviewed by Nigel Goldenfeld (nominated by Eugene V. Koonin), Jürgen Brosius and W. Ford Doolittle.
Collapse
Affiliation(s)
- Adi Livnat
- Department of Biological Sciences, Virginia Tech, Blacksburg, VA, 24061,
USA
| |
Collapse
|
17
|
Zhang Q. The role of mRNA-based duplication in the evolution of the primate genome. FEBS Lett 2013; 587:3500-7. [DOI: 10.1016/j.febslet.2013.08.042] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2013] [Revised: 08/24/2013] [Accepted: 08/30/2013] [Indexed: 12/28/2022]
|
18
|
RNA-Mediated Gene Duplication and Retroposons: Retrogenes, LINEs, SINEs, and Sequence Specificity. INTERNATIONAL JOURNAL OF EVOLUTIONARY BIOLOGY 2013; 2013:424726. [PMID: 23984183 PMCID: PMC3747384 DOI: 10.1155/2013/424726] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/14/2013] [Accepted: 07/01/2013] [Indexed: 11/18/2022]
Abstract
A substantial number of “retrogenes” that are derived from the mRNA of various intron-containing genes have been reported. A class of mammalian retroposons, long interspersed element-1 (LINE1, L1), has been shown to be involved in the reverse transcription of retrogenes (or processed pseudogenes) and non-autonomous short interspersed elements (SINEs). The 3′-end sequences of various SINEs originated from a corresponding LINE. As the 3′-untranslated regions of several LINEs are essential for retroposition, these LINEs presumably require “stringent” recognition of the 3′-end sequence of the RNA template. However, the 3′-ends of mammalian L1s do not exhibit any similarity to SINEs, except for the presence of 3′-poly(A) repeats. Since the 3′-poly(A) repeats of L1 and Alu SINE are critical for their retroposition, L1 probably recognizes the poly(A) repeats, thereby mobilizing not only Alu SINE but also cytosolic mRNA. Many flowering plants only harbor L1-clade LINEs and a significant number of SINEs with poly(A) repeats, but no homology to the LINEs. Moreover, processed pseudogenes have also been found in flowering plants. I propose that the ancestral L1-clade LINE in the common ancestor of green plants may have recognized a specific RNA template, with stringent recognition then becoming relaxed during the course of plant evolution.
Collapse
|
19
|
Graur D, Zheng Y, Price N, Azevedo RBR, Zufall RA, Elhaik E. On the immortality of television sets: "function" in the human genome according to the evolution-free gospel of ENCODE. Genome Biol Evol 2013; 5:578-90. [PMID: 23431001 PMCID: PMC3622293 DOI: 10.1093/gbe/evt028] [Citation(s) in RCA: 302] [Impact Index Per Article: 27.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/16/2013] [Indexed: 12/11/2022] Open
Abstract
A recent slew of ENCyclopedia Of DNA Elements (ENCODE) Consortium publications, specifically the article signed by all Consortium members, put forward the idea that more than 80% of the human genome is functional. This claim flies in the face of current estimates according to which the fraction of the genome that is evolutionarily conserved through purifying selection is less than 10%. Thus, according to the ENCODE Consortium, a biological function can be maintained indefinitely without selection, which implies that at least 80 - 10 = 70% of the genome is perfectly invulnerable to deleterious mutations, either because no mutation can ever occur in these "functional" regions or because no mutation in these regions can ever be deleterious. This absurd conclusion was reached through various means, chiefly by employing the seldom used "causal role" definition of biological function and then applying it inconsistently to different biochemical properties, by committing a logical fallacy known as "affirming the consequent," by failing to appreciate the crucial difference between "junk DNA" and "garbage DNA," by using analytical methods that yield biased errors and inflate estimates of functionality, by favoring statistical sensitivity over specificity, and by emphasizing statistical significance rather than the magnitude of the effect. Here, we detail the many logical and methodological transgressions involved in assigning functionality to almost every nucleotide in the human genome. The ENCODE results were predicted by one of its authors to necessitate the rewriting of textbooks. We agree, many textbooks dealing with marketing, mass-media hype, and public relations may well have to be rewritten.
Collapse
Affiliation(s)
- Dan Graur
- Department of Biology and Biochemistry, University of Houston, TX, USA.
| | | | | | | | | | | |
Collapse
|
20
|
Zou M, Wang G, He S. Evolutionary patterns of RNA-based gene duplicates in Caenorhabditis nematodes coincide with their genomic features. BMC Res Notes 2012; 5:398. [PMID: 22853807 PMCID: PMC3532220 DOI: 10.1186/1756-0500-5-398] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2012] [Accepted: 07/18/2012] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND RNA-based gene duplicates (retrocopies) played pivotal roles in many physiological processes. Nowadays, functional retrocopies have been systematically identified in several mammals, fruit flies, plants, zebrafish and other chordates, etc. However, studies about this kind of duplication in Caenorhabditis nematodes have not been reported. FINDINGS We identified 43, 48, 43, 9, and 42 retrocopies, of which 6, 15, 18, 3, and 13 formed chimeric genes in C. brenneri, C. briggsae, C. elegans, C. japonica, and C. remanei, respectively. At least 5 chimeric types exist in Caenorhabditis species, of which retrocopy recruiting both N and C terminus is the commonest one. Evidences from different analyses demonstrate many retrocopies and almost all chimeric genes may be functional in these species. About half of retrocopies in each species has coordinates in other species, and we suggest that retrocopies in closely related species may be helpful in identifying retrocopies for one certain species. CONCLUSIONS A number of retrocopies and chimeric genes exist in Caenorhabditis genomes, and some of them may be functional. The evolutionary patterns of these genes may correlate with their genomic features, such as the activity of retroelements, the high rate of mutation and deletion rate, and a large proportion of genes subject to trans-splicing.
Collapse
Affiliation(s)
- Ming Zou
- The key Laboratory of Aquatic Biodiversity and Conservation of Chinese Academy of Sciences, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, PR China
- University of the Chinese Academy of Sciences, Beijing 100039, PR China
| | - Guoxiu Wang
- Hubei Key Laboratory of Genetic Regulation and Integrative Biology, HuaZhong Normal University, Wuhan, Hubei, China
| | - Shunping He
- The key Laboratory of Aquatic Biodiversity and Conservation of Chinese Academy of Sciences, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, PR China
| |
Collapse
|
21
|
Novel genes from formation to function. INTERNATIONAL JOURNAL OF EVOLUTIONARY BIOLOGY 2012; 2012:821645. [PMID: 22811949 PMCID: PMC3395120 DOI: 10.1155/2012/821645] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/07/2012] [Accepted: 04/26/2012] [Indexed: 11/29/2022]
Abstract
The study of the evolution of novel genes generally focuses on the formation of new coding sequences. However, equally important in the evolution of novel functional genes are the formation of regulatory regions that allow the expression of the genes and the effects of the new genes in the organism as well. Herein, we discuss the current knowledge on the evolution of novel functional genes, and we examine in more detail the youngest genes discovered. We examine the existing data on a very recent and rapidly evolving cluster of duplicated genes, the Sdic gene cluster. This cluster of genes is an excellent model for the evolution of novel genes, as it is very recent and may still be in the process of evolving.
Collapse
|
22
|
Ohshima K. Parallel relaxation of stringent RNA recognition in plant and mammalian L1 retrotransposons. Mol Biol Evol 2012; 29:3255-9. [PMID: 22675029 PMCID: PMC3472496 DOI: 10.1093/molbev/mss147] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
L1 elements are mammalian non-long terminal repeat retrotransposons, or long interspersed elements (LINEs), that significantly influence the dynamics and fluidity of the genome. A series of observations suggest that plant L1-clade LINEs, just as mammalian L1s, mobilize both short interspersed elements (SINEs) and certain messenger RNA by recognizing the 3'-poly(A) tail of RNA. However, one L1 lineage in monocots was shown to possess a conserved 3'-end sequence with a solid RNA structure also observed in maize and sorghum SINEs. This strongly suggests that plant LINEs require a particular 3'-end sequence during initiation of reverse transcription. As one L1-clade LINE was also found to share the 3'-end sequence with a SINE in a green algal genome, I propose that the ancestral L1-clade LINE in the common ancestor of green plants may have recognized the specific RNA template, with stringent recognition then becoming relaxed during the course of plant evolution.
Collapse
|
23
|
Beck CR, Garcia-Perez JL, Badge RM, Moran JV. LINE-1 elements in structural variation and disease. Annu Rev Genomics Hum Genet 2011; 12:187-215. [PMID: 21801021 DOI: 10.1146/annurev-genom-082509-141802] [Citation(s) in RCA: 394] [Impact Index Per Article: 30.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
The completion of the human genome reference sequence ushered in a new era for the study and discovery of human transposable elements. It now is undeniable that transposable elements, historically dismissed as junk DNA, have had an instrumental role in sculpting the structure and function of our genomes. In particular, long interspersed element-1 (LINE-1 or L1) and short interspersed elements (SINEs) continue to affect our genome, and their movement can lead to sporadic cases of disease. Here, we briefly review the types of transposable elements present in the human genome and their mechanisms of mobility. We next highlight how advances in DNA sequencing and genomic technologies have enabled the discovery of novel retrotransposons in individual genomes. Finally, we discuss how L1-mediated retrotransposition events impact human genomes.
Collapse
Affiliation(s)
- Christine R Beck
- Department of Human Genetics, University of MIchigan Medical School, Ann Arbor, Michigan 48109-5618, USA.
| | | | | | | |
Collapse
|
24
|
Hedges DJ, Belancio VP. Restless genomes humans as a model organism for understanding host-retrotransposable element dynamics. ADVANCES IN GENETICS 2011; 73:219-62. [PMID: 21310298 DOI: 10.1016/b978-0-12-380860-8.00006-9] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Since their initial discovery in maize, there have been various attempts to categorize the relationship between transposable elements (TEs) and their host organisms. These have ranged from TEs being selfish parasites to their role as essential, functional components of organismal biology. Research over the past several decades has, in many respects, only served to complicate the issue even further. On the one hand, investigators have amassed substantial evidence concerning the negative effects that TE-mutagenic activity can have on host genomes and organismal fitness. On the other hand, we find an increasing number of examples, across several taxa, of TEs being incorporated into functional biological roles for their host organism. Some 45% of our own genomes are comprised of TE copies. While many of these copies are dormant, having lost their ability to mobilize, several lineages continue to actively proliferate in modern human populations. With its complement of ancestral and active TEs, the human genome exhibits key aspects of the host-TE dynamic that has played out since early on in organismal evolution. In this review, we examine what insights the particularly well-characterized human system can provide regarding the nature of the host-TE interaction.
Collapse
Affiliation(s)
- Dale J Hedges
- Hussman Institute for Human Genomics, Dr. John T. Macdonald Foundation Department of Human Genetics, Miller School of Medicine, University of Miami, Miami, Florida, USA
| | | |
Collapse
|
25
|
Oliver KR, Greene WK. Mobile DNA and the TE-Thrust hypothesis: supporting evidence from the primates. Mob DNA 2011; 2:8. [PMID: 21627776 PMCID: PMC3123540 DOI: 10.1186/1759-8753-2-8] [Citation(s) in RCA: 77] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2011] [Accepted: 05/31/2011] [Indexed: 02/07/2023] Open
Abstract
Transposable elements (TEs) are increasingly being recognized as powerful facilitators of evolution. We propose the TE-Thrust hypothesis to encompass TE-facilitated processes by which genomes self-engineer coding, regulatory, karyotypic or other genetic changes. Although TEs are occasionally harmful to some individuals, genomic dynamism caused by TEs can be very beneficial to lineages. This can result in differential survival and differential fecundity of lineages. Lineages with an abundant and suitable repertoire of TEs have enhanced evolutionary potential and, if all else is equal, tend to be fecund, resulting in species-rich adaptive radiations, and/or they tend to undergo major evolutionary transitions. Many other mechanisms of genomic change are also important in evolution, and whether the evolutionary potential of TE-Thrust is realized is heavily dependent on environmental and ecological factors. The large contribution of TEs to evolutionary innovation is particularly well documented in the primate lineage. In this paper, we review numerous cases of beneficial TE-caused modifications to the genomes of higher primates, which strongly support our TE-Thrust hypothesis.
Collapse
Affiliation(s)
- Keith R Oliver
- School of Biological Sciences and Biotechnology, Faculty of Science and Engineering, Murdoch University, Perth W. A. 6150, Australia
| | - Wayne K Greene
- School of Veterinary and Biomedical Sciences, Faculty of Health Sciences, Murdoch University, Perth W. A. 6150, Australia
| |
Collapse
|
26
|
Hancks DC, Kazazian H. SVA retrotransposons: Evolution and genetic instability. Semin Cancer Biol 2010; 20:234-45. [PMID: 20416380 PMCID: PMC2945828 DOI: 10.1016/j.semcancer.2010.04.001] [Citation(s) in RCA: 129] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2010] [Revised: 04/01/2010] [Accepted: 04/14/2010] [Indexed: 01/21/2023]
Abstract
SINE-VNTR-Alus (SVA) are non-autonomous hominid specific retrotransposons that are associated with disease in humans. SVAs are evolutionarily young and presumably mobilized by the LINE-1 reverse transcriptase in trans. SVAs are currently active and may impact the host through a variety of mechanisms including insertional mutagenesis, exon shuffling, alternative splicing, and the generation of differentially methylated regions (DMR). Here we review SVA biology, including SVA insertions associated with known diseases. Further, we discuss a model describing the initial formation of SVA and the mechanisms by which SVA may impact the host.
Collapse
Affiliation(s)
- Dustin C. Hancks
- Department of Genetics, The University of Pennsylvania School of Medicine
| | - Haig Kazazian
- Department of Genetics, The University of Pennsylvania School of Medicine
| |
Collapse
|
27
|
Abstract
Ever since the pre-molecular era, the birth of new genes with novel functions has been considered to be a major contributor to adaptive evolutionary innovation. Here, I review the origin and evolution of new genes and their functions in eukaryotes, an area of research that has made rapid progress in the past decade thanks to the genomics revolution. Indeed, recent work has provided initial whole-genome views of the different types of new genes for a large number of different organisms. The array of mechanisms underlying the origin of new genes is compelling, extending way beyond the traditionally well-studied source of gene duplication. Thus, it was shown that novel genes also regularly arose from messenger RNAs of ancestral genes, protein-coding genes metamorphosed into new RNA genes, genomic parasites were co-opted as new genes, and that both protein and RNA genes were composed from scratch (i.e., from previously nonfunctional sequences). These mechanisms then also contributed to the formation of numerous novel chimeric gene structures. Detailed functional investigations uncovered different evolutionary pathways that led to the emergence of novel functions from these newly minted sequences and, with respect to animals, attributed a potentially important role to one specific tissue--the testis--in the process of gene birth. Remarkably, these studies also demonstrated that novel genes of the various types significantly impacted the evolution of cellular, physiological, morphological, behavioral, and reproductive phenotypic traits. Consequently, it is now firmly established that new genes have indeed been major contributors to the origin of adaptive evolutionary novelties.
Collapse
Affiliation(s)
- Henrik Kaessmann
- Center for Integrative Genomics, University of Lausanne, CH-1015 Lausanne, Switzerland.
| |
Collapse
|
28
|
Buljan M, Frankish A, Bateman A. Quantifying the mechanisms of domain gain in animal proteins. Genome Biol 2010; 11:R74. [PMID: 20633280 PMCID: PMC2926785 DOI: 10.1186/gb-2010-11-7-r74] [Citation(s) in RCA: 82] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2010] [Revised: 06/04/2010] [Accepted: 07/15/2010] [Indexed: 11/21/2022] Open
Abstract
Background Protein domains are protein regions that are shared among different proteins and are frequently functionally and structurally independent from the rest of the protein. Novel domain combinations have a major role in evolutionary innovation. However, the relative contributions of the different molecular mechanisms that underlie domain gains in animals are still unknown. By using animal gene phylogenies we were able to identify a set of high confidence domain gain events and by looking at their coding DNA investigate the causative mechanisms. Results Here we show that the major mechanism for gains of new domains in metazoan proteins is likely to be gene fusion through joining of exons from adjacent genes, possibly mediated by non-allelic homologous recombination. Retroposition and insertion of exons into ancestral introns through intronic recombination are, in contrast to previous expectations, only minor contributors to domain gains and have accounted for less than 1% and 10% of high confidence domain gain events, respectively. Additionally, exonization of previously non-coding regions appears to be an important mechanism for addition of disordered segments to proteins. We observe that gene duplication has preceded domain gain in at least 80% of the gain events. Conclusions The interplay of gene duplication and domain gain demonstrates an important mechanism for fast neofunctionalization of genes.
Collapse
Affiliation(s)
- Marija Buljan
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK.
| | | | | |
Collapse
|
29
|
Rogers RL, Bedford T, Lyons AM, Hartl DL. Adaptive impact of the chimeric gene Quetzalcoatl in Drosophila melanogaster. Proc Natl Acad Sci U S A 2010; 107:10943-8. [PMID: 20534482 PMCID: PMC2890713 DOI: 10.1073/pnas.1006503107] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Chimeric genes, which form through the genomic fusion of two protein-coding genes, are a significant source of evolutionary novelty in Drosophila melanogaster. However, the propensity of chimeric genes to produce adaptive phenotypic changes is not fully understood. Here, we describe the chimeric gene Quetzalcoatl (Qtzl; CG31864), which formed in the recent past and swept to fixation in D. melanogaster. Qtzl arose through a duplication on chromosome 2L that united a portion of the mitochondrially targeted peptide CG12264 with a segment of the polycomb gene escl. The 3' segment of the gene, which is derived from escl, is inherited out of frame, producing a unique peptide sequence. Nucleotide diversity is drastically reduced and site frequency spectra are significantly skewed surrounding the duplicated region, a finding consistent with a selective sweep on the duplicate region containing Qtzl. Qtzl has an expression profile that largely resembles that of escl, with expression in early pupae, adult females, and male testes. However, expression patterns appear to have been decoupled from both parental genes during later embryonic development and in head tissues of adult males, indicating that Qtzl has developed a distinct regulatory profile through the rearrangement of different 5' and 3' regulatory domains. Furthermore, misexpression of Qtzl suppresses defects in the formation of the neuromuscular junction in larvae, demonstrating that Qtzl can produce phenotypic effects in cells. Together, these results show that chimeric genes can produce structural and regulatory changes in a single mutational step and may be a major factor in adaptive evolution.
Collapse
Affiliation(s)
- Rebekah L. Rogers
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138; and
| | - Trevor Bedford
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI 48109
| | - Ana M. Lyons
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138; and
| | - Daniel L. Hartl
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138; and
| |
Collapse
|
30
|
Ohshima K, Igarashi K. Inference for the initial stage of domain shuffling: tracing the evolutionary fate of the PIPSL retrogene in hominoids. Mol Biol Evol 2010; 27:2522-33. [PMID: 20525901 DOI: 10.1093/molbev/msq138] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Domain shuffling has provided extraordinarily diverse functions to proteins. Nevertheless, how newly combined domains are coordinated to create novel functions remains a fundamental question of genetic and phenotypic evolution. Previously, we reported a unique mechanism of gene creation, whereby new combinations of functional domains are assembled from distinct genes at the RNA level, reverse transcribed, and integrated into the genome by the L1 retrotransposon. The novel gene PIPSL, created by the fusion of phosphatidylinositol-4-phosphate 5-kinase (PIP5K1A) and 26S proteasome subunit (S5a/PSMD4) genes, is specifically transcribed in human and chimpanzee testes. We present the first evidence for the translation of PIPSL in humans. The human PIPSL locus showed a low nucleotide diversity within 11 populations (125 individuals) compared with other genomic regions such as introns and overall chromosomes. It was equivalent to the average for coding sequences or exons from other genes, suggesting that human PIPSL has some function and is conserved among modern populations. Two linked amino acid-altering single-nucleotide polymorphisms were found in the PIPSL kinase domain of non-African populations. They are positioned in the vicinity of the substrate-binding cavity of the parental PIP5K1A protein and change the charge of both residues. The relatively rapid expansion of this haplotype might indicate a selective advantage for it in modern humans. We determined the evolutionary fate of PIPSL domains created by domain shuffling. During hominoid diversification, the S5a-derived domain was retained in all lineages, whereas the ubiquitin-interacting motif (UIM) 1 in the domain experienced critical amino acid replacements at an early stage, being conserved under subsequent high levels of nonsynonymous substitutions to UIM2 and other domains, suggesting that adaptive evolution diversified these functional compartments. Conversely, the PIP5K1A-derived domain is degenerated in gibbons and gorillas. These observations provide a possible scheme of domain shuffling in which the combined parental domains are not tightly linked in the novel chimeric protein, allowing for changes in their functional roles, leading to their fine-tuning. Selective pressure toward a novel function initially acted on one domain, whereas the other experienced a nearly neutral state. Over time, the latter also gained a new function or was degenerated.
Collapse
Affiliation(s)
- Kazuhiko Ohshima
- Graduate School of Bioscience, Nagahama Institute of Bio-Science and Technology, Nagahama, Japan.
| | | |
Collapse
|
31
|
Evolution in health and medicine Sackler colloquium: Genomic disorders: a window into human gene and genome evolution. Proc Natl Acad Sci U S A 2010; 107 Suppl 1:1765-71. [PMID: 20080665 DOI: 10.1073/pnas.0906222107] [Citation(s) in RCA: 52] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Gene duplications alter the genetic constitution of organisms and can be a driving force of molecular evolution in humans and the great apes. In this context, the study of genomic disorders has uncovered the essential role played by the genomic architecture, especially low copy repeats (LCRs) or segmental duplications (SDs). In fact, regardless of the mechanism, LCRs can mediate or stimulate rearrangements, inciting genomic instability and generating dynamic and unstable regions prone to rapid molecular evolution. In humans, copy-number variation (CNV) has been implicated in common traits such as neuropathy, hypertension, color blindness, infertility, and behavioral traits including autism and schizophrenia, as well as disease susceptibility to HIV, lupus nephritis, and psoriasis among many other clinical phenotypes. The same mechanisms implicated in the origin of genomic disorders may also play a role in the emergence of segmental duplications and the evolution of new genes by means of genomic and gene duplication and triplication, exon shuffling, exon accretion, and fusion/fission events.
Collapse
|
32
|
Young J, Ménétrey J, Goud B. RAB6C is a retrogene that encodes a centrosomal protein involved in cell cycle progression. J Mol Biol 2010; 397:69-88. [PMID: 20064528 DOI: 10.1016/j.jmb.2010.01.009] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2009] [Revised: 12/08/2009] [Accepted: 01/05/2010] [Indexed: 10/20/2022]
Abstract
Rab-GTPases are key regulators of membrane transport, and growing evidence indicates that their expression levels are altered in certain human malignancies, including cancer. Rab6C, a newly identified Rab6 subfamily member, has attracted recent attention because its reduced expression might confer a selective advantage to drug-resistant breast cancer cells. Here, we report that RAB6C is a primate-specific retrogene derived from a RAB6A' transcript. RAB6C is transcribed in a limited number of human tissues including brain, testis, prostate, and breast. Endogenous Rab6C is considerably less abundant and has a much shorter half-life than Rab6A'. Comparison of the GTP-binding motifs of Rab6C and Rab6A', homology modeling, and GTP-blot overlay assays indicate that amino acid changes in Rab6C have greatly reduced its GTP-binding affinity. Instead, the noncanonical GTP-binding domain of Rab6C mediates localization of the protein to the centrosome. Overexpression of Rab6C results in G1 arrest, and its specific depletion generates tetraploid cells with supernumerary centrosomes, revealing a role of Rab6C in events related to the centrosome and cell cycle progression. Thus, RAB6C is a rare example of a recently emerged retrogene that has acquired the status of a new gene, encoding a functional protein with altered characteristics compared to Rab6A'.
Collapse
Affiliation(s)
- Joanne Young
- Molecular Mechanisms of Intracellular Transport, CNRS, UMR144, Institut Curie, 26 rue d'Ulm, 75248 Paris Cedex 05, France
| | | | | |
Collapse
|
33
|
Ryan FP. An alternative approach to medical genetics based on modern evolutionary biology. Part 2: retroviral symbiosis. J R Soc Med 2009; 102:324-31. [PMID: 19679734 DOI: 10.1258/jrsm.2009.090183] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Affiliation(s)
- Frank P Ryan
- Sheffield Primary Care Trust and Department of Animal and Plant Sciences, Sheffield University, Sheffield, UK.
| |
Collapse
|
34
|
Zhang Y, Lu S, Zhao S, Zheng X, Long M, Wei L. Positive selection for the male functionality of a co-retroposed gene in the hominoids. BMC Evol Biol 2009; 9:252. [PMID: 19832993 PMCID: PMC2773790 DOI: 10.1186/1471-2148-9-252] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2009] [Accepted: 10/15/2009] [Indexed: 01/09/2023] Open
Abstract
BACKGROUND New genes generated by retroposition are widespread in humans and other mammalian species. Usually, this process copies a single parental gene and inserts it into a distant genomic location. However, retroposition of two adjacent parental genes, i.e. co-retroposition, had not been reported until the hominoid chimeric gene, PIPSL, was identified recently. It was shown how two genes linked in tandem (phosphatidylinositol-4-phosphate 5-kinase, type I, alpha, PIP5K1A and proteasome 26S subunit, non-ATPase, 4, PSMD4) could be co-retroposed from a single RNA molecule to form this novel chimeric gene. However, understanding of the origination and biological function of PIPSL requires determination of the coding potential of this gene as well as the evolutionary forces acting on its hominoid copies. RESULTS We tackled these problems by analyzing the evolutionary signature in both within-species variation and between species divergence in the sequence and structure of the gene. We revealed a significant evolutionary signature: the coding region has significantly lower sequence variation, especially insertions and deletions, suggesting that the human copy may encode a protein. Moreover, a survey across five different hominoid species revealed that all adaptive changes of PSMD4-derived regions occurred on branches leading to human and chimp rather than other hominoid lineages. Finally, computational analysis suggests testis-specific transcription of PIPSL is regulated by tissue-dependent methylation rather than some transcriptional leakage. CONCLUSION Therefore, this set of analyses showed that PIPSL is an extraordinary co-retroposed protein-coding gene that may participate in the male functions of humans and its close relatives.
Collapse
Affiliation(s)
- Yong Zhang
- Center for Bioinformatics, National Laboratory of Protein Engineering and Plant Genetic Engineering, College of Life Sciences, Peking University, Beijing, 100871, PR China.
| | | | | | | | | | | |
Collapse
|
35
|
Cordaux R, Batzer MA. The impact of retrotransposons on human genome evolution. Nat Rev Genet 2009; 10:691-703. [PMID: 19763152 DOI: 10.1038/nrg2640] [Citation(s) in RCA: 1127] [Impact Index Per Article: 75.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Their ability to move within genomes gives transposable elements an intrinsic propensity to affect genome evolution. Non-long terminal repeat (LTR) retrotransposons--including LINE-1, Alu and SVA elements--have proliferated over the past 80 million years of primate evolution and now account for approximately one-third of the human genome. In this Review, we focus on this major class of elements and discuss the many ways that they affect the human genome: from generating insertion mutations and genomic instability to altering gene expression and contributing to genetic innovation. Increasingly detailed analyses of human and other primate genomes are revealing the scale and complexity of the past and current contributions of non-LTR retrotransposons to genomic change in the human lineage.
Collapse
Affiliation(s)
- Richard Cordaux
- CNRS UMR 6556 Ecologie, Evolution, Symbiose, Université de Poitiers, 40 Avenue du Recteur Pineau, Poitiers, France
| | | |
Collapse
|
36
|
Iida K, Fukami-Kobayashi K, Toyoda A, Sakaki Y, Kobayashi M, Seki M, Shinozaki K. Analysis of multiple occurrences of alternative splicing events in Arabidopsis thaliana using novel sequenced full-length cDNAs. DNA Res 2009; 16:155-64. [PMID: 19423640 PMCID: PMC2695776 DOI: 10.1093/dnares/dsp009] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
Alternative splicing (AS) is a mechanism by which multiple types of mature mRNAs are generated from a single pre-mature mRNA. In this study, we completely sequenced 1800 full-length cDNAs from Arabidopsis thaliana, which had 5′ and/or 3′ sequences that were previously found to have AS events or alternative transcription start sites. Unexpectedly, these sequences gave us further evidence of AS, as 601 out of 1800 transcripts showed novel AS events. We focused on the combination patterns of multiple AS events within individual genes. Interestingly, some specific AS event combination patterns tended to appear more frequently than expected. The two most common patterns were: (i) alternative donor–0∼12 times of exon skips–alternative acceptor and (ii) several times (∼8) of retained introns. We also found that multiple AS events in a transcript tend to have the same effects concerning the length of the mature mRNA. Our current results are consistent with our previous observations, which showed changes in AS profiles under different conditions, and suggest the involvement of hypothetical cis- and trans-acting factors in the regulation of AS events.
Collapse
Affiliation(s)
- Kei Iida
- Nagahama Institute of Bio-Science and Technology, 1266 Tamura, Nagahama, Shiga 526-0829, Japan
| | | | | | | | | | | | | |
Collapse
|
37
|
Abstract
Gene copies that stem from the mRNAs of parental source genes have long been viewed as evolutionary dead-ends with little biological relevance. Here we review a range of recent studies that have unveiled a significant number of functional retroposed gene copies in both mammalian and some non-mammalian genomes. These studies have not only revealed previously unknown mechanisms for the emergence of new genes and their functions but have also provided fascinating general insights into molecular and evolutionary processes that have shaped genomes. For example, analyses of chromosomal gene movement patterns via RNA-based gene duplication have shed fresh light on the evolutionary origin and biology of our sex chromosomes.
Collapse
|
38
|
Abstract
Retrotransposons, mainly LINEs, SINEs, and endogenous retroviruses, make up roughly 40% of the mammalian genome and have played an important role in genome evolution. Their prevalence in genomes reflects a delicate balance between their further expansion and the restraint imposed by the host. In any human genome only a small number of LINE1s (L1s) are active, moving their own and SINE sequences into new genomic locations and occasionally causing disease. Recent insights and new technologies promise answers to fundamental questions about the biology of transposable elements.
Collapse
Affiliation(s)
- John L Goodier
- Department of Genetics, University of Pennsylvania School of Medicine, 415 Curie Boulevard, Philadelphia, PA 19104, USA.
| | | |
Collapse
|
39
|
Belancio VP, Hedges DJ, Deininger P. Mammalian non-LTR retrotransposons: for better or worse, in sickness and in health. Genome Res 2008; 18:343-58. [PMID: 18256243 DOI: 10.1101/gr.5558208] [Citation(s) in RCA: 224] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Transposable elements (TEs) have shared an exceptionally long coexistence with their host organisms and have come to occupy a significant fraction of eukaryotic genomes. The bulk of the expansion occurring within mammalian genomes has arisen from the activity of type I retrotransposons, which amplify in a "copy-and-paste" fashion through an RNA intermediate. For better or worse, the sequences of these retrotransposons are now wedded to the genomes of their mammalian hosts. Although there are several reported instances of the positive contribution of mobile elements to their host genomes, these discoveries have occurred alongside growing evidence of the role of TEs in human disease and genetic instability. Here we examine, with a particular emphasis on human retrotransposon activity, several newly discovered aspects of mammalian retrotransposon biology. We consider their potential impact on host biology as well as their ultimate implications for the nature of the TE-host relationship.
Collapse
Affiliation(s)
- Victoria P Belancio
- Tulane Cancer Center and Department of Epidemiology, Tulane University Health Sciences Center, New Orleans, Louisiana 70112, USA
| | | | | |
Collapse
|