1
|
Lee DH, Park EG, Kim JM, Shin HJ, Lee YJ, Jeong HS, Roh HY, Kim WR, Ha H, Kim SW, Choi YH, Kim HS. Genomic analyses of intricate interaction of TE-lncRNA overlapping genes with miRNAs in human diseases. Genes Genomics 2024; 46:1313-1325. [PMID: 39215947 DOI: 10.1007/s13258-024-01547-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2024] [Accepted: 07/09/2024] [Indexed: 09/04/2024]
Abstract
BACKGROUND Transposable elements (TEs) are known to be inserted into genome to create transcript isoforms or to generate long non-coding RNA (lncRNA) sequences. The insertion of TEs generates a gene protein sequence within the genome, but also provides a microRNA (miRNA) regulatory region. OBJECTIVE To determine the effect of gene sequence changes caused by TE insertion on miRNA binding and to investigate the formation of an overlapping lncRNA that represses it. METHODS The distribution of overlapping regions between exons and TE regions with lncRNA was examined using the Bedtools. miRNAs that can bind to those overlapping regions were identified through the miRDB web program. For TE-lncRNA overlapping genes, bioinformatic analysis was conducted using DAVID web database. Differential expression analysis was conducted using data from the GEO dataset and TCGA. RESULTS Most TEs were distributed more frequently in untranslated regions than open reading frames. There were 30 annotated TE-lncRNA overlapping genes with same strand that could bind to the same miRNA. As a result of identifying the association between these 30 genes and diseases, TGFB2, FCGR2A, DCTN5, and IFI6 were associated with breast cancer, and HMGCS1, FRMD4A, EDNRB, and SNCA were associated with Alzheimer's disease. Analysis of the GEO and TCGA data showed that the relevant expression of miR-891a and miR-28, which bind to the TE overlapping region of DCTN5 and HMGCS1, decreased. CONCLUSION This study indicates that the interaction between TE-lncRNA overlapping genes and miRNAs can affect disease progression.
Collapse
Affiliation(s)
- Du Hyeong Lee
- Department of Integrated Biological Sciences, Pusan National University, Busan, 46241, Republic of Korea
- Institute of Systems Biology, Pusan National University, Busan, 46241, Republic of Korea
| | - Eun Gyung Park
- Department of Integrated Biological Sciences, Pusan National University, Busan, 46241, Republic of Korea
- Institute of Systems Biology, Pusan National University, Busan, 46241, Republic of Korea
| | - Jung-Min Kim
- Department of Integrated Biological Sciences, Pusan National University, Busan, 46241, Republic of Korea
- Institute of Systems Biology, Pusan National University, Busan, 46241, Republic of Korea
| | - Hae Jin Shin
- Department of Integrated Biological Sciences, Pusan National University, Busan, 46241, Republic of Korea
- Institute of Systems Biology, Pusan National University, Busan, 46241, Republic of Korea
| | - Yun Ju Lee
- Department of Integrated Biological Sciences, Pusan National University, Busan, 46241, Republic of Korea
- Institute of Systems Biology, Pusan National University, Busan, 46241, Republic of Korea
| | - Hyeon-Su Jeong
- Department of Integrated Biological Sciences, Pusan National University, Busan, 46241, Republic of Korea
- Institute of Systems Biology, Pusan National University, Busan, 46241, Republic of Korea
| | - Hyun-Young Roh
- Institute of Systems Biology, Pusan National University, Busan, 46241, Republic of Korea
- Department of Biological Sciences, College of Natural Sciences, Pusan National University, Busan, 46241, Republic of Korea
| | - Woo Ryung Kim
- Department of Integrated Biological Sciences, Pusan National University, Busan, 46241, Republic of Korea
- Institute of Systems Biology, Pusan National University, Busan, 46241, Republic of Korea
| | - Hongseok Ha
- Institute of Endemic Disease, Medical Research Center, Seoul National University, Seoul, 03080, Republic of Korea
| | - Sang-Woo Kim
- Department of Integrated Biological Sciences, Pusan National University, Busan, 46241, Republic of Korea
- Department of Biological Sciences, College of Natural Sciences, Pusan National University, Busan, 46241, Republic of Korea
| | - Yung Hyun Choi
- Department of Biochemistry, College of Oriental Medicine, Dong-Eui University, Busan, 47227, Republic of Korea
| | - Heui-Soo Kim
- Institute of Systems Biology, Pusan National University, Busan, 46241, Republic of Korea.
- Department of Biological Sciences, College of Natural Sciences, Pusan National University, Busan, 46241, Republic of Korea.
| |
Collapse
|
2
|
Grundy EE, Diab N, Chiappinelli KB. Transposable element regulation and expression in cancer. FEBS J 2022; 289:1160-1179. [PMID: 33471418 PMCID: PMC11577309 DOI: 10.1111/febs.15722] [Citation(s) in RCA: 52] [Impact Index Per Article: 17.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2020] [Revised: 01/08/2021] [Accepted: 01/14/2021] [Indexed: 12/11/2022]
Abstract
Approximately 45% of the human genome is composed of transposable elements (TEs). Expression of these elements is tightly regulated during normal development. TEs may be expressed at high levels in embryonic stem cells but are epigenetically silenced in terminally differentiated cells. As part of the global 'epigenetic dysregulation' that cells undergo during transformation from normal to cancer, TEs can lose epigenetic silencing and become transcribed, and, in some cases, active. Here, we summarize recent advances detailing the consequences of TE activation in cancer and describe how these understudied residents of our genome can both aid tumorigenesis and potentially be harnessed for anticancer therapies.
Collapse
Affiliation(s)
- Erin E Grundy
- Department of Microbiology, Immunology, & Tropical Medicine, The George Washington University, Washington, DC, USA
- The GW Cancer Center, The George Washington University, Washington, DC, USA
- The Institute for Biomedical Sciences at The George Washington University, Washington, DC, USA
| | - Noor Diab
- Department of Microbiology, Immunology, & Tropical Medicine, The George Washington University, Washington, DC, USA
- The GW Cancer Center, The George Washington University, Washington, DC, USA
| | - Katherine B Chiappinelli
- Department of Microbiology, Immunology, & Tropical Medicine, The George Washington University, Washington, DC, USA
- The GW Cancer Center, The George Washington University, Washington, DC, USA
| |
Collapse
|
3
|
Etchegaray E, Naville M, Volff JN, Haftek-Terreau Z. Transposable element-derived sequences in vertebrate development. Mob DNA 2021; 12:1. [PMID: 33407840 PMCID: PMC7786948 DOI: 10.1186/s13100-020-00229-5] [Citation(s) in RCA: 42] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2020] [Accepted: 12/15/2020] [Indexed: 12/14/2022] Open
Abstract
Transposable elements (TEs) are major components of all vertebrate genomes that can cause deleterious insertions and genomic instability. However, depending on the specific genomic context of their insertion site, TE sequences can sometimes get positively selected, leading to what are called "exaptation" events. TE sequence exaptation constitutes an important source of novelties for gene, genome and organism evolution, giving rise to new regulatory sequences, protein-coding exons/genes and non-coding RNAs, which can play various roles beneficial to the host. In this review, we focus on the development of vertebrates, which present many derived traits such as bones, adaptive immunity and a complex brain. We illustrate how TE-derived sequences have given rise to developmental innovations in vertebrates and how they thereby contributed to the evolutionary success of this lineage.
Collapse
Affiliation(s)
- Ema Etchegaray
- Institut de Genomique Fonctionnelle de Lyon, Univ Lyon, CNRS UMR 5242, Ecole Normale Superieure de Lyon, Universite Claude Bernard Lyon 1, 46 allee d'Italie, F-69364, Lyon, France.
| | - Magali Naville
- Institut de Genomique Fonctionnelle de Lyon, Univ Lyon, CNRS UMR 5242, Ecole Normale Superieure de Lyon, Universite Claude Bernard Lyon 1, 46 allee d'Italie, F-69364, Lyon, France
| | - Jean-Nicolas Volff
- Institut de Genomique Fonctionnelle de Lyon, Univ Lyon, CNRS UMR 5242, Ecole Normale Superieure de Lyon, Universite Claude Bernard Lyon 1, 46 allee d'Italie, F-69364, Lyon, France
| | - Zofia Haftek-Terreau
- Institut de Genomique Fonctionnelle de Lyon, Univ Lyon, CNRS UMR 5242, Ecole Normale Superieure de Lyon, Universite Claude Bernard Lyon 1, 46 allee d'Italie, F-69364, Lyon, France
| |
Collapse
|
4
|
Alvarez MEV, Chivers M, Borovska I, Monger S, Giannoulatou E, Kralovicova J, Vorechovsky I. Transposon clusters as substrates for aberrant splice-site activation. RNA Biol 2020; 18:354-367. [PMID: 32965162 PMCID: PMC7951965 DOI: 10.1080/15476286.2020.1805909] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Transposed elements (TEs) have dramatically shaped evolution of the exon-intron structure and significantly contributed to morbidity, but how recent TE invasions into older TEs cooperate in generating new coding sequences is poorly understood. Employing an updated repository of new exon-intron boundaries induced by pathogenic mutations, termed DBASS, here we identify novel TE clusters that facilitated exon selection. To explore the extent to which such TE exons maintain RNA secondary structure of their progenitors, we carried out structural studies with a composite exon that was derived from a long terminal repeat (LTR78) and AluJ and was activated by a C > T mutation optimizing the 5ʹ splice site. Using a combination of SHAPE, DMS and enzymatic probing, we show that the disease-causing mutation disrupted a conserved AluJ stem that evolved from helix 3.3 (or 5b) of 7SL RNA, liberating a primordial GC 5ʹ splice site from the paired conformation for interactions with the spliceosome. The mutation also reduced flexibility of conserved residues in adjacent exon-derived loops of the central Alu hairpin, revealing a cross-talk between traditional and auxilliary splicing motifs that evolved from opposite termini of 7SL RNA and were approximated by Watson-Crick base-pairing already in organisms without spliceosomal introns. We also identify existing Alu exons activated by the same RNA rearrangement. Collectively, these results provide valuable TE exon models for studying formation and kinetics of pre-mRNA building blocks required for splice-site selection and will be useful for fine-tuning auxilliary splicing motifs and exon and intron size constraints that govern aberrant splice-site activation.
Collapse
Affiliation(s)
| | - Martin Chivers
- School of Medicine, University of Southampton, Southampton, UK
| | - Ivana Borovska
- Slovak Academy of Sciences, Institute of Molecular Physiology and Genetics, Bratislava, Slovak Republic
| | - Steven Monger
- Computational Genomics Laboratory, Victor Chang Cardiac Research Institute, Darlinghurst, Australia
| | - Eleni Giannoulatou
- Computational Genomics Laboratory, Victor Chang Cardiac Research Institute, Darlinghurst, Australia.,St. Vincent's Clinical School, University of New South Wales, Sydney, Australia
| | - Jana Kralovicova
- School of Medicine, University of Southampton, Southampton, UK.,Slovak Academy of Sciences, Institute of Molecular Physiology and Genetics, Bratislava, Slovak Republic
| | | |
Collapse
|
5
|
Carducci F, Biscotti MA, Barucca M, Canapa A. Transposable elements in vertebrates: species evolution and environmental adaptation. EUROPEAN ZOOLOGICAL JOURNAL 2019. [DOI: 10.1080/24750263.2019.1695967] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Affiliation(s)
- F. Carducci
- Dipartimento di Scienze della Vita e dell’Ambiente, Università Politecnica delle Marche, Ancona, Italy
| | - M. A. Biscotti
- Dipartimento di Scienze della Vita e dell’Ambiente, Università Politecnica delle Marche, Ancona, Italy
| | - M. Barucca
- Dipartimento di Scienze della Vita e dell’Ambiente, Università Politecnica delle Marche, Ancona, Italy
| | - A. Canapa
- Dipartimento di Scienze della Vita e dell’Ambiente, Università Politecnica delle Marche, Ancona, Italy
| |
Collapse
|
6
|
Investigation of somatic single nucleotide variations in human endogenous retrovirus elements and their potential association with cancer. PLoS One 2019; 14:e0213770. [PMID: 30934003 PMCID: PMC6443178 DOI: 10.1371/journal.pone.0213770] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2018] [Accepted: 02/28/2019] [Indexed: 11/19/2022] Open
Abstract
Human endogenous retroviruses (HERVs) have been investigated for potential links with human cancer. However, the distribution of somatic nucleotide variations in HERV elements has not been explored in detail. This study aims to identify HERV elements with an over-representation of somatic mutations (hot spots) in cancer patients. Four HERV elements with mutation hotspots were identified that overlap with exons of four human protein coding genes. These hotspots were identified based on the significant over-representation (p<8.62e-4) of non-synonymous single-nucleotide variations (nsSNVs). These genes are TNN (HERV-9/LTR12), OR4K15 (HERV-IP10F/LTR10F), ZNF99 (HERV-W/HERV17/LTR17), and KIR2DL1 (MST/MaLR). In an effort to identify mutations that effect survival, all nsSNVs were further evaluated and it was found that kidney cancer patients with mutation C2270G in ZNF99 have a significantly lower survival rate (hazard ratio = 2.6) compared to those without it. Among HERV elements in the human non-protein coding regions, we found 788 HERVs with significantly elevated numbers of somatic single-nucleotide variations (SNVs) (p<1.60e-5). From this category the top three HERV elements with significantly over-represented SNVs are HERV-H/LTR7, HERV-9/LTR12 and HERV-L/MLT2. Majority of the SNVs in these 788 HERV elements are located in three DNA functional groups: long non-coding RNAs (lncRNAs) (60%), introns (22.2%) and transcriptional factor binding sites (TFBS) (14.8%). This study provides a list of mutational hotspots in HERVs, which could potentially be used as biomarkers and therapeutic targets.
Collapse
|
7
|
Schumann GG, Fuchs NV, Tristán-Ramos P, Sebe A, Ivics Z, Heras SR. The impact of transposable element activity on therapeutically relevant human stem cells. Mob DNA 2019; 10:9. [PMID: 30899334 PMCID: PMC6408843 DOI: 10.1186/s13100-019-0151-x] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2018] [Accepted: 02/27/2019] [Indexed: 12/11/2022] Open
Abstract
Human stem cells harbor significant potential for basic and clinical translational research as well as regenerative medicine. Currently ~ 3000 adult and ~ 30 pluripotent stem cell-based, interventional clinical trials are ongoing worldwide, and numbers are increasing continuously. Although stem cells are promising cell sources to treat a wide range of human diseases, there are also concerns regarding potential risks associated with their clinical use, including genomic instability and tumorigenesis concerns. Thus, a deeper understanding of the factors and molecular mechanisms contributing to stem cell genome stability are a prerequisite to harnessing their therapeutic potential for degenerative diseases. Chemical and physical factors are known to influence the stability of stem cell genomes, together with random mutations and Copy Number Variants (CNVs) that accumulated in cultured human stem cells. Here we review the activity of endogenous transposable elements (TEs) in human multipotent and pluripotent stem cells, and the consequences of their mobility for genomic integrity and host gene expression. We describe transcriptional and post-transcriptional mechanisms antagonizing the spread of TEs in the human genome, and highlight those that are more prevalent in multipotent and pluripotent stem cells. Notably, TEs do not only represent a source of mutations/CNVs in genomes, but are also often harnessed as tools to engineer the stem cell genome; thus, we also describe and discuss the most widely applied transposon-based tools and highlight the most relevant areas of their biomedical applications in stem cells. Taken together, this review will contribute to the assessment of the risk that endogenous TE activity and the application of genetically engineered TEs constitute for the biosafety of stem cells to be used for substitutive and regenerative cell therapies.
Collapse
Affiliation(s)
- Gerald G Schumann
- 1Division of Medical Biotechnology, Paul-Ehrlich-Institut, Paul-Ehrlich-Str.51-59, 63225 Langen, Germany
| | - Nina V Fuchs
- 2Host-Pathogen Interactions, Paul-Ehrlich-Institut, Paul-Ehrlich-Str. 51-59, 63225 Langen, Germany
| | - Pablo Tristán-Ramos
- 3GENYO. Centre for Genomics and Oncological Research, Pfizer/University of Granada/Andalusian Regional Government, PTS Granada-Avenida de la Ilustración, 114, 18016 Granada, Spain.,4Department of Biochemistry and Molecular Biology II, Faculty of Pharmacy, University of Granada, Campus Universitario de Cartuja, 18071 Granada, Spain
| | - Attila Sebe
- 1Division of Medical Biotechnology, Paul-Ehrlich-Institut, Paul-Ehrlich-Str.51-59, 63225 Langen, Germany
| | - Zoltán Ivics
- 1Division of Medical Biotechnology, Paul-Ehrlich-Institut, Paul-Ehrlich-Str.51-59, 63225 Langen, Germany
| | - Sara R Heras
- 3GENYO. Centre for Genomics and Oncological Research, Pfizer/University of Granada/Andalusian Regional Government, PTS Granada-Avenida de la Ilustración, 114, 18016 Granada, Spain.,4Department of Biochemistry and Molecular Biology II, Faculty of Pharmacy, University of Granada, Campus Universitario de Cartuja, 18071 Granada, Spain
| |
Collapse
|
8
|
Gómez-Fernández P, Urtasun A, Paton AW, Paton JC, Borrego F, Dersh D, Argon Y, Alloza I, Vandenbroeck K. Long Interleukin-22 Binding Protein Isoform-1 Is an Intracellular Activator of the Unfolded Protein Response. Front Immunol 2018; 9:2934. [PMID: 30619294 PMCID: PMC6302113 DOI: 10.3389/fimmu.2018.02934] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2018] [Accepted: 11/29/2018] [Indexed: 12/26/2022] Open
Abstract
The human IL22RA2 gene co-produces three protein isoforms in dendritic cells [IL-22 binding protein isoform-1 (IL-22BPi1), IL-22BPi2, and IL-22BPi3]. Two of these, IL-22BPi2 and IL-22BPi3, are capable of neutralizing the biological activity of IL-22. The function of IL-22BPi1, which differs from IL-22BPi2 through an in-frame 32-amino acid insertion provided by an alternatively spliced exon, remains unknown. Using transfected human cell lines, we demonstrate that IL-22BPi1 is secreted detectably, but at much lower levels than IL-22BPi2, and unlike IL-22BPi2 and IL-22BPi3, is largely retained in the endoplasmic reticulum (ER). As opposed to IL-22BPi2 and IL-22BPi3, IL-22BPi1 is incapable of neutralizing or binding to IL-22 measured in bioassay or assembly-induced IL-22 co-folding assay. We performed interactome analysis to disclose the mechanism underlying the poor secretion of IL-22BPi1 and identified GRP78, GRP94, GRP170, and calnexin as main interactors. Structure-function analysis revealed that, like IL-22BPi2, IL-22BPi1 binds to the substrate-binding domain of GRP78 as well as to the middle domain of GRP94. Ectopic expression of wild-type GRP78 enhanced, and ATPase-defective GRP94 mutant decreased, secretion of both IL-22BPi1 and IL-22BPi2, while neither of both affected IL-22BPi3 secretion. Thus, IL-22BPi1 and IL-22BPi2 are bona fide clients of the ER chaperones GRP78 and GRP94. However, only IL-22BPi1 activates an unfolded protein response (UPR) resulting in increased protein levels of GRP78 and GRP94. Cloning of the IL22RA2 alternatively spliced exon into an unrelated cytokine, IL-2, bestowed similar characteristics on the resulting protein. We also found that CD14++/CD16+ intermediate monocytes produced a higher level of IL22RA2 mRNA than classical and non-classical monocytes, but this difference disappeared in immature dendritic cells (moDC) derived thereof. Upon silencing of IL22RA2 expression in moDC, GRP78 levels were significantly reduced, suggesting that native IL22RA2 expression naturally contributes to upregulating GRP78 levels in these cells. The IL22RA2 alternatively spliced exon was reported to be recruited through a single mutation in the proto-splice site of a Long Terminal Repeat retrotransposon sequence in the ape lineage. Our work suggests that positive selection of IL-22BPi1 was not driven by IL-22 antagonism as in the case of IL-22BPi2 and IL-22BPi3, but by capacity for induction of an UPR response.
Collapse
Affiliation(s)
- Paloma Gómez-Fernández
- Neurogenomiks Group, Department of Neuroscience, University of the Basque Country (UPV/EHU), Leioa, Spain
- Achucarro Basque Center for Neuroscience, Leioa, Spain
| | - Andoni Urtasun
- Neurogenomiks Group, Department of Neuroscience, University of the Basque Country (UPV/EHU), Leioa, Spain
- Achucarro Basque Center for Neuroscience, Leioa, Spain
| | - Adrienne W. Paton
- Research for Infectious Diseases, Department of Molecular and Biomedical Science, University of Adelaide, Adelaide, SA, Australia
| | - James C. Paton
- Research for Infectious Diseases, Department of Molecular and Biomedical Science, University of Adelaide, Adelaide, SA, Australia
| | - Francisco Borrego
- Biocruces Bizkaia Health Research Institute, Barakaldo, Spain
- Basque Center for Transfusion and Human Tissues, Galdakao, Spain
- IKERBASQUE, Basque Foundation for Science, Bilbao, Spain
| | - Devin Dersh
- Division of Cell Pathology, Children's Hospital of Philadelphia and Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States
| | - Yair Argon
- Division of Cell Pathology, Children's Hospital of Philadelphia and Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States
| | - Iraide Alloza
- Neurogenomiks Group, Department of Neuroscience, University of the Basque Country (UPV/EHU), Leioa, Spain
- Achucarro Basque Center for Neuroscience, Leioa, Spain
| | - Koen Vandenbroeck
- Neurogenomiks Group, Department of Neuroscience, University of the Basque Country (UPV/EHU), Leioa, Spain
- Achucarro Basque Center for Neuroscience, Leioa, Spain
- IKERBASQUE, Basque Foundation for Science, Bilbao, Spain
| |
Collapse
|
9
|
Zeng L, Pederson SM, Cao D, Qu Z, Hu Z, Adelson DL, Wei C. Genome-Wide Analysis of the Association of Transposable Elements with Gene Regulation Suggests that Alu Elements Have the Largest Overall Regulatory Impact. J Comput Biol 2018; 25:551-562. [DOI: 10.1089/cmb.2017.0228] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Affiliation(s)
- Lu Zeng
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
- School of Biological Sciences, The University of Adelaide, Adelaide, Australia
| | - Stephen M. Pederson
- School of Biological Sciences, The University of Adelaide, Adelaide, Australia
| | - Danfeng Cao
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| | - Zhipeng Qu
- School of Biological Sciences, The University of Adelaide, Adelaide, Australia
| | - Zhiqiang Hu
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| | - David L. Adelson
- School of Biological Sciences, The University of Adelaide, Adelaide, Australia
| | - Chaochun Wei
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| |
Collapse
|
10
|
Jung J, Lee S, Cho HS, Park K, Ryu JW, Jung M, Kim J, Kim H, Kim DS. Bioinformatic analysis of regulation of natural antisense transcripts by transposable elements in human mRNA. Genomics 2018; 111:159-166. [PMID: 29366860 DOI: 10.1016/j.ygeno.2018.01.011] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2017] [Revised: 01/16/2018] [Accepted: 01/17/2018] [Indexed: 12/19/2022]
Abstract
Non-coding RNA is no longer considered to be "junk" DNA, based on evidence uncovered in recent decades. In particular, the important role played by natural antisense transcripts (NATs) in regulating the expression of genes is receiving increasing attention. However, the regulatory mechanisms of NATs remain incompletely understood. It is well-known that the insertion of transposable elements (TEs) can affect gene transcription. Using a bioinformatics approach, we identified NATs using human mRNA sequences from the UCSC Genome Browser Database. Our in silico analysis identified 1079 NATs and 700 sense-antisense gene pairs. We identified 179 NATs that showed evidence of having been affected by TEs during cellular gene expression. These findings may provide an understanding of the complex regulation mechanisms of NATs. If our understanding of NATs as modulators of gene expression is further enhanced, we can develop ways to control gene expression.
Collapse
Affiliation(s)
- Jaeeun Jung
- Department of Bioinformatics, KRIBB School of Bioscience, Korea University of Science and Technology (UST), 217 Gajeong-ro, Yuseong-gu, Daejeon, Republic of Korea; Department of Rare Disease Research Center, Korea Research Institute of Bioscience & Biotechnology (KRIBB), 125 Gwahak-ro, Yuseong-gu, Daejeon, Republic of Korea
| | - Sugi Lee
- Department of Bioinformatics, KRIBB School of Bioscience, Korea University of Science and Technology (UST), 217 Gajeong-ro, Yuseong-gu, Daejeon, Republic of Korea; Department of Rare Disease Research Center, Korea Research Institute of Bioscience & Biotechnology (KRIBB), 125 Gwahak-ro, Yuseong-gu, Daejeon, Republic of Korea
| | - Hyun-Soo Cho
- Department of Stem Cell Research Center, Korea Research Institute of Bioscience & Biotechnology (KRIBB), 125 Gwahak-ro, Yuseong-gu, Daejeon, Republic of Korea
| | - Kunhyang Park
- Department of Core Facility Management Center, Korea Research Institute of Bioscience & Biotechnology (KRIBB), 125 Gwahak-ro, Yuseong-gu, Daejeon, Republic of Korea
| | - Jea-Woon Ryu
- Department of Rare Disease Research Center, Korea Research Institute of Bioscience & Biotechnology (KRIBB), 125 Gwahak-ro, Yuseong-gu, Daejeon, Republic of Korea
| | - Minah Jung
- Department of Bioinformatics, KRIBB School of Bioscience, Korea University of Science and Technology (UST), 217 Gajeong-ro, Yuseong-gu, Daejeon, Republic of Korea; Department of Rare Disease Research Center, Korea Research Institute of Bioscience & Biotechnology (KRIBB), 125 Gwahak-ro, Yuseong-gu, Daejeon, Republic of Korea
| | - Jeongkil Kim
- Department of Bioinformatics, KRIBB School of Bioscience, Korea University of Science and Technology (UST), 217 Gajeong-ro, Yuseong-gu, Daejeon, Republic of Korea; Department of Rare Disease Research Center, Korea Research Institute of Bioscience & Biotechnology (KRIBB), 125 Gwahak-ro, Yuseong-gu, Daejeon, Republic of Korea
| | - HyeRan Kim
- Department of Bioinformatics, KRIBB School of Bioscience, Korea University of Science and Technology (UST), 217 Gajeong-ro, Yuseong-gu, Daejeon, Republic of Korea; Department of Plant Systems Engineering Center, Korea Research Institute of Bioscience & Biotechnology (KRIBB), 125 Gwahak-ro, Yuseong-gu, Daejeon, Republic of Korea
| | - Dae-Soo Kim
- Department of Bioinformatics, KRIBB School of Bioscience, Korea University of Science and Technology (UST), 217 Gajeong-ro, Yuseong-gu, Daejeon, Republic of Korea; Department of Rare Disease Research Center, Korea Research Institute of Bioscience & Biotechnology (KRIBB), 125 Gwahak-ro, Yuseong-gu, Daejeon, Republic of Korea.
| |
Collapse
|
11
|
Abstract
Despite often being classified as selfish or junk DNA, transposable elements (TEs) are a group of abundant genetic sequences that have a significant impact on mammalian development and genome regulation. In recent years, our understanding of how pre-existing TEs affect genome architecture, gene regulatory networks and protein function during mammalian embryogenesis has dramatically expanded. In addition, the mobilization of active TEs in selected cell types has been shown to generate genetic variation during development and in fully differentiated tissues. Importantly, the ongoing domestication and evolution of TEs appears to provide a rich source of regulatory elements, functional modules and genetic variation that fuels the evolution of mammalian developmental processes. Here, we review the functional impact that TEs exert on mammalian developmental processes and discuss how the somatic activity of TEs can influence gene regulatory networks.
Collapse
Affiliation(s)
- Jose L Garcia-Perez
- Medical Research Council Human Genetics Unit, Institute of Genetics and Molecular Medicine, University of Edinburgh, Western General Hospital, Edinburgh EH4 2XU, UK
- Department of Genomic Medicine, GENYO, Centre for Genomics & Oncology (Pfizer - University of Granada & Andalusian Regional Government), PTS Granada, Avda. de la Ilustración 114, Granada 18016, Spain
| | - Thomas J Widmann
- Department of Genomic Medicine, GENYO, Centre for Genomics & Oncology (Pfizer - University of Granada & Andalusian Regional Government), PTS Granada, Avda. de la Ilustración 114, Granada 18016, Spain
| | - Ian R Adams
- Medical Research Council Human Genetics Unit, Institute of Genetics and Molecular Medicine, University of Edinburgh, Western General Hospital, Edinburgh EH4 2XU, UK
| |
Collapse
|
12
|
Abstract
The cytokine interleukin-22 (IL-22), which is a member of the IL-10 family, is produced exclusively by immune cells and activates signal transducer and activator of transcription 3 (STAT3) in nonimmune cells, such as hepatocytes, keratinocytes, and colonic epithelial cells, to drive various processes central to tissue homeostasis and immunosurveillance. Dysregulation of IL-22 signaling causes inflammatory diseases. IL-22 binding protein (IL-22BP; encoded by IL22RA2) is a soluble IL-22 receptor, which antagonizes IL-22 activity and has genetic associations with autoimmune diseases. Humans have three IL-22BP isoforms, IL-22BPi1 to IL-22BPi3, which are generated by alternative splicing; mice only have an IL-22BPi2 homolog. We showed that, although IL-22BPi3 had less inhibitory activity than IL-22BPi2, IL-22BPi3 was more abundant in various human tissues under homeostatic conditions. IL-22BPi2 was more effective than IL-22BPi3 at blocking the contribution of IL-22 to cooperative gene induction with the inflammatory cytokine IL-17, which is often present with IL-22 in autoimmune settings. In addition, we found that IL-22BPi1 was not secreted and therefore failed to antagonize IL-22 signaling. Furthermore, IL-22BPi2 was the only isoform that was increased in abundance when myeloid cells were activated by Toll-like receptor 2 signaling or retinoic acid, a maturation factor for myeloid cells. These data suggest that the human IL-22BP isoforms have distinct spatial and temporal roles and coordinately fine-tune IL-22-dependent STAT3 responses in tissues as a type of rheostat.
Collapse
Affiliation(s)
- Chrissie Lim
- Department of Immunology, University of Washington, Seattle, WA 98109, USA
| | - MeeAe Hong
- Department of Immunology, University of Washington, Seattle, WA 98109, USA
| | - Ram Savan
- Department of Immunology, University of Washington, Seattle, WA 98109, USA.
| |
Collapse
|
13
|
Warren IA, Naville M, Chalopin D, Levin P, Berger CS, Galiana D, Volff JN. Evolutionary impact of transposable elements on genomic diversity and lineage-specific innovation in vertebrates. Chromosome Res 2016; 23:505-31. [PMID: 26395902 DOI: 10.1007/s10577-015-9493-5] [Citation(s) in RCA: 77] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Abstract
Since their discovery, a growing body of evidence has emerged demonstrating that transposable elements are important drivers of species diversity. These mobile elements exhibit a great variety in structure, size and mechanisms of transposition, making them important putative actors in organism evolution. The vertebrates represent a highly diverse and successful lineage that has adapted to a wide range of different environments. These animals also possess a rich repertoire of transposable elements, with highly diverse content between lineages and even between species. Here, we review how transposable elements are driving genomic diversity and lineage-specific innovation within vertebrates. We discuss the large differences in TE content between different vertebrate groups and then go on to look at how they affect organisms at a variety of levels: from the structure of chromosomes to their involvement in the regulation of gene expression, as well as in the formation and evolution of non-coding RNAs and protein-coding genes. In the process of doing this, we highlight how transposable elements have been involved in the evolution of some of the key innovations observed within the vertebrate lineage, driving the group's diversity and success.
Collapse
Affiliation(s)
- Ian A Warren
- Institut de Génomique Fonctionnelle de Lyon, CNRS UMR5242, Ecole Normale Supérieure de Lyon, Lyon, France
| | - Magali Naville
- Institut de Génomique Fonctionnelle de Lyon, CNRS UMR5242, Ecole Normale Supérieure de Lyon, Lyon, France
| | - Domitille Chalopin
- Institut de Génomique Fonctionnelle de Lyon, CNRS UMR5242, Ecole Normale Supérieure de Lyon, Lyon, France.,Department of Genetics, University of Georgia, Athens, Georgia, 30602, USA
| | - Perrine Levin
- Institut de Génomique Fonctionnelle de Lyon, CNRS UMR5242, Ecole Normale Supérieure de Lyon, Lyon, France
| | - Chloé Suzanne Berger
- Institut de Génomique Fonctionnelle de Lyon, CNRS UMR5242, Ecole Normale Supérieure de Lyon, Lyon, France
| | - Delphine Galiana
- Institut de Génomique Fonctionnelle de Lyon, CNRS UMR5242, Ecole Normale Supérieure de Lyon, Lyon, France
| | - Jean-Nicolas Volff
- Institut de Génomique Fonctionnelle de Lyon, CNRS UMR5242, Ecole Normale Supérieure de Lyon, Lyon, France.
| |
Collapse
|
14
|
Grandi N, Cadeddu M, Blomberg J, Tramontano E. Contribution of type W human endogenous retroviruses to the human genome: characterization of HERV-W proviral insertions and processed pseudogenes. Retrovirology 2016; 13:67. [PMID: 27613107 PMCID: PMC5016936 DOI: 10.1186/s12977-016-0301-x] [Citation(s) in RCA: 62] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2016] [Accepted: 08/23/2016] [Indexed: 12/21/2022] Open
Abstract
Background Human endogenous retroviruses (HERVs) are ancient sequences integrated in the germ line cells and vertically transmitted through the offspring constituting about 8 % of our genome. In time, HERVs accumulated mutations that compromised their coding capacity. A prominent exception is HERV-W locus 7q21.2, producing a functional Env protein (Syncytin-1) coopted for placental syncytiotrophoblast formation. While expression of HERV-W sequences has been investigated for their correlation to disease, an exhaustive description of the group composition and characteristics is still not available and current HERV-W group information derive from studies published a few years ago that, of course, used the rough assemblies of the human genome available at that time. This hampers the comparison and correlation with current human genome assemblies. Results In the present work we identified and described in detail the distribution and genetic composition of 213 HERV-W elements. The bioinformatics analysis led to the characterization of several previously unreported features and provided a phylogenetic classification of two main subgroups with different age and structural characteristics. New facts on HERV-W genomic context of insertion and co-localization with sequences putatively involved in disease development are also reported. Conclusions The present work is a detailed overview of the HERV-W contribution to the human genome and provides a robust genetic background useful to clarify HERV-W role in pathologies with poorly understood etiology, representing, to our knowledge, the most complete and exhaustive HERV-W dataset up to date. Electronic supplementary material The online version of this article (doi:10.1186/s12977-016-0301-x) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Nicole Grandi
- Department of Life and Environmental Sciences, University of Cagliari, Cittadella Universitaria di Monserrato SS554, 09042, Monserrato, Cagliari, Italy
| | - Marta Cadeddu
- Department of Life and Environmental Sciences, University of Cagliari, Cittadella Universitaria di Monserrato SS554, 09042, Monserrato, Cagliari, Italy
| | - Jonas Blomberg
- Department of Medical Sciences, Uppsala University, Uppsala, Sweden
| | - Enzo Tramontano
- Department of Life and Environmental Sciences, University of Cagliari, Cittadella Universitaria di Monserrato SS554, 09042, Monserrato, Cagliari, Italy. .,Istituto di Ricerca Genetica e Biomedica, Consiglio Nazionale delle Ricerche (CNR), Monserrato, Cagliari, Italy.
| |
Collapse
|
15
|
Guizard S, Piégu B, Arensburger P, Guillou F, Bigot Y. Deep landscape update of dispersed and tandem repeats in the genome model of the red jungle fowl, Gallus gallus, using a series of de novo investigating tools. BMC Genomics 2016; 17:659. [PMID: 27542599 PMCID: PMC4992247 DOI: 10.1186/s12864-016-3015-5] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2016] [Accepted: 08/12/2016] [Indexed: 01/19/2023] Open
Abstract
BACKGROUND The program RepeatMasker and the database Repbase-ISB are part of the most widely used strategy for annotating repeats in animal genomes. They have been used to show that avian genomes have a lower repeat content (8-12 %) than the sequenced genomes of many vertebrate species (30-55 %). However, the efficiency of such a library-based strategies is dependent on the quality and completeness of the sequences in the database that is used. An alternative to these library based methods are methods that identify repeats de novo. These alternative methods have existed for a least a decade and may be more powerful than the library based methods. We have used an annotation strategy involving several complementary de novo tools to determine the repeat content of the model genome galGal4 (1.04 Gbp), including identifying simple sequence repeats (SSRs), tandem repeats and transposable elements (TEs). RESULTS We annotated over one Gbp. of the galGal4 genome and showed that it is composed of approximately 19 % SSRs and TEs repeats. Furthermore, we estimate that the actual genome of the red jungle fowl contains about 31-35 % repeats. We find that library-based methods tend to overestimate TE diversity. These results have a major impact on the current understanding of repeats distributions throughout chromosomes in the red jungle fowl. CONCLUSIONS Our results are a proof of concept of the reliability of using de novo tools to annotate repeats in large animal genomes. They have also revealed issues that will need to be resolved in order to develop gold-standard methodologies for annotating repeats in eukaryote genomes.
Collapse
Affiliation(s)
- Sébastien Guizard
- Physiologie de la Reproduction et des Comportements, UMR INRA-CNRS 7247, PRC, 37380 Nouzilly, France
| | - Benoît Piégu
- Physiologie de la Reproduction et des Comportements, UMR INRA-CNRS 7247, PRC, 37380 Nouzilly, France
| | - Peter Arensburger
- Physiologie de la Reproduction et des Comportements, UMR INRA-CNRS 7247, PRC, 37380 Nouzilly, France
- Biological Sciences Department, California State Polytechnic University, Pomona, CA 91768 USA
| | - Florian Guillou
- Physiologie de la Reproduction et des Comportements, UMR INRA-CNRS 7247, PRC, 37380 Nouzilly, France
| | - Yves Bigot
- Physiologie de la Reproduction et des Comportements, UMR INRA-CNRS 7247, PRC, 37380 Nouzilly, France
| |
Collapse
|
16
|
Shapiro JA. Nothing in Evolution Makes Sense Except in the Light of Genomics: Read-Write Genome Evolution as an Active Biological Process. BIOLOGY 2016; 5:E27. [PMID: 27338490 PMCID: PMC4929541 DOI: 10.3390/biology5020027] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 02/12/2016] [Revised: 05/20/2016] [Accepted: 06/02/2016] [Indexed: 01/15/2023]
Abstract
The 21st century genomics-based analysis of evolutionary variation reveals a number of novel features impossible to predict when Dobzhansky and other evolutionary biologists formulated the neo-Darwinian Modern Synthesis in the middle of the last century. These include three distinct realms of cell evolution; symbiogenetic fusions forming eukaryotic cells with multiple genome compartments; horizontal organelle, virus and DNA transfers; functional organization of proteins as systems of interacting domains subject to rapid evolution by exon shuffling and exonization; distributed genome networks integrated by mobile repetitive regulatory signals; and regulation of multicellular development by non-coding lncRNAs containing repetitive sequence components. Rather than single gene traits, all phenotypes involve coordinated activity by multiple interacting cell molecules. Genomes contain abundant and functional repetitive components in addition to the unique coding sequences envisaged in the early days of molecular biology. Combinatorial coding, plus the biochemical abilities cells possess to rearrange DNA molecules, constitute a powerful toolbox for adaptive genome rewriting. That is, cells possess "Read-Write Genomes" they alter by numerous biochemical processes capable of rapidly restructuring cellular DNA molecules. Rather than viewing genome evolution as a series of accidental modifications, we can now study it as a complex biological process of active self-modification.
Collapse
Affiliation(s)
- James A Shapiro
- Department of Biochemistry and Molecular Biology, University of Chicago, GCIS W123B, 979 E. 57th Street, Chicago, IL 60637, USA.
| |
Collapse
|
17
|
Park SJ, Kim YH, Lee SR, Choe SH, Kim MJ, Kim SU, Kim JS, Sim BW, Song BS, Jeong KJ, Jin YB, Lee Y, Park YH, Park YI, Huh JW, Chang KT. Gain of a New Exon by a Lineage-Specific Alu Element-Integration Event in the BCS1L Gene during Primate Evolution. Mol Cells 2015; 38:950-8. [PMID: 26537194 PMCID: PMC4673409 DOI: 10.14348/molcells.2015.0121] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2015] [Revised: 07/17/2015] [Accepted: 07/20/2015] [Indexed: 11/27/2022] Open
Abstract
BCS1L gene encodes mitochondrial protein and is a member of conserved AAA protein family. This gene is involved in the incorporation of Rieske FeS and Qcr10p into complex III of respiratory chain. In our previous study, AluYRa2-derived alternative transcript in rhesus monkey genome was identified. However, this transcript has not been reported in human genome. In present study, we conducted evolutionary analysis of AluYRa2-exonized transcript with various primate genomic DNAs and cDNAs from humans, rhesus monkeys, and crab-eating monkeys. Remarkably, our results show that AluYRa2 element has only been integrated into genomes of Macaca species. This Macaca lineage-specific integration of AluYRa2 element led to exonization event in the first intron region of BCS1L gene by producing a conserved 3' splice site. Intriguingly, in rhesus and crab-eating monkeys, more diverse transcript variants by alternative splicing (AS) events, including exon skipping and different 5' splice sites from humans, were identified. Alignment of amino acid sequences revealed that AluYRa2-exonized transcript has short N-terminal peptides. Therefore, AS events play a major role in the generation of various transcripts and proteins during primate evolution. In particular, lineage-specific integration of Alu elements and species-specific Alu-derived exonization events could be important sources of gene diversification in primates.
Collapse
Affiliation(s)
- Sang-Je Park
- National Primate Research Center, Korea Research Institute of Bioscience and Biotechnology, Cheongju 363-883,
Korea
| | - Young-Hyun Kim
- National Primate Research Center, Korea Research Institute of Bioscience and Biotechnology, Cheongju 363-883,
Korea
- University of Science & Technology, National Primate Research Center, Korea Research Institute of Bioscience and Biotechnology, Cheongju 363-883,
Korea
| | - Sang-Rae Lee
- National Primate Research Center, Korea Research Institute of Bioscience and Biotechnology, Cheongju 363-883,
Korea
- University of Science & Technology, National Primate Research Center, Korea Research Institute of Bioscience and Biotechnology, Cheongju 363-883,
Korea
| | - Se-Hee Choe
- National Primate Research Center, Korea Research Institute of Bioscience and Biotechnology, Cheongju 363-883,
Korea
- University of Science & Technology, National Primate Research Center, Korea Research Institute of Bioscience and Biotechnology, Cheongju 363-883,
Korea
| | - Myung-Jin Kim
- National Primate Research Center, Korea Research Institute of Bioscience and Biotechnology, Cheongju 363-883,
Korea
| | - Sun-Uk Kim
- National Primate Research Center, Korea Research Institute of Bioscience and Biotechnology, Cheongju 363-883,
Korea
| | - Ji-Su Kim
- National Primate Research Center, Korea Research Institute of Bioscience and Biotechnology, Cheongju 363-883,
Korea
| | - Bo-Woong Sim
- National Primate Research Center, Korea Research Institute of Bioscience and Biotechnology, Cheongju 363-883,
Korea
| | - Bong-Seok Song
- National Primate Research Center, Korea Research Institute of Bioscience and Biotechnology, Cheongju 363-883,
Korea
| | - Kang-Jin Jeong
- National Primate Research Center, Korea Research Institute of Bioscience and Biotechnology, Cheongju 363-883,
Korea
| | - Yeung-Bae Jin
- National Primate Research Center, Korea Research Institute of Bioscience and Biotechnology, Cheongju 363-883,
Korea
| | - Youngjeon Lee
- National Primate Research Center, Korea Research Institute of Bioscience and Biotechnology, Cheongju 363-883,
Korea
| | - Young-Ho Park
- National Primate Research Center, Korea Research Institute of Bioscience and Biotechnology, Cheongju 363-883,
Korea
| | - Young Il Park
- Graduate School Department of Digital Media, Ewha Womans University, Seoul 120-750,
Korea
| | - Jae-Won Huh
- National Primate Research Center, Korea Research Institute of Bioscience and Biotechnology, Cheongju 363-883,
Korea
- University of Science & Technology, National Primate Research Center, Korea Research Institute of Bioscience and Biotechnology, Cheongju 363-883,
Korea
| | - Kyu-Tae Chang
- National Primate Research Center, Korea Research Institute of Bioscience and Biotechnology, Cheongju 363-883,
Korea
- University of Science & Technology, National Primate Research Center, Korea Research Institute of Bioscience and Biotechnology, Cheongju 363-883,
Korea
| |
Collapse
|
18
|
Sun W, Zhao XW, Zhang Z. Identification and evolution of the orphan genes in the domestic silkworm, Bombyx mori. FEBS Lett 2015; 589:2731-8. [PMID: 26296317 DOI: 10.1016/j.febslet.2015.08.008] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2015] [Revised: 07/24/2015] [Accepted: 08/01/2015] [Indexed: 10/23/2022]
Abstract
Orphan genes (OGs) which have no recognizable homology to any sequences in other species could contribute to the species specific adaptations. In this study, we identified 738 OGs in the silkworm genome. About 31% of the silkworm OGs is derived from transposable elements, and 5.1% of the silkworm OGs emerged from gene duplication followed by divergence of paralogs. Five de novo silkworm OGs originated from non-coding regions. Microarray data suggested that most of the silkworm OGs were expressed in limited tissues. RNA interference experiments suggested that five de novo OGs are not essential to the silkworm, implying that they may contribute to genetic redundancy or species-specific adaptation. Our results provide some new insights into the evolutionary significance of the silkworm OGs.
Collapse
Affiliation(s)
- Wei Sun
- Laboratory of Evolutionary and Functional Genomics, School of Life Sciences, Chongqing University, Chongqing 400044, China
| | - Xin-Wei Zhao
- Laboratory of Evolutionary and Functional Genomics, School of Life Sciences, Chongqing University, Chongqing 400044, China
| | - Ze Zhang
- Laboratory of Evolutionary and Functional Genomics, School of Life Sciences, Chongqing University, Chongqing 400044, China.
| |
Collapse
|
19
|
Abstract
Discoveries in cytogenetics, molecular biology, and genomics have revealed that genome change is an active cell-mediated physiological process. This is distinctly at variance with the pre-DNA assumption that genetic changes arise accidentally and sporadically. The discovery that DNA changes arise as the result of regulated cell biochemistry means that the genome is best modelled as a read-write (RW) data storage system rather than a read-only memory (ROM). The evidence behind this change in thinking and a consideration of some of its implications are the subjects of this article. Specific points include the following: cells protect themselves from accidental genome change with proofreading and DNA damage repair systems; localized point mutations result from the action of specialized trans-lesion mutator DNA polymerases; cells can join broken chromosomes and generate genome rearrangements by non-homologous end-joining (NHEJ) processes in specialized subnuclear repair centres; cells have a broad variety of natural genetic engineering (NGE) functions for transporting, diversifying and reorganizing DNA sequences in ways that generate many classes of genomic novelties; natural genetic engineering functions are regulated and subject to activation by a range of challenging life history events; cells can target the action of natural genetic engineering functions to particular genome locations by a range of well-established molecular interactions, including protein binding with regulatory factors and linkage to transcription; and genome changes in cancer can usefully be considered as consequences of the loss of homeostatic control over natural genetic engineering functions.
Collapse
Affiliation(s)
- James A Shapiro
- Department of Biochemistry and Molecular Biology, University of Chicago, GCISW123B, 979 E. 57th Street, Chicago, IL 60637, USA
| |
Collapse
|
20
|
Martin JCJ, Bériou G, Heslan M, Chauvin C, Utriainen L, Aumeunier A, Scott CL, Mowat A, Cerovic V, Houston SA, Leboeuf M, Hubert FX, Hémont C, Merad M, Milling S, Josien R. Interleukin-22 binding protein (IL-22BP) is constitutively expressed by a subset of conventional dendritic cells and is strongly induced by retinoic acid. Mucosal Immunol 2014; 7:101-13. [PMID: 23653115 PMCID: PMC4291114 DOI: 10.1038/mi.2013.28] [Citation(s) in RCA: 118] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2012] [Accepted: 04/08/2013] [Indexed: 02/04/2023]
Abstract
Interleukin-22 (IL-22) is mainly produced at barrier surfaces by T cells and innate lymphoid cells and is crucial to maintain epithelial integrity. However, dysregulated IL-22 action leads to deleterious inflammation and is involved in diseases such as psoriasis, intestinal inflammation, and cancer. IL-22 binding protein (IL-22BP) is a soluble inhibitory IL-22 receptor and may represent a crucial regulator of IL-22. We show both in rats and mice that, in the steady state, the main source of IL-22BP is constituted by a subset of conventional dendritic cells (DCs) in lymphoid and non-lymphoid tissues. In mouse intestine, IL-22BP was specifically expressed in lamina propria CD103(+)CD11b(+) DC. In humans, IL-22BP was expressed in immature monocyte-derived DC and strongly induced by retinoic acid but dramatically reduced upon maturation. Our data suggest that a subset of immature DCs may actively participate in the regulation of IL-22 activity in the gut by producing high levels of IL-22BP.
Collapse
Affiliation(s)
- JCJ Martin
- INSERM Center of Research in Transplantation and Immunology, UMR1064, Nantes, F - 44000, France,CHU Nantes, Institut de Transplantation Urologie Néphrologie (ITUN), Nantes, F-44000, France,CHU Nantes, Laboratoire d’immunologie, Nantes, F-44000, France,Université de Nantes, Faculté de Médecine, Nantes, F-44000, France
| | - G Bériou
- INSERM Center of Research in Transplantation and Immunology, UMR1064, Nantes, F - 44000, France,CHU Nantes, Institut de Transplantation Urologie Néphrologie (ITUN), Nantes, F-44000, France
| | - M Heslan
- INSERM Center of Research in Transplantation and Immunology, UMR1064, Nantes, F - 44000, France,CHU Nantes, Institut de Transplantation Urologie Néphrologie (ITUN), Nantes, F-44000, France
| | - C Chauvin
- INSERM Center of Research in Transplantation and Immunology, UMR1064, Nantes, F - 44000, France,CHU Nantes, Institut de Transplantation Urologie Néphrologie (ITUN), Nantes, F-44000, France
| | - L Utriainen
- Centre for Immunobiology, Institute of Infection, Immunity and Inflammation, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, G12 8TA, UK
| | - A Aumeunier
- Centre for Immunobiology, Institute of Infection, Immunity and Inflammation, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, G12 8TA, UK
| | - CL Scott
- Centre for Immunobiology, Institute of Infection, Immunity and Inflammation, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, G12 8TA, UK
| | - A Mowat
- Centre for Immunobiology, Institute of Infection, Immunity and Inflammation, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, G12 8TA, UK
| | - V Cerovic
- Centre for Immunobiology, Institute of Infection, Immunity and Inflammation, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, G12 8TA, UK
| | - SA Houston
- Centre for Immunobiology, Institute of Infection, Immunity and Inflammation, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, G12 8TA, UK
| | - M Leboeuf
- Department of Gene and Cell medicine and the Department of Medicine, Mount Sinai School of Medicine, New York 10029, USA
| | - FX Hubert
- INSERM Center of Research in Transplantation and Immunology, UMR1064, Nantes, F - 44000, France,CHU Nantes, Institut de Transplantation Urologie Néphrologie (ITUN), Nantes, F-44000, France,Université de Nantes, Faculté de Médecine, Nantes, F-44000, France
| | - C Hémont
- INSERM Center of Research in Transplantation and Immunology, UMR1064, Nantes, F - 44000, France,CHU Nantes, Institut de Transplantation Urologie Néphrologie (ITUN), Nantes, F-44000, France,CHU Nantes, Laboratoire d’immunologie, Nantes, F-44000, France,Université de Nantes, Faculté de Médecine, Nantes, F-44000, France
| | - M Merad
- Department of Gene and Cell medicine and the Department of Medicine, Mount Sinai School of Medicine, New York 10029, USA
| | - S Milling
- Centre for Immunobiology, Institute of Infection, Immunity and Inflammation, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, G12 8TA, UK
| | - R Josien
- INSERM Center of Research in Transplantation and Immunology, UMR1064, Nantes, F - 44000, France,CHU Nantes, Institut de Transplantation Urologie Néphrologie (ITUN), Nantes, F-44000, France,CHU Nantes, Laboratoire d’immunologie, Nantes, F-44000, France,Université de Nantes, Faculté de Médecine, Nantes, F-44000, France
| |
Collapse
|
21
|
Bellone RR, Holl H, Setaluri V, Devi S, Maddodi N, Archer S, Sandmeyer L, Ludwig A, Foerster D, Pruvost M, Reissmann M, Bortfeldt R, Adelson DL, Lim SL, Nelson J, Haase B, Engensteiner M, Leeb T, Forsyth G, Mienaltowski MJ, Mahadevan P, Hofreiter M, Paijmans JLA, Gonzalez-Fortes G, Grahn B, Brooks SA. Evidence for a retroviral insertion in TRPM1 as the cause of congenital stationary night blindness and leopard complex spotting in the horse. PLoS One 2013; 8:e78280. [PMID: 24167615 PMCID: PMC3805535 DOI: 10.1371/journal.pone.0078280] [Citation(s) in RCA: 79] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2013] [Accepted: 09/10/2013] [Indexed: 12/21/2022] Open
Abstract
Leopard complex spotting is a group of white spotting patterns in horses caused by an incompletely dominant gene (LP) where homozygotes (LP/LP) are also affected with congenital stationary night blindness. Previous studies implicated Transient Receptor Potential Cation Channel, Subfamily M, Member 1 (TRPM1) as the best candidate gene for both CSNB and LP. RNA-Seq data pinpointed a 1378 bp insertion in intron 1 of TRPM1 as the potential cause. This insertion, a long terminal repeat (LTR) of an endogenous retrovirus, was completely associated with LP, testing 511 horses (χ2=1022.00, p<<0.0005), and CSNB, testing 43 horses (χ2=43, p<<0.0005). The LTR was shown to disrupt TRPM1 transcription by premature poly-adenylation. Furthermore, while deleterious transposable element insertions should be quickly selected against the identification of this insertion in three ancient DNA samples suggests it has been maintained in the horse gene pool for at least 17,000 years. This study represents the first description of an LTR insertion being associated with both a pigmentation phenotype and an eye disorder.
Collapse
Affiliation(s)
- Rebecca R. Bellone
- Department of Biology, University of Tampa, Tampa, Florida, United States of America
- * E-mail:
| | - Heather Holl
- Department of Animal Science, Cornell University, Ithaca, New York, United States of America
| | - Vijayasaradhi Setaluri
- Department of Dermatology, School of Medicine and Public Health, University of Wisconsin, Madison, Wisconsin, United States of America
| | - Sulochana Devi
- Department of Dermatology, School of Medicine and Public Health, University of Wisconsin, Madison, Wisconsin, United States of America
| | - Nityanand Maddodi
- Department of Dermatology, School of Medicine and Public Health, University of Wisconsin, Madison, Wisconsin, United States of America
| | | | - Lynne Sandmeyer
- Department of Small Animal Clinical Sciences, Western College of Veterinary Medicine, University of Saskatchewan, Saskatoon, Saskatchewan, Canada
| | - Arne Ludwig
- Department of Evolutionary Genetics, Leibniz Institute for Zoo and Wildlife Research, Berlin, Germany
| | - Daniel Foerster
- Department of Evolutionary Genetics, Leibniz Institute for Zoo and Wildlife Research, Berlin, Germany
| | - Melanie Pruvost
- Department of Evolutionary Genetics, Leibniz Institute for Zoo and Wildlife Research, Berlin, Germany
- Epigenomic and Palaeogenomic Group, Institut Jacques Monod, Paris, France
| | - Monika Reissmann
- Department of Breeding Biology and Molecular Genetics, Humboldt University Berlin, Berlin, Germany
| | - Ralf Bortfeldt
- Department of Breeding Biology and Molecular Genetics, Humboldt University Berlin, Berlin, Germany
| | - David L. Adelson
- School of Molecular and Biomedical Science, the University of Adelaide, South Australia, Australia
| | - Sim Lin Lim
- School of Molecular and Biomedical Science, the University of Adelaide, South Australia, Australia
| | - Janelle Nelson
- Department of Biology, University of Tampa, Tampa, Florida, United States of America
| | - Bianca Haase
- Faculty of Veterinary Science, University of Sydney, Sydney, New South Wales, Australia
| | | | - Tosso Leeb
- Institute of Genetics, University of Bern, Bern, Switzerland
| | - George Forsyth
- Department of Veterinary Biomedical Sciences, Western College of Veterinary Medicine, University of Saskatchewan, Saskatoon, Saskatchewan, Canada
| | - Michael J. Mienaltowski
- Department of Molecular Pharmacology & Physiology, College of Medicine, University of South Florida, Tampa, Florida, United States of America
| | - Padmanabhan Mahadevan
- Department of Biology, University of Tampa, Tampa, Florida, United States of America
| | | | | | | | - Bruce Grahn
- Department of Small Animal Clinical Sciences, Western College of Veterinary Medicine, University of Saskatchewan, Saskatoon, Saskatchewan, Canada
| | - Samantha A. Brooks
- Department of Animal Science, Cornell University, Ithaca, New York, United States of America
| |
Collapse
|
22
|
A New Exon Derived from a Mammalian Apparent LTR Retrotransposon of the SUPT16H Gene. Int J Genomics 2013; 2013:387594. [PMID: 23671841 PMCID: PMC3647538 DOI: 10.1155/2013/387594] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2012] [Accepted: 02/12/2013] [Indexed: 11/28/2022] Open
Abstract
The SUPT16H gene known as FACTP140 is required for the transcription of other genes. For transcription, genes need to be complexed with accessory factors, including transcription factors and RNA polymerase II. One such factor, FACT, interacts with histones H2A/H2B for nucleosome disassembly and transcription elongation. The SUPT16H gene has a transcript and many expressed sequence tags (ESTs). We were especially interested in an MaLR-derived transcript (EST, BX333035) that included a new exon introduced by a transposable element, a mammalian apparent LTR retrotransposon (MaLR). The MaLR was detected ranging from humans to galagos, indicating the MaLR in the SUPT16H gene is integrated into the primate ancestor genome. A new exon was created by alternative donor site provided by the MaLR. The original transcript and the MaLR-derived transcript were expressed in various human, rhesus monkey, and other primate tissues. Additionally, we identified a new alternative transcript that included the MaLR, but there was no significant difference in the expression of the original transcript and the MaLR-derived transcript. Interestingly, the new alternative transcript and the MaLR-derived transcript had the MaLR sequence in the new exon, but they had different structures by adopting different 3′ splice sites. From this study, we verified transposable elements that contributed to transcriptome diversity.
Collapse
|
23
|
Wei L, Xiao M, An Z, Ma B, Mason AS, Qian W, Li J, Fu D. New insights into nested long terminal repeat retrotransposons in Brassica species. MOLECULAR PLANT 2013; 6:470-482. [PMID: 22930733 DOI: 10.1093/mp/sss081] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]
Abstract
Long terminal repeat (LTR) retrotransposons, one of the foremost types of transposons, continually change or modify gene function and reorganize the genome through bursts of dramatic proliferation. Many LTR-TEs preferentially insert within other LTR-TEs, but the cause and evolutionary significance of these nested LTR-TEs are not well understood. In this study, a total of 1.52Gb of Brassica sequence containing 2020 bacterial artificial chromosomes (BACs) was scanned, and six bacterial artificial chromosome (BAC) clones with extremely nested LTR-TEs (LTR-TEs density: 7.24/kb) were selected for further analysis. The majority of the LTR-TEs in four of the six BACs were found to be derived from the rapid proliferation of retrotransposons originating within the BAC regions, with only a few LTR-TEs originating from the proliferation and insertion of retrotransposons from outside the BAC regions approximately 5-23Mya. LTR-TEs also preferably inserted into TA-rich repeat regions. Gene prediction by Genescan identified 207 genes in the 0.84Mb of total BAC sequences. Only a few genes (3/207) could be matched to the Brassica expressed sequence tag (EST) database, indicating that most genes were inactive after retrotransposon insertion. Five of the six BACs were putatively centromeric. Hence, nested LTR-TEs in centromere regions are rapidly duplicated, repeatedly inserted, and act to suppress activity of genes and to reshuffle the structure of the centromeric sequences. Our results suggest that LTR-TEs burst and proliferate on a local scale to create nested LTR-TE regions, and that these nested LTR-TEs play a role in the formation of centromeres.
Collapse
Affiliation(s)
- Lijuan Wei
- Chongqing Engineering Research Center for Rapeseed, College of Agronomy and Biotechnology, Southwest University, Chongqing 400716, China
| | | | | | | | | | | | | | | |
Collapse
|
24
|
Fujii YR. The RNA gene information: retroelement-microRNA entangling as the RNA quantum code. Methods Mol Biol 2013; 936:47-67. [PMID: 23007498 DOI: 10.1007/978-1-62703-083-0_4] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
MicroRNA (miRNA) and retroelements may be a master of regulator in our life, which are evolutionally involved in the origin of species. To support the Darwinism from the aspect of molecular evolution process, it has tremendously been interested in the molecular information of naive RNA. The RNA wave model 2000 consists of four concepts that have altered from original idea of the miRNA genes for crosstalk among embryonic stem cells, their niche cells, and retroelements as a carrier vesicle of the RNA genes. (1) the miRNA gene as a mobile genetic element induces transcriptional and posttranscriptional silencing via networking-processes (no hierarchical architecture); (2) the RNA information supplied by the miRNA genes expands to intracellular, intercellular, intraorgan, interorgan, intraspecies, and interspecies under the cycle of life into the global environment; (3) the mobile miRNAs can self-proliferate; and (4) cells contain two types information as resident and genomic miRNAs. Based on RNA wave, we have developed an interest in investigation of the transformation from RNA information to quantum bits as physicochemical characters of RNA with the measurement of RNA electron spin. When it would have been given that the fundamental bases for the acquired characters in genetics can be controlled by RNA gene information, it may be available to apply for challenging against RNA gene diseases, such as stress-induced diseases.
Collapse
|
25
|
Huda A, Bushel PR. Widespread Exonization of Transposable Elements in Human Coding Sequences is Associated with Epigenetic Regulation of Transcription. ACTA ACUST UNITED AC 2013; 1. [PMID: 24860841 PMCID: PMC4028971 DOI: 10.4172/2329-8936.1000101] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Background Transposable Elements (TEs) have long been regarded as selfish or junk DNA having little or no role in the regulation or functioning of the human genome. However, over the past several years this view came to be challenged as several studies provided anecdotal as well as global evidence for the contribution of TEs to the regulatory and coding needs of human genes. In this study, we explored the incorporation and epigenetic regulation of coding sequences donated by TEs using gene expression and other ancillary genomics data from two human hematopoietic cell-lines: GM12878 (a lymphoblastoid cell line) and K562 (a Chronic Myelogenous Leukemia cell line). In each cell line, we found several thousand instances of TEs donating coding sequences to human genes. We compared the transcriptome assembly of the RNA sequencing (RNA-Seq) reads with and without the aid of a reference transcriptome and found that the percentage of genes that incorporate TEs in their coding sequences is significantly greater than that obtained from the reference transcriptome assemblies using Refseq and Gencode gene models. We also used histone modifications chromatin immunoprecipitation sequencing (ChIP-Seq) data, Cap Analysis of Gene Expression (CAGE) data and DNAseI Hypersensitivity Site (DHS) data to demonstrate the epigenetic regulation of the TE derived coding sequences. Our results suggest that TEs form a significantly higher percentage of coding sequences than represented in gene annotation databases and these TE derived sequences are epigenetically regulated in accordance with their expression in the two cell types.
Collapse
Affiliation(s)
- Ahsan Huda
- Microarray and Genome Informatics Group, National Institute of Environmental Health Sciences, USA ; Kelly Government Solutions, Inc., USA
| | - Pierre R Bushel
- Microarray and Genome Informatics Group, National Institute of Environmental Health Sciences, USA ; Biostatistics Branch, National Institute of Environmental Health Sciences, Research Triangle Park, NC 27709, USA
| |
Collapse
|
26
|
Ha HS, Moon JW, Gim JA, Jung YD, Ahn K, Oh KB, Kim TH, Seong HH, Kim HS. Identification and characterization of transposable element-mediated chimeric transcripts from porcine Refseq and EST databases. Genes Genomics 2012. [DOI: 10.1007/s13258-011-0212-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
|
27
|
Rebollo R, Farivar S, Mager DL. C-GATE - catalogue of genes affected by transposable elements. Mob DNA 2012; 3:9. [PMID: 22621612 PMCID: PMC3472293 DOI: 10.1186/1759-8753-3-9] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2012] [Accepted: 04/20/2012] [Indexed: 01/07/2023] Open
Abstract
Background Functional regulatory sequences are present in many transposable element (TE) copies, resulting in TEs being frequently exapted by host genes. Today, many examples of TEs impacting host gene expression can be found in the literature and we believe a new catalogue of such exaptations would be useful for the field. Findings We have established the catalogue of genes affected by transposable elements (C-GATE), which can be found at https://sites.google.com/site/tecatalog/. To date, it holds 221 cases of biologically verified TE exaptations and more than 10,000 in silico TE-gene partnerships. C-GATE is interactive and allows users to include missed or new TE exaptation data. C-GATE provides a graphic representation of the entire library, which may be used for future statistical analysis of TE impact on host gene expression. Conclusions We hope C-GATE will be valuable for the TE community but also for others who have realized the role that TEs may have in their research.
Collapse
Affiliation(s)
- Rita Rebollo
- Terry Fox Laboratory, British Columbia Cancer Agency, 675 West 10th Avenue, Vancouver, BC, V5Z1L3, Canada.
| | | | | |
Collapse
|
28
|
Oliveira P, Sanges R, Huntsman D, Stupka E, Oliveira C. Characterization of the intronic portion of cadherin superfamily members, common cancer orchestrators. Eur J Hum Genet 2012; 20:878-83. [PMID: 22317972 DOI: 10.1038/ejhg.2012.11] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
Cadherins are cell-cell adhesion proteins essential for the maintenance of tissue architecture and integrity, and their impairment is often associated with human cancer. Knowledge regarding regulatory mechanisms associated with cadherin misexpression in cancer is scarce. Specific features of the intronic-structure and intronic-based regulatory mechanisms in the cadherin superfamily are unidentified. This study aims at systematically characterizing the intronic portion of cadherin superfamily members and the identification of intronic regions constituting putative targets/triggers of regulation, using a bioinformatic approach and biological data mining. Our study demonstrates that the cadherin superfamily genes harbour specific characteristics in comparison to all non-cadherin genes, both from the genomic and transcriptional standpoints. Cadherin superfamily genes display higher average total intron number and significantly longer introns than other genes and across the entire vertebrate lineage. Moreover, in the human genome, we observed an uncommon high frequency of MIR (mammalian-wide interspersed repeats) and MaLR (mammalian-wide interspersed repeats, a subtype of LTR) regulatory-associated repetitive elements at 5'-located introns, concomitantly with increased de novo intronic transcription. Using this approach, we identified cadherin intronic-specific sites that may constitute novel targets/triggers of cadherin superfamily expression regulation. These findings pinpoint the need to identify mechanisms affecting particularly MIR and MaLR elements located in introns 2 and 3 of human cadherin genes, possibly important in the expression modulation of this superfamily in homeostasis and cancer.
Collapse
Affiliation(s)
- Patrícia Oliveira
- Instituto de Patologia e Imunologia Molecular da Universidade do Porto, Rua Dr Roberto Frias, s/n, Porto, Portugal
| | | | | | | | | |
Collapse
|
29
|
Franchini LF, López-Leal R, Nasif S, Beati P, Gelman DM, Low MJ, de Souza FJS, Rubinstein M. Convergent evolution of two mammalian neuronal enhancers by sequential exaptation of unrelated retroposons. Proc Natl Acad Sci U S A 2011; 108:15270-5. [PMID: 21876128 PMCID: PMC3174587 DOI: 10.1073/pnas.1104997108] [Citation(s) in RCA: 64] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
The proopiomelanocortin gene (POMC) is expressed in a group of neurons present in the arcuate nucleus of the hypothalamus. Neuron-specific POMC expression in mammals is conveyed by two distal enhancers, named nPE1 and nPE2. Previous transgenic mouse studies showed that nPE1 and nPE2 independently drive reporter gene expression to POMC neurons. Here, we investigated the evolutionary mechanisms that shaped not one but two neuron-specific POMC enhancers and tested whether nPE1 and nPE2 drive identical or complementary spatiotemporal expression patterns. Sequence comparison among representative genomes of most vertebrate classes and mammalian orders showed that nPE1 is a placental novelty. Using in silico paleogenomics we found that nPE1 originated from the exaptation of a mammalian-apparent LTR retrotransposon sometime between the metatherian/eutherian split (147 Mya) and the placental mammal radiation (≈ 90 Mya). Thus, the evolutionary origin of nPE1 differs, in kind and time, from that previously demonstrated for nPE2, which was exapted from a CORE-short interspersed nucleotide element (SINE) retroposon before the origin of prototherians, 166 Mya. Transgenic mice expressing the fluorescent markers tomato and EGFP driven by nPE1 or nPE2, respectively, demonstrated coexpression of both reporter genes along the entire arcuate nucleus. The onset of reporter gene expression guided by nPE1 and nPE2 was also identical and coincidental with the onset of Pomc expression in the presumptive mouse diencephalon. Thus, the independent exaptation of two unrelated retroposons into functional analogs regulating neuronal POMC expression constitutes an authentic example of convergent molecular evolution of cell-specific enhancers.
Collapse
Affiliation(s)
- Lucía F. Franchini
- Instituto de Investigaciones en Ingeniería Genética y Biología Molecular, Consejo Nacional de Investigaciones Científicas y Técnicas, C1428ADN Buenos Aires, Argentina
| | - Rodrigo López-Leal
- Centro de Estudios Científicos and Universidad Austral de Chile, Valdivia 5110466, Chile
| | - Sofía Nasif
- Instituto de Investigaciones en Ingeniería Genética y Biología Molecular, Consejo Nacional de Investigaciones Científicas y Técnicas, C1428ADN Buenos Aires, Argentina
| | - Paula Beati
- Instituto de Investigaciones en Ingeniería Genética y Biología Molecular, Consejo Nacional de Investigaciones Científicas y Técnicas, C1428ADN Buenos Aires, Argentina
| | - Diego M. Gelman
- Instituto de Investigaciones en Ingeniería Genética y Biología Molecular, Consejo Nacional de Investigaciones Científicas y Técnicas, C1428ADN Buenos Aires, Argentina
| | - Malcolm J. Low
- Department of Molecular and Integrative Physiology, University of Michigan, Ann Arbor, MI 48105; and
| | - Flávio J. S. de Souza
- Instituto de Investigaciones en Ingeniería Genética y Biología Molecular, Consejo Nacional de Investigaciones Científicas y Técnicas, C1428ADN Buenos Aires, Argentina
- Departamento de Fisiología, Biología Molecular y Celular, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, C1428EGA Buenos Aires, Argentina
| | - Marcelo Rubinstein
- Instituto de Investigaciones en Ingeniería Genética y Biología Molecular, Consejo Nacional de Investigaciones Científicas y Técnicas, C1428ADN Buenos Aires, Argentina
- Departamento de Fisiología, Biología Molecular y Celular, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, C1428EGA Buenos Aires, Argentina
| |
Collapse
|
30
|
Oliver KR, Greene WK. Mobile DNA and the TE-Thrust hypothesis: supporting evidence from the primates. Mob DNA 2011; 2:8. [PMID: 21627776 PMCID: PMC3123540 DOI: 10.1186/1759-8753-2-8] [Citation(s) in RCA: 77] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2011] [Accepted: 05/31/2011] [Indexed: 02/07/2023] Open
Abstract
Transposable elements (TEs) are increasingly being recognized as powerful facilitators of evolution. We propose the TE-Thrust hypothesis to encompass TE-facilitated processes by which genomes self-engineer coding, regulatory, karyotypic or other genetic changes. Although TEs are occasionally harmful to some individuals, genomic dynamism caused by TEs can be very beneficial to lineages. This can result in differential survival and differential fecundity of lineages. Lineages with an abundant and suitable repertoire of TEs have enhanced evolutionary potential and, if all else is equal, tend to be fecund, resulting in species-rich adaptive radiations, and/or they tend to undergo major evolutionary transitions. Many other mechanisms of genomic change are also important in evolution, and whether the evolutionary potential of TE-Thrust is realized is heavily dependent on environmental and ecological factors. The large contribution of TEs to evolutionary innovation is particularly well documented in the primate lineage. In this paper, we review numerous cases of beneficial TE-caused modifications to the genomes of higher primates, which strongly support our TE-Thrust hypothesis.
Collapse
Affiliation(s)
- Keith R Oliver
- School of Biological Sciences and Biotechnology, Faculty of Science and Engineering, Murdoch University, Perth W. A. 6150, Australia
| | - Wayne K Greene
- School of Veterinary and Biomedical Sciences, Faculty of Health Sciences, Murdoch University, Perth W. A. 6150, Australia
| |
Collapse
|
31
|
HERV-K Hypomethylation in Ovarian Clear Cell Carcinoma Is Associated With a Poor Prognosis and Platinum Resistance. Int J Gynecol Cancer 2011; 21:51-7. [DOI: 10.1097/igc.0b013e3182021c1a] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
Introduction:In general, ovarian clear cell carcinoma (OCCC) has a history of poor response to standard platinum-based chemotherapy regimens, and advanced cases have short survival periods. Therefore, the discovery of a biomarker for the pretreatment prediction of OCCC is crucial. Loss of methylation of a retrotransposable sequence, such as long interspersed repetitive sequence 1 (LINE-1), frequently occurs in cancers, including ovarian cancer, and it has been proven to be associated with poor survival. The expressions of human endogenous retrovirus (HERV) K and E were found to be increased in tissues from patients with OCCC. Here, we propose that methylation levels of HERV are associated with treatment response and prognosis of OCCC.Methods:Twenty-nine patients with OCCC were enrolled. Methylation levels of HERV-K, HERV-E, and LINE-1 were measured from microdissected cancer and normal ovarian tissues. The methylation levels were correlated with stage, treatment response, and prognosis.Results:Methylation levels of HERV-K, HERV-E, and LINE-1 were decreased in tissues from patients with advanced stage cancer (P= 0.0179,P= 0.0021, andP= 0.0307, respectively). Human endogenous retrovirus K demonstrated significantly lower methylation levels in the platinum-resistant group (P= 0.0004). Patients with lower levels of methylated (hypomethylated) HERV-K had a shorter mean overall survival (P= 0.006). In advanced OCCC cases, patients with hypomethylated HERV-K had shorter mean progression-free survival (P= 0.018) and mean overall survival (P= 0.018) than did patients with higher methylation levels of HERV-K.Conclusions:Methylation levels of HERV-K, HERV-E, and LINE-1 are decreased during OCCC multistep carcinogenesis. Moreover, HERV-K hypomethylation is a promising biomarker for predicting OCCC treatment response and prognosis.
Collapse
|
32
|
Singh V, Mishra RK. RISCI--Repeat Induced Sequence Changes Identifier: a comprehensive, comparative genomics-based, in silico subtractive hybridization pipeline to identify repeat induced sequence changes in closely related genomes. BMC Bioinformatics 2010; 11:609. [PMID: 21184688 PMCID: PMC3024322 DOI: 10.1186/1471-2105-11-609] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2009] [Accepted: 12/26/2010] [Indexed: 01/19/2023] Open
Abstract
Background - The availability of multiple whole genome sequences has facilitated in silico identification of fixed and polymorphic transposable elements (TE). Whereas polymorphic loci serve as makers for phylogenetic and forensic analysis, fixed species-specific transposon insertions, when compared to orthologous loci in other closely related species, may give insights into their evolutionary significance. Besides, TE insertions are not isolated events and are frequently associated with subtle sequence changes concurrent with insertion or post insertion. These include duplication of target site, 3' and 5' flank transduction, deletion of the target locus, 5' truncation or partial deletion and inversion of the transposon, and post insertion changes like inter or intra element recombination, disruption etc. Although such changes have been studied independently, no automated platform to identify differential transposon insertions and the associated array of sequence changes in genomes of the same or closely related species is available till date. To this end, we have designed RISCI - 'Repeat Induced Sequence Changes Identifier' - a comprehensive, comparative genomics-based, in silico subtractive hybridization pipeline to identify differential transposon insertions and associated sequence changes using specific alignment signatures, which may then be examined for their downstream effects. Results - We showcase the utility of RISCI by comparing full length and truncated L1HS and AluYa5 retrotransposons in the reference human genome with the chimpanzee genome and the alternate human assemblies (Celera and HuRef). Comparison of the reference human genome with alternate human assemblies using RISCI predicts 14 novel polymorphisms in full length L1HS, 24 in truncated L1HS and 140 novel polymorphisms in AluYa5 insertions, besides several insertion and post insertion changes. We present comparison with two previous studies to show that RISCI predictions are broadly in agreement with earlier reports. We also demonstrate its versatility by comparing various strains of Mycobacterium tuberculosis for IS 6100 insertion polymorphism. Conclusions - RISCI combines comparative genomics with subtractive hybridization, inferring changes only when exclusive to one of the two genomes being compared. The pipeline is generic and may be applied to most transposons and to any two or more genomes sharing high sequence similarity. Such comparisons, when performed on a larger scale, may pull out a few critical events, which may have seeded the divergence between the two species under comparison.
Collapse
Affiliation(s)
- Vipin Singh
- Centre for Cellular and Molecular Biology, Hyderabad, India.
| | | |
Collapse
|
33
|
When one is better than two: RNA with dual functions. Biochimie 2010; 93:633-44. [PMID: 21111023 DOI: 10.1016/j.biochi.2010.11.004] [Citation(s) in RCA: 77] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2010] [Accepted: 11/17/2010] [Indexed: 11/23/2022]
Abstract
The central dogma of biology, until not long ago, held that genetic information stored on DNA molecules was translated into the final protein products through RNA as intermediate molecules. Then, an additional level of complexity in the regulation of genome expression was added, implicating new classes of RNA molecules called non-coding RNA (ncRNA). These ncRNA are also often referred to as functional RNA in that, although they do not contain the capacity to encode proteins, do have a function as RNA molecules. They have been thus far considered as truly non-coding RNA since no ORF long enough to be considered, nor protein, have been associated with them. However, the recent identification and characterization of bifunctional RNA, i.e. RNA for which both coding capacity and activity as functional RNA have been reported, suggests that a definite categorization of some RNA molecules is far from being straightforward. Indeed, several RNA primarily classified as non-protein-coding RNA has been showed to hold coding capacities and associated peptides. Conversely, mRNA, usually regarded as strictly protein-coding, may act as functional RNA molecules. Here, we describe several examples of these bifunctional RNA that have been already characterized from bacteria to mammals. We also extend this concept to fortuitous acquisition of dual function in pathological conditions and to the recently highlighted duality between information carried by a gene and its pseudogenes counterparts.
Collapse
|
34
|
Vakhrusheva AA, Kazanov MD, Mironov AA, Bazykin GA. Evolution of prokaryotic genes by shift of stop codons. J Mol Evol 2010; 72:138-46. [PMID: 21082168 DOI: 10.1007/s00239-010-9408-1] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2010] [Accepted: 10/29/2010] [Indexed: 11/30/2022]
Abstract
De novo origin of coding sequence remains an obscure issue in molecular evolution. One of the possible paths for addition (subtraction) of DNA segments to (from) a gene is stop codon shift. Single nucleotide substitutions can destroy the existing stop codon, leading to uninterrupted translation up to the next stop codon in the gene's reading frame, or create a premature stop codon via a nonsense mutation. Furthermore, short indels-caused frameshifts near gene's end may lead to premature stop codons or to translation past the existing stop codon. Here, we describe the evolution of the length of coding sequence of prokaryotic genes by change of positions of stop codons. We observed cases of addition of regions of 3'UTR to genes due to mutations at the existing stop codon, and cases of subtraction of C-terminal coding segments due to nonsense mutations upstream of the stop codon. Many of the observed stop codon shifts cannot be attributed to sequencing errors or rare deleterious variants segregating within bacterial populations. The additions of regions of 3'UTR tend to occur in those genes in which they are facilitated by nearby downstream in-frame triplets which may serve as new stop codons. Conversely, subtractions of coding sequence often give rise to in-frame stop codons located nearby. The amino acid composition of the added region is significantly biased, compared to the overall amino acid composition of the genes. Our results show that in prokaryotes, shift of stop codon is an underappreciated contributor to functional evolution of gene length.
Collapse
Affiliation(s)
- Anna A Vakhrusheva
- Department of Bioengineering and Bioinformatics, M.V. Lomonosov Moscow State University, Vorbyevy Gory 1-73, Moscow 119992, Russia
| | | | | | | |
Collapse
|
35
|
Huh JW, Kim YH, Kim DS, Park SJ, Lee SR, Kim SH, Kim E, Kim SU, Kim MS, Kim HS, Chang KT. Alu-derived old world monkeys exonization event and experimental validation of the LEPR gene. Mol Cells 2010; 30:201-7. [PMID: 20803091 DOI: 10.1007/s10059-010-0108-x] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2010] [Revised: 05/20/2010] [Accepted: 05/27/2010] [Indexed: 11/28/2022] Open
Abstract
The leptin receptor (LEPR) is a crucial regulatory protein that interacts with Leptin. In our analysis of LEPR, novel AluJb-derived alternative transcripts were identified in the genome of the rhesus monkey. In order to investigate the occurrence of AluJb-derived alternative transcripts and the mechanism underlying exonization events, we conducted analyses using a number of primate genomic DNAs and adipose RNAs of tissue and primary cells derived from the crab-eating monkey. Our results demonstrate that the AluJb element has been integrated into our common ancestor genome prior to the divergence of simians and prosimians. The lineage-specific exonization event of the LEPR gene in chimpanzees, orangutans, and Old World monkeys appear to have been accomplished via transition mutations of the 5' splicing site (second position of C to T). However, in New World monkeys and prosimians, the AluJb-related LEPR transcript should be silenced by the additional transversion mutation (fourth position of T to G). The AluJb-related transcript of human LEPR should also be silenced by a mutation of the 5' splicing site (first position of G to A) and the insertion of one nucleotide sequence (minus fourth position of A). Our data suggests that lineage-specific exonization events should be determined by the combination event of the formation of splicing sites and protection against site-specific mutation pressures. These evolutionary mechanisms could be major sources for primate diversification.
Collapse
Affiliation(s)
- Jae-Won Huh
- National Primate Research Center, Korea Research Institute of Bioscience and Biotechnology, Ochang, 363-883, Korea
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
36
|
Kim DS, Huh JW, Kim YH, Park SJ, Kim HS, Chang KT. Bioinformatic analysis of TE-spliced new exons within human, mouse and zebrafish genomes. Genomics 2010; 96:266-71. [PMID: 20728532 DOI: 10.1016/j.ygeno.2010.08.004] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2010] [Revised: 08/10/2010] [Accepted: 08/13/2010] [Indexed: 10/19/2022]
Abstract
Recent studies indicate major roles for transposable elements (TEs) in alternative splicing. In this study, we conducted genome-wide alternative splicing analyses focusing on new internal exon birth derived from TEs in human, mouse, and zebrafish genomes. We identified two different exon sets, TE-spliced exons and non-TE-spliced exons. The proportion of TE-spliced exons was nearly twice as high as the proportion of non-TE-spliced exons in the coding sequence (CDS) region. Detailed analysis of various families of TEs in three different species of TE-spliced exons revealed a different pattern in zebrafish. In our analysis, we could identify the functional role of TE insertions in the vertebrate genome affecting mRNA splicing machinery. Their effects can be directly linked to the shift from constitutive to alternative splicing during primate evolution. Our results indicate that TEs have a significant effect on shaping new internal exons in human, mouse, and zebrafish transcriptomes.
Collapse
Affiliation(s)
- Dae-Soo Kim
- National Primate Research Center (NPRC), KRIBB, Ochang, Chungbuk 363-883, Republic of Korea
| | | | | | | | | | | |
Collapse
|
37
|
Kumar RP, Senthilkumar R, Singh V, Mishra RK. Repeat performance: how do genome packaging and regulation depend on simple sequence repeats? Bioessays 2010; 32:165-74. [PMID: 20091758 DOI: 10.1002/bies.200900111] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
Non-coding DNA has consistently increased during evolution of higher eukaryotes. Since the number of genes has remained relatively static during the evolution of complex organisms, it is believed that increased degree of sophisticated regulation of genes has contributed to the increased complexity. A higher proportion of non-coding DNA, including repeats, is likely to provide more complex regulatory potential. Here, we propose that repeats play a regulatory role by contributing to the packaging of the genome during cellular differentiation. Repeats, and in particular the simple sequence repeats, are proposed to serve as landmarks that can target regulatory mechanisms to a large number of genomic sites with the help of very few factors and regulate the linked loci in a coordinated manner. Repeats may, therefore, function as common target sites for regulatory mechanisms involved in the packaging and dynamic compartmentalization of the chromatin into active and inactive regions during cellular differentiation.
Collapse
Affiliation(s)
- Ram Parikshan Kumar
- Centre for Cellular and Molecular Biology, Uppal Road, Hyderabad 500 007, India
| | | | | | | |
Collapse
|
38
|
Jintaridth P, Mutirangura A. Distinctive patterns of age-dependent hypomethylation in interspersed repetitive sequences. Physiol Genomics 2010; 41:194-200. [PMID: 20145203 DOI: 10.1152/physiolgenomics.00146.2009] [Citation(s) in RCA: 145] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
Interspersed repetitive sequences (IRSs) are a major contributor to genome size and may contribute to cellular functions. IRSs are subdivided according to size and functionally related structures into short interspersed elements, long interspersed elements (LINEs), DNA transposons, and LTR-retrotransposons. Many IRSs may produce RNA and regulate genes by a variety of mechanisms. The majority of DNA methylation occurs in IRSs and is believed to suppress IRS activities. Global hypomethylation, or the loss of genome-wide methylation, is a common epigenetic event not only in senescent cells but also in cancer cells. Loss of LINE-1 methylation has been characterized in many cancers. Here, we evaluated the methylation levels of peripheral blood mononuclear cells of LINE-1, Alu, and human endogenous retrovirus K (HERV-K) in 177 samples obtained from volunteers between 20 and 88 yr of age. Age was negatively associated with methylation levels of Alu (r = -0.452, P < 10(-3)) and HERV-K (r = -0.326, P < 10(-3)) but not LINE-1 (r = 0.145, P = 0.055). Loss of methylation of Alu occurred during ages 34-68 yr, and loss of methylation of HERV-K occurred during ages 40-63 yr and again during ages 64-83 yr. Interestingly, methylation of Alu and LINE-1 are directly associated, particularly at ages 49 yr and older (r = 0.49, P < 10(-3)). Therefore, only some types of IRSs lose methylation at certain ages. Moreover, Alu and HERV-K become hypomethylated differently. Finally, there may be several mechanisms of global methylation. However, not all of these mechanisms are age-dependent. This finding may lead to a better understanding of not only the biological causes and consequences of genome-wide hypomethylation but also the role of IRSs in the aging process.
Collapse
Affiliation(s)
- Pornrutsami Jintaridth
- Department of Tropical Nutrition and Food Science, Faculty of Tropical Medicine, Mahidol University
| | | |
Collapse
|
39
|
Human endogenous retroviral long terminal repeat sequences as cell type-specific promoters in retroviral vectors. J Virol 2009; 83:12643-50. [PMID: 19741000 DOI: 10.1128/jvi.00858-09] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The human genome contains more than half a million human endogenous retrovirus (HERV) long terminal repeats (LTRs) that can be regarded as mobile regulatory modules. Many of these HERV LTRs have been recruited during evolution as transcriptional control elements for cellular gene expression. We have cloned LTR sequences from two HERV families, HERV-H and HERV-L, differing widely in their activity and tissue specificity into a murine leukemia virus (MLV)-based promoter conversion vector (ProCon). Various human cell lines were infected with the HERV-MLV hybrid vectors, and cell type-specific expression of the reporter gene was compared with the promoter specificity of the corresponding HERV LTRs in transient-transfection assays. Transcription start site analysis of HERV-MLV hybrid vectors revealed preferential use of the HERV promoter initiation site. Our data show that HERV LTRs function in the context of retroviral vectors in certain cell types and have the potential to be useful as cell type-specific promoters in vector construction.
Collapse
|
40
|
Chordate roots of the vertebrate nervous system: expanding the molecular toolkit. Nat Rev Neurosci 2009; 10:736-46. [PMID: 19738625 DOI: 10.1038/nrn2703] [Citation(s) in RCA: 80] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
The vertebrate brain is highly complex with millions to billions of neurons. During development, the neural plate border region gives rise to the neural crest, cranial placodes and, in anamniotes, to Rohon-Beard sensory neurons, whereas the boundary region of the midbrain and hindbrain develops organizer properties. Comparisons of developmental gene expression and neuroanatomy between vertebrates and the basal chordate amphioxus, which has only thousands of neurons and lacks a neural crest, most placodes and a midbrain-hindbrain organizer, indicate that these vertebrate features were built on a foundation already present in the ancestral chordate. Recent advances in genomics have provided insights into the elaboration of the molecular toolkit at the invertebrate-vertebrate transition that may have facilitated the evolution of these vertebrate characteristics.
Collapse
|
41
|
Ying M, Zhan Z, Wang W, Chen D. Origin and evolution of ubiquitin-conjugating enzymes from Guillardia theta nucleomorph to hominoid. Gene 2009; 447:72-85. [PMID: 19664694 DOI: 10.1016/j.gene.2009.07.021] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2009] [Revised: 07/24/2009] [Accepted: 07/29/2009] [Indexed: 11/19/2022]
Abstract
The origin of eukaryotic ubiquitin-conjugating enzymes (E2s) can be traced back to the Guillardia theta nucleomorph about 2500 million years ago (Mya). E2s are largely vertically inherited over eukaryotic evolution [Lespinet, O., Wolf, Y.I., Koonin, E.V., Aravind, L., 2002. The role of lineage-specific gene family expansion in the evolution of eukaryotes. Genome Res. 1048-1059], while mammal E2s experienced evolution of multigene families by gene duplications which have been accompanied by the increase in the species complexity. Because of alternatively splicing, primate-specific expansions of E2s happened once again at a transcriptional level. Both of them resulted in increasing genomic complexity and diversity of primate E2 proteomic function. The evolutionary processes of human E2 gene structure during expansions were accompanied by exon duplication and exonization of intronic sequences. Exonizations of Transposable Elements (TEs) in UBE2D3, UBE2L3 and UBE2V1 genes from primates indicate that exaptation of TEs also plays important roles in the structural innovation of primate-specific E2s and may create alternative splicing isoforms at a transcriptional level. Estimates for the ratio of dN/dS suggest that a strong purifying selection had acted upon protein-coding sequences of their orthologous UBE2D2, UBE2A, UBE2N, UBE2I and Rbx1 genes from animals, plants and fungi. The similar rates of synonymous substitutions are in accordance with the neutral mutation-random drift hypothesis of molecular evolution. Systematic detection of the origin and evolution of E2s, analyzing the evolution of E2 multigene families by gene duplications and the evolutionary processes of E2s during expansions, and testing its evolutionary force using E2s from distant phylogenetic lineages may advance our distinguishing of ancestral E2s from created E2s, and reveal previously unknown relationships between E2s and metazoan complexity. Analysis of these conserved proteins provides strong support for a close relationship between social amoeba and eukaryote, choanoflagellate and metazoan, and for the central roles of social amoeba and choanoflagellate in the origin and evolution of eukaryote and metazoan. Retracing the different stages of primate E2 exonization by monitoring genomic events over 63 Myr of primate evolution will advance our understanding of how TEs dynamically modified primate transcriptome and proteome in the past, and continue to do so.
Collapse
Affiliation(s)
- Muying Ying
- State Key Laboratory of Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing, PR China
| | | | | | | |
Collapse
|
42
|
|
43
|
Corvelo A, Eyras E. Exon creation and establishment in human genes. Genome Biol 2009; 9:R141. [PMID: 18811936 PMCID: PMC2592719 DOI: 10.1186/gb-2008-9-9-r141] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2008] [Revised: 08/16/2008] [Accepted: 09/23/2008] [Indexed: 01/12/2023] Open
Abstract
BACKGROUND A large proportion of species-specific exons are alternatively spliced. In primates, Alu elements play a crucial role in the process of exon creation but many new exons have appeared through other mechanisms. Despite many recent studies, it is still unclear which are the splicing regulatory requirements for de novo exonization and how splicing regulation changes throughout an exon's lifespan. RESULTS Using comparative genomics, we have defined sets of exons with different evolutionary ages. Younger exons have weaker splice-sites and lower absolute values for the relative abundance of putative splicing regulators between exonic and adjacent intronic regions, indicating a less consolidated splicing regulation. This relative abundance is shown to increase with exon age, leading to higher exon inclusion. We show that this local difference in the density of regulators might be of biological significance, as it outperforms other measures in real exon versus pseudo-exon classification. We apply this new measure to the specific case of the exonization of anti-sense Alu elements and show that they are characterized by a general lack of exonic splicing silencers. CONCLUSIONS Our results suggest that specific sequence environments are required for exonization and that these can change with time. We propose a model of exon creation and establishment in human genes, in which splicing decisions depend on the relative local abundance of regulatory motifs. Using this model, we provide further explanation as to why Alu elements serve as a major substrate for exon creation in primates. Finally, we discuss the benefits of integrating such information in gene prediction.
Collapse
Affiliation(s)
- André Corvelo
- Computational Genomics, Universitat Pompeu Fabra, Barcelona, Spain
| | | |
Collapse
|
44
|
Abstract
We are in the midst of a revolution in the genomic sciences that will forever change the way we view biology and medicine, particularly with respect to brain form, function, development, evolution, plasticity, neurological disease pathogenesis and neural regenerative potential. The application of epigenetic principles has already begun to identify and characterize previously unrecognized molecular signatures of disease latency, onset and progression, mechanisms underlying disease pathogenesis, and responses to new and evolving therapeutic modalities. Moreover, epigenomic medicine promises to usher in a new era of neurological therapeutics designed to promote disease prevention and recovery of seemingly lost neurological function via reprogramming of stem cells, redirecting cell fate decisions and dynamically modulating neural network plasticity and connectivity.
Collapse
Affiliation(s)
- Mark F Mehler
- Institute for Brain Disorders and Neural Regeneration, Albert Einstein College of Medicine, Bronx, NY 10461, USA.
| |
Collapse
|
45
|
Cooperative exonization of MaLR and AluJo elements contributed an alternative promoter and novel splice variants of RNF19. Gene 2008; 424:63-70. [PMID: 18721867 DOI: 10.1016/j.gene.2008.07.030] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2008] [Revised: 07/01/2008] [Accepted: 07/28/2008] [Indexed: 11/22/2022]
Abstract
The RNF19 protein, which contains RING-finger and IBR motifs, acts as an E3 ubiquitin ligase localized to Lewy bodies. RNF19 is located on human chromosome 8q22.2, has a 4.4-kb transcript, and is ubiquitously expressed in various tissues. Here, we identified an alternative RNF19 promoter region and alternative RNF19 transcripts derived from MaLR (mammalian apparent LTR-retrotransposon) and AluJo elements. Comparative analyses indicated human-specific expression of the MaLR- and AluJo-related transcripts. From the expression analysis of 72 tissue samples including human normal, tumor, and primate tissues, three different isoforms (V1, V2, and V3) of MaLR-derived transcripts were identified. Quantitative RT-PCR analysis showed a dominant expression pattern of the V2 MaLR-derived transcript. A reporter gene assay for MaLR element promoter activity indicated that pGL2-RNF19/MaLR in the forward orientation is capable of driving luciferase gene expression in Cos7 and HCT116 cells. These findings suggest that RNF19 has acquired a new promoter and alternative exons via continuous retrotransposition events of MaLR and AluJo elements during mammalian and primate evolution, respectively.
Collapse
|
46
|
Piriyapongsa J, Rutledge MT, Patel S, Borodovsky M, Jordan IK. Evaluating the protein coding potential of exonized transposable element sequences. Biol Direct 2007; 2:31. [PMID: 18036258 PMCID: PMC2203978 DOI: 10.1186/1745-6150-2-31] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2007] [Accepted: 11/26/2007] [Indexed: 11/10/2022] Open
Abstract
Background Transposable element (TE) sequences, once thought to be merely selfish or parasitic members of the genomic community, have been shown to contribute a wide variety of functional sequences to their host genomes. Analysis of complete genome sequences have turned up numerous cases where TE sequences have been incorporated as exons into mRNAs, and it is widely assumed that such 'exonized' TEs encode protein sequences. However, the extent to which TE-derived sequences actually encode proteins is unknown and a matter of some controversy. We have tried to address this outstanding issue from two perspectives: i-by evaluating ascertainment biases related to the search methods used to uncover TE-derived protein coding sequences (CDS) and ii-through a probabilistic codon-frequency based analysis of the protein coding potential of TE-derived exons. Results We compared the ability of three classes of sequence similarity search methods to detect TE-derived sequences among data sets of experimentally characterized proteins: 1-a profile-based hidden Markov model (HMM) approach, 2-BLAST methods and 3-RepeatMasker. Profile based methods are more sensitive and more selective than the other methods evaluated. However, the application of profile-based search methods to the detection of TE-derived sequences among well-curated experimentally characterized protein data sets did not turn up many more cases than had been previously detected and nowhere near as many cases as recent genome-wide searches have. We observed that the different search methods used were complementary in the sense that they yielded largely non-overlapping sets of hits and differed in their ability to recover known cases of TE-derived CDS. The probabilistic analysis of TE-derived exon sequences indicates that these sequences have low protein coding potential on average. In particular, non-autonomous TEs that do not encode protein sequences, such as Alu elements, are frequently exonized but unlikely to encode protein sequences. Conclusion The exaptation of the numerous TE sequences found in exons as bona fide protein coding sequences may prove to be far less common than has been suggested by the analysis of complete genomes. We hypothesize that many exonized TE sequences actually function as post-transcriptional regulators of gene expression, rather than coding sequences, which may act through a variety of double stranded RNA related regulatory pathways. Indeed, their relatively high copy numbers and similarity to sequences dispersed throughout the genome suggests that exonized TE sequences could serve as master regulators with a wide scope of regulatory influence. Reviewers: This article was reviewed by Itai Yanai, Kateryna D. Makova, Melissa Wilson (nominated by Kateryna D. Makova) and Cedric Feschotte (nominated by John M. Logsdon Jr.).
Collapse
|