1
|
Chen M, Yang C, Zhai X, Wang C, Liu M, Zhang B, Guo X, Wang Y, Li H, Liu Y, Han J, Wang X, Li J, Jia L, Li L. Comprehensive Identification and Characterization of HML-9 Group in Chimpanzee Genome. Viruses 2024; 16:892. [PMID: 38932184 PMCID: PMC11209481 DOI: 10.3390/v16060892] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/01/2024] [Revised: 05/23/2024] [Accepted: 05/27/2024] [Indexed: 06/28/2024] Open
Abstract
Endogenous retroviruses (ERVs) are related to long terminal repeat (LTR) retrotransposons, comprising gene sequences of exogenous retroviruses integrated into the host genome and inherited according to Mendelian law. They are considered to have contributed greatly to the evolution of host genome structure and function. We previously characterized HERV-K HML-9 in the human genome. However, the biological function of this type of element in the genome of the chimpanzee, which is the closest living relative of humans, largely remains elusive. Therefore, the current study aims to characterize HML-9 in the chimpanzee genome and to compare the results with those in the human genome. Firstly, we report the distribution and genetic structural characterization of the 26 proviral elements and 38 solo LTR elements of HML-9 in the chimpanzee genome. The results showed that the distribution of these elements displayed a non-random integration pattern, and only six elements maintained a relatively complete structure. Then, we analyze their phylogeny and reveal that the identified elements all cluster together with HML-9 references and with those identified in the human genome. The HML-9 integration time was estimated based on the 2-LTR approach, and the results showed that HML-9 elements were integrated into the chimpanzee genome between 14 and 36 million years ago and into the human genome between 18 and 49 mya. In addition, conserved motifs, cis-regulatory regions, and enriched PBS sequence features in the chimpanzee genome were predicted based on bioinformatics. The results show that pathways significantly enriched for ERV LTR-regulated genes found in the chimpanzee genome are closely associated with disease development, including neurological and neurodevelopmental psychiatric disorders. In summary, the identification, characterization, and genomics of HML-9 presented here not only contribute to our understanding of the role of ERVs in primate evolution but also to our understanding of their biofunctional significance.
Collapse
Affiliation(s)
- Mingyue Chen
- National 111 Center for Cellular Regulation and Molecular Pharmaceutics, Key Laboratory of Fermentation Engineering, Hubei University of Technology, Wuhan 430068, China;
| | - Caiqin Yang
- State Key Laboratory of Pathogen and Biosecurity, Academy of Military Medical Sciences, Beijing 100071, China; (C.Y.); (X.Z.); (C.W.); (M.L.); (B.Z.); (X.G.); (Y.W.); (H.L.); (Y.L.); (J.H.); (X.W.); (J.L.)
| | - Xiuli Zhai
- State Key Laboratory of Pathogen and Biosecurity, Academy of Military Medical Sciences, Beijing 100071, China; (C.Y.); (X.Z.); (C.W.); (M.L.); (B.Z.); (X.G.); (Y.W.); (H.L.); (Y.L.); (J.H.); (X.W.); (J.L.)
- Department of Microbiology, School of Basic Medicine, Anhui Medical University, Hefei 230032, China
| | - Chunlei Wang
- State Key Laboratory of Pathogen and Biosecurity, Academy of Military Medical Sciences, Beijing 100071, China; (C.Y.); (X.Z.); (C.W.); (M.L.); (B.Z.); (X.G.); (Y.W.); (H.L.); (Y.L.); (J.H.); (X.W.); (J.L.)
- Department of Microbiology, School of Basic Medicine, Anhui Medical University, Hefei 230032, China
| | - Mengying Liu
- State Key Laboratory of Pathogen and Biosecurity, Academy of Military Medical Sciences, Beijing 100071, China; (C.Y.); (X.Z.); (C.W.); (M.L.); (B.Z.); (X.G.); (Y.W.); (H.L.); (Y.L.); (J.H.); (X.W.); (J.L.)
- School of Life Sciences, Tsinghua University, Beijing 100084, China
- College of Life Science and Technology, Beijing University of Chemical Technology, Beijing 100029, China
| | - Bohan Zhang
- State Key Laboratory of Pathogen and Biosecurity, Academy of Military Medical Sciences, Beijing 100071, China; (C.Y.); (X.Z.); (C.W.); (M.L.); (B.Z.); (X.G.); (Y.W.); (H.L.); (Y.L.); (J.H.); (X.W.); (J.L.)
| | - Xing Guo
- State Key Laboratory of Pathogen and Biosecurity, Academy of Military Medical Sciences, Beijing 100071, China; (C.Y.); (X.Z.); (C.W.); (M.L.); (B.Z.); (X.G.); (Y.W.); (H.L.); (Y.L.); (J.H.); (X.W.); (J.L.)
- Department of Microbiology, School of Basic Medicine, Anhui Medical University, Hefei 230032, China
| | - Yanglan Wang
- State Key Laboratory of Pathogen and Biosecurity, Academy of Military Medical Sciences, Beijing 100071, China; (C.Y.); (X.Z.); (C.W.); (M.L.); (B.Z.); (X.G.); (Y.W.); (H.L.); (Y.L.); (J.H.); (X.W.); (J.L.)
- College of Life Science and Technology, Beijing University of Chemical Technology, Beijing 100029, China
| | - Hanping Li
- State Key Laboratory of Pathogen and Biosecurity, Academy of Military Medical Sciences, Beijing 100071, China; (C.Y.); (X.Z.); (C.W.); (M.L.); (B.Z.); (X.G.); (Y.W.); (H.L.); (Y.L.); (J.H.); (X.W.); (J.L.)
| | - Yongjian Liu
- State Key Laboratory of Pathogen and Biosecurity, Academy of Military Medical Sciences, Beijing 100071, China; (C.Y.); (X.Z.); (C.W.); (M.L.); (B.Z.); (X.G.); (Y.W.); (H.L.); (Y.L.); (J.H.); (X.W.); (J.L.)
| | - Jingwan Han
- State Key Laboratory of Pathogen and Biosecurity, Academy of Military Medical Sciences, Beijing 100071, China; (C.Y.); (X.Z.); (C.W.); (M.L.); (B.Z.); (X.G.); (Y.W.); (H.L.); (Y.L.); (J.H.); (X.W.); (J.L.)
| | - Xiaolin Wang
- State Key Laboratory of Pathogen and Biosecurity, Academy of Military Medical Sciences, Beijing 100071, China; (C.Y.); (X.Z.); (C.W.); (M.L.); (B.Z.); (X.G.); (Y.W.); (H.L.); (Y.L.); (J.H.); (X.W.); (J.L.)
| | - Jingyun Li
- State Key Laboratory of Pathogen and Biosecurity, Academy of Military Medical Sciences, Beijing 100071, China; (C.Y.); (X.Z.); (C.W.); (M.L.); (B.Z.); (X.G.); (Y.W.); (H.L.); (Y.L.); (J.H.); (X.W.); (J.L.)
| | - Lei Jia
- State Key Laboratory of Pathogen and Biosecurity, Academy of Military Medical Sciences, Beijing 100071, China; (C.Y.); (X.Z.); (C.W.); (M.L.); (B.Z.); (X.G.); (Y.W.); (H.L.); (Y.L.); (J.H.); (X.W.); (J.L.)
| | - Lin Li
- State Key Laboratory of Pathogen and Biosecurity, Academy of Military Medical Sciences, Beijing 100071, China; (C.Y.); (X.Z.); (C.W.); (M.L.); (B.Z.); (X.G.); (Y.W.); (H.L.); (Y.L.); (J.H.); (X.W.); (J.L.)
| |
Collapse
|
2
|
Wang B, Saleh AA, Yang N, Asare E, Chen H, Wang Q, Chen C, Song C, Gao B. High Diversity of Long Terminal Repeat Retrotransposons in Compact Vertebrate Genomes: Insights from Genomes of Tetraodontiformes. Animals (Basel) 2024; 14:1425. [PMID: 38791643 PMCID: PMC11117352 DOI: 10.3390/ani14101425] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2024] [Revised: 05/04/2024] [Accepted: 05/07/2024] [Indexed: 05/26/2024] Open
Abstract
This study aimed to investigate the evolutionary profile (including diversity, activity, and abundance) of retrotransposons (RTNs) with long terminal repeats (LTRs) in ten species of Tetraodontiformes. These species, Arothron firmamentum, Lagocephalus sceleratus, Pao palembangensis, Takifugu bimaculatus, Takifugu flavidus, Takifugu ocellatus, Takifugu rubripes, Tetraodon nigroviridis, Mola mola, and Thamnaconus septentrionalis, are known for having the smallest genomes among vertebrates. Data mining revealed a high diversity and wide distribution of LTR retrotransposons (LTR-RTNs) in these compact vertebrate genomes, with varying abundances among species. A total of 819 full-length LTR-RTN sequences were identified across these genomes, categorized into nine families belonging to four different superfamilies: ERV (Orthoretrovirinae and Epsilon retrovirus), Copia, BEL-PAO, and Gypsy (Gmr, Mag, V-clade, CsRN1, and Barthez). The Gypsy superfamily exhibited the highest diversity. LTR family distribution varied among species, with Takifugu bimaculatus, Takifugu flavidus, Takifugu ocellatus, and Takifugu rubripes having the highest richness of LTR families and sequences. Additionally, evidence of recent invasions was observed in specific tetraodontiform genomes, suggesting potential transposition activity. This study provides insights into the evolution of LTR retrotransposons in Tetraodontiformes, enhancing our understanding of their impact on the structure and evolution of host genomes.
Collapse
Affiliation(s)
- Bingqing Wang
- College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China; (B.W.); (A.A.S.); (N.Y.); (E.A.); (H.C.); (Q.W.); (C.C.); (C.S.)
| | - Ahmed A. Saleh
- College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China; (B.W.); (A.A.S.); (N.Y.); (E.A.); (H.C.); (Q.W.); (C.C.); (C.S.)
- Animal and Fish Production Department, Faculty of Agriculture (Al-Shatby), Alexandria University, Alexandria 11865, Egypt
| | - Naisu Yang
- College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China; (B.W.); (A.A.S.); (N.Y.); (E.A.); (H.C.); (Q.W.); (C.C.); (C.S.)
| | - Emmanuel Asare
- College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China; (B.W.); (A.A.S.); (N.Y.); (E.A.); (H.C.); (Q.W.); (C.C.); (C.S.)
| | - Hong Chen
- College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China; (B.W.); (A.A.S.); (N.Y.); (E.A.); (H.C.); (Q.W.); (C.C.); (C.S.)
| | - Quan Wang
- College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China; (B.W.); (A.A.S.); (N.Y.); (E.A.); (H.C.); (Q.W.); (C.C.); (C.S.)
| | - Cai Chen
- College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China; (B.W.); (A.A.S.); (N.Y.); (E.A.); (H.C.); (Q.W.); (C.C.); (C.S.)
| | - Chengyi Song
- College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China; (B.W.); (A.A.S.); (N.Y.); (E.A.); (H.C.); (Q.W.); (C.C.); (C.S.)
| | - Bo Gao
- College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China; (B.W.); (A.A.S.); (N.Y.); (E.A.); (H.C.); (Q.W.); (C.C.); (C.S.)
| |
Collapse
|
3
|
Wang C, Zhai X, Wang S, Zhang B, Yang C, Song Y, Li H, Liu Y, Han J, Wang X, Li J, Chen M, Jia L, Li L. Comprehensive characterization of ERV-K (HML-8) in the chimpanzee genome revealed less genomic activity than humans. Front Cell Infect Microbiol 2024; 14:1349046. [PMID: 38456081 PMCID: PMC10918009 DOI: 10.3389/fcimb.2024.1349046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2023] [Accepted: 02/06/2024] [Indexed: 03/09/2024] Open
Abstract
Endogenous retroviruses (ERVs) originate from ancestral germline infections caused by exogenous retroviruses. Throughout evolution, they have become fixed within the genome of the animals into which they were integrated. As ERV elements coevolve with the host, they are normally epigenetically silenced and can become upregulated in a series of physiological and pathological processes. Generally, a detailed ERV profile in the host genome is critical for understanding the evolutionary history and functional performance of the host genome. We previously characterized and cataloged all the ERV-K subtype HML-8 loci in the human genome; however, this has not been done for the chimpanzee, the nearest living relative of humans. In this study, we aimed to catalog and characterize the integration of HML-8 in the chimpanzee genome and compare it with the integration of HML-8 in the human genome. We analyzed the integration of HML-8 and found that HML-8 pervasively invaded the chimpanzee genome. A total of 76 proviral elements were characterized on 23/24 chromosomes, including detailed elements distribution, structure, phylogeny, integration time, and their potential to regulate adjacent genes. The incomplete structure of HML-8 proviral LTRs will undoubtedly affect their activity. Moreover, the results indicated that HML-8 integration occurred before the divergence between humans and chimpanzees. Furthermore, chimpanzees include more HML-8 proviral elements (76 vs. 40) and fewer solo long terminal repeats (LTR) (0 vs. 5) than humans. These results suggested that chimpanzee genome activity is less than the human genome and that humans may have a better ability to shape and screen integrated proviral elements. Our work is informative in both an evolutionary and a functional context for ERVs.
Collapse
Affiliation(s)
- Chunlei Wang
- Department of Microbiology, School of Basic Medicine, Anhui Medical University, Hefei, Anhui, China
- Department of Virology, Beijing Institute of Microbiology and Epidemiology, Beijing, China
- State Key Laboratory of Pathogen and Biosecurity, Beijing, China
| | - Xiuli Zhai
- Department of Microbiology, School of Basic Medicine, Anhui Medical University, Hefei, Anhui, China
- Department of Virology, Beijing Institute of Microbiology and Epidemiology, Beijing, China
- State Key Laboratory of Pathogen and Biosecurity, Beijing, China
| | - Shibo Wang
- National 111 Center for Cellular Regulation and Molecular Pharmaceutics, Key Laboratory of Fermentation Engineering, Hubei University of Technology, Wuhan, Hubei, China
| | - Bohan Zhang
- Department of Virology, Beijing Institute of Microbiology and Epidemiology, Beijing, China
- State Key Laboratory of Pathogen and Biosecurity, Beijing, China
| | - Caiqin Yang
- Department of Virology, Beijing Institute of Microbiology and Epidemiology, Beijing, China
- State Key Laboratory of Pathogen and Biosecurity, Beijing, China
| | - Yanmei Song
- Department of Virology, Beijing Institute of Microbiology and Epidemiology, Beijing, China
- State Key Laboratory of Pathogen and Biosecurity, Beijing, China
| | - Hanping Li
- Department of Virology, Beijing Institute of Microbiology and Epidemiology, Beijing, China
- State Key Laboratory of Pathogen and Biosecurity, Beijing, China
| | - Yongjian Liu
- Department of Virology, Beijing Institute of Microbiology and Epidemiology, Beijing, China
- State Key Laboratory of Pathogen and Biosecurity, Beijing, China
| | - Jingwan Han
- Department of Virology, Beijing Institute of Microbiology and Epidemiology, Beijing, China
- State Key Laboratory of Pathogen and Biosecurity, Beijing, China
| | - Xiaolin Wang
- Department of Virology, Beijing Institute of Microbiology and Epidemiology, Beijing, China
- State Key Laboratory of Pathogen and Biosecurity, Beijing, China
| | - Jingyun Li
- Department of Virology, Beijing Institute of Microbiology and Epidemiology, Beijing, China
- State Key Laboratory of Pathogen and Biosecurity, Beijing, China
| | - Mingyue Chen
- National 111 Center for Cellular Regulation and Molecular Pharmaceutics, Key Laboratory of Fermentation Engineering, Hubei University of Technology, Wuhan, Hubei, China
| | - Lei Jia
- Department of Virology, Beijing Institute of Microbiology and Epidemiology, Beijing, China
- State Key Laboratory of Pathogen and Biosecurity, Beijing, China
| | - Lin Li
- Department of Microbiology, School of Basic Medicine, Anhui Medical University, Hefei, Anhui, China
- Department of Virology, Beijing Institute of Microbiology and Epidemiology, Beijing, China
- State Key Laboratory of Pathogen and Biosecurity, Beijing, China
| |
Collapse
|
4
|
Chiu ES, McDonald CA, Gagne RB, Dunkleberger H, Moxcey M, VandeWoude S. Endogenous feline leukemia virus long terminal repeat integration site diversity is highly variable in related and unrelated domestic cats. Retrovirology 2024; 21:3. [PMID: 38347535 PMCID: PMC10863107 DOI: 10.1186/s12977-024-00635-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2023] [Accepted: 01/30/2024] [Indexed: 02/15/2024] Open
Abstract
Endogenous retroviruses (ERV) are indicators of vertebrate evolutionary history and play important roles as homeostatic regulators. ERV long terminal repeat (LTR) elements may act as cis-activating promoters or trans-activating enhancer elements modifying gene transcription distant from LTR insertion sites. We previously documented that endogenous feline leukemia virus (FeLV)-LTR copy number variation in individual cats tracks inversely with susceptibility to virulent FeLV disease. To evaluate FeLV-LTR insertion characteristics, we assessed enFeLV-LTR integration site diversity in 20 cats from three genetically distinct populations using a baited linker-mediated PCR approach. We documented 765 individual integration sites unequally represented among individuals. Only three LTR integration sites were shared among all individuals, while 412 sites were unique to a single individual. When primary fibroblast cultures were challenged with exogenous FeLV, we found significantly increased expression of both exogenous and endogenous FeLV orthologs, supporting previous findings of potential exFeLV-enFeLV interactions; however, viral challenge did not elicit transcriptional changes in genes associated with the vast majority of integration sites. This study assesses FeLV-LTR integration sites in individual animals, providing unique transposome genotypes. Further, we document substantial individual variation in LTR integration site locations, even in a highly inbred population, and provide a framework for understanding potential endogenous retroviral element position influence on host gene transcription.
Collapse
Affiliation(s)
- Elliott S Chiu
- Department of Microbiology, Immunology, and Pathology, Colorado State University, Fort Collins, CO, 80523, USA.
| | - Coby A McDonald
- Department of Microbiology, Immunology, and Pathology, Colorado State University, Fort Collins, CO, 80523, USA
| | - Roderick B Gagne
- Department of Microbiology, Immunology, and Pathology, Colorado State University, Fort Collins, CO, 80523, USA
| | - Henry Dunkleberger
- Department of Microbiology, Immunology, and Pathology, Colorado State University, Fort Collins, CO, 80523, USA
| | | | - Sue VandeWoude
- Department of Microbiology, Immunology, and Pathology, Colorado State University, Fort Collins, CO, 80523, USA.
| |
Collapse
|
5
|
Smith KC, Blanchong JA. Detection of lymphoproliferative disease virus in Iowa Wild Turkeys (Meleagris gallopavo): Comparison of two sections of the proviral genome. PLoS One 2024; 19:e0296856. [PMID: 38346036 PMCID: PMC10861079 DOI: 10.1371/journal.pone.0296856] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2023] [Accepted: 12/20/2023] [Indexed: 02/15/2024] Open
Abstract
An accurate diagnostic test is an essential aspect of successfully monitoring and managing wildlife diseases. Lymphoproliferative Disease Virus (LPDV) is an avian retrovirus that was first identified in domestic turkeys in Europe and was first reported in a Wild Turkey (Meleagris gallopavo) in the United States in 2009. It has since been found to be widely distributed throughout North America. The majority of studies have utilized bone marrow and PCR primers targeting a 413-nucleotide sequence of the gag gene of the provirus to detect infection. While prior studies have evaluated the viability of other tissues for LPDV detection (whole blood, spleen, liver, cloacal swabs) none to date have studied differences in detection rates when utilizing different genomic regions of the provirus. This study examined the effectiveness of another section of the provirus, a 335-nucleotide sequence starting in the U3 region of the LTR (Long Terminal Repeat) and extending into the Matrix of the gag region (henceforth LTR), for detecting LPDV. Bone marrow samples from hunter-harvested Wild Turkeys (n = 925) were tested for LPDV with the gag gene and a subset (n = 417) including both those testing positive and those where LPDV was not detected was re-tested with LTR. The positive percent agreement (PPA) was 97.1% (68 of 70 gag positive samples tested positive with LTR) while the negative percent agreement (NPA) was only 68.0% (236 of 347 gag negative samples tested negative with LTR). Cohen's Kappa (κ = 0.402, Z = 10.26, p<0.0001) and the McNemar test (OR = 55.5, p<0.0001) indicated weak agreement between the two gene regions. We found that in Iowa Wild Turkeys use of the LTR region identified LPDV in many samples in which we failed to detect LPDV using the gag region and that LTR may be more appropriate for LPDV surveillance and monitoring. However, neither region of the provirus resulted in perfect detection and additional work is necessary to determine if LTR is more reliable in other geographic regions where LPDV occurs.
Collapse
Affiliation(s)
- Kelsey C. Smith
- Department of Natural Resource Ecology and Management, Iowa State University, Ames, Iowa, United States of Ameria
| | - Julie A. Blanchong
- Department of Natural Resource Ecology and Management, Iowa State University, Ames, Iowa, United States of Ameria
| |
Collapse
|
6
|
Fandiño S, Gomez-Lucia E, Benítez L, Doménech A. Comparison of Endogenous Alpharetroviruses (ALV-like) across Galliform Species: New Distant Proviruses. Microorganisms 2023; 12:86. [PMID: 38257913 PMCID: PMC10820513 DOI: 10.3390/microorganisms12010086] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2023] [Revised: 12/22/2023] [Accepted: 12/25/2023] [Indexed: 01/24/2024] Open
Abstract
The Genus Alpharetrovirus contains viruses pathogenic mainly for chickens, forming the Avian Sarcoma and Leukosis Virus group (ASLV). Cells of most Galliform species, besides chickens, contain genetic elements (endogenous retroviruses, ERVs) that could recombine with other alpharetroviruses or express proteins, complementing defective ASLV, which may successfully replicate and cause disease. However, they are quite unknown, and only ALV-F, from ring-necked pheasants, has been partially published. Upon scrutiny of 53 genomes of different avian species, we found Alpharetrovirus-like sequences only in 12 different Galliformes, including six full-length (7.4-7.6 Kbp) and 27 partial sequences. Phylogenetic studies of the regions studied (LTR, gag, pol, and env) consistently resulted in five almost identical clades containing the same ERVs: Clade I (presently known ASLVs); Clade II (Callipepla spp. ERVs); Clade IIIa (Phasianus colchicus ERVs); Clade IIIb (Alectoris spp. ERVs); and Clade IV (Centrocercus spp. ERVs). The low pol identity scores suggested that each of these Clades may be considered a different species. ORF analysis revealed that putatively encoded proteins would be very similar in length and domains to those of other alpharetroviruses and thus potentially functional. This will undoubtedly contribute to better understanding the biology of defective viruses, especially in wild Galliformes, their evolution, and the danger they may represent for other wild species and the poultry industry.
Collapse
Affiliation(s)
- Sergio Fandiño
- Department of Animal Health, Veterinary Faculty, Complutense University of Madrid, Av. Puerta de Hierro s/n, 28040 Madrid, Spain; (S.F.); (A.D.)
- Department of Genetics, Physiology and Microbiology, Faculty of Biological Sciences, Complutense University of Madrid (UCM), C. de José Antonio Novais 12, 28040 Madrid, Spain;
- Research Group, “Animal Viruses” of Complutense University of Madrid, 28040 Madrid, Spain
| | - Esperanza Gomez-Lucia
- Department of Animal Health, Veterinary Faculty, Complutense University of Madrid, Av. Puerta de Hierro s/n, 28040 Madrid, Spain; (S.F.); (A.D.)
- Research Group, “Animal Viruses” of Complutense University of Madrid, 28040 Madrid, Spain
| | - Laura Benítez
- Department of Genetics, Physiology and Microbiology, Faculty of Biological Sciences, Complutense University of Madrid (UCM), C. de José Antonio Novais 12, 28040 Madrid, Spain;
- Research Group, “Animal Viruses” of Complutense University of Madrid, 28040 Madrid, Spain
| | - Ana Doménech
- Department of Animal Health, Veterinary Faculty, Complutense University of Madrid, Av. Puerta de Hierro s/n, 28040 Madrid, Spain; (S.F.); (A.D.)
- Research Group, “Animal Viruses” of Complutense University of Madrid, 28040 Madrid, Spain
| |
Collapse
|
7
|
Ishihara S. Detection of long terminal repeat loci derived from endogenous retrovirus in junglefowl using whole-genome sequencing. Sci Rep 2023; 13:7380. [PMID: 37149699 PMCID: PMC10164170 DOI: 10.1038/s41598-023-34520-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2023] [Accepted: 05/03/2023] [Indexed: 05/08/2023] Open
Abstract
Endogenous retroviruses (ERVs) are genetic elements present in the genome that retain traces of past viral infections. Characterization of ERVs can provide crucial insights into avian evolution. This study aimed to identify novel long terminal repeat (LTR) loci derived from ERVs (ERV-LTRs) absent in the reference genome using whole-genome sequencing data of red junglefowl, gray junglefowl, Ceylon junglefowl, and green junglefowl. In total, 835 ERV-LTR loci were identified across the four Gallus species. The numbers of ERV-LTRs loci detected in red junglefowl and its subspecies gray junglefowl, Ceylon junglefowl, and green junglefowl were 362, 216, 193, and 128, respectively. The phylogenetic tree was congruent with previously reported trees, suggesting the potential for inferring relationships among past junglefowl populations from the identified ERV-LTR loci. Of the detected loci, 306 ERV-LTRs were identified near or within the genes, and some were associated with cell adhesion. The detected ERV-LTR sequences were classified as endogenous avian retrovirus family, avian leukosis virus subgroup E, Ovex-1, and murine leukemia virus-related ERVs. In addition, the sequence of the EAV family was divided into four patterns by combining the U3, R, and U5 regions. These findings contribute to a more comprehensive understanding of the characteristics of junglefowl ERVs.
Collapse
Affiliation(s)
- Shinya Ishihara
- Department of Animal Science, Nippon Veterinary and Life Science University, 1-7-1 Kyonancho, Musashino, Tokyo, 180-8602, Japan.
| |
Collapse
|
8
|
Jia GS, Zhang WC, Liang Y, Liu XH, Rhind N, Pidoux A, Brysch-Herzberg M, Du LL. A high-quality reference genome for the fission yeast Schizosaccharomyces osmophilus. G3 (BETHESDA, MD.) 2023; 13:jkad028. [PMID: 36748990 PMCID: PMC10085805 DOI: 10.1093/g3journal/jkad028] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/06/2022] [Revised: 01/23/2023] [Accepted: 01/23/2023] [Indexed: 02/08/2023]
Abstract
Fission yeasts are an ancient group of fungal species that diverged from each other from tens to hundreds of million years ago. Among them is the preeminent model organism Schizosaccharomyces pombe, which has significantly contributed to our understandings of molecular mechanisms underlying fundamental cellular processes. The availability of the genomes of S. pombe and 3 other fission yeast species S. japonicus, S. octosporus, and S. cryophilus has enabled cross-species comparisons that provide insights into the evolution of genes, pathways, and genomes. Here, we performed genome sequencing on the type strain of the recently identified fission yeast species S. osmophilus and obtained a complete mitochondrial genome and a nuclear genome assembly with gaps only at rRNA gene arrays. A total of 5,098 protein-coding nuclear genes were annotated and orthologs for more than 95% of them were identified. Genome-based phylogenetic analysis showed that S. osmophilus is most closely related to S. octosporus and these 2 species diverged around 16 million years ago. To demonstrate the utility of this S. osmophilus reference genome, we conducted cross-species comparative analyses of centromeres, telomeres, transposons, the mating-type region, Cbp1 family proteins, and mitochondrial genomes. These analyses revealed conservation of repeat arrangements and sequence motifs in centromere cores, identified telomeric sequences composed of 2 types of repeats, delineated relationships among Tf1/sushi group retrotransposons, characterized the evolutionary origins and trajectories of Cbp1 family domesticated transposases, and discovered signs of interspecific transfer of 2 types of mitochondrial selfish elements.
Collapse
Affiliation(s)
- Guo-Song Jia
- National Institute of Biological Sciences, Beijing 102206, China
| | - Wen-Cai Zhang
- National Institute of Biological Sciences, Beijing 102206, China
| | - Yue Liang
- National Institute of Biological Sciences, Beijing 102206, China
| | - Xi-Han Liu
- National Institute of Biological Sciences, Beijing 102206, China
| | - Nicholas Rhind
- Department of Biochemistry and Molecular Biotechnology, University of Massachusetts Medical School, Worcester, MA 01605, USA
| | - Alison Pidoux
- Wellcome Centre for Cell Biology, Institute of Cell Biology, School of Biological Sciences, The University of Edinburgh, Edinburgh EH9 3BF, Scotland, UK
| | - Michael Brysch-Herzberg
- Laboratory for Wine Microbiology, Department International Business, Heilbronn University, Heilbronn 74081, Germany
| | - Li-Lin Du
- National Institute of Biological Sciences, Beijing 102206, China
- Tsinghua Institute of Multidisciplinary Biomedical Research, Tsinghua University, Beijing 102206, China
| |
Collapse
|
9
|
Poretti M, Praz CR, Sotiropoulos AG, Wicker T. A survey of lineage-specific genes in Triticeae reveals de novo gene evolution from genomic raw material. PLANT DIRECT 2023; 7:e484. [PMID: 36937792 PMCID: PMC10020141 DOI: 10.1002/pld3.484] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/27/2022] [Revised: 01/26/2023] [Accepted: 01/27/2023] [Indexed: 06/18/2023]
Abstract
Diploid plant genomes typically contain ~35,000 genes, almost all belonging to highly conserved gene families. Only a small fraction are lineage-specific, which are found in only one or few closely related species. Little is known about how genes arise de novo in plant genomes and how often this occurs; however, they are believed to be important for plants diversification and adaptation. We developed a pipeline to identify lineage-specific genes in Triticeae, using newly available genome assemblies of wheat, barley, and rye. Applying a set of stringent criteria, we identified 5942 candidate Triticeae-specific genes (TSGs), of which 2337 were validated as protein-coding genes in wheat. Differential gene expression analyses revealed that stress-induced wheat TSGs are strongly enriched in putative secreted proteins. Some were previously described to be involved in Triticeae non-host resistance and cold response. Additionally, we show that 1079 TSGs have sequence homology to transposable elements (TEs), ~68% of them deriving from regulatory non-coding regions of Gypsy retrotransposons. Most importantly, we demonstrate that these TSGs are enriched in transmembrane domains and are among the most highly expressed wheat genes overall. To summarize, we conclude that de novo gene formation is relatively rare and that Triticeae probably possess ~779 lineage-specific genes per haploid genome. TSGs, which respond to pathogen and environmental stresses, may be interesting candidates for future targeted resistance breeding in Triticeae. Finally, we propose that non-coding regions of TEs might provide important genetic raw material for the functional innovation of TM domains and the evolution of novel secreted proteins.
Collapse
Affiliation(s)
- Manuel Poretti
- Department of Plant and Microbial BiologyUniversity of ZurichZurichSwitzerland
- Department of BiologyUniversity of FribourgFribourgSwitzerland
| | - Coraline R. Praz
- Department of Plant and Microbial BiologyUniversity of ZurichZurichSwitzerland
- Centro de Biotecnología y Genómica de PlantasUniversidad Politécnica de Madrid (UPM)–Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA)MadridSpain
| | | | - Thomas Wicker
- Department of Plant and Microbial BiologyUniversity of ZurichZurichSwitzerland
| |
Collapse
|
10
|
Venus: An efficient virus infection detection and fusion site discovery method using single-cell and bulk RNA-seq data. PLoS Comput Biol 2022; 18:e1010636. [PMID: 36301997 PMCID: PMC9642901 DOI: 10.1371/journal.pcbi.1010636] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2022] [Revised: 11/08/2022] [Accepted: 10/04/2022] [Indexed: 11/09/2022] Open
Abstract
Early and accurate detection of viruses in clinical and environmental samples is essential for effective public healthcare, treatment, and therapeutics. While PCR detects potential pathogens with high sensitivity, it is difficult to scale and requires knowledge of the exact sequence of the pathogen. With the advent of next-gen single-cell sequencing, it is now possible to scrutinize viral transcriptomics at the finest possible resolution–cells. This newfound ability to investigate individual cells opens new avenues to understand viral pathophysiology with unprecedented resolution. To leverage this ability, we propose an efficient and accurate computational pipeline, named Venus, for virus detection and integration site discovery in both single-cell and bulk-tissue RNA-seq data. Specifically, Venus addresses two main questions: whether a tissue/cell type is infected by viruses or a virus of interest? And if infected, whether and where has the virus inserted itself into the human genome? Our analysis can be broken into two parts–validation and discovery. Firstly, for validation, we applied Venus on well-studied viral datasets, such as HBV- hepatocellular carcinoma and HIV-infection treated with antiretroviral therapy. Secondly, for discovery, we analyzed datasets such as HIV-infected neurological patients and deeply sequenced T-cells. We detected viral transcripts in the novel target of the brain and high-confidence integration sites in immune cells. In conclusion, here we describe Venus, a publicly available software which we believe will be a valuable virus investigation tool for the scientific community at large.
Collapse
|
11
|
Jia L, Liu M, Yang C, Li H, Liu Y, Han J, Zhai X, Wang X, Li T, Li J, Zhang B, Yu C, Li L. Comprehensive identification and characterization of the HERV-K (HML-9) group in the human genome. Retrovirology 2022; 19:11. [PMID: 35676699 PMCID: PMC9178832 DOI: 10.1186/s12977-022-00596-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2022] [Accepted: 05/09/2022] [Indexed: 12/14/2022] Open
Abstract
Background Human endogenous retroviruses (HERVs) result from ancestral infections caused by exogenous retroviruses that became incorporated into the germline DNA and evolutionarily fixed in the human genome. HERVs can be transmitted vertically in a Mendelian fashion and be stably maintained in the human genome, of which they are estimated to comprise approximately 8%. HERV-K (HML1-10) transcription has been confirmed to be associated with a variety of diseases, such as breast cancer, lung cancer, prostate cancer, melanoma, rheumatoid arthritis, and amyotrophic lateral sclerosis. However, the poor characterization of HML-9 prevents a detailed understanding of the regulation of the expression of this family in humans and its impact on the host genome. In light of this, a precise and updated HERV-K HML-9 genomic map is urgently needed to better evaluate the role of these elements in human health. Results We report a comprehensive analysis of the presence and distribution of HERV-K HML-9 elements within the human genome, with a detailed characterization of the structural and phylogenetic properties of the group. A total of 23 proviruses and 47 solo LTR elements were characterized, with a detailed description of the provirus structure, integration time, potential regulated genes, transcription factor binding sites (TFBS), and primer binding site (PBS) features. The integration time results showed that the HML-9 elements found in the human genome integrated into the primate lineage between 17.5 and 48.5 million years ago (mya). Conclusion The results provide a clear characterization of HML-9 and a comprehensive background for subsequent functional studies. Supplementary Information The online version contains supplementary material available at 10.1186/s12977-022-00596-2.
Collapse
Affiliation(s)
- Lei Jia
- Department of Virology, Beijing Institute of Microbiology and Epidemiology, Beijing, 100071, China.,State Key Laboratory of Pathogen and Biosecurity, Beijing, 100071, China
| | - Mengying Liu
- College of Life Science and Technology, Beijing University of Chemical Technology, Beijing, 100029, China
| | - Caiqin Yang
- Department of Virology, Beijing Institute of Microbiology and Epidemiology, Beijing, 100071, China.,State Key Laboratory of Pathogen and Biosecurity, Beijing, 100071, China
| | - Hanping Li
- Department of Virology, Beijing Institute of Microbiology and Epidemiology, Beijing, 100071, China.,State Key Laboratory of Pathogen and Biosecurity, Beijing, 100071, China
| | - Yongjian Liu
- Department of Virology, Beijing Institute of Microbiology and Epidemiology, Beijing, 100071, China.,State Key Laboratory of Pathogen and Biosecurity, Beijing, 100071, China
| | - Jingwan Han
- Department of Virology, Beijing Institute of Microbiology and Epidemiology, Beijing, 100071, China.,State Key Laboratory of Pathogen and Biosecurity, Beijing, 100071, China
| | - Xiuli Zhai
- Department of Virology, Beijing Institute of Microbiology and Epidemiology, Beijing, 100071, China.,State Key Laboratory of Pathogen and Biosecurity, Beijing, 100071, China
| | - Xiaolin Wang
- Department of Virology, Beijing Institute of Microbiology and Epidemiology, Beijing, 100071, China.,State Key Laboratory of Pathogen and Biosecurity, Beijing, 100071, China
| | - Tianyi Li
- Department of Virology, Beijing Institute of Microbiology and Epidemiology, Beijing, 100071, China.,State Key Laboratory of Pathogen and Biosecurity, Beijing, 100071, China
| | - Jingyun Li
- Department of Virology, Beijing Institute of Microbiology and Epidemiology, Beijing, 100071, China.,State Key Laboratory of Pathogen and Biosecurity, Beijing, 100071, China
| | - Bohan Zhang
- Department of Virology, Beijing Institute of Microbiology and Epidemiology, Beijing, 100071, China.,State Key Laboratory of Pathogen and Biosecurity, Beijing, 100071, China
| | - Changyuan Yu
- College of Life Science and Technology, Beijing University of Chemical Technology, Beijing, 100029, China.
| | - Lin Li
- Department of Virology, Beijing Institute of Microbiology and Epidemiology, Beijing, 100071, China. .,State Key Laboratory of Pathogen and Biosecurity, Beijing, 100071, China.
| |
Collapse
|
12
|
Kwolek K, Kędzierska P, Hankiewicz M, Mirouze M, Panaud O, Grzebelus D, Macko‐Podgórni A. Diverse and mobile: eccDNA-based identification of carrot low-copy-number LTR retrotransposons active in callus cultures. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2022; 110:1811-1828. [PMID: 35426957 PMCID: PMC9324142 DOI: 10.1111/tpj.15773] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/21/2021] [Revised: 03/15/2022] [Accepted: 03/29/2022] [Indexed: 05/28/2023]
Abstract
Long terminal repeat retrotransposons (LTR-RTs) are mobilized via an RNA intermediate using a 'copy and paste' mechanism, and account for the majority of repetitive DNA in plant genomes. As a side effect of mobilization, the formation of LTR-RT-derived extrachromosomal circular DNAs (eccDNAs) occurs. Thus, high-throughput sequencing of eccDNA can be used to identify active LTR-RTs in plant genomes. Despite the release of a reference genome assembly, carrot LTR-RTs have not yet been thoroughly characterized. LTR-RTs are abundant and diverse in the carrot genome. We identified 5976 carrot LTR-RTs, 2053 and 1660 of which were attributed to Copia and Gypsy superfamilies, respectively. They were further classified into lineages, families and subfamilies. More diverse LTR-RT lineages, i.e. lineages comprising many low-copy-number subfamilies, were more frequently associated with genic regions. Certain LTR-RT lineages have been recently active in Daucus carota. In particular, low-copy-number LTR-RT subfamilies, e.g. those belonging to the DcAle lineage, have significantly contributed to carrot genome diversity as a result of continuing activity. We utilized eccDNA sequencing to identify and characterize two DcAle subfamilies, Alex1 and Alex3, active in carrot callus. We documented 14 and 32 de novo insertions of Alex1 and Alex3, respectively, which were positioned in non-repetitive regions.
Collapse
Affiliation(s)
- Kornelia Kwolek
- Department of Plant Biology and Biotechnology, Faculty of Biotechnology and HorticultureUniversity of Agriculture in Krakow31 120KrakowPoland
| | - Patrycja Kędzierska
- Department of Plant Biology and Biotechnology, Faculty of Biotechnology and HorticultureUniversity of Agriculture in Krakow31 120KrakowPoland
| | - Magdalena Hankiewicz
- Department of Plant Biology and Biotechnology, Faculty of Biotechnology and HorticultureUniversity of Agriculture in Krakow31 120KrakowPoland
| | - Marie Mirouze
- Laboratoire Génome et Développement des Plantes, UMR 5096 CNRS/UPVDUniversité de PerpignanVia Domitia, 52 Avenue Paul Alduy66 860Perpignan CedexFrance
- IRD, EMR IRD‐CNRS‐UPVD ‘MANGO’Université de PerpignanPerpignanFrance
| | - Olivier Panaud
- Laboratoire Génome et Développement des Plantes, UMR 5096 CNRS/UPVDUniversité de PerpignanVia Domitia, 52 Avenue Paul Alduy66 860Perpignan CedexFrance
| | - Dariusz Grzebelus
- Department of Plant Biology and Biotechnology, Faculty of Biotechnology and HorticultureUniversity of Agriculture in Krakow31 120KrakowPoland
| | - Alicja Macko‐Podgórni
- Department of Plant Biology and Biotechnology, Faculty of Biotechnology and HorticultureUniversity of Agriculture in Krakow31 120KrakowPoland
| |
Collapse
|
13
|
Long Terminal Repeat Retrotransposon Afut4 Promotes Azole Resistance of Aspergillus fumigatus by Enhancing the Expression of sac1 Gene. Antimicrob Agents Chemother 2021; 65:e0029121. [PMID: 34516252 DOI: 10.1128/aac.00291-21] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Aspergillus fumigatus causes a series of invasive diseases, including the high-mortality invasive aspergillosis, and has been a serious global health threat because of its increased resistance to the first-line clinical triazoles. We analyzed the whole-genome sequence of 15 A. fumigatus strains from China and found that long terminal repeat retrotransposons (LTR-RTs), including Afut1, Afut2, Afut3, and Afut4, are most common and have the largest total nucleotide length among all transposable elements in A. fumigatus. Deleting one of the most enriched Afut4977-sac1 in azole-resistant strains decreased azole resistance and downregulated its nearby gene, sac1, but it did not significantly affect the expression of genes of the ergosterol synthesis pathway. We then discovered that 5'LTR of Afut4977-sac1 had promoter activity and enhanced the adjacent sac1 gene expression. We found that sac1 is important to A. fumigatus, and the upregulated sac1 caused elevated resistance of A. fumigatus to azoles. Finally, we showed that Afut4977-sac1 has an evolution pattern similar to that of the whole genome of azole-resistant strains due to azoles; phylogenetic analysis of both the whole genome and Afut4977-sac1 suggests that the insertion of Afut4977-sac1 might have preceded the emergence of azole-resistant strains. Taking these data together, we found that the Afut4977-sac1 LTR-RT might be involved in the regulation of azole resistance of A. fumigatus by upregulating its nearby sac1 gene.
Collapse
|
14
|
Regulation of retrotransposition in Arabidopsis. Biochem Soc Trans 2021; 49:2241-2251. [PMID: 34495315 DOI: 10.1042/bst20210337] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2021] [Revised: 08/10/2021] [Accepted: 08/12/2021] [Indexed: 01/01/2023]
Abstract
Plant genomes are largely comprised of retrotransposons which can replicate through 'copy and paste' mechanisms. Long terminal repeat (LTR) retrotransposons are the major class of retrotransposons in plant species, and importantly they broadly affect the expression of nearby genes. Although most LTR retrotransposons are non-functional, active retrotranspositions have been reported in plant species or mutants under normal growth condition and environmental stresses. With the well-defined reference genome and numerous mutant alleles, Arabidopsis studies have significantly expanded our understanding of retrotransposon regulation. Active LTR retrotransposon loci produce virus-like particles to perform reverse transcription, and their complementary DNA can be inserted into new genomic loci. Due to the detrimental consequences of retrotransposition, plants like animals, have developed transcriptional and post-transcriptional silencing mechanisms. Recently several different genome-wide techniques have been developed to understand LTR retrotransposition in Arabidopsis and different plant species. Transposome, methylome, transcriptome, translatome and small RNA sequencing data have revealed how host silencing mechanisms can affect multiple steps of retrotransposition. These recent advances shed light on future mechanistic studies of retrotransposition as well as retrotransposon diversity.
Collapse
|
15
|
Yedavalli VRK, Patil A, Parrish J, Kozak CA. A novel class III endogenous retrovirus with a class I envelope gene in African frogs with an intact genome and developmentally regulated transcripts in Xenopus tropicalis. Retrovirology 2021; 18:20. [PMID: 34261506 PMCID: PMC8278194 DOI: 10.1186/s12977-021-00564-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2021] [Accepted: 06/29/2021] [Indexed: 02/02/2023] Open
Abstract
BACKGROUND Retroviruses exist as exogenous infectious agents and as endogenous retroviruses (ERVs) integrated into host chromosomes. Such endogenous retroviruses (ERVs) are grouped into three classes roughly corresponding to the seven genera of infectious retroviruses: class I (gamma-, epsilonretroviruses), class II (alpha-, beta-, delta-, lentiretroviruses) and class III (spumaretroviruses). Some ERVs have counterparts among the known infectious retroviruses, while others represent paleovirological relics of extinct or undiscovered retroviruses. RESULTS Here we identify an intact ERV in the Anuran amphibian, Xenopus tropicalis. XtERV-S has open reading frames (ORFs) for gag, pol (polymerase) and env (envelope) genes, with a small additional ORF in pol and a serine tRNA primer binding site. It has unusual features and domain relationships to known retroviruses. Analyses based on phylogeny and functional motifs establish that XtERV-S gag and pol genes are related to the ancient env-less class III ERV-L family but the surface subunit of env is unrelated to known retroviruses while its transmembrane subunit is class I-like. LTR constructs show transcriptional activity, and XtERV-S transcripts are detected in embryos after the maternal to zygotic mid-blastula transition and before the late tailbud stage. Tagged Gag protein shows typical subcellular localization. The presence of ORFs in all three protein-coding regions along with identical 5' and 3' LTRs (long terminal repeats) indicate this is a very recent germline acquisition. There are older, full-length, nonorthologous, defective copies in Xenopus laevis and the distantly related African bullfrog, Pyxicephalus adspersus. Additional older, internally deleted copies in X. tropicalis carry a 300 bp LTR substitution. CONCLUSIONS XtERV-S represents a genera-spanning member of the largely env-less class III ERV that has ancient and modern copies in Anurans. This provirus has an env ORF with a surface subunit unrelated to known retroviruses and a transmembrane subunit related to class I gammaretroviruses in sequence and organization, and is expressed in early embryogenesis. Additional XtERV-S-related but defective copies are present in X. tropicalis and other African frog taxa. XtERV-S is an unusual class III ERV variant, and it may represent an important transitional retroviral form that has been spreading in African frogs for tens of millions of years.
Collapse
Affiliation(s)
- Venkat R K Yedavalli
- Laboratory of Molecular Microbiology, National Institute of Allergy and Infectious Diseases, Bethesda, MD, 20892, USA
| | - Akash Patil
- Department of Biomedical Engineering, John Hopkins University, Baltimore, MD, 21205, USA
| | - Janay Parrish
- Internal Medicine, Northwell Health, Lenox Hill Hospital, New York, NY, 10075, USA
| | - Christine A Kozak
- Laboratory of Molecular Microbiology, National Institute of Allergy and Infectious Diseases, Bethesda, MD, 20892, USA.
| |
Collapse
|
16
|
Jones AR, Iacoangeli A, Adey BN, Bowles H, Shatunov A, Troakes C, Garson JA, McCormick AL, Al-Chalabi A. A HML6 endogenous retrovirus on chromosome 3 is upregulated in amyotrophic lateral sclerosis motor cortex. Sci Rep 2021; 11:14283. [PMID: 34253796 PMCID: PMC8275748 DOI: 10.1038/s41598-021-93742-3] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2021] [Accepted: 05/21/2021] [Indexed: 02/06/2023] Open
Abstract
There is increasing evidence that endogenous retroviruses (ERVs) play a significant role in central nervous system diseases, including amyotrophic lateral sclerosis (ALS). Studies of ALS have consistently identified retroviral enzyme reverse transcriptase activity in patients. Evidence indicates that ERVs are the cause of reverse transcriptase activity in ALS, but it is currently unclear whether this is due to a specific ERV locus or a family of ERVs. We employed a combination of bioinformatic methods to identify whether specific ERVs or ERV families are associated with ALS. Using the largest post-mortem RNA-sequence datasets available we selectively identified ERVs that closely resembled full-length proviruses. In the discovery dataset there was one ERV locus (HML6_3p21.31c) that showed significant increased expression in post-mortem motor cortex tissue after multiple-testing correction. Using six replication post-mortem datasets we found HML6_3p21.31c was consistently upregulated in ALS in motor cortex and cerebellum tissue. In addition, HML6_3p21.31c showed significant co-expression with cytokine binding and genes involved in EBV, HTLV-1 and HIV type-1 infections. There were no significant differences in ERV family expression between ALS and controls. Our results support the hypothesis that specific ERV loci are involved in ALS pathology.
Collapse
Affiliation(s)
- Ashley R. Jones
- grid.13097.3c0000 0001 2322 6764Department of Basic and Clinical Neuroscience, Maurice Wohl Clinical Neuroscience Institute, Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, SE5 9NU UK
| | - Alfredo Iacoangeli
- grid.13097.3c0000 0001 2322 6764Department of Basic and Clinical Neuroscience, Maurice Wohl Clinical Neuroscience Institute, Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, SE5 9NU UK ,grid.13097.3c0000 0001 2322 6764Department of Biostatistics and Health Informatics, Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, UK ,grid.451056.30000 0001 2116 3923National Institute for Health Research Biomedical Research Centre and Dementia Unit at South London and Maudsley NHS Foundation Trust and King’s College London, London, UK
| | - Brett N. Adey
- grid.13097.3c0000 0001 2322 6764Department of Biostatistics and Health Informatics, Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, UK ,grid.13097.3c0000 0001 2322 6764Social Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, UK ,grid.13097.3c0000 0001 2322 6764NIHR Maudsley Biomedical Research Centre, South London and Maudsley NHS Trust, King’s College London, London, UK
| | - Harry Bowles
- grid.13097.3c0000 0001 2322 6764Department of Basic and Clinical Neuroscience, Maurice Wohl Clinical Neuroscience Institute, Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, SE5 9NU UK ,grid.13097.3c0000 0001 2322 6764Department of Biostatistics and Health Informatics, Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, UK ,grid.451056.30000 0001 2116 3923National Institute for Health Research Biomedical Research Centre and Dementia Unit at South London and Maudsley NHS Foundation Trust and King’s College London, London, UK ,grid.451056.30000 0001 2116 3923National Institute for Health Research Biomedical Research Centre at Guy’s and St Thomas’ NHS Foundation Trust and King’s College London, London, UK
| | - Aleksey Shatunov
- grid.13097.3c0000 0001 2322 6764Department of Basic and Clinical Neuroscience, Maurice Wohl Clinical Neuroscience Institute, Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, SE5 9NU UK
| | - Claire Troakes
- grid.13097.3c0000 0001 2322 6764MRC London Neurodegenerative Diseases Brain Bank, Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, UK
| | - Jeremy A. Garson
- grid.83440.3b0000000121901201Division of Infection and Immunity, University College London, London, UK
| | - Adele L. McCormick
- grid.12896.340000 0000 9046 8598School of Life Sciences, University of Westminster, London, UK
| | - Ammar Al-Chalabi
- grid.13097.3c0000 0001 2322 6764Department of Basic and Clinical Neuroscience, Maurice Wohl Clinical Neuroscience Institute, Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, SE5 9NU UK
| |
Collapse
|
17
|
Yañez-Santos AM, Paz RC, Paz-Sepúlveda PB, Urdampilleta JD. Full-length LTR retroelements in Capsicum annuum revealed a few species-specific family bursts with insertional preferences. Chromosome Res 2021; 29:261-284. [PMID: 34086192 DOI: 10.1007/s10577-021-09663-4] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2020] [Revised: 05/04/2021] [Accepted: 05/06/2021] [Indexed: 01/01/2023]
Abstract
Capsicum annuum is a species that has undergone an expansion of the size of its genome caused mainly by the amplification of repetitive DNA sequences, including mobile genetic elements. Based on information obtained from sequencing the genome of pepper, the estimated fraction of retroelements is approximately 81%, and previous results revealed an important contribution of lineages derived from Gypsy superfamily. However, the dynamics of the retroelements in the C. annuum genome is poorly understood. In this way, the present work seeks to investigate the phylogenetic diversity and genomic abundance of the families of autonomous (complete and intact) LTR retroelements from C. annuum and inspect their distribution along its chromosomes. In total, we identified 1151 structurally full-length retroelements (340 Copia; 811 Gypsy) grouped in 124 phylogenetic families in the base of their retrotranscriptase. All the evolutive lineages of LTR retroelements identified in plants were present in pepper; however, three of them comprise 83% of the entire LTR retroelements population, the lineages Athila, Del/Tekay, and Ale/Retrofit. From them, only three families represent 70.8% of the total number of the identified retroelements. A massive family-specific wave of amplification of two of them occurred in the last 0.5 Mya (GypsyCa_16; CopiaCa_01), whereas the third is more ancient and occurred 3.0 Mya (GypsyCa_13). Fluorescent in situ hybridization performed with family and lineage-specific probes revealed contrasting patterns of chromosomal affinity. Our results provide a database of the populations LTR retroelements specific to C. annuum genome. The most abundant families were analyzed according to chromosome insertional preferences, suppling useful tools to the design of retroelement-based markers specific to the species.
Collapse
Affiliation(s)
- Anahí Mara Yañez-Santos
- CIGEOBIO (FCEFyN, UNSJ/CONICET), Av. Ignacio de la Roza 590 (Oeste), J5402DCS, Rivadavia, San Juan, Argentina.,Instituto Multidisciplinario de Biología Vegetal (IMBIV), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET)-Universidad Nacional de Córdoba (UNC), Córdoba, Argentina
| | - Rosalía Cristina Paz
- CIGEOBIO (FCEFyN, UNSJ/CONICET), Av. Ignacio de la Roza 590 (Oeste), J5402DCS, Rivadavia, San Juan, Argentina.
| | - Paula Beatriz Paz-Sepúlveda
- Instituto Multidisciplinario de Biología Celular (IMBICE), Consejo Nacional de Investigaciones Científicas y Técnicas de la República Argentina (CONICET) - Comisión de Investigaciones Científicas (CIC) - Universidad Nacional de La Plata (UNLP), La Plata, Argentina
| | - Juan Domingo Urdampilleta
- Instituto Multidisciplinario de Biología Vegetal (IMBIV), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET)-Universidad Nacional de Córdoba (UNC), Córdoba, Argentina
| |
Collapse
|
18
|
Higgins P, Grace CA, Lee SA, Goddard MR. Whole-genome sequencing from the New Zealand Saccharomyces cerevisiae population reveals the genomic impacts of novel microbial range expansion. G3-GENES GENOMES GENETICS 2021; 11:6044130. [PMID: 33561237 PMCID: PMC7849907 DOI: 10.1093/g3journal/jkaa027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/27/2020] [Accepted: 11/23/2020] [Indexed: 11/14/2022]
Abstract
Saccharomyces cerevisiae is extensively utilized for commercial fermentation, and is also an important biological model; however, its ecology has only recently begun to be understood. Through the use of whole-genome sequencing, the species has been characterized into a number of distinct subpopulations, defined by geographical ranges and industrial uses. Here, the whole-genome sequences of 104 New Zealand (NZ) S. cerevisiae strains, including 52 novel genomes, are analyzed alongside 450 published sequences derived from various global locations. The impact of S. cerevisiae novel range expansion into NZ was investigated and these analyses reveal the positioning of NZ strains as a subgroup to the predominantly European/wine clade. A number of genomic differences with the European group correlate with range expansion into NZ, including 18 highly enriched single-nucleotide polymorphism (SNPs) and novel Ty1/2 insertions. While it is not possible to categorically determine if any genetic differences are due to stochastic process or the operations of natural selection, we suggest that the observation of NZ-specific copy number increases of four sugar transporter genes in the HXT family may reasonably represent an adaptation in the NZ S. cerevisiae subpopulation, and this correlates with the observations of copy number changes during adaptation in small-scale experimental evolution studies.
Collapse
Affiliation(s)
- Peter Higgins
- The School of Life Sciences, College of Science, University of Lincoln, Brayford Pool, Lincoln, LN6 7TS, UK
| | - Cooper A Grace
- Department of Biology, York Biomedical Research Institute, University of York, Heslington, York, YO10 5DD, UK.,Department of Biological and Geographical Sciences, University of Huddersfield, Queensgate, Huddersfield, HD1 3DH, UK
| | - Soon A Lee
- The School of Biological Sciences, University of Auckland, Auckland 1142, New Zealand
| | - Matthew R Goddard
- The School of Life Sciences, College of Science, University of Lincoln, Brayford Pool, Lincoln, LN6 7TS, UK.,The School of Biological Sciences, University of Auckland, Auckland 1142, New Zealand
| |
Collapse
|
19
|
Abstract
Retroviruses infect a broad range of vertebrate hosts that includes amphibians, reptiles, fish, birds and mammals. In addition, a typical vertebrate genome contains thousands of loci composed of ancient retroviral sequences known as endogenous retroviruses (ERVs). ERVs are molecular remnants of ancient retroviruses and proof that the ongoing relationship between retroviruses and their vertebrate hosts began hundreds of millions of years ago. The long-term impact of retroviruses on vertebrate evolution is twofold: first, as with other viruses, retroviruses act as agents of selection, driving the evolution of host genes that block viral infection or that mitigate pathogenesis, and second, through the phenomenon of endogenization, retroviruses contribute an abundance of genetic novelty to host genomes, including unique protein-coding genes and cis-acting regulatory elements. This Review describes ERV origins, their diversity and their relationships to retroviruses and discusses the potential for ERVs to reveal virus-host interactions on evolutionary timescales. It also describes some of the many examples of cellular functions, including protein-coding genes and regulatory elements, that have evolved from ERVs.
Collapse
|
20
|
Orozco-Arias S, Isaza G, Guyot R, Tabares-Soto R. A systematic review of the application of machine learning in the detection and classification of transposable elements. PeerJ 2019; 7:e8311. [PMID: 31976169 PMCID: PMC6967008 DOI: 10.7717/peerj.8311] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2019] [Accepted: 11/28/2019] [Indexed: 12/16/2022] Open
Abstract
Background Transposable elements (TEs) constitute the most common repeated sequences in eukaryotic genomes. Recent studies demonstrated their deep impact on species diversity, adaptation to the environment and diseases. Although there are many conventional bioinformatics algorithms for detecting and classifying TEs, none have achieved reliable results on different types of TEs. Machine learning (ML) techniques can automatically extract hidden patterns and novel information from labeled or non-labeled data and have been applied to solving several scientific problems. Methodology We followed the Systematic Literature Review (SLR) process, applying the six stages of the review protocol from it, but added a previous stage, which aims to detect the need for a review. Then search equations were formulated and executed in several literature databases. Relevant publications were scanned and used to extract evidence to answer research questions. Results Several ML approaches have already been tested on other bioinformatics problems with promising results, yet there are few algorithms and architectures available in literature focused specifically on TEs, despite representing the majority of the nuclear DNA of many organisms. Only 35 articles were found and categorized as relevant in TE or related fields. Conclusions ML is a powerful tool that can be used to address many problems. Although ML techniques have been used widely in other biological tasks, their utilization in TE analyses is still limited. Following the SLR, it was possible to notice that the use of ML for TE analyses (detection and classification) is an open problem, and this new field of research is growing in interest.
Collapse
Affiliation(s)
- Simon Orozco-Arias
- Department of Computer Science, Universidad Autónoma de Manizales, Manizales, Caldas, Colombia.,Department of Systems and Informatics, Universidad de Caldas, Manizales, Caldas, Colombia
| | - Gustavo Isaza
- Department of Systems and Informatics, Universidad de Caldas, Manizales, Caldas, Colombia
| | - Romain Guyot
- Institut de Recherche pour le Développement, CIRAD, University of Montpellier, Montpellier, France.,Department of Electronics and Automation, Universidad Autónoma de Manizales, Manizales, Caldas, Colombia
| | - Reinel Tabares-Soto
- Department of Electronics and Automation, Universidad Autónoma de Manizales, Manizales, Caldas, Colombia
| |
Collapse
|
21
|
Minarovits J, Niller HH. Truncated oncoproteins of retroviruses and hepatitis B virus: A lesson in contrasts. INFECTION GENETICS AND EVOLUTION 2019; 73:342-357. [DOI: 10.1016/j.meegid.2019.05.020] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/18/2019] [Revised: 05/14/2019] [Accepted: 05/27/2019] [Indexed: 02/07/2023]
|
22
|
Orozco-Arias S, Isaza G, Guyot R. Retrotransposons in Plant Genomes: Structure, Identification, and Classification through Bioinformatics and Machine Learning. Int J Mol Sci 2019; 20:E3837. [PMID: 31390781 PMCID: PMC6696364 DOI: 10.3390/ijms20153837] [Citation(s) in RCA: 34] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2019] [Revised: 07/31/2019] [Accepted: 08/02/2019] [Indexed: 01/26/2023] Open
Abstract
Transposable elements (TEs) are genomic units able to move within the genome of virtually all organisms. Due to their natural repetitive numbers and their high structural diversity, the identification and classification of TEs remain a challenge in sequenced genomes. Although TEs were initially regarded as "junk DNA", it has been demonstrated that they play key roles in chromosome structures, gene expression, and regulation, as well as adaptation and evolution. A highly reliable annotation of these elements is, therefore, crucial to better understand genome functions and their evolution. To date, much bioinformatics software has been developed to address TE detection and classification processes, but many problematic aspects remain, such as the reliability, precision, and speed of the analyses. Machine learning and deep learning are algorithms that can make automatic predictions and decisions in a wide variety of scientific applications. They have been tested in bioinformatics and, more specifically for TEs, classification with encouraging results. In this review, we will discuss important aspects of TEs, such as their structure, importance in the evolution and architecture of the host, and their current classifications and nomenclatures. We will also address current methods and their limitations in identifying and classifying TEs.
Collapse
Affiliation(s)
- Simon Orozco-Arias
- Department of Computer Science, Universidad Autónoma de Manizales, Manizales 170001, Colombia
- Department of Systems and Informatics, Universidad de Caldas, Manizales 170001, Colombia
| | - Gustavo Isaza
- Department of Systems and Informatics, Universidad de Caldas, Manizales 170001, Colombia
| | - Romain Guyot
- Department of Electronics and Automatization, Universidad Autónoma de Manizales, Manizales 170001, Colombia.
- Institut de Recherche pour le Développement, CIRAD, University Montpellier, 34000 Montpellier, France.
| |
Collapse
|
23
|
Krupovic M, Blomberg J, Coffin JM, Dasgupta I, Fan H, Geering AD, Gifford R, Harrach B, Hull R, Johnson W, Kreuze JF, Lindemann D, Llorens C, Lockhart B, Mayer J, Muller E, Olszewski NE, Pappu HR, Pooggin MM, Richert-Pöggeler KR, Sabanadzovic S, Sanfaçon H, Schoelz JE, Seal S, Stavolone L, Stoye JP, Teycheney PY, Tristem M, Koonin EV, Kuhn JH. Ortervirales: New Virus Order Unifying Five Families of Reverse-Transcribing Viruses. J Virol 2018; 92:e00515-18. [PMID: 29618642 PMCID: PMC5974489 DOI: 10.1128/jvi.00515-18] [Citation(s) in RCA: 73] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Affiliation(s)
- Mart Krupovic
- Department of Microbiology, Institut Pasteur, Paris, France
| | - Jonas Blomberg
- Department of Medical Sciences, Uppsala University, Uppsala, Sweden
| | - John M Coffin
- Department of Molecular Biology and Microbiology, Tufts University School of Medicine, Boston, Massachusetts, USA
| | - Indranil Dasgupta
- Department of Plant Molecular Biology, University of Delhi, New Delhi, India
| | - Hung Fan
- Department of Molecular Biology and Biochemistry, University of California, Irvine, California, USA
| | - Andrew D Geering
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, Brisbane, Queensland, Australia
| | - Robert Gifford
- MRC-University of Glasgow Centre for Virus Research, Glasgow, United Kingdom
| | - Balázs Harrach
- Institute for Veterinary Medical Research, Centre for Agricultural Research, Hungarian Academy of Sciences, Budapest, Hungary
| | - Roger Hull
- Child Okeford, Blandford Forum, Dorset, United Kingdom
| | - Welkin Johnson
- Biology Department, Boston College, Chestnut Hill, Massachusetts, USA
| | - Jan F Kreuze
- Crop and System Sciences Division, International Potato Center (CIP), Lima, Peru
| | - Dirk Lindemann
- Institute of Virology, Technische Universität Dresden, Dresden, Germany
| | - Carlos Llorens
- Biotechvana, Parc Cientific, Universitat de Valencia, Valencia, Spain
| | - Ben Lockhart
- Department of Plant Pathology, University of Minnesota, St. Paul, Minnesota, USA
| | - Jens Mayer
- Institute of Human Genetics, University of Saarland, Homburg, Germany
| | - Emmanuelle Muller
- CIRAD, UMR BGPI, Montpellier, France
- BGPI, Université Montpellier, CIRAD, INRA, Montpellier SupAgro, Montpellier, France
| | - Neil E Olszewski
- Department of Microbial and Plant Biology, University of Minnesota, St. Paul, Minnesota, USA
| | - Hanu R Pappu
- Department of Plant Pathology, Washington State University, Pullman, Washington, USA
| | | | - Katja R Richert-Pöggeler
- Julius Kühn-Institut, Institute for Epidemiology and Pathogen Diagnostics, Braunschweig, Germany
| | - Sead Sabanadzovic
- Department of Biochemistry, Molecular Biology, Entomology and Plant Pathology, Mississippi State University, Mississippi State, Mississippi, USA
| | - Hélène Sanfaçon
- Agriculture and Agri-Food Canada, Summerland Research and Development Centre, Summerland, BC, Canada
| | - James E Schoelz
- Division of Plant Sciences, University of Missouri, Columbia, Missouri, USA
| | - Susan Seal
- Natural Resources Institute, University of Greenwich, Chatham, Kent, United Kingdom
| | - Livia Stavolone
- Consiglio Nazionale delle Ricerche, Istituto per la Protezione Sostenibile delle Piante, Bari, Italy
- International Institute of Tropical Agriculture, Ibadan, Nigeria
| | - Jonathan P Stoye
- The Francis Crick Institute and Department of Medicine, Faculty of Medicine, Imperial College London, London, United Kingdom
| | - Pierre-Yves Teycheney
- CIRAD, UMR AGAP, Capesterre Belle Eau, Guadeloupe, France
- AGAP, Université Montpellier, CIRAD, INRA, Montpellier SupAgro, Montpellier, France
| | - Michael Tristem
- Imperial College London, Silwood Park Campus, Ascot, Berkshire, United Kingdom
| | - Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, USA
| | - Jens H Kuhn
- Integrated Research Facility at Fort Detrick, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Fort Detrick, Frederick, Maryland, USA
| |
Collapse
|
24
|
Grandi N, Cadeddu M, Pisano MP, Esposito F, Blomberg J, Tramontano E. Identification of a novel HERV-K(HML10): comprehensive characterization and comparative analysis in non-human primates provide insights about HML10 proviruses structure and diffusion. Mob DNA 2017; 8:15. [PMID: 29118853 PMCID: PMC5667498 DOI: 10.1186/s13100-017-0099-7] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2017] [Accepted: 10/24/2017] [Indexed: 11/23/2022] Open
Abstract
Background About half of the human genome is constituted of transposable elements, including human endogenous retroviruses (HERV). HERV sequences represent the 8% of our genetic material, deriving from exogenous infections occurred millions of years ago in the germ line cells and being inherited by the offspring in a Mendelian fashion. HERV-K elements (classified as HML1–10) are among the most studied HERV groups, especially due to their possible correlation with human diseases. In particular, the HML10 group was reported to be upregulated in persistent HIV-1 infected cells as well as in tumor cells and samples, and proposed to have a role in the control of host genes expression. An individual HERV-K(HML10) member within the major histocompatibility complex C4 gene has even been studied for its possible contribution to type 1 diabetes susceptibility. Following a first characterization of the HML10 group at the genomic level, performed with the innovative software RetroTector, we have characterized in detail the 8 previously identified HML10 sequences present in the human genome, and an additional HML10 partial provirus in chromosome 1p22.2 that is reported here for the first time. Results Using a combined approach based on RetroTector software and a traditional Genome Browser Blat search, we identified a novel HERV-K(HML10) sequence in addition to the eight previously reported in the human genome GRCh37/hg19 assembly. We fully characterized the nine HML10 sequences at the genomic level, including their classification in two types based on both structural and phylogenetic characteristics, a detailed analysis of each HML10 nucleotide sequence, the first description of the presence of an Env Rec domain in the type II HML10, the estimated time of integration of individual members and the comparative map of the HML10 proviruses in non-human primates. Conclusions We performed an unambiguous and exhaustive analysis of the nine HML10 sequences present in GRCh37/hg19 assembly, useful to increase the knowledge of the group’s contribution to the human genome and laying the foundation for a better understanding of the potential physiological effects and the tentative correlation of these sequences with human pathogenesis. Electronic supplementary material The online version of this article (10.1186/s13100-017-0099-7) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Nicole Grandi
- Department of Life and Environmental Sciences, University of Cagliari, Cagliari, Italy
| | - Marta Cadeddu
- Department of Life and Environmental Sciences, University of Cagliari, Cagliari, Italy
| | - Maria Paola Pisano
- Department of Life and Environmental Sciences, University of Cagliari, Cagliari, Italy
| | - Francesca Esposito
- Department of Life and Environmental Sciences, University of Cagliari, Cagliari, Italy
| | - Jonas Blomberg
- Department of Medical Sciences, Uppsala University, Uppsala, Sweden
| | - Enzo Tramontano
- Department of Life and Environmental Sciences, University of Cagliari, Cagliari, Italy.,Istituto di Ricerca Genetica e Biomedica, Consiglio Nazionale delle Ricerche (CNR), Monserrato, Cagliari, Italy
| |
Collapse
|
25
|
|
26
|
Mason AS, Fulton JE, Hocking PM, Burt DW. A new look at the LTR retrotransposon content of the chicken genome. BMC Genomics 2016; 17:688. [PMID: 27577548 PMCID: PMC5006616 DOI: 10.1186/s12864-016-3043-1] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2016] [Accepted: 08/24/2016] [Indexed: 11/23/2022] Open
Abstract
Background LTR retrotransposons contribute approximately 10 % of the mammalian genome, but it has been previously reported that there is a deficit of these elements in the chicken relative to both mammals and other birds. A novel LTR retrotransposon classification pipeline, LocaTR, was developed and subsequently utilised to re-examine the chicken LTR retrotransposon annotation, and determine if the proposed chicken deficit is biologically accurate or simply a technical artefact. Results Using LocaTR 3.01 % of the chicken galGal4 genome assembly was annotated as LTR retrotransposon-derived elements (nearly double the previous annotation), including 1,073 that were structurally intact. Element distribution is significantly correlated with chromosome size and is non-random within each chromosome. Elements are significantly depleted within coding regions and enriched in gene sparse areas of the genome. Over 40 % of intact elements are found in clusters, unrelated by age or genera, generally in poorly recombining regions. The transcription of most LTR retrotransposons were suppressed or incomplete, but individual domain and full length retroviral transcripts were produced in some cases, although mostly with regularly interspersed stop codons in all reading frames. Furthermore, RNAseq data from 23 diverse tissues enabled greater characterisation of the co-opted endogenous retrovirus Ovex1. This gene was shown to be expressed ubiquitously but at variable levels across different tissues. LTR retrotransposon content was found to be very variable across the avian lineage and did not correlate with either genome size or phylogenetic position. However, the extent of previous, species-specific LTR retrotransposon annotation appears to be a confounding factor. Conclusions Use of the novel LocaTR pipeline has nearly doubled the annotated LTR retrotransposon content of the chicken genome compared to previous estimates. Further analysis has described element distribution, clustering patterns and degree of expression in a variety of adult tissues, as well as in three embryonic stages. This study also enabled better characterisation of the co-opted gamma retroviral envelope gene Ovex1. Additionally, this work suggests that there is no deficit of LTR retrotransposons within the Galliformes relative to other birds, or to mammalian genomes when scaled for the three-fold difference in genome size. Electronic supplementary material The online version of this article (doi:10.1186/s12864-016-3043-1) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Andrew S Mason
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush, Midlothian, EH25 9RG, UK.
| | - Janet E Fulton
- Hy-Line International, 1915 Sugar Grove Avenue, Dallas Grove, IA, 50063, USA
| | - Paul M Hocking
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush, Midlothian, EH25 9RG, UK
| | - David W Burt
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush, Midlothian, EH25 9RG, UK.
| |
Collapse
|
27
|
Identification and characterization of a LTR retrotransposon from the genome of Cyprinus carpio var. Jian. Genetica 2016; 144:325-33. [PMID: 27178280 DOI: 10.1007/s10709-016-9901-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2015] [Accepted: 04/27/2016] [Indexed: 10/21/2022]
Abstract
A Ty3/gypsy-retrotransposon-type transposon was found in the genome of the Jian carp (Cyprinus carpio var. Jian) in a previous study (unpublished), and was designated a JRE retrotransposon (Jian retrotransposon). The full-length JRE retrotransposon is 5126 bp, which includes two long terminal repeats of 470 bp at the 5' end and 453 bp at the 3' end, and two open reading frames between them: 4203 bp encoding the group-specific antigen (GAG) and polyprotein (POL). The pol gene has a typical Ty3/gypsy retrotransposon structure, and the gene order is protease, reverse transcriptase, RNase H, and integrase (PR-RT-RH-IN). A phylogenetic analysis of the pol gene showed that it has similarities of 40.7, 40, and 32.8 %, to retrotransposons of Azumapecten farreri, Mizuhopecten yessoensis, and Xiphophorus maculatus, respectively. Therefore, JRE might belong to the JULE retrotransposon family. The copy number of the JRE transposon in the genome of the Jian carp is 124, determined with real-time quantitative PCR. The mRNA of the JRE retrotransposon is expressed in five Jian carp tissues, the liver, kidney, blood, muscle, and gonad, and slightly higher in the kidney and liver than in the other tissues.
Collapse
|
28
|
Tycowski KT, Shu MD, Steitz JA. Myriad Triple-Helix-Forming Structures in the Transposable Element RNAs of Plants and Fungi. Cell Rep 2016; 15:1266-76. [PMID: 27134163 DOI: 10.1016/j.celrep.2016.04.010] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2015] [Revised: 02/25/2016] [Accepted: 03/28/2016] [Indexed: 01/08/2023] Open
Abstract
The ENE (element for nuclear expression) is a cis-acting RNA structure that protects viral or cellular noncoding RNAs (ncRNAs) from nuclear decay through triple-helix formation with the poly(A) tail or 3'-terminal A-rich tract. We expanded the roster of nine known ENEs by bioinformatic identification of ∼200 distinct ENEs that reside in transposable elements (TEs) of numerous non-metazoan and one fish species and in four Dicistrovirus genomes. Despite variation within the ENE core, none of the predicted triple-helical stacks exceeds five base triples. Increased accumulation of reporter transcripts in human cells demonstrated functionality for representative ENEs. Location close to the poly(A) tail argues that ENEs are active in TE transcripts. Their presence in intronless, but not intron-containing, hAT transposase genes supports the idea that TEs acquired ENEs to counteract the RNA-destabilizing effects of intron loss, a potential evolutionary consequence of TE horizontal transfer in organisms that couple RNA silencing to splicing deficits.
Collapse
Affiliation(s)
- Kazimierz T Tycowski
- Department of Molecular Biophysics and Biochemistry, Howard Hughes Medical Institute, Yale University School of Medicine, 295 Congress Avenue, New Haven, CT 06536, USA
| | - Mei-Di Shu
- Department of Molecular Biophysics and Biochemistry, Howard Hughes Medical Institute, Yale University School of Medicine, 295 Congress Avenue, New Haven, CT 06536, USA
| | - Joan A Steitz
- Department of Molecular Biophysics and Biochemistry, Howard Hughes Medical Institute, Yale University School of Medicine, 295 Congress Avenue, New Haven, CT 06536, USA.
| |
Collapse
|
29
|
Aswad A, Katzourakis A. Convergent capture of retroviral superantigens by mammalian herpesviruses. Nat Commun 2015; 6:8299. [PMID: 26400439 PMCID: PMC4667437 DOI: 10.1038/ncomms9299] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2015] [Accepted: 08/07/2015] [Indexed: 12/23/2022] Open
Abstract
Horizontal gene transfer from retroviruses to mammals is well documented and extensive, but is rare between unrelated viruses with distinct genome types. Three herpesviruses encode a gene with similarity to a retroviral superantigen gene (sag) of the unrelated mouse mammary tumour virus (MMTV). We uncover ancient retroviral sags in over 20 mammals to reconstruct their shared history with herpesviral sags, revealing that the acquisition is a convergent evolutionary event. A retrovirus circulating in South American primates over 10 million years ago was the source of sag in two monkey herpesviruses, and a different retrovirus was the source of sag in a Peruvian rodent herpesvirus. We further show through a timescaled phylogenetic analysis that a cross-species transmission of monkey herpesviruses occurred after the acquisition of sag. These results reveal that a diverse range of ancient sag-containing retroviruses independently donated sag twice from two separate lineages that are distinct from MMTV. Horizontal gene transfer from retroviruses to mammals is rare between unrelated viruses. Here the authors show the convergent acquisition by herpesviruses of a virulence gene of ancient retroviruses, which occurred at least twice from different donor lineages, to distinct herpesviruses that infect mammals.
Collapse
Affiliation(s)
- Amr Aswad
- Department of Zoology, University of Oxford, Oxford OX1 3PS, UK
| | | |
Collapse
|
30
|
Mata H, Gongora J, Eizirik E, Alves BM, Soares MA, Ravazzolo AP. Identification and characterization of diverse groups of endogenous retroviruses in felids. Retrovirology 2015; 12:26. [PMID: 25808580 PMCID: PMC4373062 DOI: 10.1186/s12977-015-0152-x] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2014] [Accepted: 02/23/2015] [Indexed: 01/28/2023] Open
Abstract
BACKGROUND Endogenous retroviruses (ERVs) are genetic elements with a retroviral origin that are integrated into vertebrate genomes. In felids (Mammalia, Carnivora, Felidae), ERVs have been described mostly in the domestic cat, and only rarely in wild species. To gain insight into the origins and evolutionary dynamics of endogenous retroviruses in felids, we have identified and characterized partial pro/pol ERV sequences from eight Neotropical wild cat species, belonging to three distinct lineages of Felidae. We also compared them with publicly available genomic sequences of Felis catus and Panthera tigris, as well as with representatives of other vertebrate groups, and performed phylogenetic and molecular dating analyses to investigate the pattern and timing of diversification of these retroviral elements. RESULTS We identified a high diversity of ERVs in the sampled felids, with a predominance of Gammaretrovirus-related sequences, including class I ERVs. Our data indicate that the identified ERVs arose from at least eleven horizontal interordinal transmissions from other mammals. Furthermore, we estimated that the majority of the Gamma-like integrations took place during the diversification of modern felids. Finally, our phylogenetic analyses indicate the presence of a genetically divergent group of sequences whose position in our phylogenetic tree was difficult to establish confidently relative to known retroviruses, and another lineage identified as ERVs belonging to class II. CONCLUSIONS Retroviruses have circulated in felids along with their evolution. The majority of the deep clades of ERVs exist since the primary divergence of felids' base and cluster with retroviruses of divergent mammalian lineages, suggesting horizontal interordinal transmission. Our findings highlight the importance of additional studies on the role of ERVs in the genome landscaping of other carnivore species.
Collapse
|
31
|
Cruz GMQ, Metcalfe CJ, de Setta N, Cruz EAO, Vieira AP, Medina R, Van Sluys MA. Virus-like attachment sites and plastic CpG islands:landmarks of diversity in plant Del retrotransposons. PLoS One 2014; 9:e97099. [PMID: 24849372 PMCID: PMC4029996 DOI: 10.1371/journal.pone.0097099] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/01/2014] [Accepted: 04/14/2014] [Indexed: 11/18/2022] Open
Abstract
Full-length Del elements from ten angiosperm genomes, 5 monocot and 5 dicot, were retrieved and putative attachment (att) sites were identified. In the 2432 Del elements, two types of U5 att sites and a single conserved type of U3 att site were identified. Retroviral att sites confer specificity to the integration process, different att sites types therefore implies lineage specificity. While some features are common to all Del elements, CpG island patterns within the LTRs were particular to lineage specific clusters. All eudicot copies grouped into one single clade while the monocots harbour a more diverse collection of elements. Furthermore, full-length Del elements and truncated copies were unevenly distributed amongst chromosomes. Elements of Del lineage are organized in plants into three clusters and each cluster is composed of elements with distinct LTR features. Our results suggest that the Del lineage efficiently amplified in the monocots and that one branch is probably a newly emerging sub-lineage. Finally, sequences in all groups are under purifying selection. These results show the LTR region is dynamic and important in the evolution of LTR-retrotransposons, we speculate that it is a trigger for retrotransposon diversification.
Collapse
Affiliation(s)
- Guilherme M. Q. Cruz
- Departamento de Botânica, Instituto de Biociências (IB), Universidade de São Paulo (USP), São Paulo, São Paulo, Brasil
| | - Cushla J. Metcalfe
- Departamento de Botânica, Instituto de Biociências (IB), Universidade de São Paulo (USP), São Paulo, São Paulo, Brasil
| | | | - Edgar A. O. Cruz
- Departamento de Botânica, Instituto de Biociências (IB), Universidade de São Paulo (USP), São Paulo, São Paulo, Brasil
| | - Andréia Prata Vieira
- Departamento de Botânica, Instituto de Biociências (IB), Universidade de São Paulo (USP), São Paulo, São Paulo, Brasil
| | - Rosario Medina
- Departamento de Botânica, Instituto de Biociências (IB), Universidade de São Paulo (USP), São Paulo, São Paulo, Brasil
| | - Marie-Anne Van Sluys
- Departamento de Botânica, Instituto de Biociências (IB), Universidade de São Paulo (USP), São Paulo, São Paulo, Brasil
- * E-mail:
| |
Collapse
|
32
|
Schulman AH. Retrotransposon replication in plants. Curr Opin Virol 2013; 3:604-14. [PMID: 24035277 DOI: 10.1016/j.coviro.2013.08.009] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2013] [Revised: 08/16/2013] [Accepted: 08/19/2013] [Indexed: 12/31/2022]
Abstract
Retrotransposons comprise the bulk of large plant genomes, replicating via an RNA intermediate whereby the original, integrated element remains in place. Of the two main orders, the LTR retrotransposons considerably outnumber the LINEs. LINEs integrate into target sites simultaneously with the RNA transcript being copied into cDNA by target-primed reverse transcription. LTR retrotransposon replication is basically equivalent to the intracellular phase of retroviral life cycles. The envelope gene giving extracellular mobility to retroviruses is in fact widespread in plants and their retrotransposons. Evolutionary analyses of the retrotransposons and retroviruses suggest that both form an ancient monophyletic group. The particular adaptations of LTR retrotransposons to plant life cycles enabling their success remain to be clarified.
Collapse
Affiliation(s)
- Alan H Schulman
- Institute of Biotechnology, Viikki Biocenter, University of Helsinki, P.O. Box 65, Helsinki FIN-00014, Finland; Biotechnology and Food Research, MTT Agrifood Research Finland, Jokioinen FIN-31600, Finland.
| |
Collapse
|